Query         001711
Match_columns 1021
No_of_seqs    255 out of 767
Neff          6.3 
Searched_HMMs 46136
Date          Fri Mar 29 07:30:09 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/001711.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/001711hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1984 Vesicle coat complex C 100.0  5E-184  1E-188 1584.2  85.7  730  277-1020  236-1005(1007)
  2 KOG1985 Vesicle coat complex C 100.0  2E-165  3E-170 1430.8  74.5  711  304-1019  160-887 (887)
  3 PTZ00395 Sec24-related protein 100.0  7E-152  2E-156 1365.6  69.0  721  278-1020  599-1556(1560)
  4 COG5028 Vesicle coat complex C 100.0  1E-150  3E-155 1296.1  67.8  706  299-1019  132-861 (861)
  5 PLN00162 transport protein sec 100.0  2E-120  3E-125 1115.3  69.7  656  312-1019    7-760 (761)
  6 KOG1986 Vesicle coat complex C 100.0   4E-90 8.8E-95  791.5  53.9  653  312-1019    7-743 (745)
  7 COG5047 SEC23 Vesicle coat com 100.0 9.8E-83 2.1E-87  712.4  42.5  661  311-1020    6-754 (755)
  8 cd01479 Sec24-like Sec24-like: 100.0 4.5E-54 9.8E-59  466.0  25.8  241  425-666     1-244 (244)
  9 cd01468 trunk_domain trunk dom 100.0 5.9E-50 1.3E-54  433.2  25.7  235  425-660     1-239 (239)
 10 PF04811 Sec23_trunk:  Sec23/Se 100.0   9E-50 1.9E-54  432.7  21.8  237  425-662     1-243 (243)
 11 cd01478 Sec23-like Sec23-like: 100.0   2E-44 4.3E-49  394.5  20.6  225  425-654     1-265 (267)
 12 PF04815 Sec23_helical:  Sec23/  99.9 2.1E-21 4.5E-26  183.8  11.6  103  763-868     1-103 (103)
 13 PF08033 Sec23_BS:  Sec23/Sec24  99.8 1.7E-20 3.7E-25  175.3  11.0   85  667-751     1-96  (96)
 14 PF04810 zf-Sec23_Sec24:  Sec23  99.2   6E-12 1.3E-16   98.6   1.8   35  354-388     6-40  (40)
 15 PRK13685 hypothetical protein;  98.8 3.7E-07   8E-12  104.0  19.7  174  427-661    88-289 (326)
 16 cd01453 vWA_transcription_fact  98.7 5.7E-07 1.2E-11   94.1  17.4  163  429-660     5-177 (183)
 17 cd01467 vWA_BatA_type VWA BatA  98.5 3.5E-06 7.7E-11   86.9  16.7  154  429-643     4-175 (180)
 18 cd01466 vWA_C3HC4_type VWA C3H  98.5 1.8E-06   4E-11   87.6  14.1  147  430-642     3-154 (155)
 19 cd01465 vWA_subgroup VWA subgr  98.5 3.5E-06 7.6E-11   85.8  16.2  155  430-644     3-162 (170)
 20 cd01463 vWA_VGCC_like VWA Volt  98.5   5E-06 1.1E-10   87.2  17.6  163  426-644    12-188 (190)
 21 cd01451 vWA_Magnesium_chelatas  98.5 4.1E-06 8.9E-11   87.0  16.6  160  429-647     2-169 (178)
 22 cd01456 vWA_ywmD_type VWA ywmD  98.5   3E-06 6.4E-11   90.0  15.3  174  423-639    16-196 (206)
 23 TIGR00868 hCaCC calcium-activa  98.4 2.5E-05 5.4E-10   97.8  24.3  167  428-662   305-477 (863)
 24 TIGR03788 marine_srt_targ mari  98.3 0.00045 9.7E-09   85.2  32.3  284  424-803   268-556 (596)
 25 cd01474 vWA_ATR ATR (Anthrax T  98.3 2.3E-05   5E-10   81.8  17.6  167  429-662     6-181 (185)
 26 PF13519 VWA_2:  von Willebrand  98.3   1E-05 2.2E-10   81.7  13.2  151  430-643     2-159 (172)
 27 cd01472 vWA_collagen von Wille  98.3 2.8E-05   6E-10   79.4  16.0  151  430-644     3-163 (164)
 28 TIGR03436 acidobact_VWFA VWFA-  98.2 7.4E-05 1.6E-09   83.9  19.9  158  426-642    52-238 (296)
 29 cd01470 vWA_complement_factors  98.2 4.4E-05 9.6E-10   80.5  15.9  167  430-645     3-190 (198)
 30 cd01461 vWA_interalpha_trypsin  98.2 0.00012 2.5E-09   74.6  18.3  157  427-644     2-161 (171)
 31 cd01452 VWA_26S_proteasome_sub  98.1   8E-05 1.7E-09   78.2  15.4  142  429-634     5-160 (187)
 32 cd01480 vWA_collagen_alpha_1-V  98.0 0.00011 2.4E-09   76.9  14.9  157  429-646     4-173 (186)
 33 PF00626 Gelsolin:  Gelsolin re  98.0 6.7E-06 1.4E-10   73.0   4.5   66  892-983     4-70  (76)
 34 PF13768 VWA_3:  von Willebrand  98.0 0.00011 2.4E-09   74.2  13.6  150  430-641     3-155 (155)
 35 cd01475 vWA_Matrilin VWA_Matri  97.9  0.0002 4.3E-09   77.2  15.5  167  429-662     4-183 (224)
 36 PTZ00441 sporozoite surface pr  97.9 0.00037 8.1E-09   83.4  18.9  163  428-646    43-217 (576)
 37 cd01450 vWFA_subfamily_ECM Von  97.9 0.00022 4.8E-09   71.3  14.5  145  430-635     3-155 (161)
 38 cd01477 vWA_F09G8-8_type VWA F  97.9 0.00038 8.3E-09   73.6  15.9  151  429-638    21-188 (193)
 39 cd01471 vWA_micronemal_protein  97.9 0.00038 8.2E-09   72.5  15.7  149  430-634     3-160 (186)
 40 TIGR02442 Cob-chelat-sub cobal  97.8 0.00018 3.9E-09   89.2  14.6  160  427-642   465-632 (633)
 41 cd01469 vWA_integrins_alpha_su  97.8 0.00065 1.4E-08   70.6  16.3  156  430-646     3-172 (177)
 42 cd01482 vWA_collagen_alphaI-XI  97.8 0.00083 1.8E-08   68.7  15.9  150  430-643     3-162 (164)
 43 TIGR02031 BchD-ChlD magnesium   97.7 0.00044 9.5E-09   84.9  16.0  174  426-647   406-585 (589)
 44 COG1240 ChlD Mg-chelatase subu  97.7 0.00043 9.3E-09   75.0  13.7  166  426-647    77-249 (261)
 45 PHA03247 large tegument protei  97.7   0.069 1.5E-06   72.3  35.3   14  446-459  3114-3127(3151)
 46 smart00327 VWA von Willebrand   97.7  0.0012 2.6E-08   66.9  16.2  153  429-641     3-164 (177)
 47 PRK13406 bchD magnesium chelat  97.7 0.00099 2.1E-08   81.5  18.1  167  426-647   400-572 (584)
 48 cd00198 vWFA Von Willebrand fa  97.7 0.00096 2.1E-08   65.6  15.1  148  429-635     2-155 (161)
 49 PF00092 VWA:  von Willebrand f  97.6 0.00086 1.9E-08   68.3  14.0  155  430-646     2-169 (178)
 50 cd01481 vWA_collagen_alpha3-VI  97.6  0.0024 5.3E-08   65.8  16.0  151  430-645     3-165 (165)
 51 cd01473 vWA_CTRP CTRP for  CS   97.5  0.0037 8.1E-08   66.0  16.9  150  430-634     3-161 (192)
 52 cd01476 VWA_integrin_invertebr  97.4  0.0057 1.2E-07   62.1  16.3  102  430-566     3-115 (163)
 53 cd01464 vWA_subfamily VWA subf  97.3  0.0012 2.6E-08   68.3  10.4  138  430-633     6-159 (176)
 54 smart00262 GEL Gelsolin homolo  97.2  0.0018   4E-08   59.6   9.3   71  896-995    16-87  (90)
 55 KOG1924 RhoA GTPase effector D  97.1  0.0036 7.9E-08   75.5  11.7   12  827-838  1046-1057(1102)
 56 cd01454 vWA_norD_type norD typ  97.0   0.021 4.5E-07   59.0  15.4  147  429-622     2-154 (174)
 57 KOG1984 Vesicle coat complex C  96.9     0.1 2.3E-06   64.5  22.2   33  667-699   717-752 (1007)
 58 cd01458 vWA_ku Ku70/Ku80 N-ter  96.9   0.023   5E-07   61.0  15.1  154  429-621     3-173 (218)
 59 PF04056 Ssl1:  Ssl1-like;  Int  96.8  0.0066 1.4E-07   64.1   9.8  163  433-662     1-173 (193)
 60 KOG1924 RhoA GTPase effector D  96.7   0.011 2.3E-07   71.7  11.6   12  328-339   656-667 (1102)
 61 KOG0443 Actin regulatory prote  96.6  0.0047   1E-07   75.4   8.1   91  866-985   616-706 (827)
 62 COG4245 TerY Uncharacterized p  96.4   0.066 1.4E-06   55.6  13.5  158  428-661     5-180 (207)
 63 KOG2884 26S proteasome regulat  96.3     0.1 2.2E-06   55.2  14.5  154  429-644     5-175 (259)
 64 cd01462 VWA_YIEM_type VWA YIEM  96.2    0.13 2.8E-06   51.6  14.4  130  430-621     3-135 (152)
 65 TIGR00578 ku70 ATP-dependent D  95.5    0.23 4.9E-06   61.4  15.4  162  429-626    12-190 (584)
 66 COG5148 RPN10 26S proteasome r  95.1    0.69 1.5E-05   48.0  14.6  133  428-620     4-146 (243)
 67 cd01457 vWA_ORF176_type VWA OR  94.6    0.42 9.2E-06   50.5  12.5  146  429-634     4-165 (199)
 68 cd01460 vWA_midasin VWA_Midasi  94.4    0.53 1.2E-05   52.4  13.1  132  426-620    59-204 (266)
 69 KOG0443 Actin regulatory prote  94.1    0.19   4E-06   62.1   9.4   79  898-1001  277-358 (827)
 70 cd01455 vWA_F11C1-5a_type Von   93.7     3.2 6.9E-05   44.0  16.5   98  514-644    72-174 (191)
 71 TIGR00627 tfb4 transcription f  93.3     5.4 0.00012   44.8  18.4   95  536-662   117-221 (279)
 72 PF03731 Ku_N:  Ku70/Ku80 N-ter  92.7    0.77 1.7E-05   49.4  10.6  154  429-618     1-172 (224)
 73 PF03850 Tfb4:  Transcription f  92.6     4.9 0.00011   45.2  16.9  184  429-644     3-207 (276)
 74 KOG0444 Cytoskeletal regulator  91.2    0.31 6.6E-06   58.8   5.7   66  894-985   637-703 (1255)
 75 KOG2807 RNA polymerase II tran  90.8     2.6 5.6E-05   47.4  11.9  165  427-660    60-234 (378)
 76 KOG4849 mRNA cleavage factor I  90.2     8.2 0.00018   43.7  15.1   13  448-460   391-403 (498)
 77 COG2425 Uncharacterized protei  89.9     2.1 4.5E-05   50.8  11.0  148  427-643   273-424 (437)
 78 KOG4849 mRNA cleavage factor I  88.8     9.5 0.00021   43.3  14.3    7  354-360   412-418 (498)
 79 PRK10997 yieM hypothetical pro  88.1       2 4.3E-05   51.8   9.3  149  428-644   324-475 (487)
 80 PF06707 DUF1194:  Protein of u  86.9      29 0.00062   37.4  16.1  119  514-666    75-202 (205)
 81 smart00187 INB Integrin beta s  85.2      91   0.002   37.2  26.0  272  427-715    99-389 (423)
 82 KOG2353 L-type voltage-depende  84.2      19 0.00041   47.6  15.7  116  408-553   203-322 (1104)
 83 PF00362 Integrin_beta:  Integr  83.7      94   0.002   37.3  20.2  275  412-715    93-392 (426)
 84 KOG3768 DEAD box RNA helicase   83.1      16 0.00035   44.2  13.2   32  428-459     2-38  (888)
 85 KOG0444 Cytoskeletal regulator  82.4     3.1 6.8E-05   50.7   7.2   56  867-927   731-788 (1255)
 86 KOG2487 RNA polymerase II tran  76.5      46 0.00099   37.1  13.0   55  599-662   185-239 (314)
 87 COG4867 Uncharacterized protei  69.4      27 0.00059   40.9   9.8  160  428-643   464-634 (652)
 88 PF11265 Med25_VWA:  Mediator c  67.2   2E+02  0.0043   31.6  15.4  103  516-641    89-204 (226)
 89 PF09967 DUF2201:  VWA-like dom  63.4      12 0.00026   37.0   5.0   93  431-566     2-94  (126)
 90 COG5242 TFB4 RNA polymerase II  61.1 1.4E+02   0.003   32.5  12.4  177  426-644    19-214 (296)
 91 KOG0307 Vesicle coat complex C  58.8 5.7E+02   0.012   34.0  22.2    9  354-362   960-968 (1049)
 92 PF10138 vWA-TerF-like:  vWA fo  54.3 2.6E+02  0.0057   30.1  13.3  144  430-634     4-155 (200)
 93 PF05762 VWA_CoxE:  VWA domain   44.3      32 0.00069   37.3   4.9  102  425-564    54-159 (222)
 94 KOG2893 Zn finger protein [Gen  40.4 1.3E+02  0.0028   32.8   8.4   10  511-520   323-332 (341)
 95 PF02905 EBV-NA1:  Epstein Barr  32.5      71  0.0015   31.5   4.5   33  446-478   112-145 (146)
 96 KOG1923 Rac1 GTPase effector F  31.7 1.5E+02  0.0033   37.6   8.2    6  477-482   465-470 (830)
 97 KOG4672 Uncharacterized conser  31.5 2.7E+02  0.0059   32.9   9.6    6  150-155   381-386 (487)
 98 PF10058 DUF2296:  Predicted in  25.7      55  0.0012   27.7   2.2   13  370-382    42-54  (54)
 99 KOG1985 Vesicle coat complex C  25.1 1.3E+03   0.028   30.1  14.5   24  359-383   206-230 (887)
100 PF12257 DUF3608:  Protein of u  23.9 8.3E+02   0.018   27.8  11.7   28  596-623   246-273 (281)
101 COG5415 Predicted integral mem  23.5      34 0.00073   36.6   0.7   33  354-386   188-228 (251)
102 COG1580 FliL Flagellar basal b  22.8 2.5E+02  0.0053   29.2   6.8   65  721-799    76-143 (159)
103 COG1592 Rubrerythrin [Energy p  21.8      47   0.001   34.6   1.4   15  369-383   131-145 (166)
104 KOG4368 Predicted RNA binding   21.1 1.6E+03   0.035   28.0  13.7  151   81-253   291-446 (757)
105 PF13894 zf-C2H2_4:  C2H2-type   20.7      47   0.001   21.9   0.8   12  373-384     1-12  (24)
106 COG3285 Predicted eukaryotic-t  20.3 4.2E+02   0.009   30.2   8.3   15  354-368    66-80  (299)

No 1  
>KOG1984 consensus Vesicle coat complex COPII, subunit SFB3 [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00  E-value=5.5e-184  Score=1584.25  Aligned_cols=730  Identities=37%  Similarity=0.685  Sum_probs=703.8

Q ss_pred             CCCCCCCCCCCCCCCCCCCCCC--------------CCCCCCCCcccc----cCCCCceeccccccCCCHHHHhhcCCce
Q 001711          277 SIPGSIEPGIDLKSLPRPLDGD--------------VEPSSLAETYPL----NCHSRYLRLTTSAIPNSQSLVSRWHLPL  338 (1021)
Q Consensus       277 ~~~~~~dp~~~~~~ip~p~~~~--------------~~pp~~~~~~~~----N~~P~y~R~T~~~iP~t~~l~~~~~lPl  338 (1021)
                      ..+.|+||    ++||+|....              ..||++||+|.+    ||||||||||+|+||+|.++++.++|||
T Consensus       236 ~~~~rldp----~~iPs~~qv~~~d~~~~r~~~~~~~~PPl~TTd~~~~DqGN~sPr~mr~T~Y~iP~T~Dl~~as~iPL  311 (1007)
T KOG1984|consen  236 PPPQRLDP----NAIPSPPQVSIEDDSSFRSTDTRAQPPPLVTTDFFIQDQGNCSPRFMRCTMYTIPCTNDLLKASQIPL  311 (1007)
T ss_pred             CccccCCh----hhCCCchhcccchhhhhhcCCccCCCCCCcccceEEeccCCCCcchheeecccCCccHhHHHhcCCcc
Confidence            46789999    9999997651              579999999986    9999999999999999999999999999


Q ss_pred             EEEEccCCCCCCCCC---------------CccceEEccceeEecCCceEEEcCCCCCCCCCcccccccCcCcccCCCCC
Q 001711          339 GAVVCPLAEPPEGNL---------------FICRTYVNPYVTFTDAGRKWRCNICALLNDVPGDYFAHLDATGRRIDIDQ  403 (1021)
Q Consensus       339 g~vv~Pfa~~~~~e~---------------~rCrAYiNPf~~f~~~g~~W~Cn~C~~~N~vP~~Y~~~l~~~g~R~D~~~  403 (1021)
                      |+||+|||.+.+.|.               +||||||||||+|+++||+|+||||+.+|++|++||++|+++|||.|+++
T Consensus       312 alvIqPfa~l~p~E~~~~vVd~g~sgPvRC~RCkaYinPFmqF~~~gr~f~Cn~C~~~n~vp~~yf~~L~~~grr~D~~e  391 (1007)
T KOG1984|consen  312 ALVIQPFATLTPNEAPVPVVDLGESGPVRCNRCKAYINPFMQFIDGGRKFICNFCGSKNQVPDDYFNHLGPTGRRVDVEE  391 (1007)
T ss_pred             eeEecccccCCcccCCCceecCCCCCCcchhhhhhhcCcceEEecCCceEEecCCCccccCChhhcccCCCccccccccc
Confidence            999999998876553               99999999999999999999999999999999999999999999999999


Q ss_pred             CCccccccEEEEccccccCC--CCCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCC-CCCCceEEEEEEcCeEEEE
Q 001711          404 RPELTKGSVEFVAPTEYMVR--PPMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELP-GFPRTQIGFITFDSTIHFY  480 (1021)
Q Consensus       404 rPEL~~gtVEfvap~eY~~r--~p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp-~~~rt~VgiITFds~V~fy  480 (1021)
                      ||||++|+|||+|+++||++  ++++++|||+||||++|+++|++.++|++|+++|+.|+ ++++++|||||||++||||
T Consensus       392 rpEL~~Gt~dfvatk~Y~~~~k~p~ppafvFmIDVSy~Ai~~G~~~a~ce~ik~~l~~lp~~~p~~~Vgivtfd~tvhFf  471 (1007)
T KOG1984|consen  392 RPELCLGTVDFVATKDYCRKTKPPKPPAFVFMIDVSYNAISNGAVKAACEAIKSVLEDLPREEPNIRVGIVTFDKTVHFF  471 (1007)
T ss_pred             CchhcccccceeeehhhhhcCCCCCCceEEEEEEeehhhhhcchHHHHHHHHHHHHhhcCccCCceEEEEEEecceeEee
Confidence            99999999999999999998  89999999999999999999999999999999999999 6789999999999999999


Q ss_pred             ecCCCCCCcceeeccccccccCCCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-CCEEE
Q 001711          481 NMKSSLTQPQMMVISDLDDIFVPLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-GGKLL  559 (1021)
Q Consensus       481 nl~~~~~~p~mlVvsDldd~f~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-GGkIi  559 (1021)
                      |+++++++++|+||+|++|+|+|+.+++||+..|++..|+.|||+|+.||.+.+.+++|+|+||+||..+||.+ ||||+
T Consensus       472 nl~s~L~qp~mliVsdv~dvfvPf~~g~~V~~~es~~~i~~lLd~Ip~mf~~sk~pes~~g~alqaa~lalk~~~gGKl~  551 (1007)
T KOG1984|consen  472 NLSSNLAQPQMLIVSDVDDVFVPFLDGLFVNPNESRKVIELLLDSIPTMFQDSKIPESVFGSALQAAKLALKAADGGKLF  551 (1007)
T ss_pred             ccCccccCceEEEeecccccccccccCeeccchHHHHHHHHHHHHhhhhhccCCCCchhHHHHHHHHHHHHhccCCceEE
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999998 99999


Q ss_pred             EEecCCCCCCcc-cccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccE
Q 001711          560 IFQNSLPSLGVG-CLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQ  638 (1021)
Q Consensus       560 vF~sg~Pt~GpG-~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~  638 (1021)
                      ||++.+||+|.| +|+.|+| .|+++|+||++|+.+++++|++||++|++.|||||||++...|+|+|+|+.+++.|||+
T Consensus       552 vF~s~Lpt~g~g~kl~~r~D-~~l~~t~kek~l~~pq~~~y~~LA~e~v~~g~svDlF~t~~ayvDvAtlg~v~~~TgG~  630 (1007)
T KOG1984|consen  552 VFHSVLPTAGAGGKLSNRDD-RRLIGTDKEKNLLQPQDKTYTTLAKEFVESGCSVDLFLTPNAYVDVATLGVVPALTGGQ  630 (1007)
T ss_pred             EEecccccccCcccccccch-hhhhcccchhhccCcchhHHHHHHHHHHHhCceEEEEEcccceeeeeeecccccccCce
Confidence            999999999977 8877754 89999999999999999999999999999999999999999999999999999999999


Q ss_pred             EEEeCCCCCchhHHHHHHHHHHhcccccccceEEEEEeCCCeEEEeeecCcccCCCCceeeccCCCCCcEEEEEEecccc
Q 001711          639 VYYYPSFQSTTHGERLRHELSRDLTRETAWEAVMRIRCGKGVRFTNYHGNFMLRSTDLLALPAVDCDKAYAMQLSLEETL  718 (1021)
Q Consensus       639 v~~y~~F~~~~d~~kl~~dL~r~ltr~~g~~a~mrVR~S~Gl~V~~~~Gnf~~rs~~~~~l~~id~d~Sia~~~~~d~~l  718 (1021)
                      +|+|.+|....|+.+|.+||.|++++++||+|+||||||+||++.+|||||+++++++++|+.+|+||+++|+|+|||+|
T Consensus       631 vy~Y~~F~a~~D~~rl~nDL~~~vtk~~gf~a~mrvRtStGirv~~f~Gnf~~~~~tDiela~lD~dkt~~v~fkhDdkL  710 (1007)
T KOG1984|consen  631 VYKYYPFQALTDGPRLLNDLVRNVTKKQGFDAVMRVRTSTGIRVQDFYGNFLMRNPTDIELAALDCDKTLTVEFKHDDKL  710 (1007)
T ss_pred             eEEecchhhcccHHHHHHHHHHhcccceeeeeEEEEeecCceeeeeeechhhhcCCCCccccccccCceeEEEEeccccc
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CCCceeEEEEEEEEEecCCcEEEEEEeeeecccCCHHHHHHhcCHhHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHH
Q 001711          719 LTTQTVYFQVALLYTASCGERRIRVHTLAAPVVSNLSDMYQQADTGAIVSVFSRLAIEKTLSHKLEDARNAVQLRLVKAL  798 (1021)
Q Consensus       719 ~~~~~~~iQ~AllYTt~~GeRrIRV~Tl~lpvt~~l~~vf~s~D~eai~~~laK~a~~~~l~~~l~d~R~~l~~~lv~iL  798 (1021)
                      +++..++||+|||||+.+|+|||||+|+++++|+.++|+|+++|.|+++++|+|.|+..+.++.++++|+.++++|++||
T Consensus       711 q~~s~~~fQ~AlLYTti~G~RR~Rv~Nlsl~~ts~l~~lyr~~~~d~l~a~maK~a~~~i~~~~lk~vre~l~~~~~~iL  790 (1007)
T KOG1984|consen  711 QDGSDVHFQTALLYTTIDGQRRLRVLNLSLAVTSQLSELYRSADTDPLIAIMAKQAAKAILDKPLKEVREQLVSQCAQIL  790 (1007)
T ss_pred             cCCcceeEEEEEEEeccCCceeEEEEecchhhhhhHHHHHHhcCccHHHHHHHHHHHHhcccccHHHHHHHHHHHHHHHH
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHhhhhhccCCCCccccccccccHHHHHHHhhhccCCCCCCCCCChhHHHHHHHHHcCCCHHHHHhhhcccEEEeecC
Q 001711          799 KEYRNLYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPIRGGYADVTLDERCAAGYTMMALPVKKLLKLLYPCLIRVDEH  878 (1021)
Q Consensus       799 ~~YRk~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~Lr~g~~~~s~DeR~~~~~~l~s~~v~~~~~~lYPrL~~lh~~  878 (1021)
                      ++||| +|++..+++||||||+||+||+|+++|+||.+|++  .+++.|+|+|++.+++++++++++.++||||+++|++
T Consensus       791 ~~YRk-~cas~~ssgQLILPeslKLlPly~la~lKs~~l~~--~~~~~DdRi~~~~~v~sl~v~~~~~~~YPrl~p~hdl  867 (1007)
T KOG1984|consen  791 ASYRK-NCASPASSGQLILPESLKLLPLYMLALLKSSALRP--QEIRTDDRIYQLQLVTSLSVEQLMPFFYPRLLPFHDL  867 (1007)
T ss_pred             HHHHH-hhcCCCCcccEechhhhHHHHHHHHHHHHhhcccc--cccccchhHHHHHHhhcccHHhhhhhhccceeeeecc
Confidence            99999 99999999999999999999999999999999996  7899999999999999999999999999999999999


Q ss_pred             CCCCCccCCcccccccccccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhh--ccccccccchHHHH
Q 001711          879 LLKPSAQLDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAEL--SKVMLREQDNEMSR  956 (1021)
Q Consensus       879 ~~~~~~~~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l--~~~~lp~~~n~~s~  956 (1021)
                      ..+++    ....+|.+|++|+|+|+++||||||||+++|||||+++++.|+|+||+|++.+++  ...+||++||.+|+
T Consensus       868 ~i~dt----l~~~~p~~VraS~e~l~negiYll~nG~~~ylwvg~sv~~~llQ~lf~V~s~~~i~s~~~~Lpe~dn~lS~  943 (1007)
T KOG1984|consen  868 DIEDT----LEFVLPKAVRASSEFLSNEGIYLLDNGQKIYLWVGESVDPDLLQDLFSVSSFEQIDSQSGVLPELDNPLSR  943 (1007)
T ss_pred             ccccc----cccccccceecchhhccCCceEEEecCcEEEEEecCCCCHHHHHHHhcCccccccccccccccccCcHHHH
Confidence            64432    2236799999999999999999999999999999999999999999999999999  34789999999999


Q ss_pred             HHHHHHHHHHHhCCCCCceEEEeccCCCcchHHHHHhhccccCCCCCCCHHHHHHHHHHHHhcC
Q 001711          957 KLLGILKKLREQDPSYYQLCQLVRQGEQPREGFLLLANLVEDQIGGSNGYADWIMQIHRQVLQN 1020 (1021)
Q Consensus       957 ~l~~ii~~lr~~r~~~~~l~~vvrqg~~~~~e~~f~~~LVED~~~~~~SY~dFL~~lh~~I~~k 1020 (1021)
                      ++|++|..||+.|..++++ +++|+|++.. +.+|.++||||++++++||+||||.|||+|++|
T Consensus       944 k~r~~i~~i~~~r~~~l~v-~~~k~g~~~~-~~~~~~~lved~~~~~~sY~dyL~~~H~ki~~~ 1005 (1007)
T KOG1984|consen  944 KVRNVISLIRRQRSSELPV-VLVKQGLDGS-EVEFSEYLVEDRGRNISSYVDYLCELHKKIQQK 1005 (1007)
T ss_pred             HHHHHHHHHHhcccccccc-EEEecCCCch-hhhhhhhhhcccccCccccchHHHHHHHHHHhh
Confidence            9999999999999999998 9999999883 588999999999999999999999999999986


No 2  
>KOG1985 consensus Vesicle coat complex COPII, subunit SEC24/subunit SFB2 [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00  E-value=1.6e-165  Score=1430.85  Aligned_cols=711  Identities=47%  Similarity=0.769  Sum_probs=671.9

Q ss_pred             CCCcccccCCCCceeccccccCCCHHHHhhcCCceEEEEccCCCCCCCCC------------CccceEEccceeEecCCc
Q 001711          304 LAETYPLNCHSRYLRLTTSAIPNSQSLVSRWHLPLGAVVCPLAEPPEGNL------------FICRTYVNPYVTFTDAGR  371 (1021)
Q Consensus       304 ~~~~~~~N~~P~y~R~T~~~iP~t~~l~~~~~lPlg~vv~Pfa~~~~~e~------------~rCrAYiNPf~~f~~~g~  371 (1021)
                      .+.....||+|+|+|+|+++||.++++++|+|||||++|+||+++.+.++            ++||+||||||.|++.|+
T Consensus       160 ~~~~~~~nc~p~y~RsTl~~iP~t~sLl~kskLPlglvv~Pf~~~~d~~~~p~~~~~~IvRCr~CRtYiNPFV~fid~gr  239 (887)
T KOG1985|consen  160 VTPSESSNCSPSYVRSTLSAIPQTQSLLKKSKLPLGLVVHPFAHLDDIDPLPVITSTLIVRCRRCRTYINPFVEFIDQGR  239 (887)
T ss_pred             cCCccccCCCHHHHHHHHHhCCccHHHHHhcCCCceEEEeecccccccCCCCcccCCceeeehhhhhhcCCeEEecCCCc
Confidence            33334569999999999999999999999999999999999997653322            999999999999999999


Q ss_pred             eEEEcCCCCCCCCCcccccccCcCcccCCCCCCCccccccEEEEccccccCCCCCCCeEEEEEecchhHHhhcHHHHHHH
Q 001711          372 KWRCNICALLNDVPGDYFAHLDATGRRIDIDQRPELTKGSVEFVAPTEYMVRPPMPPLYFFLIDVSISAIRSGMLEVVAQ  451 (1021)
Q Consensus       372 ~W~Cn~C~~~N~vP~~Y~~~l~~~g~R~D~~~rPEL~~gtVEfvap~eY~~r~p~pp~yvFvIDvS~~av~sG~l~~~~~  451 (1021)
                      +|+||+|+..|+||.+|+++. -++.+.|.++||||++++|||+||.|||.|+|+|++||||||||.+|+++|+|+++|+
T Consensus       240 ~WrCNlC~~~NdvP~~f~~~~-~t~~~~~~~~RpEl~~s~vE~iAP~eYmlR~P~Pavy~FliDVS~~a~ksG~L~~~~~  318 (887)
T KOG1985|consen  240 RWRCNLCGRVNDVPDDFDWDP-LTGAYGDPYSRPELTSSVVEFIAPSEYMLRPPQPAVYVFLIDVSISAIKSGYLETVAR  318 (887)
T ss_pred             eeeechhhhhcCCcHHhhcCc-cccccCCcccCccccceeEEEecCcccccCCCCCceEEEEEEeehHhhhhhHHHHHHH
Confidence            999999999999999999874 3567889999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCccceehhhhHHHHHHHHhhCCCccc
Q 001711          452 TIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDLLVNLSESRSVVDTLLDSLPSMFQ  531 (1021)
Q Consensus       452 sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~  531 (1021)
                      +|+++||.||+++|++|||||||++||||+++.++.+|+|++|+|+||+|+|.+++|||+++|||+.|+.+|+.|+.||.
T Consensus       319 slL~~LD~lpgd~Rt~igfi~fDs~ihfy~~~~~~~qp~mm~vsdl~d~flp~pd~lLv~L~~ck~~i~~lL~~lp~~F~  398 (887)
T KOG1985|consen  319 SLLENLDALPGDPRTRIGFITFDSTIHFYSVQGDLNQPQMMIVSDLDDPFLPMPDSLLVPLKECKDLIETLLKTLPEMFQ  398 (887)
T ss_pred             HHHHhhhcCCCCCcceEEEEEeeceeeEEecCCCcCCCceeeeccccccccCCchhheeeHHHHHHHHHHHHHHHHHHHh
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CCCCcccchHHHHHHHHHHHHhcCCEEEEEecCCCCCCcccccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCc
Q 001711          532 DNMNVESAFGPALKAAFMVMSRLGGKLLIFQNSLPSLGVGCLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQI  611 (1021)
Q Consensus       532 ~~~~~~~alG~AL~aA~~lL~~~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gI  611 (1021)
                      +++..++|+|+||++|+++|+.+||||++|++++||.|.|+|+.||+ .++.+++++.+++.+++.|||+||.+|++.||
T Consensus       399 ~~~~t~~alGpALkaaf~li~~~GGri~vf~s~lPnlG~G~L~~rEd-p~~~~s~~~~qlL~~~t~FYK~~a~~cs~~qI  477 (887)
T KOG1985|consen  399 DTRSTGSALGPALKAAFNLIGSTGGRISVFQSTLPNLGAGKLKPRED-PNVRSSDEDSQLLSPATDFYKDLALECSKSQI  477 (887)
T ss_pred             hccCcccccCHHHHHHHHHHhhcCCeEEEEeccCCCCCccccccccc-cccccchhhhhccCCCchHHHHHHHHhccCce
Confidence            99999999999999999999999999999999999999999999954 78888999999999999999999999999999


Q ss_pred             EEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCc--hhHHHHHHHHHHhcccccccceEEEEEeCCCeEEEeeecCc
Q 001711          612 AVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQST--THGERLRHELSRDLTRETAWEAVMRIRCGKGVRFTNYHGNF  689 (1021)
Q Consensus       612 sVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~--~d~~kl~~dL~r~ltr~~g~~a~mrVR~S~Gl~V~~~~Gnf  689 (1021)
                      |||+|+++.+|+|+|||+.|+++|||.+|||++|+..  .|..||.+||.|.|+|++||||+||||||+||+++.|||||
T Consensus       478 ~VDlFl~s~qY~DlAsLs~LskySgG~~y~YP~f~~s~p~~~~Kf~~el~r~Ltr~~~feaVmRiR~S~gl~~~~f~GnF  557 (887)
T KOG1985|consen  478 CVDLFLFSEQYTDLASLSCLSKYSGGQVYYYPSFDGSNPHDVLKFARELARYLTRKIGFEAVMRIRCSTGLRMSSFFGNF  557 (887)
T ss_pred             EEEEEeecccccchhhhhccccccCceeEEccCCCCCCHHHHHHHHHHHHHHhhhhhhhheeEEeeccccccccceeccc
Confidence            9999999999999999999999999999999999987  57889999999999999999999999999999999999999


Q ss_pred             ccCCCCceeeccCCCCCcEEEEEEeccccCCCceeEEEEEEEEEecCCcEEEEEEeeeecccCCHHHHHHhcCHhHHHHH
Q 001711          690 MLRSTDLLALPAVDCDKAYAMQLSLEETLLTTQTVYFQVALLYTASCGERRIRVHTLAAPVVSNLSDMYQQADTGAIVSV  769 (1021)
Q Consensus       690 ~~rs~~~~~l~~id~d~Sia~~~~~d~~l~~~~~~~iQ~AllYTt~~GeRrIRV~Tl~lpvt~~l~~vf~s~D~eai~~~  769 (1021)
                      +.|++|++.++++++|++++|++++|+.+. ...++||+|+|||...|||||||||+++++++++.|||+++|++||+.+
T Consensus       558 F~RStDLla~~~v~~D~sy~~qisiEesl~-~~~~~fQvAlLyT~~~GERRIRV~T~~lpt~~sl~evY~saD~~AI~~l  636 (887)
T KOG1985|consen  558 FVRSTDLLALPNVNPDQSYAFQISIEESLT-TGFCVFQVALLYTLSKGERRIRVHTLCLPTVSSLNEVYASADQEAIASL  636 (887)
T ss_pred             ccCcHHHhcccCCCCCccceEEEEeehhcC-CceeEEEeeeeecccCCceeEEEEEeeccccccHHHHHhhcCHHHHHHH
Confidence            999999999999999999999999999986 4667899999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCccccccccccHHHHHHHhhhccCCCCCCCCCChhHH
Q 001711          770 FSRLAIEKTLSHKLEDARNAVQLRLVKALKEYRNLYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPIRGGYADVTLDER  849 (1021)
Q Consensus       770 laK~a~~~~l~~~l~d~R~~l~~~lv~iL~~YRk~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~Lr~g~~~~s~DeR  849 (1021)
                      |+|+|+++.++..+.|+|+.|+++++++|.+|||++..++.....|.+|.+|++||+|+++|+||++||.| ..++.|+|
T Consensus       637 la~~Av~ksl~ssL~dardal~~~~~D~l~aYk~~~~~~~~~~~~l~~p~~LrllPllvlALlK~~~fr~g-~~~~lD~R  715 (887)
T KOG1985|consen  637 LAKKAVEKSLSSSLSDARDALTNAVVDILNAYKKLVSNQNGQGITLSLPASLRLLPLLVLALLKHPAFRPG-TGTRLDYR  715 (887)
T ss_pred             HHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHhcccccCCcceecCcchhhhHHHHHHHhcCCcccCC-CCCCchHH
Confidence            99999999999999999999999999999999996665556666799999999999999999999999987 69999999


Q ss_pred             HHHHHHHcCCCHHHHHhhhcccEEEeecCCCCCCcc-CCcccccccccccchhhccCCcEEEEECCceeEEEecCCCCHH
Q 001711          850 CAAGYTMMALPVKKLLKLLYPCLIRVDEHLLKPSAQ-LDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPD  928 (1021)
Q Consensus       850 ~~~~~~l~s~~v~~~~~~lYPrL~~lh~~~~~~~~~-~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~  928 (1021)
                      ++++++|+++++..++++|||.||++|++..+...- .|+.+.+|++|+|+.+.|+.+|+||||+|..+|||||++++++
T Consensus       716 ~~a~~~~~~lpl~~L~k~IYP~Lysl~~l~~ea~~~~~d~~~~~p~~L~ltae~l~~~GlyL~D~g~~lfl~vg~~a~P~  795 (887)
T KOG1985|consen  716 AYAMCLMSTLPLKYLMKYIYPTLYSLHDLDDEAGLPIHDQTVVLPPPLNLTAELLSRRGLYLMDTGTTLFLWVGSNADPS  795 (887)
T ss_pred             HHHHHHhhcCCHHHHHhhhcccceeccccccccCcccccccccCCCccchHHHHhccCceEEEecCcEEEEEEcCCCCcc
Confidence            999999999999999999999999999984211111 3566788999999999999999999999999999999999999


Q ss_pred             HHHhhcCCchhhhh--ccccccccchHHHHHHHHHHHHHHHhCCCCCceEEEeccCCCcchHHHHHhhccccCCCCCCCH
Q 001711          929 IAMNLLGSEFAAEL--SKVMLREQDNEMSRKLLGILKKLREQDPSYYQLCQLVRQGEQPREGFLLLANLVEDQIGGSNGY 1006 (1021)
Q Consensus       929 ll~~lFgv~~~~~l--~~~~lp~~~n~~s~~l~~ii~~lr~~r~~~~~l~~vvrqg~~~~~e~~f~~~LVED~~~~~~SY 1006 (1021)
                      ++.++||++.+.++  ++.+|++.+|+.+++++++|++||..|..+..+ +|||+++.+.+..||++.||||++.+..||
T Consensus       796 ll~~vfg~~~~adi~~~~~~lp~~~n~~s~r~~~fI~~lR~d~~~~p~~-~ivr~~~~s~~k~~f~~~lvEDrs~~~~SY  874 (887)
T KOG1985|consen  796 LLFDVFGVSTLADIPIGKYTLPELDNEESDRVRRFIKKLRDDRTYFPNL-YIVRGDDNSPLKAWFFSRLVEDRSENSPSY  874 (887)
T ss_pred             ccccccCcchHhhcccccccCcccccchhHHHHHHHHHhhcCCcccceE-EEEecCCCchHHHHHHHHHHhhhhcCcHHH
Confidence            99999999999999  678999999999999999999999777666665 999998777778999999999999999999


Q ss_pred             HHHHHHHHHHHhc
Q 001711         1007 ADWIMQIHRQVLQ 1019 (1021)
Q Consensus      1007 ~dFL~~lh~~I~~ 1019 (1021)
                      +|||.+||++|++
T Consensus       875 ~efLq~lk~qv~~  887 (887)
T KOG1985|consen  875 YEFLQHLKAQVSK  887 (887)
T ss_pred             HHHHHHHHHHhcC
Confidence            9999999999974


No 3  
>PTZ00395 Sec24-related protein; Provisional
Probab=100.00  E-value=7e-152  Score=1365.61  Aligned_cols=721  Identities=24%  Similarity=0.420  Sum_probs=649.1

Q ss_pred             CCCCCCCCCCCCCCCCCCCCC-----------------CCCCCCCCcccc----cCCCCceeccccccCCCHHHHhhcCC
Q 001711          278 IPGSIEPGIDLKSLPRPLDGD-----------------VEPSSLAETYPL----NCHSRYLRLTTSAIPNSQSLVSRWHL  336 (1021)
Q Consensus       278 ~~~~~dp~~~~~~ip~p~~~~-----------------~~pp~~~~~~~~----N~~P~y~R~T~~~iP~t~~l~~~~~l  336 (1021)
                      +.+|||+    ++||||+...                 ..||+.+++|++    ||+|+|||+|||+||.+.++++.++|
T Consensus       599 ~~~ri~~----~~ip~p~~~~~~~~~~~~~~~~~t~k~~~pp~~~~~~~~~dtgn~dP~~~r~tmY~iP~~~~~~~~~~i  674 (1560)
T PTZ00395        599 TINRIDM----NKIPRPIINTQEKKKKKNLKVFETCKYISPPSYYQPYISIDTGKADPRFLKSTLYQIPLFSETLKLSQI  674 (1560)
T ss_pred             cccccCc----ccCCCcccccccccccccchhhhhccCCCCCCCCCceEEeecCCCChhhhhhhhhcCcchHHHHHhcCC
Confidence            5689999    9999998543                 468999999996    99999999999999999999999999


Q ss_pred             ceEEEEccCCCCCCCCC----------------------CccceEEccceeEecCCceEEEcCCCCCCCCCcc----ccc
Q 001711          337 PLGAVVCPLAEPPEGNL----------------------FICRTYVNPYVTFTDAGRKWRCNICALLNDVPGD----YFA  390 (1021)
Q Consensus       337 Plg~vv~Pfa~~~~~e~----------------------~rCrAYiNPf~~f~~~g~~W~Cn~C~~~N~vP~~----Y~~  390 (1021)
                      |||+||+|||.+.+.|.                      .+|++|+|+++.|+.. ++++||||+..+.+...    +++
T Consensus       675 P~gi~v~Pfa~~~~~e~~~~~~~~~~~~d~~~~~~~~rc~~c~~y~~~~~~~~~~-~~~~c~~c~~~~~i~e~~~~~~~~  753 (1560)
T PTZ00395        675 PFGIIVNPFACLNEGEGIDKIDMKDIINDKEENIEILRCPKCLGYLHATILEDIS-SSVQCVFCDTDFLINENVLFDIFQ  753 (1560)
T ss_pred             CceeecchhhhcCCCCCCcccchhhcccchhhccceeecchhHhhhcchheeccc-ceEEEEecCCcchhhHHHHHHHHH
Confidence            99999999999765432                      7999999999999976 99999999999988542    221


Q ss_pred             -ccCcCcccCCCCCC----CccccccEEEEcccccc--------------------------------------------
Q 001711          391 -HLDATGRRIDIDQR----PELTKGSVEFVAPTEYM--------------------------------------------  421 (1021)
Q Consensus       391 -~l~~~g~R~D~~~r----PEL~~gtVEfvap~eY~--------------------------------------------  421 (1021)
                       +..-.-+..|.+++    --|.+|+||+++|.-|.                                            
T Consensus       754 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  833 (1560)
T PTZ00395        754 YNEKIGHKESDHNEHGNSLSPLLKGSVDIIIPPIYYHNVNKFKLTYTYLNKNINQTAFMITNKIMSFTKHISNSLVANDS  833 (1560)
T ss_pred             HhhhhccccccccccccccchhhcCceeEEccchhhccCCccceeeehhhcchhhhhhhhhhhhhhhhhhhcchheeccc
Confidence             11101111222222    14679999999886542                                            


Q ss_pred             --------------------------------------------------------------------------------
Q 001711          422 --------------------------------------------------------------------------------  421 (1021)
Q Consensus       422 --------------------------------------------------------------------------------  421 (1021)
                                                                                                      
T Consensus       834 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  913 (1560)
T PTZ00395        834 KGGNKATSASAFGDSGDANFLAGGGYTNYGGAGGYNTYDNQSGYNNHDVVNNRGGSGAGNHLYGKDHDVQNFDNVMDNAN  913 (1560)
T ss_pred             ccccccchhhhcccccccccccccccccccccccccccccccccccccccccccccCcCcccccCcccccchhhhccCCc
Confidence                                                                                            


Q ss_pred             ---------------------------------CCCCCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceE
Q 001711          422 ---------------------------------VRPPMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQI  468 (1021)
Q Consensus       422 ---------------------------------~r~p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~V  468 (1021)
                                                       ++.++||+||||||||+.||++|+++++|++|+++|+.|+ ++|+||
T Consensus       914 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~PP~YvFLIDVS~~AVkSGLl~tacesIK~sLDsL~-dpRTRV  992 (1560)
T PTZ00395        914 FTIHDMKNLICEKNGEPDSAKIRRNSFLAKYPQVKNMLPPYFVFVVECSYNAIYNNITYTILEGIRYAVQNVK-CPQTKI  992 (1560)
T ss_pred             eeeecchhhhhcccCCchhhhhhccchhhccccccCCCCCEEEEEEECCHHHHhhChHHHHHHHHHHHHhcCC-CCCcEE
Confidence                                             0236889999999999999999999999999999999997 578999


Q ss_pred             EEEEEcCeEEEEecCCC-------------CCCcceeeccccccccCCCC-CccceehhhhHHHHHHHHhhCCCcccCCC
Q 001711          469 GFITFDSTIHFYNMKSS-------------LTQPQMMVISDLDDIFVPLP-DDLLVNLSESRSVVDTLLDSLPSMFQDNM  534 (1021)
Q Consensus       469 giITFds~V~fynl~~~-------------~~~p~mlVvsDldd~f~Pl~-~~lLv~l~es~~~I~~lLd~Lp~~f~~~~  534 (1021)
                      ||||||++||||+|+.+             +++|||+||+||||+|+|++ ++|||++.|+|+.|+.|||.|+.||....
T Consensus       993 GIITFDSsLHFYNLks~l~~~~~~~~~~~~l~qPQMLVVSDLDDPFLPlP~ddLLVnL~ESRevIe~LLDkLPemFt~t~ 1072 (1560)
T PTZ00395        993 AIITFNSSIYFYHCKGGKGVSGEEGDGGGGSGNHQVIVMSDVDDPFLPLPLEDLFFGCVEEIDKINTLIDTIKSVSTTMQ 1072 (1560)
T ss_pred             EEEEecCcEEEEecCcccccccccccccccCCCceEEeecCCccCcCCCCccCeeechHHHHHHHHHHHHHHHHHhhccC
Confidence            99999999999999875             47899999999999999998 89999999999999999999999999999


Q ss_pred             CcccchHHHHHHHHHHHHhcC--CEEEEEecCCCCCCcccccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcE
Q 001711          535 NVESAFGPALKAAFMVMSRLG--GKLLIFQNSLPSLGVGCLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIA  612 (1021)
Q Consensus       535 ~~~~alG~AL~aA~~lL~~~G--GkIivF~sg~Pt~GpG~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIs  612 (1021)
                      ..++|+|+||++|+++|+.+|  |||++|++++|++|+|+|+.|++      +.+|+.++.++++|||+||.+|++++||
T Consensus      1073 ~~esCLGSALqAA~~aLk~~GGGGKIiVF~SSLPniGpGaLK~Re~------~~KEk~Ll~pqd~FYK~LA~ECsk~qIS 1146 (1560)
T PTZ00395       1073 SYGSCGNSALKIAMDMLKERNGLGSICMFYTTTPNCGIGAIKELKK------DLQENFLEVKQKIFYDSLLLDLYAFNIS 1146 (1560)
T ss_pred             CCcccHHHHHHHHHHHHHhcCCCceEEEEEcCCCCCCCCccccccc------ccccccccccchHHHHHHHHHHHhcCCc
Confidence            999999999999999999986  99999999999999999997753      3477788999999999999999999999


Q ss_pred             EEEEEecCCCcC--hhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHhccc-ccccceEEEEEeCCCeEEEeee--c
Q 001711          613 VNVYAFSDKYTD--IASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRDLTR-ETAWEAVMRIRCGKGVRFTNYH--G  687 (1021)
Q Consensus       613 VDlF~~s~~~~d--iatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~ltr-~~g~~a~mrVR~S~Gl~V~~~~--G  687 (1021)
                      ||||+++..|+|  |++|+.|+++|||+||||+.|+..+|..+|++||.+.|++ ++||+|+||||||+||+|++||  |
T Consensus      1147 VDLFLfSsqYvDVDVATLg~Lsr~TGGqlyyYPnFna~rD~~KL~~DL~r~LTre~iGyEAVMRVRCS~GLrVs~fyG~G 1226 (1560)
T PTZ00395       1147 VDIFIISSNNVRVCVPSLQYVAQNTGGKILFVENFLWQKDYKEIYMNIMDTLTSEDIAYCCELKLRYSHHMSVKKLFCCN 1226 (1560)
T ss_pred             eEEEEccCcccccccccccchhcccceeEEEeCCCcccccHHHHHHHHHHHhhccceeeEEEEEEECCCCeEEEEEeccC
Confidence            999999999986  7999999999999999999999999999999999999998 6999999999999999999999  5


Q ss_pred             Ccc--cCCCCceeeccCCCCCcEEEEEEeccccCCCceeEEEEEEEEEecCCcEEEEEEeeeecccCCHHHHHHhcCHhH
Q 001711          688 NFM--LRSTDLLALPAVDCDKAYAMQLSLEETLLTTQTVYFQVALLYTASCGERRIRVHTLAAPVVSNLSDMYQQADTGA  765 (1021)
Q Consensus       688 nf~--~rs~~~~~l~~id~d~Sia~~~~~d~~l~~~~~~~iQ~AllYTt~~GeRrIRV~Tl~lpvt~~l~~vf~s~D~ea  765 (1021)
                      +++  .++++++.|+.+++|++|+|+|+||++|.+...+|||+|||||+.+|||||||||++|+||+++.+||+++|++|
T Consensus      1227 nnF~s~rStDLLaLP~Id~DqSfaVeLk~DEkL~~~~~AYFQaALLYTSssGERRIRVHTLALPVTSsLseVFrsADqdA 1306 (1560)
T PTZ00395       1227 NNFNSIISVDTIKIPKIRHDQTFAFLLNYSDISESKKQIYFQCACIYTNLWGDRFVRLHTTHMNLTSSLSTVFRYTDAEA 1306 (1560)
T ss_pred             CccccccccccccccccCCCceEEEEEEeccccCCCCcEEEEEEEeeccCCCcEEEEEEeeeecccCCHHHHHHhhcHHH
Confidence            555  468899999999999999999999999987889999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCccccccccccHHHHHHHhhhccCCCCCCCCCC
Q 001711          766 IVSVFSRLAIEKTLSHKLEDARNAVQLRLVKALKEYRNLYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPIRGGYADVT  845 (1021)
Q Consensus       766 i~~~laK~a~~~~l~~~l~d~R~~l~~~lv~iL~~YRk~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~Lr~g~~~~s  845 (1021)
                      ++++|+|+|+++++++  .++|+.|.++|+++|++||| +|+...+.+||||||+||+||+|+++|+||.+|+   .+++
T Consensus      1307 IvslLAK~AV~~aLss--sdARe~L~dklVdILtaYRK-~CAsssssgQLILPESLKLLPLYILSLLKS~AfR---t~I~ 1380 (1560)
T PTZ00395       1307 LMNILIKQLCTNILHN--DNYSKIIIDNLAAILFSYRI-NCASSAHSGQLILPDTLKLLPLFTSSLLKHNVTK---KEIL 1380 (1560)
T ss_pred             HHHHHHHHHHHHhccc--HHHHHHHHHHHHHHHHHHHH-HhhccCCCccccchhHHHHHHHHHHHHhcccccc---CCCC
Confidence            9999999999999987  49999999999999999999 9998888999999999999999999999999998   5789


Q ss_pred             hhHHHHHHHHHcCCCHHHHHhhhcccEEEeecCCCCCC---ccCCcccccccccccchhhccCCcEEEEECCceeEEEec
Q 001711          846 LDERCAAGYTMMALPVKKLLKLLYPCLIRVDEHLLKPS---AQLDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFG  922 (1021)
Q Consensus       846 ~DeR~~~~~~l~s~~v~~~~~~lYPrL~~lh~~~~~~~---~~~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG  922 (1021)
                      .|+|++++++|+++++..++.+||||||+||++..+..   ...++.+.+|..|+||.++|+++||||||+|+.||||||
T Consensus      1381 sDeRVyaL~rL~SmPI~~Li~yLYPRLYpLHdL~~e~e~d~~d~d~~ivLPp~LrLS~ErLesdGIYLLDNGe~IyLWVG 1460 (1560)
T PTZ00395       1381 HDLKVYSLIKLLSMPIISSLLYVYPVMYVIHIKGKTNEIDSMDVDDDLFIPKTIPSSAEKIYSNGIYLLDACTHFYLYFG 1460 (1560)
T ss_pred             ccHHHHHHHHHhCCCHHHHHhhhcCceEEcccccccccCCccCCCCccccCCcccchHHHhcCCcEEEEECCCEEEEEEC
Confidence            99999999999999999999999999999999721111   112345678999999999999999999999999999999


Q ss_pred             CCCCHHHHHhhcCCchhhhhccccccccchHHHHHHHHHHHHHHHhC--CCCCceEEEeccCCCcchHHHHHhhccccCC
Q 001711          923 RMLSPDIAMNLLGSEFAAELSKVMLREQDNEMSRKLLGILKKLREQD--PSYYQLCQLVRQGEQPREGFLLLANLVEDQI 1000 (1021)
Q Consensus       923 ~~v~~~ll~~lFgv~~~~~l~~~~lp~~~n~~s~~l~~ii~~lr~~r--~~~~~l~~vvrqg~~~~~e~~f~~~LVED~~ 1000 (1021)
                      ++++++|++||||+.... ....+||++++++++||++||+.||++|  ..|+++ +|||++++.  |.||+++|||||+
T Consensus      1461 ~~V~PqLLqDLFGv~~~~-~~~~eLPelDT~iS~RVrnII~~LR~~r~~~~Y~pL-~IVRqgDp~--E~~F~s~LVEDRs 1536 (1560)
T PTZ00395       1461 FHSDANFAKEIVGDIPTE-KNAHELNLTDTPNAQKVQRIIKNLSRIHHFNKYVPL-VMVAPKSNE--EEHLISLCVEDKA 1536 (1560)
T ss_pred             CCCCHHHHHHHcCCCccc-cccccccCCCCHHHHHHHHHHHHHHHhccCCCcceE-EEEeCCCch--HHHHHHhCeecCC
Confidence            999999999999974222 2234689999999999999999999986  488998 999999877  8999999999999


Q ss_pred             CCCCCHHHHHHHHHHHHhcC
Q 001711         1001 GGSNGYADWIMQIHRQVLQN 1020 (1021)
Q Consensus      1001 ~~~~SY~dFL~~lh~~I~~k 1020 (1021)
                      .+++||+||||+|||+|++|
T Consensus      1537 ~g~~SYvDFLc~LHKqIq~k 1556 (1560)
T PTZ00395       1537 DKEYSYVNFLCFIHKLVHKR 1556 (1560)
T ss_pred             CCCCCHHHHHHHHHHHHHHh
Confidence            99999999999999999987


No 4  
>COG5028 Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion]
Probab=100.00  E-value=1.3e-150  Score=1296.13  Aligned_cols=706  Identities=37%  Similarity=0.689  Sum_probs=669.9

Q ss_pred             CCCCCCCCcccc----cCCCCceeccccccCCCHHHHhhcCCceEEEEccCCCCCCCCC-------------CccceEEc
Q 001711          299 VEPSSLAETYPL----NCHSRYLRLTTSAIPNSQSLVSRWHLPLGAVVCPLAEPPEGNL-------------FICRTYVN  361 (1021)
Q Consensus       299 ~~pp~~~~~~~~----N~~P~y~R~T~~~iP~t~~l~~~~~lPlg~vv~Pfa~~~~~e~-------------~rCrAYiN  361 (1021)
                      ..||. ++.++.    ||+|+|+|+|+|+||.+.+++++++||||+||+||.++.+.+.             +|||+|||
T Consensus       132 ~~ppl-tt~~~~~e~~n~~p~yvrsT~yaiP~t~dl~~~skiPfgLVI~Pf~~l~~e~~~vpl~~d~~ivRCrrCrsYiN  210 (861)
T COG5028         132 IVPPL-TTNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKIPFGLVIRPFLELYPEEDPVPLVEDGSIVRCRRCRSYIN  210 (861)
T ss_pred             CCCCc-ccceeeeccCCCCHHHHHHHHhhCCCchhHHHhcCCCceEEeehhhhcCccCCCCccCCCCcchhhhhhHhhcC
Confidence            34555 777764    9999999999999999999999999999999999999876432             99999999


Q ss_pred             cceeEecCCceEEEcCCCCCCCCCcccccccCcCcccCCCCCCCccccccEEEEccccccCCCCCCCeEEEEEecchhHH
Q 001711          362 PYVTFTDAGRKWRCNICALLNDVPGDYFAHLDATGRRIDIDQRPELTKGSVEFVAPTEYMVRPPMPPLYFFLIDVSISAI  441 (1021)
Q Consensus       362 Pf~~f~~~g~~W~Cn~C~~~N~vP~~Y~~~l~~~g~R~D~~~rPEL~~gtVEfvap~eY~~r~p~pp~yvFvIDvS~~av  441 (1021)
                      ||++|+++|++|+||+|+..|++|.++++...+++.|.|+++|+||.+|+|||+||++|+.|.+.|++|||+||||.+++
T Consensus       211 Pfv~fi~~g~kw~CNiC~~kN~vp~~~~~~~~~~~~r~d~~~r~El~~~vvdf~ap~~Y~~~~p~P~~yvFlIDVS~~a~  290 (861)
T COG5028         211 PFVQFIEQGRKWRCNICRSKNDVPEGFDNPSGPNDPRSDRYSRPELKSGVVDFLAPKEYSLRQPPPPVYVFLIDVSFEAI  290 (861)
T ss_pred             ceEEEecCCcEEEEeeccccccCcccccCcCCCCCccccccccchhhceeeEEecccceeeccCCCCEEEEEEEeehHhh
Confidence            99999999999999999999999999999899999999999999999999999999999999999999999999999999


Q ss_pred             hhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCC-CccceehhhhHHHH
Q 001711          442 RSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLP-DDLLVNLSESRSVV  519 (1021)
Q Consensus       442 ~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~-~~lLv~l~es~~~I  519 (1021)
                      ++|++.+++++|++.|+.+++ ++|+||+||.||++|||++++.+++ .+|++|+|+||+|+|.+ .+|++++.+++..+
T Consensus       291 ~~g~~~a~~r~Il~~l~~~~~~dpr~kIaii~fD~sl~ffk~s~d~~-~~~~~vsdld~pFlPf~s~~fv~pl~~~k~~~  369 (861)
T COG5028         291 KNGLVKAAIRAILENLDQIPNFDPRTKIAIICFDSSLHFFKLSPDLD-EQMLIVSDLDEPFLPFPSGLFVLPLKSCKQII  369 (861)
T ss_pred             hcchHHHHHHHHHhhccCCCCCCCcceEEEEEEcceeeEEecCCCCc-cceeeecccccccccCCcchhcccHHHHHHHH
Confidence            999999999999999999975 7899999999999999999998874 38999999999999998 67899999999999


Q ss_pred             HHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcCCEEEEEecCCCCCCcccccccCCcCcccCCCccccCCCCCcHHH
Q 001711          520 DTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLGGKLLIFQNSLPSLGVGCLKLRGDDLRVYGTDKEHSLRIPEDPFY  599 (1021)
Q Consensus       520 ~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~gt~~e~~l~~pa~~fY  599 (1021)
                      +.||+.++.+|.+++.++.|+|+||++|..+++.+||||++|.+++||.|.|+|..|+|        +|+.++.+.+.||
T Consensus       370 etLl~~~~~If~d~~~pk~~~G~aLk~a~~l~g~~GGkii~~~stlPn~G~Gkl~~r~d--------~e~~ll~c~d~fY  441 (861)
T COG5028         370 ETLLDRVPRIFQDNKSPKNALGPALKAAKSLIGGTGGKIIVFLSTLPNMGIGKLQLRED--------KESSLLSCKDSFY  441 (861)
T ss_pred             HHHHHHhhhhhcccCCCccccCHHHHHHHHHhhccCceEEEEeecCCCccccccccccc--------chhhhccccchHH
Confidence            99999999999999999999999999999999999999999999999999999999865        6777999999999


Q ss_pred             HHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCch--hHHHHHHHHHHhcccccccceEEEEEeC
Q 001711          600 KQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTT--HGERLRHELSRDLTRETAWEAVMRIRCG  677 (1021)
Q Consensus       600 ~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~--d~~kl~~dL~r~ltr~~g~~a~mrVR~S  677 (1021)
                      |++|.+|++.||+||+|+++.+|+|+||++.|+++|||++|||++|+..+  |..||.+||.+++++++||+++||||||
T Consensus       442 k~~a~e~~k~gIsvd~Flt~~~yidvaTls~l~~~T~G~~~~Yp~f~~~~~~d~~kl~~dL~~~ls~~~gy~~~~rvR~S  521 (861)
T COG5028         442 KEFAIECSKVGISVDLFLTSEDYIDVATLSHLCRYTGGQTYFYPNFSATRPNDATKLANDLVSHLSMEIGYEAVMRVRCS  521 (861)
T ss_pred             HHHHHHHHHhcceEEEEeccccccchhhhcchhhccCcceEEcCCcccCCchhHHHHHHHHHHhhhhhhhhheeeEeecc
Confidence            99999999999999999999999999999999999999999999999998  9999999999999999999999999999


Q ss_pred             CCeEEEeeecCcccCCCCceeeccCCCCCcEEEEEEeccccCCCceeEEEEEEEEEecCCcEEEEEEeeeecccCCHHHH
Q 001711          678 KGVRFTNYHGNFMLRSTDLLALPAVDCDKAYAMQLSLEETLLTTQTVYFQVALLYTASCGERRIRVHTLAAPVVSNLSDM  757 (1021)
Q Consensus       678 ~Gl~V~~~~Gnf~~rs~~~~~l~~id~d~Sia~~~~~d~~l~~~~~~~iQ~AllYTt~~GeRrIRV~Tl~lpvt~~l~~v  757 (1021)
                      +|+++++|||||+.|+.++++|+.++.|+|+.|+|++|+++.. ..+|||+|+|||+.+|||||||.|+++++++++.|+
T Consensus       522 ~glr~s~fyGnf~~rs~dl~~F~tm~rd~Sl~~~~sid~~l~~-~~v~fQvAlL~T~~~GeRRiRVvn~s~~~ss~~~ev  600 (861)
T COG5028         522 TGLRVSSFYGNFFNRSSDLCAFSTMPRDTSLLVEFSIDEKLMT-SDVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREV  600 (861)
T ss_pred             CceehhhhhccccccCcccccccccCCCceEEEEEEecccccC-CceEEEEEEEeeccCCceEEEEEEeccccchhHHHH
Confidence            9999999999999999999999999999999999999999976 899999999999999999999999999999999999


Q ss_pred             HHhcCHhHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCccccccccccHHHHHHHhhhccCC
Q 001711          758 YQQADTGAIVSVFSRLAIEKTLSHKLEDARNAVQLRLVKALKEYRNLYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPI  837 (1021)
Q Consensus       758 f~s~D~eai~~~laK~a~~~~l~~~l~d~R~~l~~~lv~iL~~YRk~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~L  837 (1021)
                      |+++|+++|+.+|+|+|+.++....++++|+.|.+++++||++||| .|+....++||+||++||+||+++++|+||.+|
T Consensus       601 yasadq~aIa~~lak~a~~~~~~~s~~~~r~~i~~s~~~IL~~Ykk-~~~~snt~tql~Lp~nL~lLPll~lal~Ks~~~  679 (861)
T COG5028         601 YASADQLAIACILAKKASTKALNSSLKEARVLINKSMVDILKAYKK-ELVKSNTSTQLPLPANLKLLPLLMLALLKSSAF  679 (861)
T ss_pred             HHhccHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHH-HHhhccCCccccchhhhHHHHHHHHHHhhhccc
Confidence            9999999999999999999999999999999999999999999999 888888899999999999999999999999999


Q ss_pred             CCCCCCCChhHHHHHHHHHcCCCHHHHHhhhcccEEEeecCCCCCCccCCcccccccccccchhhccCCcEEEEECCcee
Q 001711          838 RGGYADVTLDERCAAGYTMMALPVKKLLKLLYPCLIRVDEHLLKPSAQLDEYKNIMKRLPLVAESLDSRGLYIFDDGFRF  917 (1021)
Q Consensus       838 r~g~~~~s~DeR~~~~~~l~s~~v~~~~~~lYPrL~~lh~~~~~~~~~~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i  917 (1021)
                      |.  ..++.|.|+++++++.+++++++++.|||+||++|++..+....+++...++.+|++|.+.|+++|+||||+|.++
T Consensus       680 rs--~~~~sD~r~~~L~~l~~~p~~~l~~~iYP~lyalHdm~~e~~l~~~~~~~~~~piNaT~s~le~~GlYLidtg~~i  757 (861)
T COG5028         680 RS--GSTPSDIRISALNRLTSLPLKQLMRNIYPTLYALHDMPIEAGLPDEGLLVLPSPINATSSLLESGGLYLIDTGQKI  757 (861)
T ss_pred             cc--CCCccchhHHHHHHhhcCCHHHHHHhhccceeeecccccccCCCcccccccccchhhhHHHHhcCCeEEEEcCCEE
Confidence            95  6789999999999999999999999999999999999643322123456789999999999999999999999999


Q ss_pred             EEEecCCCCHHHHHhhcCCchhhhh--ccccccccchHHHHHHHHHHHHHHH-hCCCCCceEEEeccCCCcchHHHHHhh
Q 001711          918 VLWFGRMLSPDIAMNLLGSEFAAEL--SKVMLREQDNEMSRKLLGILKKLRE-QDPSYYQLCQLVRQGEQPREGFLLLAN  994 (1021)
Q Consensus       918 ~lwvG~~v~~~ll~~lFgv~~~~~l--~~~~lp~~~n~~s~~l~~ii~~lr~-~r~~~~~l~~vvrqg~~~~~e~~f~~~  994 (1021)
                      |||+|+++++.|++|+||++++.+|  .+.++|+.+|++++++++||++||+ .+...+++ ++||+|.++..+.||.++
T Consensus       758 flw~g~d~~p~Ll~dlf~~~~~~~I~~~k~~~p~~~n~~n~~v~~iI~~lrs~~~~~tl~l-vlVR~~~d~s~~~~~~s~  836 (861)
T COG5028         758 FLWFGKDAVPSLLQDLFGVDSLSDIPSGKFTLPPTGNEFNERVRNIIGELRSVNDDSTLPL-VLVRGGGDPSLRLWFFST  836 (861)
T ss_pred             EEEecCCCCHHHHHHhcCcchhhhccccccccCCcCCHHHHHHHHHHHHHHhhCCCCccce-EEEecCCCcchhhheehh
Confidence            9999999999999999999999999  7889999999999999999999999 56777887 999998777668999999


Q ss_pred             ccccCCCCCCCHHHHHHHHHHHHhc
Q 001711          995 LVEDQIGGSNGYADWIMQIHRQVLQ 1019 (1021)
Q Consensus       995 LVED~~~~~~SY~dFL~~lh~~I~~ 1019 (1021)
                      ||||++.+..||.|||+.||++|+.
T Consensus       837 lVEDk~~n~~SY~~yL~~lh~ki~~  861 (861)
T COG5028         837 LVEDKTLNIPSYLDYLQILHEKIKS  861 (861)
T ss_pred             eecccccCCccHHHHHHHHHHHhcC
Confidence            9999999999999999999999974


No 5  
>PLN00162 transport protein sec23; Provisional
Probab=100.00  E-value=1.6e-120  Score=1115.31  Aligned_cols=656  Identities=20%  Similarity=0.283  Sum_probs=584.1

Q ss_pred             CCCCceeccccccCCCHHHHhhcCCceEEEEccCCCCCCC-----CC---CccceEEccceeEecCCceEEEcCCCCCCC
Q 001711          312 CHSRYLRLTTSAIPNSQSLVSRWHLPLGAVVCPLAEPPEG-----NL---FICRTYVNPYVTFTDAGRKWRCNICALLND  383 (1021)
Q Consensus       312 ~~P~y~R~T~~~iP~t~~l~~~~~lPlg~vv~Pfa~~~~~-----e~---~rCrAYiNPf~~f~~~g~~W~Cn~C~~~N~  383 (1021)
                      -+-++||+|||+||+|+.++++++|||||+|+||++..+.     +.   ++|||||||||+|+++|++|+||||+..|+
T Consensus         7 e~~~gvR~s~n~~P~t~~~~~~~~iPlg~v~tPl~~~~~vp~v~~~pvRC~~CraylNPf~~~d~~~~~W~C~~C~~~N~   86 (761)
T PLN00162          7 EAIDGVRMSWNVWPSSKIEASKCVIPLAALYTPLKPLPELPVLPYDPLRCRTCRAVLNPYCRVDFQAKIWICPFCFQRNH   86 (761)
T ss_pred             cccCceEeeeecCCCCHHHHhcCCCCeEEEEecCCcCCCCCcCCCCCCccCCCcCEECCceEEecCCCEEEccCCCCCCC
Confidence            3457999999999999999999999999999999875432     11   899999999999999999999999999999


Q ss_pred             CCcccccccCcCcccCCCCCCCcc--ccccEEEEccccccCCCCCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCC
Q 001711          384 VPGDYFAHLDATGRRIDIDQRPEL--TKGSVEFVAPTEYMVRPPMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELP  461 (1021)
Q Consensus       384 vP~~Y~~~l~~~g~R~D~~~rPEL--~~gtVEfvap~eY~~r~p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp  461 (1021)
                      +|.+|+ +++++      +.+|||  .++||||++|+ |+.+++.||+|+||||+|..+++   ++.++++|+.+|+.||
T Consensus        87 ~P~~Y~-~~~~~------~~p~EL~p~~~TvEY~~p~-~~~~~~~pp~fvFvID~s~~~~~---l~~lk~sl~~~L~~LP  155 (761)
T PLN00162         87 FPPHYS-SISET------NLPAELFPQYTTVEYTLPP-GSGGAPSPPVFVFVVDTCMIEEE---LGALKSALLQAIALLP  155 (761)
T ss_pred             CchHhc-ccCcc------CCChhhcCCceeEEEECCC-CCCCCCCCcEEEEEEecchhHHH---HHHHHHHHHHHHHhCC
Confidence            999997 44433      478999  89999999998 99999999999999999999987   6667899999999999


Q ss_pred             CCCCceEEEEEEcCeEEEEecCCCCCCcceeecc--------cccc----------------------ccCCCCCcccee
Q 001711          462 GFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVIS--------DLDD----------------------IFVPLPDDLLVN  511 (1021)
Q Consensus       462 ~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvs--------Dldd----------------------~f~Pl~~~lLv~  511 (1021)
                      ++  ++|||||||++||||+|+.+. .++++|+.        |++|                      .|+|..++||++
T Consensus       156 ~~--a~VGlITF~s~V~~~~L~~~~-~~~~~Vf~g~k~~t~~~l~~~l~l~~~~~~~~~~~~~~~~~~~~~p~~~~fLvp  232 (761)
T PLN00162        156 EN--ALVGLITFGTHVHVHELGFSE-CSKSYVFRGNKEVSKDQILEQLGLGGKKRRPAGGGIAGARDGLSSSGVNRFLLP  232 (761)
T ss_pred             CC--CEEEEEEECCEEEEEEcCCCC-CcceEEecCCccCCHHHHHHHhccccccccccccccccccccccCCCccceeEE
Confidence            76  999999999999999998653 67777775        2322                      234567899999


Q ss_pred             hhhhHHHHHHHHhhCCCcc---cCCCCcccchHHHHHHHHHHHH----hcCCEEEEEecCCCCCCcccccccC--CcCcc
Q 001711          512 LSESRSVVDTLLDSLPSMF---QDNMNVESAFGPALKAAFMVMS----RLGGKLLIFQNSLPSLGVGCLKLRG--DDLRV  582 (1021)
Q Consensus       512 l~es~~~I~~lLd~Lp~~f---~~~~~~~~alG~AL~aA~~lL~----~~GGkIivF~sg~Pt~GpG~L~~r~--~~~r~  582 (1021)
                      ++||+..|+++||+|+.++   .+++++++|+|+||++|..+|+    .+||||++|++|+||.|||+|+.|+  +..|.
T Consensus       233 l~e~~~~i~~lLe~L~~~~~~~~~~~rp~r~tG~AL~vA~~lL~~~~~~~gGrI~~F~sgppT~GpG~v~~r~~~~~~rs  312 (761)
T PLN00162        233 ASECEFTLNSALEELQKDPWPVPPGHRPARCTGAALSVAAGLLGACVPGTGARIMAFVGGPCTEGPGAIVSKDLSEPIRS  312 (761)
T ss_pred             HHHHHHHHHHHHHhhhccccccCCCCCCCccHHHHHHHHHHHHhhccCCCceEEEEEeCCCCCCCCceeecccccccccC
Confidence            9999999999999998763   6778899999999999999998    5799999999999999999999885  34555


Q ss_pred             cCC--CccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHH
Q 001711          583 YGT--DKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSR  660 (1021)
Q Consensus       583 ~gt--~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r  660 (1021)
                      +.+  +++.++++++.+||++||.+|+++||+||||+++.+|+||++|+.|++.|||.+++|++|+.    ++|.++|+|
T Consensus       313 h~di~k~~~~~~~~a~~fY~~la~~~~~~gisvDlF~~s~dqvglaem~~l~~~TGG~v~~~~sF~~----~~f~~~l~r  388 (761)
T PLN00162        313 HKDLDKDAAPYYKKAVKFYEGLAKQLVAQGHVLDVFACSLDQVGVAEMKVAVERTGGLVVLAESFGH----SVFKDSLRR  388 (761)
T ss_pred             ccccccchhhhcchHHHHHHHHHHHHHHcCceEEEEEccccccCHHHHhhhHhhcCcEEEEeCCcCh----HHHHHHHHH
Confidence            542  45567999999999999999999999999999999999999999999999999999999976    578888898


Q ss_pred             hcccc------cccceEEEEEeCCCeEEEeeecCccc---------------CCCCceeeccCCCCCcEEEEEEecccc-
Q 001711          661 DLTRE------TAWEAVMRIRCGKGVRFTNYHGNFML---------------RSTDLLALPAVDCDKAYAMQLSLEETL-  718 (1021)
Q Consensus       661 ~ltr~------~g~~a~mrVR~S~Gl~V~~~~Gnf~~---------------rs~~~~~l~~id~d~Sia~~~~~d~~l-  718 (1021)
                      .++|+      +||+|+||||||+||+|+++||||+.               +++++|+++++++|+||+|+|+++++. 
T Consensus       389 ~~~r~~~~~~~~gf~a~~~VrtS~glkv~g~~G~~~s~~~~~~~vsd~~iG~g~T~~w~l~~l~~~~t~av~f~~~~~~~  468 (761)
T PLN00162        389 VFERDGEGSLGLSFNGTFEVNCSKDVKVQGAIGPCASLEKKGPSVSDTEIGEGGTTAWKLCGLDKKTSLAVFFEVANSGQ  468 (761)
T ss_pred             HhcccccccccccceeEEEEEecCCeEEeeeEcCcccccccCCccccccccCCCCceeeecCcCcCCEEEEEEEEccccc
Confidence            88864      79999999999999999999999862               457889999999999999999998765 


Q ss_pred             ----CCCceeEEEEEEEEEecCCcEEEEEEeeeecccC--CHHHHHHhcCHhHHHHHHHHHHHHHHhcCCHHHHHHHHHH
Q 001711          719 ----LTTQTVYFQVALLYTASCGERRIRVHTLAAPVVS--NLSDMYQQADTGAIVSVFSRLAIEKTLSHKLEDARNAVQL  792 (1021)
Q Consensus       719 ----~~~~~~~iQ~AllYTt~~GeRrIRV~Tl~lpvt~--~l~~vf~s~D~eai~~~laK~a~~~~l~~~l~d~R~~l~~  792 (1021)
                          .++..+|||+|++||+.+|||||||||++++++.  ++.++|+++|+||++++|+|+|+.+++++++.|+|++|++
T Consensus       469 ~~~~~~~~~~~iQ~a~lYt~~~G~rRiRV~T~~~~~~~~~~~~~v~~~fDqeA~a~llaR~av~k~~~~~~~d~~r~ld~  548 (761)
T PLN00162        469 SNPQPPGQQFFLQFLTRYQHSNGQTRLRVTTVTRRWVEGSSSEELVAGFDQEAAAVVMARLASHKMETEEEFDATRWLDR  548 (761)
T ss_pred             cCCCCCCceEEEEEEEEEEcCCCCEEEEEEccccCccCCCCHHHHHHhcCHHHHHHHHHHHHHHHHhhCCHHHHHHHHHH
Confidence                4557899999999999999999999999999654  8899999999999999999999999999999999999999


Q ss_pred             HHHHHH---HHHHhhhhhccCCCCccccccccccHHHHHHHhhhccCCCCCCCCCChhHHHHHHHHHcCCCHHHHHhhhc
Q 001711          793 RLVKAL---KEYRNLYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPIRGGYADVTLDERCAAGYTMMALPVKKLLKLLY  869 (1021)
Q Consensus       793 ~lv~iL---~~YRk~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~Lr~g~~~~s~DeR~~~~~~l~s~~v~~~~~~lY  869 (1021)
                      +|++++   ..||| .+     +.+|+||++||+||+|||+|+||.+|+.  .++++|||++++++++++++.+++.|||
T Consensus       549 ~li~~~~~f~~Yrk-~~-----~~s~~Lp~~~~~lP~f~~~LrRS~~l~~--~n~spDera~~r~~l~~~~~~~sl~mI~  620 (761)
T PLN00162        549 ALIRLCSKFGDYRK-DD-----PSSFRLSPNFSLYPQFMFNLRRSQFVQV--FNNSPDETAYFRMMLNRENVTNSLVMIQ  620 (761)
T ss_pred             HHHHHHHHHhhhcc-cC-----CccccCCHHHHHHHHHHHHHhhhhhccC--CCCCchHHHHHHHHHhcCCHHHHHHhhC
Confidence            999874   67888 44     3469999999999999999999999995  7899999999999999999999999999


Q ss_pred             ccEEEeecCCCCCCccCCcccccccccccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhhccccccc
Q 001711          870 PCLIRVDEHLLKPSAQLDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAELSKVMLRE  949 (1021)
Q Consensus       870 PrL~~lh~~~~~~~~~~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~~lp~  949 (1021)
                      |+||++|.-            .+|+++.|+.++|++|||||||+|++++||+|+.+.+|..+.+...           |+
T Consensus       621 P~L~sy~~~------------~~P~pv~Ld~~si~~d~ilLLD~~f~vvi~~G~~ia~w~~~~~~~~-----------~~  677 (761)
T PLN00162        621 PTLISYSFN------------GPPEPVLLDVASIAADRILLLDSYFSVVIFHGSTIAQWRKAGYHNQ-----------PE  677 (761)
T ss_pred             CeEEEecCC------------CCCcceecchhhccCCceEEEeCCCEEEEEecCcccchhhcCCCCC-----------cc
Confidence            999999831            1377899999999999999999999999999999999999888876           44


Q ss_pred             cch--HHHHHHHHHHHHHHHhCCCCCceEEEeccCCCcchHHHHHhhccccCC--------------CCCCCHHHHHHHH
Q 001711          950 QDN--EMSRKLLGILKKLREQDPSYYQLCQLVRQGEQPREGFLLLANLVEDQI--------------GGSNGYADWIMQI 1013 (1021)
Q Consensus       950 ~~n--~~s~~l~~ii~~lr~~r~~~~~l~~vvrqg~~~~~e~~f~~~LVED~~--------------~~~~SY~dFL~~l 1013 (1021)
                      +++  ++.+..++.+++|.+.|.+.+++ +++.||.++  .++++++|---.+              -++.|+..|+.||
T Consensus       678 ~~~~~~~l~~p~~~a~~~~~~Rfp~Pr~-i~~~~~~Sq--aRfl~~klnPs~~~~~~~~~~~~~~~~tdd~sl~~f~~~l  754 (761)
T PLN00162        678 HEAFAQLLEAPQADAQAIIKERFPVPRL-VVCDQHGSQ--ARFLLAKLNPSATYNSANAMGGSDIIFTDDVSLQVFMEHL  754 (761)
T ss_pred             hhhHHHHHHhHHHHHHHHHhcCCCCCeE-EEeCCCCcH--HHHHHHhcCCcccccCCCCCCCCCeeecCCcCHHHHHHHH
Confidence            442  67778888999999999999998 999999988  8888888875411              1579999999999


Q ss_pred             HHHHhc
Q 001711         1014 HRQVLQ 1019 (1021)
Q Consensus      1014 h~~I~~ 1019 (1021)
                      +|.+.+
T Consensus       755 ~~~~v~  760 (761)
T PLN00162        755 QRLAVQ  760 (761)
T ss_pred             HHHhcC
Confidence            998764


No 6  
>KOG1986 consensus Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00  E-value=4e-90  Score=791.54  Aligned_cols=653  Identities=19%  Similarity=0.290  Sum_probs=563.8

Q ss_pred             CCCCceeccccccCCCHHHHhhcCCceEEEEccCCCCCCC-----CC---CccceEEccceeEecCCceEEEcCCCCCCC
Q 001711          312 CHSRYLRLTTSAIPNSQSLVSRWHLPLGAVVCPLAEPPEG-----NL---FICRTYVNPYVTFTDAGRKWRCNICALLND  383 (1021)
Q Consensus       312 ~~P~y~R~T~~~iP~t~~l~~~~~lPlg~vv~Pfa~~~~~-----e~---~rCrAYiNPf~~f~~~g~~W~Cn~C~~~N~  383 (1021)
                      -.-+.+|+|||.+|.++....++.+|++++++||.+.+..     +.   ++|+||+||||.++.+.+.|.|+||+..|.
T Consensus         7 e~~dGvR~twnvwPs~~~~~~~~vvPla~lytPl~e~~~~~~~~y~P~~C~~C~AvlNPyc~vd~~a~~W~CpfC~qrN~   86 (745)
T KOG1986|consen    7 EEIDGVRFTWNVWPSTRAEASRTVVPLACLYTPLKERPDLPPIQYDPLRCSKCGAVLNPYCSVDFRAKSWICPFCNQRNP   86 (745)
T ss_pred             ccCCCcccccccCCCcccccccccccHHHhccccccCCCCCccCCCCchhccchhhcCcceeecccCceEeccccccCCC
Confidence            3446899999999999999999999999999999975541     12   889999999999999999999999999999


Q ss_pred             CCcccccccCcCcccCCCCCCCcc--ccccEEEEccccccCCCCCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCC
Q 001711          384 VPGDYFAHLDATGRRIDIDQRPEL--TKGSVEFVAPTEYMVRPPMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELP  461 (1021)
Q Consensus       384 vP~~Y~~~l~~~g~R~D~~~rPEL--~~gtVEfvap~eY~~r~p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp  461 (1021)
                      +|.+|-. +.++      +..+||  ...+|||+.++..    ..||+|+||||+|....+   ++.++++|+.+|+.||
T Consensus        87 ~p~~Y~~-is~~------n~P~el~Pq~stvEy~l~~~~----~~ppvf~fVvDtc~~eee---L~~LkssL~~~l~lLP  152 (745)
T KOG1986|consen   87 FPPHYSG-ISEN------NLPPELLPQYSTVEYTLSPGR----VSPPVFVFVVDTCMDEEE---LQALKSSLKQSLSLLP  152 (745)
T ss_pred             CChhhcc-cCcc------CCChhhcCCcceeEEecCCCC----CCCceEEEEEeeccChHH---HHHHHHHHHHHHhhCC
Confidence            9999853 3332      466688  7999999998653    358999999999999866   8999999999999999


Q ss_pred             CCCCceEEEEEEcCeEEEEecCCCCCCcceeecc---c-----ccccc------------CCCCCccceehhhhHHHHHH
Q 001711          462 GFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVIS---D-----LDDIF------------VPLPDDLLVNLSESRSVVDT  521 (1021)
Q Consensus       462 ~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvs---D-----ldd~f------------~Pl~~~lLv~l~es~~~I~~  521 (1021)
                      ++  +.||||||++.||+|+|+... ..+..|..   |     +.|..            -.....||.++.+|...+.+
T Consensus       153 ~~--alvGlItfg~~v~v~el~~~~-~sk~~VF~G~ke~s~~q~~~~L~~~~~~~~~~~~~~~~~rFL~P~~~c~~~L~~  229 (745)
T KOG1986|consen  153 EN--ALVGLITFGTMVQVHELGFEE-CSKSYVFSGNKEYSAKQLLDLLGLSGGAGKGSENQSASNRFLLPAQECEFKLTN  229 (745)
T ss_pred             Cc--ceEEEEEecceEEEEEcCCCc-ccceeEEeccccccHHHHHHHhcCCcccccCCcccccchhhhccHHHHHHHHHH
Confidence            87  999999999999999998642 22333432   1     11111            00124799999999999999


Q ss_pred             HHhhCC---CcccCCCCcccchHHHHHHHHHHHHh----cCCEEEEEecCCCCCCcccccccC--CcCcccC--CCcccc
Q 001711          522 LLDSLP---SMFQDNMNVESAFGPALKAAFMVMSR----LGGKLLIFQNSLPSLGVGCLKLRG--DDLRVYG--TDKEHS  590 (1021)
Q Consensus       522 lLd~Lp---~~f~~~~~~~~alG~AL~aA~~lL~~----~GGkIivF~sg~Pt~GpG~L~~r~--~~~r~~g--t~~e~~  590 (1021)
                      +|++|.   +.....+++.||+|.||.+|+.+|+.    +|+||++|++|+||.|||++..+|  +.+|.+.  .++...
T Consensus       230 lle~L~~d~wpV~~g~Rp~RcTG~Al~iA~~Ll~~c~p~~g~rIv~f~gGPcT~GpG~vv~~el~~piRshhdi~~d~a~  309 (745)
T KOG1986|consen  230 LLEELQPDPWPVPPGHRPLRCTGVALSIASGLLEGCFPNTGARIVLFAGGPCTRGPGTVVSRELKEPIRSHHDIEKDNAP  309 (745)
T ss_pred             HHHHhcCCCCCCCCCCCcccchhHHHHHHHHHhcccCCCCcceEEEeccCCCCcCCceecchhhcCCCcCcccccCcchH
Confidence            999994   56677899999999999999999986    699999999999999999999885  5677776  455667


Q ss_pred             CCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHhcc------c
Q 001711          591 LRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRDLT------R  664 (1021)
Q Consensus       591 l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~lt------r  664 (1021)
                      +++.+.+||++||++++.+|++||+|+++.++++|++|..|++.|||.+...++|+.+.++.    .++|.++      .
T Consensus       310 y~kKa~KfY~~La~r~~~~ghvlDifa~~lDQvGi~EMk~l~~~TGG~lvl~dsF~~s~Fk~----sfqR~f~~d~~~~l  385 (745)
T KOG1986|consen  310 YYKKAIKFYEKLAERLANQGHVLDIFAAALDQVGILEMKPLVESTGGVLVLGDSFNTSIFKQ----SFQRIFTRDGEGDL  385 (745)
T ss_pred             HHHHHHHHHHHHHHHHHhCCceEeeeeeeccccchHHHHHHhhcCCcEEEEecccchHHHHH----HHHHHhccccccch
Confidence            88999999999999999999999999999999999999999999999999999998865544    4555555      4


Q ss_pred             ccccceEEEEEeCCCeEEEeeecCccc---------------CCCCceeeccCCCCCcEEEEEEeccc--cCCCceeEEE
Q 001711          665 ETAWEAVMRIRCGKGVRFTNYHGNFML---------------RSTDLLALPAVDCDKAYAMQLSLEET--LLTTQTVYFQ  727 (1021)
Q Consensus       665 ~~g~~a~mrVR~S~Gl~V~~~~Gnf~~---------------rs~~~~~l~~id~d~Sia~~~~~d~~--l~~~~~~~iQ  727 (1021)
                      ..||+|.|+|+||++|+|++.+|++..               +++..|++..++..+++++.|++..+  +.....+|||
T Consensus       386 ~~~fn~~leV~tSkdlkI~g~IGp~~Sl~~k~~~vsdt~ig~g~t~~wkm~~ls~~t~~s~~fei~~~~~~~~~~~~~iQ  465 (745)
T KOG1986|consen  386 KMGFNGTLEVKTSKDLKIQGVIGPCVSLNKKGPNVSDTEIGEGNTSAWKMCGLSPSTTLSLFFEISNQHNIPQSGQGYIQ  465 (745)
T ss_pred             hhhcCceEEEEecCCcEEEecccccccccCCCCccccceeccccccceeeeccCCCceEEEEEEeccccCCCCCCeeEEE
Confidence            689999999999999999999998651               35678999999999999999998643  3345789999


Q ss_pred             EEEEEEecCCcEEEEEEeeeecccCCH-HHHHHhcCHhHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHH---HHHh
Q 001711          728 VALLYTASCGERRIRVHTLAAPVVSNL-SDMYQQADTGAIVSVFSRLAIEKTLSHKLEDARNAVQLRLVKALK---EYRN  803 (1021)
Q Consensus       728 ~AllYTt~~GeRrIRV~Tl~lpvt~~l-~~vf~s~D~eai~~~laK~a~~~~l~~~l~d~R~~l~~~lv~iL~---~YRk  803 (1021)
                      |++.|.+.+|++||||+|++.+.++.. .++-.++|+||.++++||+++.++.++...|++.++++.++++..   .|+|
T Consensus       466 FiT~Yq~s~g~~riRVtT~~r~~~d~~~~~i~~~FDqEaaAV~mAR~~~~kae~e~~~d~~rwlDr~Lirlc~kFg~y~k  545 (745)
T KOG1986|consen  466 FITQYQHSSGQKRIRVTTLARPWADSGSPEISQSFDQEAAAVLMARLALLKAETEDGPDVLRWLDRNLIRLCQKFGDYRK  545 (745)
T ss_pred             EEEEEEcCCCcEEEEEEEeehhhccccchHhhhccchHHHHHHHHHHHHHhhhccccchHHHHHHHHHHHHHHHHhccCC
Confidence            999999999999999999999999987 588899999999999999999999999888999999999988854   5666


Q ss_pred             hhhhccCCCCccccccccccHHHHHHHhhhccCCCCCCCCCChhHHHHHHHHHcCCCHHHHHhhhcccEEEeecCCCCCC
Q 001711          804 LYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPIRGGYADVTLDERCAAGYTMMALPVKKLLKLLYPCLIRVDEHLLKPS  883 (1021)
Q Consensus       804 ~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~Lr~g~~~~s~DeR~~~~~~l~s~~v~~~~~~lYPrL~~lh~~~~~~~  883 (1021)
                            ..+..+.|+++|.++|.|||+|+||+.|.-  .+.|+|||+|++|+|.+.++.+++.||.|+|++++..     
T Consensus       546 ------~dPssf~l~~~fsl~PQfmfhLRRS~fLqv--fNnSPDEt~~yrhll~~e~v~~sliMIqP~L~sySf~-----  612 (745)
T KOG1986|consen  546 ------DDPSSFRLSPNFSLYPQFMFHLRRSPFLQV--FNNSPDETAYYRHLLNREDVDNSLIMIQPTLLSYSFN-----  612 (745)
T ss_pred             ------CCchhhcCChhhhhhHHHHHhhccchhhhc--cCCCcchHHHHHHHHhhccchhhhheecceeeeeecC-----
Confidence                  455679999999999999999999999994  8999999999999999999999999999999999853     


Q ss_pred             ccCCcccccccccccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhhccccccccch--HHHHHHHHH
Q 001711          884 AQLDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAELSKVMLREQDN--EMSRKLLGI  961 (1021)
Q Consensus       884 ~~~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~~lp~~~n--~~s~~l~~i  961 (1021)
                         .    -|+++.|+..+|.+|.|+|||+++.|+||.|..+..|...++...           ||+++  ++.+..++.
T Consensus       613 ---g----~~epvlLD~~Si~~D~iLLlDt~f~i~i~hG~tIaqWR~~gy~~~-----------pe~~~f~~LL~ap~~d  674 (745)
T KOG1986|consen  613 ---G----PPEPVLLDVASILADRILLLDTYFTIVIFHGSTIAQWRKAGYHEQ-----------PEYENFKELLEAPRED  674 (745)
T ss_pred             ---C----CCceeEecccccCCceEEEeecceEEEEECCchHHHHHhcccccC-----------hhhHHHHHHHHhHHHH
Confidence               1    156789999999999999999999999999999999999888876           55653  788888999


Q ss_pred             HHHHHHhCCCCCceEEEeccCCCcchHHHHHhhccccC--C------------CCCCCHHHHHHHHHHHHhc
Q 001711          962 LKKLREQDPSYYQLCQLVRQGEQPREGFLLLANLVEDQ--I------------GGSNGYADWIMQIHRQVLQ 1019 (1021)
Q Consensus       962 i~~lr~~r~~~~~l~~vvrqg~~~~~e~~f~~~LVED~--~------------~~~~SY~dFL~~lh~~I~~ 1019 (1021)
                      +++|-..|.+.+++ ++++||.++  ..++++++.--.  +            -+++||.+|+.||+|.+..
T Consensus       675 A~el~~~RFP~PR~-v~~~q~GSQ--ARFLlsklnPS~t~~~~~~~~~s~~I~TDDvSlq~fm~hLkklav~  743 (745)
T KOG1986|consen  675 AQELLLERFPMPRY-VVTDQGGSQ--ARFLLSKLNPSETHNNLTAHGGSSIILTDDVSLQVFMEHLKKLAVS  743 (745)
T ss_pred             HHHHHHhhCCCCeE-EEecCCccH--HHhhhhhcCcchhccchhhccCCCeeeeccccHHHHHHHHHhhcCC
Confidence            99999999999998 999999877  677778877521  1            1579999999999987654


No 7  
>COG5047 SEC23 Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion]
Probab=100.00  E-value=9.8e-83  Score=712.42  Aligned_cols=661  Identities=17%  Similarity=0.279  Sum_probs=553.7

Q ss_pred             cCCCCceeccccccCCCHHHHhhcCCceEEEEccCCCCCCC-----CC---Cc-cceEEccceeEecCCceEEEcCCCCC
Q 001711          311 NCHSRYLRLTTSAIPNSQSLVSRWHLPLGAVVCPLAEPPEG-----NL---FI-CRTYVNPYVTFTDAGRKWRCNICALL  381 (1021)
Q Consensus       311 N~~P~y~R~T~~~iP~t~~l~~~~~lPlg~vv~Pfa~~~~~-----e~---~r-CrAYiNPf~~f~~~g~~W~Cn~C~~~  381 (1021)
                      +-+-+.||+|||++|.|+...+++.+|++|+|+||.+.+.-     +.   .. |+||+||||.++.+.+.|+|.||+..
T Consensus         6 iee~dgir~twnvfpat~~da~~~~iPia~lY~Pl~e~~~~~v~~yepv~C~~pC~avlnpyC~id~r~~~W~CpfCnqr   85 (755)
T COG5047           6 IEENDGIRLTWNVFPATRGDATRTVIPIACLYTPLHEDDALTVNYYEPVKCTAPCKAVLNPYCHIDERNQSWICPFCNQR   85 (755)
T ss_pred             hccccceEEEEecccCCccccccccccHHHhccccccccccCcccCCCceecccchhhcCcceeeccCCceEecceecCC
Confidence            34557899999999999999999999999999999987432     12   44 99999999999999999999999999


Q ss_pred             CCCCcccccccCcCcccCCCCCCCcc--ccccEEEEccccccCCCCCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhc
Q 001711          382 NDVPGDYFAHLDATGRRIDIDQRPEL--TKGSVEFVAPTEYMVRPPMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDE  459 (1021)
Q Consensus       382 N~vP~~Y~~~l~~~g~R~D~~~rPEL--~~gtVEfvap~eY~~r~p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~  459 (1021)
                      |.+|..|- .+.+      .+..+||  ++.||||+.+++    .-.||+|+||||++++..+   +.+++++|+.+|..
T Consensus        86 n~lp~qy~-~iS~------~~LplellpqssTiey~lskp----~~~ppvf~fvvD~~~D~e~---l~~Lkdslivslsl  151 (755)
T COG5047          86 NTLPPQYR-DISN------ANLPLELLPQSSTIEYTLSKP----VILPPVFFFVVDACCDEEE---LTALKDSLIVSLSL  151 (755)
T ss_pred             CCCChhhc-CCCc------ccCCccccCCCceEEEEccCC----ccCCceEEEEEEeecCHHH---HHHHHHHHHHHHhc
Confidence            99999884 3332      2566798  799999999875    3578999999999997766   89999999999999


Q ss_pred             CCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccc--------ccccc------CC-------------CCCccceeh
Q 001711          460 LPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISD--------LDDIF------VP-------------LPDDLLVNL  512 (1021)
Q Consensus       460 Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsD--------ldd~f------~P-------------l~~~lLv~l  512 (1021)
                      ||.+  +.||||||++.||+|.++... ..+-.|.+-        |+++.      .+             .+..|+.++
T Consensus       152 lppe--aLvglItygt~i~v~el~ae~-~~r~~VF~g~~eyt~~~L~~ll~~~~~~~~~~~es~is~~~~~~~~rFl~p~  228 (755)
T COG5047         152 LPPE--ALVGLITYGTSIQVHELNAEN-HRRSYVFSGNKEYTKENLQELLALSKPTKSGGFESKISGIGQFASSRFLLPT  228 (755)
T ss_pred             CCcc--ceeeEEEecceeEEEeccccc-cCcceeecchHHHHHHHHHHHhcccCCCCcchhhhhcccccccchhhhhccH
Confidence            9977  999999999999999997642 222233211        22211      11             123589999


Q ss_pred             hhhHHHHHHHHhhCC---CcccCCCCcccchHHHHHHHHHHHHh----cCCEEEEEecCCCCCCcccccccC--CcCccc
Q 001711          513 SESRSVVDTLLDSLP---SMFQDNMNVESAFGPALKAAFMVMSR----LGGKLLIFQNSLPSLGVGCLKLRG--DDLRVY  583 (1021)
Q Consensus       513 ~es~~~I~~lLd~Lp---~~f~~~~~~~~alG~AL~aA~~lL~~----~GGkIivF~sg~Pt~GpG~L~~r~--~~~r~~  583 (1021)
                      .+|...+.++||+|.   +.....+++.||+|+||.+|..+|+.    .|+||++|.+|+||.|||.|..+|  +.+|.+
T Consensus       229 q~ce~~L~n~le~L~pd~~~v~~~~Rp~RCTGsAl~ias~Ll~~~~p~~~~~i~lF~~GPcTvGpG~Vvs~elkEpmRsh  308 (755)
T COG5047         229 QQCEFKLLNILEQLQPDPWPVPAGKRPLRCTGSALNIASSLLEQCFPNAGCHIVLFAGGPCTVGPGTVVSTELKEPMRSH  308 (755)
T ss_pred             HHHHHHHHHHHHHhCCCCccCCCCCCCccccchhHHHHHHHHHhhccCcceeEEEEcCCCccccCceeeehhhccccccc
Confidence            999999999999994   45677899999999999999999986    699999999999999999999874  567766


Q ss_pred             C--CCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHh
Q 001711          584 G--TDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRD  661 (1021)
Q Consensus       584 g--t~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~  661 (1021)
                      .  +.+..++.+++.+||+.||++.+.+|.++|+|+.+.++++|.+|..|...|||.+...++|+.+++...|.+-|.+.
T Consensus       309 H~ie~d~aqh~kka~KFY~~laeR~a~~gh~~DifagcldqIGI~eM~~L~~sTgg~lvlsdsF~t~ifkqSfqrif~~d  388 (755)
T COG5047         309 HDIESDSAQHSKKATKFYKGLAERVANQGHALDIFAGCLDQIGIMEMEPLTTSTGGALVLSDSFTTSIFKQSFQRIFNRD  388 (755)
T ss_pred             ccccccchhhccchHHHHHHHHHHHhccchhHHHHHHHHHhhhhhcchhhccCCcceEEEeccccHHHHHHHHHHHhCcC
Confidence            5  34446889999999999999999999999999999999999999999999999999999999887777766655543


Q ss_pred             ccc--ccccceEEEEEeCCCeEEEeeecCccc---------------CCCCceeeccCCCCCcEEEEEEeccccCC----
Q 001711          662 LTR--ETAWEAVMRIRCGKGVRFTNYHGNFML---------------RSTDLLALPAVDCDKAYAMQLSLEETLLT----  720 (1021)
Q Consensus       662 ltr--~~g~~a~mrVR~S~Gl~V~~~~Gnf~~---------------rs~~~~~l~~id~d~Sia~~~~~d~~l~~----  720 (1021)
                      -..  ..||+|.|+|.|||+|+|++.+|+...               ..++.|.++.+.+.+++++.|++...-..    
T Consensus       389 ~~g~l~~gfNa~m~V~TsKnl~~~g~ig~a~~~~k~~~ni~~~eigi~~t~swkm~slsPk~nyal~fei~~~~~~~~~~  468 (755)
T COG5047         389 SEGYLKMGFNANMEVKTSKNLKIKGLIGHAVSVKKKANNISDSEIGIGATNSWKMASLSPKSNYALYFEIALGAASGSAQ  468 (755)
T ss_pred             cccchhhhhccceeEeeccCceeeeeecceeeecccccccccccccccccccccccccCCCcceEEEEEeccccCCCccC
Confidence            222  479999999999999999999998541               24567999999999999999998643322    


Q ss_pred             -CceeEEEEEEEEEecCCcEEEEEEeeeecccCC-HHHHHHhcCHhHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHH-
Q 001711          721 -TQTVYFQVALLYTASCGERRIRVHTLAAPVVSN-LSDMYQQADTGAIVSVFSRLAIEKTLSHKLEDARNAVQLRLVKA-  797 (1021)
Q Consensus       721 -~~~~~iQ~AllYTt~~GeRrIRV~Tl~lpvt~~-l~~vf~s~D~eai~~~laK~a~~~~l~~~l~d~R~~l~~~lv~i-  797 (1021)
                       ...+|+|+...|.+++|.-||||.|++...++. ...+++++|+||.++++||+|+.++......|+-++++..++++ 
T Consensus       469 ~~~~a~iQfiT~yQhss~t~riRVtTvar~f~~~~~p~i~~SFdqEaaaV~~aR~a~~K~~~ed~~Dv~rw~dr~lirlc  548 (755)
T COG5047         469 RPAEAYIQFITTYQHSSGTYRIRVTTVARMFTDGGLPKINRSFDQEAAAVFMARIAAFKAETEDIIDVFRWIDRNLIRLC  548 (755)
T ss_pred             CcccchhhhhhhhhccCCcEEEEEeehhhhhccCCChhhhhcchhhHHHHHHHHHHHhhcccccchhHHHHHHHHHHHHH
Confidence             268999999999999999999999999777764 56688899999999999999999999888889888888876665 


Q ss_pred             --HHHHHhhhhhccCCCCccccccccccHHHHHHHhhhccCCCCCCCCCChhHHHHHHHHHcCCCHHHHHhhhcccEEEe
Q 001711          798 --LKEYRNLYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPIRGGYADVTLDERCAAGYTMMALPVKKLLKLLYPCLIRV  875 (1021)
Q Consensus       798 --L~~YRk~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~Lr~g~~~~s~DeR~~~~~~l~s~~v~~~~~~lYPrL~~l  875 (1021)
                        ++.|||      ..+..+.|+.++.++|.|||+|+||+.|.-  .+.++|||++++|.+.+.++.+++.|+.|.|.++
T Consensus       549 q~fa~y~k------~dpssfrl~~~f~lypqf~y~lrRSpfL~v--fNnSPDEt~fyrh~l~~~dv~~sLimiqPtL~Sy  620 (755)
T COG5047         549 QKFADYRK------DDPSSFRLDPNFTLYPQFMYHLRRSPFLSV--FNNSPDETAFYRHMLNNADVNDSLIMIQPTLQSY  620 (755)
T ss_pred             HHHHhcCC------CCchhhcCCcchhhhhHHHhhhhccceeec--cCCCcchHHHHHHHHhcccccchhhhhcchheee
Confidence              567777      456679999999999999999999999994  8999999999999999999999999999999999


Q ss_pred             ecCCCCCCccCCcccccccccccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhhccccccccchHHH
Q 001711          876 DEHLLKPSAQLDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAELSKVMLREQDNEMS  955 (1021)
Q Consensus       876 h~~~~~~~~~~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~~lp~~~n~~s  955 (1021)
                      |...        +    ..++-|++-++++|-|+|+|++++|+||-|+.+..|.-..+.....+..+         .++.
T Consensus       621 s~~~--------~----~~pVlLDs~svkpdviLLlDtff~Ili~hG~~iaqwr~agyq~qpey~~l---------K~Ll  679 (755)
T COG5047         621 SFEK--------G----GVPVLLDSVSVKPDVILLLDTFFHILIFHGSYIAQWRNAGYQEQPEYLNL---------KELL  679 (755)
T ss_pred             eccC--------C----CceEEEeccccCCCeEEEeeceeEEEEECChHHHHHHhhhhhcCchhhhH---------HHHh
Confidence            9641        1    23578899999999999999999999999999999988877766333222         1455


Q ss_pred             HHHHHHHHHHHHhCCCCCceEEEeccCCCcchHHHHHhhccc-cCCC------------CCCCHHHHHHHHHHHHhcC
Q 001711          956 RKLLGILKKLREQDPSYYQLCQLVRQGEQPREGFLLLANLVE-DQIG------------GSNGYADWIMQIHRQVLQN 1020 (1021)
Q Consensus       956 ~~l~~ii~~lr~~r~~~~~l~~vvrqg~~~~~e~~f~~~LVE-D~~~------------~~~SY~dFL~~lh~~I~~k 1020 (1021)
                      +.-+-.+.++-..|.+.+++ ++++||.++  ..++++++.- |..+            +.++|.+|+.+|+|....|
T Consensus       680 ~~p~~ea~ell~dRfP~Prf-i~teqggSQ--aRfLlskinPsd~~~~~~~~~s~tilTddv~lq~fm~hl~~lav~~  754 (755)
T COG5047         680 EAPRLEAAELLQDRFPIPRF-IVTEQGGSQ--ARFLLSKINPSDITNKMSGGGSETILTDDVNLQKFMNHLRKLAVSK  754 (755)
T ss_pred             hchhhHHHHHHHhhCCCCeE-EEecCCccH--HHHHHhhcCccccccccccCccceeeecccCHHHHHHHHHHHhccC
Confidence            55555667777889999998 999999888  7778888875 2221            4699999999999976544


No 8  
>cd01479 Sec24-like Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 
Probab=100.00  E-value=4.5e-54  Score=466.04  Aligned_cols=241  Identities=56%  Similarity=0.965  Sum_probs=231.4

Q ss_pred             CCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCC-CCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCC
Q 001711          425 PMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGF-PRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVP  503 (1021)
Q Consensus       425 p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~-~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~P  503 (1021)
                      |+||+||||||||..++++|+++++|++|+++|+.||++ +|++|||||||+.||||+++...++++|++++|++|+|+|
T Consensus         1 p~pp~~~FvIDvs~~a~~~g~~~~~~~si~~~L~~lp~~~~~~~VgiITfd~~v~~y~l~~~~~~~q~~vv~dl~d~f~P   80 (244)
T cd01479           1 PQPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPGDDPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDLDDPFLP   80 (244)
T ss_pred             CCCCEEEEEEEccHHHHhhChHHHHHHHHHHHHHhcCCCCCCeEEEEEEECCeEEEEECCCCCCCCeEEEeeCcccccCC
Confidence            579999999999999999999999999999999999987 8999999999999999999998889999999999999999


Q ss_pred             CCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcCCEEEEEecCCCCCCcccccccCCcCccc
Q 001711          504 LPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLGGKLLIFQNSLPSLGVGCLKLRGDDLRVY  583 (1021)
Q Consensus       504 l~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~  583 (1021)
                      ++++||++++|+++.|+++||+|+++|.+++++++|+|+||++|..+|+.+||||++|++|+||+|+|+|+.|++ .+..
T Consensus        81 ~~~~~lv~l~e~~~~i~~lL~~L~~~~~~~~~~~~c~G~Al~~A~~lL~~~GGkIi~f~s~~pt~GpG~l~~~~~-~~~~  159 (244)
T cd01479          81 LPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLKETGGKIIVFQSSLPTLGAGKLKSRED-PKLL  159 (244)
T ss_pred             CCcceeecHHHHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHhcCCEEEEEeCCCCCcCCcccccCcc-cccc
Confidence            999999999999999999999999999999999999999999999999999999999999999999999999875 4567


Q ss_pred             CCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeC--CCCCchhHHHHHHHHHHh
Q 001711          584 GTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYP--SFQSTTHGERLRHELSRD  661 (1021)
Q Consensus       584 gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~--~F~~~~d~~kl~~dL~r~  661 (1021)
                      ++++|+++++++++||++||.+|+++||+||+|+++.+|+|+++|+.|+++|||.+++|+  +|+..+|.+||++||+|+
T Consensus       160 ~~~~e~~~~~p~~~fY~~la~~~~~~~isvDlF~~~~~~~dla~l~~l~~~TGG~v~~y~~~~~~~~~d~~kl~~dl~~~  239 (244)
T cd01479         160 STDKEKQLLQPQTDFYKKLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYPSFNFSAPNDVEKLVNELARY  239 (244)
T ss_pred             CchhhhhhcCcchHHHHHHHHHHHHcCeEEEEEEccCcccChhhhhhhhhhcCceEEEECCccCCchhhHHHHHHHHHHH
Confidence            778888999999999999999999999999999999999999999999999999999999  788889999999999999


Q ss_pred             ccccc
Q 001711          662 LTRET  666 (1021)
Q Consensus       662 ltr~~  666 (1021)
                      ++|++
T Consensus       240 ltr~~  244 (244)
T cd01479         240 LTRKI  244 (244)
T ss_pred             hcccC
Confidence            99864


No 9  
>cd01468 trunk_domain trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Some members of this family possess a partial MIDAS motif that is a characteristic feature of most vWA domain proteins.
Probab=100.00  E-value=5.9e-50  Score=433.17  Aligned_cols=235  Identities=46%  Similarity=0.848  Sum_probs=224.2

Q ss_pred             CCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCC
Q 001711          425 PMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPL  504 (1021)
Q Consensus       425 p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl  504 (1021)
                      |+||+||||||+|++|+++|++++++++|+++|+.||++++++|||||||++||||++++...+++|+|++|++|+|+|.
T Consensus         1 p~pp~~vFvID~s~~ai~~~~l~~~~~sl~~~l~~lp~~~~~~igiITf~~~V~~~~~~~~~~~~~~~v~~dl~d~f~p~   80 (239)
T cd01468           1 PQPPVFVFVIDVSYEAIKEGLLQALKESLLASLDLLPGDPRARVGLITYDSTVHFYNLSSDLAQPKMYVVSDLKDVFLPL   80 (239)
T ss_pred             CCCCEEEEEEEcchHhccccHHHHHHHHHHHHHHhCCCCCCcEEEEEEeCCeEEEEECCCCCCCCeEEEeCCCccCcCCC
Confidence            68999999999999999999999999999999999997677999999999999999999887779999999999999999


Q ss_pred             CCccceehhhhHHHHHHHHhhCCCcccC--CCCcccchHHHHHHHHHHHHhc--CCEEEEEecCCCCCCcccccccCCcC
Q 001711          505 PDDLLVNLSESRSVVDTLLDSLPSMFQD--NMNVESAFGPALKAAFMVMSRL--GGKLLIFQNSLPSLGVGCLKLRGDDL  580 (1021)
Q Consensus       505 ~~~lLv~l~es~~~I~~lLd~Lp~~f~~--~~~~~~alG~AL~aA~~lL~~~--GGkIivF~sg~Pt~GpG~L~~r~~~~  580 (1021)
                      ++++|++++|+++.|+++|++|+.++..  +++.++|+|+||++|..+|+..  ||||++|++|+||+|||+|+.|++ .
T Consensus        81 ~~~~l~~~~e~~~~i~~~l~~l~~~~~~~~~~~~~~~~G~Al~~A~~ll~~~~~gGkI~~f~sg~pt~GpG~l~~~~~-~  159 (239)
T cd01468          81 PDRFLVPLSECKKVIHDLLEQLPPMFWPVPTHRPERCLGPALQAAFLLLKGTFAGGRIIVFQGGLPTVGPGKLKSRED-K  159 (239)
T ss_pred             cCceeeeHHHHHHHHHHHHHhhhhhccccCCCCCcccHHHHHHHHHHHHhhcCCCceEEEEECCCCCCCCCccccCcc-c
Confidence            9999999999999999999999999987  8899999999999999999998  999999999999999999999854 4


Q ss_pred             cccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHH
Q 001711          581 RVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSR  660 (1021)
Q Consensus       581 r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r  660 (1021)
                      +..++++|+++++++++||++||++|++++|+||+|+++.+++|+++|+.|++.|||.+++|++|+..+|.++|.+||+|
T Consensus       160 ~~~~~~~e~~~~~~a~~fY~~la~~~~~~~isvdlF~~~~~~~dl~~l~~l~~~TGG~v~~y~~f~~~~~~~~~~~~l~r  239 (239)
T cd01468         160 EPIRSHDEAQLLKPATKFYKSLAKECVKSGICVDLFAFSLDYVDVATLKQLAKSTGGQVYLYDSFQAPNDGSKFKQDLQR  239 (239)
T ss_pred             ccCCCccchhcccccHHHHHHHHHHHHHcCeEEEEEeccccccCHHHhhhhhhcCCceEEEeCCCCCcccHHHHHHHhcC
Confidence            56667889999999999999999999999999999999999999999999999999999999999999999999999975


No 10 
>PF04811 Sec23_trunk:  Sec23/Sec24 trunk domain;  InterPro: IPR006896 COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation [].  Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger (IPR006895 from INTERPRO), an alpha/beta trunk domain, an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes the Sec23/24 alpha/beta trunk domain, which is formed from a single, approximately 250-residue segment plugged into the beta-barrel between strands beta-1 and beta-19. The trunk has an alpha/beta fold with a vWA topology, and it forms the dimer interface, primarily involving strand beta-14 on Sec23 and Sec24; in addition, the trunk domain of Sec23 contacts Sar1.; GO: 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EGD_A 2NUP_A 3EG9_A 3EFO_A 3EGX_A 2NUT_A 1PD0_A 1PD1_A 1M2V_B 1PCX_A ....
Probab=100.00  E-value=9e-50  Score=432.72  Aligned_cols=237  Identities=51%  Similarity=0.915  Sum_probs=205.8

Q ss_pred             CCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCC
Q 001711          425 PMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPL  504 (1021)
Q Consensus       425 p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl  504 (1021)
                      |+||+|+||||+|.+|+++|++++++++|+++|+.|+.+++++|||||||++||||+++.+..+++|+|++|+||+|+|.
T Consensus         1 P~pp~y~FvID~s~~av~~g~~~~~~~sl~~~l~~l~~~~~~~vgiitfd~~V~~y~l~~~~~~~~~~v~~dl~~~~~p~   80 (243)
T PF04811_consen    1 PQPPVYVFVIDVSYEAVQSGLLQSLIESLKSALDSLPGDERTRVGIITFDSSVHFYNLSSSLSQPQMIVVSDLDDPFIPL   80 (243)
T ss_dssp             -S--EEEEEEE-SHHHHHHTHHHHHHHHHHHHGCTSSTSTT-EEEEEEESSSEEEEETTTTSSSTEEEEEHHTTSHHSST
T ss_pred             CCCCEEEEEEECchhhhhccHHHHHHHHHHHHHHhccCCCCcEEEEEEeCCEEEEEECCCCcCCCcccchHHHhhcccCC
Confidence            68999999999999999999999999999999999997778999999999999999999988889999999999999999


Q ss_pred             CCccceehhhhHHHHHHHHhhCCCcccCC--CCcccchHHHHHHHHHHHH--hcCCEEEEEecCCCCCCc-ccccccCCc
Q 001711          505 PDDLLVNLSESRSVVDTLLDSLPSMFQDN--MNVESAFGPALKAAFMVMS--RLGGKLLIFQNSLPSLGV-GCLKLRGDD  579 (1021)
Q Consensus       505 ~~~lLv~l~es~~~I~~lLd~Lp~~f~~~--~~~~~alG~AL~aA~~lL~--~~GGkIivF~sg~Pt~Gp-G~L~~r~~~  579 (1021)
                      +++||+++.|+++.|+++|++|+.++..+  +++++|+|+||++|..+|+  ..||||++|++|+||+|+ |+|+.+++ 
T Consensus        81 ~~~llv~~~e~~~~i~~ll~~L~~~~~~~~~~~~~~c~G~Al~~A~~ll~~~~~gGkI~~F~s~~pt~G~Gg~l~~~~~-  159 (243)
T PF04811_consen   81 PDGLLVPLSECRDAIEELLESLPSIFPETAGKRPERCLGSALSAALSLLSSRNTGGKILVFTSGPPTYGPGGSLKKRED-  159 (243)
T ss_dssp             SSSSSEETTTCHHHHHHHHHHHHHHSTT-TTB-----HHHHHHHHHHHHHHHTS-EEEEEEESS---SSSTTSS-SBTT-
T ss_pred             cccEEEEhHHhHHHHHHHHHHhhhhcccccccCccccHHHHHHHHHHHHhccccCCEEEEEeccCCCCCCCceeccccc-
Confidence            99999999999999999999999988887  8899999999999999999  899999999999999999 77777754 


Q ss_pred             CcccCCCcc-ccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHH
Q 001711          580 LRVYGTDKE-HSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHEL  658 (1021)
Q Consensus       580 ~r~~gt~~e-~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL  658 (1021)
                      .+.+++++| ..++.++++||++||++|+++||+||+|+++.+++|+++|+.|++.|||.+++|++|+.++|.++|++||
T Consensus       160 ~~~~~~~~~~~~~~~~~~~fY~~la~~~~~~~isvDlf~~~~~~~~l~tl~~l~~~TGG~l~~y~~f~~~~~~~~l~~dl  239 (243)
T PF04811_consen  160 SSHYDTEKEKALLLPPANEFYKKLAEECSKQGISVDLFVFSSDYVDLATLGPLARYTGGSLYYYPNFNAERDGEKLRQDL  239 (243)
T ss_dssp             SCCCCHCTTHHCHSHSSSHHHHHHHHHHHHCTEEEEEEEECSS--SHHHHTHHHHCTT-EEEEETTTTCHHHHHHHHHHH
T ss_pred             ccccccccchhhhccccchHHHHHHHHHHhcCCEEEEEeecCCCCCcHhHHHHHHhCceeEEEeCCCCCchhHHHHHHHH
Confidence            456666666 6778888999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHhc
Q 001711          659 SRDL  662 (1021)
Q Consensus       659 ~r~l  662 (1021)
                      +|++
T Consensus       240 ~r~~  243 (243)
T PF04811_consen  240 KRLV  243 (243)
T ss_dssp             HHHH
T ss_pred             HHhC
Confidence            9874


No 11 
>cd01478 Sec23-like Sec23-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 23 is very similar to Sec24. The Sec23 and Sec24 
Probab=100.00  E-value=2e-44  Score=394.49  Aligned_cols=225  Identities=20%  Similarity=0.330  Sum_probs=195.5

Q ss_pred             CCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCC---------------CCc
Q 001711          425 PMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSL---------------TQP  489 (1021)
Q Consensus       425 p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~---------------~~p  489 (1021)
                      |.||+|+||||+|.++++   +++++++|+++|+.||++  ++|||||||++||||||+...               ++.
T Consensus         1 p~pp~~vFviDvs~~~~e---l~~l~~sl~~~L~~lP~~--a~VGlITfd~~V~~~~L~~~~~~~~~vf~g~~~~~~~~~   75 (267)
T cd01478           1 TSPPVFLFVVDTCMDEEE---LDALKESLIMSLSLLPPN--ALVGLITFGTMVQVHELGFEECSKSYVFRGNKDYTAKQI   75 (267)
T ss_pred             CCCCEEEEEEECccCHHH---HHHHHHHHHHHHHhCCCC--CEEEEEEECCEEEEEEcCCCcCceeeeccCCccCCHHHH
Confidence            578999999999999998   889999999999999976  899999999999999998541               111


Q ss_pred             -cee------------eccccccccCCCC-CccceehhhhHHHHHHHHhhCCCc---ccCCCCcccchHHHHHHHHHHHH
Q 001711          490 -QMM------------VISDLDDIFVPLP-DDLLVNLSESRSVVDTLLDSLPSM---FQDNMNVESAFGPALKAAFMVMS  552 (1021)
Q Consensus       490 -~ml------------VvsDldd~f~Pl~-~~lLv~l~es~~~I~~lLd~Lp~~---f~~~~~~~~alG~AL~aA~~lL~  552 (1021)
                       +|+            +.+|++|.|+|.+ ++||++++||++.|+++||+|+.+   +.+++++++|+|+||++|..+|+
T Consensus        76 ~~~l~~~~~~~~~~~~~~~~~~~~~~p~~~~~flvpl~e~~~~i~~lLe~L~~~~~~~~~~~r~~r~~G~Al~~A~~ll~  155 (267)
T cd01478          76 QDMLGLGGPAMRPSASQHPGAGNPLPSAAASRFLLPVSQCEFTLTDLLEQLQPDPWPVPAGHRPLRCTGVALSIAVGLLE  155 (267)
T ss_pred             HHHhccccccccccccCcCCccccccccccccEEEEHHHHHHHHHHHHHhCcccccccCCCCCCCCchHHHHHHHHHHHH
Confidence             222            2245788999876 699999999999999999999875   46678899999999999999998


Q ss_pred             ----hcCCEEEEEecCCCCCCcccccccC--CcCcccC-CCcc-ccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcC
Q 001711          553 ----RLGGKLLIFQNSLPSLGVGCLKLRG--DDLRVYG-TDKE-HSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTD  624 (1021)
Q Consensus       553 ----~~GGkIivF~sg~Pt~GpG~L~~r~--~~~r~~g-t~~e-~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~d  624 (1021)
                          .+||||++|++|+||+|||+|+.|+  +..|.+. .+++ .++++++++||++||.+|+++||+||+|+++.+|+|
T Consensus       156 ~~~~~~gGki~~F~sg~pT~GpG~l~~r~~~~~~r~~~d~~~~~~~~~~~a~~fY~~la~~~~~~~vsvDlF~~s~d~vg  235 (267)
T cd01478         156 ACFPNTGARIMLFAGGPCTVGPGAVVSTELKDPIRSHHDIDKDNAKYYKKAVKFYDSLAKRLAANGHAVDIFAGCLDQVG  235 (267)
T ss_pred             hhcCCCCcEEEEEECCCCCCCCceeeccccccccccccccccchhhhhhhHHHHHHHHHHHHHhCCeEEEEEeccccccC
Confidence                5899999999999999999999885  3455544 4444 469999999999999999999999999999999999


Q ss_pred             hhhhhhhccccccEEEEeCCCCCchhHHHH
Q 001711          625 IASLGTLAKYTGGQVYYYPSFQSTTHGERL  654 (1021)
Q Consensus       625 iatl~~L~~~TGG~v~~y~~F~~~~d~~kl  654 (1021)
                      |++|+.|++.|||.+|+|++|+.+.+.+.|
T Consensus       236 laem~~l~~~TGG~v~~~~~f~~~~f~~s~  265 (267)
T cd01478         236 LLEMKVLVNSTGGHVVLSDSFTTSIFKQSF  265 (267)
T ss_pred             HHHHHHHHHhcCcEEEEeCCcchHHHHHHh
Confidence            999999999999999999999886544443


No 12 
>PF04815 Sec23_helical:  Sec23/Sec24 helical domain;  InterPro: IPR006900 COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation [].  Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger (IPR006895 from INTERPRO), an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region, and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes the all-helical domain, which forms an approximately 105-residue segment with the C-terminal 30 residues. The linker between alpha-M and alpha-N contacts Sar1.; GO: 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EGD_B 2NUP_B 2NUT_B 3EGX_B 3EH2_C 3EH1_A 3EFO_B 3EG9_B 2QTV_A 1M2O_C ....
Probab=99.86  E-value=2.1e-21  Score=183.76  Aligned_cols=103  Identities=41%  Similarity=0.650  Sum_probs=96.9

Q ss_pred             HhHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCccccccccccHHHHHHHhhhccCCCCCCC
Q 001711          763 TGAIVSVFSRLAIEKTLSHKLEDARNAVQLRLVKALKEYRNLYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPIRGGYA  842 (1021)
Q Consensus       763 ~eai~~~laK~a~~~~l~~~l~d~R~~l~~~lv~iL~~YRk~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~Lr~g~~  842 (1021)
                      |||++++++|++++++.+++++|+|+.++++|+++|++||+ +|+..++++||+|||+||+||+|+++|+||++|++  .
T Consensus         1 Qda~~~llak~ai~~~~~~~l~~~r~~l~~~~v~il~~Yr~-~~~~~~~~~qLilPe~lklLPly~l~llKs~alr~--~   77 (103)
T PF04815_consen    1 QDAITSLLAKQAIDKALSSSLKDARESLDNRLVDILAAYRK-NCASSSSSGQLILPESLKLLPLYILALLKSPALRP--T   77 (103)
T ss_dssp             HHHHHHHHHHHHHHHHCCS-HHHHHHHHHHHHHHHHHHHHH-HCTTECCCTEEEEEGGGTTHHHHHHHHHTSTTTSC--S
T ss_pred             CHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHh-hccCCCCchhhhCCHHHHHHHHHHHHHHcchhhcC--C
Confidence            79999999999999999999999999999999999999999 99998888999999999999999999999999996  7


Q ss_pred             CCChhHHHHHHHHHcCCCHHHHHhhh
Q 001711          843 DVTLDERCAAGYTMMALPVKKLLKLL  868 (1021)
Q Consensus       843 ~~s~DeR~~~~~~l~s~~v~~~~~~l  868 (1021)
                      ++++|||+|+++++++++++.++.||
T Consensus        78 ~v~~D~R~~~~~~~~~~~~~~~~~~i  103 (103)
T PF04815_consen   78 NVSPDERAYAMHLLLSMPVDSLLRMI  103 (103)
T ss_dssp             TS-HHHHHHHHHHHHHS-HHHHHHHH
T ss_pred             CCCCcHHHHHHHHHHCCCHHHHHhhC
Confidence            99999999999999999999999875


No 13 
>PF08033 Sec23_BS:  Sec23/Sec24 beta-sandwich domain;  InterPro: IPR012990 COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation [].  Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger (IPR006895 from INTERPRO), an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes part of the Sec23/24 beta-barrel domain, which is formed from approximately 180 residues from three segments of the polypeptide. The strands of the barrel are oriented roughly parallel to the membrane such that one end of the barrel forms part of the inner surface of the coat and the other end part of the membrane-distal surface. The barrel is constructed from two opposed sheets: a six-stranded beta sheet facing partly towards the zinc finger domain and partly towards the solvent, and a five-stranded beta sheet facing the helical domain.; PDB: 3EFO_B 3EG9_B 1PD0_A 1PD1_A 1M2V_B 1PCX_A 3EH2_C 3EGD_A 2NUP_A 3EGX_A ....
Probab=99.83  E-value=1.7e-20  Score=175.26  Aligned_cols=85  Identities=44%  Similarity=0.742  Sum_probs=77.2

Q ss_pred             ccceEEEEEeCCCeEEEeeecCcccCC---------CCc--eeeccCCCCCcEEEEEEeccccCCCceeEEEEEEEEEec
Q 001711          667 AWEAVMRIRCGKGVRFTNYHGNFMLRS---------TDL--LALPAVDCDKAYAMQLSLEETLLTTQTVYFQVALLYTAS  735 (1021)
Q Consensus       667 g~~a~mrVR~S~Gl~V~~~~Gnf~~rs---------~~~--~~l~~id~d~Sia~~~~~d~~l~~~~~~~iQ~AllYTt~  735 (1021)
                      ||+|+||||||+||+|++++||+..++         .+.  |.++++++|++|+|+|++++++...+.+|||+|++||+.
T Consensus         1 g~~~~l~vr~S~gl~v~~~~G~~~~~~~~s~~~~g~~~~~~~~~~~l~~~~s~~~~~~~~~~~~~~~~~~iQ~~~~Yt~~   80 (96)
T PF08033_consen    1 GFNAVLRVRCSKGLKVSGVIGPCFNRSSVSDNEIGEGDTTRWKLPSLDPDTSFAFEFEIDEDLPNGSQAYIQFALLYTDS   80 (96)
T ss_dssp             EEEEEEEEEE-TTEEEEEEESSSEESSTBESSECSBSSCSEEEEEEEETT--EEEEEEESSBTBTTSEEEEEEEEEEEET
T ss_pred             CceEEEEEEECCCeEEEEEEcCccccccccceeeccCCccEEEecccCCCCEEEEEEEECCCCCCCCeEEEEEEEEEECC
Confidence            799999999999999999999998766         455  999999999999999999999877899999999999999


Q ss_pred             CCcEEEEEEeeeeccc
Q 001711          736 CGERRIRVHTLAAPVV  751 (1021)
Q Consensus       736 ~GeRrIRV~Tl~lpvt  751 (1021)
                      +|+|||||+|+++++|
T Consensus        81 ~G~r~iRV~T~~l~vt   96 (96)
T PF08033_consen   81 NGERRIRVTTLSLPVT   96 (96)
T ss_dssp             TSEEEEEEEEEEEEEE
T ss_pred             CCCEEEEEEeeccccC
Confidence            9999999999999986


No 14 
>PF04810 zf-Sec23_Sec24:  Sec23/Sec24 zinc finger;  InterPro: IPR006895 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation [].  Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger, an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes an approximately 55-residue Sec23/24 zinc-binding domain, which lies against the beta-barrel at the periphery of the complex. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EFO_B 3EG9_B 3EGD_A 2YRC_A 2NUP_A 2YRD_A 3EGX_A 2NUT_A 3EH1_A 1PD0_A ....
Probab=99.19  E-value=6e-12  Score=98.55  Aligned_cols=35  Identities=43%  Similarity=1.091  Sum_probs=26.9

Q ss_pred             CccceEEccceeEecCCceEEEcCCCCCCCCCccc
Q 001711          354 FICRTYVNPYVTFTDAGRKWRCNICALLNDVPGDY  388 (1021)
Q Consensus       354 ~rCrAYiNPf~~f~~~g~~W~Cn~C~~~N~vP~~Y  388 (1021)
                      ++|+||||||++|+++|++|+|+||++.|++|.+|
T Consensus         6 ~~C~aylNp~~~~~~~~~~w~C~~C~~~N~lp~~Y   40 (40)
T PF04810_consen    6 RRCRAYLNPFCQFDDGGKTWICNFCGTKNPLPPHY   40 (40)
T ss_dssp             TTT--BS-TTSEEETTTTEEEETTT--EEE--GGG
T ss_pred             CCCCCEECCcceEcCCCCEEECcCCCCcCCCCCCC
Confidence            68999999999999999999999999999999887


No 15 
>PRK13685 hypothetical protein; Provisional
Probab=98.76  E-value=3.7e-07  Score=103.96  Aligned_cols=174  Identities=20%  Similarity=0.282  Sum_probs=122.0

Q ss_pred             CCeEEEEEecchhHHhh----cHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccC
Q 001711          427 PPLYFFLIDVSISAIRS----GMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFV  502 (1021)
Q Consensus       427 pp~yvFvIDvS~~av~s----G~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~  502 (1021)
                      .-.+|||||+|.++-..    ..++.+++.++..|+.+.++  .+||+|+|++..++.                     .
T Consensus        88 ~~~vvlvlD~S~SM~~~D~~p~RL~~ak~~~~~~l~~l~~~--d~vglv~Fa~~a~~~---------------------~  144 (326)
T PRK13685         88 RAVVMLVIDVSQSMRATDVEPNRLAAAQEAAKQFADELTPG--INLGLIAFAGTATVL---------------------V  144 (326)
T ss_pred             CceEEEEEECCccccCCCCCCCHHHHHHHHHHHHHHhCCCC--CeEEEEEEcCceeec---------------------C
Confidence            34689999999998532    46889999999999998654  689999999765421                     0


Q ss_pred             CCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-----------CCEEEEEecCCCCCCcc
Q 001711          503 PLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-----------GGKLLIFQNSLPSLGVG  571 (1021)
Q Consensus       503 Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-----------GGkIivF~sg~Pt~GpG  571 (1021)
                      |        +.+.++.+.+.|+.|..      ...+++|.||..|++.++..           .++|+++++|.-|.|..
T Consensus       145 p--------~t~d~~~l~~~l~~l~~------~~~T~~g~al~~A~~~l~~~~~~~~~~~~~~~~~IILlTDG~~~~~~~  210 (326)
T PRK13685        145 S--------PTTNREATKNAIDKLQL------ADRTATGEAIFTALQAIATVGAVIGGGDTPPPARIVLMSDGKETVPTN  210 (326)
T ss_pred             C--------CCCCHHHHHHHHHhCCC------CCCcchHHHHHHHHHHHHhhhcccccccCCCCCEEEEEcCCCCCCCCC
Confidence            1        22456778888888853      24577899999999888631           36799999987665421


Q ss_pred             cccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCC-------------CcChhhhhhhccccccE
Q 001711          572 CLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDK-------------YTDIASLGTLAKYTGGQ  638 (1021)
Q Consensus       572 ~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~-------------~~diatl~~L~~~TGG~  638 (1021)
                      ..    +               +...  .+.+..+.+.||.|.++.++.+             ..|-..|..+++.|||+
T Consensus       211 ~~----~---------------~~~~--~~aa~~a~~~gi~i~~Ig~G~~~g~~~~~g~~~~~~~d~~~L~~iA~~tgG~  269 (326)
T PRK13685        211 PD----N---------------PRGA--YTAARTAKDQGVPISTISFGTPYGSVEINGQRQPVPVDDESLKKIAQLSGGE  269 (326)
T ss_pred             CC----C---------------cccH--HHHHHHHHHcCCeEEEEEECCCCCCcCcCCceeeecCCHHHHHHHHHhcCCE
Confidence            10    0               0001  2456777889999999998864             26778999999999998


Q ss_pred             EEEeCCCCCchhHHHHHHHHHHh
Q 001711          639 VYYYPSFQSTTHGERLRHELSRD  661 (1021)
Q Consensus       639 v~~y~~F~~~~d~~kl~~dL~r~  661 (1021)
                      .|+..+   ..+-++.+.++.+.
T Consensus       270 ~~~~~~---~~~L~~if~~I~~~  289 (326)
T PRK13685        270 FYTAAS---LEELRAVYATLQQQ  289 (326)
T ss_pred             EEEcCC---HHHHHHHHHHHHHH
Confidence            887654   22334455555443


No 16 
>cd01453 vWA_transcription_factor_IIH_type Transcription factors IIH type: TFIIH is a multiprotein complex that is one of the five general transcription factors that binds RNA polymerase II holoenzyme. Orthologues of these genes are found in all completed eukaryotic genomes and all these proteins contain a VWA domain. The p44 subunit of TFIIH functions as a DNA helicase in RNA polymerase II transcription initiation and DNA repair, and its transcriptional activity is dependent on its C-terminal Zn-binding domains. The function of the vWA domain is unclear, but may be involved in complex assembly. The MIDAS motif is not conserved in this sub-group.
Probab=98.70  E-value=5.7e-07  Score=94.10  Aligned_cols=163  Identities=20%  Similarity=0.198  Sum_probs=109.2

Q ss_pred             eEEEEEecchhHHhh----cHHHHHHHHHHHHHhcCC-CCCCceEEEEEE-cCeEEEEecCCCCCCcceeeccccccccC
Q 001711          429 LYFFLIDVSISAIRS----GMLEVVAQTIKSCLDELP-GFPRTQIGFITF-DSTIHFYNMKSSLTQPQMMVISDLDDIFV  502 (1021)
Q Consensus       429 ~yvFvIDvS~~av~s----G~l~~~~~sI~~~L~~Lp-~~~rt~VgiITF-ds~V~fynl~~~~~~p~mlVvsDldd~f~  502 (1021)
                      -.+|+||+|.++.++    ..++.+++.+...++.+. .++..+||||+| ++.-|+.                     +
T Consensus         5 ~ivi~lD~S~SM~a~D~~ptRl~~ak~~~~~fi~~~~~~~~~~~vglv~f~~~~a~~~---------------------~   63 (183)
T cd01453           5 HLIIVIDCSRSMEEQDLKPSRLAVVLKLLELFIEEFFDQNPISQLGIISIKNGRAEKL---------------------T   63 (183)
T ss_pred             EEEEEEECcHHHhcCCCCchHHHHHHHHHHHHHHHHhhcCccccEEEEEEcCCccEEE---------------------E
Confidence            368999999998643    368888998888887642 234478999999 5543321                     1


Q ss_pred             CCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc----CCEEEEEecCCCCCCcccccccCC
Q 001711          503 PLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL----GGKLLIFQNSLPSLGVGCLKLRGD  578 (1021)
Q Consensus       503 Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~----GGkIivF~sg~Pt~GpG~L~~r~~  578 (1021)
                      |+        +...+.+...|+.+  +.   ...+++++.||+.|...|+..    .++|+++.++.-+.++        
T Consensus        64 Pl--------T~D~~~~~~~L~~~--~~---~~G~t~l~~aL~~A~~~l~~~~~~~~~~iiil~sd~~~~~~--------  122 (183)
T cd01453          64 DL--------TGNPRKHIQALKTA--RE---CSGEPSLQNGLEMALESLKHMPSHGSREVLIIFSSLSTCDP--------  122 (183)
T ss_pred             CC--------CCCHHHHHHHhhcc--cC---CCCchhHHHHHHHHHHHHhcCCccCceEEEEEEcCCCcCCh--------
Confidence            22        12223444455554  11   234589999999999999752    3568888774211100        


Q ss_pred             cCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHH
Q 001711          579 DLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHEL  658 (1021)
Q Consensus       579 ~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL  658 (1021)
                                        .-+.++++++.+.+|.|++..++.   ++..|..+|+.|||+.|.-.      |.+.|...+
T Consensus       123 ------------------~~~~~~~~~l~~~~I~v~~IgiG~---~~~~L~~ia~~tgG~~~~~~------~~~~l~~~~  175 (183)
T cd01453         123 ------------------GNIYETIDKLKKENIRVSVIGLSA---EMHICKEICKATNGTYKVIL------DETHLKELL  175 (183)
T ss_pred             ------------------hhHHHHHHHHHHcCcEEEEEEech---HHHHHHHHHHHhCCeeEeeC------CHHHHHHHH
Confidence                              112567888999999999999974   56789999999999998754      345565555


Q ss_pred             HH
Q 001711          659 SR  660 (1021)
Q Consensus       659 ~r  660 (1021)
                      .+
T Consensus       176 ~~  177 (183)
T cd01453         176 LE  177 (183)
T ss_pred             Hh
Confidence            44


No 17 
>cd01467 vWA_BatA_type VWA BatA type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=98.50  E-value=3.5e-06  Score=86.93  Aligned_cols=154  Identities=18%  Similarity=0.244  Sum_probs=104.1

Q ss_pred             eEEEEEecchhHHhh-----cHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCC
Q 001711          429 LYFFLIDVSISAIRS-----GMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVP  503 (1021)
Q Consensus       429 ~yvFvIDvS~~av~s-----G~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~P  503 (1021)
                      -++||||+|.++-..     ..++.+++.+...+...+   +.+||+|+|++.++..                     +|
T Consensus         4 ~vv~vlD~S~SM~~~~~~~~~r~~~a~~~~~~~~~~~~---~~~v~lv~f~~~~~~~---------------------~~   59 (180)
T cd01467           4 DIMIALDVSGSMLAQDFVKPSRLEAAKEVLSDFIDRRE---NDRIGLVVFAGAAFTQ---------------------AP   59 (180)
T ss_pred             eEEEEEECCcccccccCCCCCHHHHHHHHHHHHHHhCC---CCeEEEEEEcCCeeec---------------------cC
Confidence            478999999987422     135667777777666544   3689999998765421                     01


Q ss_pred             CCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---CCEEEEEecCCCCCCcccccccCCcC
Q 001711          504 LPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---GGKLLIFQNSLPSLGVGCLKLRGDDL  580 (1021)
Q Consensus       504 l~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---GGkIivF~sg~Pt~GpG~L~~r~~~~  580 (1021)
                              +...+..+.++|+.|....   ...++.++.||..|...+...   ...|++++.|.++.|.  .       
T Consensus        60 --------~~~~~~~~~~~l~~l~~~~---~~g~T~l~~al~~a~~~l~~~~~~~~~iiliTDG~~~~g~--~-------  119 (180)
T cd01467          60 --------LTLDRESLKELLEDIKIGL---AGQGTAIGDAIGLAIKRLKNSEAKERVIVLLTDGENNAGE--I-------  119 (180)
T ss_pred             --------CCccHHHHHHHHHHhhhcc---cCCCCcHHHHHHHHHHHHHhcCCCCCEEEEEeCCCCCCCC--C-------
Confidence                    1123445566666665211   234577999999999998653   2458888887655431  0       


Q ss_pred             cccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCC----------CcChhhhhhhccccccEEEEeC
Q 001711          581 RVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDK----------YTDIASLGTLAKYTGGQVYYYP  643 (1021)
Q Consensus       581 r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~----------~~diatl~~L~~~TGG~v~~y~  643 (1021)
                                       ...+.+..+.+.||.|+.+.+...          ..|...|..|++.|||.+|+..
T Consensus       120 -----------------~~~~~~~~~~~~gi~i~~i~ig~~~~~~~~~~~~~~~~~~l~~la~~tgG~~~~~~  175 (180)
T cd01467         120 -----------------DPATAAELAKNKGVRIYTIGVGKSGSGPKPDGSTILDEDSLVEIADKTGGRIFRAL  175 (180)
T ss_pred             -----------------CHHHHHHHHHHCCCEEEEEEecCCCCCcCCCCcccCCHHHHHHHHHhcCCEEEEec
Confidence                             012334556678999999998862          4788889999999999999865


No 18 
>cd01466 vWA_C3HC4_type VWA C3HC4-type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, 
Probab=98.50  E-value=1.8e-06  Score=87.60  Aligned_cols=147  Identities=17%  Similarity=0.268  Sum_probs=104.3

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCccc
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDLL  509 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~lL  509 (1021)
                      .+||||+|.++-. .-++.+.++|+..++.|+++  .+||||+|++..+.+-                  .+.+.     
T Consensus         3 v~~vlD~S~SM~~-~rl~~ak~a~~~l~~~l~~~--~~~~li~F~~~~~~~~------------------~~~~~-----   56 (155)
T cd01466           3 LVAVLDVSGSMAG-DKLQLVKHALRFVISSLGDA--DRLSIVTFSTSAKRLS------------------PLRRM-----   56 (155)
T ss_pred             EEEEEECCCCCCc-HHHHHHHHHHHHHHHhCCCc--ceEEEEEecCCccccC------------------CCccc-----
Confidence            5799999998743 24777889999999998865  6899999998754320                  00000     


Q ss_pred             eehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-----CCEEEEEecCCCCCCcccccccCCcCcccC
Q 001711          510 VNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-----GGKLLIFQNSLPSLGVGCLKLRGDDLRVYG  584 (1021)
Q Consensus       510 v~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-----GGkIivF~sg~Pt~GpG~L~~r~~~~r~~g  584 (1021)
                        -.+.++.+.++|+.+.      ....++++.||+.|..+++..     ...|++++.|.++.|..             
T Consensus        57 --~~~~~~~~~~~i~~~~------~~g~T~~~~al~~a~~~~~~~~~~~~~~~iillTDG~~~~~~~-------------  115 (155)
T cd01466          57 --TAKGKRSAKRVVDGLQ------AGGGTNVVGGLKKALKVLGDRRQKNPVASIMLLSDGQDNHGAV-------------  115 (155)
T ss_pred             --CHHHHHHHHHHHHhcc------CCCCccHHHHHHHHHHHHhhcccCCCceEEEEEcCCCCCcchh-------------
Confidence              0134566777777763      245689999999999998643     25788888888765500             


Q ss_pred             CCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEe
Q 001711          585 TDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYY  642 (1021)
Q Consensus       585 t~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y  642 (1021)
                                        ..++.+.+|.|..+.++. ..|..+|..|+..|||+.||.
T Consensus       116 ------------------~~~~~~~~v~v~~igig~-~~~~~~l~~iA~~t~G~~~~~  154 (155)
T cd01466         116 ------------------VLRADNAPIPIHTFGLGA-SHDPALLAFIAEITGGTFSYV  154 (155)
T ss_pred             ------------------hhcccCCCceEEEEecCC-CCCHHHHHHHHhccCceEEEe
Confidence                              001224678888888764 468899999999999999874


No 19 
>cd01465 vWA_subgroup VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if n
Probab=98.50  E-value=3.5e-06  Score=85.85  Aligned_cols=155  Identities=17%  Similarity=0.240  Sum_probs=110.6

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCccc
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDLL  509 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~lL  509 (1021)
                      ++||||+|.++-... ++.+++++...+..+..+  .+|++|+|++..+.+-              +.    .  .    
T Consensus         3 ~~~vlD~S~SM~~~~-~~~~k~a~~~~~~~l~~~--~~v~li~f~~~~~~~~--------------~~----~--~----   55 (170)
T cd01465           3 LVFVIDRSGSMDGPK-LPLVKSALKLLVDQLRPD--DRLAIVTYDGAAETVL--------------PA----T--P----   55 (170)
T ss_pred             EEEEEECCCCCCChh-HHHHHHHHHHHHHhCCCC--CEEEEEEecCCccEEe--------------cC----c--c----
Confidence            789999999885433 778888999999988754  6899999997644320              00    0  0    


Q ss_pred             eehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---C--CEEEEEecCCCCCCcccccccCCcCcccC
Q 001711          510 VNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---G--GKLLIFQNSLPSLGVGCLKLRGDDLRVYG  584 (1021)
Q Consensus       510 v~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---G--GkIivF~sg~Pt~GpG~L~~r~~~~r~~g  584 (1021)
                         ...++.+...|+.+..      ...+.++.||+.|+..++..   +  .+|++|+.|.++.|...            
T Consensus        56 ---~~~~~~l~~~l~~~~~------~g~T~~~~al~~a~~~~~~~~~~~~~~~ivl~TDG~~~~~~~~------------  114 (170)
T cd01465          56 ---VRDKAAILAAIDRLTA------GGSTAGGAGIQLGYQEAQKHFVPGGVNRILLATDGDFNVGETD------------  114 (170)
T ss_pred             ---cchHHHHHHHHHcCCC------CCCCCHHHHHHHHHHHHHhhcCCCCeeEEEEEeCCCCCCCCCC------------
Confidence               0123445555665541      34567899999999988652   2  57999999988765311            


Q ss_pred             CCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCC
Q 001711          585 TDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPS  644 (1021)
Q Consensus       585 t~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~  644 (1021)
                                 .+-+++....+.+.+|.|+++.++ ...|...|..+++.++|..++-++
T Consensus       115 -----------~~~~~~~~~~~~~~~v~i~~i~~g-~~~~~~~l~~ia~~~~g~~~~~~~  162 (170)
T cd01465         115 -----------PDELARLVAQKRESGITLSTLGFG-DNYNEDLMEAIADAGNGNTAYIDN  162 (170)
T ss_pred             -----------HHHHHHHHHHhhcCCeEEEEEEeC-CCcCHHHHHHHHhcCCceEEEeCC
Confidence                       122345555667889999999998 678999999999999999887654


No 20 
>cd01463 vWA_VGCC_like VWA Voltage gated Calcium channel like: Voltage-gated calcium channels are a complex of five proteins: alpha 1, beta 1, gamma, alpha 2 and delta. The alpha 2 and delta subunits result from proteolytic processing of a single gene product and carries at its N-terminus the VWA and cache domains, The alpha 2 delta gene family has orthologues in D. melanogaster and C. elegans but none have been detected in aither A. thaliana or yeast. The exact biochemical function of the VWA domain  is not known but the alpha 2 delta complex has been shown to regulate various functional properties of the channel complex.
Probab=98.49  E-value=5e-06  Score=87.17  Aligned_cols=163  Identities=21%  Similarity=0.250  Sum_probs=107.0

Q ss_pred             CCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEe-cCCCCCCcceeeccccccccCCC
Q 001711          426 MPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYN-MKSSLTQPQMMVISDLDDIFVPL  504 (1021)
Q Consensus       426 ~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fyn-l~~~~~~p~mlVvsDldd~f~Pl  504 (1021)
                      .|-..+||||+|.++-.+ -++.++++++..|+.|+++  .+||||+|++.++.+- +..                    
T Consensus        12 ~p~~vv~llD~SgSM~~~-~l~~ak~~~~~ll~~l~~~--d~v~lv~F~~~~~~~~~~~~--------------------   68 (190)
T cd01463          12 SPKDIVILLDVSGSMTGQ-RLHLAKQTVSSILDTLSDN--DFFNIITFSNEVNPVVPCFN--------------------   68 (190)
T ss_pred             CCceEEEEEECCCCCCcH-HHHHHHHHHHHHHHhCCCC--CEEEEEEeCCCeeEEeeecc--------------------
Confidence            456789999999988543 4678899999999999765  7899999999877431 100                    


Q ss_pred             CCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh---c---------CCEEEEEecCCCCCCccc
Q 001711          505 PDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR---L---------GGKLLIFQNSLPSLGVGC  572 (1021)
Q Consensus       505 ~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~---~---------GGkIivF~sg~Pt~GpG~  572 (1021)
                       ..++....+.++.+...|+.|..      ...+.++.||+.|+..|+.   .         ...|++++.|.++.+.  
T Consensus        69 -~~~~~~~~~~~~~~~~~l~~l~~------~G~T~~~~al~~a~~~l~~~~~~~~~~~~~~~~~~iillTDG~~~~~~--  139 (190)
T cd01463          69 -DTLVQATTSNKKVLKEALDMLEA------KGIANYTKALEFAFSLLLKNLQSNHSGSRSQCNQAIMLITDGVPENYK--  139 (190)
T ss_pred             -cceEecCHHHHHHHHHHHhhCCC------CCcchHHHHHHHHHHHHHHhhhcccccccCCceeEEEEEeCCCCCcHh--
Confidence             11111122445556666666652      3357899999999998875   1         1358888888765311  


Q ss_pred             ccccCCcCcccCCCccccCCCCCcHHHHHHHH-HHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCC
Q 001711          573 LKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAA-DLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPS  644 (1021)
Q Consensus       573 L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~-~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~  644 (1021)
                                              ..+.++.. ...+.+|.|..|.++.+..|...|..|+..+||..++.++
T Consensus       140 ------------------------~~~~~~~~~~~~~~~v~i~tigiG~~~~d~~~L~~lA~~~~G~~~~i~~  188 (190)
T cd01463         140 ------------------------EIFDKYNWDKNSEIPVRVFTYLIGREVTDRREIQWMACENKGYYSHIQS  188 (190)
T ss_pred             ------------------------HHHHHhcccccCCCcEEEEEEecCCccccchHHHHHHhhcCCeEEEccc
Confidence                                    01111110 1112245566666666656889999999999999998764


No 21 
>cd01451 vWA_Magnesium_chelatase Magnesium chelatase: Mg-chelatase catalyses the insertion of Mg into protoporphyrin IX (Proto). In chlorophyll biosynthesis, insertion of Mg2+ into protoporphyrin IX is catalysed by magnesium chelatase in an ATP-dependent reaction. Magnesium chelatase is a three sub-unit (BchI, BchD and BchH) enzyme with a novel arrangement of domains: the C-terminal helical domain is located behind the nucleotide binding site. The BchD domain contains a AAA domain at its N-terminus and a VWA domain at its C-terminus. The VWA domain has been speculated to be involved in mediating protein-protein interactions.
Probab=98.48  E-value=4.1e-06  Score=86.96  Aligned_cols=160  Identities=19%  Similarity=0.246  Sum_probs=109.6

Q ss_pred             eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCe-EEEEecCCCCCCcceeeccccccccCCCCCc
Q 001711          429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDST-IHFYNMKSSLTQPQMMVISDLDDIFVPLPDD  507 (1021)
Q Consensus       429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~-V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~  507 (1021)
                      .++||||+|.++-...-++.+++++...+..+.. ++.+||||+|++. .++.                     +|    
T Consensus         2 ~v~lvlD~SgSM~~~~rl~~ak~a~~~~~~~~~~-~~d~v~lv~F~~~~~~~~---------------------~~----   55 (178)
T cd01451           2 LVIFVVDASGSMAARHRMAAAKGAVLSLLRDAYQ-RRDKVALIAFRGTEAEVL---------------------LP----   55 (178)
T ss_pred             eEEEEEECCccCCCccHHHHHHHHHHHHHHHhhc-CCCEEEEEEECCCCceEE---------------------eC----
Confidence            3689999999885432577788888887765322 2378999999864 2211                     01    


Q ss_pred             cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHH-h---cC--CEEEEEecCCCCCCcccccccCCcCc
Q 001711          508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMS-R---LG--GKLLIFQNSLPSLGVGCLKLRGDDLR  581 (1021)
Q Consensus       508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~-~---~G--GkIivF~sg~Pt~GpG~L~~r~~~~r  581 (1021)
                          +...++.+...|+.++.      ...+.++.||..|...++ .   .+  ..|++++.|.++.|...         
T Consensus        56 ----~t~~~~~~~~~l~~l~~------~G~T~l~~aL~~a~~~l~~~~~~~~~~~~ivliTDG~~~~g~~~---------  116 (178)
T cd01451          56 ----PTRSVELAKRRLARLPT------GGGTPLAAGLLAAYELAAEQARDPGQRPLIVVITDGRANVGPDP---------  116 (178)
T ss_pred             ----CCCCHHHHHHHHHhCCC------CCCCcHHHHHHHHHHHHHHHhcCCCCceEEEEECCCCCCCCCCc---------
Confidence                11223344556666642      456789999999999982 1   12  46888888877765210         


Q ss_pred             ccCCCccccCCCCCcHHH-HHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCC
Q 001711          582 VYGTDKEHSLRIPEDPFY-KQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQS  647 (1021)
Q Consensus       582 ~~gt~~e~~l~~pa~~fY-~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~  647 (1021)
                                    ...- .+++.++.+.+|.|.++.+...+.|-..|..|++.|||+.|+.++.+.
T Consensus       117 --------------~~~~~~~~~~~l~~~gi~v~~I~~~~~~~~~~~l~~iA~~tgG~~~~~~d~~~  169 (178)
T cd01451         117 --------------TADRALAAARKLRARGISALVIDTEGRPVRRGLAKDLARALGGQYVRLPDLSA  169 (178)
T ss_pred             --------------hhHHHHHHHHHHHhcCCcEEEEeCCCCccCccHHHHHHHHcCCeEEEcCcCCH
Confidence                          0111 567788889999887776666667888899999999999999887543


No 22 
>cd01456 vWA_ywmD_type VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if 
Probab=98.46  E-value=3e-06  Score=90.03  Aligned_cols=174  Identities=22%  Similarity=0.228  Sum_probs=111.3

Q ss_pred             CCCCCCeEEEEEecchhHHh-----hcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeecccc
Q 001711          423 RPPMPPLYFFLIDVSISAIR-----SGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDL  497 (1021)
Q Consensus       423 r~p~pp~yvFvIDvS~~av~-----sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDl  497 (1021)
                      ....+..++||||+|.++..     ...++.+++++...|+.++++  .+|||++|++.++-.   ..   .. .+++  
T Consensus        16 ~~~~~~~vv~vlD~SgSM~~~~~~~~~rl~~ak~a~~~~l~~l~~~--~~v~lv~F~~~~~~~---~~---~~-~~~p--   84 (206)
T cd01456          16 EPQLPPNVAIVLDNSGSMREVDGGGETRLDNAKAALDETANALPDG--TRLGLWTFSGDGDNP---LD---VR-VLVP--   84 (206)
T ss_pred             ccCCCCcEEEEEeCCCCCcCCCCCcchHHHHHHHHHHHHHHhCCCC--ceEEEEEecCCCCCC---cc---cc-cccc--
Confidence            34567789999999999862     135888999999999998755  789999999854210   00   00 0000  


Q ss_pred             ccccCCCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC-CEEEEEecCCCCCCccccccc
Q 001711          498 DDIFVPLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG-GKLLIFQNSLPSLGVGCLKLR  576 (1021)
Q Consensus       498 dd~f~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G-GkIivF~sg~Pt~GpG~L~~r  576 (1021)
                         ..+.....--.....++.+.+.|+.|..     ...++.++.||+.|...++... ..||+++.|..+.|...+   
T Consensus        85 ---~~~~~~~~~~~~~~~~~~l~~~i~~i~~-----~~G~T~l~~aL~~a~~~l~~~~~~~iillTDG~~~~~~~~~---  153 (206)
T cd01456          85 ---KGCLTAPVNGFPSAQRSALDAALNSLQT-----PTGWTPLAAALAEAAAYVDPGRVNVVVLITDGEDTCGPDPC---  153 (206)
T ss_pred             ---ccccccccCCCCcccHHHHHHHHHhhcC-----CCCcChHHHHHHHHHHHhCCCCcceEEEEcCCCccCCCCHH---
Confidence               0011000000001356677777888751     2456889999999999996222 578888888766542000   


Q ss_pred             CCcCcccCCCccccCCCCCcHHHHHHHHHH-hhCCcEEEEEEecCCCcChhhhhhhccccccEE
Q 001711          577 GDDLRVYGTDKEHSLRIPEDPFYKQMAADL-TKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQV  639 (1021)
Q Consensus       577 ~~~~r~~gt~~e~~l~~pa~~fY~~La~~~-~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v  639 (1021)
                                          +..++++.+. .+.+|.|+++.++.+ .|...|..|++.|||..
T Consensus       154 --------------------~~~~~~~~~~~~~~~i~i~~igiG~~-~~~~~l~~iA~~tgG~~  196 (206)
T cd01456         154 --------------------EVARELAKRRTPAPPIKVNVIDFGGD-ADRAELEAIAEATGGTY  196 (206)
T ss_pred             --------------------HHHHHHHHhcCCCCCceEEEEEecCc-ccHHHHHHHHHhcCCeE
Confidence                                1112222211 225899999999865 67889999999999988


No 23 
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=98.43  E-value=2.5e-05  Score=97.83  Aligned_cols=167  Identities=19%  Similarity=0.260  Sum_probs=109.4

Q ss_pred             CeEEEEEecchhHHhhcHHHHHHHHHHHHHhc-CCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCC
Q 001711          428 PLYFFLIDVSISAIRSGMLEVVAQTIKSCLDE-LPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPD  506 (1021)
Q Consensus       428 p~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~-Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~  506 (1021)
                      ...+||||+|.++-....++.+.++++..|.. ++.+  .+||||+||+..++..              +    +.++.+
T Consensus       305 r~VVLVLDvSGSM~g~dRL~~lkqAA~~fL~~~l~~~--DrVGLVtFsssA~vl~--------------p----Lt~Its  364 (863)
T TIGR00868       305 RIVCLVLDKSGSMTVEDRLKRMNQAAKLFLLQTVEKG--SWVGMVTFDSAAYIKN--------------E----LIQITS  364 (863)
T ss_pred             ceEEEEEECCccccccCHHHHHHHHHHHHHHHhCCCC--CEEEEEEECCceeEee--------------c----cccCCc
Confidence            56899999999985433577777777776654 4433  7999999998765421              0    111111


Q ss_pred             ccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-----CCEEEEEecCCCCCCcccccccCCcCc
Q 001711          507 DLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-----GGKLLIFQNSLPSLGVGCLKLRGDDLR  581 (1021)
Q Consensus       507 ~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-----GGkIivF~sg~Pt~GpG~L~~r~~~~r  581 (1021)
                            ...++.|...|...       ...+++++.||+.|+++|+..     +..||+++.|..+.+            
T Consensus       365 ------~~dr~aL~~~L~~~-------A~GGT~I~~GL~~Alq~L~~~~~~~~~~~IILLTDGedn~~------------  419 (863)
T TIGR00868       365 ------SAERDALTANLPTA-------ASGGTSICSGLKAAFQVIKKSYQSTDGSEIVLLTDGEDNTI------------  419 (863)
T ss_pred             ------HHHHHHHHHhhccc-------cCCCCcHHHHHHHHHHHHHhcccccCCCEEEEEeCCCCCCH------------
Confidence                  12344444333311       245789999999999999763     467777777643210            


Q ss_pred             ccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHh
Q 001711          582 VYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRD  661 (1021)
Q Consensus       582 ~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~  661 (1021)
                                        .+++.++.+.||.|..+.++.+. | ..|..||+.|||..|+..+   ..+...|...|.++
T Consensus       420 ------------------~~~l~~lk~~gVtI~TIg~G~da-d-~~L~~IA~~TGG~~f~asd---~~dl~~L~dAF~~i  476 (863)
T TIGR00868       420 ------------------SSCFEEVKQSGAIIHTIALGPSA-A-KELEELSDMTGGLRFYASD---QADNNGLIDAFGAL  476 (863)
T ss_pred             ------------------HHHHHHHHHcCCEEEEEEeCCCh-H-HHHHHHHHhcCCEEEEeCC---HHHHHHHHHHHHHH
Confidence                              23445567789999999998764 2 4589999999999998864   22334565555554


Q ss_pred             c
Q 001711          662 L  662 (1021)
Q Consensus       662 l  662 (1021)
                      .
T Consensus       477 s  477 (863)
T TIGR00868       477 S  477 (863)
T ss_pred             h
Confidence            3


No 24 
>TIGR03788 marine_srt_targ marine proteobacterial sortase target protein. Members of this protein family are restricted to the Proteobacteria. Each contains a C-terminal sortase-recognition motif, transmembrane domain, and basic residues cluster at the the C-terminus, and is encoded adjacent to a sortase gene. This protein is frequently the only sortase target in its genome, which is as unusual its occurrence in Gram-negative rather than Gram-positive genomes. Many bacteria with this system are marine. In addition to the LPXTG signal, members carry a vault protein inter-alpha-trypsin inhibitor domain (pfam08487) and a von Willebrand factor type A domain (pfam00092).
Probab=98.34  E-value=0.00045  Score=85.22  Aligned_cols=284  Identities=13%  Similarity=0.156  Sum_probs=161.6

Q ss_pred             CCCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCC
Q 001711          424 PPMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVP  503 (1021)
Q Consensus       424 ~p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~P  503 (1021)
                      ...+..++||||+|.++-. .-++.+++++..+|+.|.++  .+|+||+||+.++.+.-..       .          +
T Consensus       268 ~~~p~~vvfvlD~SgSM~g-~~i~~ak~al~~~l~~L~~~--d~~~ii~F~~~~~~~~~~~-------~----------~  327 (596)
T TIGR03788       268 QVLPRELVFVIDTSGSMAG-ESIEQAKSALLLALDQLRPG--DRFNIIQFDSDVTLLFPVP-------V----------P  327 (596)
T ss_pred             cCCCceEEEEEECCCCCCC-ccHHHHHHHHHHHHHhCCCC--CEEEEEEECCcceEecccc-------c----------c
Confidence            3556689999999998843 23677889999999999865  7899999999877542100       0          0


Q ss_pred             CCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-C---CEEEEEecCCCCCCcccccccCCc
Q 001711          504 LPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-G---GKLLIFQNSLPSLGVGCLKLRGDD  579 (1021)
Q Consensus       504 l~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-G---GkIivF~sg~Pt~GpG~L~~r~~~  579 (1021)
                      .       -.+.++.+...|+.|..      ..++.+..||+.|+...... .   -.|+++++|..+          + 
T Consensus       328 ~-------~~~~~~~a~~~i~~l~a------~GgT~l~~aL~~a~~~~~~~~~~~~~~iillTDG~~~----------~-  383 (596)
T TIGR03788       328 A-------TAHNLARARQFVAGLQA------DGGTEMAGALSAALRDDGPESSGALRQVVFLTDGAVG----------N-  383 (596)
T ss_pred             C-------CHHHHHHHHHHHhhCCC------CCCccHHHHHHHHHHhhcccCCCceeEEEEEeCCCCC----------C-
Confidence            0       02334445556666642      35678999999998775332 1   258888887421          0 


Q ss_pred             CcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHH
Q 001711          580 LRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELS  659 (1021)
Q Consensus       580 ~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~  659 (1021)
                              +       ...++.+.  ....++.|..|.++.+ .|-..|..|++.+||..++...  .+...+++.+.+.
T Consensus       384 --------~-------~~~~~~~~--~~~~~~ri~tvGiG~~-~n~~lL~~lA~~g~G~~~~i~~--~~~~~~~~~~~l~  443 (596)
T TIGR03788       384 --------E-------DALFQLIR--TKLGDSRLFTVGIGSA-PNSYFMRKAAQFGRGSFTFIGS--TDEVQRKMSQLFA  443 (596)
T ss_pred             --------H-------HHHHHHHH--HhcCCceEEEEEeCCC-cCHHHHHHHHHcCCCEEEECCC--HHHHHHHHHHHHH
Confidence                    0       11222332  1234567777776654 6778899999999998776543  2222334444444


Q ss_pred             HhcccccccceEEEEEeCCCeEEEeeecCcccCCCCceeeccCCCCCcEEEEEEeccccCCCceeEEEEEEEEEecCCcE
Q 001711          660 RDLTRETAWEAVMRIRCGKGVRFTNYHGNFMLRSTDLLALPAVDCDKAYAMQLSLEETLLTTQTVYFQVALLYTASCGER  739 (1021)
Q Consensus       660 r~ltr~~g~~a~mrVR~S~Gl~V~~~~Gnf~~rs~~~~~l~~id~d~Sia~~~~~d~~l~~~~~~~iQ~AllYTt~~GeR  739 (1021)
                      + +..+..-+..+++....   +..++-         -.++.+-....+.|.-++...   ...+    .+.....++. 
T Consensus       444 ~-~~~p~l~~v~v~~~~~~---~~~v~P---------~~~p~L~~g~~l~v~g~~~~~---~~~i----~v~g~~~~~~-  502 (596)
T TIGR03788       444 K-LEQPALTDIALTFDNGN---AADVYP---------SPIPDLYRGEPLQIAIKLQQA---AGEL----QLTGRTGSQP-  502 (596)
T ss_pred             h-hcCeEEEEEEEEEcCCc---cceecc---------CCCccccCCCEEEEEEEecCC---CCeE----EEEEEcCCce-
Confidence            4 55566666666664322   222221         234556666666666664321   1222    2222322222 


Q ss_pred             EEEEEeeeecccCCHHHHHHhcCHhHHHHHHHHHHHHHHhcCCH-HHHHHHHHHHHHHHHHHHHh
Q 001711          740 RIRVHTLAAPVVSNLSDMYQQADTGAIVSVFSRLAIEKTLSHKL-EDARNAVQLRLVKALKEYRN  803 (1021)
Q Consensus       740 rIRV~Tl~lpvt~~l~~vf~s~D~eai~~~laK~a~~~~l~~~l-~d~R~~l~~~lv~iL~~YRk  803 (1021)
                          .+..+.+...       .+-..+-.+.||+-+..+..... ..-++.+.++++++-.+|+-
T Consensus       503 ----~~~~~~~~~~-------~~~~~l~~lwA~~~I~~L~~~~~~~~~~~~~~~~Ii~Lsl~y~l  556 (596)
T TIGR03788       503 ----WSQQLDLDSA-------APGKGIDKLWARRKIDSLEDSLRYGANEEKVKDQVTALALNHHL  556 (596)
T ss_pred             ----EEEEEecCCC-------CCcchHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHhCC
Confidence                1222333321       13344677788877776653211 01124466677777777765


No 25 
>cd01474 vWA_ATR ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.
Probab=98.32  E-value=2.3e-05  Score=81.85  Aligned_cols=167  Identities=16%  Similarity=0.156  Sum_probs=97.9

Q ss_pred             eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEE-EecCCCCCCcceeeccccccccCCCCCc
Q 001711          429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHF-YNMKSSLTQPQMMVISDLDDIFVPLPDD  507 (1021)
Q Consensus       429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~f-ynl~~~~~~p~mlVvsDldd~f~Pl~~~  507 (1021)
                      -.+||||+|.++-. . .....+.++..++.+.. ++.+||||+|++..+. +.+.                        
T Consensus         6 Dvv~llD~SgSm~~-~-~~~~~~~~~~l~~~~~~-~~~rvglv~Fs~~~~~~~~l~------------------------   58 (185)
T cd01474           6 DLYFVLDKSGSVAA-N-WIEIYDFVEQLVDRFNS-PGLRFSFITFSTRATKILPLT------------------------   58 (185)
T ss_pred             eEEEEEeCcCchhh-h-HHHHHHHHHHHHHHcCC-CCcEEEEEEecCCceEEEecc------------------------
Confidence            47999999998743 2 33344667777766532 4589999999876432 1111                        


Q ss_pred             cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHH--hcCC----E-EEEEecCCCCCCcccccccCCcC
Q 001711          508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMS--RLGG----K-LLIFQNSLPSLGVGCLKLRGDDL  580 (1021)
Q Consensus       508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~--~~GG----k-IivF~sg~Pt~GpG~L~~r~~~~  580 (1021)
                            +..+.+.+.|+.|..+..   ...+++|.||+.|...|.  ..||    | |++++.|..+-..+         
T Consensus        59 ------~~~~~~~~~l~~l~~~~~---~g~T~~~~aL~~a~~~l~~~~~~~r~~~~~villTDG~~~~~~~---------  120 (185)
T cd01474          59 ------DDSSAIIKGLEVLKKVTP---SGQTYIHEGLENANEQIFNRNGGGRETVSVIIALTDGQLLLNGH---------  120 (185)
T ss_pred             ------ccHHHHHHHHHHHhccCC---CCCCcHHHHHHHHHHHHHhhccCCCCCCeEEEEEcCCCcCCCCC---------
Confidence                  111123344444443322   357899999999998773  3444    2 67777765431000         


Q ss_pred             cccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEE-EeCCCCCchhHHHHHHHHH
Q 001711          581 RVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVY-YYPSFQSTTHGERLRHELS  659 (1021)
Q Consensus       581 r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~-~y~~F~~~~d~~kl~~dL~  659 (1021)
                                      ..-...+.++.+.||.|..+.+  ...|..+|..++..++ .+| ...+|+.   -..+.++|.
T Consensus       121 ----------------~~~~~~a~~l~~~gv~i~~vgv--~~~~~~~L~~iA~~~~-~~f~~~~~~~~---l~~~~~~~~  178 (185)
T cd01474         121 ----------------KYPEHEAKLSRKLGAIVYCVGV--TDFLKSQLINIADSKE-YVFPVTSGFQA---LSGIIESVV  178 (185)
T ss_pred             ----------------cchHHHHHHHHHcCCEEEEEee--chhhHHHHHHHhCCCC-eeEecCccHHH---HHHHHHHHH
Confidence                            0002335567778886666555  5678899999998774 455 3334432   234455555


Q ss_pred             Hhc
Q 001711          660 RDL  662 (1021)
Q Consensus       660 r~l  662 (1021)
                      +.+
T Consensus       179 ~~~  181 (185)
T cd01474         179 KKA  181 (185)
T ss_pred             Hhh
Confidence            444


No 26 
>PF13519 VWA_2:  von Willebrand factor type A domain; PDB: 3IBS_B 3RAG_B 2X5N_A.
Probab=98.28  E-value=1e-05  Score=81.70  Aligned_cols=151  Identities=17%  Similarity=0.235  Sum_probs=101.0

Q ss_pred             EEEEEecchhHHhhc----HHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCC
Q 001711          430 YFFLIDVSISAIRSG----MLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLP  505 (1021)
Q Consensus       430 yvFvIDvS~~av~sG----~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~  505 (1021)
                      +|||||+|.++-..+    .++.+++++...++.+++   .+|+|++|++..+.                          
T Consensus         2 vv~v~D~SgSM~~~~~~~~~~~~~~~~~~~~~~~~~~---~~v~l~~f~~~~~~--------------------------   52 (172)
T PF13519_consen    2 VVFVLDNSGSMNGYDGNRTRIDQAKDALNELLANLPG---DRVGLVSFSDSSRT--------------------------   52 (172)
T ss_dssp             EEEEEE-SGGGGTTTSSS-HHHHHHHHHHHHHHHHTT---SEEEEEEESTSCEE--------------------------
T ss_pred             EEEEEECCcccCCCCCCCcHHHHHHHHHHHHHHHCCC---CEEEEEEecccccc--------------------------
Confidence            589999999986542    578889999999988763   48999999875311                          


Q ss_pred             CccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC---CEEEEEecCCCCCCcccccccCCcCcc
Q 001711          506 DDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG---GKLLIFQNSLPSLGVGCLKLRGDDLRV  582 (1021)
Q Consensus       506 ~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G---GkIivF~sg~Pt~GpG~L~~r~~~~r~  582 (1021)
                         +.++...++.+.+.|+.+....  ......+++.||..|.+++....   ..|++|+.|.++               
T Consensus        53 ---~~~~t~~~~~~~~~l~~~~~~~--~~~~~t~~~~al~~a~~~~~~~~~~~~~iv~iTDG~~~---------------  112 (172)
T PF13519_consen   53 ---LSPLTSDKDELKNALNKLSPQG--MPGGGTNLYDALQEAAKMLASSDNRRRAIVLITDGEDN---------------  112 (172)
T ss_dssp             ---EEEEESSHHHHHHHHHTHHHHG----SSS--HHHHHHHHHHHHHC-SSEEEEEEEEES-TTH---------------
T ss_pred             ---cccccccHHHHHHHhhcccccc--cCccCCcHHHHHHHHHHHHHhCCCCceEEEEecCCCCC---------------
Confidence               0112234555566666664321  12455789999999999998653   355555554222               


Q ss_pred             cCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeC
Q 001711          583 YGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYP  643 (1021)
Q Consensus       583 ~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~  643 (1021)
                                    .-..+.+..+.+.+|.|.++.+..+...-..|..|++.|||..+...
T Consensus       113 --------------~~~~~~~~~~~~~~i~i~~v~~~~~~~~~~~l~~la~~tgG~~~~~~  159 (172)
T PF13519_consen  113 --------------SSDIEAAKALKQQGITIYTVGIGSDSDANEFLQRLAEATGGRYFHVD  159 (172)
T ss_dssp             --------------CHHHHHHHHHHCTTEEEEEEEES-TT-EHHHHHHHHHHTEEEEEEE-
T ss_pred             --------------cchhHHHHHHHHcCCeEEEEEECCCccHHHHHHHHHHhcCCEEEEec
Confidence                          00113667788999999999998887766789999999999988873


No 27 
>cd01472 vWA_collagen von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins.  This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via  the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.
Probab=98.26  E-value=2.8e-05  Score=79.37  Aligned_cols=151  Identities=18%  Similarity=0.146  Sum_probs=96.5

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCC-CCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCcc
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGF-PRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDL  508 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~-~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~l  508 (1021)
                      .+||||+|.++-.. -++.++++++..+..|... .+.+||||+|++..+..-              .+..         
T Consensus         3 vv~vlD~SgSm~~~-~~~~~k~~~~~~~~~l~~~~~~~~~giv~Fs~~~~~~~--------------~~~~---------   58 (164)
T cd01472           3 IVFLVDGSESIGLS-NFNLVKDFVKRVVERLDIGPDGVRVGVVQYSDDPRTEF--------------YLNT---------   58 (164)
T ss_pred             EEEEEeCCCCCCHH-HHHHHHHHHHHHHhhcccCCCCeEEEEEEEcCceeEEE--------------ecCC---------
Confidence            58999999987543 4677888888888877532 347999999998765421              0000         


Q ss_pred             ceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc--------CCEEEEEecCCCCCCcccccccCCcC
Q 001711          509 LVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL--------GGKLLIFQNSLPSLGVGCLKLRGDDL  580 (1021)
Q Consensus       509 Lv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~--------GGkIivF~sg~Pt~GpG~L~~r~~~~  580 (1021)
                          ...++.+.+.|+.|...     ...+.+|.||..|.+.|...        ...|++++.|.++.+           
T Consensus        59 ----~~~~~~~~~~l~~l~~~-----~g~T~~~~al~~a~~~l~~~~~~~~~~~~~~iiliTDG~~~~~-----------  118 (164)
T cd01472          59 ----YRSKDDVLEAVKNLRYI-----GGGTNTGKALKYVRENLFTEASGSREGVPKVLVVITDGKSQDD-----------  118 (164)
T ss_pred             ----CCCHHHHHHHHHhCcCC-----CCCchHHHHHHHHHHHhCCcccCCCCCCCEEEEEEcCCCCCch-----------
Confidence                02244556667777642     34578999999999988641        123556655532110           


Q ss_pred             cccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhcccccc-EEEEeCC
Q 001711          581 RVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGG-QVYYYPS  644 (1021)
Q Consensus       581 r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG-~v~~y~~  644 (1021)
                                       . ...+.++.+.||.|..+.++.  .|...|..++..++| .+|.+..
T Consensus       119 -----------------~-~~~~~~l~~~gv~i~~ig~g~--~~~~~L~~ia~~~~~~~~~~~~~  163 (164)
T cd01472         119 -----------------V-EEPAVELKQAGIEVFAVGVKN--ADEEELKQIASDPKELYVFNVAD  163 (164)
T ss_pred             -----------------H-HHHHHHHHHCCCEEEEEECCc--CCHHHHHHHHCCCchheEEeccC
Confidence                             0 123344556777655554444  499999999999987 5665544


No 28 
>TIGR03436 acidobact_VWFA VWFA-related Acidobacterial domain. Members of this family are bacterial domains that include a region related to the von Willebrand factor type A (VWFA) domain (pfam00092). These domains are restricted to, and have undergone a large paralogous family expansion in, the Acidobacteria, including Solibacter usitatus and Acidobacterium capsulatum ATCC 51196.
Probab=98.22  E-value=7.4e-05  Score=83.85  Aligned_cols=158  Identities=17%  Similarity=0.231  Sum_probs=101.8

Q ss_pred             CCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhc-CCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCC
Q 001711          426 MPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDE-LPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPL  504 (1021)
Q Consensus       426 ~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~-Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl  504 (1021)
                      .|...+||||+|.++..  .+..++++++..|+. +..  +.+|+||+|++.+++..              +        
T Consensus        52 ~p~~vvlvlD~SgSM~~--~~~~a~~a~~~~l~~~l~~--~d~v~lv~f~~~~~~~~--------------~--------  105 (296)
T TIGR03436        52 LPLTVGLVIDTSGSMRN--DLDRARAAAIRFLKTVLRP--NDRVFVVTFNTRLRLLQ--------------D--------  105 (296)
T ss_pred             CCceEEEEEECCCCchH--HHHHHHHHHHHHHHhhCCC--CCEEEEEEeCCceeEee--------------c--------
Confidence            47789999999998753  467788888888877 543  47999999998765421              1        


Q ss_pred             CCccceehhhhHHHHHHHHhhCCCccc---------CCCCcccchHHHHHHH-HHHHHhc-----CCE-EEEEecCCCCC
Q 001711          505 PDDLLVNLSESRSVVDTLLDSLPSMFQ---------DNMNVESAFGPALKAA-FMVMSRL-----GGK-LLIFQNSLPSL  568 (1021)
Q Consensus       505 ~~~lLv~l~es~~~I~~lLd~Lp~~f~---------~~~~~~~alG~AL~aA-~~lL~~~-----GGk-IivF~sg~Pt~  568 (1021)
                             ....++.|...|+.|.....         .....++++..||..| ..++...     |-| ||+|+.|..+ 
T Consensus       106 -------~t~~~~~l~~~l~~l~~~~~~~~~~~~~~~~~~g~T~l~~al~~aa~~~~~~~~~~~p~rk~iIllTDG~~~-  177 (296)
T TIGR03436       106 -------FTSDPRLLEAALNRLKPPLRTDYNSSGAFVRDGGGTALYDAITLAALEQLANALAGIPGRKALIVISDGGDN-  177 (296)
T ss_pred             -------CCCCHHHHHHHHHhccCCCccccccccccccCCCcchhHHHHHHHHHHHHHHhhcCCCCCeEEEEEecCCCc-
Confidence                   01224556666666643110         0124567788887544 4555442     334 4555544211 


Q ss_pred             CcccccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCC------------cChhhhhhhccccc
Q 001711          569 GVGCLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKY------------TDIASLGTLAKYTG  636 (1021)
Q Consensus       569 GpG~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~------------~diatl~~L~~~TG  636 (1021)
                                               ....-++++...|.+.+|.|..+.++...            .+-..|..||+.||
T Consensus       178 -------------------------~~~~~~~~~~~~~~~~~v~vy~I~~~~~~~~~~~~~~~~~~~~~~~L~~iA~~TG  232 (296)
T TIGR03436       178 -------------------------RSRDTLERAIDAAQRADVAIYSIDARGLRAPDLGAGAKAGLGGPEALERLAEETG  232 (296)
T ss_pred             -------------------------chHHHHHHHHHHHHHcCCEEEEeccCccccCCcccccccCCCcHHHHHHHHHHhC
Confidence                                     01234577888888999998888775321            24568999999999


Q ss_pred             cEEEEe
Q 001711          637 GQVYYY  642 (1021)
Q Consensus       637 G~v~~y  642 (1021)
                      |+.|+-
T Consensus       233 G~~~~~  238 (296)
T TIGR03436       233 GRAFYV  238 (296)
T ss_pred             CeEecc
Confidence            997654


No 29 
>cd01470 vWA_complement_factors Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.
Probab=98.17  E-value=4.4e-05  Score=80.51  Aligned_cols=167  Identities=14%  Similarity=0.181  Sum_probs=101.6

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCC-CCceEEEEEEcCeEEE-EecCCCCCCcceeeccccccccCCCCCc
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGF-PRTQIGFITFDSTIHF-YNMKSSLTQPQMMVISDLDDIFVPLPDD  507 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~-~rt~VgiITFds~V~f-ynl~~~~~~p~mlVvsDldd~f~Pl~~~  507 (1021)
                      ++||||+|.++-.+ -++.++++|+..++.|... .+.+||||+|++..+. +.+...                      
T Consensus         3 i~~vlD~SgSM~~~-~~~~~k~~~~~l~~~l~~~~~~~~v~li~Fs~~~~~~~~~~~~----------------------   59 (198)
T cd01470           3 IYIALDASDSIGEE-DFDEAKNAIKTLIEKISSYEVSPRYEIISYASDPKEIVSIRDF----------------------   59 (198)
T ss_pred             EEEEEECCCCccHH-HHHHHHHHHHHHHHHccccCCCceEEEEEecCCceEEEecccC----------------------
Confidence            68999999987544 3678899999999988642 3579999999987653 222110                      


Q ss_pred             cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---------CC--EEEEEecCCCCCCccccccc
Q 001711          508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---------GG--KLLIFQNSLPSLGVGCLKLR  576 (1021)
Q Consensus       508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---------GG--kIivF~sg~Pt~GpG~L~~r  576 (1021)
                          ....++.+...|+.+..... ....++.++.||+.+...|...         ++  .|++|+.|.+|.|.....  
T Consensus        60 ----~~~~~~~~~~~l~~~~~~~~-~~~ggT~~~~Al~~~~~~l~~~~~~~~~~~~~~~~~iillTDG~~~~g~~~~~--  132 (198)
T cd01470          60 ----NSNDADDVIKRLEDFNYDDH-GDKTGTNTAAALKKVYERMALEKVRNKEAFNETRHVIILFTDGKSNMGGSPLP--  132 (198)
T ss_pred             ----CCCCHHHHHHHHHhCCcccc-cCccchhHHHHHHHHHHHHHHHHhcCccchhhcceEEEEEcCCCcCCCCChhH--
Confidence                01123344555666643211 1234678999999988776311         12  378899998886521100  


Q ss_pred             CCcCcccCCCccccCCCCCcHHHHHHHH------HHhhCCcEEEEEEecCCCcChhhhhhhcccccc--EEEEeCCC
Q 001711          577 GDDLRVYGTDKEHSLRIPEDPFYKQMAA------DLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGG--QVYYYPSF  645 (1021)
Q Consensus       577 ~~~~r~~gt~~e~~l~~pa~~fY~~La~------~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG--~v~~y~~F  645 (1021)
                                        ..+.++++..      .+.+.+|+|..+.++. ..|..+|..|+..|||  ++|+..+|
T Consensus       133 ------------------~~~~~~~~~~~~~~~~~~~~~~v~i~~iGvG~-~~~~~~L~~iA~~~~g~~~~f~~~~~  190 (198)
T cd01470         133 ------------------TVDKIKNLVYKNNKSDNPREDYLDVYVFGVGD-DVNKEELNDLASKKDNERHFFKLKDY  190 (198)
T ss_pred             ------------------HHHHHHHHHhcccccccchhcceeEEEEecCc-ccCHHHHHHHhcCCCCCceEEEeCCH
Confidence                              0111222211      1234456665555543 4789999999999999  46665544


No 30 
>cd01461 vWA_interalpha_trypsin_inhibitor vWA_interalpha trypsin inhibitor (ITI): ITI is a glycoprotein composed of three polypeptides- two heavy chains and one light chain (bikunin). Bikunin confers the protease-inhibitor function while the heavy chains are involved in rendering stability to the extracellular matrix by binding to hyaluronic acid. The heavy chains carry the VWA domain with a conserved MIDAS motif. Although the exact role of the VWA domains remains unknown, it has been speculated to be involved in mediating protein-protein interactions with the components of the extracellular matrix.
Probab=98.16  E-value=0.00012  Score=74.65  Aligned_cols=157  Identities=17%  Similarity=0.204  Sum_probs=102.1

Q ss_pred             CCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCC
Q 001711          427 PPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPD  506 (1021)
Q Consensus       427 pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~  506 (1021)
                      |.-++||+|+|.++.. .-++.+.++|...+..++.+  .+|+|++|++.++.+- ..                +.+  .
T Consensus         2 ~~~v~~vlD~S~SM~~-~~~~~~~~al~~~l~~l~~~--~~~~l~~Fs~~~~~~~-~~----------------~~~--~   59 (171)
T cd01461           2 PKEVVFVIDTSGSMSG-TKIEQTKEALLTALKDLPPG--DYFNIIGFSDTVEEFS-PS----------------SVS--A   59 (171)
T ss_pred             CceEEEEEECCCCCCC-hhHHHHHHHHHHHHHhCCCC--CEEEEEEeCCCceeec-Cc----------------cee--C
Confidence            4568999999999843 23778888999999988755  6899999998765431 00                000  0


Q ss_pred             ccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh---cCCEEEEEecCCCCCCcccccccCCcCccc
Q 001711          507 DLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR---LGGKLLIFQNSLPSLGVGCLKLRGDDLRVY  583 (1021)
Q Consensus       507 ~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~---~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~  583 (1021)
                          + .+.++.+.+.|+.+..      ...+.+..||..|...++.   ....|++|+.|..+          +     
T Consensus        60 ----~-~~~~~~~~~~l~~~~~------~g~T~l~~al~~a~~~l~~~~~~~~~iillTDG~~~----------~-----  113 (171)
T cd01461          60 ----T-AENVAAAIEYVNRLQA------LGGTNMNDALEAALELLNSSPGSVPQIILLTDGEVT----------N-----  113 (171)
T ss_pred             ----C-HHHHHHHHHHHHhcCC------CCCcCHHHHHHHHHHhhccCCCCccEEEEEeCCCCC----------C-----
Confidence                0 1223333445555432      4457799999999998874   23456666665411          0     


Q ss_pred             CCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCC
Q 001711          584 GTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPS  644 (1021)
Q Consensus       584 gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~  644 (1021)
                                 .... .+.+.++.+.+|.|..+.++. ..|-..|..+++.|||..++..+
T Consensus       114 -----------~~~~-~~~~~~~~~~~i~i~~i~~g~-~~~~~~l~~ia~~~gG~~~~~~~  161 (171)
T cd01461         114 -----------ESQI-LKNVREALSGRIRLFTFGIGS-DVNTYLLERLAREGRGIARRIYE  161 (171)
T ss_pred             -----------HHHH-HHHHHHhcCCCceEEEEEeCC-ccCHHHHHHHHHcCCCeEEEecC
Confidence                       0122 234445555678777777764 35678899999999999998875


No 31 
>cd01452 VWA_26S_proteasome_subunit 26S proteasome plays a major role in eukaryotic protein breakdown, especially for ubiquitin-tagged proteins. It is an ATP-dependent protease responsible for the bulk of non-lysosomal proteolysis in eukaryotes, often using covalent modification of proteins by ubiquitylation. It consists of a 20S proteolytic core particle (CP) and a 19S regulatory particle (RP). The CP is an ATP independent peptidase consisting of hydrolyzing activities. One or both ends of CP carry the RP that confers both ubiquitin and ATP dependence to the 26S proteosome. The RP's  proposed functions include recognition of substrates and translocation of these to CP for proteolysis. The RP can dissociate into a stable lid and base subcomplexes. The base is composed of three non-ATPase subunits (Rpn 1, 2 and 10). A single residue in the vWA domain of Rpn10 has been implicated to be responsible for stabilizing the lid-base association.
Probab=98.08  E-value=8e-05  Score=78.21  Aligned_cols=142  Identities=15%  Similarity=0.217  Sum_probs=95.8

Q ss_pred             eEEEEEecchhHHhh----cHHHHHHHHHHHHH----hcCCCCCCceEEEEEEcC-eEEEEecCCCCCCcceeecccccc
Q 001711          429 LYFFLIDVSISAIRS----GMLEVVAQTIKSCL----DELPGFPRTQIGFITFDS-TIHFYNMKSSLTQPQMMVISDLDD  499 (1021)
Q Consensus       429 ~yvFvIDvS~~av~s----G~l~~~~~sI~~~L----~~Lp~~~rt~VgiITFds-~V~fynl~~~~~~p~mlVvsDldd  499 (1021)
                      +.+++||+|..+.+.    ..+++.++.+...+    +..+   ..+||||+|.. .-++                    
T Consensus         5 a~vi~lD~S~sM~a~D~~PnRL~aak~~i~~~~~~f~~~np---~~~vGlv~fag~~a~v--------------------   61 (187)
T cd01452           5 ATMICIDNSEYMRNGDYPPTRFQAQADAVNLICQAKTRSNP---ENNVGLMTMAGNSPEV--------------------   61 (187)
T ss_pred             EEEEEEECCHHHHcCCCCCCHHHHHHHHHHHHHHHHHhcCC---CccEEEEEecCCceEE--------------------
Confidence            568999999987432    35778888877664    4444   36899999975 2221                    


Q ss_pred             ccCCCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-----CCEEEEEecCCCCCCccccc
Q 001711          500 IFVPLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-----GGKLLIFQNSLPSLGVGCLK  574 (1021)
Q Consensus       500 ~f~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-----GGkIivF~sg~Pt~GpG~L~  574 (1021)
                               +++++.....+...|+.+..      ..+..+|.||+.|..+|++.     ..||++|.+++-+.      
T Consensus        62 ---------~~plT~D~~~~~~~L~~i~~------~g~~~l~~AL~~A~~~L~~~~~~~~~~rivi~v~S~~~~------  120 (187)
T cd01452          62 ---------LVTLTNDQGKILSKLHDVQP------KGKANFITGIQIAQLALKHRQNKNQKQRIVAFVGSPIEE------  120 (187)
T ss_pred             ---------EECCCCCHHHHHHHHHhCCC------CCcchHHHHHHHHHHHHhcCCCcCCcceEEEEEecCCcC------
Confidence                     22333446667777777641      25567999999999999752     24889998865221      


Q ss_pred             ccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccc
Q 001711          575 LRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKY  634 (1021)
Q Consensus       575 ~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~  634 (1021)
                                  .+        +-..++++++.++||.||+..++...-+..-|..+.+.
T Consensus       121 ------------d~--------~~i~~~~~~lkk~~I~v~vI~~G~~~~~~~~l~~~~~~  160 (187)
T cd01452         121 ------------DE--------KDLVKLAKRLKKNNVSVDIINFGEIDDNTEKLTAFIDA  160 (187)
T ss_pred             ------------CH--------HHHHHHHHHHHHcCCeEEEEEeCCCCCCHHHHHHHHHH
Confidence                        11        11347899999999999999998664444444444433


No 32 
>cd01480 vWA_collagen_alpha_1-VI-type VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far.  Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=98.01  E-value=0.00011  Score=76.93  Aligned_cols=157  Identities=14%  Similarity=0.131  Sum_probs=100.8

Q ss_pred             eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-------CCCceEEEEEEcCeEEEE-ecCCCCCCcceeeccccccc
Q 001711          429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-------FPRTQIGFITFDSTIHFY-NMKSSLTQPQMMVISDLDDI  500 (1021)
Q Consensus       429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-------~~rt~VgiITFds~V~fy-nl~~~~~~p~mlVvsDldd~  500 (1021)
                      -.+||||.|.+.-.+. ++.+++.++..++.|..       ....+||+|+|++..++. .+.                 
T Consensus         4 dvv~vlD~S~Sm~~~~-~~~~k~~~~~~~~~l~~~~~~~i~~~~~rvglv~fs~~~~~~~~l~-----------------   65 (186)
T cd01480           4 DITFVLDSSESVGLQN-FDITKNFVKRVAERFLKDYYRKDPAGSWRVGVVQYSDQQEVEAGFL-----------------   65 (186)
T ss_pred             eEEEEEeCCCccchhh-HHHHHHHHHHHHHHHhhhhccCCCCCceEEEEEEecCCceeeEecc-----------------
Confidence            3689999999875444 56667777777777621       234799999999764421 110                 


Q ss_pred             cCCCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh----cC-CEEEEEecCCCCCCcccccc
Q 001711          501 FVPLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR----LG-GKLLIFQNSLPSLGVGCLKL  575 (1021)
Q Consensus       501 f~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~----~G-GkIivF~sg~Pt~GpG~L~~  575 (1021)
                           +.     ...++.+.+.|+.|...     ...+++|.||..|...+..    .. ..|++++.|..+.+      
T Consensus        66 -----~~-----~~~~~~l~~~i~~l~~~-----gg~T~~~~AL~~a~~~l~~~~~~~~~~~iillTDG~~~~~------  124 (186)
T cd01480          66 -----RD-----IRNYTSLKEAVDNLEYI-----GGGTFTDCALKYATEQLLEGSHQKENKFLLVITDGHSDGS------  124 (186)
T ss_pred             -----cc-----cCCHHHHHHHHHhCccC-----CCCccHHHHHHHHHHHHhccCCCCCceEEEEEeCCCcCCC------
Confidence                 00     12356667777777531     3468899999999999864    12 34555555543210      


Q ss_pred             cCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCC
Q 001711          576 RGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQ  646 (1021)
Q Consensus       576 r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~  646 (1021)
                                         ...-..+.+.++.+.||.|-.+.++.  .|...|..++...+|. |+-.+|.
T Consensus       125 -------------------~~~~~~~~~~~~~~~gi~i~~vgig~--~~~~~L~~IA~~~~~~-~~~~~~~  173 (186)
T cd01480         125 -------------------PDGGIEKAVNEADHLGIKIFFVAVGS--QNEEPLSRIACDGKSA-LYRENFA  173 (186)
T ss_pred             -------------------cchhHHHHHHHHHHCCCEEEEEecCc--cchHHHHHHHcCCcch-hhhcchh
Confidence                               00122456677888888866666654  7888899999887776 5555553


No 33 
>PF00626 Gelsolin:  Gelsolin repeat;  InterPro: IPR007123 Gelsolin is a cytoplasmic, calcium-regulated, actin-modulating protein that binds to the barbed ends of actin filaments, preventing monomer exchange (end-blocking or capping) []. It can promote nucleation (the assembly of monomers into filaments), as well as sever existing filaments. In addition, this protein binds with high affinity to fibronectin. Plasma gelsolin and cytoplasmic gelsolin are derived from a single gene by alternate initiation sites and differential splicing. Sequence comparisons indicate an evolutionary relationship between gelsolin, villin, fragmin and severin []. Six large repeating segments occur in gelsolin and villin, and 3 similar segments in severin and fragmin. While the multiple repeats have yet to be related to any known function of the actin-severing proteins, the superfamily appears to have evolved from an ancestral sequence of 120 to 130 amino acid residues [].; PDB: 3FG6_F 1RGI_G 2FGH_A 1D0N_B 3EGD_B 2NUP_B 2NUT_B 3EGX_B 1JHW_A 1J72_A ....
Probab=97.99  E-value=6.7e-06  Score=72.99  Aligned_cols=66  Identities=24%  Similarity=0.488  Sum_probs=50.1

Q ss_pred             cccccccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhhccccccccchHHHHHHHHHHHHHH-HhCC
Q 001711          892 IMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAELSKVMLREQDNEMSRKLLGILKKLR-EQDP  970 (1021)
Q Consensus       892 lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~~lp~~~n~~s~~l~~ii~~lr-~~r~  970 (1021)
                      ++..++++.+.|.++++||||+|..||+|+|+..  ...++.++.                       .+++++. ..|.
T Consensus         4 ~~~~~~~s~~~L~s~~~yIld~~~~i~vW~G~~~--~~~e~~~a~-----------------------~~a~~~~~~~~~   58 (76)
T PF00626_consen    4 RPEQVPLSQSSLNSDDCYILDCGYEIFVWVGKKS--SPEEKAFAA-----------------------QLAQELLSEERP   58 (76)
T ss_dssp             EEEEESSSGGGEETTSEEEEEESSEEEEEEHTTS--HHHHHHHHH-----------------------HHHHHHHHHHTT
T ss_pred             cCCcCCCCHHHcCCCCEEEEEeCCCcEEEEeccC--CHHHHHHHH-----------------------HHHHHhhhhcCC
Confidence            4677899999999999999999999999999994  344444433                       3445555 6677


Q ss_pred             CCCceEEEeccCC
Q 001711          971 SYYQLCQLVRQGE  983 (1021)
Q Consensus       971 ~~~~l~~vvrqg~  983 (1021)
                      ...++ .++.+|.
T Consensus        59 ~~~~~-~~~~eg~   70 (76)
T PF00626_consen   59 PLPEV-IRVEEGK   70 (76)
T ss_dssp             TTSEE-EEEETTH
T ss_pred             CCCEE-EEecCCC
Confidence            77776 7778874


No 34 
>PF13768 VWA_3:  von Willebrand factor type A domain
Probab=97.97  E-value=0.00011  Score=74.18  Aligned_cols=150  Identities=23%  Similarity=0.305  Sum_probs=99.8

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCccc
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDLL  509 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~lL  509 (1021)
                      .|||||+|.++.  |..+.++++|+..|+.|+++  .++.||+||+.++.|.-  .                       +
T Consensus         3 vvilvD~S~Sm~--g~~~~~k~al~~~l~~L~~~--d~fnii~f~~~~~~~~~--~-----------------------~   53 (155)
T PF13768_consen    3 VVILVDTSGSMS--GEKELVKDALRAILRSLPPG--DRFNIIAFGSSVRPLFP--G-----------------------L   53 (155)
T ss_pred             EEEEEeCCCCCC--CcHHHHHHHHHHHHHhCCCC--CEEEEEEeCCEeeEcch--h-----------------------H
Confidence            689999999984  33388999999999999865  79999999998775431  1                       1


Q ss_pred             eeh-hhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh--cCCEEEEEecCCCCCCcccccccCCcCcccCCC
Q 001711          510 VNL-SESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR--LGGKLLIFQNSLPSLGVGCLKLRGDDLRVYGTD  586 (1021)
Q Consensus       510 v~l-~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~--~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~gt~  586 (1021)
                      +.. .+.++...+.++.+..     ....+.+..||+.|+..+..  .--.|++++.|.++.+.                
T Consensus        54 ~~~~~~~~~~a~~~I~~~~~-----~~G~t~l~~aL~~a~~~~~~~~~~~~IilltDG~~~~~~----------------  112 (155)
T PF13768_consen   54 VPATEENRQEALQWIKSLEA-----NSGGTDLLAALRAALALLQRPGCVRAIILLTDGQPVSGE----------------  112 (155)
T ss_pred             HHHhHHHHHHHHHHHHHhcc-----cCCCccHHHHHHHHHHhcccCCCccEEEEEEeccCCCCH----------------
Confidence            111 1344444555555432     25667899999999988632  34577888777653221                


Q ss_pred             ccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEE
Q 001711          587 KEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYY  641 (1021)
Q Consensus       587 ~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~  641 (1021)
                               ....+.+ .++. ..+.|+.|.++. ..+-..|..|++.|||...+
T Consensus       113 ---------~~i~~~v-~~~~-~~~~i~~~~~g~-~~~~~~L~~LA~~~~G~~~f  155 (155)
T PF13768_consen  113 ---------EEILDLV-RRAR-GHIRIFTFGIGS-DADADFLRELARATGGSFHF  155 (155)
T ss_pred             ---------HHHHHHH-HhcC-CCceEEEEEECC-hhHHHHHHHHHHcCCCEEEC
Confidence                     1122222 2222 457777777765 46678899999999998763


No 35 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=97.93  E-value=0.0002  Score=77.24  Aligned_cols=167  Identities=21%  Similarity=0.272  Sum_probs=104.2

Q ss_pred             eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCC-CCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCc
Q 001711          429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGF-PRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDD  507 (1021)
Q Consensus       429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~-~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~  507 (1021)
                      -.+||||.|.+.-... ++.+++.++..++.|.-. ..++||||+|++.+++.-              ++.+        
T Consensus         4 DlvfllD~S~Sm~~~~-~~~~k~f~~~l~~~l~~~~~~~rvglv~fs~~~~~~~--------------~l~~--------   60 (224)
T cd01475           4 DLVFLIDSSRSVRPEN-FELVKQFLNQIIDSLDVGPDATRVGLVQYSSTVKQEF--------------PLGR--------   60 (224)
T ss_pred             cEEEEEeCCCCCCHHH-HHHHHHHHHHHHHhcccCCCccEEEEEEecCceeEEe--------------cccc--------
Confidence            4799999999864333 678888899888877432 358999999998765420              1110        


Q ss_pred             cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHH-Hh-cC--------CE-EEEEecCCCCCCccccccc
Q 001711          508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVM-SR-LG--------GK-LLIFQNSLPSLGVGCLKLR  576 (1021)
Q Consensus       508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL-~~-~G--------Gk-IivF~sg~Pt~GpG~L~~r  576 (1021)
                           ..+++.|.+.|+.|..+     ...+.+|.||+.|...+ .. .|        -| |++|+.|.++         
T Consensus        61 -----~~~~~~l~~~i~~i~~~-----~~~t~tg~AL~~a~~~~~~~~~g~r~~~~~~~kvvillTDG~s~---------  121 (224)
T cd01475          61 -----FKSKADLKRAVRRMEYL-----ETGTMTGLAIQYAMNNAFSEAEGARPGSERVPRVGIVVTDGRPQ---------  121 (224)
T ss_pred             -----cCCHHHHHHHHHhCcCC-----CCCChHHHHHHHHHHHhCChhcCCCCCCCCCCeEEEEEcCCCCc---------
Confidence                 01344556667777543     23467899999888653 21 11        13 4566555321         


Q ss_pred             CCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhcccccc-EEEEeCCCCCchhHHHHH
Q 001711          577 GDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGG-QVYYYPSFQSTTHGERLR  655 (1021)
Q Consensus       577 ~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG-~v~~y~~F~~~~d~~kl~  655 (1021)
                                          +-+++.+.++.+.||.|  |+++-...|...|..|+..+++ .+|+-.+|+.   -+++.
T Consensus       122 --------------------~~~~~~a~~lk~~gv~i--~~VgvG~~~~~~L~~ias~~~~~~~f~~~~~~~---l~~~~  176 (224)
T cd01475         122 --------------------DDVSEVAAKARALGIEM--FAVGVGRADEEELREIASEPLADHVFYVEDFST---IEELT  176 (224)
T ss_pred             --------------------ccHHHHHHHHHHCCcEE--EEEeCCcCCHHHHHHHhCCCcHhcEEEeCCHHH---HHHHh
Confidence                                01356778888888655  5544445788999999987754 6666666542   34455


Q ss_pred             HHHHHhc
Q 001711          656 HELSRDL  662 (1021)
Q Consensus       656 ~dL~r~l  662 (1021)
                      .+|...+
T Consensus       177 ~~l~~~~  183 (224)
T cd01475         177 KKFQGKI  183 (224)
T ss_pred             hhccccc
Confidence            5554443


No 36 
>PTZ00441 sporozoite surface protein 2 (SSP2); Provisional
Probab=97.93  E-value=0.00037  Score=83.43  Aligned_cols=163  Identities=11%  Similarity=0.064  Sum_probs=100.5

Q ss_pred             CeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEE-EEecCCCCCCcceeeccccccccCCCC
Q 001711          428 PLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIH-FYNMKSSLTQPQMMVISDLDDIFVPLP  505 (1021)
Q Consensus       428 p~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~-fynl~~~~~~p~mlVvsDldd~f~Pl~  505 (1021)
                      .-++||||+|.+.-...+++.++..++..++.+.. ..+++||+|+|++..+ ++.+....                   
T Consensus        43 lDIvFLLD~SgSMg~~Nfle~AK~Fa~~LV~~l~Is~D~V~VgiV~FSd~~r~vfpL~s~~-------------------  103 (576)
T PTZ00441         43 VDLYLLVDGSGSIGYHNWITHVIPMLMGLIQQLNLSDDAINLYMSLFSNNTTELIRLGSGA-------------------  103 (576)
T ss_pred             ceEEEEEeCCCccCCccHHHHHHHHHHHHHHHhccCCCceEEEEEEeCCCceEEEecCCCc-------------------
Confidence            35799999999886656667788888888887753 3458899999987654 33332211                   


Q ss_pred             CccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC------CEEEEEecCCCCCCcccccccCCc
Q 001711          506 DDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG------GKLLIFQNSLPSLGVGCLKLRGDD  579 (1021)
Q Consensus       506 ~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G------GkIivF~sg~Pt~GpG~L~~r~~~  579 (1021)
                         -.+.......|..++..+.      ....+.+|.||..|...+...+      +.||||+.|.++-+          
T Consensus       104 ---s~Dk~~aL~~I~sL~~~~~------pgGgTnig~AL~~Aae~L~sr~~R~nvpKVVILLTDG~sns~----------  164 (576)
T PTZ00441        104 ---SKDKEQALIIVKSLRKTYL------PYGKTNMTDALLEVRKHLNDRVNRENAIQLVILMTDGIPNSK----------  164 (576)
T ss_pred             ---cccHHHHHHHHHHHHhhcc------CCCCccHHHHHHHHHHHHhhcccccCCceEEEEEecCCCCCc----------
Confidence               0011122333333333321      1245779999999988887543      56778877664311          


Q ss_pred             CcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhc----cccccEEEEeCCCC
Q 001711          580 LRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLA----KYTGGQVYYYPSFQ  646 (1021)
Q Consensus       580 ~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~----~~TGG~v~~y~~F~  646 (1021)
                                      .+. .+.+.++.+.||.|-+|.++. ..|...+..|+    ..++|.+|.+.+|+
T Consensus       165 ----------------~dv-leaAq~LR~~GVeI~vIGVG~-g~n~e~LrlIAgC~p~~g~c~~Y~vadf~  217 (576)
T PTZ00441        165 ----------------YRA-LEESRKLKDRNVKLAVIGIGQ-GINHQFNRLLAGCRPREGKCKFYSDADWE  217 (576)
T ss_pred             ----------------ccH-HHHHHHHHHCCCEEEEEEeCC-CcCHHHHHHHhccCCCCCCCceEEeCCHH
Confidence                            001 134566777888766666643 46666555555    33556788877774


No 37 
>cd01450 vWFA_subfamily_ECM Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A
Probab=97.91  E-value=0.00022  Score=71.32  Aligned_cols=145  Identities=21%  Similarity=0.198  Sum_probs=98.8

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCcc
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDL  508 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~l  508 (1021)
                      ++||||+|.++-. .-++.+++.+...++.+.. +.+.+|+||+|++..+...              ++.       +. 
T Consensus         3 i~~llD~S~Sm~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~li~f~~~~~~~~--------------~~~-------~~-   59 (161)
T cd01450           3 IVFLLDGSESVGP-ENFEKVKDFIEKLVEKLDIGPDKTRVGLVQYSDDVRVEF--------------SLN-------DY-   59 (161)
T ss_pred             EEEEEeCCCCcCH-HHHHHHHHHHHHHHHheeeCCCceEEEEEEEcCCceEEE--------------ECC-------CC-
Confidence            5799999998743 2567788888888887763 2468999999997543210              100       00 


Q ss_pred             ceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC-------CEEEEEecCCCCCCcccccccCCcCc
Q 001711          509 LVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG-------GKLLIFQNSLPSLGVGCLKLRGDDLR  581 (1021)
Q Consensus       509 Lv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G-------GkIivF~sg~Pt~GpG~L~~r~~~~r  581 (1021)
                           ..++.+.+.|+.+.....    ..+.++.||+.|...+....       ..|++|++|.++.+.           
T Consensus        60 -----~~~~~~~~~i~~~~~~~~----~~t~~~~al~~a~~~~~~~~~~~~~~~~~iiliTDG~~~~~~-----------  119 (161)
T cd01450          60 -----KSKDDLLKAVKNLKYLGG----GGTNTGKALQYALEQLFSESNARENVPKVIIVLTDGRSDDGG-----------  119 (161)
T ss_pred             -----CCHHHHHHHHHhcccCCC----CCccHHHHHHHHHHHhcccccccCCCCeEEEEECCCCCCCCc-----------
Confidence                 024455556666543211    46889999999999986542       257777777655431           


Q ss_pred             ccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhcccc
Q 001711          582 VYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYT  635 (1021)
Q Consensus       582 ~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~T  635 (1021)
                                      -..++..++.+.+|.|..+.++.  .|...|..|+..|
T Consensus       120 ----------------~~~~~~~~~~~~~v~v~~i~~g~--~~~~~l~~la~~~  155 (161)
T cd01450         120 ----------------DPKEAAAKLKDEGIKVFVVGVGP--ADEEELREIASCP  155 (161)
T ss_pred             ----------------chHHHHHHHHHCCCEEEEEeccc--cCHHHHHHHhCCC
Confidence                            12566777788888888887766  7888899999888


No 38 
>cd01477 vWA_F09G8-8_type VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of mo
Probab=97.87  E-value=0.00038  Score=73.64  Aligned_cols=151  Identities=23%  Similarity=0.265  Sum_probs=90.2

Q ss_pred             eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-------CCCceEEEEEEcCeEEE-EecCCCCCCcceeeccccccc
Q 001711          429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-------FPRTQIGFITFDSTIHF-YNMKSSLTQPQMMVISDLDDI  500 (1021)
Q Consensus       429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-------~~rt~VgiITFds~V~f-ynl~~~~~~p~mlVvsDldd~  500 (1021)
                      =.|||||.|.+.-..+ ++.+++.|+..+..+..       ...+|||+|+|++..++ ++|.            |.   
T Consensus        21 DivfvlD~S~Sm~~~~-f~~~k~fi~~~~~~~~~~~~~~~~~~~~rVGlV~fs~~a~~~~~L~------------d~---   84 (193)
T cd01477          21 DIVFVVDNSKGMTQGG-LWQVRATISSLFGSSSQIGTDYDDPRSTRVGLVTYNSNATVVADLN------------DL---   84 (193)
T ss_pred             eEEEEEeCCCCcchhh-HHHHHHHHHHHHhhccccccccCCCCCcEEEEEEccCceEEEEecc------------cc---
Confidence            4799999999875433 67788888887776543       13489999999987653 2221            10   


Q ss_pred             cCCCCCccceehhhhHHHHHHHHhh-CCCcccCCCCcccchHHHHHHHHHHHHhc--C-----CE-EEEEecCCCCCCcc
Q 001711          501 FVPLPDDLLVNLSESRSVVDTLLDS-LPSMFQDNMNVESAFGPALKAAFMVMSRL--G-----GK-LLIFQNSLPSLGVG  571 (1021)
Q Consensus       501 f~Pl~~~lLv~l~es~~~I~~lLd~-Lp~~f~~~~~~~~alG~AL~aA~~lL~~~--G-----Gk-IivF~sg~Pt~GpG  571 (1021)
                                   ...+.+.+.|+. +..+.   ...++.+|.||+.|.+++...  +     -| ||+++++--+.+  
T Consensus        85 -------------~~~~~~~~ai~~~~~~~~---~~ggT~ig~aL~~A~~~l~~~~~~~R~~v~kvvIllTDg~~~~~--  146 (193)
T cd01477          85 -------------QSFDDLYSQIQGSLTDVS---STNASYLDTGLQAAEQMLAAGKRTSRENYKKVVIVFASDYNDEG--  146 (193)
T ss_pred             -------------cCHHHHHHHHHHHhhccc---cCCcchHHHHHHHHHHHHHhhhccccCCCCeEEEEEecCccCCC--
Confidence                         011222222332 21111   123678999999999999742  3     46 455544421100  


Q ss_pred             cccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccE
Q 001711          572 CLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQ  638 (1021)
Q Consensus       572 ~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~  638 (1021)
                                      +       . -..+.|.++.+.||.|..+.++. +.|...+..|++..++.
T Consensus       147 ----------------~-------~-~~~~~a~~l~~~GI~i~tVGiG~-~~d~~~~~~L~~ias~~  188 (193)
T cd01477         147 ----------------S-------N-DPRPIAARLKSTGIAIITVAFTQ-DESSNLLDKLGKIASPG  188 (193)
T ss_pred             ----------------C-------C-CHHHHHHHHHHCCCEEEEEEeCC-CCCHHHHHHHHHhcCCC
Confidence                            0       0 02467888999999998888875 45544455555554443


No 39 
>cd01471 vWA_micronemal_protein Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.
Probab=97.86  E-value=0.00038  Score=72.53  Aligned_cols=149  Identities=15%  Similarity=0.153  Sum_probs=92.5

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEE-EecCCCCCCcceeeccccccccCCCCCc
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHF-YNMKSSLTQPQMMVISDLDDIFVPLPDD  507 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~f-ynl~~~~~~p~mlVvsDldd~f~Pl~~~  507 (1021)
                      ++||||+|.++-....++.+++.++..++.+.- ..+++||+|+|++..+. +++...                      
T Consensus         3 v~~vlD~SgSm~~~~~~~~~k~~~~~~~~~~~~~~~~~~vglv~Fs~~~~~~~~l~~~----------------------   60 (186)
T cd01471           3 LYLLVDGSGSIGYSNWVTHVVPFLHTFVQNLNISPDEINLYLVTFSTNAKELIRLSSP----------------------   60 (186)
T ss_pred             EEEEEeCCCCccchhhHHHHHHHHHHHHHhcccCCCceEEEEEEecCCceEEEECCCc----------------------
Confidence            689999999986655477888888888887752 23589999999987652 323211                      


Q ss_pred             cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-C------CEEEEEecCCCCCCcccccccCCcC
Q 001711          508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-G------GKLLIFQNSLPSLGVGCLKLRGDDL  580 (1021)
Q Consensus       508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-G------GkIivF~sg~Pt~GpG~L~~r~~~~  580 (1021)
                          ....++.+.++++.|..+.  .....++++.||+.|.+.+... +      ..|+++++|.++-+..         
T Consensus        61 ----~~~~~~~~~~~i~~l~~~~--~~~G~T~l~~aL~~a~~~l~~~~~~r~~~~~~villTDG~~~~~~~---------  125 (186)
T cd01471          61 ----NSTNKDLALNAIRALLSLY--YPNGSTNTTSALLVVEKHLFDTRGNRENAPQLVIIMTDGIPDSKFR---------  125 (186)
T ss_pred             ----cccchHHHHHHHHHHHhCc--CCCCCccHHHHHHHHHHHhhccCCCcccCceEEEEEccCCCCCCcc---------
Confidence                0112222223333332211  1235678999999999999652 1      2477777766432100         


Q ss_pred             cccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccc
Q 001711          581 RVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKY  634 (1021)
Q Consensus       581 r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~  634 (1021)
                                      .  .+.+.++.+.||.|-++.++ ...|...|..|+..
T Consensus       126 ----------------~--~~~a~~l~~~gv~v~~igiG-~~~d~~~l~~ia~~  160 (186)
T cd01471         126 ----------------T--LKEARKLRERGVIIAVLGVG-QGVNHEENRSLVGC  160 (186)
T ss_pred             ----------------h--hHHHHHHHHCCCEEEEEEee-hhhCHHHHHHhcCC
Confidence                            0  13456677788776666665 35777777777664


No 40 
>TIGR02442 Cob-chelat-sub cobaltochelatase subunit. A number of genomes (actinobacteria, cyanobacteria, betaproteobacteria and pseudomonads) which apparently biosynthesize B12, encode a cobN gene but are demonstrably lacking cobS and cobT. These genomes do, however contain a homolog (modelled here) of the magnesium chelatase subunits BchI/BchD family. Aside from the cyanobacteria (which have a separate magnesium chelatase trimer), these species do not make chlorins, so do not have any use for a magnesium chelatase. Furthermore, in nearly all cases the members of this family are proximal to either CobN itself or other genes involved in cobalt transport or B12 biosynthesis.
Probab=97.83  E-value=0.00018  Score=89.15  Aligned_cols=160  Identities=21%  Similarity=0.273  Sum_probs=109.7

Q ss_pred             CCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCe-EEEEecCCCCCCcceeeccccccccCCCC
Q 001711          427 PPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDST-IHFYNMKSSLTQPQMMVISDLDDIFVPLP  505 (1021)
Q Consensus       427 pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~-V~fynl~~~~~~p~mlVvsDldd~f~Pl~  505 (1021)
                      .-.++||||+|.++...+-++.++.++...|..... .+.+||||+|++. ..+                          
T Consensus       465 ~~~vv~vvD~SgSM~~~~rl~~ak~a~~~ll~~a~~-~~D~v~lI~F~g~~a~~--------------------------  517 (633)
T TIGR02442       465 GNLVIFVVDASGSMAARGRMAAAKGAVLSLLRDAYQ-KRDKVALITFRGEEAEV--------------------------  517 (633)
T ss_pred             CceEEEEEECCccCCCccHHHHHHHHHHHHHHHhhc-CCCEEEEEEECCCCceE--------------------------
Confidence            457889999999985444577778777777764322 2478999999743 111                          


Q ss_pred             CccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh-------cCCEEEEEecCCCCCCcccccccCC
Q 001711          506 DDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR-------LGGKLLIFQNSLPSLGVGCLKLRGD  578 (1021)
Q Consensus       506 ~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~-------~GGkIivF~sg~Pt~GpG~L~~r~~  578 (1021)
                         ++++..+++.+...|+.|+.      ...+.++.||..|..+++.       ..+.|++++.|..|.|.+.    ++
T Consensus       518 ---~~p~t~~~~~~~~~L~~l~~------gG~Tpl~~aL~~A~~~l~~~~~~~~~~~~~vvliTDG~~n~~~~~----~~  584 (633)
T TIGR02442       518 ---LLPPTSSVELAARRLEELPT------GGRTPLAAGLLKAAEVLSNELLRDDDGRPLLVVITDGRANVADGG----EP  584 (633)
T ss_pred             ---EcCCCCCHHHHHHHHHhCCC------CCCCCHHHHHHHHHHHHHHhhccCCCCceEEEEECCCCCCCCCCC----CC
Confidence               11122344555667777753      4567899999999999883       2367999999998875110    00


Q ss_pred             cCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEe
Q 001711          579 DLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYY  642 (1021)
Q Consensus       579 ~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y  642 (1021)
                                     +..+ -..+|.++.+.+|.+.++-+...+++...+..||+.+||+.|+.
T Consensus       585 ---------------~~~~-~~~~a~~l~~~~i~~~vIdt~~~~~~~~~~~~lA~~~gg~y~~l  632 (633)
T TIGR02442       585 ---------------PTDD-ARTIAAKLAARGILFVVIDTESGFVRLGLAEDLARALGGEYVRL  632 (633)
T ss_pred             ---------------hHHH-HHHHHHHHHhcCCeEEEEeCCCCCcchhHHHHHHHhhCCeEEec
Confidence                           0011 24567777778887766666667777888999999999999864


No 41 
>cd01469 vWA_integrins_alpha_subunit Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.
Probab=97.81  E-value=0.00065  Score=70.58  Aligned_cols=156  Identities=12%  Similarity=0.183  Sum_probs=100.0

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCC-CCceEEEEEEcCeEEEE-ecCCCCCCcceeeccccccccCCCCCc
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGF-PRTQIGFITFDSTIHFY-NMKSSLTQPQMMVISDLDDIFVPLPDD  507 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~-~rt~VgiITFds~V~fy-nl~~~~~~p~mlVvsDldd~f~Pl~~~  507 (1021)
                      ++|+||.|.+.-.. -++.+++.++..++.+..+ ..+|||+|+|++..++. ++.            |.          
T Consensus         3 i~fvlD~S~S~~~~-~f~~~k~fi~~~i~~l~~~~~~~rvgvv~fs~~~~~~~~l~------------~~----------   59 (177)
T cd01469           3 IVFVLDGSGSIYPD-DFQKVKNFLSTVMKKLDIGPTKTQFGLVQYSESFRTEFTLN------------EY----------   59 (177)
T ss_pred             EEEEEeCCCCCCHH-HHHHHHHHHHHHHHHcCcCCCCcEEEEEEECCceeEEEecC------------cc----------
Confidence            68999999886432 3677888899988887643 35899999999876531 221            10          


Q ss_pred             cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHH--HhcCC------EEEEEecCCCCCCcccccccCCc
Q 001711          508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVM--SRLGG------KLLIFQNSLPSLGVGCLKLRGDD  579 (1021)
Q Consensus       508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL--~~~GG------kIivF~sg~Pt~GpG~L~~r~~~  579 (1021)
                            .+.+.+.+.++.+...     ...+.+|.||+.|...+  ...|.      -+++++.|..+-+.         
T Consensus        60 ------~~~~~~~~~i~~~~~~-----~g~T~~~~AL~~a~~~l~~~~~g~R~~~~kv~illTDG~~~~~~---------  119 (177)
T cd01469          60 ------RTKEEPLSLVKHISQL-----LGLTNTATAIQYVVTELFSESNGARKDATKVLVVITDGESHDDP---------  119 (177)
T ss_pred             ------CCHHHHHHHHHhCccC-----CCCccHHHHHHHHHHHhcCcccCCCCCCCeEEEEEeCCCCCCcc---------
Confidence                  1122344455666532     22378999999998876  22332      36666665533211         


Q ss_pred             CcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCC---cChhhhhhhcccccc-EEEEeCCCC
Q 001711          580 LRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKY---TDIASLGTLAKYTGG-QVYYYPSFQ  646 (1021)
Q Consensus       580 ~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~---~diatl~~L~~~TGG-~v~~y~~F~  646 (1021)
                                        ..+..+.++.+.||.|-.+..+..+   .+..+|..++..+++ ++|...+|+
T Consensus       120 ------------------~~~~~~~~~k~~gv~v~~Vgvg~~~~~~~~~~~L~~ias~p~~~h~f~~~~~~  172 (177)
T cd01469         120 ------------------LLKDVIPQAEREGIIRYAIGVGGHFQRENSREELKTIASKPPEEHFFNVTDFA  172 (177)
T ss_pred             ------------------ccHHHHHHHHHCCcEEEEEEecccccccccHHHHHHHhcCCcHHhEEEecCHH
Confidence                              0044566677788877777766543   347889999998874 666666653


No 42 
>cd01482 vWA_collagen_alphaI-XII-like Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=97.76  E-value=0.00083  Score=68.69  Aligned_cols=150  Identities=19%  Similarity=0.185  Sum_probs=93.6

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCcc
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDL  508 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~l  508 (1021)
                      .+||||.|.+.-+.+ ++.+++.++..+..+.- .++++||||+|++..+..-              ++++         
T Consensus         3 v~~vlD~S~Sm~~~~-~~~~k~~~~~l~~~~~~~~~~~rvgli~fs~~~~~~~--------------~l~~---------   58 (164)
T cd01482           3 IVFLVDGSWSIGRSN-FNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEF--------------DLNA---------   58 (164)
T ss_pred             EEEEEeCCCCcChhh-HHHHHHHHHHHHhheeeCCCceEEEEEEECCCeeEEE--------------ecCC---------
Confidence            689999999886544 57788888888887642 2458999999998654310              0110         


Q ss_pred             ceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHH-Hh-cC------CEEEEEecCCCCCCcccccccCCcC
Q 001711          509 LVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVM-SR-LG------GKLLIFQNSLPSLGVGCLKLRGDDL  580 (1021)
Q Consensus       509 Lv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL-~~-~G------GkIivF~sg~Pt~GpG~L~~r~~~~  580 (1021)
                          ..+++.+.+.|+++..     ....+.+|.||..+...+ +. .|      ..|++|+.|.++-            
T Consensus        59 ----~~~~~~l~~~l~~~~~-----~~g~T~~~~aL~~a~~~~~~~~~~~r~~~~k~iillTDG~~~~------------  117 (164)
T cd01482          59 ----YTSKEDVLAAIKNLPY-----KGGNTRTGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQD------------  117 (164)
T ss_pred             ----CCCHHHHHHHHHhCcC-----CCCCChHHHHHHHHHHHhcccccCCCCCCCEEEEEEcCCCCCc------------
Confidence                0123445555666653     234567999999877644 32 11      2366776654320            


Q ss_pred             cccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhcccccc-EEEEeC
Q 001711          581 RVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGG-QVYYYP  643 (1021)
Q Consensus       581 r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG-~v~~y~  643 (1021)
                                       -.++.+.++.+.||.+-.+  +-+..+...|..|+..+.. +++...
T Consensus       118 -----------------~~~~~a~~lk~~gi~i~~i--g~g~~~~~~L~~ia~~~~~~~~~~~~  162 (164)
T cd01482         118 -----------------DVELPARVLRNLGVNVFAV--GVKDADESELKMIASKPSETHVFNVA  162 (164)
T ss_pred             -----------------hHHHHHHHHHHCCCEEEEE--ecCcCCHHHHHHHhCCCchheEEEcC
Confidence                             1245677888888754444  4444668889999888654 455443


No 43 
>TIGR02031 BchD-ChlD magnesium chelatase ATPase subunit D. This model represents one of two ATPase subunits of the trimeric magnesium chelatase responsible for insertion of magnesium ion into protoporphyrin IX. This is an essential step in the biosynthesis of both chlorophyll and bacteriochlorophyll. This subunit is found in green plants, photosynthetic algae, cyanobacteria and other photosynthetic bacteria. Unlike subunit I (TIGR02030), this subunit is not found in archaea.
Probab=97.75  E-value=0.00044  Score=84.93  Aligned_cols=174  Identities=20%  Similarity=0.242  Sum_probs=117.6

Q ss_pred             CCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCC
Q 001711          426 MPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLP  505 (1021)
Q Consensus       426 ~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~  505 (1021)
                      ..-.++||||+|.++-. .-++.+++++...|..+-. .+-+||||+|++...-+            +        +|  
T Consensus       406 ~~~~v~fvvD~SGSM~~-~rl~~aK~av~~Ll~~~~~-~~D~v~Li~F~~~~a~~------------~--------lp--  461 (589)
T TIGR02031       406 SGRLLIFVVDASGSAAV-ARMSEAKGAVELLLGEAYV-HRDQVSLIAFRGTAAEV------------L--------LP--  461 (589)
T ss_pred             cCceEEEEEECCCCCCh-HHHHHHHHHHHHHHHhhcc-CCCEEEEEEECCCCceE------------E--------CC--
Confidence            45568899999998832 3578888888888875422 23589999997542110            0        11  


Q ss_pred             CccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh---cCC--EEEEEecCCCCCCccc-ccccCCc
Q 001711          506 DDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR---LGG--KLLIFQNSLPSLGVGC-LKLRGDD  579 (1021)
Q Consensus       506 ~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~---~GG--kIivF~sg~Pt~GpG~-L~~r~~~  579 (1021)
                            ...+++.+...|+.|+.      ..++.++.||..|...++.   .++  .|++++.|.+|+|.+. ...... 
T Consensus       462 ------~t~~~~~~~~~L~~l~~------gGgTpL~~gL~~A~~~~~~~~~~~~~~~ivllTDG~~nv~~~~~~~~~~~-  528 (589)
T TIGR02031       462 ------PSRSVEQAKRRLDVLPG------GGGTPLAAGLAAAFQTALQARSSGGTPTIVLITDGRGNIPLDGDPESIKA-  528 (589)
T ss_pred             ------CCCCHHHHHHHHhcCCC------CCCCcHHHHHHHHHHHHHHhcccCCceEEEEECCCCCCCCCCcccccccc-
Confidence                  11233444556777752      4567899999999999864   233  6999999999987531 110000 


Q ss_pred             CcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCC
Q 001711          580 LRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQS  647 (1021)
Q Consensus       580 ~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~  647 (1021)
                            .     .....+-...++.++.+.||.+-++-+...+.+..-+..|++..||..|+.++-++
T Consensus       529 ------~-----~~~~~~~~~~~a~~~~~~gi~~~vid~~~~~~~~~~~~~lA~~~~g~y~~l~~~~a  585 (589)
T TIGR02031       529 ------D-----REQAAEEALALARKIREAGMPALVIDTAMRFVSTGFAQKLARKMGAHYIYLPNATA  585 (589)
T ss_pred             ------c-----chhHHHHHHHHHHHHHhcCCeEEEEeCCCCCccchHHHHHHHhcCCcEEeCCCCCh
Confidence                  0     11223344677888999998877777777777777789999999999999887543


No 44 
>COG1240 ChlD Mg-chelatase subunit ChlD [Coenzyme metabolism]
Probab=97.73  E-value=0.00043  Score=75.00  Aligned_cols=166  Identities=17%  Similarity=0.236  Sum_probs=119.5

Q ss_pred             CCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCC
Q 001711          426 MPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLP  505 (1021)
Q Consensus       426 ~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~  505 (1021)
                      ....+|||||.|.++-...-.++++-++...|.+--. .|-||++|+|...           +                 
T Consensus        77 ~g~lvvfvVDASgSM~~~~Rm~aaKG~~~~lL~dAYq-~RdkvavI~F~G~-----------~-----------------  127 (261)
T COG1240          77 AGNLIVFVVDASGSMAARRRMAAAKGAALSLLRDAYQ-RRDKVAVIAFRGE-----------K-----------------  127 (261)
T ss_pred             cCCcEEEEEeCcccchhHHHHHHHHHHHHHHHHHHHH-ccceEEEEEecCC-----------c-----------------
Confidence            4457899999999986655688888888888875332 3578999999632           1                 


Q ss_pred             CccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC-------CEEEEEecCCCCCCcccccccCC
Q 001711          506 DDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG-------GKLLIFQNSLPSLGVGCLKLRGD  578 (1021)
Q Consensus       506 ~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G-------GkIivF~sg~Pt~GpG~L~~r~~  578 (1021)
                      -.++++...+.+.+++.|+.|+.      ...+=+..||+.|.+++....       -.+++.+.|.+|.+.+.=..   
T Consensus       128 A~lll~pT~sv~~~~~~L~~l~~------GG~TPL~~aL~~a~ev~~r~~r~~p~~~~~~vviTDGr~n~~~~~~~~---  198 (261)
T COG1240         128 AELLLPPTSSVELAERALERLPT------GGKTPLADALRQAYEVLAREKRRGPDRRPVMVVITDGRANVPIPLGPK---  198 (261)
T ss_pred             ceEEeCCcccHHHHHHHHHhCCC------CCCCchHHHHHHHHHHHHHhhccCCCcceEEEEEeCCccCCCCCCchH---
Confidence            13455566677888889999984      344559999999999997532       47888999998876431100   


Q ss_pred             cCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCC
Q 001711          579 DLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQS  647 (1021)
Q Consensus       579 ~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~  647 (1021)
                                        .--...+.++...|+-+=+.=+...++.+.-...||+..||++|+.+..+.
T Consensus       199 ------------------~e~~~~a~~~~~~g~~~lvid~e~~~~~~g~~~~iA~~~Gg~~~~L~~l~~  249 (261)
T COG1240         199 ------------------AETLEAASKLRLRGIQLLVIDTEGSEVRLGLAEEIARASGGEYYHLDDLSD  249 (261)
T ss_pred             ------------------HHHHHHHHHHhhcCCcEEEEecCCccccccHHHHHHHHhCCeEEecccccc
Confidence                              001345666667777666666677777777789999999999999987654


No 45 
>PHA03247 large tegument protein UL36; Provisional
Probab=97.72  E-value=0.069  Score=72.32  Aligned_cols=14  Identities=21%  Similarity=0.228  Sum_probs=8.6

Q ss_pred             HHHHHHHHHHHHhc
Q 001711          446 LEVVAQTIKSCLDE  459 (1021)
Q Consensus       446 l~~~~~sI~~~L~~  459 (1021)
                      |-.+|+.|...|..
T Consensus      3114 Li~ACr~i~r~lr~ 3127 (3151)
T PHA03247       3114 LIEACRRIRRQLRR 3127 (3151)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45566667666653


No 46 
>smart00327 VWA von Willebrand factor (vWF) type A domain. VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.
Probab=97.71  E-value=0.0012  Score=66.89  Aligned_cols=153  Identities=22%  Similarity=0.217  Sum_probs=104.8

Q ss_pred             eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCc
Q 001711          429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDD  507 (1021)
Q Consensus       429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~  507 (1021)
                      -++||||+|.++-. ..++.+.+.+...+..+.. .+..+||||+|++..+.+.                     +..  
T Consensus         3 ~v~l~vD~S~SM~~-~~~~~~~~~~~~~~~~~~~~~~~~~i~ii~f~~~~~~~~---------------------~~~--   58 (177)
T smart00327        3 DVVFLLDGSGSMGP-NRFEKAKEFVLKLVEQLDIGPDGDRVGLVTFSDDATVLF---------------------PLN--   58 (177)
T ss_pred             cEEEEEeCCCccch-HHHHHHHHHHHHHHHhcCCCCCCcEEEEEEeCCCceEEE---------------------ccc--
Confidence            47899999998842 4577888888888888764 2358999999998443321                     000  


Q ss_pred             cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh---c-----CCEEEEEecCCCCCCcccccccCCc
Q 001711          508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR---L-----GGKLLIFQNSLPSLGVGCLKLRGDD  579 (1021)
Q Consensus       508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~---~-----GGkIivF~sg~Pt~GpG~L~~r~~~  579 (1021)
                          ....++.+...++.+...    .....-++.||+.+...++.   .     .-.|++|++|.++.+          
T Consensus        59 ----~~~~~~~~~~~i~~~~~~----~~~~~~~~~al~~~~~~~~~~~~~~~~~~~~~iviitDg~~~~~----------  120 (177)
T smart00327       59 ----DSRSKDALLEALASLSYK----LGGGTNLGAALQYALENLFSKSAGSRRGAPKVLILITDGESNDG----------  120 (177)
T ss_pred             ----ccCCHHHHHHHHHhcCCC----CCCCchHHHHHHHHHHHhcCcCCCCCCCCCeEEEEEcCCCCCCC----------
Confidence                123345566677766532    33456789999999998852   1     125666666554422          


Q ss_pred             CcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEE
Q 001711          580 LRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYY  641 (1021)
Q Consensus       580 ~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~  641 (1021)
                                       ..+++...++.+.+|.+..+.++... +...+..++..++|...+
T Consensus       121 -----------------~~~~~~~~~~~~~~i~i~~i~~~~~~-~~~~l~~~~~~~~~~~~~  164 (177)
T smart00327      121 -----------------GDLLKAAKELKRSGVKVFVVGVGNDV-DEEELKKLASAPGGVYVF  164 (177)
T ss_pred             -----------------ccHHHHHHHHHHCCCEEEEEEccCcc-CHHHHHHHhCCCcceEEe
Confidence                             23467778888889888888887653 778899999999987765


No 47 
>PRK13406 bchD magnesium chelatase subunit D; Provisional
Probab=97.71  E-value=0.00099  Score=81.47  Aligned_cols=167  Identities=18%  Similarity=0.179  Sum_probs=111.8

Q ss_pred             CCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCe-EEEEecCCCCCCcceeeccccccccCCC
Q 001711          426 MPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDST-IHFYNMKSSLTQPQMMVISDLDDIFVPL  504 (1021)
Q Consensus       426 ~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~-V~fynl~~~~~~p~mlVvsDldd~f~Pl  504 (1021)
                      ..-.++||||+|.++.. .-+..++.+++..|+..-. .|-+|++|+|++. ..+                         
T Consensus       400 ~~~~vvfvvD~SGSM~~-~rl~~aK~a~~~ll~~ay~-~rD~v~lI~F~g~~a~~-------------------------  452 (584)
T PRK13406        400 SETTTIFVVDASGSAAL-HRLAEAKGAVELLLAEAYV-RRDQVALVAFRGRGAEL-------------------------  452 (584)
T ss_pred             CCccEEEEEECCCCCcH-hHHHHHHHHHHHHHHhhcC-CCCEEEEEEECCCceeE-------------------------
Confidence            34688999999999843 3578888888888876422 3468999999754 211                         


Q ss_pred             CCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---C--CEEEEEecCCCCCCcccccccCCc
Q 001711          505 PDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---G--GKLLIFQNSLPSLGVGCLKLRGDD  579 (1021)
Q Consensus       505 ~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---G--GkIivF~sg~Pt~GpG~L~~r~~~  579 (1021)
                          +++...+.+.+...|+.|+      ...++.++.||..|..+++..   |  -.|++++.|-.|.|.+.-..+++ 
T Consensus       453 ----~lppT~~~~~~~~~L~~l~------~gGgTpL~~gL~~A~~~l~~~~~~~~~~~iVLlTDG~~n~~~~~~~~~~~-  521 (584)
T PRK13406        453 ----LLPPTRSLVRAKRSLAGLP------GGGGTPLAAGLDAAAALALQVRRKGMTPTVVLLTDGRANIARDGTAGRAQ-  521 (584)
T ss_pred             ----EcCCCcCHHHHHHHHhcCC------CCCCChHHHHHHHHHHHHHHhccCCCceEEEEEeCCCCCCCccccccccc-
Confidence                1111123344556667775      246788999999999988642   2  47888999998886532111110 


Q ss_pred             CcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCC
Q 001711          580 LRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQS  647 (1021)
Q Consensus       580 ~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~  647 (1021)
                                    +..+ =..++..+.+.+|.+-++-+....  ...+..|++.+||..|..++-+.
T Consensus       522 --------------~~~~-~~~~a~~~~~~gi~~~vId~g~~~--~~~~~~LA~~~gg~y~~l~~~~a  572 (584)
T PRK13406        522 --------------AEED-ALAAARALRAAGLPALVIDTSPRP--QPQARALAEAMGARYLPLPRADA  572 (584)
T ss_pred             --------------hhhH-HHHHHHHHHhcCCeEEEEecCCCC--cHHHHHHHHhcCCeEEECCCCCH
Confidence                          0001 145678888888876666665444  34478999999999999997544


No 48 
>cd00198 vWFA Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.
Probab=97.71  E-value=0.00096  Score=65.57  Aligned_cols=148  Identities=22%  Similarity=0.320  Sum_probs=98.2

Q ss_pred             eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCc
Q 001711          429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDD  507 (1021)
Q Consensus       429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~  507 (1021)
                      .++|+||+|.++ ....++.+++.+...+..+.. ....+|++++|+...+.+-              ++.+.       
T Consensus         2 ~v~~viD~S~Sm-~~~~~~~~~~~~~~~~~~~~~~~~~~~i~v~~f~~~~~~~~--------------~~~~~-------   59 (161)
T cd00198           2 DIVFLLDVSGSM-GGEKLDKAKEALKALVSSLSASPPGDRVGLVTFGSNARVVL--------------PLTTD-------   59 (161)
T ss_pred             cEEEEEeCCCCc-CcchHHHHHHHHHHHHHhcccCCCCcEEEEEEecCccceee--------------ccccc-------
Confidence            378999999987 345678888889999988875 2348999999997433211              00000       


Q ss_pred             cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-----CCEEEEEecCCCCCCcccccccCCcCcc
Q 001711          508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-----GGKLLIFQNSLPSLGVGCLKLRGDDLRV  582 (1021)
Q Consensus       508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-----GGkIivF~sg~Pt~GpG~L~~r~~~~r~  582 (1021)
                            ..++.+...++.+..    .......+..|+..+.+.+...     ...|++|+.|..+.+.            
T Consensus        60 ------~~~~~~~~~~~~~~~----~~~~~t~~~~al~~~~~~~~~~~~~~~~~~lvvitDg~~~~~~------------  117 (161)
T cd00198          60 ------TDKADLLEAIDALKK----GLGGGTNIGAALRLALELLKSAKRPNARRVIILLTDGEPNDGP------------  117 (161)
T ss_pred             ------CCHHHHHHHHHhccc----CCCCCccHHHHHHHHHHHhcccCCCCCceEEEEEeCCCCCCCc------------
Confidence                  134445556666643    2345677889999999999753     4567777776543321            


Q ss_pred             cCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhcccc
Q 001711          583 YGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYT  635 (1021)
Q Consensus       583 ~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~T  635 (1021)
                                    .-.++...++.+.+|.|.++.++. ..+-..+..|+..|
T Consensus       118 --------------~~~~~~~~~~~~~~v~v~~v~~g~-~~~~~~l~~l~~~~  155 (161)
T cd00198         118 --------------ELLAEAARELRKLGITVYTIGIGD-DANEDELKEIADKT  155 (161)
T ss_pred             --------------chhHHHHHHHHHcCCEEEEEEcCC-CCCHHHHHHHhccc
Confidence                          011345666777799998888776 45666788888887


No 49 
>PF00092 VWA:  von Willebrand factor type A domain;  InterPro: IPR002035 The von Willebrand factor is a large multimeric glycoprotein found in blood plasma. Mutant forms are involved in the aetiology of bleeding disorders []. In von Willebrand factor, the type A domain (vWF) is the prototype for a protein superfamily. The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. Fold recognition algorithms were used to score sequence compatibility with a library of known structures: the vWF domain fold was predicted to be a doubly-wound, open, twisted beta-sheet flanked by alpha-helices []. 3D structures have been determined for the I-domains of integrins CD11b (with bound magnesium) [] and CD11a (with bound manganese) []. The domain adopts a classic alpha/beta Rossmann fold and contains an unusual metal ion coordination site at its surface. It has been suggested that this site represents a general metal ion-dependent adhesion site (MIDAS) for binding protein ligands []. The residues constituting the MIDAS motif in the CD11b and CD11a I-domains are completely conserved, but the manner in which the metal ion is coordinated differs slightly [].; GO: 0005515 protein binding; PDB: 2XGG_B 3ZQK_B 3GXB_A 3PPV_A 3PPX_A 3PPW_A 3PPY_A 1CQP_B 3TCX_B 2ICA_A ....
Probab=97.64  E-value=0.00086  Score=68.30  Aligned_cols=155  Identities=25%  Similarity=0.331  Sum_probs=95.2

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCC-CCCCceEEEEEEcCeEEEE-ecCCCCCCcceeeccccccccCCCCCc
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELP-GFPRTQIGFITFDSTIHFY-NMKSSLTQPQMMVISDLDDIFVPLPDD  507 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp-~~~rt~VgiITFds~V~fy-nl~~~~~~p~mlVvsDldd~f~Pl~~~  507 (1021)
                      .+|+||.|.++-..+ ++.+++.|...++.+. ...++|||||+|++..+.+ ++..                       
T Consensus         2 ivflvD~S~sm~~~~-~~~~~~~v~~~i~~~~~~~~~~rv~iv~f~~~~~~~~~~~~-----------------------   57 (178)
T PF00092_consen    2 IVFLVDTSGSMSGDN-FEKAKQFVKSIISRLSISNNGTRVGIVTFSDSARVLFSLTD-----------------------   57 (178)
T ss_dssp             EEEEEE-STTSCHHH-HHHHHHHHHHHHHHSTBSTTSEEEEEEEESSSEEEEEETTS-----------------------
T ss_pred             EEEEEeCCCCCchHH-HHHHHHHHHHHHHhhhccccccccceeeeeccccccccccc-----------------------
Confidence            589999999875433 6678888999988773 3456999999999887622 2211                       


Q ss_pred             cceehhhhHHHHHHHH-hhCCCcccCCCCcccchHHHHHHHHHHHHhc--C------CEEEEEecCCCCCCcccccccCC
Q 001711          508 LLVNLSESRSVVDTLL-DSLPSMFQDNMNVESAFGPALKAAFMVMSRL--G------GKLLIFQNSLPSLGVGCLKLRGD  578 (1021)
Q Consensus       508 lLv~l~es~~~I~~lL-d~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~--G------GkIivF~sg~Pt~GpG~L~~r~~  578 (1021)
                           .++.+.+.+.+ +.++.     ....+.+|.||+.|...+...  |      .-|+++++|.++.+.        
T Consensus        58 -----~~~~~~~~~~i~~~~~~-----~~g~t~~~~aL~~a~~~l~~~~~~~r~~~~~~iiliTDG~~~~~~--------  119 (178)
T PF00092_consen   58 -----YQSKNDLLNAINDSIPS-----SGGGTNLGAALKFAREQLFSSNNGGRPNSPKVIILITDGNSNDSD--------  119 (178)
T ss_dssp             -----HSSHHHHHHHHHTTGGC-----CBSSB-HHHHHHHHHHHTTSGGGTTGTTSEEEEEEEESSSSSSHS--------
T ss_pred             -----ccccccccccccccccc-----cchhhhHHHHHhhhhhcccccccccccccccceEEEEeecccCCc--------
Confidence                 01222222222 33332     345677999999999998643  2      235666665543221        


Q ss_pred             cCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccc--cccEEEEeCCCC
Q 001711          579 DLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKY--TGGQVYYYPSFQ  646 (1021)
Q Consensus       579 ~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~--TGG~v~~y~~F~  646 (1021)
                                         .....+..+.+. ..|.+|+++.+..|...|..|+..  .+|++++..+|+
T Consensus       120 -------------------~~~~~~~~~~~~-~~i~~~~ig~~~~~~~~l~~la~~~~~~~~~~~~~~~~  169 (178)
T PF00092_consen  120 -------------------SPSEEAANLKKS-NGIKVIAIGIDNADNEELRELASCPTSEGHVFYLADFS  169 (178)
T ss_dssp             -------------------GHHHHHHHHHHH-CTEEEEEEEESCCHHHHHHHHSHSSTCHHHEEEESSHH
T ss_pred             -------------------chHHHHHHHHHh-cCcEEEEEecCcCCHHHHHHHhCCCCCCCcEEEcCCHH
Confidence                               011122222222 567777777777889999999965  447888877654


No 50 
>cd01481 vWA_collagen_alpha3-VI-like VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far.  Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=97.56  E-value=0.0024  Score=65.76  Aligned_cols=151  Identities=18%  Similarity=0.232  Sum_probs=93.8

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEE-EecCCCCCCcceeeccccccccCCCCCc
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHF-YNMKSSLTQPQMMVISDLDDIFVPLPDD  507 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~f-ynl~~~~~~p~mlVvsDldd~f~Pl~~~  507 (1021)
                      .+|+||.|.+.-+ .-++.+++.|+..++.+.- ...+|||+|+|++..+. ++|.               +        
T Consensus         3 ivfllD~S~Si~~-~~f~~~k~fi~~lv~~f~i~~~~~rVgvv~ys~~~~~~~~l~---------------~--------   58 (165)
T cd01481           3 IVFLIDGSDNVGS-GNFPAIRDFIERIVQSLDVGPDKIRVAVVQFSDTPRPEFYLN---------------T--------   58 (165)
T ss_pred             EEEEEeCCCCcCH-HHHHHHHHHHHHHHhhccCCCCCcEEEEEEecCCeeEEEecc---------------c--------
Confidence            5899999987543 3477888889999988763 24589999999876542 1221               1        


Q ss_pred             cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHH-Hh-cCC-------EE-EEEecCCCCCCcccccccC
Q 001711          508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVM-SR-LGG-------KL-LIFQNSLPSLGVGCLKLRG  577 (1021)
Q Consensus       508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL-~~-~GG-------kI-ivF~sg~Pt~GpG~L~~r~  577 (1021)
                           ..+++.+.+.++.|+.+    ....+.+|.||+.+.+.+ .. .|+       |+ ++++.|..+          
T Consensus        59 -----~~~~~~l~~~i~~i~~~----~g~~t~t~~AL~~~~~~~f~~~~g~R~~~~~~kv~vviTdG~s~----------  119 (165)
T cd01481          59 -----HSTKADVLGAVRRLRLR----GGSQLNTGSALDYVVKNLFTKSAGSRIEEGVPQFLVLITGGKSQ----------  119 (165)
T ss_pred             -----cCCHHHHHHHHHhcccC----CCCcccHHHHHHHHHHhhcCccccCCccCCCCeEEEEEeCCCCc----------
Confidence                 01233455566666532    112356899999887654 32 232       34 455544211          


Q ss_pred             CcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCC
Q 001711          578 DDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSF  645 (1021)
Q Consensus       578 ~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F  645 (1021)
                                        + -+++-|.++.+.||  .+|..+....|..+|..++..- -.+|...+|
T Consensus       120 ------------------d-~~~~~a~~lr~~gv--~i~~vG~~~~~~~eL~~ias~p-~~vf~v~~f  165 (165)
T cd01481         120 ------------------D-DVERPAVALKRAGI--VPFAIGARNADLAELQQIAFDP-SFVFQVSDF  165 (165)
T ss_pred             ------------------c-hHHHHHHHHHHCCc--EEEEEeCCcCCHHHHHHHhCCC-ccEEEecCC
Confidence                              1 13566778888875  5677776668888988888665 355555443


No 51 
>cd01473 vWA_CTRP CTRP for  CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60  amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.
Probab=97.51  E-value=0.0037  Score=66.04  Aligned_cols=150  Identities=13%  Similarity=0.127  Sum_probs=91.9

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEEE-ecCCCCCCcceeeccccccccCCCCCc
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHFY-NMKSSLTQPQMMVISDLDDIFVPLPDD  507 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~fy-nl~~~~~~p~mlVvsDldd~f~Pl~~~  507 (1021)
                      .+|+||.|.+.-+..+-..+++.++..++.+.- ..++|||+|+|++..+++ .+...                      
T Consensus         3 i~fllD~S~Si~~~~f~~~~~~f~~~lv~~l~i~~~~~rvgvv~fs~~~~~~~~~~~~----------------------   60 (192)
T cd01473           3 LTLILDESASIGYSNWRKDVIPFTEKIINNLNISKDKVHVGILLFAEKNRDVVPFSDE----------------------   60 (192)
T ss_pred             EEEEEeCCCcccHHHHHHHHHHHHHHHHHhCccCCCccEEEEEEecCCceeEEecCcc----------------------
Confidence            589999999875544433567778888887653 245899999999866532 22110                      


Q ss_pred             cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcCC------E-EEEEecCCCCCCcccccccCCcC
Q 001711          508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLGG------K-LLIFQNSLPSLGVGCLKLRGDDL  580 (1021)
Q Consensus       508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~GG------k-IivF~sg~Pt~GpG~L~~r~~~~  580 (1021)
                          ....++.+.+.++.|.....  ....+.+|.||+.|.+.+...+|      | +|+++.|-.+-+           
T Consensus        61 ----~~~~~~~l~~~i~~l~~~~~--~~g~T~~~~AL~~a~~~~~~~~~~r~~~~kv~IllTDG~s~~~-----------  123 (192)
T cd01473          61 ----ERYDKNELLKKINDLKNSYR--SGGETYIVEALKYGLKNYTKHGNRRKDAPKVTMLFTDGNDTSA-----------  123 (192)
T ss_pred             ----cccCHHHHHHHHHHHHhccC--CCCcCcHHHHHHHHHHHhccCCCCcccCCeEEEEEecCCCCCc-----------
Confidence                01123444555566543221  13467799999999888754322      3 555555432110           


Q ss_pred             cccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccc
Q 001711          581 RVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKY  634 (1021)
Q Consensus       581 r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~  634 (1021)
                            .+        .--.+.++++.+.||.|-.+..+.  .+..+|..|+..
T Consensus       124 ------~~--------~~~~~~a~~lk~~gV~i~~vGiG~--~~~~el~~ia~~  161 (192)
T cd01473         124 ------SK--------KELQDISLLYKEENVKLLVVGVGA--ASENKLKLLAGC  161 (192)
T ss_pred             ------ch--------hhHHHHHHHHHHCCCEEEEEEecc--ccHHHHHHhcCC
Confidence                  00        112466788888998877777664  467788888764


No 52 
>cd01476 VWA_integrin_invertebrates VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in  cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest  any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.
Probab=97.41  E-value=0.0057  Score=62.07  Aligned_cols=102  Identities=18%  Similarity=0.265  Sum_probs=66.5

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcC--eEEE-EecCCCCCCcceeeccccccccCCCC
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDS--TIHF-YNMKSSLTQPQMMVISDLDDIFVPLP  505 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds--~V~f-ynl~~~~~~p~mlVvsDldd~f~Pl~  505 (1021)
                      ++|+||+|.+.-.  -++..++.+++.++.|.. ..+.+||+|+|++  ..++ +.+..                     
T Consensus         3 v~~llD~S~Sm~~--~~~~~~~~~~~~~~~l~~~~~~~~v~lv~f~~~~~~~~~~~l~~---------------------   59 (163)
T cd01476           3 LLFVLDSSGSVRG--KFEKYKKYIERIVEGLEIGPTATRVALITYSGRGRQRVRFNLPK---------------------   59 (163)
T ss_pred             EEEEEeCCcchhh--hHHHHHHHHHHHHHhcCCCCCCcEEEEEEEcCCCceEEEecCCC---------------------
Confidence            6899999998743  366778888888888753 2358999999987  3332 11110                     


Q ss_pred             CccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-C------CEEEEEecCCC
Q 001711          506 DDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-G------GKLLIFQNSLP  566 (1021)
Q Consensus       506 ~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-G------GkIivF~sg~P  566 (1021)
                             ...++.+...|+.|..     ....+.+|.||+.|.+++... +      ..|++++.|.+
T Consensus        60 -------~~~~~~l~~~i~~l~~-----~gg~T~l~~aL~~a~~~l~~~~~~r~~~~~~villTDG~~  115 (163)
T cd01476          60 -------HNDGEELLEKVDNLRF-----IGGTTATGAAIEVALQQLDPSEGRREGIPKVVVVLTDGRS  115 (163)
T ss_pred             -------CCCHHHHHHHHHhCcc-----CCCCccHHHHHHHHHHHhccccCCCCCCCeEEEEECCCCC
Confidence                   1123455556666652     134578999999999999521 1      34666666543


No 53 
>cd01464 vWA_subfamily VWA subfamily: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=97.33  E-value=0.0012  Score=68.30  Aligned_cols=138  Identities=18%  Similarity=0.243  Sum_probs=84.3

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCC----CCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCC
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGF----PRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLP  505 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~----~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~  505 (1021)
                      ++||||+|.++-.. -++.++++++..++.|..+    ++.+|+||+|++..+..-   .        +.++++.     
T Consensus         6 v~~llD~SgSM~~~-~~~~~k~a~~~~~~~l~~~~~~~~~~~v~ii~F~~~a~~~~---~--------l~~~~~~-----   68 (176)
T cd01464           6 IYLLLDTSGSMAGE-PIEALNQGLQMLQSELRQDPYALESVEISVITFDSAARVIV---P--------LTPLESF-----   68 (176)
T ss_pred             EEEEEECCCCCCCh-HHHHHHHHHHHHHHHHhcChhhccccEEEEEEecCCceEec---C--------CccHHhc-----
Confidence            58999999987432 3567778888888777543    467999999998765421   0        0010000     


Q ss_pred             CccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-----C-------CEEEEEecCCCCCCcccc
Q 001711          506 DDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-----G-------GKLLIFQNSLPSLGVGCL  573 (1021)
Q Consensus       506 ~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-----G-------GkIivF~sg~Pt~GpG~L  573 (1021)
                                      .++.|      ....+++++.||+.|.+.|+..     +       ..|++++.|.++-+... 
T Consensus        69 ----------------~~~~l------~~~GgT~l~~aL~~a~~~l~~~~~~~~~~~~~~~~~~iillTDG~~~~~~~~-  125 (176)
T cd01464          69 ----------------QPPRL------TASGGTSMGAALELALDCIDRRVQRYRADQKGDWRPWVFLLTDGEPTDDLTA-  125 (176)
T ss_pred             ----------------CCCcc------cCCCCCcHHHHHHHHHHHHHHHHHHhcccCcCCcCcEEEEEcCCCCCchHHH-
Confidence                            00111      1235689999999999998542     0       15888888776422100 


Q ss_pred             cccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhcc
Q 001711          574 KLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAK  633 (1021)
Q Consensus       574 ~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~  633 (1021)
                                           .    .+...++.+.++.|..|.++. .+|...|..|+.
T Consensus       126 ---------------------~----~~~~~~~~~~~~~i~~igiG~-~~~~~~L~~ia~  159 (176)
T cd01464         126 ---------------------A----IERIKEARDSKGRIVACAVGP-KADLDTLKQITE  159 (176)
T ss_pred             ---------------------H----HHHHHhhcccCCcEEEEEecc-ccCHHHHHHHHC
Confidence                                 0    122233344567777777766 578777777774


No 54 
>smart00262 GEL Gelsolin homology domain. Gelsolin/severin/villin homology domain. Calcium-binding and actin-binding. Both intra- and extracellular domains.
Probab=97.23  E-value=0.0018  Score=59.55  Aligned_cols=71  Identities=25%  Similarity=0.453  Sum_probs=49.8

Q ss_pred             cccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhhccccccccchHHHHHHHHHHHHHHH-hCCCCCc
Q 001711          896 LPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAELSKVMLREQDNEMSRKLLGILKKLRE-QDPSYYQ  974 (1021)
Q Consensus       896 l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~~lp~~~n~~s~~l~~ii~~lr~-~r~~~~~  974 (1021)
                      ++++.+.|.++.+||||+|..||+|+|+.++......                         ...+.+.+.+ .+....+
T Consensus        16 ~~~~~~~L~s~d~fild~~~~iyvW~G~~as~~ek~~-------------------------A~~~a~~~~~~~~~~~~~   70 (90)
T smart00262       16 VPFSQGSLNSGDCYILDTGSEIYVWVGKKSSQDEKKK-------------------------AAELAVELDDTLGPGPVQ   70 (90)
T ss_pred             cCCCHHHCCCCCEEEEECCCEEEEEECCCCCHHHHHH-------------------------HHHHHHHHHHhcCCCCce
Confidence            5678899999999999999999999999997765421                         1222333332 2345567


Q ss_pred             eEEEeccCCCcchHHHHHhhc
Q 001711          975 LCQLVRQGEQPREGFLLLANL  995 (1021)
Q Consensus       975 l~~vvrqg~~~~~e~~f~~~L  995 (1021)
                      + .+++||...   ..|..+|
T Consensus        71 i-~~v~eg~E~---~~F~~~f   87 (90)
T smart00262       71 V-RVVDEGKEP---PEFWSLF   87 (90)
T ss_pred             E-EEEeCCCCC---HHHHHHh
Confidence            7 889998754   3565554


No 55 
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton]
Probab=97.06  E-value=0.0036  Score=75.52  Aligned_cols=12  Identities=17%  Similarity=0.158  Sum_probs=6.7

Q ss_pred             HHHHhhhccCCC
Q 001711          827 YCLAICKSTPIR  838 (1021)
Q Consensus       827 yil~LlKS~~Lr  838 (1021)
                      ++-+|+-..+||
T Consensus      1046 lLeaLqsgaafr 1057 (1102)
T KOG1924|consen 1046 LLEALQSGAAFR 1057 (1102)
T ss_pred             HHHHHHhhcccc
Confidence            455555555555


No 56 
>cd01454 vWA_norD_type norD type: Denitrifying bacteria contain both membrane bound and periplasmic nitrate reductases. Denitrification plays a major role  in completing the nitrogen cycle by converting nitrate or nitrite to nitrogen gas. The pathway for microbial denitrification has been established as NO3-  ------ NO2- ------ NO ------- N2O --------- N2. This reaction generally occurs under oxygen limiting conditions. Genetic and biochemical studies have shown that the first srep of the biochemical pathway is catalyzed by periplasmic nitrate reductases. This family is widely present in proteobacteria and firmicutes. This version of the domain is also present in some archaeal members. The function of the vWA domain in this sub-group is not known. Members of this subgroup have a conserved MIDAS motif.
Probab=96.99  E-value=0.021  Score=58.97  Aligned_cols=147  Identities=16%  Similarity=0.126  Sum_probs=87.0

Q ss_pred             eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCcc
Q 001711          429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDL  508 (1021)
Q Consensus       429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~l  508 (1021)
                      .++|+||+|.++....-++.+++++...++.|.. .+.+++|++|++..     . .......+...+.++       .+
T Consensus         2 ~v~~llD~SgSM~~~~kl~~ak~a~~~l~~~l~~-~~d~~~l~~F~~~~-----~-~~~~~~~~~~~~~~~-------~~   67 (174)
T cd01454           2 AVTLLLDLSGSMRSDRRIDVAKKAAVLLAEALEA-CGVPHAILGFTTDA-----G-GRERVRWIKIKDFDE-------SL   67 (174)
T ss_pred             EEEEEEECCCCCCCCcHHHHHHHHHHHHHHHHHH-cCCcEEEEEecCCC-----C-CccceEEEEecCccc-------cc
Confidence            4789999999985433677788877777766654 23689999998752     0 000001111111111       00


Q ss_pred             ceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh---cCCEEEEEecCCCCCCcccccccCCcCcccCC
Q 001711          509 LVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR---LGGKLLIFQNSLPSLGVGCLKLRGDDLRVYGT  585 (1021)
Q Consensus       509 Lv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~---~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~gt  585 (1021)
                             ...+...|+.+..      ...+.+|.||..|...+..   ....|++++.|.|+.+...-          + 
T Consensus        68 -------~~~~~~~l~~~~~------~g~T~~~~al~~a~~~l~~~~~~~~~iiliTDG~~~~~~~~~----------~-  123 (174)
T cd01454          68 -------HERARKRLAALSP------GGNTRDGAAIRHAAERLLARPEKRKILLVISDGEPNDLDYYE----------G-  123 (174)
T ss_pred             -------chhHHHHHHccCC------CCCCcHHHHHHHHHHHHhcCCCcCcEEEEEeCCCcCcccccC----------c-
Confidence                   1122334444431      2357899999999999874   34568888899887653100          0 


Q ss_pred             CccccCCCCCcHHHHHH---HHHHhhCCcEEEEEEecCCC
Q 001711          586 DKEHSLRIPEDPFYKQM---AADLTKFQIAVNVYAFSDKY  622 (1021)
Q Consensus       586 ~~e~~l~~pa~~fY~~L---a~~~~~~gIsVDlF~~s~~~  622 (1021)
                          .+     ...++.   +.++.+.||.|..+.++.+.
T Consensus       124 ----~~-----~~~~~~~~~~~~~~~~gi~v~~igig~~~  154 (174)
T cd01454         124 ----NV-----FATEDALRAVIEARKLGIEVFGITIDRDA  154 (174)
T ss_pred             ----ch-----hHHHHHHHHHHHHHhCCcEEEEEEecCcc
Confidence                00     012233   77888899998877776553


No 57 
>KOG1984 consensus Vesicle coat complex COPII, subunit SFB3 [Intracellular trafficking, secretion, and vesicular transport]
Probab=96.90  E-value=0.1  Score=64.51  Aligned_cols=33  Identities=12%  Similarity=0.138  Sum_probs=14.3

Q ss_pred             ccceEEEEEeCCC---eEEEeeecCcccCCCCceee
Q 001711          667 AWEAVMRIRCGKG---VRFTNYHGNFMLRSTDLLAL  699 (1021)
Q Consensus       667 g~~a~mrVR~S~G---l~V~~~~Gnf~~rs~~~~~l  699 (1021)
                      .|.|.+---|-+|   ++|.++-+++..+..+++++
T Consensus       717 ~fQ~AlLYTti~G~RR~Rv~Nlsl~~ts~l~~lyr~  752 (1007)
T KOG1984|consen  717 HFQTALLYTTIDGQRRLRVLNLSLAVTSQLSELYRS  752 (1007)
T ss_pred             eEEEEEEEeccCCceeEEEEecchhhhhhHHHHHHh
Confidence            3444443334444   44555555544433344333


No 58 
>cd01458 vWA_ku Ku70/Ku80 N-terminal domain. The Ku78 heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks (DSB) in a preferred orientation. DSB's are repaired by either homologues recombination or non-homologues end joining and facilitate repair by the non-homologous end-joining pathway (NHEJ). The Ku heterodimer is required for accurate process that tends to preserve the sequence at the junction. Ku78 is found in all three kingdoms of life. However, only the eukaryotic proteins have a vWA domain fused to them at their N-termini. The vWA domain is not involved in DNA binding but may very likey mediate Ku78's interactions with other proteins. Members of this subgroup lack the conserved MIDAS motif.
Probab=96.87  E-value=0.023  Score=61.03  Aligned_cols=154  Identities=21%  Similarity=0.282  Sum_probs=90.6

Q ss_pred             eEEEEEecchhHHhh------cHHHHHHHHHHHHHhcC-CCCCCceEEEEEEcCeEEEEecCCCCCCcceeecccccccc
Q 001711          429 LYFFLIDVSISAIRS------GMLEVVAQTIKSCLDEL-PGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIF  501 (1021)
Q Consensus       429 ~yvFvIDvS~~av~s------G~l~~~~~sI~~~L~~L-p~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f  501 (1021)
                      ..+|+||+|.++.+.      ..++.+++.|...+... -..+..+||+|.|++.-+--    ...-..+.|+.++..+ 
T Consensus         3 ~ivf~iDvS~SM~~~~~~~~~s~l~~a~~~i~~~~~~ki~~~~~D~vGlilf~t~~~~~----~~~~~~i~v~~~l~~~-   77 (218)
T cd01458           3 SVVFLVDVSPSMFESKDGEYESPFEEALKCIRQLMKSKIISSPKDLVGVVFYGTEESKN----PVGYENIYVLLDLDTP-   77 (218)
T ss_pred             EEEEEEeCCHHHcCCCCCCCCChHHHHHHHHHHHHHhceeCCCCCeEEEEEEcccCCCC----cCCCCceEEeecCCCC-
Confidence            479999999988522      35778888888888852 11233689999997653210    0011123333333211 


Q ss_pred             CCCCCccceehhhhHHHHHHHHhhCCCc-c----cCCCCcccchHHHHHHHHHHHHh-----cCCEEEEEecCCCCCCcc
Q 001711          502 VPLPDDLLVNLSESRSVVDTLLDSLPSM-F----QDNMNVESAFGPALKAAFMVMSR-----LGGKLLIFQNSLPSLGVG  571 (1021)
Q Consensus       502 ~Pl~~~lLv~l~es~~~I~~lLd~Lp~~-f----~~~~~~~~alG~AL~aA~~lL~~-----~GGkIivF~sg~Pt~GpG  571 (1021)
                                   ..+.|+.+++.+..- .    ......+..++.||..|..+++.     ..-+|++|+++--..| |
T Consensus        78 -------------~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~l~~aL~~a~~~~~~~~~~~~~k~IvL~TDg~~p~~-~  143 (218)
T cd01458          78 -------------GAERVEDLKELIEPGGLSFAGQVGDSGQVSLSDALWVCLDLFSKGKKKKSHKRIFLFTNNDDPHG-G  143 (218)
T ss_pred             -------------CHHHHHHHHHHhhcchhhhcccCCCCCCccHHHHHHHHHHHHHhccccccccEEEEECCCCCCCC-C
Confidence                         123334444433211 0    01123577899999999999985     2346888888643222 0


Q ss_pred             cccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCC
Q 001711          572 CLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDK  621 (1021)
Q Consensus       572 ~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~  621 (1021)
                            +        .      -...-+.+++.++.+.||.|.+|.+...
T Consensus       144 ------~--------~------~~~~~~~~~a~~l~~~gI~i~~i~i~~~  173 (218)
T cd01458         144 ------D--------S------IKDSQAAVKAEDLKDKGIELELFPLSSP  173 (218)
T ss_pred             ------C--------H------HHHHHHHHHHHHHHhCCcEEEEEecCCC
Confidence                  0        0      0123356788899999999999887543


No 59 
>PF04056 Ssl1:  Ssl1-like;  InterPro: IPR007198 Ssl1-like proteins are 40 kDa subunits of the transcription factor II H complex. This domain is often found associated with the C2H2 type Zn-finger (IPR007087 from INTERPRO).; GO: 0008270 zinc ion binding, 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent
Probab=96.80  E-value=0.0066  Score=64.10  Aligned_cols=163  Identities=20%  Similarity=0.263  Sum_probs=103.1

Q ss_pred             EEecchhHHhhc----HHHHHHHHHHHHHhcC-CCCCCceEEEEEEcCe-EEEEecCCCCCCcceeeccccccccCCCCC
Q 001711          433 LIDVSISAIRSG----MLEVVAQTIKSCLDEL-PGFPRTQIGFITFDST-IHFYNMKSSLTQPQMMVISDLDDIFVPLPD  506 (1021)
Q Consensus       433 vIDvS~~av~sG----~l~~~~~sI~~~L~~L-p~~~rt~VgiITFds~-V~fynl~~~~~~p~mlVvsDldd~f~Pl~~  506 (1021)
                      |||.|..+.+.-    .++++++.+..-+++. ..+|-.++|||+.-+. .+.              ++++         
T Consensus         1 viD~S~~m~~~D~~PtRl~~~~~~l~~Fv~eff~qNPiSqlgii~~~~~~a~~--------------ls~l---------   57 (193)
T PF04056_consen    1 VIDMSEAMREKDLKPTRLQCVLKALEEFVREFFDQNPISQLGIIVMRDGRAER--------------LSEL---------   57 (193)
T ss_pred             CeechHhHHhCcCCccHHHHHHHHHHHHHHHHHhcCChhheeeeeeecceeEE--------------eeec---------
Confidence            589998875432    4666777766666653 3467789999987432 221              1221         


Q ss_pred             ccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---CC-EEEEEecCCCCCCcccccccCCcCcc
Q 001711          507 DLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---GG-KLLIFQNSLPSLGVGCLKLRGDDLRV  582 (1021)
Q Consensus       507 ~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---GG-kIivF~sg~Pt~GpG~L~~r~~~~r~  582 (1021)
                            +-+-....+.|+++.+   ..-..+..+-.||+.|...|++.   |. .|+++.+++-|..||.          
T Consensus        58 ------sgn~~~h~~~L~~~~~---~~~~G~~SLqN~Le~A~~~L~~~p~~~srEIlvi~gSl~t~Dp~d----------  118 (193)
T PF04056_consen   58 ------SGNPQEHIEALKKLRK---LEPSGEPSLQNGLEMARSSLKHMPSHGSREILVIFGSLTTCDPGD----------  118 (193)
T ss_pred             ------CCCHHHHHHHHHHhcc---CCCCCChhHHHHHHHHHHHHhhCccccceEEEEEEeecccCCchh----------
Confidence                  1111122223333322   22356678999999999999864   33 5666666665555442          


Q ss_pred             cCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHhc
Q 001711          583 YGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRDL  662 (1021)
Q Consensus       583 ~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~l  662 (1021)
                                      +.+..+.+.+.+|-||+..++.   .+..+..||+.|||.....      .|.+.|..-|....
T Consensus       119 ----------------i~~ti~~l~~~~IrvsvI~laa---Ev~I~k~i~~~T~G~y~V~------lde~H~~~lL~~~~  173 (193)
T PF04056_consen  119 ----------------IHETIESLKKENIRVSVISLAA---EVYICKKICKETGGTYGVI------LDEDHFKELLMEHV  173 (193)
T ss_pred             ----------------HHHHHHHHHHcCCEEEEEEEhH---HHHHHHHHHHhhCCEEEEe------cCHHHHHHHHHhhC
Confidence                            2366788999999999999986   4777899999999954433      34455655555544


No 60 
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton]
Probab=96.70  E-value=0.011  Score=71.72  Aligned_cols=12  Identities=17%  Similarity=0.382  Sum_probs=5.9

Q ss_pred             HHHHhhcCCceE
Q 001711          328 QSLVSRWHLPLG  339 (1021)
Q Consensus       328 ~~l~~~~~lPlg  339 (1021)
                      .+++.+..+=|+
T Consensus       656 ~dlfakL~~~Fa  667 (1102)
T KOG1924|consen  656 DDLFAKLALKFA  667 (1102)
T ss_pred             hHHHHHHHHHhh
Confidence            455555444443


No 61 
>KOG0443 consensus Actin regulatory proteins (gelsolin/villin family) [Cytoskeleton]
Probab=96.61  E-value=0.0047  Score=75.39  Aligned_cols=91  Identities=16%  Similarity=0.227  Sum_probs=61.0

Q ss_pred             hhhcccEEEeecCCCCCCccCCcccccccccccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhhccc
Q 001711          866 KLLYPCLIRVDEHLLKPSAQLDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAELSKV  945 (1021)
Q Consensus       866 ~~lYPrL~~lh~~~~~~~~~~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~  945 (1021)
                      .-.-||||..+.-.        +.+.+-+....+.+.|..+.|||||++..+|||||+.++++.....+..         
T Consensus       616 ~~~~PrLF~Cs~~~--------g~f~~~EI~~F~QdDL~tdDi~lLDt~~evfvWvG~~a~~~eK~~Al~~---------  678 (827)
T KOG0443|consen  616 PERDPRLFSCSNKT--------GSFVVEEIYNFTQDDLMTDDIMLLDTWSEVFVWVGQEANEKEKEEALTI---------  678 (827)
T ss_pred             CCCCCcEEEEEecC--------CcEEEEEecCcchhhccccceEEEecCceEEEEecCCCChhHHHHHHHH---------
Confidence            45678999988531        1122223346788999999999999999999999999988877554421         


Q ss_pred             cccccchHHHHHHHHHHHHHHHhCCCCCceEEEeccCCCc
Q 001711          946 MLREQDNEMSRKLLGILKKLREQDPSYYQLCQLVRQGEQP  985 (1021)
Q Consensus       946 ~lp~~~n~~s~~l~~ii~~lr~~r~~~~~l~~vvrqg~~~  985 (1021)
                               .++-.+. + +-+.|.+.-|+ +||+||...
T Consensus       679 ---------~~~yl~~-~-~p~gr~~~TPI-~vV~qG~EP  706 (827)
T KOG0443|consen  679 ---------GQKYLET-D-LPEGRDPRTPI-YVVKQGHEP  706 (827)
T ss_pred             ---------HHHHHhc-c-CcccCCCCCce-EEecCCCCC
Confidence                     1111111 1 23345566788 999998544


No 62 
>COG4245 TerY Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [General function prediction only]
Probab=96.38  E-value=0.066  Score=55.64  Aligned_cols=158  Identities=19%  Similarity=0.313  Sum_probs=92.1

Q ss_pred             CeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCC----CCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCC
Q 001711          428 PLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGF----PRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVP  503 (1021)
Q Consensus       428 p~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~----~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~P  503 (1021)
                      |+ +|++|+|.+++-. -++++-.+|+..++.|..+    .+.+++|||||+.++.|.-           ..|++. |-|
T Consensus         5 P~-~lllDtSgSM~Ge-~IealN~Glq~m~~~Lkqdp~Ale~v~lsIVTF~~~a~~~~p-----------f~~~~n-F~~   70 (207)
T COG4245           5 PC-YLLLDTSGSMIGE-PIEALNAGLQMMIDTLKQDPYALERVELSIVTFGGPARVIQP-----------FTDAAN-FNP   70 (207)
T ss_pred             CE-EEEEecCcccccc-cHHHHHHHHHHHHHHHHhChhhhheeEEEEEEecCcceEEec-----------hhhHhh-cCC
Confidence            44 4699999988643 3677778888888877654    4689999999987666531           122221 111


Q ss_pred             CCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc------CC------EEEEEecCCCCCCcc
Q 001711          504 LPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL------GG------KLLIFQNSLPSLGVG  571 (1021)
Q Consensus       504 l~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~------GG------kIivF~sg~Pt~GpG  571 (1021)
                                             |..+   ...++.+|+||+.|.++++..      .|      -|++.+.|-||    
T Consensus        71 -----------------------p~L~---a~GgT~lGaAl~~a~d~Ie~~~~~~~a~~kgdyrP~vfLiTDG~Pt----  120 (207)
T COG4245          71 -----------------------PILT---AQGGTPLGAALTLALDMIEERKRKYDANGKGDYRPWVFLITDGEPT----  120 (207)
T ss_pred             -----------------------Ccee---cCCCCchHHHHHHHHHHHHHHHhhcccCCccccceEEEEecCCCcc----
Confidence                                   1111   236788999999999999642      11      34555555442    


Q ss_pred             cccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhh--CCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCch
Q 001711          572 CLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTK--FQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTT  649 (1021)
Q Consensus       572 ~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~--~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~  649 (1021)
                                              +++=+.++.....  ...+|=.|.+..+..|...|..+.+    ++..+..    .
T Consensus       121 ------------------------D~w~~~~~~~~~~~~~~k~v~a~~~G~~~ad~~~L~qit~----~V~~~~t----~  168 (207)
T COG4245         121 ------------------------DDWQAGAALVFQGERRAKSVAAFSVGVQGADNKTLNQITE----KVRQFLT----L  168 (207)
T ss_pred             ------------------------hHHHhHHHHhhhcccccceEEEEEecccccccHHHHHHHH----hhccccc----c
Confidence                                    2222222222211  2234555666666678777777653    3333332    3


Q ss_pred             hHHHHHHHHHHh
Q 001711          650 HGERLRHELSRD  661 (1021)
Q Consensus       650 d~~kl~~dL~r~  661 (1021)
                      |..+|...+.+.
T Consensus       169 d~~~f~~fFkW~  180 (207)
T COG4245         169 DGLQFREFFKWL  180 (207)
T ss_pred             chHHHHHHHHHH
Confidence            556676666553


No 63 
>KOG2884 consensus 26S proteasome regulatory complex, subunit RPN10/PSMD4 [Posttranslational modification, protein turnover, chaperones]
Probab=96.30  E-value=0.1  Score=55.21  Aligned_cols=154  Identities=16%  Similarity=0.277  Sum_probs=96.5

Q ss_pred             eEEEEEecchhHHhhc-----HHHHHHHHHHHHHh-cCCCCCCceEEEEEEcC-eEEEEecCCCCCCcceeecccccccc
Q 001711          429 LYFFLIDVSISAIRSG-----MLEVVAQTIKSCLD-ELPGFPRTQIGFITFDS-TIHFYNMKSSLTQPQMMVISDLDDIF  501 (1021)
Q Consensus       429 ~yvFvIDvS~~av~sG-----~l~~~~~sI~~~L~-~Lp~~~rt~VgiITFds-~V~fynl~~~~~~p~mlVvsDldd~f  501 (1021)
                      +.+.|||-|.-+.+ |     .+++=+++|..... .+..++...|||||... .+.+..                    
T Consensus         5 atmi~iDNse~mrN-gDy~PtRf~aQ~daVn~v~~~K~~snpEntvGiitla~a~~~vLs--------------------   63 (259)
T KOG2884|consen    5 ATMICIDNSEYMRN-GDYLPTRFQAQKDAVNLVCQAKLRSNPENTVGIITLANASVQVLS--------------------   63 (259)
T ss_pred             eEEEEEeChHHhhc-CCCChHHHHHHHHHHHHHHHhhhcCCcccceeeEeccCCCceeee--------------------
Confidence            56889999887643 4     35555555554443 34445556799999864 333321                    


Q ss_pred             CCCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC-----CEEEEEecCCCCCCccccccc
Q 001711          502 VPLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG-----GKLLIFQNSLPSLGVGCLKLR  576 (1021)
Q Consensus       502 ~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G-----GkIivF~sg~Pt~GpG~L~~r  576 (1021)
                               .+...+-.|...|..|.      ...+.-++.+|+.|..+||++-     -||++|.+++-.         
T Consensus        64 ---------T~T~d~gkils~lh~i~------~~g~~~~~~~i~iA~lalkhRqnk~~~~riVvFvGSpi~---------  119 (259)
T KOG2884|consen   64 ---------TLTSDRGKILSKLHGIQ------PHGKANFMTGIQIAQLALKHRQNKNQKQRIVVFVGSPIE---------  119 (259)
T ss_pred             ---------eccccchHHHHHhcCCC------cCCcccHHHHHHHHHHHHHhhcCCCcceEEEEEecCcch---------
Confidence                     11222334444555554      2345568999999999999853     588999987621         


Q ss_pred             CCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhcccccc-----EEEEeCC
Q 001711          577 GDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGG-----QVYYYPS  644 (1021)
Q Consensus       577 ~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG-----~v~~y~~  644 (1021)
                               +.|        +-.-++|.++.+.+|.|||.-|+....+-.-+......++|     ++...+.
T Consensus       120 ---------e~e--------keLv~~akrlkk~~Vaidii~FGE~~~~~e~l~~fida~N~~~~gshlv~Vpp  175 (259)
T KOG2884|consen  120 ---------ESE--------KELVKLAKRLKKNKVAIDIINFGEAENNTEKLFEFIDALNGKGDGSHLVSVPP  175 (259)
T ss_pred             ---------hhH--------HHHHHHHHHHHhcCeeEEEEEeccccccHHHHHHHHHHhcCCCCCceEEEeCC
Confidence                     112        22357999999999999999998766664444444444444     3555554


No 64 
>cd01462 VWA_YIEM_type VWA YIEM type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=96.16  E-value=0.13  Score=51.61  Aligned_cols=130  Identities=15%  Similarity=0.142  Sum_probs=75.0

Q ss_pred             EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCccc
Q 001711          430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDLL  509 (1021)
Q Consensus       430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~lL  509 (1021)
                      ++|+||+|.++-.+ -++.++..+...++.+.. .+.+|+||+|++..+.+.+..                         
T Consensus         3 v~illD~SgSM~~~-k~~~a~~~~~~l~~~~~~-~~~~v~li~F~~~~~~~~~~~-------------------------   55 (152)
T cd01462           3 VILLVDQSGSMYGA-PEEVAKAVALALLRIALA-ENRDTYLILFDSEFQTKIVDK-------------------------   55 (152)
T ss_pred             EEEEEECCCCCCCC-HHHHHHHHHHHHHHHHHH-cCCcEEEEEeCCCceEEecCC-------------------------
Confidence            68999999988532 244455555555555432 125799999998733221110                         


Q ss_pred             eehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---CCEEEEEecCCCCCCcccccccCCcCcccCCC
Q 001711          510 VNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---GGKLLIFQNSLPSLGVGCLKLRGDDLRVYGTD  586 (1021)
Q Consensus       510 v~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---GGkIivF~sg~Pt~GpG~L~~r~~~~r~~gt~  586 (1021)
                         ...   +..+++.|..+.   ...++.++.||..+.+.++..   .+.|++++.|..+.                  
T Consensus        56 ---~~~---~~~~~~~l~~~~---~~ggT~l~~al~~a~~~l~~~~~~~~~ivliTDG~~~~------------------  108 (152)
T cd01462          56 ---TDD---LEEPVEFLSGVQ---LGGGTDINKALRYALELIERRDPRKADIVLITDGYEGG------------------  108 (152)
T ss_pred             ---ccc---HHHHHHHHhcCC---CCCCcCHHHHHHHHHHHHHhcCCCCceEEEECCCCCCC------------------
Confidence               011   122233332221   245678999999999998763   46777777764110                  


Q ss_pred             ccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCC
Q 001711          587 KEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDK  621 (1021)
Q Consensus       587 ~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~  621 (1021)
                             ...+.. +.+....+.++.|..+.++.+
T Consensus       109 -------~~~~~~-~~~~~~~~~~~~v~~~~~g~~  135 (152)
T cd01462         109 -------VSDELL-REVELKRSRVARFVALALGDH  135 (152)
T ss_pred             -------CCHHHH-HHHHHHHhcCcEEEEEEecCC
Confidence                   011222 334444566789999988764


No 65 
>TIGR00578 ku70 ATP-dependent DNA helicase ii, 70 kDa subunit (ku70). Proteins in this family are involved in non-homologous end joining, a process used for the repair of double stranded DNA breaks. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). Cutoff does not detect the putative ku70 homologs in yeast.
Probab=95.48  E-value=0.23  Score=61.39  Aligned_cols=162  Identities=17%  Similarity=0.260  Sum_probs=90.2

Q ss_pred             eEEEEEecchhHHh-------hcHHHHHHHHHHHHHhc-CCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccc
Q 001711          429 LYFFLIDVSISAIR-------SGMLEVVAQTIKSCLDE-LPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDI  500 (1021)
Q Consensus       429 ~yvFvIDvS~~av~-------sG~l~~~~~sI~~~L~~-Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~  500 (1021)
                      ..|||||+|.++.+       ..-+..++++|...+.. +-.+++..|||+.|++.=+    ++.+.-....|+.||+.+
T Consensus        12 ailflIDvs~sM~~~~~~~~~~s~~~~al~~i~~l~q~kIis~~~D~vGivlfgT~~t----~n~~~~~~i~v~~~L~~p   87 (584)
T TIGR00578        12 SLIFLVDASKAMFEESQGEDELTPFDMSIQCIQSVYTSKIISSDKDLLAVVFYGTEKD----KNSVNFKNIYVLQELDNP   87 (584)
T ss_pred             EEEEEEECCHHHcCCCcCcCcCChHHHHHHHHHHHHHhcCCCCCCCeEEEEEEeccCC----CCccCCCceEEEeeCCCC
Confidence            68999999999864       12355666777777764 2234568999999976422    122223355666666542


Q ss_pred             cCCCCCccceehhhhHHHHHHHHhh-CCCcccC--CCCcccchHHHHHHHHHHHHh----cCC-EEEEEecCCCCCCccc
Q 001711          501 FVPLPDDLLVNLSESRSVVDTLLDS-LPSMFQD--NMNVESAFGPALKAAFMVMSR----LGG-KLLIFQNSLPSLGVGC  572 (1021)
Q Consensus       501 f~Pl~~~lLv~l~es~~~I~~lLd~-Lp~~f~~--~~~~~~alG~AL~aA~~lL~~----~GG-kIivF~sg~Pt~GpG~  572 (1021)
                      -           .+....|++|++. -...|..  .......+..||.+|..++..    .+. ||++||+.---     
T Consensus        88 ~-----------a~~i~~L~~l~~~~~~~~~~~~~~~~~~~~l~daL~~~~~~f~~~~~k~~~kRI~lfTd~D~P-----  151 (584)
T TIGR00578        88 G-----------AKRILELDQFKGDQGPKKFRDTYGHGSDYSLSEVLWVCANLFSDVQFRMSHKRIMLFTNEDNP-----  151 (584)
T ss_pred             C-----------HHHHHHHHHHhhccCccchhhccCCCCCCcHHHHHHHHHHHHHhcchhhcCcEEEEECCCCCC-----
Confidence            1           1122223333332 1111111  112234789999999999965    233 58998863211     


Q ss_pred             ccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecC-CCcChh
Q 001711          573 LKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSD-KYTDIA  626 (1021)
Q Consensus       573 L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~-~~~dia  626 (1021)
                                ++.++.      ...-=...|.++.+.||.+++|.++. +.+|+.
T Consensus       152 ----------~~~~~~------~~~~a~~~a~dl~~~gi~ielf~l~~~~~Fd~s  190 (584)
T TIGR00578       152 ----------HGNDSA------KASRARTKAGDLRDTGIFLDLMHLKKPGGFDIS  190 (584)
T ss_pred             ----------CCCchh------HHHHHHHHHHHHHhcCeEEEEEecCCCCCCChh
Confidence                      111100      00111346888999999999996542 224444


No 66 
>COG5148 RPN10 26S proteasome regulatory complex, subunit RPN10/PSMD4 [Posttranslational modification, protein turnover, chaperones]
Probab=95.06  E-value=0.69  Score=48.04  Aligned_cols=133  Identities=20%  Similarity=0.320  Sum_probs=89.4

Q ss_pred             CeEEEEEecchhHHhhc----HHHHHHHHHHHHHhc-CCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccC
Q 001711          428 PLYFFLIDVSISAIRSG----MLEVVAQTIKSCLDE-LPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFV  502 (1021)
Q Consensus       428 p~yvFvIDvS~~av~sG----~l~~~~~sI~~~L~~-Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~  502 (1021)
                      -+.|.+||-|..+.+.-    .+++-++++...+.. ..+++...||||+...           .+|+.           
T Consensus         4 EatvvliDNse~s~NgDy~ptRFeAQkd~ve~if~~K~ndnpEntiGli~~~~-----------a~p~v-----------   61 (243)
T COG5148           4 EATVVLIDNSEASQNGDYLPTRFEAQKDAVESIFSKKFNDNPENTIGLIPLVQ-----------AQPNV-----------   61 (243)
T ss_pred             ceEEEEEeChhhhhcCCCCcHHHHHHHHHHHHHHHHHhcCCccceeeeeeccc-----------CCcch-----------
Confidence            46789999998775422    366777777777763 4455666799998542           12321           


Q ss_pred             CCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---C--CEEEEEecCCCCCCcccccccC
Q 001711          503 PLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---G--GKLLIFQNSLPSLGVGCLKLRG  577 (1021)
Q Consensus       503 Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---G--GkIivF~sg~Pt~GpG~L~~r~  577 (1021)
                            |..+...+-.|...|..++-      +.+--++-+|+.|..+|++.   |  -+|++|.+++-.          
T Consensus        62 ------lsT~T~~~gkilt~lhd~~~------~g~a~~~~~lqiaql~lkhR~nk~q~qriVaFvgSpi~----------  119 (243)
T COG5148          62 ------LSTPTKQRGKILTFLHDIRL------HGGADIMRCLQIAQLILKHRDNKGQRQRIVAFVGSPIQ----------  119 (243)
T ss_pred             ------hccchhhhhHHHHHhccccc------cCcchHHHHHHHHHHHHhcccCCccceEEEEEecCccc----------
Confidence                  22234456667777777752      34445889999999999984   3  689999987521          


Q ss_pred             CcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecC
Q 001711          578 DDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSD  620 (1021)
Q Consensus       578 ~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~  620 (1021)
                              +.|        +-.-.+|..+.+++|.||+.-|+.
T Consensus       120 --------ese--------deLirlak~lkknnVAidii~fGE  146 (243)
T COG5148         120 --------ESE--------DELIRLAKQLKKNNVAIDIIFFGE  146 (243)
T ss_pred             --------ccH--------HHHHHHHHHHHhcCeeEEEEehhh
Confidence                    111        223468999999999999998763


No 67 
>cd01457 vWA_ORF176_type VWA ORF176 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most
Probab=94.58  E-value=0.42  Score=50.53  Aligned_cols=146  Identities=17%  Similarity=0.221  Sum_probs=80.3

Q ss_pred             eEEEEEecchhHHhh----c--HHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccC
Q 001711          429 LYFFLIDVSISAIRS----G--MLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFV  502 (1021)
Q Consensus       429 ~yvFvIDvS~~av~s----G--~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~  502 (1021)
                      -++|+||+|.++-..    +  -++.+++++...+..+......+|++++|++..+-+                     .
T Consensus         4 dvv~~ID~SgSM~~~~~~~~~~k~~~ak~~~~~l~~~~~~~D~d~i~l~~f~~~~~~~---------------------~   62 (199)
T cd01457           4 DYTLLIDKSGSMAEADEAKERSRWEEAQESTRALARKCEEYDSDGITVYLFSGDFRRY---------------------D   62 (199)
T ss_pred             CEEEEEECCCcCCCCCCCCCchHHHHHHHHHHHHHHHHHhcCCCCeEEEEecCCcccc---------------------C
Confidence            379999999998532    1  256666666666665443223568888886542110                     0


Q ss_pred             CCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHH-HHHhc--------CCEEEEEecCCCCCCcccc
Q 001711          503 PLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFM-VMSRL--------GGKLLIFQNSLPSLGVGCL  573 (1021)
Q Consensus       503 Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~-lL~~~--------GGkIivF~sg~Pt~GpG~L  573 (1021)
                      +        +.  ++.+.++++.+..      ...+.++.||+.++. +++..        +..||+++.|.++- ...+
T Consensus        63 ~--------~~--~~~v~~~~~~~~p------~G~T~l~~~l~~a~~~~~~~~~~~~~~p~~~~vIiiTDG~~~d-~~~~  125 (199)
T cd01457          63 N--------VN--SSKVDQLFAENSP------DGGTNLAAVLQDALNNYFQRKENGATCPEGETFLVITDGAPDD-KDAV  125 (199)
T ss_pred             C--------cC--HHHHHHHHhcCCC------CCcCcHHHHHHHHHHHHHHHHhhccCCCCceEEEEEcCCCCCc-HHHH
Confidence            1        11  4555666655432      245789999998874 33321        34566677766541 1100


Q ss_pred             cccCCcCcccCCCccccCCCCCcHHHHHHHHHHhh-CCcEEEEEEecCCCcChhhhhhhccc
Q 001711          574 KLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTK-FQIAVNVYAFSDKYTDIASLGTLAKY  634 (1021)
Q Consensus       574 ~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~-~gIsVDlF~~s~~~~diatl~~L~~~  634 (1021)
                      .                      +.-.+.+.++.+ .+|++.++.++.+.-+...|..|...
T Consensus       126 ~----------------------~~i~~a~~~l~~~~~i~i~~v~vG~~~~~~~~L~~ld~~  165 (199)
T cd01457         126 E----------------------RVIIKASDELDADNELAISFLQIGRDPAATAFLKALDDQ  165 (199)
T ss_pred             H----------------------HHHHHHHHhhccccCceEEEEEeCCcHHHHHHHHHHhHH
Confidence            0                      000111111111 47888888887766665556665543


No 68 
>cd01460 vWA_midasin VWA_Midasin: Midasin is a member of the AAA ATPase family. The proteins of this family are unified by their common archetectural organization that is based upon a conserved ATPase domain. The AAA domain of midasin contains six tandem AAA protomers. The AAA domains in midasin is followed by a D/E rich domain that is following by a VWA domain. The members of this subgroup have a conserved MIDAS motif. The function of this domain is not exactly known although it has been speculated to play a crucial role in midasin function.
Probab=94.41  E-value=0.53  Score=52.42  Aligned_cols=132  Identities=19%  Similarity=0.201  Sum_probs=77.2

Q ss_pred             CCCeEEEEEecchhHHhhcH----HHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeecccccccc
Q 001711          426 MPPLYFFLIDVSISAIRSGM----LEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIF  501 (1021)
Q Consensus       426 ~pp~yvFvIDvS~~av~sG~----l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f  501 (1021)
                      ...-++|+||+|.++.++..    ++ .+..|.++|+.+..   -+|||+.|+.++.+              +.++++.|
T Consensus        59 r~~qIvlaID~S~SM~~~~~~~~ale-ak~lIs~al~~Le~---g~vgVv~Fg~~~~~--------------v~Plt~d~  120 (266)
T cd01460          59 RDYQILIAIDDSKSMSENNSKKLALE-SLCLVSKALTLLEV---GQLGVCSFGEDVQI--------------LHPFDEQF  120 (266)
T ss_pred             cCceEEEEEecchhcccccccccHHH-HHHHHHHHHHhCcC---CcEEEEEeCCCceE--------------eCCCCCCc
Confidence            45678999999999865443    33 45567777777765   47999999976431              22222211


Q ss_pred             CCCCCccceehhhhHHHHHHHHhhCCC-cccCCCCcccchHHHHHHHHHHHHhc-----CC---EEEEEec-CCCCCCcc
Q 001711          502 VPLPDDLLVNLSESRSVVDTLLDSLPS-MFQDNMNVESAFGPALKAAFMVMSRL-----GG---KLLIFQN-SLPSLGVG  571 (1021)
Q Consensus       502 ~Pl~~~lLv~l~es~~~I~~lLd~Lp~-~f~~~~~~~~alG~AL~aA~~lL~~~-----GG---kIivF~s-g~Pt~GpG  571 (1021)
                                     .. +..++.+.. .|.   ..++.++.||..|..+++..     +|   ++++..| |-+.    
T Consensus       121 ---------------~~-~a~~~~l~~~~f~---~~~Tni~~aL~~a~~~f~~~~~~~~s~~~~qlilLISDG~~~----  177 (266)
T cd01460         121 ---------------SS-QSGPRILNQFTFQ---QDKTDIANLLKFTAQIFEDARTQSSSGSLWQLLLIISDGRGE----  177 (266)
T ss_pred             ---------------hh-hHHHHHhCcccCC---CCCCcHHHHHHHHHHHHHhhhccccccccccEEEEEECCCcc----
Confidence                           11 222333321 222   23456999999999998754     32   5555544 2211    


Q ss_pred             cccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecC
Q 001711          572 CLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSD  620 (1021)
Q Consensus       572 ~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~  620 (1021)
                       ..             |        .--+..+.++.+.+|.|-..+.-.
T Consensus       178 -~~-------------e--------~~~~~~~r~a~e~~i~l~~I~ld~  204 (266)
T cd01460         178 -FS-------------E--------GAQKVRLREAREQNVFVVFIIIDN  204 (266)
T ss_pred             -cC-------------c--------cHHHHHHHHHHHcCCeEEEEEEcC
Confidence             00             0        001345788889998887776544


No 69 
>KOG0443 consensus Actin regulatory proteins (gelsolin/villin family) [Cytoskeleton]
Probab=94.13  E-value=0.19  Score=62.08  Aligned_cols=79  Identities=25%  Similarity=0.284  Sum_probs=53.6

Q ss_pred             cchhhccCCcEEEEECC-ceeEEEecCCCCHHHHHhhcCCchhhhhccccccccchHHHHHHHHHHHHHHHh-CCCCCce
Q 001711          898 LVAESLDSRGLYIFDDG-FRFVLWFGRMLSPDIAMNLLGSEFAAELSKVMLREQDNEMSRKLLGILKKLREQ-DPSYYQL  975 (1021)
Q Consensus       898 LS~~~L~~~giyLlD~G-~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~~lp~~~n~~s~~l~~ii~~lr~~-r~~~~~l  975 (1021)
                      |+.+-|+.+++||||+| ..||||+|+.++.+-.+..+                     ...+++|   |.. +..+-.+
T Consensus       277 l~qdlLd~~dCYILD~g~~~IfVW~Gr~as~~ERkaAm---------------------~~AeeFl---k~k~yP~~TqV  332 (827)
T KOG0443|consen  277 LTKDLLDTEDCYILDCGGGEIFVWKGRQASLDERKAAM---------------------SSAEEFL---KKKKYPPNTQV  332 (827)
T ss_pred             hhHHhhccCCeEEEecCCceEEEEeCCCCCHHHHHHHH---------------------HHHHHHH---HhccCCCCceE
Confidence            88899999999999999 99999999999776543222                     2233344   443 4566666


Q ss_pred             EEEeccCC-CcchHHHHHhhccccCCC
Q 001711          976 CQLVRQGE-QPREGFLLLANLVEDQIG 1001 (1021)
Q Consensus       976 ~~vvrqg~-~~~~e~~f~~~LVED~~~ 1001 (1021)
                       .+|-+|- +.....+|.+..-+|+++
T Consensus       333 -~rv~EG~Esa~FKq~F~~W~~~~~t~  358 (827)
T KOG0443|consen  333 -VRVLEGAESAPFKQLFDSWPDKDQTN  358 (827)
T ss_pred             -EEecCCCcchhHHHHHhhCccccccc
Confidence             6666653 332234666777777765


No 70 
>cd01455 vWA_F11C1-5a_type Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A 
Probab=93.70  E-value=3.2  Score=44.05  Aligned_cols=98  Identities=10%  Similarity=0.068  Sum_probs=61.2

Q ss_pred             hhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHH-h--cCCEEEEEec-CCCCCCcccccccCCcCcccCCCccc
Q 001711          514 ESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMS-R--LGGKLLIFQN-SLPSLGVGCLKLRGDDLRVYGTDKEH  589 (1021)
Q Consensus       514 es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~-~--~GGkIivF~s-g~Pt~GpG~L~~r~~~~r~~gt~~e~  589 (1021)
                      +..+.+..+|+.+.--+..   ..++  .||..|++.|+ .  ...|+++..+ |-=|.|              +     
T Consensus        72 ~~~~~l~~~l~~~q~g~ag---~~Ta--dAi~~av~rl~~~~~a~~kvvILLTDG~n~~~--------------~-----  127 (191)
T cd01455          72 ERLETLKMMHAHSQFCWSG---DHTV--EATEFAIKELAAKEDFDEAIVIVLSDANLERY--------------G-----  127 (191)
T ss_pred             hHHHHHHHHHHhcccCccC---ccHH--HHHHHHHHHHHhcCcCCCcEEEEEeCCCcCCC--------------C-----
Confidence            4456788888887543322   1233  88888888886 4  2355555544 321110              0     


Q ss_pred             cCCCCCcHHHHHH-HHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCC
Q 001711          590 SLRIPEDPFYKQM-AADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPS  644 (1021)
Q Consensus       590 ~l~~pa~~fY~~L-a~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~  644 (1021)
                        ..|     .+. |+.+.+.||-|..+.++.  .|-.++..+++.|||+.|.-.+
T Consensus       128 --i~P-----~~aAa~lA~~~gV~iytIgiG~--~d~~~l~~iA~~tgG~~F~A~d  174 (191)
T cd01455         128 --IQP-----KKLADALAREPNVNAFVIFIGS--LSDEADQLQRELPAGKAFVCMD  174 (191)
T ss_pred             --CCh-----HHHHHHHHHhCCCEEEEEEecC--CCHHHHHHHHhCCCCcEEEeCC
Confidence              011     344 355667888887777765  3677899999999999998754


No 71 
>TIGR00627 tfb4 transcription factor tfb4. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=93.29  E-value=5.4  Score=44.82  Aligned_cols=95  Identities=18%  Similarity=0.169  Sum_probs=62.8

Q ss_pred             cccchHHHHHHHHHHHHh----------cCCEEEEEecCCCCCCcccccccCCcCcccCCCccccCCCCCcHHHHHHHHH
Q 001711          536 VESAFGPALKAAFMVMSR----------LGGKLLIFQNSLPSLGVGCLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAAD  605 (1021)
Q Consensus       536 ~~~alG~AL~aA~~lL~~----------~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~  605 (1021)
                      .++.+..||..|+-.+..          ..+||+++..+.            |.             ..+.-=+-+....
T Consensus       117 ~~s~lagals~ALcyinr~~~~~~~~~~~~~RIlii~~s~------------~~-------------~~qYi~~mn~Ifa  171 (279)
T TIGR00627       117 SRTVLAGALSDALGYINRSEQSETASEKLKSRILVISITP------------DM-------------ALQYIPLMNCIFS  171 (279)
T ss_pred             ccccchhHHHhhhhhhcccccccccCcCCcceEEEEECCC------------Cc-------------hHHHHHHHHHHHH
Confidence            466788888888877743          247888887631            10             1112223477788


Q ss_pred             HhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHhc
Q 001711          606 LTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRDL  662 (1021)
Q Consensus       606 ~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~l  662 (1021)
                      |.+.+|.||++..+.+ .|..-+..+++.|||......      |.+.|...|...+
T Consensus       172 aqk~~I~Idv~~L~~e-~~~~~lqQa~~~TgG~Y~~~~------~~~~L~q~L~~~~  221 (279)
T TIGR00627       172 AQKQNIPIDVVSIGGD-FTSGFLQQAADITGGSYLHVK------KPQGLLQYLMTNM  221 (279)
T ss_pred             HHHcCceEEEEEeCCc-cccHHHHHHHHHhCCEEeccC------CHhHHHHHHHHhc
Confidence            9999999999988643 467889999999999544443      2344555554433


No 72 
>PF03731 Ku_N:  Ku70/Ku80 N-terminal alpha/beta domain;  InterPro: IPR005161 The Ku heterodimer (composed of Ku70 P12956 from SWISSPROT and Ku80 P13010 from SWISSPROT) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the N-terminal alpha/beta domain. This domain only makes a small contribution to the dimer interface. The domain comprises a six stranded beta sheet of the Rossman fold [].; PDB: 1JEQ_A 1JEY_A.
Probab=92.72  E-value=0.77  Score=49.37  Aligned_cols=154  Identities=20%  Similarity=0.242  Sum_probs=74.3

Q ss_pred             eEEEEEecchhHHhh-----cHHHHHHHHHHHHHhcC-CCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccC
Q 001711          429 LYFFLIDVSISAIRS-----GMLEVVAQTIKSCLDEL-PGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFV  502 (1021)
Q Consensus       429 ~yvFvIDvS~~av~s-----G~l~~~~~sI~~~L~~L-p~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~  502 (1021)
                      +.|||||+|.++.+.     .-++.++++|...+.+. -..+...||||.|++.-.=-. .....-..+.++.+|+-   
T Consensus         1 ~~vflID~s~sM~~~~~~~~~~l~~al~~i~~~~~~ki~~~~kD~vgvvl~gt~~t~n~-~~~~~~~~i~~l~~l~~---   76 (224)
T PF03731_consen    1 ATVFLIDVSPSMFEPSSESESPLEEALKAIEDLMQQKIISSPKDEVGVVLFGTDETNNP-DEDSGYENIFVLQPLDP---   76 (224)
T ss_dssp             EEEEEEE-SCGGGS-BTTCS-HHHHHHHHHHHHHHHHHHTT---EEEEEEES-SS-BST--TTT-STTEEEEEECC----
T ss_pred             CEEEEEECCHHHCCCCCCcchhHHHHHHHHHHHHHHHHcCCCCCeEEEEEEcCCCCCCc-ccccCCCceEEeecCCc---
Confidence            469999999988532     23666777777777642 122337899999975421000 00111123333333321   


Q ss_pred             CCCCccceehhhhHHHHHHHHhhCCC----cccCCCCcccchHHHHHHHHHHHHh--c-----CCEEEEEecCCCCCCcc
Q 001711          503 PLPDDLLVNLSESRSVVDTLLDSLPS----MFQDNMNVESAFGPALKAAFMVMSR--L-----GGKLLIFQNSLPSLGVG  571 (1021)
Q Consensus       503 Pl~~~lLv~l~es~~~I~~lLd~Lp~----~f~~~~~~~~alG~AL~aA~~lL~~--~-----GGkIivF~sg~Pt~GpG  571 (1021)
                                 -+.+.|..|.+.+..    ........+..+..||.+|..+++.  .     .-||++|++.-   +|-
T Consensus        77 -----------~~~~~l~~L~~~~~~~~~~~~~~~~~~~~~l~~al~v~~~~~~~~~~~~k~~~krI~l~Td~d---~p~  142 (224)
T PF03731_consen   77 -----------PSAERLKELEELLKPGDKFENFFSGSDEGDLSDALWVASDMFRERTCKKKKNKKRIFLFTDND---GPH  142 (224)
T ss_dssp             ------------BHHHHHHHHTTSHHHHHHHHHC-SSS---HHHHHHHHHHHHHCHCTTS-ECEEEEEEEES-S---STT
T ss_pred             -----------cCHHHHHHHHHhhcccccccccCCCCCccCHHHHHHHHHHHHHHHhhcccCCCcEEEEEeCCC---CCC
Confidence                       122333333333322    0011233456799999999999975  1     23677777621   111


Q ss_pred             cccccCCcCcccCCCccccCCCCCcHHHHH-HHHHHhhCCcEEEEEEe
Q 001711          572 CLKLRGDDLRVYGTDKEHSLRIPEDPFYKQ-MAADLTKFQIAVNVYAF  618 (1021)
Q Consensus       572 ~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~-La~~~~~~gIsVDlF~~  618 (1021)
                      .           +.+ +      -..-.++ .+.++...+|.+++|..
T Consensus       143 ~-----------~~~-~------~~~~~~~l~~~Dl~~~~i~~~~~~l  172 (224)
T PF03731_consen  143 E-----------DDD-E------LERIIQKLKAKDLQDNGIEIELFFL  172 (224)
T ss_dssp             T------------CC-C------HHHHHHHHHHHHHHHHTEEEEEEEC
T ss_pred             C-----------CHH-H------HHHHHHhhccccchhcCcceeEeec
Confidence            0           000 0      0011111 26779999999999987


No 73 
>PF03850 Tfb4:  Transcription factor Tfb4;  InterPro: IPR004600 Members of this family are part of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. The core-TFIIH basal transcription factor complex has six subunits, this is the p34 subunit.; GO: 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent, 0000439 core TFIIH complex
Probab=92.62  E-value=4.9  Score=45.17  Aligned_cols=184  Identities=17%  Similarity=0.167  Sum_probs=96.4

Q ss_pred             eEEEEEecchhHHhh----cHHHHHHHHHHHHHhc-CCCCCCceEEEEEEcC--eEEEEecCCCC--CCcceeecccccc
Q 001711          429 LYFFLIDVSISAIRS----GMLEVVAQTIKSCLDE-LPGFPRTQIGFITFDS--TIHFYNMKSSL--TQPQMMVISDLDD  499 (1021)
Q Consensus       429 ~yvFvIDvS~~av~s----G~l~~~~~sI~~~L~~-Lp~~~rt~VgiITFds--~V~fynl~~~~--~~p~mlVvsDldd  499 (1021)
                      ..+.|||++-.+...    ..+..++++|.--++. |--+..-+|+||....  .-.+|.-....  ....-.-..+.++
T Consensus         3 LLvIILD~nP~~W~~~~~~~~l~~~l~~llvFlNahL~l~~~N~vaVIAs~~~~s~~LYP~~~~~~~~~~~~~~~~~~~~   82 (276)
T PF03850_consen    3 LLVIILDTNPLAWGQLSDQLSLSQFLDSLLVFLNAHLALNHSNQVAVIASHSNSSKFLYPSPSSSESSNSGDVEMNSSDS   82 (276)
T ss_pred             EEEEEEECCHHHHhhccccccHHHHHHHHHHHHHHHHhhCccCCEEEEEEcCCccEEEeCCCccccccCCCccccccccc
Confidence            468899999877432    2344555555555542 2222235799988743  33455543310  0000000111110


Q ss_pred             ccCCCCCccceehhhh-HHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh-----------cCCEEEEEecCCCC
Q 001711          500 IFVPLPDDLLVNLSES-RSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR-----------LGGKLLIFQNSLPS  567 (1021)
Q Consensus       500 ~f~Pl~~~lLv~l~es-~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~-----------~GGkIivF~sg~Pt  567 (1021)
                      .    -.+.+-.++|. .+.+.+++++....-  .....+.+..||..|+-.+..           ..+||+++.++-  
T Consensus        83 ~----~y~~f~~v~~~v~~~l~~l~~~~~~~~--~~~~~s~LagALS~ALCyINR~~~~~~~~~~~~~~RILv~~s~s--  154 (276)
T PF03850_consen   83 N----KYRQFRNVDETVLEELKKLMSETSESS--DSTTSSLLAGALSMALCYINRISRESPSGGTSLKSRILVIVSGS--  154 (276)
T ss_pred             c----hhHHHHHHHHHHHHHHHHHHhhccccc--ccccchhhHHHHHHHHHHHhhhhhcccCCCCCcCccEEEEEecC--
Confidence            0    00111112221 233333333332211  111226788899888876643           235888853321  


Q ss_pred             CCcccccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCC
Q 001711          568 LGVGCLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPS  644 (1021)
Q Consensus       568 ~GpG~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~  644 (1021)
                               +|        .     ..+.-=+-+..-.+.+.+|.||++..+.  .|-.-|...+..|||.-+..+.
T Consensus       155 ---------~d--------~-----~~QYi~~MN~iFaAqk~~v~IDv~~L~~--~~s~fLqQa~d~T~G~y~~~~~  207 (276)
T PF03850_consen  155 ---------PD--------S-----SSQYIPLMNCIFAAQKQKVPIDVCKLGG--KDSTFLQQASDITGGIYLKVSK  207 (276)
T ss_pred             ---------CC--------c-----cHHHHHHHHHHHHHhcCCceeEEEEecC--CchHHHHHHHHHhCceeeccCc
Confidence                     11        0     0112223466677889999999999887  5666789999999999887765


No 74 
>KOG0444 consensus Cytoskeletal regulator Flightless-I (contains leucine-rich and gelsolin repeats) [Cytoskeleton]
Probab=91.25  E-value=0.31  Score=58.81  Aligned_cols=66  Identities=27%  Similarity=0.456  Sum_probs=49.4

Q ss_pred             cccccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhhccccccccchHHHHHHHHHHHHHHHh-CCCC
Q 001711          894 KRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAELSKVMLREQDNEMSRKLLGILKKLREQ-DPSY  972 (1021)
Q Consensus       894 ~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~~lp~~~n~~s~~l~~ii~~lr~~-r~~~  972 (1021)
                      ++++|+..+|++.-+||||-|.+||||-|...-                         +..+.+.|-+.++|.+. |.--
T Consensus       637 EPVpl~~tSLDPRf~FlLD~G~~IyiW~G~~s~-------------------------~t~~~KARLfAEkinK~eRKgK  691 (1255)
T KOG0444|consen  637 EPVPLSVTSLDPRFCFLLDAGETIYIWSGYKSR-------------------------ITVSNKARLFAEKINKRERKGK  691 (1255)
T ss_pred             eccCccccccCcceEEEEeCCceEEEEeccchh-------------------------cccchHHHHHHHHhhhhhccCc
Confidence            468999999999999999999999999997641                         13445667677777544 3333


Q ss_pred             CceEEEeccCCCc
Q 001711          973 YQLCQLVRQGEQP  985 (1021)
Q Consensus       973 ~~l~~vvrqg~~~  985 (1021)
                      ..+ .++|||...
T Consensus       692 ~EI-~l~rQg~e~  703 (1255)
T KOG0444|consen  692 SEI-ELCRQGREP  703 (1255)
T ss_pred             eee-ehhhhcCCC
Confidence            455 788998654


No 75 
>KOG2807 consensus RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription; Replication, recombination and repair]
Probab=90.85  E-value=2.6  Score=47.40  Aligned_cols=165  Identities=23%  Similarity=0.311  Sum_probs=99.3

Q ss_pred             CCeEEEEEecchhHHhhc----HHHHHHHHHHHHHhcCC-CCCCceEEEEEEcCeEEEEecCCCCCCcceeecccccccc
Q 001711          427 PPLYFFLIDVSISAIRSG----MLEVVAQTIKSCLDELP-GFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIF  501 (1021)
Q Consensus       427 pp~yvFvIDvS~~av~sG----~l~~~~~sI~~~L~~Lp-~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f  501 (1021)
                      -...+.|||+|-.+.++-    .++.+++.+..-+.+.- .+|-..||||+.-+.         ..    -+++|     
T Consensus        60 iRhl~iviD~S~am~e~Df~P~r~a~~~K~le~Fv~eFFdQNPiSQigii~~k~g---------~A----~~lt~-----  121 (378)
T KOG2807|consen   60 IRHLYIVIDCSRAMEEKDFRPSRFANVIKYLEGFVPEFFDQNPISQIGIISIKDG---------KA----DRLTD-----  121 (378)
T ss_pred             heeEEEEEEhhhhhhhccCCchHHHHHHHHHHHHHHHHhccCchhheeEEEEecc---------hh----hHHHH-----
Confidence            346678999999886654    34555555555555432 356678999875321         10    01122     


Q ss_pred             CCCCCccceehhhh-HHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcCC----EEEEEecCCCCCCccccccc
Q 001711          502 VPLPDDLLVNLSES-RSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLGG----KLLIFQNSLPSLGVGCLKLR  576 (1021)
Q Consensus       502 ~Pl~~~lLv~l~es-~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~GG----kIivF~sg~Pt~GpG~L~~r  576 (1021)
                                ++.+ +..|+.|....      .-.....+-.||+.|...|++.-|    .|++..+++.|.-||-+   
T Consensus       122 ----------ltgnp~~hI~aL~~~~------~~~g~fSLqNaLe~a~~~Lk~~p~H~sREVLii~sslsT~DPgdi---  182 (378)
T KOG2807|consen  122 ----------LTGNPRIHIHALKGLT------ECSGDFSLQNALELAREVLKHMPGHVSREVLIIFSSLSTCDPGDI---  182 (378)
T ss_pred             ----------hcCCHHHHHHHHhccc------ccCCChHHHHHHHHHHHHhcCCCcccceEEEEEEeeecccCcccH---
Confidence                      2222 22333332222      123455688899999999998633    45666677777766633   


Q ss_pred             CCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHH
Q 001711          577 GDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRH  656 (1021)
Q Consensus       577 ~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~  656 (1021)
                                            | +.-..+.+..|-|.++-.+.+   ++.-..||+.|||. |+.     ..|...|..
T Consensus       183 ----------------------~-~tI~~lk~~kIRvsvIgLsaE---v~icK~l~kaT~G~-Y~V-----~lDe~Hlke  230 (378)
T KOG2807|consen  183 ----------------------Y-ETIDKLKAYKIRVSVIGLSAE---VFICKELCKATGGR-YSV-----ALDEGHLKE  230 (378)
T ss_pred             ----------------------H-HHHHHHHhhCeEEEEEeechh---HHHHHHHHHhhCCe-EEE-----EeCHHHHHH
Confidence                                  3 334567888899999887744   66678899999993 222     245555544


Q ss_pred             HHHH
Q 001711          657 ELSR  660 (1021)
Q Consensus       657 dL~r  660 (1021)
                      -|..
T Consensus       231 Ll~e  234 (378)
T KOG2807|consen  231 LLLE  234 (378)
T ss_pred             HHHh
Confidence            4443


No 76 
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification]
Probab=90.19  E-value=8.2  Score=43.74  Aligned_cols=13  Identities=8%  Similarity=0.171  Sum_probs=6.3

Q ss_pred             HHHHHHHHHHhcC
Q 001711          448 VVAQTIKSCLDEL  460 (1021)
Q Consensus       448 ~~~~sI~~~L~~L  460 (1021)
                      .++|+|..+|.-+
T Consensus       391 ~AiETllTAI~lI  403 (498)
T KOG4849|consen  391 GAIETLLTAIQLI  403 (498)
T ss_pred             hHHHHHHHHHHHH
Confidence            3445555555444


No 77 
>COG2425 Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [General function prediction only]
Probab=89.92  E-value=2.1  Score=50.78  Aligned_cols=148  Identities=16%  Similarity=0.216  Sum_probs=94.0

Q ss_pred             CCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCC
Q 001711          427 PPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPD  506 (1021)
Q Consensus       427 pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~  506 (1021)
                      .| ++.|||.|.++  .|..+...+++..+|-.+.--.+.++.++.||+.++=|.+....                    
T Consensus       273 Gp-villlD~SGSM--~G~~e~~AKAvalAl~~~alaenR~~~~~lF~s~~~~~el~~k~--------------------  329 (437)
T COG2425         273 GP-VILLLDKSGSM--SGFKEQWAKAVALALMRIALAENRDCYVILFDSEVIEYELYEKK--------------------  329 (437)
T ss_pred             CC-EEEEEeCCCCc--CCcHHHHHHHHHHHHHHHHHHhccceEEEEecccceeeeecCCc--------------------
Confidence            44 45699999998  57777777777777765432233789999999954444433210                    


Q ss_pred             ccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh---cCCEEEEEecCCCCCCcccccccCCcCccc
Q 001711          507 DLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR---LGGKLLIFQNSLPSLGVGCLKLRGDDLRVY  583 (1021)
Q Consensus       507 ~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~---~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~  583 (1021)
                                -.++++++.|...|..    ++-+-.||..|++.++.   .++.|++.|.|-.                 
T Consensus       330 ----------~~~~e~i~fL~~~f~G----GTD~~~~l~~al~~~k~~~~~~adiv~ITDg~~-----------------  378 (437)
T COG2425         330 ----------IDIEELIEFLSYVFGG----GTDITKALRSALEDLKSRELFKADIVVITDGED-----------------  378 (437)
T ss_pred             ----------cCHHHHHHHHhhhcCC----CCChHHHHHHHHHHhhcccccCCCEEEEeccHh-----------------
Confidence                      0134456666555543    35678899999999985   4678888776421                 


Q ss_pred             CCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCC-cChhhhhhhccccccEEEEeC
Q 001711          584 GTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKY-TDIASLGTLAKYTGGQVYYYP  643 (1021)
Q Consensus       584 gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~-~diatl~~L~~~TGG~v~~y~  643 (1021)
                            ..   .+.|-++..+...+.+.=|.-.+++... -++..+..   ++   +|.++
T Consensus       379 ------~~---~~~~~~~v~e~~k~~~~rl~aV~I~~~~~~~l~~Isd---~~---i~~~~  424 (437)
T COG2425         379 ------ER---LDDFLRKVKELKKRRNARLHAVLIGGYGKPGLMRISD---HI---IYRVE  424 (437)
T ss_pred             ------hh---hhHHHHHHHHHHHHhhceEEEEEecCCCCcccceeee---ee---EEeeC
Confidence                  11   1567777777776777777777766544 55554444   33   66655


No 78 
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification]
Probab=88.82  E-value=9.5  Score=43.26  Aligned_cols=7  Identities=29%  Similarity=0.591  Sum_probs=2.7

Q ss_pred             CccceEE
Q 001711          354 FICRTYV  360 (1021)
Q Consensus       354 ~rCrAYi  360 (1021)
                      .|||-.|
T Consensus       412 dRCrvLi  418 (498)
T KOG4849|consen  412 DRCRVLI  418 (498)
T ss_pred             hHHHHHH
Confidence            3444333


No 79 
>PRK10997 yieM hypothetical protein; Provisional
Probab=88.12  E-value=2  Score=51.76  Aligned_cols=149  Identities=13%  Similarity=0.169  Sum_probs=85.8

Q ss_pred             CeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCc
Q 001711          428 PLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDD  507 (1021)
Q Consensus       428 p~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~  507 (1021)
                      --+++|||+|.++.  |.-+..+.++..+|-.+.-..+.++++|.|++.+..|.+...                      
T Consensus       324 GpiII~VDtSGSM~--G~ke~~AkalAaAL~~iAl~q~dr~~li~Fs~~i~~~~l~~~----------------------  379 (487)
T PRK10997        324 GPFIVCVDTSGSMG--GFNEQCAKAFCLALMRIALAENRRCYIMLFSTEVVTYELTGP----------------------  379 (487)
T ss_pred             CcEEEEEECCCCCC--CCHHHHHHHHHHHHHHHHHhcCCCEEEEEecCCceeeccCCc----------------------
Confidence            45788999999984  554455556666665543323367999999988776644321                      


Q ss_pred             cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---CCEEEEEecCCCCCCcccccccCCcCcccC
Q 001711          508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---GGKLLIFQNSLPSLGVGCLKLRGDDLRVYG  584 (1021)
Q Consensus       508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---GGkIivF~sg~Pt~GpG~L~~r~~~~r~~g  584 (1021)
                            ..+..+..+|+..   +    ..++.+..||+.++..++..   .|-|+++++.....                
T Consensus       380 ------~gl~~ll~fL~~~---f----~GGTDl~~aL~~al~~l~~~~~r~adIVVISDF~~~~----------------  430 (487)
T PRK10997        380 ------DGLEQAIRFLSQS---F----RGGTDLAPCLRAIIEKMQGREWFDADAVVISDFIAQR----------------  430 (487)
T ss_pred             ------cCHHHHHHHHHHh---c----CCCCcHHHHHHHHHHHHcccccCCceEEEECCCCCCC----------------
Confidence                  1112222233321   2    34677899999999888652   46677665543110                


Q ss_pred             CCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCC
Q 001711          585 TDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPS  644 (1021)
Q Consensus       585 t~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~  644 (1021)
                               ..+.+.+.+...-.+.+.-+...+++..  +-..+..++.    +++.|+.
T Consensus       431 ---------~~eel~~~L~~Lk~~~~~rf~~l~i~~~--~~p~l~~ifD----~~W~~d~  475 (487)
T PRK10997        431 ---------LPDELVAKVKELQRQHQHRFHAVAMSAH--GKPGIMRIFD----HIWRFDT  475 (487)
T ss_pred             ---------ChHHHHHHHHHHHHhcCcEEEEEEeCCC--CCchHHHhcC----eeeEecC
Confidence                     0123444444333347777777777642  2233444443    4677664


No 80 
>PF06707 DUF1194:  Protein of unknown function (DUF1194);  InterPro: IPR010607 This family consists of several hypothetical Rhizobiales specific proteins of around 270 residues in length. The function of this family is unknown.
Probab=86.90  E-value=29  Score=37.40  Aligned_cols=119  Identities=18%  Similarity=0.171  Sum_probs=64.5

Q ss_pred             hhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---CCEEEEEecC--CCCCCcccccccCCcCcccCCCcc
Q 001711          514 ESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---GGKLLIFQNS--LPSLGVGCLKLRGDDLRVYGTDKE  588 (1021)
Q Consensus       514 es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---GGkIivF~sg--~Pt~GpG~L~~r~~~~r~~gt~~e  588 (1021)
                      +..+.+-.-|+..+..+    ...+|+|.||..+..+|...   +.|-++=.||  .-|.|+                  
T Consensus        75 ~da~a~A~~l~~~~r~~----~~~Taig~Al~~a~~ll~~~~~~~~RrVIDvSGDG~~N~G~------------------  132 (205)
T PF06707_consen   75 ADAEAFAARLRAAPRRF----GGRTAIGSALDFAAALLAQNPFECWRRVIDVSGDGPNNQGP------------------  132 (205)
T ss_pred             HHHHHHHHHHHhCCCCC----CCCchHHHHHHHHHHHHHhCCCCCceEEEEECCCCCCCCCC------------------
Confidence            34444455555555432    22389999999999999874   3444444442  222221                  


Q ss_pred             ccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCc----ChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHhccc
Q 001711          589 HSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYT----DIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRDLTR  664 (1021)
Q Consensus       589 ~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~----diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~ltr  664 (1021)
                          .|.    +..-..+...||.||=+.+....-    +|...-.=+-.+|---|....    .+.+.|.+-++|-|.|
T Consensus       133 ----~p~----~~ard~~~~~GitINgL~I~~~~~~~~~~L~~yy~~~VIgGpgAFV~~a----~~~~df~~AirrKL~r  200 (205)
T PF06707_consen  133 ----RPV----TSARDAAVAAGITINGLAILDDDPFGGADLDAYYRRCVIGGPGAFVETA----RGFEDFAEAIRRKLIR  200 (205)
T ss_pred             ----Ccc----HHHHHHHHHCCeEEeeeEecCCCCCccccHHHHHhhhcccCCCceEEEc----CCHHHHHHHHHHHHHH
Confidence                122    122234556899999998877655    565544333333322232222    2345566666666655


Q ss_pred             cc
Q 001711          665 ET  666 (1021)
Q Consensus       665 ~~  666 (1021)
                      |+
T Consensus       201 Ei  202 (205)
T PF06707_consen  201 EI  202 (205)
T ss_pred             Hh
Confidence            53


No 81 
>smart00187 INB Integrin beta subunits (N-terminal portion of extracellular region). Portion of beta integrins that lies N-terminal to their EGF-like repeats. Integrins are cell adhesion molecules that mediate cell-extracellular  matrix and cell-cell interactions. They contain both alpha and beta subunits. Beta integrins are proposed to have a von Willebrand factor type-A "insert" or "I" -like domain (although this remains to be confirmed).
Probab=85.16  E-value=91  Score=37.16  Aligned_cols=272  Identities=15%  Similarity=0.192  Sum_probs=140.4

Q ss_pred             CCeEEEEEecchhHHhh-cHHHHHHHHHHHHHhcCCCCCCceEEEEEE-cCeEEEEec--CCCCCCcceeeccccccccC
Q 001711          427 PPLYFFLIDVSISAIRS-GMLEVVAQTIKSCLDELPGFPRTQIGFITF-DSTIHFYNM--KSSLTQPQMMVISDLDDIFV  502 (1021)
Q Consensus       427 pp~yvFvIDvS~~av~s-G~l~~~~~sI~~~L~~Lp~~~rt~VgiITF-ds~V~fynl--~~~~~~p~mlVvsDldd~f~  502 (1021)
                      |-=..|+.|+|+++... .-++.+...|.+.|..+-.+  .|+||=+| |+.|.=|-.  ...+..|-.-.-...+-.| 
T Consensus        99 PvDLYyLMDlS~SM~ddl~~lk~lg~~L~~~m~~it~n--~rlGfGsFVDK~v~P~~~t~p~~l~~PC~~~~~~c~p~f-  175 (423)
T smart00187       99 PVDLYYLMDLSYSMKDDLDNLKSLGDDLAREMKGLTSN--FRLGFGSFVDKTVSPFVSTRPEKLENPCPNYNLTCEPPY-  175 (423)
T ss_pred             ccceEEEEeCCccHHHHHHHHHHHHHHHHHHHHhcccC--ceeeEEEeecCccCCcccCCHHHhcCCCcCCCCCcCCCc-
Confidence            33467899999988431 12445555666666666544  88999988 665532221  1111111000000001111 


Q ss_pred             CCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC-----CEEEEEecCCCC--CCcccccc
Q 001711          503 PLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG-----GKLLIFQNSLPS--LGVGCLKL  575 (1021)
Q Consensus       503 Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G-----GkIivF~sg~Pt--~GpG~L~~  575 (1021)
                        .-.-.++|.+..+.+.+.+.... ...+...+|-.|-+-+++|+ .-+.+|     -||+||.+--.-  .|-|+|-.
T Consensus       176 --~f~~~L~LT~~~~~F~~~V~~~~-iSgN~D~PEgG~DAimQaaV-C~~~IGWR~~a~rllv~~TDa~fH~AGDGkLaG  251 (423)
T smart00187      176 --GFKHVLSLTDDTDEFNEEVKKQR-ISGNLDAPEGGFDAIMQAAV-CTEQIGWREDARRLLVFSTDAGFHFAGDGKLAG  251 (423)
T ss_pred             --ceeeeccCCCCHHHHHHHHhhce-eecCCcCCcccHHHHHHHHh-hccccccCCCceEEEEEEcCCCccccCCcceee
Confidence              11224667777776776666643 23344567777777777774 112233     489999987775  38888765


Q ss_pred             c--CCcCcccC-CCccccC-CCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEE-EeCCCCCchh
Q 001711          576 R--GDDLRVYG-TDKEHSL-RIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVY-YYPSFQSTTH  650 (1021)
Q Consensus       576 r--~~~~r~~g-t~~e~~l-~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~-~y~~F~~~~d  650 (1021)
                      .  .++.+-|= .+.+..- ..-...--.+|++++.+++|-+ ||+.+....++.  ..|+.+-.|... ...  ..+.+
T Consensus       252 Iv~PNDg~CHL~~~g~Yt~s~~~DYPSi~ql~~kL~e~nI~~-IFAVT~~~~~~Y--~~Ls~lipgs~vg~Ls--~DSsN  326 (423)
T smart00187      252 IVQPNDGQCHLDNNGEYTMSTTQDYPSIGQLNQKLAENNINP-IFAVTKKQVSLY--KELSALIPGSSVGVLS--EDSSN  326 (423)
T ss_pred             EecCCCCcceeCCCCCcCccCcCCCCCHHHHHHHHHhcCceE-EEEEcccchhHH--HHHHHhcCcceeeecc--cCcch
Confidence            4  12233221 1101110 0112234578899999999865 788777776653  344444444332 211  12233


Q ss_pred             HHHHHHHHHHhcccccccceEEEEE-eCCCeEEEeeecCccc--CCCCceeeccCCCCCcEEEEEEec
Q 001711          651 GERLRHELSRDLTRETAWEAVMRIR-CGKGVRFTNYHGNFML--RSTDLLALPAVDCDKAYAMQLSLE  715 (1021)
Q Consensus       651 ~~kl~~dL~r~ltr~~g~~a~mrVR-~S~Gl~V~~~~Gnf~~--rs~~~~~l~~id~d~Sia~~~~~d  715 (1021)
                      .-+|..+-++.|.    -.++|+.. ..++++++-.- .+-.  .....-...++.-.+.+.|++++.
T Consensus       327 Iv~LI~~aY~~i~----S~V~l~~~~~p~~v~~~y~s-~C~~g~~~~~~~~C~~v~iG~~V~F~v~vt  389 (423)
T smart00187      327 VVELIKDAYNKIS----SRVELEDNSLPEGVSVTYTS-SCPGGVVGPGTRKCEGVKIGDTVSFEVTVT  389 (423)
T ss_pred             HHHHHHHHHHhhc----eEEEEecCCCCCcEEEEEEe-eCCCCCcccCCcccCCcccCCEEEEEEEEE
Confidence            4455555555443    34455544 36677766332 2111  011112345666667777777654


No 82 
>KOG2353 consensus L-type voltage-dependent Ca2+ channel, alpha2/delta subunit [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=84.19  E-value=19  Score=47.62  Aligned_cols=116  Identities=23%  Similarity=0.344  Sum_probs=73.4

Q ss_pred             ccccEEEEcc---ccccCCCCCCCeEEEEEecchhHHhhc-HHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecC
Q 001711          408 TKGSVEFVAP---TEYMVRPPMPPLYFFLIDVSISAIRSG-MLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMK  483 (1021)
Q Consensus       408 ~~gtVEfvap---~eY~~r~p~pp~yvFvIDvS~~av~sG-~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~  483 (1021)
                      ...++|+...   +-|+.....+--.+|++|+|.+.  +| .+..++.++.++|+.|.++  ..|-|+||++.++.-   
T Consensus       203 ~~~~idl~D~R~r~Wyi~aAt~pKdiviLlD~SgSm--~g~~~~lak~tv~~iLdtLs~~--Dfvni~tf~~~~~~v---  275 (1104)
T KOG2353|consen  203 TDNSIDLYDCRNRSWYIQAATSPKDIVILLDVSGSM--SGLRLDLAKQTVNEILDTLSDN--DFVNILTFNSEVNPV---  275 (1104)
T ss_pred             CCCcceeeecccccccccccCCccceEEEEeccccc--cchhhHHHHHHHHHHHHhcccC--CeEEEEeeccccCcc---
Confidence            3444444433   33565667778899999999987  34 3677888899999999876  789999999876532   


Q ss_pred             CCCCCcceeeccccccccCCCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh
Q 001711          484 SSLTQPQMMVISDLDDIFVPLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR  553 (1021)
Q Consensus       484 ~~~~~p~mlVvsDldd~f~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~  553 (1021)
                                +++..       .+|+----..++.+.++++.|.  .+..    .-+-.|++.|+.+|..
T Consensus       276 ----------~pc~~-------~~lvqAt~~nk~~~~~~i~~l~--~k~~----a~~~~~~e~aF~lL~~  322 (1104)
T KOG2353|consen  276 ----------SPCFN-------GTLVQATMRNKKVFKEAIETLD--AKGI----ANYTAALEYAFSLLRD  322 (1104)
T ss_pred             ----------ccccc-------CceeecchHHHHHHHHHHhhhc--cccc----cchhhhHHHHHHHHHH
Confidence                      22211       1222222244566666666665  1111    2245688888888865


No 83 
>PF00362 Integrin_beta:  Integrin, beta chain;  InterPro: IPR002369 Integrins are the major metazoan receptors for cell adhesion to extracellular matrix proteins and, in vertebrates, also play important roles in certain cell-cell adhesions, make transmembrane connections to the cytoskeleton and activate many intracellular signalling pathways [, ]. The integrin receptors are composed of alpha and beta subunit heterodimers. Each subunit crosses the membrane once, with most of the polypeptide residing in the extracellular space, and has two short cytoplasmic domains. Some members of this family have EGF repeats at the C terminus and also have a vWA domain inserted within the integrin domain at the N terminus.  Most integrins recognise relatively short peptide motifs, and in general require an acidic amino acid to be present. Ligand specificity depends upon both the alpha and beta subunits []. There are at least 18 types of alpha and 8 types of beta subunits recognised in humans []. Each alpha subunit tends to associate only with one type of beta subunit, but there are exceptions to this rule []. Each association of alpha and beta subunits has its own binding specificity and signalling properties. Many integrins require activation on the cell surface before they can bind ligands. Integrins frequently intercommunicate, and binding at one integrin receptor activate or inhibit another.  The structure of unliganded alphaV beta3 showed the molecule to be folded, with the head bent over towards the C termini of the legs which would normally be inserted into the membrane []. The head comprises a beta propeller domain at the end terminus of the alphaV subunit and an I/A domain inserted into a loop on the top of the hybrid domain in the beta subunit. The I/A domain consists of a Rossman fold with a core of beta parallel sheets surrounded by amphipathic alpha helices.  Integrins are important therapeutic targets in conditions such as atherosclerosis, thrombosis, cancer and asthma []. At the N terminus of the beta subunit is a cysteine-containing domain reminiscent of that found in presenillins and semaphorins, which has hence been termed the PSI domain. C-terminal to the PSI domain is an A-domain, which has been predicted to adopt a Rossmann fold similar to that of the alpha subunit, but with additional loops between the second and third beta strands []. The murine gene Pactolus shares significant similarity with the beta subunit [], but lacks either one or both of the inserted loops. The C-terminal portion of the beta subunit extracellular domain contains an internally disulphide-bonded cysteine-rich region, while the intracellular tail contains putative sites of interaction with a variety of intracellular signalling and cytoskeletal proteins, such as focal adhesion kinase and alpha-actinin respectively []. Integrin cytoplasmic domains are normally less than 50 amino acids in length, with the beta-subunit sequences exhibiting greater homology to each other than the alpha-subunit sequences. This is consistent with current evidence that the beta subunit is the principal site for binding of cytoskeletal and signalling molecules, whereas the alpha subunit has a regulatory role. The first 20 amino acids of the beta-subunit cytoplasmic domain are also alpha helical, but the final 25 residues are disordered and, apart from a turn that follows a conserved NPxY motif, appear to lack defined structure, suggesting that this is adopted on effector binding. The two membrane-proximal helices mediate the link between the subunits via a series of hydrophobic and electrostatic contacts. This entry represents the N-terminal portion of the extracellular region of integrin beta subunits.; GO: 0005488 binding, 0007155 cell adhesion, 0007160 cell-matrix adhesion; PDB: 3VI4_B 3VI3_B 2VDQ_B 3IJE_B 1M1X_B 2VDR_B 3NIF_B 3NID_D 1TYE_F 2Q6W_F ....
Probab=83.68  E-value=94  Score=37.25  Aligned_cols=275  Identities=17%  Similarity=0.238  Sum_probs=131.8

Q ss_pred             EEEEccccccCCCCCCCeEEEEEecchhHHhh-cHHHHHHHHHHHHHhcCCCCCCceEEEEEE-cCeEEEEecCCCCCCc
Q 001711          412 VEFVAPTEYMVRPPMPPLYFFLIDVSISAIRS-GMLEVVAQTIKSCLDELPGFPRTQIGFITF-DSTIHFYNMKSSLTQP  489 (1021)
Q Consensus       412 VEfvap~eY~~r~p~pp~yvFvIDvS~~av~s-G~l~~~~~sI~~~L~~Lp~~~rt~VgiITF-ds~V~fynl~~~~~~p  489 (1021)
                      ++|..+++|      |-=.-|++|+|+++... .-++.+-..|...|.++-.+  .|+||=+| |+.|.=|--    ..|
T Consensus        93 v~~~~a~~y------PvDLYyLmDlS~Sm~ddl~~l~~lg~~l~~~~~~it~~--~~~GfGsfvdK~~~P~~~----~~p  160 (426)
T PF00362_consen   93 VTVRPAEDY------PVDLYYLMDLSYSMKDDLENLKSLGQDLAEEMRNITSN--FRLGFGSFVDKPVMPFVS----TTP  160 (426)
T ss_dssp             EEEEBSSS--------EEEEEEEE-SGGGHHHHHHHCCCCHHHHHHHHTT-SS--EEEEEEEESSSSSTTTST-----SS
T ss_pred             EEEeecccc------ceeEEEEeechhhhhhhHHHHHHHHHHHHHHHHhcCcc--ceEechhhcccccCCccc----CCh
Confidence            444445555      33467899999987321 11344556677777777655  88999999 554321110    001


Q ss_pred             ceeecccccccc--------CCCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-----CC
Q 001711          490 QMMVISDLDDIF--------VPLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-----GG  556 (1021)
Q Consensus       490 ~mlVvsDldd~f--------~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-----GG  556 (1021)
                      .     .+.++.        -|..-.-.++|.+..+.+...+.+.. +-.+...++..|-+-++||+= -+.+     .-
T Consensus       161 ~-----~l~~pc~~~~~~c~~~~~f~~~l~Lt~~~~~F~~~v~~~~-is~n~D~PEgg~dal~Qa~vC-~~~igWr~~a~  233 (426)
T PF00362_consen  161 E-----KLKNPCPSKNPNCQPPFSFRHVLSLTDDITEFNEEVNKQK-ISGNLDAPEGGLDALMQAAVC-QEEIGWRNEAR  233 (426)
T ss_dssp             H-----CHHSTSCCTTS--B---SEEEEEEEES-HHHHHHHHHTS---B--SSSSBSHHHHHHHHHH--HHHHT--STSE
T ss_pred             h-----hhcCcccccCCCCCCCeeeEEeecccchHHHHHHhhhhcc-ccCCCCCCccccchheeeeec-ccccCcccCce
Confidence            0     111111        01111234677777777888887753 344556777777777777651 1222     35


Q ss_pred             EEEEEecCCCC--CCcccccccC--CcCccc-CCCcccc-CCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhh
Q 001711          557 KLLIFQNSLPS--LGVGCLKLRG--DDLRVY-GTDKEHS-LRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGT  630 (1021)
Q Consensus       557 kIivF~sg~Pt--~GpG~L~~r~--~~~r~~-gt~~e~~-l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~  630 (1021)
                      ||+||.+--.-  .|-|+|...-  ++.+-| ..+.+.. -..-...-..+|.+.+.+++|.+ ||+......++.  ..
T Consensus       234 ~llv~~TD~~fH~agDg~l~gi~~pnd~~Chl~~~~~y~~~~~~DYPSv~ql~~~l~e~~i~~-IFAVt~~~~~~Y--~~  310 (426)
T PF00362_consen  234 RLLVFSTDAGFHFAGDGKLAGIVKPNDGKCHLDDNGMYTASTEQDYPSVGQLVRKLSENNINP-IFAVTKDVYSIY--EE  310 (426)
T ss_dssp             EEEEEEESS-B--TTGGGGGT--S---SS--BSTTSBBGGGGCS----HHHHHHHHHHTTEEE-EEEEEGGGHHHH--HH
T ss_pred             EEEEEEcCCccccccccccceeeecCCCceEECCCCcccccccccCCCHHHHHHHHHHcCCEE-EEEEchhhhhHH--HH
Confidence            89999887664  4888887542  223322 1111100 01124466778888899988765 777776655543  23


Q ss_pred             hcccc-ccEEEEeCCCCCchhHHHHHHHHHHhcccccccceEEEEE-eCCCeEEEeeecCcccCC--CCceeeccCCCCC
Q 001711          631 LAKYT-GGQVYYYPSFQSTTHGERLRHELSRDLTRETAWEAVMRIR-CGKGVRFTNYHGNFMLRS--TDLLALPAVDCDK  706 (1021)
Q Consensus       631 L~~~T-GG~v~~y~~F~~~~d~~kl~~dL~r~ltr~~g~~a~mrVR-~S~Gl~V~~~~Gnf~~rs--~~~~~l~~id~d~  706 (1021)
                      |+.+- |+.+-....  .+....+|..+-++.++.    .+.|+.. ..++++|+ |..++..+.  ...-...++..++
T Consensus       311 L~~~i~~s~vg~L~~--dSsNIv~LI~~aY~~i~s----~V~L~~~~~p~~v~v~-y~s~C~~~~~~~~~~~C~~V~iG~  383 (426)
T PF00362_consen  311 LSNLIPGSSVGELSS--DSSNIVQLIKEAYNKISS----KVELKHDNAPDGVKVS-YTSNCPNGSTVPGTNECSNVKIGD  383 (426)
T ss_dssp             HHHHSTTEEEEEEST--TSHTHHHHHHHHHHHHCT----EEEEEECS--TTEEEE-EEEEESSSEEEECCEEECSE-TT-
T ss_pred             HhhcCCCceeccccc--CchhHHHHHHHHHHHHhh----eEEEEecCCCCcEEEE-EEEEccCCcccCcCccccCEecCC
Confidence            33332 444444432  223344555555554432    2333322 23456553 222222110  1224455566666


Q ss_pred             cEEEEEEec
Q 001711          707 AYAMQLSLE  715 (1021)
Q Consensus       707 Sia~~~~~d  715 (1021)
                      ++.|++.+.
T Consensus       384 ~V~F~VtVt  392 (426)
T PF00362_consen  384 TVTFNVTVT  392 (426)
T ss_dssp             EEEEEEEEE
T ss_pred             EEEEEEEEE
Confidence            666666554


No 84 
>KOG3768 consensus DEAD box RNA helicase [General function prediction only]
Probab=83.14  E-value=16  Score=44.20  Aligned_cols=32  Identities=22%  Similarity=0.507  Sum_probs=24.0

Q ss_pred             CeEEEEEecchhHHh-----hcHHHHHHHHHHHHHhc
Q 001711          428 PLYFFLIDVSISAIR-----SGMLEVVAQTIKSCLDE  459 (1021)
Q Consensus       428 p~yvFvIDvS~~av~-----sG~l~~~~~sI~~~L~~  459 (1021)
                      |+|+|+||+|.++-+     ..+|+.++.+|..-|+.
T Consensus         2 pi~lFllDTS~SM~qrah~~~tylD~AKgaVEtFiK~   38 (888)
T KOG3768|consen    2 PIFLFLLDTSGSMSQRAHPQFTYLDLAKGAVETFIKQ   38 (888)
T ss_pred             ceEEEEEecccchhhhccCCchhhHHHHHHHHHHHHH
Confidence            689999999998743     34677777777777764


No 85 
>KOG0444 consensus Cytoskeletal regulator Flightless-I (contains leucine-rich and gelsolin repeats) [Cytoskeleton]
Probab=82.43  E-value=3.1  Score=50.67  Aligned_cols=56  Identities=23%  Similarity=0.335  Sum_probs=37.5

Q ss_pred             hhcccEEEeecCCCCCCcc--CCcccccccccccchhhccCCcEEEEECCceeEEEecCCCCH
Q 001711          867 LLYPCLIRVDEHLLKPSAQ--LDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSP  927 (1021)
Q Consensus       867 ~lYPrL~~lh~~~~~~~~~--~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~  927 (1021)
                      -..|+||.+. +    ..+  +--.+.+-+...|-.+-|.+.|+|+||+..+||||+|+....
T Consensus       731 p~qpkLYkV~-l----GmGyLELPQvel~P~~~l~q~lL~sk~VyiLDc~sDiF~W~GkKs~R  788 (1255)
T KOG0444|consen  731 PEQPKLYKVN-L----GMGYLELPQVELLPKGILKQDLLGSKGVYILDCNSDIFLWIGKKSNR  788 (1255)
T ss_pred             CCCcceEEEc-c----ccceeecchhhhchhhHHHHHhhcCCeEEEEecCCceEEEecccchH
Confidence            4578999874 2    010  111121212345667778999999999999999999998644


No 86 
>KOG2487 consensus RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB4 [Transcription; Replication, recombination and repair]
Probab=76.54  E-value=46  Score=37.07  Aligned_cols=55  Identities=20%  Similarity=0.189  Sum_probs=40.7

Q ss_pred             HHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHhc
Q 001711          599 YKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRDL  662 (1021)
Q Consensus       599 Y~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~l  662 (1021)
                      |-+.--.+.+++|.||++....+   -..|...|..|||...+.+.      .+.|.+.|...+
T Consensus       185 ~MNciFaAqKq~I~Idv~~l~~~---s~~LqQa~D~TGG~YL~v~~------~~gLLqyLlt~~  239 (314)
T KOG2487|consen  185 YMNCIFAAQKQNIPIDVVSLGGD---SGFLQQACDITGGDYLHVEK------PDGLLQYLLTLL  239 (314)
T ss_pred             HHHHHHHHHhcCceeEEEEecCC---chHHHHHHhhcCCeeEecCC------cchHHHHHHHHh
Confidence            45666678899999999998877   34588999999999888764      234555555443


No 87 
>COG4867 Uncharacterized protein with a von Willebrand factor type A (vWA) domain [General function prediction only]
Probab=69.43  E-value=27  Score=40.88  Aligned_cols=160  Identities=16%  Similarity=0.248  Sum_probs=95.7

Q ss_pred             CeEEEEEecchhHHhhcHHHHHHH---HHHHHHhc-CCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCC
Q 001711          428 PLYFFLIDVSISAIRSGMLEVVAQ---TIKSCLDE-LPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVP  503 (1021)
Q Consensus       428 p~yvFvIDvS~~av~sG~l~~~~~---sI~~~L~~-Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~P  503 (1021)
                      ...+.++|+|++++-.|..--+++   +|...+.. .++   --+.||+|...-            +.+-+++       
T Consensus       464 aAvallvDtS~SM~~eGRw~PmKQtALALhHLv~TrfrG---D~l~~i~Fgr~A------------~~v~v~e-------  521 (652)
T COG4867         464 AAVALLVDTSFSMVMEGRWLPMKQTALALHHLVCTRFRG---DALQIIAFGRYA------------RTVTAAE-------  521 (652)
T ss_pred             cceeeeeeccHHHHHhccCCchHHHHHHHHHHHHhcCCC---cceEEEeccchh------------cccCHHH-------
Confidence            467889999999988885433333   33333332 333   358899986421            1111111       


Q ss_pred             CCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC---CEEEEEecCCCCC----Cccccccc
Q 001711          504 LPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG---GKLLIFQNSLPSL----GVGCLKLR  576 (1021)
Q Consensus       504 l~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G---GkIivF~sg~Pt~----GpG~L~~r  576 (1021)
                                         |..++...    ..++.+--||..|-.+++...   -.|++.+.|-||.    |-|...--
T Consensus       522 -------------------Lt~l~~v~----eqgTNlhhaL~LA~r~l~Rh~~~~~~il~vTDGePtAhle~~DG~~~~f  578 (652)
T COG4867         522 -------------------LTGLAGVY----EQGTNLHHALALAGRHLRRHAGAQPVVLVVTDGEPTAHLEDGDGTSVFF  578 (652)
T ss_pred             -------------------HhcCCCcc----ccccchHHHHHHHHHHHHhCcccCceEEEEeCCCccccccCCCCceEec
Confidence                               12233222    223456678888888887643   4788899999874    33322211


Q ss_pred             CCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeC
Q 001711          577 GDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYP  643 (1021)
Q Consensus       577 ~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~  643 (1021)
                           -|++|-+ .+.    ...+++ ..|.+.|+-|++|....+.-=..-+..+++.|+|.+|+-+
T Consensus       579 -----~yp~DP~-t~~----~Tvr~~-d~~~r~G~q~t~FrLg~DpgL~~Fv~qva~rv~G~vv~pd  634 (652)
T COG4867         579 -----DYPPDPR-TIA----HTVRGF-DDMARLGAQVTIFRLGSDPGLARFIDQVARRVQGRVVVPD  634 (652)
T ss_pred             -----CCCCChh-HHH----HHHHHH-HHHHhccceeeEEeecCCHhHHHHHHHHHHHhCCeEEecC
Confidence                 2333322 111    112233 4589999999999998876545557889999999999643


No 88 
>PF11265 Med25_VWA:  Mediator complex subunit 25 von Willebrand factor type A;  InterPro: IPR021419  The overall function of the full-length Med25 is efficiently to coordinate the transcriptional activation of RAR/RXR (retinoic acid receptor/retinoic X receptor) in higher eukaryotic cells. Human Med25 consists of several domains with different binding properties, the N-terminal, VWA domain which is this one, an SD2 domain from residues 229-381, a PTOV(B) or ACID domain from 395-545, an SD2 domain from residues 564-645 and a C-terminal NR box-containing domain (646-650) from 646-747. This VWA or von Willebrand factor type A domain when bound to RAR and the histone acetyltransferase CBP is responsible for recruiting Med1 to the rest of the Mediator complex []. 
Probab=67.17  E-value=2e+02  Score=31.61  Aligned_cols=103  Identities=16%  Similarity=0.138  Sum_probs=62.9

Q ss_pred             HHHHHHHHhhCCCcccCCCCcccc-hHHHHHHHHHHHHhc-------C-----CEEEEEecCCCCCCcccccccCCcCcc
Q 001711          516 RSVVDTLLDSLPSMFQDNMNVESA-FGPALKAAFMVMSRL-------G-----GKLLIFQNSLPSLGVGCLKLRGDDLRV  582 (1021)
Q Consensus       516 ~~~I~~lLd~Lp~~f~~~~~~~~a-lG~AL~aA~~lL~~~-------G-----GkIivF~sg~Pt~GpG~L~~r~~~~r~  582 (1021)
                      -+.+.+.|++|+  |..+.-.+.| +.-+|.+|+.++...       +     -+.|+..+++|..=|    ..      
T Consensus        89 ~~~fl~~L~~I~--f~GGG~e~~a~iaEGLa~AL~~fd~~~~~r~~~~~~~~~khcILI~nSpP~~~p----~~------  156 (226)
T PF11265_consen   89 PQKFLQWLDAIQ--FSGGGFESCAAIAEGLAEALQCFDDFKQMRQQQQQTDVQKHCILICNSPPYRLP----VN------  156 (226)
T ss_pred             HHHHHHHHHccC--cCCCCcccchhHHHHHHHHHHHhcchhhhccccCcccccceEEEEeCCCCcccc----cc------
Confidence            345566778887  3344444444 777888888877631       1     234555555553211    11      


Q ss_pred             cCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEE
Q 001711          583 YGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYY  641 (1021)
Q Consensus       583 ~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~  641 (1021)
                          +..+   -....++++|..+.+++|.+.++.-    --+..+..|-+..+|....
T Consensus       157 ----~~~~---~~~~~~d~la~~~~~~~I~LSiisP----rklP~l~~Lfeka~~~~~~  204 (226)
T PF11265_consen  157 ----ECPQ---YSGKTCDQLAVLISERNISLSIISP----RKLPSLRSLFEKAKGNPRA  204 (226)
T ss_pred             ----CCCc---ccCCCHHHHHHHHHhcCceEEEEcC----ccCHHHHHHHHhcCCCccc
Confidence                1111   1335678999999999999999863    2356677777777776654


No 89 
>PF09967 DUF2201:  VWA-like domain (DUF2201);  InterPro: IPR018698  This family of various hypothetical bacterial proteins has no known function. 
Probab=63.40  E-value=12  Score=36.96  Aligned_cols=93  Identities=18%  Similarity=0.212  Sum_probs=58.5

Q ss_pred             EEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCccce
Q 001711          431 FFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDLLV  510 (1021)
Q Consensus       431 vFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~lLv  510 (1021)
                      +++||+|.+.-+ ..++.++..|...++...    .+|-+|.||..|+--.           .+.+.++           
T Consensus         2 ~vaiDtSGSis~-~~l~~fl~ev~~i~~~~~----~~v~vi~~D~~v~~~~-----------~~~~~~~-----------   54 (126)
T PF09967_consen    2 VVAIDTSGSISD-EELRRFLSEVAGILRRFP----AEVHVIQFDAEVQDVQ-----------VFRSLED-----------   54 (126)
T ss_pred             EEEEECCCCCCH-HHHHHHHHHHHHHHHhCC----CCEEEEEECCEeeeee-----------EEecccc-----------
Confidence            689999997633 357777888888887762    5699999999887321           1111000           


Q ss_pred             ehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcCCEEEEEecCCC
Q 001711          511 NLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLGGKLLIFQNSLP  566 (1021)
Q Consensus       511 ~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~GGkIivF~sg~P  566 (1021)
                                 .+..+    .-....++++.++++.+.+.. ....-|++|+.|-.
T Consensus        55 -----------~~~~~----~~~GgGGTdf~pvf~~~~~~~-~~~~~vi~fTDg~~   94 (126)
T PF09967_consen   55 -----------ELRDI----KLKGGGGTDFRPVFEYLEENR-PRPSVVIYFTDGEG   94 (126)
T ss_pred             -----------ccccc----ccCCCCCCcchHHHHHHHhcC-CCCCEEEEEeCCCC
Confidence                       00011    112456788888888876543 34566778998654


No 90 
>COG5242 TFB4 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB4 [Transcription / DNA replication, recombination, and repair]
Probab=61.13  E-value=1.4e+02  Score=32.46  Aligned_cols=177  Identities=19%  Similarity=0.272  Sum_probs=93.0

Q ss_pred             CCCeEEEEEecchhHH----hhcHHHHHHHHHHHHHhc-CCCCCCceEEEEE-EcCeEEEEecCCCCCCcceeeccccc-
Q 001711          426 MPPLYFFLIDVSISAI----RSGMLEVVAQTIKSCLDE-LPGFPRTQIGFIT-FDSTIHFYNMKSSLTQPQMMVISDLD-  498 (1021)
Q Consensus       426 ~pp~yvFvIDvS~~av----~sG~l~~~~~sI~~~L~~-Lp~~~rt~VgiIT-Fds~V~fynl~~~~~~p~mlVvsDld-  498 (1021)
                      .|...+.+||.--...    +.|-..-+.+.|.--|.. |.-..+-||++|. |+..+.+.--+...    .+.+++.| 
T Consensus        19 spslL~viid~~p~~W~~~~ek~~~~kvl~di~VFLNAhlaf~~~NrVaVva~~s~~~~yLypss~s----~~k~se~e~   94 (296)
T COG5242          19 SPSLLFVIIDLEPENWELTTEKGSRDKVLNDIVVFLNAHLAFSRNNRVAVVAGYSQGKTYLYPSSES----ALKASESEN   94 (296)
T ss_pred             CCceEEEEEecChhhcccccccccHHHHHHHHHHHHHHHHhhccCCeEEEEEeccCceEEeccCcch----hhhhhcccC
Confidence            3555666788755442    345455555555555553 3322335787764 66666543222211    12233332 


Q ss_pred             ----cccCCCCCccceehhhhHHHHHHHHhhCCCcccCC--CCcccchHHHHHHHHHHHHh------cCCEEEEEecCCC
Q 001711          499 ----DIFVPLPDDLLVNLSESRSVVDTLLDSLPSMFQDN--MNVESAFGPALKAAFMVMSR------LGGKLLIFQNSLP  566 (1021)
Q Consensus       499 ----d~f~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~--~~~~~alG~AL~aA~~lL~~------~GGkIivF~sg~P  566 (1021)
                          |+|.-     +-++      =+.+++.|-..++..  .....-+|-|+.+++.+..+      .-.||++|+.+  
T Consensus        95 tr~sd~yrr-----fr~v------de~~i~eiyrl~e~~~k~sqr~~v~gams~glay~n~~~~e~slkSriliftls--  161 (296)
T COG5242          95 TRNSDMYRR-----FRNV------DETDITEIYRLIEHPHKNSQRYDVGGAMSLGLAYCNHRDEETSLKSRILIFTLS--  161 (296)
T ss_pred             ccchhhhhh-----hccc------chHHHHHHHHHHhCcccccceeehhhhhhhhHHHHhhhcccccccceEEEEEec--
Confidence                12211     1111      122333333322222  22334678899999888755      34899999872  


Q ss_pred             CCCcccccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCC
Q 001711          567 SLGVGCLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPS  644 (1021)
Q Consensus       567 t~GpG~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~  644 (1021)
                        |      |+.         ..+|.     =|-+-.-.+.+.||-||+|-+...   -..|...+..|||.....++
T Consensus       162 --G------~d~---------~~qYi-----p~mnCiF~Aqk~~ipI~v~~i~g~---s~fl~Q~~daTgG~Yl~ve~  214 (296)
T COG5242         162 --G------RDR---------KDQYI-----PYMNCIFAAQKFGIPISVFSIFGN---SKFLLQCCDATGGDYLTVED  214 (296)
T ss_pred             --C------chh---------hhhhc-----hhhhheeehhhcCCceEEEEecCc---cHHHHHHhhccCCeeEeecC
Confidence              2      211         11111     122223345678999999976654   34578899999998777664


No 91 
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=58.82  E-value=5.7e+02  Score=33.99  Aligned_cols=9  Identities=22%  Similarity=0.552  Sum_probs=3.9

Q ss_pred             CccceEEcc
Q 001711          354 FICRTYVNP  362 (1021)
Q Consensus       354 ~rCrAYiNP  362 (1021)
                      .||.+-.++
T Consensus       960 ~r~~a~~~~  968 (1049)
T KOG0307|consen  960 QRCSARTDP  968 (1049)
T ss_pred             HHhhccCCH
Confidence            444444443


No 92 
>PF10138 vWA-TerF-like:  vWA found in TerF C terminus ;  InterPro: IPR019303 This entry represents the N-terminal domain of a family of proteins that confer resistance to the metalloid element tellurium and its salts. 
Probab=54.31  E-value=2.6e+02  Score=30.10  Aligned_cols=144  Identities=17%  Similarity=0.247  Sum_probs=84.8

Q ss_pred             EEEEEecchhH---HhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCC
Q 001711          430 YFFLIDVSISA---IRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPD  506 (1021)
Q Consensus       430 yvFvIDvS~~a---v~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~  506 (1021)
                      ..+|||.|.++   -++|.++.+.+.|...=..+-++  ..|=+.+|++..+=              +.+          
T Consensus         4 V~LVLD~SGSM~~~yk~G~vQ~~~Er~lalA~~~DdD--G~i~v~~Fs~~~~~--------------~~~----------   57 (200)
T PF10138_consen    4 VYLVLDISGSMRPLYKDGTVQRVVERILALAAQFDDD--GEIDVWFFSTEFDR--------------LPD----------   57 (200)
T ss_pred             EEEEEeCCCCCchhhhCccHHHHHHHHHHHHhhcCCC--CceEEEEeCCCCCc--------------CCC----------
Confidence            56899999987   67788888888888776666544  44555555543221              111          


Q ss_pred             ccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-C---CEEEEEec-CCCCCCcccccccCCcCc
Q 001711          507 DLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-G---GKLLIFQN-SLPSLGVGCLKLRGDDLR  581 (1021)
Q Consensus       507 ~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-G---GkIivF~s-g~Pt~GpG~L~~r~~~~r  581 (1021)
                         +.+.+....|..+...+..+   .....+...+||+.++.--... +   .-+++|++ |-|+       .+     
T Consensus        58 ---vt~~~~~~~v~~~~~~~~~~---~~~G~t~y~~vm~~v~~~y~~~~~~~~P~~VlFiTDG~~~-------~~-----  119 (200)
T PF10138_consen   58 ---VTLDNYEGYVDELHAGLPDW---GRMGGTNYAPVMEDVLDHYFKREPSDAPALVLFITDGGPD-------DR-----  119 (200)
T ss_pred             ---cCHHHHHHHHHHHhcccccc---CCCCCcchHHHHHHHHHHHhhcCCCCCCeEEEEEecCCcc-------ch-----
Confidence               12334445555555444222   2234477889999988776532 1   24555554 3221       11     


Q ss_pred             ccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccc
Q 001711          582 VYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKY  634 (1021)
Q Consensus       582 ~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~  634 (1021)
                                     .--+++-.+++...|-.-..-++.+..++  |..|-.+
T Consensus       120 ---------------~~~~~~i~~as~~pifwqFVgiG~~~f~f--L~kLD~l  155 (200)
T PF10138_consen  120 ---------------RAIEKLIREASDEPIFWQFVGIGDSNFGF--LEKLDDL  155 (200)
T ss_pred             ---------------HHHHHHHHhccCCCeeEEEEEecCCcchH--HHHhhcc
Confidence                           11245566667777888877777776554  6666664


No 93 
>PF05762 VWA_CoxE:  VWA domain containing CoxE-like protein;  InterPro: IPR008912 This group of proteins contains a VWA type domain and the function of this family is unknown. It is found as part of a CO oxidising (Cox) system operon in several bacteria [].
Probab=44.34  E-value=32  Score=37.27  Aligned_cols=102  Identities=16%  Similarity=0.244  Sum_probs=52.8

Q ss_pred             CCCC-eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCC
Q 001711          425 PMPP-LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVP  503 (1021)
Q Consensus       425 p~pp-~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~P  503 (1021)
                      +..+ -+|+|+|||.++..  +...++..+..+....     .+|.++.|++.|.  .+.               +.+. 
T Consensus        54 ~~~~~~lvvl~DvSGSM~~--~s~~~l~~~~~l~~~~-----~~~~~f~F~~~l~--~vT---------------~~l~-  108 (222)
T PF05762_consen   54 PRKPRRLVVLCDVSGSMAG--YSEFMLAFLYALQRQF-----RRVRVFVFSTRLT--EVT---------------PLLR-  108 (222)
T ss_pred             cCCCccEEEEEeCCCChHH--HHHHHHHHHHHHHHhC-----CCEEEEEEeeehh--hhh---------------hhhc-
Confidence            3444 89999999998853  3333333333333222     2577778876543  111               1110 


Q ss_pred             CCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---CCEEEEEecC
Q 001711          504 LPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---GGKLLIFQNS  564 (1021)
Q Consensus       504 l~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---GGkIivF~sg  564 (1021)
                        .      .+-.+.+..+......     -..++.+|.||+.+...+...   +-.|+++.++
T Consensus       109 --~------~~~~~~l~~~~~~~~~-----~~GgTdi~~aL~~~~~~~~~~~~~~t~vvIiSDg  159 (222)
T PF05762_consen  109 --R------RDPEEALARLSALVQS-----FGGGTDIGQALREFLRQYARPDLRRTTVVIISDG  159 (222)
T ss_pred             --c------CCHHHHHHHHHhhccC-----CCCccHHHHHHHHHHHHhhcccccCcEEEEEecc
Confidence              0      0111223333222221     345677899999888877632   3456666554


No 94 
>KOG2893 consensus Zn finger protein [General function prediction only]
Probab=40.44  E-value=1.3e+02  Score=32.81  Aligned_cols=10  Identities=30%  Similarity=0.464  Sum_probs=4.7

Q ss_pred             ehhhhHHHHH
Q 001711          511 NLSESRSVVD  520 (1021)
Q Consensus       511 ~l~es~~~I~  520 (1021)
                      .|+|.|..+-
T Consensus       323 sleerraqlp  332 (341)
T KOG2893|consen  323 SLEERRAQLP  332 (341)
T ss_pred             cHHHHhhhhh
Confidence            3455555443


No 95 
>PF02905 EBV-NA1:  Epstein Barr virus nuclear antigen-1, DNA-binding domain;  InterPro: IPR004186 The Epstein-Barr virus (strain GD1) nuclear antigen 1 (EBNA1) binds to and activates DNA replication from the latent origin of replication. The crystal structure of the DNA-binding and dimerization domains were solved [], and it was found that EBNA1 appears to bind DNA via two independent regions, the core and the flanking DNA-binding domains. This DNA-binding domain has a ferredoxin-like fold.; GO: 0003677 DNA binding, 0003688 DNA replication origin binding, 0006260 DNA replication, 0006275 regulation of DNA replication, 0045893 positive regulation of transcription, DNA-dependent, 0042025 host cell nucleus; PDB: 1B3T_B 1VHI_B.
Probab=32.45  E-value=71  Score=31.52  Aligned_cols=33  Identities=24%  Similarity=0.338  Sum_probs=24.5

Q ss_pred             HHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEE
Q 001711          446 LEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIH  478 (1021)
Q Consensus       446 l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~  478 (1021)
                      .+.++++|+.-+..-|. ..+++|.+++||+.|-
T Consensus       112 Ae~vkDAi~Dyi~T~P~PT~~~~Vt~~~Fd~~V~  145 (146)
T PF02905_consen  112 AECVKDAIRDYIMTRPQPTCNTQVTVCSFDDGVM  145 (146)
T ss_dssp             HHHHHHHHHHHHCTS-TTGGGEEEEEEEEEEEE-
T ss_pred             HHHHHHHHHHHhcCCCCCCcceEEEEEeCCCCCc
Confidence            36788899888877653 3458999999998764


No 96 
>KOG1923 consensus Rac1 GTPase effector FRL [Signal transduction mechanisms; Cytoskeleton]
Probab=31.72  E-value=1.5e+02  Score=37.59  Aligned_cols=6  Identities=33%  Similarity=0.622  Sum_probs=2.6

Q ss_pred             EEEEec
Q 001711          477 IHFYNM  482 (1021)
Q Consensus       477 V~fynl  482 (1021)
                      ||-++|
T Consensus       465 ih~~dL  470 (830)
T KOG1923|consen  465 IHPLDL  470 (830)
T ss_pred             hhhccc
Confidence            444444


No 97 
>KOG4672 consensus Uncharacterized conserved low complexity protein [Function unknown]
Probab=31.48  E-value=2.7e+02  Score=32.87  Aligned_cols=6  Identities=67%  Similarity=1.370  Sum_probs=2.3

Q ss_pred             CCCCCC
Q 001711          150 PMGSPV  155 (1021)
Q Consensus       150 ~~~~~~  155 (1021)
                      +||++|
T Consensus       381 p~Gp~p  386 (487)
T KOG4672|consen  381 PMGPPP  386 (487)
T ss_pred             CCCCCC
Confidence            344333


No 98 
>PF10058 DUF2296:  Predicted integral membrane metal-binding protein (DUF2296);  InterPro: IPR019273  This domain, found mainly in the eukaryotic lunapark proteins, has no known function []. 
Probab=25.72  E-value=55  Score=27.74  Aligned_cols=13  Identities=38%  Similarity=0.912  Sum_probs=11.0

Q ss_pred             CceEEEcCCCCCC
Q 001711          370 GRKWRCNICALLN  382 (1021)
Q Consensus       370 g~~W~Cn~C~~~N  382 (1021)
                      .-+|+|..|+..|
T Consensus        42 ~i~y~C~~Cg~~N   54 (54)
T PF10058_consen   42 EIQYRCPYCGALN   54 (54)
T ss_pred             ceEEEcCCCCCcC
Confidence            4589999999887


No 99 
>KOG1985 consensus Vesicle coat complex COPII, subunit SEC24/subunit SFB2 [Intracellular trafficking, secretion, and vesicular transport]
Probab=25.13  E-value=1.3e+03  Score=30.11  Aligned_cols=24  Identities=25%  Similarity=0.433  Sum_probs=15.6

Q ss_pred             EEccceeEecCCceEEEcCCCCC-CC
Q 001711          359 YVNPYVTFTDAGRKWRCNICALL-ND  383 (1021)
Q Consensus       359 YiNPf~~f~~~g~~W~Cn~C~~~-N~  383 (1021)
                      ++++-+.+.. +.--+|.-|.+. |.
T Consensus       206 d~~~~p~~~~-~~IvRCr~CRtYiNP  230 (887)
T KOG1985|consen  206 DIDPLPVITS-TLIVRCRRCRTYINP  230 (887)
T ss_pred             ccCCCCcccC-CceeeehhhhhhcCC
Confidence            5555555544 568889999864 53


No 100
>PF12257 DUF3608:  Protein of unknown function (DUF3608);  InterPro: IPR022046  This domain family is found in eukaryotes, and is approximately 280 amino acids in length. The family is found in association with PF00610 from PFAM. 
Probab=23.95  E-value=8.3e+02  Score=27.77  Aligned_cols=28  Identities=11%  Similarity=0.113  Sum_probs=22.4

Q ss_pred             cHHHHHHHHHHhhCCcEEEEEEecCCCc
Q 001711          596 DPFYKQMAADLTKFQIAVNVYAFSDKYT  623 (1021)
Q Consensus       596 ~~fY~~La~~~~~~gIsVDlF~~s~~~~  623 (1021)
                      .+.++-..+++...||++|+.+.+..-.
T Consensus       246 ~~ll~~T~~rl~~~gi~~DlIcL~~~PL  273 (281)
T PF12257_consen  246 YDLLRLTTQRLLDNGIGIDLICLSKPPL  273 (281)
T ss_pred             HHHHHHHHHHHHhcCccEEEEEcCCCCc
Confidence            3456778889999999999999876543


No 101
>COG5415 Predicted integral membrane metal-binding protein [General function prediction only]
Probab=23.51  E-value=34  Score=36.58  Aligned_cols=33  Identities=15%  Similarity=0.215  Sum_probs=25.4

Q ss_pred             CccceEEccceeEecC--------CceEEEcCCCCCCCCCc
Q 001711          354 FICRTYVNPYVTFTDA--------GRKWRCNICALLNDVPG  386 (1021)
Q Consensus       354 ~rCrAYiNPf~~f~~~--------g~~W~Cn~C~~~N~vP~  386 (1021)
                      ..=.|.|+|.|.+-.|        -..|+|.+|++.|+.+.
T Consensus       188 ~~~~alIC~~C~hhngl~~~~ek~~~efiC~~Cn~~n~~~~  228 (251)
T COG5415         188 SPFKALICPQCHHHNGLYRLAEKPIIEFICPHCNHKNDEVK  228 (251)
T ss_pred             CchhhhccccccccccccccccccchheecccchhhcCccc
Confidence            5566888888877654        34799999999997664


No 102
>COG1580 FliL Flagellar basal body-associated protein [Cell motility and secretion]
Probab=22.77  E-value=2.5e+02  Score=29.19  Aligned_cols=65  Identities=15%  Similarity=0.253  Sum_probs=42.7

Q ss_pred             CceeEEEEEEEEEecCCcEEEEEEeeeecccCCHHHHHHhcCH--hHHHHHHHHHHHHHHhc-CCHHHHHHHHHHHHHHH
Q 001711          721 TQTVYFQVALLYTASCGERRIRVHTLAAPVVSNLSDMYQQADT--GAIVSVFSRLAIEKTLS-HKLEDARNAVQLRLVKA  797 (1021)
Q Consensus       721 ~~~~~iQ~AllYTt~~GeRrIRV~Tl~lpvt~~l~~vf~s~D~--eai~~~laK~a~~~~l~-~~l~d~R~~l~~~lv~i  797 (1021)
                      ....|+|+++.|--.+              .....++=+.-..  ++++.+|+++.++.+.. .+.++.|+++.++|-.+
T Consensus        76 ~~~~~v~i~i~l~~~n--------------~~~~~el~~~~p~vrd~li~lfsskt~~eL~t~~Gke~Lk~ei~~~in~~  141 (159)
T COG1580          76 PKDRYVKIAITLEVAN--------------KALLEELEEKKPEVRDALLMLFSSKTAAELSTPEGKEKLKAEIKDRINTI  141 (159)
T ss_pred             CCcEEEEEEEEEeeCC--------------HHHHHHHHHhhHHHHHHHHHHHHhCCHHHhcCchhHHHHHHHHHHHHHHH
Confidence            3557777777765332              1112333333332  79999999999988877 67777888888887776


Q ss_pred             HH
Q 001711          798 LK  799 (1021)
Q Consensus       798 L~  799 (1021)
                      |.
T Consensus       142 L~  143 (159)
T COG1580         142 LK  143 (159)
T ss_pred             Hh
Confidence            63


No 103
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=21.81  E-value=47  Score=34.59  Aligned_cols=15  Identities=27%  Similarity=1.068  Sum_probs=11.7

Q ss_pred             CCceEEEcCCCCCCC
Q 001711          369 AGRKWRCNICALLND  383 (1021)
Q Consensus       369 ~g~~W~Cn~C~~~N~  383 (1021)
                      +|+.|+|..||+.-.
T Consensus       131 ~~~~~vC~vCGy~~~  145 (166)
T COG1592         131 EGKVWVCPVCGYTHE  145 (166)
T ss_pred             cCCEEEcCCCCCccc
Confidence            466899999998653


No 104
>KOG4368 consensus Predicted RNA binding protein, contains SWAP, RPR and G-patch domains [General function prediction only]
Probab=21.09  E-value=1.6e+03  Score=28.04  Aligned_cols=151  Identities=17%  Similarity=0.121  Sum_probs=0.0

Q ss_pred             CCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccC
Q 001711           81 FNDPSVSSSPITYVPPTSGPF-QRFPTPQFPPVAQAPPVRGPPVGLPPVSHPIGQVPNPPVPLRAQPPPVPMGSPVQRAN  159 (1021)
Q Consensus        81 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  159 (1021)
                      +..||...|-.-..++.++++ |.+|..+       +.+...+-+++.+.+.++-..++.+++..-.-.    .|+    
T Consensus       291 ~~~~p~~GPgdH~h~~~~~p~dq~hpqA~-------~~~~~~prqpp~p~~~~~~P~~p~~~~~h~~~~----~pg----  355 (757)
T KOG4368|consen  291 TPPPPAPGPGPHDQIPPNKPFDQPHPVAP-------WGQQQPPEQPPYPHHQGGPPHCPPWNNSHEGRG----DPG----  355 (757)
T ss_pred             cCCCCCCCCCcccccCCCCCCCCCCCCCC-------CCCCCCccCCCCCCcccCCCCCCCCCcccccCC----CCC----


Q ss_pred             CCCCCCCCCCCCCCCCccCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC----CCCCCCCCCCCCCCCCCCCCCCCCCCC
Q 001711          160 FAPSGVNVPQPLSDSSFSASRPNSPPDSSYPFARPTPQQPLPGYVTTQP----NAVSQGPTMPSSFPSHPRSYVPPPPTS  235 (1021)
Q Consensus       160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~  235 (1021)
                      +.|+..+++.+       ....-+++...++...+.-++.++++..+.+    +.-++.++.+++|..-+.....+--+.
T Consensus       356 ~pGp~~~n~g~-------a~g~q~~~p~~~~~~q~p~~g~epp~~~q~~~~~~qq~~Q~~qp~hp~n~~ppgq~q~d~s~  428 (757)
T KOG4368|consen  356 WNGPWNNNPDA-------AWGSQFEGPWNSQHEQPPWGGGEPPFRMQGPFPPHQQHPQFNQPPHPFNRFPPRFMQDDFPP  428 (757)
T ss_pred             CCCCCCCCCCC-------CcccccCCccccccccCcccCCCCchhhcCcCchhhhccccCCCCCccccCChhhcccccCc


Q ss_pred             CCCCCCCCCCCCCCCCCC
Q 001711          236 ASSFPAHQGGYVPPGVQS  253 (1021)
Q Consensus       236 ~~~~~~~~~~~~~~~~~~  253 (1021)
                      ..++..+......+++..
T Consensus       429 ~~~~~~~p~~~~~~~p~~  446 (757)
T KOG4368|consen  429 RHPFERPPYPHRFDYPQG  446 (757)
T ss_pred             ccccccCccccccCCCCC


No 105
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=20.69  E-value=47  Score=21.86  Aligned_cols=12  Identities=25%  Similarity=0.642  Sum_probs=7.5

Q ss_pred             EEEcCCCCCCCC
Q 001711          373 WRCNICALLNDV  384 (1021)
Q Consensus       373 W~Cn~C~~~N~v  384 (1021)
                      |+|.+|+....-
T Consensus         1 ~~C~~C~~~~~~   12 (24)
T PF13894_consen    1 FQCPICGKSFRS   12 (24)
T ss_dssp             EE-SSTS-EESS
T ss_pred             CCCcCCCCcCCc
Confidence            789999886543


No 106
>COG3285 Predicted eukaryotic-type DNA primase [DNA replication, recombination, and repair]
Probab=20.28  E-value=4.2e+02  Score=30.21  Aligned_cols=15  Identities=13%  Similarity=0.040  Sum_probs=12.4

Q ss_pred             CccceEEccceeEec
Q 001711          354 FICRTYVNPYVTFTD  368 (1021)
Q Consensus       354 ~rCrAYiNPf~~f~~  368 (1021)
                      ++|-.++.++++-.+
T Consensus        66 Kha~~~~p~~v~~~~   80 (299)
T COG3285          66 KHAPRGAPPWVQTVR   80 (299)
T ss_pred             ccCCCCCCchheeee
Confidence            899999999987654


Done!