Query         001172
Match_columns 1133
No_of_seqs    394 out of 1255
Neff          4.2 
Searched_HMMs 46136
Date          Thu Mar 28 17:48:08 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/001172.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/001172hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd03772 MATH_HAUSP Herpesvirus  99.9 3.3E-23 7.1E-28  203.1  15.3  125   67-193     2-134 (137)
  2 cd03775 MATH_Ubp21p Ubiquitin-  99.9 3.5E-23 7.6E-28  202.7  13.8  121   69-190     2-134 (134)
  3 cd03774 MATH_SPOP Speckle-type  99.9 6.9E-22 1.5E-26  194.0  14.3  123   67-194     4-139 (139)
  4 cd03773 MATH_TRIM37 Tripartite  99.9 1.3E-21 2.9E-26  189.3  13.1  120   66-190     3-130 (132)
  5 cd03776 MATH_TRAF6 Tumor Necro  99.9 1.3E-21 2.7E-26  194.5  11.2  123   68-190     1-147 (147)
  6 cd00270 MATH_TRAF_C Tumor Necr  99.9 1.8E-21 3.8E-26  192.4  11.4  122   68-189     1-148 (149)
  7 COG5077 Ubiquitin carboxyl-ter  99.9 2.6E-22 5.6E-27  234.7   5.9  241   65-349    36-287 (1089)
  8 cd03781 MATH_TRAF4 Tumor Necro  99.8   7E-21 1.5E-25  191.5  13.1  122   68-189     1-153 (154)
  9 cd03780 MATH_TRAF5 Tumor Necro  99.8   3E-20 6.4E-25  187.0  12.8  122   68-189     1-147 (148)
 10 cd03777 MATH_TRAF3 Tumor Necro  99.8 4.5E-20 9.8E-25  192.1  13.7  126   65-190    36-184 (186)
 11 cd00121 MATH MATH (meprin and   99.8 1.1E-19 2.4E-24  168.5  14.6  123   68-190     1-126 (126)
 12 cd03779 MATH_TRAF1 Tumor Necro  99.8 1.1E-19 2.4E-24  182.9  12.3  122   68-189     1-146 (147)
 13 cd03771 MATH_Meprin Meprin fam  99.7 1.2E-17 2.7E-22  171.4  13.7  122   68-189     2-166 (167)
 14 PF00917 MATH:  MATH domain;  I  99.7 7.4E-18 1.6E-22  157.4   9.2  115   74-191     1-119 (119)
 15 cd03778 MATH_TRAF2 Tumor Necro  99.7 5.5E-16 1.2E-20  158.9  13.2  125   65-189    16-163 (164)
 16 smart00061 MATH meprin and TRA  99.6 5.2E-15 1.1E-19  132.8  10.6   93   70-168     2-95  (95)
 17 cd03783 MATH_Meprin_Alpha Mepr  99.3 5.3E-12 1.2E-16  129.9  10.7  121   69-189     3-166 (167)
 18 KOG1863 Ubiquitin carboxyl-ter  99.3 3.2E-12 6.9E-17  162.3   6.8  236   68-347    27-273 (1093)
 19 cd03782 MATH_Meprin_Beta Mepri  99.3 1.8E-11   4E-16  125.7  10.7  122   68-189     2-166 (167)
 20 KOG1987 Speckle-type POZ prote  97.7 0.00021 4.6E-09   78.6  10.4  116   70-194     6-128 (297)
 21 PHA03098 kelch-like protein; P  96.7 0.00032   7E-09   83.2  -1.1  134  264-419     7-154 (534)
 22 PHA02713 hypothetical protein;  96.6 0.00078 1.7E-08   81.5   1.1  134  266-423    25-176 (557)
 23 KOG1029 Endocytic adaptor prot  96.3   0.012 2.7E-07   72.2   8.9   38  399-436   314-352 (1118)
 24 KOG2412 Nuclear-export-signal   96.2   0.048   1E-06   65.4  13.0   53  416-468   211-264 (591)
 25 KOG4364 Chromatin assembly fac  96.2   0.014 3.1E-07   70.9   8.8    7  499-505   380-386 (811)
 26 KOG4364 Chromatin assembly fac  95.9   0.074 1.6E-06   65.0  12.9   14  493-506   378-391 (811)
 27 PTZ00121 MAEBL; Provisional     95.9   0.043 9.4E-07   71.2  11.4    6   30-35    575-580 (2084)
 28 PTZ00266 NIMA-related protein   95.8     0.1 2.2E-06   67.7  14.2    7  976-982   868-874 (1021)
 29 PTZ00121 MAEBL; Provisional     95.6   0.077 1.7E-06   69.1  11.7    8  272-279   986-993 (2084)
 30 KOG0163 Myosin class VI heavy   95.2   0.087 1.9E-06   65.0   9.9   32  419-450   928-959 (1259)
 31 PHA02790 Kelch-like protein; P  94.6    0.01 2.2E-07   70.7   0.3  136  262-420    18-172 (480)
 32 PTZ00266 NIMA-related protein   94.2    0.19   4E-06   65.3  10.0   14  967-980   893-906 (1021)
 33 PRK00247 putative inner membra  94.0    0.32   7E-06   57.9  10.8   12  326-337   256-267 (429)
 34 KOG2412 Nuclear-export-signal   93.2    0.64 1.4E-05   56.3  11.4   15  285-299    65-79  (591)
 35 KOG1029 Endocytic adaptor prot  92.9    0.37 8.1E-06   60.0   9.1    9  834-842   857-865 (1118)
 36 PF05262 Borrelia_P83:  Borreli  92.4    0.85 1.8E-05   55.3  11.1   17  227-243    81-97  (489)
 37 KOG4350 Uncharacterized conser  92.4    0.15 3.2E-06   59.7   4.7   79  266-349    44-133 (620)
 38 PF05262 Borrelia_P83:  Borreli  92.4    0.55 1.2E-05   56.8   9.5   11  133-143     8-18  (489)
 39 KOG0163 Myosin class VI heavy   92.3     1.6 3.5E-05   54.6  13.1   10  233-242   649-658 (1259)
 40 KOG0345 ATP-dependent RNA heli  92.3    0.19 4.2E-06   59.9   5.5   16  336-351   436-451 (567)
 41 KOG2072 Translation initiation  91.3     1.8 3.9E-05   54.8  12.3   22  384-405   743-764 (988)
 42 KOG4441 Proteins containing BT  91.3   0.061 1.3E-06   65.9   0.0  137  264-423    34-187 (571)
 43 KOG2002 TPR-containing nuclear  91.1     1.5 3.3E-05   56.3  11.6   15  381-395   783-797 (1018)
 44 KOG2891 Surface glycoprotein [  91.1     2.3 4.9E-05   48.1  11.7   27  449-475   387-414 (445)
 45 KOG2072 Translation initiation  90.4     2.7 5.8E-05   53.3  12.7   28  406-433   757-785 (988)
 46 KOG3054 Uncharacterized conser  89.8     1.2 2.5E-05   49.7   8.1   38  531-574   213-250 (299)
 47 PRK00247 putative inner membra  89.7     1.4   3E-05   52.8   9.3   15  285-300   112-126 (429)
 48 CHL00118 atpG ATP synthase CF0  89.6     3.3 7.1E-05   42.8  10.9   30  410-439    49-78  (156)
 49 KOG0297 TNF receptor-associate  89.4    0.23 5.1E-06   58.3   2.8   77   65-141   277-365 (391)
 50 TIGR03321 alt_F1F0_F0_B altern  89.3       3 6.5E-05   46.1  11.1   50  387-437    10-59  (246)
 51 PRK13428 F0F1 ATP synthase sub  89.1     2.5 5.4E-05   50.8  11.0   49  388-437     7-55  (445)
 52 PF07946 DUF1682:  Protein of u  89.0     1.5 3.2E-05   50.4   8.6    8  180-187   110-117 (321)
 53 PF00651 BTB:  BTB/POZ domain;   88.7   0.048   1E-06   50.8  -2.9   32  265-296     9-44  (111)
 54 KOG2002 TPR-containing nuclear  88.4     7.1 0.00015   50.6  14.5    9  212-220   639-647 (1018)
 55 PF07946 DUF1682:  Protein of u  88.3       1 2.2E-05   51.6   6.9   15  380-394   234-248 (321)
 56 PRK14474 F0F1 ATP synthase sub  87.7       4 8.6E-05   45.6  10.7   44  387-433    10-55  (250)
 57 PRK14472 F0F1 ATP synthase sub  87.5     5.3 0.00011   41.9  11.0   54  385-439    21-74  (175)
 58 PRK14471 F0F1 ATP synthase sub  87.5     5.4 0.00012   41.3  10.9   29  410-438    35-63  (164)
 59 PRK13453 F0F1 ATP synthase sub  87.3     5.6 0.00012   41.8  11.0   52  386-438    22-73  (173)
 60 PRK09174 F0F1 ATP synthase sub  87.3     4.7  0.0001   43.8  10.7   55  384-439    55-109 (204)
 61 PRK06231 F0F1 ATP synthase sub  87.1     5.6 0.00012   43.2  11.1   51  385-436    51-101 (205)
 62 PRK13454 F0F1 ATP synthase sub  87.0       5 0.00011   42.6  10.5   54  384-438    33-86  (181)
 63 PRK08475 F0F1 ATP synthase sub  86.7     5.8 0.00013   41.6  10.7   52  385-439    25-78  (167)
 64 PRK07352 F0F1 ATP synthase sub  86.3       7 0.00015   41.0  11.1   49  388-437    25-73  (174)
 65 KOG3809 Microtubule-binding pr  86.2     2.6 5.6E-05   50.2   8.5   12  383-394   101-112 (583)
 66 PRK13455 F0F1 ATP synthase sub  85.8     5.4 0.00012   42.2  10.0   48  389-439    33-83  (184)
 67 PRK07353 F0F1 ATP synthase sub  85.6     8.7 0.00019   38.6  10.9   31  409-439    31-61  (140)
 68 PRK13460 F0F1 ATP synthase sub  85.1     9.6 0.00021   40.0  11.4   53  385-438    19-71  (173)
 69 PRK14475 F0F1 ATP synthase sub  84.8     8.4 0.00018   40.3  10.7   31  409-439    36-66  (167)
 70 PRK06569 F0F1 ATP synthase sub  84.8      14  0.0003   39.0  12.2   58  385-443    13-70  (155)
 71 CHL00019 atpF ATP synthase CF0  84.6     8.9 0.00019   40.6  11.0   50  388-438    30-79  (184)
 72 PF06098 Radial_spoke_3:  Radia  84.4      11 0.00023   43.4  12.1   47  386-433   130-178 (291)
 73 PRK14473 F0F1 ATP synthase sub  84.1     9.4  0.0002   39.6  10.7   29  410-438    35-63  (164)
 74 KOG3054 Uncharacterized conser  84.0       4 8.7E-05   45.6   8.2   14  416-429   116-129 (299)
 75 PRK09510 tolA cell envelope in  83.1     6.8 0.00015   46.6  10.2    9  507-515   259-267 (387)
 76 KOG2891 Surface glycoprotein [  82.6     5.3 0.00011   45.3   8.5   19  154-174   123-141 (445)
 77 PRK08476 F0F1 ATP synthase sub  82.5      14 0.00029   37.9  10.9   30  409-438    33-62  (141)
 78 COG3064 TolA Membrane protein   82.2     4.6  0.0001   46.6   8.0   11  417-427   100-110 (387)
 79 PF09727 CortBP2:  Cortactin-bi  82.2      12 0.00025   40.9  10.6   38  379-432    90-127 (192)
 80 PRK13461 F0F1 ATP synthase sub  82.1      14  0.0003   38.1  11.0   47  386-433     9-55  (159)
 81 PRK06568 F0F1 ATP synthase sub  81.9      12 0.00027   39.3  10.5   49  388-439    10-60  (154)
 82 KOG2357 Uncharacterized conser  81.7      11 0.00025   44.9  11.2   26  165-190   189-219 (440)
 83 PF07888 CALCOCO1:  Calcium bin  80.9      67  0.0015   40.1  17.6   63  457-520   429-494 (546)
 84 COG3064 TolA Membrane protein   79.8     6.6 0.00014   45.3   8.2   12  463-474   178-189 (387)
 85 TIGR01144 ATP_synt_b ATP synth  79.7      19 0.00041   36.5  10.8   36  393-433    10-45  (147)
 86 COG4499 Predicted membrane pro  79.7     3.4 7.4E-05   48.7   6.1   19   87-105    48-71  (434)
 87 KOG0742 AAA+-type ATPase [Post  78.7      21 0.00046   43.1  12.0   18  285-302    10-27  (630)
 88 PRK05759 F0F1 ATP synthase sub  78.0      23  0.0005   36.1  10.9   24  410-433    31-54  (156)
 89 smart00225 BTB Broad-Complex,   77.8    0.46   1E-05   41.3  -1.2   75  268-350     1-86  (90)
 90 KOG2963 RNA-binding protein re  77.6     3.5 7.5E-05   48.0   5.3   10  233-242   207-216 (405)
 91 KOG4848 Extracellular matrix-a  77.5      20 0.00044   39.0  10.5   17  440-456   198-214 (225)
 92 KOG1103 Predicted coiled-coil   77.1      11 0.00023   44.1   8.9   17  575-591   316-332 (561)
 93 PF10211 Ax_dynein_light:  Axon  76.3      43 0.00094   36.1  12.8   46  379-424    76-121 (189)
 94 KOG3915 Transcription regulato  75.5      22 0.00047   43.1  11.0   16  404-419   513-528 (641)
 95 cd02666 Peptidase_C19J A subfa  75.4    0.96 2.1E-05   52.4   0.2   34  213-255     1-34  (343)
 96 KOG4786 Ubinuclein, nuclear pr  75.1     2.3   5E-05   53.0   3.2   79  401-479   673-754 (1136)
 97 KOG0556 Aspartyl-tRNA syntheta  73.6     3.7   8E-05   48.8   4.2   20  502-522    68-87  (533)
 98 KOG1924 RhoA GTPase effector D  73.4     9.7 0.00021   48.4   7.8   19  389-411   975-993 (1102)
 99 KOG2357 Uncharacterized conser  73.1      19  0.0004   43.2   9.7   26  318-345   262-293 (440)
100 KOG4691 Uncharacterized conser  72.9      24 0.00052   38.6   9.6   23  428-450   122-144 (227)
101 TIGR02794 tolA_full TolA prote  72.7      18 0.00039   42.5   9.5    7  400-406    95-101 (346)
102 PRK14474 F0F1 ATP synthase sub  72.4      49  0.0011   37.2  12.5   20  388-407    15-34  (250)
103 KOG4031 Vesicle coat protein c  72.3      14 0.00031   40.6   7.9   24  413-436   112-136 (216)
104 PF09756 DDRGK:  DDRGK domain;   71.4     1.3 2.8E-05   47.7   0.0   18  451-468    37-54  (188)
105 PRK09173 F0F1 ATP synthase sub  71.0      40 0.00087   34.8  10.7   30  409-438    28-57  (159)
106 PF06936 Selenoprotein_S:  Sele  70.1      25 0.00055   38.2   9.2   27  381-407    32-58  (190)
107 PTZ00234 variable surface prot  70.1     6.1 0.00013   47.6   5.1   27  825-851   341-375 (433)
108 PF15236 CCDC66:  Coiled-coil d  69.9      21 0.00046   37.8   8.4   23  417-439    80-102 (157)
109 COG5269 ZUO1 Ribosome-associat  69.8      31 0.00067   39.5  10.0   59  368-428   184-243 (379)
110 PRK09174 F0F1 ATP synthase sub  69.5      68  0.0015   35.1  12.5   22  389-410    64-85  (204)
111 PF06637 PV-1:  PV-1 protein (P  69.2      46   0.001   39.7  11.6   73  395-467   286-364 (442)
112 CHL00118 atpG ATP synthase CF0  68.8      51  0.0011   34.2  10.9   10  340-349    38-47  (156)
113 TIGR03321 alt_F1F0_F0_B altern  68.2      69  0.0015   35.6  12.4   13  389-401    16-28  (246)
114 KOG0112 Large RNA-binding prot  68.1       6 0.00013   50.8   4.6   14  500-513   782-795 (975)
115 COG0711 AtpF F0F1-type ATP syn  68.0      85  0.0018   33.0  12.4   30  408-437    31-60  (161)
116 PRK13454 F0F1 ATP synthase sub  66.9      37 0.00081   36.2   9.7   26  386-411    39-64  (181)
117 KOG2192 PolyC-binding hnRNP-K   66.8      27 0.00058   39.8   8.8  102  978-1091  183-288 (390)
118 PF00430 ATP-synt_B:  ATP synth  66.1      18  0.0004   35.4   6.8   35  392-431    13-47  (132)
119 PF06098 Radial_spoke_3:  Radia  65.6      38 0.00083   39.1  10.0   19  404-422   158-176 (291)
120 PF06658 DUF1168:  Protein of u  64.9      44 0.00096   35.0   9.4   19  440-458    85-103 (142)
121 PRK13460 F0F1 ATP synthase sub  64.6 1.1E+02  0.0023   32.3  12.4   25  386-410    24-48  (173)
122 PF06936 Selenoprotein_S:  Sele  64.2      27 0.00059   38.0   8.1   13  402-414    76-88  (190)
123 PRK14472 F0F1 ATP synthase sub  64.0 1.1E+02  0.0024   32.2  12.4   17  390-406    30-46  (175)
124 PRK13461 F0F1 ATP synthase sub  63.5 1.2E+02  0.0027   31.3  12.5   20  389-408    16-35  (159)
125 KOG3915 Transcription regulato  62.6      68  0.0015   39.2  11.4   16  417-432   535-550 (641)
126 KOG0742 AAA+-type ATPase [Post  62.5      77  0.0017   38.6  11.8    7  339-345    76-82  (630)
127 PRK13453 F0F1 ATP synthase sub  62.5 1.2E+02  0.0027   32.0  12.4   24  385-408    25-48  (173)
128 KOG2505 Ankyrin repeat protein  62.1      14  0.0003   45.2   5.8   14  109-122   177-190 (591)
129 KOG3654 Uncharacterized CH dom  62.0      16 0.00034   44.5   6.3   26  217-243   190-215 (708)
130 PRK00409 recombination and DNA  61.5      46   0.001   43.1  10.8   16  287-302   410-425 (782)
131 PF08229 SHR3_chaperone:  ER me  61.4      10 0.00022   41.4   4.3   13  415-427   160-172 (196)
132 KOG2689 Predicted ubiquitin re  61.2      15 0.00032   42.0   5.6   10  410-419    99-108 (290)
133 KOG2341 TATA box binding prote  60.6      11 0.00023   46.7   4.8   33  393-425   408-440 (563)
134 PF11208 DUF2992:  Protein of u  60.4      26 0.00057   36.1   6.8   45  413-457    85-129 (132)
135 PF10147 CR6_interact:  Growth   60.2 1.7E+02  0.0036   32.8  13.2   14  440-453   187-200 (217)
136 KOG1363 Predicted regulator of  59.8      25 0.00054   42.9   7.6   11  233-243   146-156 (460)
137 PF08229 SHR3_chaperone:  ER me  59.7     7.6 0.00017   42.3   3.0   17  407-423   156-172 (196)
138 PF10500 SR-25:  Nuclear RNA-sp  59.6      11 0.00025   41.7   4.3    9  498-506   156-164 (225)
139 KOG0949 Predicted helicase, DE  57.8      22 0.00047   46.8   6.8   17  971-987  1279-1295(1330)
140 KOG2341 TATA box binding prote  57.1      32  0.0007   42.8   7.9   27  418-444   430-456 (563)
141 KOG0783 Uncharacterized conser  56.8     6.3 0.00014   50.3   2.0   49  337-394   791-844 (1267)
142 KOG3634 Troponin [Cytoskeleton  56.4      22 0.00047   41.6   6.0   31  420-450    85-115 (361)
143 KOG4661 Hsp27-ERE-TATA-binding  56.0      56  0.0012   40.7   9.4   25  459-484   670-694 (940)
144 PRK00409 recombination and DNA  55.5      59  0.0013   42.1  10.3   15   89-103   188-202 (782)
145 PRK08475 F0F1 ATP synthase sub  55.4 1.2E+02  0.0026   32.0  10.9   10  340-349    38-47  (167)
146 KOG0345 ATP-dependent RNA heli  55.2      18 0.00039   44.2   5.3   41  381-424   451-495 (567)
147 PF04696 Pinin_SDK_memA:  pinin  54.7 1.7E+02  0.0036   30.1  11.4   10  385-394     8-17  (131)
148 COG4499 Predicted membrane pro  54.3      12 0.00026   44.4   3.6   12  414-425   365-376 (434)
149 KOG3859 Septins (P-loop GTPase  54.0      72  0.0016   37.1   9.4   15  405-419   318-332 (406)
150 PRK13428 F0F1 ATP synthase sub  53.8      88  0.0019   38.0  10.8   19  389-407    12-30  (445)
151 PF12037 DUF3523:  Domain of un  53.6 1.6E+02  0.0034   34.1  12.1   12  405-416   109-120 (276)
152 TIGR01069 mutS2 MutS2 family p  53.6   1E+02  0.0022   40.0  11.9   15   89-103   183-197 (771)
153 PRK07352 F0F1 ATP synthase sub  53.5 2.2E+02  0.0047   30.1  12.4   23  385-407    26-48  (174)
154 PHA00451 protein kinase         53.3      32  0.0007   39.5   6.6   14  223-236   141-154 (362)
155 PRK14473 F0F1 ATP synthase sub  53.1 2.3E+02   0.005   29.5  12.5   22  389-410    19-40  (164)
156 PRK14471 F0F1 ATP synthase sub  53.1 1.4E+02  0.0029   31.1  10.8   20  391-410    21-40  (164)
157 KOG3654 Uncharacterized CH dom  52.4      26 0.00056   42.9   5.9   15  453-467   430-444 (708)
158 PRK06231 F0F1 ATP synthase sub  52.3 1.3E+02  0.0029   32.8  10.9   14  337-350    61-74  (205)
159 KOG1150 Predicted molecular ch  51.6      36 0.00078   37.6   6.4   42  440-481   168-212 (250)
160 KOG2689 Predicted ubiquitin re  51.5      79  0.0017   36.4   9.2   27  406-432    99-125 (290)
161 KOG1363 Predicted regulator of  51.4      48   0.001   40.5   8.1    9  248-256   169-177 (460)
162 PF01086 Clathrin_lg_ch:  Clath  50.8      20 0.00043   39.7   4.5   14  415-428   120-133 (225)
163 TIGR01144 ATP_synt_b ATP synth  50.0 2.9E+02  0.0063   28.0  12.4    6  396-401    21-26  (147)
164 KOG4001 Axonemal dynein light   49.9 1.1E+02  0.0024   34.0   9.6   69  404-472   166-235 (259)
165 PRK07353 F0F1 ATP synthase sub  49.9 1.9E+02   0.004   29.1  10.9   18  392-409    19-36  (140)
166 KOG1962 B-cell receptor-associ  49.8 1.3E+02  0.0029   33.6  10.4   22  396-418   110-131 (216)
167 PRK08476 F0F1 ATP synthase sub  49.7 1.8E+02  0.0039   29.9  10.9   15  392-406    21-35  (141)
168 KOG2007 Cysteinyl-tRNA synthet  49.6      71  0.0015   39.6   9.0   46  391-437   491-536 (586)
169 KOG4715 SWI/SNF-related matrix  49.5      86  0.0019   36.7   9.1   41  501-545   343-383 (410)
170 PF08243 SPT2:  SPT2 chromatin   49.3      24 0.00053   35.5   4.4   38  402-439    73-110 (116)
171 KOG0343 RNA Helicase [RNA proc  49.1      44 0.00095   41.8   7.2    6  110-115   272-277 (758)
172 TIGR01069 mutS2 MutS2 family p  48.9   2E+02  0.0043   37.5  13.4   16  287-302   405-420 (771)
173 PRK05759 F0F1 ATP synthase sub  48.1 3.2E+02   0.007   27.9  12.4   19  392-410    18-36  (156)
174 PF12569 NARP1:  NMDA receptor-  48.0      14 0.00031   45.4   3.2   16  226-241   126-141 (517)
175 KOG3634 Troponin [Cytoskeleton  48.0      77  0.0017   37.3   8.6    7  608-614   203-209 (361)
176 KOG0066 eIF2-interacting prote  47.7      35 0.00077   41.6   6.0   46  406-451   496-541 (807)
177 KOG3088 Secretory carrier memb  47.4      22 0.00047   41.0   4.1   23  450-472    70-92  (313)
178 PF02731 SKIP_SNW:  SKIP/SNW do  47.2      48  0.0011   35.3   6.4   20  379-399    88-107 (158)
179 PF15066 CAGE1:  Cancer-associa  46.8 1.8E+02   0.004   35.7  11.6   39  439-477   484-522 (527)
180 KOG1772 Vacuolar H+-ATPase V1   46.8      57  0.0012   32.7   6.3   25  441-465    26-50  (108)
181 cd02664 Peptidase_C19H A subfa  46.4      10 0.00023   43.1   1.6   33  215-256     1-33  (327)
182 KOG0681 Actin-related protein   46.3      31 0.00066   42.8   5.4   20  399-418   351-370 (645)
183 COG5644 Uncharacterized conser  46.3      55  0.0012   40.8   7.4   32  409-444   370-401 (869)
184 TIGR00570 cdk7 CDK-activating   45.9      98  0.0021   36.2   9.1    8  335-342    96-103 (309)
185 KOG2635 Medium subunit of clat  45.0      50  0.0011   40.1   6.7   41  385-429   102-143 (512)
186 KOG3756 Pinin (desmosome-assoc  44.9 2.5E+02  0.0055   33.1  11.9   43  385-430   140-183 (340)
187 KOG3878 Protein involved in ma  44.9 2.1E+02  0.0045   34.0  11.3   33  401-433   115-147 (469)
188 cd02659 peptidase_C19C A subfa  44.6     8.5 0.00018   43.3   0.5   34  212-254     1-34  (334)
189 KOG2441 mRNA splicing factor/p  44.4      69  0.0015   38.4   7.6   20  379-399   261-280 (506)
190 CHL00019 atpF ATP synthase CF0  44.3 2.2E+02  0.0048   30.3  10.9   11  339-349    39-49  (184)
191 PF15346 ARGLU:  Arginine and g  44.2 3.9E+02  0.0084   28.5  12.2   16  465-480   100-115 (149)
192 KOG4571 Activating transcripti  43.9   1E+02  0.0022   35.8   8.6   52  407-459   242-293 (294)
193 KOG1015 Transcription regulato  43.8      19 0.00042   47.0   3.4   54  419-472   549-615 (1567)
194 PRK12704 phosphodiesterase; Pr  43.6      62  0.0013   40.1   7.6   20  569-588   195-214 (520)
195 PRK04863 mukB cell division pr  42.9 1.9E+02  0.0041   40.5  12.4   40  377-416   941-982 (1486)
196 smart00784 SPT2 SPT2 chromatin  42.7      41  0.0009   33.9   4.8   36  403-438    69-104 (111)
197 KOG4657 Uncharacterized conser  42.5 4.4E+02  0.0096   29.9  12.9   53  402-456    62-116 (246)
198 PF12757 DUF3812:  Protein of u  42.1      58  0.0013   33.2   5.9   17  409-425    75-91  (126)
199 PF09831 DUF2058:  Uncharacteri  41.1      45 0.00097   36.1   5.1    7  433-439    15-21  (177)
200 PRK06569 F0F1 ATP synthase sub  41.0 3.5E+02  0.0077   28.9  11.5   12  396-407    28-39  (155)
201 PF10186 Atg14:  UV radiation r  40.7 5.4E+02   0.012   28.5  13.8   28  381-408    21-48  (302)
202 KOG0288 WD40 repeat protein Ti  40.6 1.2E+02  0.0027   36.7   8.9   14  486-499   118-131 (459)
203 PF15236 CCDC66:  Coiled-coil d  40.4 1.5E+02  0.0033   31.7   8.7   26  419-444    72-98  (157)
204 COG5374 Uncharacterized conser  39.7 1.2E+02  0.0026   33.2   7.9   68  412-479   116-191 (192)
205 COG4942 Membrane-bound metallo  39.4 5.6E+02   0.012   31.4  14.2   14  335-348   118-131 (420)
206 cd02669 Peptidase_C19M A subfa  39.3      14 0.00029   44.3   1.1   37  210-255   116-152 (440)
207 PF00430 ATP-synt_B:  ATP synth  39.2      94   0.002   30.5   6.8   14  403-416    35-48  (132)
208 KOG0112 Large RNA-binding prot  39.1      28 0.00062   45.1   3.8   19  498-516   784-803 (975)
209 PHA03247 large tegument protei  39.0 1.2E+03   0.025   35.2  18.4   11  289-299  2226-2236(3151)
210 PRK11637 AmiB activator; Provi  38.3 3.3E+02   0.007   32.7  12.2    9  528-536   314-322 (428)
211 PF03879 Cgr1:  Cgr1 family;  I  38.0 3.9E+02  0.0085   27.1  10.7   23  404-426    30-52  (108)
212 KOG2507 Ubiquitin regulatory p  37.8      28 0.00061   42.0   3.3   11  408-418   204-214 (506)
213 PF09731 Mitofilin:  Mitochondr  37.0 6.4E+02   0.014   31.4  14.9   34  375-411   253-286 (582)
214 TIGR00570 cdk7 CDK-activating   36.9 1.2E+02  0.0026   35.5   8.0    8  429-436   151-158 (309)
215 KOG1265 Phospholipase C [Lipid  36.4 2.4E+02  0.0051   37.4  10.9  119  367-495  1024-1157(1189)
216 PF04889 Cwf_Cwc_15:  Cwf15/Cwc  36.4      69  0.0015   36.2   5.9    7  418-424   150-156 (244)
217 PLN02316 synthase/transferase   36.3      78  0.0017   42.4   7.2   25  573-600   335-359 (1036)
218 PF11861 DUF3381:  Domain of un  35.8      81  0.0018   33.4   6.0   14  331-344    30-43  (159)
219 smart00786 SHR3_chaperone ER m  35.8      44 0.00096   36.6   4.2   13  407-419   156-168 (196)
220 cd03405 Band_7_HflC Band_7_Hfl  35.2   2E+02  0.0042   31.3   9.1   32  430-461   176-207 (242)
221 cd02661 Peptidase_C19E A subfa  35.2      23  0.0005   38.8   2.1   34  213-255     1-34  (304)
222 COG4942 Membrane-bound metallo  35.1 2.4E+02  0.0053   34.4  10.3    9  528-536   298-306 (420)
223 PF13904 DUF4207:  Domain of un  35.0 2.2E+02  0.0048   32.3   9.6   12  386-397    85-96  (264)
224 TIGR03319 YmdA_YtgF conserved   34.6   1E+02  0.0023   38.1   7.6   20  569-588   189-208 (514)
225 KOG4673 Transcription factor T  34.0 1.5E+02  0.0031   38.1   8.4   37  401-437   416-452 (961)
226 COG5269 ZUO1 Ribosome-associat  33.9 2.6E+02  0.0056   32.5   9.7    8  387-394   219-226 (379)
227 PF05890 Ebp2:  Eukaryotic rRNA  33.6 1.7E+02  0.0038   33.5   8.5   25  408-432   116-140 (271)
228 KOG0717 Molecular chaperone (D  33.2 3.7E+02  0.0081   33.3  11.4   16  410-425   196-211 (508)
229 KOG1103 Predicted coiled-coil   33.2 1.9E+02  0.0041   34.5   8.7   12  458-469   223-234 (561)
230 KOG2573 Ribosome biogenesis pr  33.0      31 0.00068   41.3   2.7   13  229-241   278-290 (498)
231 PF02029 Caldesmon:  Caldesmon;  33.0 1.7E+02  0.0036   36.4   8.8    7  594-600   429-435 (492)
232 PF00769 ERM:  Ezrin/radixin/mo  32.9 4.1E+02   0.009   29.9  11.2   20  379-398     4-23  (246)
233 KOG2062 26S proteasome regulat  32.8      16 0.00034   46.6   0.3   18  440-457   793-810 (929)
234 KOG4739 Uncharacterized protei  32.6 3.6E+02  0.0078   30.6  10.5   10  598-607   220-229 (233)
235 PRK12585 putative monovalent c  32.6 1.2E+02  0.0026   33.5   6.6   11  378-388   108-118 (197)
236 COG5493 Uncharacterized conser  32.5 4.8E+02    0.01   29.2  11.1   24  401-425    24-47  (231)
237 PF15086 UPF0542:  Uncharacteri  32.5      64  0.0014   30.5   4.0   14  417-430    42-55  (74)
238 cd03404 Band_7_HflK Band_7_Hfl  32.3 1.6E+02  0.0035   32.6   8.0   17  177-193    80-96  (266)
239 cd02671 Peptidase_C19O A subfa  32.3      13 0.00029   42.8  -0.4   34  210-252    21-54  (332)
240 TIGR01933 hflK HflK protein. H  32.1 2.5E+02  0.0054   31.1   9.4   15  178-192    54-68  (261)
241 PRK12704 phosphodiesterase; Pr  31.8 2.5E+02  0.0055   35.0  10.2    8  584-591   216-223 (520)
242 KOG2505 Ankyrin repeat protein  31.8      81  0.0018   39.0   5.8   14  176-189   289-303 (591)
243 PF07543 PGA2:  Protein traffic  31.8 1.3E+02  0.0028   31.5   6.5   25  396-424    26-50  (140)
244 PF15295 CCDC50_N:  Coiled-coil  31.7 4.8E+02    0.01   27.4  10.4    9  392-400    26-34  (132)
245 KOG4369 RTK signaling protein   31.7      30 0.00065   46.0   2.4   27  973-1000 1931-1957(2131)
246 PF04747 DUF612:  Protein of un  31.5 1.9E+02  0.0041   34.8   8.5   24  390-413    34-57  (510)
247 KOG2484 GTPase [General functi  31.4      41  0.0009   40.5   3.3   18  575-593   226-243 (435)
248 PHA02571 a-gt.4 hypothetical p  31.1 1.2E+02  0.0027   30.6   5.9   37  417-453    17-55  (109)
249 COG5192 BMS1 GTP-binding prote  30.8      58  0.0013   40.8   4.4   29  440-468  1030-1058(1077)
250 cd02657 Peptidase_C19A A subfa  30.7      25 0.00054   39.1   1.4   33  215-256     1-33  (305)
251 PF04747 DUF612:  Protein of un  30.5 2.7E+02  0.0058   33.6   9.4   14  603-616   250-263 (510)
252 COG2811 NtpF Archaeal/vacuolar  30.4 3.2E+02  0.0069   27.7   8.7   17  413-429    27-43  (108)
253 KOG4722 Zn-finger protein [Gen  30.3 3.8E+02  0.0082   32.8  10.6   96  383-480   334-439 (672)
254 PF02029 Caldesmon:  Caldesmon;  30.0 2.6E+02  0.0055   34.9   9.6   29  494-522   369-398 (492)
255 KOG0783 Uncharacterized conser  29.9      21 0.00046   46.0   0.7   57  266-322   558-637 (1267)
256 PF14235 DUF4337:  Domain of un  29.8 2.3E+02   0.005   30.1   8.1   27  442-468    80-106 (157)
257 KOG2457 A/G-specific adenine D  29.7      48   0.001   39.6   3.4   21  966-986   443-463 (555)
258 KOG0994 Extracellular matrix g  29.7 5.2E+02   0.011   35.5  12.4   18  381-398  1627-1644(1758)
259 PTZ00436 60S ribosomal protein  29.7 3.6E+02  0.0078   31.8  10.0    7  236-242    13-19  (357)
260 PF12999 PRKCSH-like:  Glucosid  29.7 1.9E+02  0.0041   31.4   7.6   35  379-422   116-150 (176)
261 KOG2668 Flotillins [Intracellu  29.5 2.1E+02  0.0045   34.3   8.3   41  433-473   293-334 (428)
262 cd02658 Peptidase_C19B A subfa  29.3      21 0.00045   39.9   0.5   30  215-253     1-30  (311)
263 KOG2270 Serine/threonine prote  29.0      40 0.00087   40.7   2.7   14  180-193   231-244 (520)
264 KOG1156 N-terminal acetyltrans  28.9      57  0.0012   41.3   4.0   32  441-472   584-615 (700)
265 PRK07720 fliJ flagellar biosyn  28.7 6.4E+02   0.014   25.7  11.5   14  402-415    71-84  (146)
266 KOG4591 Uncharacterized conser  28.5      20 0.00044   39.4   0.2   89  266-363    66-165 (280)
267 KOG1962 B-cell receptor-associ  28.5 2.5E+02  0.0055   31.5   8.4    8  336-343    88-95  (216)
268 KOG4657 Uncharacterized conser  28.3 7.3E+02   0.016   28.3  11.8   12  553-564   207-218 (246)
269 TIGR01147 V_ATP_synt_G vacuola  28.1 2.4E+02  0.0052   28.7   7.5   26  443-468    28-53  (113)
270 KOG4661 Hsp27-ERE-TATA-binding  27.7 3.6E+02  0.0079   34.1  10.1    9  476-484   680-688 (940)
271 PLN02850 aspartate-tRNA ligase  27.5      52  0.0011   40.8   3.4   19  889-907   302-321 (530)
272 KOG0343 RNA Helicase [RNA proc  27.4   1E+02  0.0022   38.9   5.6   15  513-527   733-747 (758)
273 cd02668 Peptidase_C19L A subfa  27.4      20 0.00044   40.6  -0.1   32  215-255     1-32  (324)
274 PRK03963 V-type ATP synthase s  27.3 1.2E+02  0.0026   32.3   5.7    7  417-423    20-26  (198)
275 KOG2007 Cysteinyl-tRNA synthet  27.3 1.1E+02  0.0023   38.1   5.7    6   95-100    56-61  (586)
276 KOG4055 Uncharacterized conser  27.2 3.9E+02  0.0084   29.5   9.2    6  404-409   109-114 (213)
277 PF03879 Cgr1:  Cgr1 family;  I  27.2 2.9E+02  0.0064   27.9   7.8   30  399-428    33-62  (108)
278 KOG2573 Ribosome biogenesis pr  26.7      70  0.0015   38.6   4.0   11   67-77     11-21  (498)
279 KOG4326 Mitochondrial F1F0-ATP  26.5 4.7E+02    0.01   25.0   8.4   12  383-394    12-23  (81)
280 COG1422 Predicted membrane pro  26.5      56  0.0012   36.0   3.0   38  442-479    72-109 (201)
281 PTZ00074 60S ribosomal protein  26.2 1.6E+02  0.0035   30.8   6.0    7  401-407    87-93  (135)
282 PHA01750 hypothetical protein   26.2 2.6E+02  0.0055   26.2   6.6   28  391-419    17-44  (75)
283 PRK00106 hypothetical protein;  26.2 3.1E+02  0.0068   34.5   9.6   25  610-634   242-267 (535)
284 PF05529 Bap31:  B-cell recepto  26.1 4.3E+02  0.0092   28.2   9.5   14  386-399   108-121 (192)
285 TIGR02473 flagell_FliJ flagell  25.7 6.6E+02   0.014   24.9  11.0   10  402-411    68-77  (141)
286 KOG2385 Uncharacterized conser  25.6      56  0.0012   40.5   3.1   43  390-434   568-610 (633)
287 PF05529 Bap31:  B-cell recepto  25.5 4.4E+02  0.0094   28.2   9.5    6  389-394   105-110 (192)
288 PTZ00491 major vault protein;   25.4 2.6E+02  0.0057   36.9   8.9   67  370-446   633-700 (850)
289 PF03538 VRP1:  Salmonella viru  25.1      26 0.00057   40.9   0.3   48  971-1020  171-223 (325)
290 PF00443 UCH:  Ubiquitin carbox  25.1      29 0.00064   36.3   0.7   33  213-254     1-33  (269)
291 PF05279 Asp-B-Hydro_N:  Aspart  24.9 1.8E+02   0.004   33.0   6.7   10  309-318    43-52  (243)
292 PF12569 NARP1:  NMDA receptor-  24.8      85  0.0018   38.9   4.5   20  444-463   413-432 (517)
293 PTZ00401 aspartyl-tRNA synthet  24.7      58  0.0013   40.7   3.1   19  889-907   290-309 (550)
294 PRK05689 fliJ flagellar biosyn  24.6   7E+02   0.015   25.3  10.4   11  440-450   113-123 (147)
295 TIGR03319 YmdA_YtgF conserved   24.5 4.5E+02  0.0098   32.8  10.5    6  628-633   240-245 (514)
296 KOG4043 Uncharacterized conser  24.5      49  0.0011   35.4   2.1   20  436-455   179-198 (214)
297 PF05384 DegS:  Sensor protein   24.4 6.9E+02   0.015   26.8  10.5   33  393-425    65-101 (159)
298 PF15295 CCDC50_N:  Coiled-coil  24.4 2.1E+02  0.0045   29.9   6.4    6  315-320     7-12  (132)
299 PF12037 DUF3523:  Domain of un  24.2 9.7E+02   0.021   28.0  12.2   33  378-410    61-94  (276)
300 cd02667 Peptidase_C19K A subfa  24.1      34 0.00073   38.1   0.9   30  215-253     1-30  (279)
301 PF11214 Med2:  Mediator comple  23.9      74  0.0016   31.9   3.1    6  454-459    94-99  (105)
302 PF12072 DUF3552:  Domain of un  23.8 9.6E+02   0.021   26.1  12.1   19  392-410    18-36  (201)
303 PF07763 FEZ:  FEZ-like protein  23.8 4.4E+02  0.0096   30.1   9.3   51  382-438   163-213 (244)
304 PF05672 MAP7:  MAP7 (E-MAP-115  23.8 1.8E+02  0.0039   31.6   6.0   19  415-433    33-51  (171)
305 KOG2962 Prohibitin-related mem  23.7 8.6E+02   0.019   27.9  11.3    6  110-115    29-34  (322)
306 cd08045 TAF4 TATA Binding Prot  23.5 9.8E+02   0.021   26.2  11.8   12  335-346    48-59  (212)
307 KOG2223 Uncharacterized conser  23.5 1.7E+02  0.0037   35.9   6.4    8  518-525   356-363 (586)
308 KOG0267 Microtubule severing p  23.4   3E+02  0.0066   35.7   8.7   59  784-842   498-565 (825)
309 PRK01005 V-type ATP synthase s  23.2 4.1E+02  0.0089   29.4   8.9   14  396-409    15-28  (207)
310 PLN02381 valyl-tRNA synthetase  23.1      94   0.002   41.8   4.8   60  429-490    16-75  (1066)
311 PTZ00436 60S ribosomal protein  23.1 5.1E+02   0.011   30.6   9.7    6  433-438   159-164 (357)
312 KOG0949 Predicted helicase, DE  23.0 2.7E+02  0.0059   37.5   8.3   19  407-425   862-880 (1330)
313 cd07658 F-BAR_NOSTRIN The F-BA  22.8 1.1E+03   0.024   26.4  12.5   15  333-347    20-35  (239)
314 PF14738 PaaSYMP:  Solute carri  22.8 4.1E+02  0.0088   28.3   8.4   11  384-394    75-85  (154)
315 KOG2572 Ribosome biogenesis pr  22.6      30 0.00064   41.5   0.1   11  272-282   290-300 (498)
316 cd08045 TAF4 TATA Binding Prot  22.6 2.7E+02  0.0059   30.5   7.4    6  424-429   126-131 (212)
317 COG5638 Uncharacterized conser  22.5      98  0.0021   37.3   4.2   19  496-514   524-544 (622)
318 PF08701 GN3L_Grn1:  GNL3L/Grn1  22.4 1.9E+02   0.004   27.7   5.3   16  417-432    45-60  (79)
319 KOG2391 Vacuolar sorting prote  22.3 6.4E+02   0.014   30.3  10.4   48  403-450   211-261 (365)
320 PTZ00491 major vault protein;   22.1 1.5E+03   0.032   30.4  14.5   28  226-253   571-598 (850)
321 PF04568 IATP:  Mitochondrial A  22.1 2.8E+02  0.0061   27.7   6.6   27  396-423    51-77  (100)
322 PLN03188 kinesin-12 family pro  22.0 7.6E+02   0.016   34.3  12.2   24  378-401  1090-1115(1320)
323 PLN03237 DNA topoisomerase 2;   21.9 1.9E+02  0.0041   40.2   7.1   17  335-351  1060-1076(1465)
324 PF09731 Mitofilin:  Mitochondr  21.7 7.9E+02   0.017   30.6  12.0    9  231-239   119-127 (582)
325 KOG4709 Uncharacterized conser  21.3   3E+02  0.0064   30.6   7.1   14  512-525   121-134 (217)
326 PRK05689 fliJ flagellar biosyn  21.3 8.7E+02   0.019   24.6  11.6   69  402-470    71-139 (147)
327 PRK04654 sec-independent trans  21.1   6E+02   0.013   28.6   9.5   38  380-429    23-60  (214)
328 KOG2481 Protein required for n  21.0 1.8E+02   0.004   36.2   6.1   30  440-469   535-564 (570)
329 COG1390 NtpE Archaeal/vacuolar  20.9 5.5E+02   0.012   28.1   9.2   19  509-527   112-130 (194)
330 KOG0566 Inositol-1,4,5-triphos  20.7 6.8E+02   0.015   33.8  11.1   29  814-842  1015-1043(1080)
331 PF11675 DUF3271:  Protein of u  20.3      36 0.00078   38.3   0.2   48  974-1021   49-104 (249)
332 PRK07720 fliJ flagellar biosyn  20.2 9.3E+02    0.02   24.5  11.1   12  439-450   112-123 (146)
333 PF10669 Phage_Gp23:  Protein g  20.1 6.2E+02   0.014   25.4   8.3   13  493-505   104-116 (121)

No 1  
>cd03772 MATH_HAUSP Herpesvirus-associated ubiquitin-specific protease (HAUSP, also known as USP7) family, N-terminal MATH (TRAF-like) domain; composed of proteins similar to human HAUSP, an enzyme that specifically catalyzes the deubiquitylation of p53 and MDM2, hence playing an important role in the p53-MDM2 pathway. It contains an N-terminal TRAF-like domain and a C-terminal catalytic protease (C19 family) domain. The tumor suppressor p53 protein is a transcription factor that responds to many cellular stress signals and is regulated primarily through ubiquitylation and subsequent degradation. MDM2 is a RING-finger E3 ubiquitin ligase that promotes p53 ubiquitinylation. p53 and MDM2 bind to the same site in the N-terminal TRAF-like domain of HAUSP in a mutually exclusive manner. HAUSP also interacts with the Epstein-Barr nuclear antigen 1 (EBNA1) protein of the Epstein-Barr virus (EBV), which efficiently immortalizes infected cells predisposing the host to a variety of cancers. EBNA1
Probab=99.90  E-value=3.3e-23  Score=203.08  Aligned_cols=125  Identities=26%  Similarity=0.493  Sum_probs=106.0

Q ss_pred             ccEEEEEEccccccCCCeeeCCceeecCeEEEEEEEeCCCCC----CCeEEEEEEEecCCCCCCCCeEEEEEEEEEEeCC
Q 001172           67 YGKYTWRIEKFSQISKRELRSNAFEVGGYKWYILIYPQGCDV----CNHLSLFLCVANHDKLLPGWSHFAQFTIAVVNRD  142 (1133)
Q Consensus        67 tGshtwkIdNFS~LKge~I~Sp~FsVGGhkWRI~~YPnG~~~----~dyLSLYLeL~s~~sl~p~WsV~AqFtf~LLNQd  142 (1133)
                      +++|+|+|.|||.+ ++.++|+.|.||||+|+|++||+|+..    .+||||||+|.... ....|++.|+|+|+|+|++
T Consensus         2 ~~~~~~~I~~~S~l-~e~~~S~~f~vgG~~W~i~~~P~g~~~~~~~~~~lsvyL~~~~~~-~~~~w~i~a~~~~~l~~~~   79 (137)
T cd03772           2 EATFSFTVERFSRL-SESVLSPPCFVRNLPWKIMVMPRNYPDRNPHQKSVGFFLQCNAES-DSTSWSCHAQAVLRIINYK   79 (137)
T ss_pred             CcEEEEEECCcccC-CCcEECCCEEECCcceEEEEEeCCCCCCCCCCCeEEEEEeeCCcC-CCCCCeEEEEEEEEEEcCC
Confidence            58999999999998 678999999999999999999999642    48999999997643 2347999999999999997


Q ss_pred             CC-CceecccceecccCCCCcccccceeccccc---cCCccCCEEEEEEEEEEee
Q 001172          143 PK-KSKYSDTLHRFWKKEHDWGWKKFMELSKVS---DGFKDGDTLIIKAQVQVIR  193 (1133)
Q Consensus       143 Gk-~S~skes~h~FsskesdWGfskFIkrsdLE---sGFLvDDsLTIeCdVtVIk  193 (1133)
                      +. +.......|.|......|||.+||+|++|+   ++||+||+|+|+|+|+|-+
T Consensus        80 ~~~~~~~~~~~~~f~~~~~~~G~~~fi~~~~L~~~~sgyl~~D~l~Ie~~V~~~~  134 (137)
T cd03772          80 DDEPSFSRRISHLFFSKENDWGFSNFMTWSEVTDPEKGFIEDDTITLEVYVQADA  134 (137)
T ss_pred             CCcccEEEeeeeEEcCCCCCccchheeEHHHhcCCCCCcEECCEEEEEEEEEeeC
Confidence            43 333334557887667899999999999995   7999999999999999865


No 2  
>cd03775 MATH_Ubp21p Ubiquitin-specific protease 21 (Ubp21p) family, MATH domain; composed of fungal proteins with similarity to Ubp21p of fission yeast. Ubp21p is a deubiquitinating enzyme that may be involved in the regulation of the protein kinase Prp4p, which controls the formation of active spliceosomes. Members of this family are similar to human HAUSP (Herpesvirus-associated ubiquitin-specific protease) in that they contain an N-terminal MATH domain and a C-terminal catalytic protease (C19 family) domain. HAUSP is also an ubiquitin-specific protease that specifically catalyzes the deubiquitylation of p53 and MDM2. The MATH domain of HAUSP contains the binding site for p53 and MDM2. Similarly, the MATH domain of members in this family may be involved in substrate binding.
Probab=99.90  E-value=3.5e-23  Score=202.67  Aligned_cols=121  Identities=32%  Similarity=0.702  Sum_probs=103.6

Q ss_pred             EEEEEEccccccCCCeeeCCceeecCeEEEEEEEeCCCCCCCeEEEEEEEecCCC----CCCCCeEEEEEEEEEEeCC-C
Q 001172           69 KYTWRIEKFSQISKRELRSNAFEVGGYKWYILIYPQGCDVCNHLSLFLCVANHDK----LLPGWSHFAQFTIAVVNRD-P  143 (1133)
Q Consensus        69 shtwkIdNFS~LKge~I~Sp~FsVGGhkWRI~~YPnG~~~~dyLSLYLeL~s~~s----l~p~WsV~AqFtf~LLNQd-G  143 (1133)
                      +|+|+|.|||.+ ++.+.|++|.||||+|+|++||+|+...+||||||++.+...    .+++|.|+|+|.|+|+|+. +
T Consensus         2 ~f~w~I~~fS~~-~~~~~S~~F~vGG~~W~l~~yP~G~~~~~~iSlyL~l~~~~~~~~~~~~~~~v~a~f~~~l~n~~~~   80 (134)
T cd03775           2 SFTWRIKNWSEL-EKKVHSPKFKCGGFEWRILLFPQGNSQTGGVSIYLEPHPEEEEKAPLDEDWSVCAQFALVISNPGDP   80 (134)
T ss_pred             cEEEEECCcccC-CcceeCCCEEECCeeEEEEEeCCCCCCCCeEEEEEEecCcccccccCCCCCeEEEEEEEEEEcCCCC
Confidence            599999999996 678999999999999999999999876799999999976443    2468999999999999994 4


Q ss_pred             CCceecccceecccCCCCcccccceeccccc-------cCCccCCEEEEEEEEE
Q 001172          144 KKSKYSDTLHRFWKKEHDWGWKKFMELSKVS-------DGFKDGDTLIIKAQVQ  190 (1133)
Q Consensus       144 k~S~skes~h~FsskesdWGfskFIkrsdLE-------sGFLvDDsLTIeCdVt  190 (1133)
                      ..+......|.|.....+|||.+||++++|+       +|||+||+|+|+|.|.
T Consensus        81 ~~~~~~~~~~~F~~~~~~wG~~~fi~~~~L~~~~~~~~~g~l~nD~l~I~~~~~  134 (134)
T cd03775          81 SIQLSNVAHHRFNAEDKDWGFTRFIELRKLAHRTPDKPSPFLENGELNITVYVR  134 (134)
T ss_pred             ccceEccceeEeCCCCCCCChhHcccHHHHcccccCCCCceeECCEEEEEEEEC
Confidence            3333345678998777899999999999997       5999999999999873


No 3  
>cd03774 MATH_SPOP Speckle-type POZ protein (SPOP) family, MATH domain; composed of proteins with similarity to human SPOP. SPOP was isolated as a novel antigen recognized by serum from a scleroderma patient, whose overexpression in COS cells results in a discrete speckled pattern in the nuclei. It contains an N-terminal MATH domain and a C-terminal BTB (also called POZ) domain. Together with Cul3, SPOP constitutes an ubiquitin E3 ligase which is able to ubiquitinate the PcG protein BMI1, the variant histone macroH2A1 and the death domain-associated protein Daxx. Therefore, SPOP may be involved in the regulation of these proteins and may play a role in transcriptional regulation, apoptosis and X-chromosome inactivation. Cul3 binds to the BTB domain of SPOP whereas Daxx and the macroH2A1 nonhistone region have been shown to bind to the MATH domain. Both MATH and BTB domains are necessary for the nuclear speckled accumulation of SPOP. There are many proteins, mostly uncharacterized, conta
Probab=99.87  E-value=6.9e-22  Score=193.98  Aligned_cols=123  Identities=28%  Similarity=0.516  Sum_probs=103.0

Q ss_pred             ccEEEEEEccccccC---CCeeeCCceeecCe---EEEEEEEeCCCC--CCCeEEEEEEEecCCCCCCCCeEEEEEEEEE
Q 001172           67 YGKYTWRIEKFSQIS---KRELRSNAFEVGGY---KWYILIYPQGCD--VCNHLSLFLCVANHDKLLPGWSHFAQFTIAV  138 (1133)
Q Consensus        67 tGshtwkIdNFS~LK---ge~I~Sp~FsVGGh---kWRI~~YPnG~~--~~dyLSLYLeL~s~~sl~p~WsV~AqFtf~L  138 (1133)
                      ..+|+|+|+|||.++   ++.+.|++|.||||   +|+|++||+|..  ..+||||||++....    .|.++|+|+|+|
T Consensus         4 ~~~~~w~I~~fS~~~~~~~~~i~S~~F~vgg~~~~~W~l~~yP~G~~~~~~~~iSlyL~l~~~~----~~~v~a~f~~~l   79 (139)
T cd03774           4 KFCYMWTISNFSFCREEMGEVIKSSTFSSGANDKLKWCLRVNPKGLDEESKDYLSLYLLLVSCP----KSEVRAKFKFSI   79 (139)
T ss_pred             EEEEEEEECCchhhhhcCCCEEECCCeecCCcCCceEEEEEeCCCCCCCCCCeEEEEEEEccCC----CCcEEEEEEEEE
Confidence            468999999999875   67899999999995   999999999974  368999999997532    367999999999


Q ss_pred             EeCCCCCce--ecccceecccCCCCcccccceeccccc---cCCccCCEEEEEEEEEEeec
Q 001172          139 VNRDPKKSK--YSDTLHRFWKKEHDWGWKKFMELSKVS---DGFKDGDTLIIKAQVQVIRE  194 (1133)
Q Consensus       139 LNQdGk~S~--skes~h~FsskesdWGfskFIkrsdLE---sGFLvDDsLTIeCdVtVIkd  194 (1133)
                      +|+++....  .....+.|.. ..+|||.+||++++|+   ++||+||+|+|+|+|+|+++
T Consensus        80 ~n~~~~~~~~~~~~~~~~f~~-~~~wG~~~fi~~~~L~~~~~g~l~dD~l~I~c~I~V~~~  139 (139)
T cd03774          80 LNAKGEETKAMESQRAYRFVQ-GKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQD  139 (139)
T ss_pred             EecCCCeeeeecccCcEeCCC-CCccCHHHeeeHHHhhhhhcccccCCEEEEEEEEEEEcC
Confidence            999876432  1234577864 5789999999999996   58999999999999999863


No 4  
>cd03773 MATH_TRIM37 Tripartite motif containing protein 37 (TRIM37) family, MATH domain; TRIM37 is a peroxisomal protein and is a member of the tripartite motif (TRIM) protein subfamily, also known as the RING-B-box-coiled-coil (RBCC) subfamily of zinc-finger proteins. Mutations in the human TRIM37 gene (also known as MUL) cause Mulibrey (muscle-liver-brain-eye) nanism, a rare growth disorder of prenatal onset characterized by dysmorphic features, pericardial constriction and hepatomegaly. TRIM37, similar to other TRIMs, contains a cysteine-rich, zinc-binding RING-finger domain followed by another cysteine-rich zinc-binding domain, the B-box, and a coiled-coil domain. TRIM37 is autoubiquitinated in a RING domain-dependent manner, indicating that it functions as an ubiquitin E3 ligase. In addition to the tripartite motif, TRIM37 also contains a MATH domain C-terminal to the coiled-coil domain. The MATH domain of TRIM37 has been shown to interact with the TRAF domain of six known TRAFs i
Probab=99.86  E-value=1.3e-21  Score=189.34  Aligned_cols=120  Identities=27%  Similarity=0.512  Sum_probs=101.4

Q ss_pred             cccEEEEEEccccccC--CCeeeCCceeecCeEEEEEEEeCCCCC--CCeEEEEEEEecCCCCCCCCeEEEEEEEEEEeC
Q 001172           66 LYGKYTWRIEKFSQIS--KRELRSNAFEVGGYKWYILIYPQGCDV--CNHLSLFLCVANHDKLLPGWSHFAQFTIAVVNR  141 (1133)
Q Consensus        66 ~tGshtwkIdNFS~LK--ge~I~Sp~FsVGGhkWRI~~YPnG~~~--~dyLSLYLeL~s~~sl~p~WsV~AqFtf~LLNQ  141 (1133)
                      ...+|+|+|.|||.++  ++.+.|+.|.||||+|+|++||+|...  .+||||||++....    .|.+.++|+|+|+|+
T Consensus         3 ~~~~~~~~I~~fS~~~~~~~~~~S~~F~vgG~~W~i~~yP~G~~~~~~~~lSl~L~l~~~~----~~~~~~~~~l~llnq   78 (132)
T cd03773           3 PYDSATFTLENFSTLRQSADPVYSDPLNVDGLCWRLKVYPDGNGEVRGNFLSVFLELCSGL----GEASKYEYRVEMVHQ   78 (132)
T ss_pred             CCcccEEEECChhhhhcCCcceeCCCeEeCCccEEEEEECCCCCCCCCCEEEEEEEeecCC----CCceeEEEEEEEEcC
Confidence            4678999999999985  578999999999999999999999753  68999999987642    367889999999999


Q ss_pred             -CCCCceecccceecccCCCCcccccceeccccc-cCCccC--CEEEEEEEEE
Q 001172          142 -DPKKSKYSDTLHRFWKKEHDWGWKKFMELSKVS-DGFKDG--DTLIIKAQVQ  190 (1133)
Q Consensus       142 -dGk~S~skes~h~FsskesdWGfskFIkrsdLE-sGFLvD--DsLTIeCdVt  190 (1133)
                       ++..+......+.|.. ..+|||.+||++++|+ +|||+|  |+|+|+|.|.
T Consensus        79 ~~~~~~~~~~~~~~f~~-~~~wG~~~Fi~~~~L~~~gfl~~~~D~l~i~~~v~  130 (132)
T cd03773          79 ANPTKNIKREFASDFEV-GECWGYNRFFRLDLLINEGYLLPENDTLILRFSVR  130 (132)
T ss_pred             CCCccceEEeccccccC-CCCcCHHHhccHHHHhhCCCcCCCCCEEEEEEEEe
Confidence             5544443445677865 4679999999999998 799999  9999999985


No 5  
>cd03776 MATH_TRAF6 Tumor Necrosis Factor Receptor (TNFR)-Associated Factor (TRAF) family, TRAF6 subfamily, TRAF domain, C-terminal MATH subdomain; composed of proteins with similarity to human TRAF6, including the Drosophila protein DTRAF2. TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF6 is the most divergent in its TRAF domain among the mammalian TRAFs. In addition to mediating TNFR family signaling, it is also an essential signaling molecule of the interleukin-1/Toll-like receptor superfamily. Whereas other TRAF molecules display similar and overlapping TNFR-binding specificities, TRAF6 binds completely different sites on receptors such as CD40 and RANK. TRAF6 serves as a molecular bridge between innate and adaptive immunity and plays a central role in osteoimmunology. DTRAF2, as an activator of nuclear factor-kapp
Probab=99.86  E-value=1.3e-21  Score=194.49  Aligned_cols=123  Identities=24%  Similarity=0.454  Sum_probs=98.6

Q ss_pred             cEEEEEEccccccC-----CC--eeeCCceee--cCeEEEEEEEeCCCCC--CCeEEEEEEEecCCCC-CCCCeEEEEEE
Q 001172           68 GKYTWRIEKFSQIS-----KR--ELRSNAFEV--GGYKWYILIYPQGCDV--CNHLSLFLCVANHDKL-LPGWSHFAQFT  135 (1133)
Q Consensus        68 GshtwkIdNFS~LK-----ge--~I~Sp~FsV--GGhkWRI~~YPnG~~~--~dyLSLYLeL~s~~sl-~p~WsV~AqFt  135 (1133)
                      |+|+|+|.|||.++     ++  .|+|+.|.+  |||+|+|+|||+|...  .+||||||+|+..... ...|.+.|+|+
T Consensus         1 g~h~~~I~~yS~~~~~~~~g~~~~i~S~~F~~~~gGy~W~i~~yP~G~~~~~~~~lS~~L~l~~~~~d~~l~wpv~a~~~   80 (147)
T cd03776           1 GIYVWKIKNFSNLRRSMEAGSPVVIHSPGFYTSPPGYKLCARLNLSLPEARCPNYISLFVHLMQGENDSHLDWPFQGTIT   80 (147)
T ss_pred             CEEEEEECCHHHHHHHHhcCCCceEECCCcccCCCCceEEEEEEeCCCCCCCCCEEEEEEEEeccCCCcccCCcccceeE
Confidence            68999999999754     34  488999985  7999999999999863  6899999999875432 23699999999


Q ss_pred             EEEEeCCCCCc-ee-----cccceeccc-----CCCCcccccceeccccc-cCCccCCEEEEEEEEE
Q 001172          136 IAVVNRDPKKS-KY-----SDTLHRFWK-----KEHDWGWKKFMELSKVS-DGFKDGDTLIIKAQVQ  190 (1133)
Q Consensus       136 f~LLNQdGk~S-~s-----kes~h~Fss-----kesdWGfskFIkrsdLE-sGFLvDDsLTIeCdVt  190 (1133)
                      |+|+||++... ..     ....+.|..     .+..|||.+||++++|+ .+||+||+|+|+|+|.
T Consensus        81 ~~lldq~~~~~~~~~~~~~~~~~~~F~~p~~~~~~~~~G~~~fi~~~~Le~~~yl~dD~l~I~c~V~  147 (147)
T cd03776          81 LTLLDQSEPRQNIHETMMSKPELLAFQRPTTDRNPKGFGYVEFAHIEDLLQRGFVKNDTLLIKIEVN  147 (147)
T ss_pred             EEEECCCcccCccEEEEEcCCChHhhcCCCcCCCCCCeeEceeeEHHHhhhCCCccCCEEEEEEEEC
Confidence            99999986422 11     112355753     24679999999999999 6899999999999984


No 6  
>cd00270 MATH_TRAF_C Tumor Necrosis Factor Receptor (TNFR)-Associated Factor (TRAF) family, TRAF domain, C-terminal MATH subdomain; TRAF molecules serve as adapter proteins that link cell surface TNFRs and receptors of the interleukin-1/Toll-like family to downstream kinase signaling cascades which results in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses in the immune and inflammatory systems. There are at least six mammalian and three Drosophila proteins containing TRAF domains. The mammalian TRAFs display varying expression profiles, indicating independent and cell type-specific regulation. They display distinct, as well as overlapping functions and interactions with receptors. Most TRAFs, except TRAF1, share N-terminal homology and contain a RING domain, multiple zinc finger domains, and a TRAF domain. TRAFs form homo- and heterotrimers through its TRAF domain. The TRAF domain can be divided into a more divergent N-ter
Probab=99.86  E-value=1.8e-21  Score=192.37  Aligned_cols=122  Identities=28%  Similarity=0.507  Sum_probs=99.2

Q ss_pred             cEEEEEEccccccC-------CCeeeCCceeec--CeEEEEEEEeCCCC--CCCeEEEEEEEecCCCC-CCCCeEEEEEE
Q 001172           68 GKYTWRIEKFSQIS-------KRELRSNAFEVG--GYKWYILIYPQGCD--VCNHLSLFLCVANHDKL-LPGWSHFAQFT  135 (1133)
Q Consensus        68 GshtwkIdNFS~LK-------ge~I~Sp~FsVG--GhkWRI~~YPnG~~--~~dyLSLYLeL~s~~sl-~p~WsV~AqFt  135 (1133)
                      |+|+|+|.|||.++       ++.++|+.|+||  ||+|+|+|||+|..  ..+||||||++.+...+ ...|.+.|+|+
T Consensus         1 g~~~w~I~~fs~~~~~~~~~~~~~~~S~~F~vg~~G~~w~i~~yP~G~~~~~~~~lsl~L~l~~~~~d~~~~w~~~~~~~   80 (149)
T cd00270           1 GVLIWKIKDYSRKLQEAVAGSNTVLYSPPFYTSRYGYKLCLRLYLNGDGTGKGTHLSLFVHVMKGEYDALLEWPFRGKIT   80 (149)
T ss_pred             CEEEEEECCHHHHHHHHhcCCCceEECCCcccCCCCceEEEEEEeCCCCCCCCCEEEEEEEEeccCCCccccCCccceEE
Confidence            68999999999974       358999999999  99999999999975  36799999999876432 34699999999


Q ss_pred             EEEEeCCCC---Cceecc-----cceecc-----cCCCCcccccceeccccc-cCCccCCEEEEEEEE
Q 001172          136 IAVVNRDPK---KSKYSD-----TLHRFW-----KKEHDWGWKKFMELSKVS-DGFKDGDTLIIKAQV  189 (1133)
Q Consensus       136 f~LLNQdGk---~S~ske-----s~h~Fs-----skesdWGfskFIkrsdLE-sGFLvDDsLTIeCdV  189 (1133)
                      |+|+|+.+.   .+....     ..+.|.     ....+|||.+||++++|+ .+||+||+|+|+|+|
T Consensus        81 ~~l~d~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~G~~~fi~~~~L~~~gfl~dD~l~I~~~v  148 (149)
T cd00270          81 LTLLDQSDDSKRKHITETFMPDPNSSAFQRPPTGENNIGFGYPEFVPLEKLESRGYVKDDTLFIKVEV  148 (149)
T ss_pred             EEEECCCCccccCceEEEEEcCCchHhhcCCCcccCCCCcCcceEeEHHHhccCCCEeCCEEEEEEEE
Confidence            999999763   221111     234564     135789999999999999 589999999999998


No 7  
>COG5077 Ubiquitin carboxyl-terminal hydrolase [Posttranslational modification, protein turnover, chaperones]
Probab=99.85  E-value=2.6e-22  Score=234.71  Aligned_cols=241  Identities=25%  Similarity=0.461  Sum_probs=188.7

Q ss_pred             ccccEEEEEEccccccCCCeeeCCceeecCeEEEEEEEeCCCCCCCeEEEEEEEecCCCC--C-CCCeEEEEEEEEEEeC
Q 001172           65 ELYGKYTWRIEKFSQISKRELRSNAFEVGGYKWYILIYPQGCDVCNHLSLFLCVANHDKL--L-PGWSHFAQFTIAVVNR  141 (1133)
Q Consensus        65 ~~tGshtwkIdNFS~LKge~I~Sp~FsVGGhkWRI~~YPnG~~~~dyLSLYLeL~s~~sl--~-p~WsV~AqFtf~LLNQ  141 (1133)
                      .....|+|+|.+||.+.. ...|+.|.||||.|+|.++|.|+...+ +||||+.......  . ..|.|+|+|.|.|-|.
T Consensus        36 ~~~~sftW~vk~wsel~~-k~~Sp~F~vg~~twki~lfPqG~nq~~-~sVyLe~~pqe~e~~~gk~~~ccaqFaf~Is~p  113 (1089)
T COG5077          36 LLEMSFTWKVKRWSELAK-KVESPPFSVGGHTWKIILFPQGNNQCN-VSVYLEYEPQELEETGGKYYDCCAQFAFDISNP  113 (1089)
T ss_pred             HhhcccceecCChhhhhh-hccCCcccccCeeEEEEEecccCCccc-cEEEEEeccchhhhhcCcchhhhhheeeecCCC
Confidence            456789999999999865 578999999999999999999988666 9999998643211  1 2499999999999887


Q ss_pred             -CCCCceecccceecccCCCCcccccceeccccc------cCCccCCEEEEEEEEEEeeccCCCCc-cccccccccceee
Q 001172          142 -DPKKSKYSDTLHRFWKKEHDWGWKKFMELSKVS------DGFKDGDTLIIKAQVQVIREKTDRPF-RCLDCQYRRELVR  213 (1133)
Q Consensus       142 -dGk~S~skes~h~FsskesdWGfskFIkrsdLE------sGFLvDDsLTIeCdVtVIkd~~~~~~-r~ld~dskk~tI~  213 (1133)
                       ++.........|+|+....+|||.+||.+..|.      ..|+.+|.+.|.+.|.|++++++... ..++|++++++++
T Consensus       114 ~~pti~~iN~sHhrFs~~~tDwGFt~f~dL~kl~~psp~~Ppfleeg~l~ItvyVRvlkdPTGVLWHsF~nYnSKkeTGY  193 (1089)
T COG5077         114 KYPTIEYINKSHHRFSMESTDWGFTNFIDLNKLIEPSPGRPPFLEEGTLVITVYVRVLKDPTGVLWHSFLNYNSKKETGY  193 (1089)
T ss_pred             CCCchhhhhcccccccccccccchhhhhhhhhhcCCCCCCCCcccCCeEEEEEEEEEEeCCccceeecccccccccceee
Confidence             332223346789999888999999999999887      25899999999999999999887542 2348999999999


Q ss_pred             ecCCchHHHHhhhhhHHHhhHHHHHhhhccccccchhhhccCHHHHhhhccccchhhhhhhhhhhcchhhhhhhHHHHHH
Q 001172          214 VYLTNVEQICRRFVEERRGKLGRLIEDKARWSSFCAFWLGIDQNARRRMSREKTDAILKVVVKHFFIEKEVTSTLVMDSL  293 (1133)
Q Consensus       214 VPpSNLgaTC~~fved~~~hLG~LLqS~~~~ssFRaf~~a~~~~~r~~m~~e~sDVilkV~~k~Ff~~keVtSplvmd~L  293 (1133)
                      |++.|+|+||         ||++|||+...+..||.....++...-     .+-|.+                ++..+.+
T Consensus       194 VGlrNqGATC---------YmNSLlQslffi~~FRk~Vy~ipTd~p-----~grdSV----------------alaLQr~  243 (1089)
T COG5077         194 VGLRNQGATC---------YMNSLLQSLFFIAKFRKDVYGIPTDHP-----RGRDSV----------------ALALQRL  243 (1089)
T ss_pred             eeeccCCcee---------eHHHHHHHHHHHHHHHHHhhcCCCCCC-----CccchH----------------HHHHHHH
Confidence            9999999999         999999999877778877666653210     121222                6677777


Q ss_pred             hhcccccccccccccccccccccCCCCCCceEeecceeeehhhHHHHHHHhhcCCC
Q 001172          294 YSGLKALEGQSKSKKTKAKLLDAEDTPPPIVHVENDMFVLVDDVLLLLERAALEPL  349 (1133)
Q Consensus       294 ~~glk~le~~t~~~k~~~~~~~t~e~p~~~~~v~~d~~~l~~Dll~~leR~al~~L  349 (1133)
                      |.-|+..+.          -++|-||..++.|..-|.|. ..|+. -+.|+.-|.|
T Consensus       244 Fynlq~~~~----------PvdTteltrsfgWds~dsf~-QHDiq-EfnrVl~DnL  287 (1089)
T COG5077         244 FYNLQTGEE----------PVDTTELTRSFGWDSDDSFM-QHDIQ-EFNRVLQDNL  287 (1089)
T ss_pred             HHHHhccCC----------CcchHHhhhhcCcccchHHH-HHhHH-HHHHHHHHHH
Confidence            777776654          37888898888888777776 44666 5677766665


No 8  
>cd03781 MATH_TRAF4 Tumor Necrosis Factor Receptor (TNFR)-Associated Factor (TRAF) family, TRAF4 subfamily, TRAF domain, C-terminal MATH subdomain; composed of proteins with similarity to human TRAF4, including the Drosophila protein DTRAF1. TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF4 is highly expressed during embryogenesis, especially in the central and peripheral nervous system. Studies using TRAF4-deficient mice show that TRAF4 is required for neurogenesis, as well as the development of the trachea and the axial skeleton. In addition, TRAF4 augments nuclear factor-kappaB activation triggered by GITR (glucocorticoid-induced TNFR), a receptor expressed in T-cells, B-cells and macrophages. It also participates in counteracting the signaling mediated by Toll-like receptors through its association with TRAF6 and TR
Probab=99.85  E-value=7e-21  Score=191.49  Aligned_cols=122  Identities=22%  Similarity=0.415  Sum_probs=99.1

Q ss_pred             cEEEEEEccccccC-------CCeeeCCceeec--CeEEEEEEEeCCCCC--CCeEEEEEEEecCCCCC-CCCeEEEEEE
Q 001172           68 GKYTWRIEKFSQIS-------KRELRSNAFEVG--GYKWYILIYPQGCDV--CNHLSLFLCVANHDKLL-PGWSHFAQFT  135 (1133)
Q Consensus        68 GshtwkIdNFS~LK-------ge~I~Sp~FsVG--GhkWRI~~YPnG~~~--~dyLSLYLeL~s~~sl~-p~WsV~AqFt  135 (1133)
                      |+|.|+|.|||.++       +..|.|+.|.+|  ||+|+|++||+|...  .+||||||+|+..+... ..|.+.++|+
T Consensus         1 g~~~~~I~gys~~~~~~~~~~~~~i~S~~F~vg~~Gy~w~i~~yPnG~~~~~~~~vs~~l~l~~ge~d~~l~wp~~a~~~   80 (154)
T cd03781           1 GTLLWKITDYSRKLQEAKGRDNLELFSPPFYTHRYGYKLQVSAFLNGNGSGEGSHLSVYIRVLPGEYDNLLEWPFSHRIT   80 (154)
T ss_pred             CEEEEEECCHHHHHHHhhcCCCceEECCCeecCCCCEEEEEEEECCCCCCCCCCEEEEEEEEecCCcccccCCceeeEEE
Confidence            68999999999875       258999999999  999999999999753  68999999998754322 3799999999


Q ss_pred             EEEEeCCCCC-----cee-c----ccceeccc--------CCCCcccccceeccccc-cCCccCCEEEEEEEE
Q 001172          136 IAVVNRDPKK-----SKY-S----DTLHRFWK--------KEHDWGWKKFMELSKVS-DGFKDGDTLIIKAQV  189 (1133)
Q Consensus       136 f~LLNQdGk~-----S~s-k----es~h~Fss--------kesdWGfskFIkrsdLE-sGFLvDDsLTIeCdV  189 (1133)
                      |+|+||.+..     ... .    .....|..        .+..|||.+||++++|+ .+||+||+|+|+|+|
T Consensus        81 ~~llDq~~~~~~~~~~~~~~~~~~~~~~~F~rp~~~~~~~~~~~~G~~~fi~~~~Le~~~yl~dD~l~Irc~v  153 (154)
T cd03781          81 FTLLDQSDPSLSKPQHITETFTPDPTWKNFQKPSASRLDESTLGFGYPKFISHEDLKKRNYIKDDAIFLRASV  153 (154)
T ss_pred             EEEECCCCCccccCcceEEEEEcCCchhhhcCCcccccCCCCCccchhHeeEHHHHhhCCcccCCEEEEEEEe
Confidence            9999997641     110 0    11345542        33579999999999999 799999999999998


No 9  
>cd03780 MATH_TRAF5 Tumor Necrosis Factor Receptor (TNFR)-Associated Factor (TRAF) family, TRAF5 subfamily, TRAF domain, C-terminal MATH subdomain; TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF5 was identified as an activator of nuclear factor-kappaB and a regulator of lymphotoxin-beta receptor and CD40 signaling. Its interaction with CD40 is indirect, involving hetero-oligomerization with TRAF3. In addition, TRAF5 has been shown to associate with other TNFRs including CD27, CD30, OX40 and GITR (glucocorticoid-induced TNFR). It plays a role in modulating Th2 immune responses (driven by OX40 costimulation) and T-cell activation (triggered by GITR). It is also involved in osteoclastogenesis. TRAF5 contains a RING finger domain, five zinc finger domains, and a TRAF domain. The TRAF domain can be divided into a more dive
Probab=99.83  E-value=3e-20  Score=187.03  Aligned_cols=122  Identities=23%  Similarity=0.409  Sum_probs=100.3

Q ss_pred             cEEEEEEccccccC-----CC--eeeCCce--eecCeEEEEEEEeCCCCC--CCeEEEEEEEecCCCC-CCCCeEEEEEE
Q 001172           68 GKYTWRIEKFSQIS-----KR--ELRSNAF--EVGGYKWYILIYPQGCDV--CNHLSLFLCVANHDKL-LPGWSHFAQFT  135 (1133)
Q Consensus        68 GshtwkIdNFS~LK-----ge--~I~Sp~F--sVGGhkWRI~~YPnG~~~--~dyLSLYLeL~s~~sl-~p~WsV~AqFt  135 (1133)
                      |+|+|+|+|||+++     ++  .+.|+.|  .+|||+|+|++||||...  ++||||||+|+..+.+ ...|.+.++|+
T Consensus         1 g~~vwkI~~ys~~~~~~~~g~~~~i~S~~Fyt~~~Gy~w~i~~ypnG~~~~~~~~iSv~l~l~~g~~D~~l~wp~~~~~t   80 (148)
T cd03780           1 GKLIWKVTDYKMKKKEAVDGHTVSIFSQPFYTSRCGYRLCARAYLNGDGSGKGTHLSLYFVVMRGEFDSLLQWPFRQRVT   80 (148)
T ss_pred             CEEEEEECCHHHHHHhhcCCCccEEECCCcccCCCCeeEEEEEEcCCCCCCCCCEEEEEEEEecCccccccCcceEEEEE
Confidence            68999999999986     34  7999999  999999999999999863  6799999999875322 23699999999


Q ss_pred             EEEEeCCCCCce--ec----ccceecccC----CCCcccccceecccccc---CCccCCEEEEEEEE
Q 001172          136 IAVVNRDPKKSK--YS----DTLHRFWKK----EHDWGWKKFMELSKVSD---GFKDGDTLIIKAQV  189 (1133)
Q Consensus       136 f~LLNQdGk~S~--sk----es~h~Fssk----esdWGfskFIkrsdLEs---GFLvDDsLTIeCdV  189 (1133)
                      |.|+||++++..  ..    ...+.|...    +..||+.+||++++|+.   +||+||+|+|+|.|
T Consensus        81 fsLlDq~~~~~~~~~~~~~~~~~~~F~rp~~~~n~~~G~~~Fi~~~~Le~s~~~ylkdD~~~Ik~~v  147 (148)
T cd03780          81 LMLLDQSGKKNHIMETFKADPNSSSFKRPDGEMNIASGCPRFVAHSVLENAKNTYIKDDTLFLKVAV  147 (148)
T ss_pred             EEEECCCCCCCCcceeeecCCccccccCCCCCCCCCcChhheeEHHHhhcccCCcCcCCEEEEEEEE
Confidence            999999765321  11    124668654    45799999999999983   89999999999987


No 10 
>cd03777 MATH_TRAF3 Tumor Necrosis Factor Receptor (TNFR)-Associated Factor (TRAF) family, TRAF3 subfamily, TRAF domain; TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF3 was first described as a molecule that binds the cytoplasmic tail of CD40. However, it is not required for CD40 signaling. More recently, TRAF3 has been identified as a key regulator of type I interferon (IFN) production and the mammalian innate antiviral immunity. It mediates IFN responses in Toll-like receptor (TLR)-dependent as well as TLR-independent viral recognition pathways. It is also a key element in immunological homeostasis through its regulation of the anti-inflammatory cytokine interleukin-10. TRAF3 contains a RING finger domain, five zinc finger domains, and a TRAF domain. The TRAF domain can be divided into a more divergent N-terminal al
Probab=99.83  E-value=4.5e-20  Score=192.06  Aligned_cols=126  Identities=22%  Similarity=0.392  Sum_probs=101.6

Q ss_pred             ccccEEEEEEccccccC-----CC--eeeCCceeec--CeEEEEEEEeCCCC--CCCeEEEEEEEecCCCC-CCCCeEEE
Q 001172           65 ELYGKYTWRIEKFSQIS-----KR--ELRSNAFEVG--GYKWYILIYPQGCD--VCNHLSLFLCVANHDKL-LPGWSHFA  132 (1133)
Q Consensus        65 ~~tGshtwkIdNFS~LK-----ge--~I~Sp~FsVG--GhkWRI~~YPnG~~--~~dyLSLYLeL~s~~sl-~p~WsV~A  132 (1133)
                      ...|+|+|+|.|||.++     ++  .++|+.|++|  ||+|+|++||+|..  .++||||||+|+.+..+ ...|.|.+
T Consensus        36 ~~~G~hvwkI~~yS~~~~~~~~g~~~~i~S~~Fyvg~~GY~w~i~~ypnG~g~~~~~~iSvyl~L~~ge~D~~L~WP~~~  115 (186)
T cd03777          36 SYNGVLIWKIRDYKRRKQEAVMGKTLSLYSQPFYTGYFGYKMCARVYLNGDGMGKGTHLSLFFVIMRGEYDALLPWPFKQ  115 (186)
T ss_pred             ccceEEEEEECChhHHHHhhccCCCcEEECCCeEeCCCCeeEEEEEEcCCCCCCCCCEEEEEEEEecCCcccccCCceeE
Confidence            45799999999999875     34  7999999999  99999999999986  36799999999875422 23699999


Q ss_pred             EEEEEEEeCCCCC-ce----e-cccceecc-cC---CCCcccccceeccccc-cCCccCCEEEEEEEEE
Q 001172          133 QFTIAVVNRDPKK-SK----Y-SDTLHRFW-KK---EHDWGWKKFMELSKVS-DGFKDGDTLIIKAQVQ  190 (1133)
Q Consensus       133 qFtf~LLNQdGk~-S~----s-kes~h~Fs-sk---esdWGfskFIkrsdLE-sGFLvDDsLTIeCdVt  190 (1133)
                      +|+|.|+||++.. ..    . ....+.|. ..   +..|||.+||++++|+ ++||+||+|+|+|.|.
T Consensus       116 ~~tfsLlDQ~~~~~~~~~~~~p~p~~~~F~rp~~~~n~~~G~~~Fi~~~~Le~~~ylkdD~l~Irv~v~  184 (186)
T cd03777         116 KVTLMLMDQGSSRRHLGDAFKPDPNSSSFKKPTGEMNIASGCPVFVAQTVLENGTYIKDDTIFIKVIVD  184 (186)
T ss_pred             EEEEEEEcCCCccccccceeccCCccccccCCccCCCCCCCchheeEHHHhccCCcEeCCEEEEEEEEe
Confidence            9999999997521 11    0 01235575 22   4579999999999999 6899999999999885


No 11 
>cd00121 MATH MATH (meprin and TRAF-C homology) domain; an independent folding unit with an eight-stranded beta-sandwich structure found in meprins, TRAFs and other proteins. Meprins comprise a class of extracellular metalloproteases which are anchored to the membrane and are capable of cleaving growth factors, extracellular matrix proteins, and biologically active peptides. TRAF molecules serve as adapter proteins that link cell surface receptors of the Tumor Necrosis Factor and 1nterleukin-1/Toll-like families to downstream kinase cascades, which results in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses in the immune and inflammatory systems. Other members include the ubiquitin ligases, TRIM37 and SPOP, and the ubiquitin-specific proteases, HAUSP and Ubp21p. A large number of uncharacterized members mostly from lineage-specific expansions in C. elegans and rice contain MATH and BTB domains, similar to SPOP. The MATH doma
Probab=99.82  E-value=1.1e-19  Score=168.51  Aligned_cols=123  Identities=38%  Similarity=0.694  Sum_probs=102.1

Q ss_pred             cEEEEEEccccccCCCeeeCCceeecCeEEEEEEEeCCCCC-CCeEEEEEEEecCCCCCCCCeEEEEEEEEEEeCCCCCc
Q 001172           68 GKYTWRIEKFSQISKRELRSNAFEVGGYKWYILIYPQGCDV-CNHLSLFLCVANHDKLLPGWSHFAQFTIAVVNRDPKKS  146 (1133)
Q Consensus        68 GshtwkIdNFS~LKge~I~Sp~FsVGGhkWRI~~YPnG~~~-~dyLSLYLeL~s~~sl~p~WsV~AqFtf~LLNQdGk~S  146 (1133)
                      ++|+|+|.+|+...++.+.|+.|.++|+.|+|++||+|... .+||||||+|.........|.+.|+|+|.|+|+++.+.
T Consensus         1 ~~~~~~i~~~~~~~~~~~~S~~f~~~g~~W~l~~~p~~~~~~~~~lsv~L~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~   80 (126)
T cd00121           1 GKHTWKIVNFSELEGESIYSPPFEVGGYKWRIRIYPNGDGESGDYLSLYLELDKGESDLEKWSVRAEFTLKLVNQNGGKS   80 (126)
T ss_pred             CEEEEEECCCCCCCCcEEECCCEEEcCEeEEEEEEcCCCCCCCCEEEEEEEecCCCCCCCCCcEEEEEEEEEECCCCCcc
Confidence            47999999999866788999999999999999999998643 57999999998665433469999999999999985444


Q ss_pred             eecccceecc-cCCCCcccccceeccccccCC-ccCCEEEEEEEEE
Q 001172          147 KYSDTLHRFW-KKEHDWGWKKFMELSKVSDGF-KDGDTLIIKAQVQ  190 (1133)
Q Consensus       147 ~skes~h~Fs-skesdWGfskFIkrsdLEsGF-LvDDsLTIeCdVt  190 (1133)
                      ......+.|. ....+|||.+||+|++|++.+ ++||+|+|+|+|.
T Consensus        81 ~~~~~~~~~~~~~~~~~G~~~fi~~~~l~~~~~~~~d~l~i~~~v~  126 (126)
T cd00121          81 LSKSFTHVFFSEKGSGWGFPKFISWDDLEDSYYLVDDSLTIEVEVK  126 (126)
T ss_pred             ceEeccCCcCCCCCCCCChHHeeEHHHhccCCcEECCEEEEEEEEC
Confidence            4444444553 456899999999999999655 9999999999984


No 12 
>cd03779 MATH_TRAF1 Tumor Necrosis Factor Receptor (TNFR) Associated Factor (TRAF) family, TRAF1 subfamily, TRAF domain, C-terminal MATH subdomain; TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF1 expression is the most restricted among the TRAFs. It is found exclusively in activated lymphocytes, dendritic cells and certain epithelia. TRAF1 associates, directly or indirectly through heterodimerization with TRAF2, with the TNFR family receptors TNFR-2, CD30, RANK, CD40 and LMP1, among others. It also binds the intracellular proteins TRADD, TANK, TRIP, RIP1, RIP2 and FLIP. TRAF1 is unique among the TRAFs in that it lacks a RING domain, which is critical for the activation of  nuclear factor-kappaB and Jun NH2-terminal kinase. Studies on TRAF1-deficient mice suggest that TRAF1 has a negative regulatory role in TNFR-mediat
Probab=99.81  E-value=1.1e-19  Score=182.90  Aligned_cols=122  Identities=25%  Similarity=0.415  Sum_probs=98.8

Q ss_pred             cEEEEEEccccccC-----C--CeeeCCceeec--CeEEEEEEEeCCCCC--CCeEEEEEEEecCCCC-CCCCeEEEEEE
Q 001172           68 GKYTWRIEKFSQIS-----K--RELRSNAFEVG--GYKWYILIYPQGCDV--CNHLSLFLCVANHDKL-LPGWSHFAQFT  135 (1133)
Q Consensus        68 GshtwkIdNFS~LK-----g--e~I~Sp~FsVG--GhkWRI~~YPnG~~~--~dyLSLYLeL~s~~sl-~p~WsV~AqFt  135 (1133)
                      |+|+|+|.||++..     +  ..++|+.|+.+  ||+|+|++||||...  ++||||||+|+....+ ...|.|.++|+
T Consensus         1 g~~~W~i~~f~~~~~~a~~~~~~~~~S~~Fyt~~~Gy~w~i~~ypnG~~~~~~~~iSv~l~l~~g~~D~~l~wpv~~~~t   80 (147)
T cd03779           1 GTFLWKITDVSQKQRESSHGRDVSLCSPAFYTAKYGYKVCLRLYLNGDGAGKGTHISLFFVIMKGEYDALLPWPFRHKVT   80 (147)
T ss_pred             CeEEEEECcHHHHHHHHhcCCCceEECCCcccCCCCceEEEEEEcCCCCCCCCCEEEEEEEEecCCcccccCcceEEEEE
Confidence            68999999999753     2  36999999986  999999999999863  6899999999865321 23699999999


Q ss_pred             EEEEeCCCCCcee--ccc---ceecc----cCCCCcccccceecccccc---CCccCCEEEEEEEE
Q 001172          136 IAVVNRDPKKSKY--SDT---LHRFW----KKEHDWGWKKFMELSKVSD---GFKDGDTLIIKAQV  189 (1133)
Q Consensus       136 f~LLNQdGk~S~s--kes---~h~Fs----skesdWGfskFIkrsdLEs---GFLvDDsLTIeCdV  189 (1133)
                      |.|+||++.+...  ...   .+.|.    ..+..|||.+||++++|+.   +||+||+|+|+|+|
T Consensus        81 fsLlDq~~~~~~~~~~~~~~~~~~F~rP~~~~n~~~G~~~Fi~~~~Le~s~~~ylkDD~~~Irc~V  146 (147)
T cd03779          81 FMLLDQNNREHVIDAFRPDLSSASFQRPVSDMNVASGCPLFFPLKKLQSPKHAYCKDDTIYIKCVV  146 (147)
T ss_pred             EEEECCCCCCCCcEeecCCcccccccCcccCCCCCcchhheeEHHHhcccCCCcEeCCEEEEEEEE
Confidence            9999998654321  111   36686    3345799999999999994   89999999999998


No 13 
>cd03771 MATH_Meprin Meprin family, MATH domain; Meprins are multidomain, highly glycosylated extracellular metalloproteases, which are either anchored to the membrane or secreted into extracellular spaces. They are expressed in renal and intestinal brush border membranes, leukocytes, and cancer cells, and are capable of cleaving growth factors, cytokines, extracellular matrix proteins, and biologically active peptides. Meprin proteases are composed of two related subunits, alpha and beta, which form homo- or hetro-complexes where the basic unit is a disulfide-linked dimer. Despite their similarity, the two subunits differ in their ability to self-associate, in proteolytic processing during biosynthesis and in substrate specificity. Both subunits are synthesized as membrane spanning proteins, however, the alpha subunit is cleaved during biosynthesis and loses its transmembrane domain. Meprin beta forms homodimers or heterotetramers while meprin alpha oligomerizes into large complexes co
Probab=99.74  E-value=1.2e-17  Score=171.41  Aligned_cols=122  Identities=25%  Similarity=0.508  Sum_probs=94.8

Q ss_pred             cEEEEEEccccccC-----CCeeeCCce-eecCeEEEEEEEeCCCC-CCCeEEEEEEEecCCCC-CCCCe-EEEEEEEEE
Q 001172           68 GKYTWRIEKFSQIS-----KRELRSNAF-EVGGYKWYILIYPQGCD-VCNHLSLFLCVANHDKL-LPGWS-HFAQFTIAV  138 (1133)
Q Consensus        68 GshtwkIdNFS~LK-----ge~I~Sp~F-sVGGhkWRI~~YPnG~~-~~dyLSLYLeL~s~~sl-~p~Ws-V~AqFtf~L  138 (1133)
                      .+|+|+|.|||.++     +..++|+.| ++|||+|+|++||+|.. .++||||||+|+..... ...|. +.|+|+|+|
T Consensus         2 p~hvwkI~~yS~~~~~~~~g~~i~S~~FysvgGy~w~I~~YPnG~~~~~~~lSlyL~L~~g~~d~~L~WP~v~a~~t~~L   81 (167)
T cd03771           2 PEAVWRVRNFSQLLETTPKGTKIYSPRFYSPEGYAFQVGLYPNGTESYPGYTGLYFHLCSGENDDVLEWPCPNRQATMTL   81 (167)
T ss_pred             CeEEEEEcCchhhhhcCCCCCEEECCCCCccCCeEEEEEEEeCCCCCCCCcceEEEEEecCCccccccCcceeEEEEEEE
Confidence            47999999999985     358999999 99999999999999976 46799999999875432 34688 689999999


Q ss_pred             EeCCCC----Cceec----c----c----ceeccc-----------------CCCCcccccceeccccc-cCCccCCEEE
Q 001172          139 VNRDPK----KSKYS----D----T----LHRFWK-----------------KEHDWGWKKFMELSKVS-DGFKDGDTLI  184 (1133)
Q Consensus       139 LNQdGk----~S~sk----e----s----~h~Fss-----------------kesdWGfskFIkrsdLE-sGFLvDDsLT  184 (1133)
                      +||++.    .+.+.    +    .    ...|..                 .+.+|||..||++++|+ ..||+||+|.
T Consensus        82 lDQ~~~~~~r~~~~~~~~~dp~~~~~~~~~~~~~rP~~~~~~~~~~~~~~~~~~~g~G~~~Fis~~~L~~r~ylk~dtl~  161 (167)
T cd03771          82 LDQDPDIQQRMSNQRSFTTDPSMTSSDNGEYFWDRPSKVGSYDTDTNGCTCYRGPGYGWSTFISHSRLRRRDFLKGDDLI  161 (167)
T ss_pred             ECCCCcccccCcceEEEecCCcccccccccccccCCccccccccccccccccccCccccccceeHHHhccCCCCcCCEEE
Confidence            999631    11111    0    0    011211                 23479999999999999 5799999999


Q ss_pred             EEEEE
Q 001172          185 IKAQV  189 (1133)
Q Consensus       185 IeCdV  189 (1133)
                      |+++|
T Consensus       162 i~~~~  166 (167)
T cd03771         162 ILLDF  166 (167)
T ss_pred             EEEEe
Confidence            99986


No 14 
>PF00917 MATH:  MATH domain;  InterPro: IPR002083 Although apparently functionally unrelated, intracellular TRAFs and extracellular meprins share a conserved region of about 180 residues, the meprin and TRAF homology (MATH) domain []. Meprins are mammalian tissue-specific metalloendopeptidases of the astacin family implicated in developmental, normal and pathological processes by hydrolysing a variety of proteins. Various growth factors, cytokines, and extracellular matrix proteins are substrates for meprins. They are composed of five structural domains: an N-terminal endopeptidase domain, a MAM domain (see PDOC00604 from PROSITEDOC), a MATH domain, an EGF-like domain (see PDOC00021 from PROSITEDOC) and a C-terminal transmembrane region. Meprin A and B form membrane bound homotetramer whereas homooligomers of meprin A are secreted. A proteolitic site adjacent to the MATH domain, only present in meprin A, allows the release of the protein from the membrane []. TRAF proteins were first isolated by their ability to interact with TNF receptors []. They promote cell survival by the activation of downstream protein kinases and, finally, transcription factors of the NF-kB and AP-1 family. The TRAF proteins are composed of 3 structural domains: a RING finger (see PDOC00449 from PROSITEDOC) in the N-terminal part of the protein, one to seven TRAF zinc fingers (see PDOC50145 from PROSITEDOC) in the middle and the MATH domain in the C-terminal part []. The MATH domain is necessary and sufficient for self-association and receptor interaction. From the structural analysis two consensus sequence recognised by the TRAF domain have been defined: a major one, [PSAT]x[QE]E and a minor one, PxQxxD []. The structure of the TRAF2 protein reveals a trimeric self-association of the MATH domain []. The domain forms a new, light-stranded antiparallel beta sandwich structure. A coiled-coil region adjacent to the MATH domain is also important for the trimerisation. The oligomerisation is essential for establishing appropriate connections to form signalling complexes with TNF receptor-1. The ligand binding surface of TRAF proteins is located in beta-strands 6 and 7 [].; GO: 0005515 protein binding; PDB: 1D00_E 1CZY_A 1D01_F 1CA9_A 1D0J_D 1F3V_B 1CA4_C 1D0A_A 1QSC_C 1CZZ_C ....
Probab=99.73  E-value=7.4e-18  Score=157.35  Aligned_cols=115  Identities=29%  Similarity=0.561  Sum_probs=94.7

Q ss_pred             EccccccC-C-CeeeCCceeecCeEEEEEEEeCCCCCCCeEEEEEEEecCCCCC-CCCeEEEEEEEEEEeCCCCCceecc
Q 001172           74 IEKFSQIS-K-RELRSNAFEVGGYKWYILIYPQGCDVCNHLSLFLCVANHDKLL-PGWSHFAQFTIAVVNRDPKKSKYSD  150 (1133)
Q Consensus        74 IdNFS~LK-g-e~I~Sp~FsVGGhkWRI~~YPnG~~~~dyLSLYLeL~s~~sl~-p~WsV~AqFtf~LLNQdGk~S~ske  150 (1133)
                      |+|||+++ + ..+.|+.|.+||++|+|.+||+|.  .+||++||+|....... ..|+|.|+|+|.|+++.+.......
T Consensus         1 i~nfs~l~~~~~~~~s~~~~~~g~~W~l~~~~~~~--~~~l~~~L~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~   78 (119)
T PF00917_consen    1 IKNFSKLKEGEEYSSSFVFSHGGYPWRLKVYPKGN--GKYLSVYLHCDKGENDSDLEWSIEAEFRFRLLNQNGKSISKRI   78 (119)
T ss_dssp             ETTGGGHHTSEEEEEEEESSTTSEEEEEEEETTES--TTEEEEEEEEECSTTGGGSSSSEEEEEEEEEE-TTSCEEEEEE
T ss_pred             CcccceEeCCCcEECCCeEEECCEEEEEEEEeCCC--cCcEEEEEEEeecccccccceeeeEEEEEEEecCCCCcceeee
Confidence            78999997 3 345568999999999999999986  67999999999875442 4799999999999999876522222


Q ss_pred             cceecccCCCCcccccceeccccccC-CccCCEEEEEEEEEE
Q 001172          151 TLHRFWKKEHDWGWKKFMELSKVSDG-FKDGDTLIIKAQVQV  191 (1133)
Q Consensus       151 s~h~FsskesdWGfskFIkrsdLEsG-FLvDDsLTIeCdVtV  191 (1133)
                      ..+.|... .+|||.+||+|++|++. ||+||+|+|+|+|+|
T Consensus        79 ~~~~F~~~-~~~g~~~fi~~~~l~~~~fl~dd~l~ie~~v~I  119 (119)
T PF00917_consen   79 KSHSFNNP-SSWGWSSFISWEDLEDPYFLVDDSLTIEVEVKI  119 (119)
T ss_dssp             ECEEECTT-SEEEEEEEEEHHHHTTCTTSBTTEEEEEEEEEE
T ss_pred             eeeEEeee-cccchhheeEHHHhCccCCeECCEEEEEEEEEC
Confidence            25888764 88999999999999964 799999999999986


No 15 
>cd03778 MATH_TRAF2 Tumor Necrosis Factor Receptor (TNFR) Associated Factor (TRAF) family, TRAF2 subfamily, TRAF domain; TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF2 associates with the receptors TNFR-1, TNFR-2, RANK (which mediates differentiation and maturation of osteoclasts) and CD40 (which is important for the proliferation and activation of B cells), among others. It regulates distinct pathways that lead to the activation of nuclear factor-kappaB and Jun NH2-terminal kinases. TRAF2 also indirectly associates with death receptors through its interaction with TRADD (TNFR-associated death domain protein). It is involved in regulating oxidative stress or ROS-induced cell death and in the preconditioning of cells by sublethal stress for protection from subsequent injury. TRAF2 contains a RING finger domain, five z
Probab=99.66  E-value=5.5e-16  Score=158.90  Aligned_cols=125  Identities=26%  Similarity=0.422  Sum_probs=100.0

Q ss_pred             ccccEEEEEEccccccC-----C--CeeeCCceee--cCeEEEEEEEeCCCCC--CCeEEEEEEEecCCCCC-CCCeEEE
Q 001172           65 ELYGKYTWRIEKFSQIS-----K--RELRSNAFEV--GGYKWYILIYPQGCDV--CNHLSLFLCVANHDKLL-PGWSHFA  132 (1133)
Q Consensus        65 ~~tGshtwkIdNFS~LK-----g--e~I~Sp~FsV--GGhkWRI~~YPnG~~~--~dyLSLYLeL~s~~sl~-p~WsV~A  132 (1133)
                      ...|+|+|+|.||+.+.     +  ..++|+.|+.  +||+|+|++||+|+..  +.||||||+++.++.+. ..|.+..
T Consensus        16 ~~~g~fiWkI~~fs~~~~~a~~~~~~~i~Sp~Fyt~~~GYk~~l~~ylnG~g~~~g~~LSly~~l~~Ge~D~~L~WPf~~   95 (164)
T cd03778          16 TYDGVFIWKISDFARKRQEAVAGRIPAIFSPAFYTSRYGYKMCLRIYLNGDGTGRGTHLSLFFVVMKGPNDALLRWPFNQ   95 (164)
T ss_pred             ccCCEEEEEECcHHHHHHHHhcCCCceEECCCcccCCCCeEEEEEEEeCCCCCCCCCEEEEEEEEecCCcCcccCCceee
Confidence            45799999999999875     2  3699999986  5899999999999864  67999999999887654 6899999


Q ss_pred             EEEEEEEeCCCCCceec----c-cceecc----cCCCCcccccceeccccc--cCCccCCEEEEEEEE
Q 001172          133 QFTIAVVNRDPKKSKYS----D-TLHRFW----KKEHDWGWKKFMELSKVS--DGFKDGDTLIIKAQV  189 (1133)
Q Consensus       133 qFtf~LLNQdGk~S~sk----e-s~h~Fs----skesdWGfskFIkrsdLE--sGFLvDDsLTIeCdV  189 (1133)
                      +++|+|+||+.......    + ....|.    ..+..|||..||++++|+  .+||+||+|.|+|.|
T Consensus        96 ~itl~llDQ~~r~hi~~~~~pd~~~~~f~RP~~~~n~~~G~~~Fv~l~~l~~~~~Yv~dDtlfIk~~V  163 (164)
T cd03778          96 KVTLMLLDQNNREHVIDAFRPDVTSSSFQRPVNDMNIASGCPLFCPVSKXEAKNSYVRDDAIFIKAIV  163 (164)
T ss_pred             EEEEEEECCCCCCcceeEEEcCcchHhcCCCCcccccCcCcceEEEhhHccccCCcccCCeEEEEEEE
Confidence            99999999965332211    1 112342    234579999999999997  589999999999987


No 16 
>smart00061 MATH meprin and TRAF homology.
Probab=99.60  E-value=5.2e-15  Score=132.75  Aligned_cols=93  Identities=26%  Similarity=0.411  Sum_probs=79.2

Q ss_pred             EEEEEccccccC-CCeeeCCceeecCeEEEEEEEeCCCCCCCeEEEEEEEecCCCCCCCCeEEEEEEEEEEeCCCCCcee
Q 001172           70 YTWRIEKFSQIS-KRELRSNAFEVGGYKWYILIYPQGCDVCNHLSLFLCVANHDKLLPGWSHFAQFTIAVVNRDPKKSKY  148 (1133)
Q Consensus        70 htwkIdNFS~LK-ge~I~Sp~FsVGGhkWRI~~YPnG~~~~dyLSLYLeL~s~~sl~p~WsV~AqFtf~LLNQdGk~S~s  148 (1133)
                      ++|+|.+|+.+. ++.+.|+.|.+|||.|+|++||+    .+||||||+|.........|.|.|+|+|+|+|++++.. .
T Consensus         2 ~~~~~~~~~~~~~~~~~~S~~f~~~g~~W~i~~~p~----~~~lsl~L~~~~~~~~~~~w~v~a~~~~~l~~~~~~~~-~   76 (95)
T smart00061        2 LSHTFKNVSRLEEGESYFSPSEEHFNIPWRLKIYRK----NGFLSLYLHCEKEECDSRKWSIEAEFTLKLVSQNGKSL-S   76 (95)
T ss_pred             ceeEEEchhhcccCceEeCChhEEcCceeEEEEEEc----CCEEEEEEEeCCCcCCCCCeEEEEEEEEEEEeCCCCEE-e
Confidence            679999999984 68899999999999999999999    48999999998665443479999999999999987643 3


Q ss_pred             cccceecccCCCCcccccce
Q 001172          149 SDTLHRFWKKEHDWGWKKFM  168 (1133)
Q Consensus       149 kes~h~FsskesdWGfskFI  168 (1133)
                      ....+.|.. ..+|||.+||
T Consensus        77 ~~~~~~F~~-~~~~G~~~fi   95 (95)
T smart00061       77 KKDKHVFEK-PSGWGFSKFI   95 (95)
T ss_pred             eeeeEEEcC-CCccceeeEC
Confidence            445788876 6889999986


No 17 
>cd03783 MATH_Meprin_Alpha Meprin family, Alpha subunit, MATH domain; Meprins are multidomain extracellular metalloproteases capable of cleaving growth factors, cytokines, extracellular matrix proteins, and biologically active peptides. They are composed of two related subunits, alpha and beta, which form homo- or hetro-complexes where the basic unit is a disulfide-linked dimer. The alpha subunit is synthesized as a membrane spanning protein, however, it is cleaved during biosynthesis and loses its transmembrane domain. It oligomerizes into large complexes, containing 10-100 subunits (dimers that associate noncovalently), which are secreted as latent proteases and can move through extracellular spaces in a nondestructive manner. This allows delivery of the concentrated protease to sites containing activating enzymes, such as sites of inflammation, infection or cancerous growth. Meprin alpha shows preference for small or hydrophobic residues at the P1 and P1' sites of its substrate. Both
Probab=99.33  E-value=5.3e-12  Score=129.93  Aligned_cols=121  Identities=29%  Similarity=0.538  Sum_probs=93.6

Q ss_pred             EEEEEEccccccC-----CCeeeCCceee-cCeEEEEEEEeCCCC---CCCeEEEEEEEecCCCC-CCCCeEE-EEEEEE
Q 001172           69 KYTWRIEKFSQIS-----KRELRSNAFEV-GGYKWYILIYPQGCD---VCNHLSLFLCVANHDKL-LPGWSHF-AQFTIA  137 (1133)
Q Consensus        69 shtwkIdNFS~LK-----ge~I~Sp~FsV-GGhkWRI~~YPnG~~---~~dyLSLYLeL~s~~sl-~p~WsV~-AqFtf~  137 (1133)
                      .|+|+|.||+++.     ...++|+.|+. .||+.+|++||+|..   .+.|||||++++.++.+ ...|.+. -+++|.
T Consensus         3 ~~iWkI~nfs~~~~~a~~~~~i~Sp~Fyt~~GYk~~l~~~lng~~~~~~g~~lSl~~~lm~Ge~D~~L~WP~~~~~itl~   82 (167)
T cd03783           3 NAVWRVRNFSQILENTTKGDVLQSPRFYSPEGYGYGVSLYPLSNESDYSGNYTGLYFHLCSGENDAVLEWPALNRQAIIT   82 (167)
T ss_pred             ceeEEECcHHHHHHhCcCCCeEECCCCccCCCceEEEEEEecCCCCCCCCCEEEEEEEEecccCCCcccCCCcCCEEEEE
Confidence            5899999999864     35799999988 499999999999975   36799999999987654 3479964 589999


Q ss_pred             EEeCCCC----Cce----ecc---c------ceeccc--------------CCCCcccccceeccccc-cCCccCCEEEE
Q 001172          138 VVNRDPK----KSK----YSD---T------LHRFWK--------------KEHDWGWKKFMELSKVS-DGFKDGDTLII  185 (1133)
Q Consensus       138 LLNQdGk----~S~----ske---s------~h~Fss--------------kesdWGfskFIkrsdLE-sGFLvDDsLTI  185 (1133)
                      |+||++.    .+.    ...   .      ...|.+              .+.+|||..||.++.|+ .+||+||+|.|
T Consensus        83 llDQ~~~~~~r~~~~~sf~~d~~~~~~~~~~~~~f~rP~~~~~~~~~~~~~~~~gfG~~~Fish~~L~~r~yikdDtlfI  162 (167)
T cd03783          83 VLDQDPDVRLRMSSSRSFTTDKSQTSSAINGTLRWDRPSRVGTYDTSCDCFRGIDFGWSTFISHSQLRRRSFLKNDDLII  162 (167)
T ss_pred             EEcCCcchhhccccceeeecCCCcccccccccccccCCcccccccccccccCCcccccccceeHHHHhhCCcccCCeEEE
Confidence            9999741    111    001   0      011332              23579999999999999 78999999999


Q ss_pred             EEEE
Q 001172          186 KAQV  189 (1133)
Q Consensus       186 eCdV  189 (1133)
                      .+++
T Consensus       163 ~~~~  166 (167)
T cd03783         163 FVDF  166 (167)
T ss_pred             EEec
Confidence            9876


No 18 
>KOG1863 consensus Ubiquitin carboxyl-terminal hydrolase [Posttranslational modification, protein turnover, chaperones]
Probab=99.28  E-value=3.2e-12  Score=162.27  Aligned_cols=236  Identities=22%  Similarity=0.244  Sum_probs=170.0

Q ss_pred             cEEEEEEccccccCCCeeeCCceeecCeEEEEEEEeCCCCCCCeEEEEEEEecCCCCCCCCeEEEEEEEEEEeC-CCCCc
Q 001172           68 GKYTWRIEKFSQISKRELRSNAFEVGGYKWYILIYPQGCDVCNHLSLFLCVANHDKLLPGWSHFAQFTIAVVNR-DPKKS  146 (1133)
Q Consensus        68 GshtwkIdNFS~LKge~I~Sp~FsVGGhkWRI~~YPnG~~~~dyLSLYLeL~s~~sl~p~WsV~AqFtf~LLNQ-dGk~S  146 (1133)
                      ..++|.+.+...+.. ...++.|..|+..|+|++.|+++ ....+++||.|...... ..|++++++.+.++|. +....
T Consensus        27 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~s~~~~~~~~v~~~~~~~~~  103 (1093)
T KOG1863|consen   27 QSTTIDGIDDKSLLY-RALSSNFGAGATKWKILIAPKVN-SLQSTRKKLEVMPSQSL-KSWSCGAQAVLRVKNTIDNLPD  103 (1093)
T ss_pred             ccccccCcCcchhhh-HhcCccccccccceeeeeccccC-cccceeEEeeeccCCCC-cceEecchhhhccccCCCCchh
Confidence            334466655555444 67899999999999999999987 45789999999876654 4599999999999993 22222


Q ss_pred             eecccceecccCCCCcccccceeccccc---cCCccCCEEEEEEEEEEeeccCCCCccccccccccceee-ecCCchHHH
Q 001172          147 KYSDTLHRFWKKEHDWGWKKFMELSKVS---DGFKDGDTLIIKAQVQVIREKTDRPFRCLDCQYRRELVR-VYLTNVEQI  222 (1133)
Q Consensus       147 ~skes~h~FsskesdWGfskFIkrsdLE---sGFLvDDsLTIeCdVtVIkd~~~~~~r~ld~dskk~tI~-VPpSNLgaT  222 (1133)
                      ......|.|.....+|||.+|+.+++|.   .+|+++|++.+++.|.|...+...    ..||.++.+++ |++.|+|+|
T Consensus       104 ~~~~~~h~~~~~~~dwg~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~----~~~d~k~~tg~~vGL~N~GaT  179 (1093)
T KOG1863|consen  104 PEKAIHHVFTADERDWGFSCFSTSSDIRKPEDGYVRNGLEKLEKRVRVEQPTSLM----NPYDSKRLTGFPVGLKNLGAT  179 (1093)
T ss_pred             hhhhhhhcccccccchhhccchhHhhccCcccccccccceeeeeeeeeecCCccc----chhhhhhcCCCCccccCCCce
Confidence            3346789999889999999999999998   599999999999999998875431    36888899998 999999999


Q ss_pred             HhhhhhHHHhhHHHHHhhhccccccchhhhccCHHHHhhhccccchhhhhhhhhhhcchhhhhhhHHHHHHhhccccccc
Q 001172          223 CRRFVEERRGKLGRLIEDKARWSSFCAFWLGIDQNARRRMSREKTDAILKVVVKHFFIEKEVTSTLVMDSLYSGLKALEG  302 (1133)
Q Consensus       223 C~~fved~~~hLG~LLqS~~~~ssFRaf~~a~~~~~r~~m~~e~sDVilkV~~k~Ff~~keVtSplvmd~L~~glk~le~  302 (1133)
                      |         |++.|||+..-...||.-++.+.+...   ...+.              +  .-++.|+.||.-|+.-..
T Consensus       180 C---------Y~NsllQ~lf~~~~FR~~Vy~~~~~~~---~~~~~--------------~--~v~~~lq~lF~~LQ~s~~  231 (1093)
T KOG1863|consen  180 C---------YVNSLLQVLFLIPEFRRAVYSIPPFTG---HEDPR--------------R--SIPLALQRLFYELQMSKR  231 (1093)
T ss_pred             e---------eehHHHHHHHccHHHHHHHhcCCCCCC---ccccc--------------c--hHHHHHHHHHHHHhhcCC
Confidence            9         999999998888888887777664111   11110              0  126678888888876522


Q ss_pred             ccccccccccccccCCCCCCceEeecceee------ehhhHHHHHHHhhcC
Q 001172          303 QSKSKKTKAKLLDAEDTPPPIVHVENDMFV------LVDDVLLLLERAALE  347 (1133)
Q Consensus       303 ~t~~~k~~~~~~~t~e~p~~~~~v~~d~~~------l~~Dll~~leR~al~  347 (1133)
                               +.++|.++.....+...+.|.      |..-++..||+...+
T Consensus       232 ---------k~Vdt~~~~~~~~~~~~~~~~QqDvqEf~~~l~d~LE~~~~~  273 (1093)
T KOG1863|consen  232 ---------KYVDTSELTKSLGWDSNDSFEQQDVQEFLTKLLDWLEDSMID  273 (1093)
T ss_pred             ---------CCcCchhhhhhhhcccccHHhhhhHHHHHHHHHHHHHhhccc
Confidence                     356666664333322212222      333444566666433


No 19 
>cd03782 MATH_Meprin_Beta Meprin family, Beta subunit, MATH domain; Meprins are multidomain extracellular metalloproteases capable of cleaving growth factors, cytokines, extracellular matrix proteins, and biologically active peptides. They are composed of two related subunits, alpha and beta, which form homo- or hetro-complexes where the basic unit is a disulfide-linked dimer. The beta subunit is a type I membrane protein, which forms homodimers or heterotetramers (alpha2beta2 or alpha3beta). Meprin beta shows preference for acidic residues at the P1 and P1' sites of its substrate. Among its best substrates are growth factors and chemokines such as gastrin and osteopontin. Both alpha and beta subunits contain a catalytic astacin (M12 family) protease domain followed by the adhesion or interaction domains MAM, MATH and AM. The MATH and MAM domains provide symmetrical intersubunit disulfide bonds necessary for the dimerization of meprin subunits. The MATH domain may also be required for f
Probab=99.27  E-value=1.8e-11  Score=125.74  Aligned_cols=122  Identities=20%  Similarity=0.394  Sum_probs=94.8

Q ss_pred             cEEEEEEccccccC-----CCeeeCCceee-cCeEEEEEEEeCCCCC-CCeEEEEEEEecCCCC-CCCCeEE-EEEEEEE
Q 001172           68 GKYTWRIEKFSQIS-----KRELRSNAFEV-GGYKWYILIYPQGCDV-CNHLSLFLCVANHDKL-LPGWSHF-AQFTIAV  138 (1133)
Q Consensus        68 GshtwkIdNFS~LK-----ge~I~Sp~FsV-GGhkWRI~~YPnG~~~-~dyLSLYLeL~s~~sl-~p~WsV~-AqFtf~L  138 (1133)
                      -.|+|+|.||+++.     ...++|+.|+. .||+.+|.+||+|.+. +.|||||++|+.++.+ ...|.+. -+++|.|
T Consensus         2 p~~iWkI~~fs~~~~~~~~~~~i~Sp~FYt~~GYkl~l~~ylnG~g~~~~~lsl~~~lm~Ge~D~~L~WPf~~~qit~~L   81 (167)
T cd03782           2 PEHIWHIRNFTQLLATTPPNGKIYSPPFLSSTGYSFQVGLYLNGTDDYPGNLAIYLHLTSGPNDDQLQWPCPWQQATMML   81 (167)
T ss_pred             CcEEEEeCcHHHHHHhcCCCceEECCCCcCccCceeEEEEEecCCCCCCCEEEEEEEEeccCCCccccCCCcCCeEEEEE
Confidence            35899999999864     35799999975 7999999999999753 5799999999987654 3479999 8999999


Q ss_pred             EeCCC---C-Cceec------c---c-ceec--ccC-----------------CCCcccccceeccccc-cCCccCCEEE
Q 001172          139 VNRDP---K-KSKYS------D---T-LHRF--WKK-----------------EHDWGWKKFMELSKVS-DGFKDGDTLI  184 (1133)
Q Consensus       139 LNQdG---k-~S~sk------e---s-~h~F--ssk-----------------esdWGfskFIkrsdLE-sGFLvDDsLT  184 (1133)
                      +||++   + .+...      .   . ...|  ...                 +.+|||..||++++|+ ..||+||.|.
T Consensus        82 lDQ~~d~~~r~~~~~~~t~~P~~~s~~n~~f~w~rP~kvg~~~~~~~~~~~~r~~~~G~~~Fish~~L~~r~yikdD~if  161 (167)
T cd03782          82 LDQHPDIRQRMSNQRSVTTDPNMTSTDSDEYFWDDPRKVGSEVTDTDGSTFYRGPGYGTSAFITHLRLRSRDFIKGDDVI  161 (167)
T ss_pred             EcCCCchhhccceeeeEEecCCcccccCccceecCCcccCcccccccccccccccccCccceeeHHHHhhcCcccCCeEE
Confidence            99974   1 12111      0   0 1134  211                 4689999999999999 6899999999


Q ss_pred             EEEEE
Q 001172          185 IKAQV  189 (1133)
Q Consensus       185 IeCdV  189 (1133)
                      |-+++
T Consensus       162 i~~~~  166 (167)
T cd03782         162 FLLTM  166 (167)
T ss_pred             EEEec
Confidence            98775


No 20 
>KOG1987 consensus Speckle-type POZ protein SPOP and related proteins with TRAF, MATH and BTB/POZ domains [Cell cycle control, cell division, chromosome partitioning; General function prediction only]
Probab=97.67  E-value=0.00021  Score=78.65  Aligned_cols=116  Identities=32%  Similarity=0.653  Sum_probs=88.9

Q ss_pred             EEEEEccccccCCCeeeCCceeecCeEEEEEEEeCCCCCCCeEEEEEEEecCCCCCCCCeEEEEEEEEEEeCCCCCc-ee
Q 001172           70 YTWRIEKFSQISKRELRSNAFEVGGYKWYILIYPQGCDVCNHLSLFLCVANHDKLLPGWSHFAQFTIAVVNRDPKKS-KY  148 (1133)
Q Consensus        70 htwkIdNFS~LKge~I~Sp~FsVGGhkWRI~~YPnG~~~~dyLSLYLeL~s~~sl~p~WsV~AqFtf~LLNQdGk~S-~s  148 (1133)
                      +.|.|.+|+... -.++|..|..||..|++.+||.|+    |+++|+.+....    +|.+.+.|.|.+.|+..... ..
T Consensus         6 ~~~~~~~~~~~~-l~~ys~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~----~~~~~~~~~l~v~n~~~~~~~~~   76 (297)
T KOG1987|consen    6 FTWVISNFSSVG-LVIYSNGFVKGGCKWRLSAYPKGN----YLSLTLSVSDSP----GWERYAKLRLTVVNQKSEKYLST   76 (297)
T ss_pred             cceeeccCcchh-hhccccceeecCceEEEEEecCCC----EEEEEEEeccCC----CcceeEEEEEEEccCCCcceeee
Confidence            338999998765 668899999999999999999984    899999886543    69999999999999966532 22


Q ss_pred             c-ccceecccC--CCCcccccceeccccc---cCCccCCEEEEEEEEEEeec
Q 001172          149 S-DTLHRFWKK--EHDWGWKKFMELSKVS---DGFKDGDTLIIKAQVQVIRE  194 (1133)
Q Consensus       149 k-es~h~Fssk--esdWGfskFIkrsdLE---sGFLvDDsLTIeCdVtVIkd  194 (1133)
                      . .....|...  ...||+..++....+.   .||+.++.++|-+.+.|++.
T Consensus        77 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~g~~~~~~~~~~a~~~V~~~  128 (297)
T KOG1987|consen   77 VEEGFSWFRFNKVLKEWGFGKMLPLTLLIDCSNGFLVAHKLVLVARSEVFEA  128 (297)
T ss_pred             eeeeEEeccccccccccCcccccChHHhhcccCcEEEcCceEEEeeecceee
Confidence            2 233344332  4689998888877776   58999988888777776654


No 21 
>PHA03098 kelch-like protein; Provisional
Probab=96.66  E-value=0.00032  Score=83.16  Aligned_cols=134  Identities=10%  Similarity=0.019  Sum_probs=93.4

Q ss_pred             cccchhhhhh--hhhhhcchhhh---hhhHHHHHHhhccc--cccc--ccccccccccccccCCCCCCceEeecceeeeh
Q 001172          264 REKTDAILKV--VVKHFFIEKEV---TSTLVMDSLYSGLK--ALEG--QSKSKKTKAKLLDAEDTPPPIVHVENDMFVLV  334 (1133)
Q Consensus       264 ~e~sDVilkV--~~k~Ff~~keV---tSplvmd~L~~glk--~le~--~t~~~k~~~~~~~t~e~p~~~~~v~~d~~~l~  334 (1133)
                      ++.+||+|.+  .+++|.+||.|   +|++|..=+.++|+  .++-  ....++.+++++|++++.     +..+   -+
T Consensus         7 ~~~~Dv~l~~~~~~~~~~~Hk~vLaa~S~yF~~mf~~~~~~~~i~l~~~~~~~~~~l~y~Ytg~~~-----i~~~---~~   78 (534)
T PHA03098          7 QKFCDESIIIVNGGGIIKVHKIILSSSSEYFKKMFKNNFKENEINLNIDYDSFNEVIKYIYTGKIN-----ITSN---NV   78 (534)
T ss_pred             CCCCCEEEEEEcCCEEEEeHHHHHHhhhHHHHHHHhCCCCCceEEecCCHHHHHHHHHHhcCCceE-----EcHH---HH
Confidence            4569999998  99999999987   66777654444553  1111  112377788999998863     2222   27


Q ss_pred             hhHHHHHHHhhcCCCCCCCCCCCccccccCCCCCCcccchhH-----HHHHHhhHHhhhHHHHHHHhhhhhHHHHHHHHH
Q 001172          335 DDVLLLLERAALEPLPPKDEKGPQNRTKESNSGEDFNKDSIE-----RDERRLTELGRRTVEIFVLAHIFSNKIEVAYQE  409 (1133)
Q Consensus       335 ~Dll~~leR~al~~Lp~~d~~~~q~~~~~~~~~~dvs~dtve-----~ee~~~~Elgrr~~~ifa~~k~F~~rv~~Ay~E  409 (1133)
                      .|||.+++++.++.|..   .|.+.+.+.      ++.++..     -+.++|.+|...|+++..  ++|.   ++.-+|
T Consensus        79 ~~ll~~A~~l~~~~l~~---~C~~~l~~~------l~~~nc~~~~~~a~~~~~~~L~~~~~~~i~--~nf~---~v~~~~  144 (534)
T PHA03098         79 KDILSIANYLIIDFLIN---LCINYIIKI------IDDNNCIDIYRFSFFYGCKKLYSAAYNYIR--NNIE---LIYNDP  144 (534)
T ss_pred             HHHHHHHHHhCcHHHHH---HHHHHHHHh------CCHhHHHHHHHHHHHcCcHHHHHHHHHHHH--HHHH---HHhcCc
Confidence            89999999999999986   688888776      5555533     289999999999999976  5564   334445


Q ss_pred             HHHHHHHHHH
Q 001172          410 AVALKRQEEL  419 (1133)
Q Consensus       410 ~~A~~rQ~~L  419 (1133)
                      .+..+-.+.|
T Consensus       145 ~f~~l~~~~l  154 (534)
T PHA03098        145 DFIYLSKNEL  154 (534)
T ss_pred             hhhcCCHHHH
Confidence            5555544433


No 22 
>PHA02713 hypothetical protein; Provisional
Probab=96.56  E-value=0.00078  Score=81.46  Aligned_cols=134  Identities=13%  Similarity=0.009  Sum_probs=88.9

Q ss_pred             cchhhhhhh-hhhhcchhhh---hhhHHHHHHhhccccc--------cccc-ccccccccccccCCCCCCceEeecceee
Q 001172          266 KTDAILKVV-VKHFFIEKEV---TSTLVMDSLYSGLKAL--------EGQS-KSKKTKAKLLDAEDTPPPIVHVENDMFV  332 (1133)
Q Consensus       266 ~sDVilkV~-~k~Ff~~keV---tSplvmd~L~~glk~l--------e~~t-~~~k~~~~~~~t~e~p~~~~~v~~d~~~  332 (1133)
                      .+||+|.|. ++.|.+||.|   .|++|..=+-++|+.-        ..-+ ..++.++.++||..+.       .+   
T Consensus        25 l~DV~L~v~~~~~f~~Hr~vLaa~S~YF~amF~~~~~e~~~~~~v~l~~v~~~~~~~ll~y~Yt~~i~-------~~---   94 (557)
T PHA02713         25 LCDVIITIGDGEEIKAHKTILAAGSKYFRTLFTTPMIIRDLVTRVNLQMFDKDAVKNIVQYLYNRHIS-------SM---   94 (557)
T ss_pred             CCCEEEEeCCCCEEeehHHHHhhcCHHHHHHhcCCchhhccCceEEeccCCHHHHHHHHHHhcCCCCC-------HH---
Confidence            589999998 8999999988   5566544333455421        1111 1256668899997642       11   


Q ss_pred             ehhhHHHHHHHhhcCCCCCCCCCCCccccccCCCCCCcccchhH-----HHHHHhhHHhhhHHHHHHHhhhhhHHHHHHH
Q 001172          333 LVDDVLLLLERAALEPLPPKDEKGPQNRTKESNSGEDFNKDSIE-----RDERRLTELGRRTVEIFVLAHIFSNKIEVAY  407 (1133)
Q Consensus       333 l~~Dll~~leR~al~~Lp~~d~~~~q~~~~~~~~~~dvs~dtve-----~ee~~~~Elgrr~~~ifa~~k~F~~rv~~Ay  407 (1133)
                      -+.|||.+++.+.++.|..   .|.+.+.+.      ++.++.-     -+.+.|.+|..++.+|.  .++|.   +++-
T Consensus        95 nv~~ll~aA~~lqi~~l~~---~C~~~l~~~------l~~~NCl~i~~~~~~~~~~~L~~~a~~~i--~~~f~---~v~~  160 (557)
T PHA02713         95 NVIDVLKCADYLLIDDLVT---DCESYIKDY------TNHDTCIYMYHRLYEMSHIPIVKYIKRML--MSNIP---TLIT  160 (557)
T ss_pred             HHHHHHHHHHHHCHHHHHH---HHHHHHHhh------CCccchHHHHHHHHhccchHHHHHHHHHH--HHHHH---HHhC
Confidence            3799999999999999986   688888776      5544422     15566677776665554  47775   5555


Q ss_pred             HHHHHHHHHHHHHHHH
Q 001172          408 QEAVALKRQEELIREE  423 (1133)
Q Consensus       408 ~E~~A~~rQ~~Ll~E~  423 (1133)
                      +|.|..+..++|++=+
T Consensus       161 ~~ef~~L~~~~l~~lL  176 (557)
T PHA02713        161 TDAFKKTVFEILFDII  176 (557)
T ss_pred             ChhhhhCCHHHHHHHh
Confidence            6777776666665433


No 23 
>KOG1029 consensus Endocytic adaptor protein intersectin [Signal transduction mechanisms; Intracellular trafficking, secretion, and vesicular transport]
Probab=96.29  E-value=0.012  Score=72.18  Aligned_cols=38  Identities=29%  Similarity=0.451  Sum_probs=17.2

Q ss_pred             hhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          399 FSNKIEVAYQEAVA-LKRQEELIREEEAAWLAESEQKAK  436 (1133)
Q Consensus       399 F~~rv~~Ay~E~~A-~~rQ~~Ll~E~E~e~~~~~e~~~k  436 (1133)
                      |+.+-+.-|.-+.| ++|+.+.|+|.+..+|++.|+|+|
T Consensus       314 FEDKrkeNy~kGqaELerRRq~leeqqqreree~eqkEr  352 (1118)
T KOG1029|consen  314 FEDKRKENYEKGQAELERRRQALEEQQQREREEVEQKER  352 (1118)
T ss_pred             hhhhhHHhHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44444444444444 334455555554444444444333


No 24 
>KOG2412 consensus Nuclear-export-signal (NES)-containing protein/polyadenylated-RNA export factor [RNA processing and modification]
Probab=96.22  E-value=0.048  Score=65.42  Aligned_cols=53  Identities=23%  Similarity=0.262  Sum_probs=33.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHhhh-HHHHHHHHHHHHHHHHhhhhhhhHHHHHH
Q 001172          416 QEELIREEEAAWLAESEQKAKRGA-AEKEKKAKKKLAKQKRNNRKGKEKKREER  468 (1133)
Q Consensus       416 Q~~Ll~E~E~e~~~~~e~~~kk~~-k~kekk~~~k~ak~~e~~rk~~e~~~ee~  468 (1133)
                      |.+-..|++++++.+.||++++.. .+++|+..++..+++|+.|+++||..||+
T Consensus       211 qi~~~~~~~e~kr~Eaerk~~~~qEe~Rqk~d~~~~~~eqekiR~~eekqeee~  264 (591)
T KOG2412|consen  211 QIRERKERSEEKREEAERKRRAHQEELRQKEDEEAELQEQEKIRAEEEKQEEER  264 (591)
T ss_pred             HHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHH
Confidence            334444556666777777666654 55556666666777888888888844443


No 25 
>KOG4364 consensus Chromatin assembly factor-I [Chromatin structure and dynamics]
Probab=96.22  E-value=0.014  Score=70.85  Aligned_cols=7  Identities=29%  Similarity=-0.054  Sum_probs=3.6

Q ss_pred             CCCCCcc
Q 001172          499 EKPDVLE  505 (1133)
Q Consensus       499 ~k~d~~e  505 (1133)
                      .|+++.-
T Consensus       380 qk~~~k~  386 (811)
T KOG4364|consen  380 QKIDNKF  386 (811)
T ss_pred             ccccccc
Confidence            5555543


No 26 
>KOG4364 consensus Chromatin assembly factor-I [Chromatin structure and dynamics]
Probab=95.93  E-value=0.074  Score=64.96  Aligned_cols=14  Identities=14%  Similarity=-0.078  Sum_probs=7.3

Q ss_pred             ccCCCCCCCCCccc
Q 001172          493 DAQPLPEKPDVLED  506 (1133)
Q Consensus       493 ~~~~~~~k~d~~e~  506 (1133)
                      =.|.+-.|--+..+
T Consensus       378 ffqk~~~k~~~~~~  391 (811)
T KOG4364|consen  378 FFQKIDNKFSTTCE  391 (811)
T ss_pred             hhcccccccCCccc
Confidence            34566666534444


No 27 
>PTZ00121 MAEBL; Provisional
Probab=95.93  E-value=0.043  Score=71.23  Aligned_cols=6  Identities=33%  Similarity=1.165  Sum_probs=2.9

Q ss_pred             hhhhcc
Q 001172           30 LAEWRS   35 (1133)
Q Consensus        30 l~~~~~   35 (1133)
                      ++.|.|
T Consensus       575 ~~~~~~  580 (2084)
T PTZ00121        575 ISHWNS  580 (2084)
T ss_pred             cccccc
Confidence            445554


No 28 
>PTZ00266 NIMA-related protein kinase; Provisional
Probab=95.81  E-value=0.1  Score=67.69  Aligned_cols=7  Identities=43%  Similarity=1.132  Sum_probs=3.3

Q ss_pred             CCchhhh
Q 001172          976 FPHLDII  982 (1133)
Q Consensus       976 FPHLDII  982 (1133)
                      .||..-|
T Consensus       868 ~~~~~~~  874 (1021)
T PTZ00266        868 LPHMNAI  874 (1021)
T ss_pred             ccccchh
Confidence            4555444


No 29 
>PTZ00121 MAEBL; Provisional
Probab=95.57  E-value=0.077  Score=69.07  Aligned_cols=8  Identities=0%  Similarity=-0.106  Sum_probs=3.3

Q ss_pred             hhhhhhhc
Q 001172          272 KVVVKHFF  279 (1133)
Q Consensus       272 kV~~k~Ff  279 (1133)
                      .+.|+.|+
T Consensus       986 ~~~GkCyi  993 (2084)
T PTZ00121        986 IANGECYH  993 (2084)
T ss_pred             hhcCeeEE
Confidence            33444443


No 30 
>KOG0163 consensus Myosin class VI heavy chain [Cytoskeleton]
Probab=95.17  E-value=0.087  Score=65.02  Aligned_cols=32  Identities=34%  Similarity=0.258  Sum_probs=20.1

Q ss_pred             HHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHH
Q 001172          419 LIREEEAAWLAESEQKAKRGAAEKEKKAKKKL  450 (1133)
Q Consensus       419 Ll~E~E~e~~~~~e~~~kk~~k~kekk~~~k~  450 (1133)
                      -|+|+-|.||+.+|-++|++.++.|||+.+.+
T Consensus       928 k~qE~~E~ER~rrEaeek~rre~ee~k~~k~e  959 (1259)
T KOG0163|consen  928 KIQELAEAERKRREAEEKRRREEEEKKRAKAE  959 (1259)
T ss_pred             HHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHH
Confidence            35566666677777777777666666555444


No 31 
>PHA02790 Kelch-like protein; Provisional
Probab=94.63  E-value=0.01  Score=70.66  Aligned_cols=136  Identities=12%  Similarity=0.057  Sum_probs=87.0

Q ss_pred             hccccchhhhhhhhhhhcchhhh---hhhHHHHHHhhccccc---------ccccccccccccccccCCCCCCceEeecc
Q 001172          262 MSREKTDAILKVVVKHFFIEKEV---TSTLVMDSLYSGLKAL---------EGQSKSKKTKAKLLDAEDTPPPIVHVEND  329 (1133)
Q Consensus       262 m~~e~sDVilkV~~k~Ff~~keV---tSplvmd~L~~glk~l---------e~~t~~~k~~~~~~~t~e~p~~~~~v~~d  329 (1133)
                      |-++..||++ +.|+.|.+||.|   -||+|.+=+.++|+.=         +......+.++.++||.++.     +..+
T Consensus        18 ~~~~~~~~~~-~~~~~~~~HR~VLAa~S~YFraMF~~~~~Es~~~v~~~~~~v~~~~l~~lldy~YTg~l~-----it~~   91 (480)
T PHA02790         18 MTKKFKTIIE-AIGGNIIVNSTILKKLSPYFRTHLRQKYTKNKDPVTRVCLDLDIHSLTSIVIYSYTGKVY-----IDSH   91 (480)
T ss_pred             hhhhhceEEE-EcCcEEeeehhhhhhcCHHHHHHhcCCccccccceEEEecCcCHHHHHHHHHhheeeeEE-----Eecc
Confidence            3344466665 556699999988   5577765444567411         11111255567788987753     2222


Q ss_pred             eeeehhhHHHHHHHhhcCCCCCCCCCCCccccccCCCCCCcccchhH-----HHHHHhhHHhhhHHHHHHHhhhhhHHHH
Q 001172          330 MFVLVDDVLLLLERAALEPLPPKDEKGPQNRTKESNSGEDFNKDSIE-----RDERRLTELGRRTVEIFVLAHIFSNKIE  404 (1133)
Q Consensus       330 ~~~l~~Dll~~leR~al~~Lp~~d~~~~q~~~~~~~~~~dvs~dtve-----~ee~~~~Elgrr~~~ifa~~k~F~~rv~  404 (1133)
                         =++|||.++....++.+..   .|.+.+.+.      ++.++--     -+.|.|.+|.+++.+|..  ++|.   +
T Consensus        92 ---nV~~ll~aA~~Lqi~~v~~---~C~~fL~~~------l~~~NCl~i~~~A~~y~~~~L~~~a~~fi~--~nF~---~  154 (480)
T PHA02790         92 ---NVVNLLRASILTSVEFIIY---TCINFILRD------FRKEYCVECYMMGIEYGLSNLLCHTKDFIA--KHFL---E  154 (480)
T ss_pred             ---cHHHHHHHHHHhChHHHHH---HHHHHHHhh------CCcchHHHHHHHHHHhCHHHHHHHHHHHHH--HhHH---H
Confidence               2789999999999988765   588888887      6666622     289999999999887765  6664   2


Q ss_pred             HHHH--HHHHHHHHHHHH
Q 001172          405 VAYQ--EAVALKRQEELI  420 (1133)
Q Consensus       405 ~Ay~--E~~A~~rQ~~Ll  420 (1133)
                      ++-+  |.+..+-..+||
T Consensus       155 v~~~~~~ef~~L~~~~lL  172 (480)
T PHA02790        155 LEDDIIDNFDYLSMKLIL  172 (480)
T ss_pred             HhcccchhhhhCCHHHhc
Confidence            2222  445444344555


No 32 
>PTZ00266 NIMA-related protein kinase; Provisional
Probab=94.15  E-value=0.19  Score=65.32  Aligned_cols=14  Identities=21%  Similarity=0.508  Sum_probs=7.4

Q ss_pred             ccCCcccccCCchh
Q 001172          967 QTQGVLVDEFPHLD  980 (1133)
Q Consensus       967 Q~qg~v~dEFPHLD  980 (1133)
                      |.-|-+.--|-||.
T Consensus       893 ~~~~~~~~~~~~~~  906 (1021)
T PTZ00266        893 KKGGEVDRSYKHLE  906 (1021)
T ss_pred             ccCcccchhhhhhh
Confidence            34445555566654


No 33 
>PRK00247 putative inner membrane protein translocase component YidC; Validated
Probab=94.00  E-value=0.32  Score=57.90  Aligned_cols=12  Identities=25%  Similarity=0.808  Sum_probs=5.6

Q ss_pred             eecceeeehhhH
Q 001172          326 VENDMFVLVDDV  337 (1133)
Q Consensus       326 v~~d~~~l~~Dl  337 (1133)
                      +.++.|++.+-+
T Consensus       256 v~snlwtl~Qq~  267 (429)
T PRK00247        256 VANNLWTLIQNI  267 (429)
T ss_pred             HHhhHHHHHHHH
Confidence            445555544433


No 34 
>KOG2412 consensus Nuclear-export-signal (NES)-containing protein/polyadenylated-RNA export factor [RNA processing and modification]
Probab=93.21  E-value=0.64  Score=56.29  Aligned_cols=15  Identities=13%  Similarity=0.014  Sum_probs=7.5

Q ss_pred             hhhHHHHHHhhcccc
Q 001172          285 TSTLVMDSLYSGLKA  299 (1133)
Q Consensus       285 tSplvmd~L~~glk~  299 (1133)
                      .++++|+.+-..++.
T Consensus        65 ~~~~~e~e~~~~~~~   79 (591)
T KOG2412|consen   65 VMHVSEDEMESDEGE   79 (591)
T ss_pred             cchhHHHHHHhcccc
Confidence            345555555444433


No 35 
>KOG1029 consensus Endocytic adaptor protein intersectin [Signal transduction mechanisms; Intracellular trafficking, secretion, and vesicular transport]
Probab=92.91  E-value=0.37  Score=59.98  Aligned_cols=9  Identities=44%  Similarity=1.095  Sum_probs=6.5

Q ss_pred             CCCCccccc
Q 001172          834 GYIPQSYRN  842 (1133)
Q Consensus       834 sy~pqsYrN  842 (1133)
                      .+.|-||=.
T Consensus       857 GwFPksYVk  865 (1118)
T KOG1029|consen  857 GWFPKSYVK  865 (1118)
T ss_pred             CcCcHHhhh
Confidence            477888865


No 36 
>PF05262 Borrelia_P83:  Borrelia P83/100 protein;  InterPro: IPR007926 This family consists of several Borrelia P83/P100 antigen proteins.
Probab=92.40  E-value=0.85  Score=55.28  Aligned_cols=17  Identities=12%  Similarity=0.180  Sum_probs=11.7

Q ss_pred             hhHHHhhHHHHHhhhcc
Q 001172          227 VEERRGKLGRLIEDKAR  243 (1133)
Q Consensus       227 ved~~~hLG~LLqS~~~  243 (1133)
                      |..+++=|..-|...++
T Consensus        81 I~nlrrIiagyl~~ayg   97 (489)
T PF05262_consen   81 INNLRRIIAGYLEAAYG   97 (489)
T ss_pred             HHHHHHHHHHHHHHhcC
Confidence            45566677777777765


No 37 
>KOG4350 consensus Uncharacterized conserved protein, contains BTB/POZ domain [General function prediction only]
Probab=92.37  E-value=0.15  Score=59.69  Aligned_cols=79  Identities=22%  Similarity=0.250  Sum_probs=58.9

Q ss_pred             cchhhhhhhhhhhcchhh---hhhhHHHHHHhhcccc-------cccc-cccccccccccccCCCCCCceEeecceeeeh
Q 001172          266 KTDAILKVVVKHFFIEKE---VTSTLVMDSLYSGLKA-------LEGQ-SKSKKTKAKLLDAEDTPPPIVHVENDMFVLV  334 (1133)
Q Consensus       266 ~sDVilkV~~k~Ff~~ke---VtSplvmd~L~~glk~-------le~~-t~~~k~~~~~~~t~e~p~~~~~v~~d~~~l~  334 (1133)
                      -+||.|.|..++|.+|+.   +||.+|-+-||+||+.       |+.. .-.|+++++.|||..+--..  ++-|   ..
T Consensus        44 y~DVtfvve~~rfpAHRvILAaRs~yFRAlLYgGm~Es~q~~ipLq~t~~eAF~~lLrYiYtg~~~l~~--~~ed---~l  118 (620)
T KOG4350|consen   44 YSDVTFVVEDTRFPAHRVILAARSSYFRALLYGGMQESHQQLIPLQETNSEAFRALLRYIYTGKIDLAG--VEED---IL  118 (620)
T ss_pred             ccceEEEEeccccchhhhhHHHHHHHHHHHHhhhhhhhhhcccccccccHHHHHHHHHHHhhcceeccc--chHH---HH
Confidence            399999999999999994   4999999999999974       2222 33589999999987653111  2222   23


Q ss_pred             hhHHHHHHHhhcCCC
Q 001172          335 DDVLLLLERAALEPL  349 (1133)
Q Consensus       335 ~Dll~~leR~al~~L  349 (1133)
                      =|+|.|+.||++.-|
T Consensus       119 ld~LslAh~Ygf~~L  133 (620)
T KOG4350|consen  119 LDYLSLAHRYGFIQL  133 (620)
T ss_pred             HHHHHHHHhcCcHHH
Confidence            478889999987765


No 38 
>PF05262 Borrelia_P83:  Borrelia P83/100 protein;  InterPro: IPR007926 This family consists of several Borrelia P83/P100 antigen proteins.
Probab=92.36  E-value=0.55  Score=56.81  Aligned_cols=11  Identities=27%  Similarity=0.392  Sum_probs=6.5

Q ss_pred             EEEEEEEeCCC
Q 001172          133 QFTIAVVNRDP  143 (1133)
Q Consensus       133 qFtf~LLNQdG  143 (1133)
                      .-++.++|-.|
T Consensus         8 ~~~i~F~Ny~g   18 (489)
T PF05262_consen    8 NMTIEFINYSG   18 (489)
T ss_pred             cCceEEEecCC
Confidence            34566677654


No 39 
>KOG0163 consensus Myosin class VI heavy chain [Cytoskeleton]
Probab=92.27  E-value=1.6  Score=54.65  Aligned_cols=10  Identities=20%  Similarity=0.474  Sum_probs=5.7

Q ss_pred             hHHHHHhhhc
Q 001172          233 KLGRLIEDKA  242 (1133)
Q Consensus       233 hLG~LLqS~~  242 (1133)
                      +|+.||+...
T Consensus       649 QL~~LldKL~  658 (1259)
T KOG0163|consen  649 QLSELLDKLE  658 (1259)
T ss_pred             HHHHHHHHHH
Confidence            5666665443


No 40 
>KOG0345 consensus ATP-dependent RNA helicase [RNA processing and modification]
Probab=92.26  E-value=0.19  Score=59.93  Aligned_cols=16  Identities=31%  Similarity=0.212  Sum_probs=14.2

Q ss_pred             hHHHHHHHhhcCCCCC
Q 001172          336 DVLLLLERAALEPLPP  351 (1133)
Q Consensus       336 Dll~~leR~al~~Lp~  351 (1133)
                      |+-.++.-|+|-+||-
T Consensus       436 d~~~lA~~YgLl~lP~  451 (567)
T KOG0345|consen  436 DLGKLATLYGLLRLPK  451 (567)
T ss_pred             cHHHHHHHHHHHhCCC
Confidence            7778999999999996


No 41 
>KOG2072 consensus Translation initiation factor 3, subunit a (eIF-3a) [Translation, ribosomal structure and biogenesis]
Probab=91.29  E-value=1.8  Score=54.76  Aligned_cols=22  Identities=9%  Similarity=0.148  Sum_probs=10.9

Q ss_pred             HHhhhHHHHHHHhhhhhHHHHH
Q 001172          384 ELGRRTVEIFVLAHIFSNKIEV  405 (1133)
Q Consensus       384 Elgrr~~~ifa~~k~F~~rv~~  405 (1133)
                      ..+.++-+|.--...|+++|+.
T Consensus       743 k~k~~l~rm~~d~~~f~e~vk~  764 (988)
T KOG2072|consen  743 KDKKRLSRMYDDRDKFKEHVKG  764 (988)
T ss_pred             HHHHHHHHHhhhHHHHHHHHhh
Confidence            3455555555545555544443


No 42 
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=91.25  E-value=0.061  Score=65.87  Aligned_cols=137  Identities=17%  Similarity=0.129  Sum_probs=84.9

Q ss_pred             cccchhhhhhhhhhhcchhhh---hhhHHHHHHhhc-cccccccccc--------ccccccccccCCCCCCceEeeccee
Q 001172          264 REKTDAILKVVVKHFFIEKEV---TSTLVMDSLYSG-LKALEGQSKS--------KKTKAKLLDAEDTPPPIVHVENDMF  331 (1133)
Q Consensus       264 ~e~sDVilkV~~k~Ff~~keV---tSplvmd~L~~g-lk~le~~t~~--------~k~~~~~~~t~e~p~~~~~v~~d~~  331 (1133)
                      ++..||+|.|.++.|.+||.|   .||+|. ++|.+ |+.-....-.        ...++.++||..+.     +..+  
T Consensus        34 ~~lcDv~L~v~~~~~~aHR~VLAa~S~YFr-aMFt~~l~e~~~~~i~l~~v~~~~l~~ll~y~Yt~~i~-----i~~~--  105 (571)
T KOG4441|consen   34 GLLCDVTLLVGDREFPAHRVVLAACSPYFR-AMFTSGLKESKQKEINLEGVDPETLELLLDYAYTGKLE-----ISED--  105 (571)
T ss_pred             CCCceEEEEECCeeechHHHHHHhccHHHH-HHhcCCcccccceEEEEecCCHHHHHHHHHHhhcceEE-----echH--
Confidence            446999999999999999977   567654 46663 3221111001        22234566665532     1111  


Q ss_pred             eehhhHHHHHHHhhcCCCCCCCCCCCccccccCCCCCCcccchhH-----HHHHHhhHHhhhHHHHHHHhhhhhHHHHHH
Q 001172          332 VLVDDVLLLLERAALEPLPPKDEKGPQNRTKESNSGEDFNKDSIE-----RDERRLTELGRRTVEIFVLAHIFSNKIEVA  406 (1133)
Q Consensus       332 ~l~~Dll~~leR~al~~Lp~~d~~~~q~~~~~~~~~~dvs~dtve-----~ee~~~~Elgrr~~~ifa~~k~F~~rv~~A  406 (1133)
                       =+++||.++..+.+..+-.   .|..-+.+.      +..++..     .|.+.|.||... .+.|+ -++|.   ++.
T Consensus       106 -nVq~ll~aA~~lQi~~v~~---~C~~fL~~~------l~~~Nclgi~~~a~~~~~~~L~~~-a~~~i-~~~F~---~v~  170 (571)
T KOG4441|consen  106 -NVQELLEAASLLQIPEVVD---ACCEFLESQ------LDPSNCLGIRRFAELHSCTELLEV-ADEYI-LQHFA---EVS  170 (571)
T ss_pred             -hHHHHHHHHHHhhhHHHHH---HHHHHHHhc------CCHHHHHHHHHHHHhcCcHHHHHH-HHHHH-HHHHH---HHh
Confidence             2578888888877766543   576666665      4444432     278888877644 44455 45675   778


Q ss_pred             HHHHHHHHHHHHHHHHH
Q 001172          407 YQEAVALKRQEELIREE  423 (1133)
Q Consensus       407 y~E~~A~~rQ~~Ll~E~  423 (1133)
                      -+|.+-++-+++|+.=+
T Consensus       171 ~~eefl~L~~~~l~~ll  187 (571)
T KOG4441|consen  171 KTEEFLLLSLEELIGLL  187 (571)
T ss_pred             ccHHhhCCCHHHHHhhc
Confidence            88888888888887644


No 43 
>KOG2002 consensus TPR-containing nuclear phosphoprotein that regulates K(+) uptake [Inorganic ion transport and metabolism]
Probab=91.08  E-value=1.5  Score=56.27  Aligned_cols=15  Identities=20%  Similarity=0.096  Sum_probs=10.0

Q ss_pred             HhhHHhhhHHHHHHH
Q 001172          381 RLTELGRRTVEIFVL  395 (1133)
Q Consensus       381 ~~~Elgrr~~~ifa~  395 (1133)
                      ...|+++|.|+-+..
T Consensus       783 ~~le~a~r~F~~ls~  797 (1018)
T KOG2002|consen  783 KELEEARRLFTELSK  797 (1018)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            335778888886654


No 44 
>KOG2891 consensus Surface glycoprotein [General function prediction only]
Probab=91.05  E-value=2.3  Score=48.13  Aligned_cols=27  Identities=26%  Similarity=0.375  Sum_probs=16.6

Q ss_pred             HHHHHHHhhhhhhhHHHHH-Hhhhhhhh
Q 001172          449 KLAKQKRNNRKGKEKKREE-RSSMALSD  475 (1133)
Q Consensus       449 k~ak~~e~~rk~~e~~~ee-~~~~~~~~  475 (1133)
                      .+|+|+++++|++||...| ...+..+|
T Consensus       387 ieareerrkqkeeeklk~e~qkikelee  414 (445)
T KOG2891|consen  387 IEAREERRKQKEEEKLKAEEQKIKELEE  414 (445)
T ss_pred             HHHHHHHHhhhHHHHHHHHHHHHHHHHH
Confidence            3477777777877776544 44444444


No 45 
>KOG2072 consensus Translation initiation factor 3, subunit a (eIF-3a) [Translation, ribosomal structure and biogenesis]
Probab=90.45  E-value=2.7  Score=53.32  Aligned_cols=28  Identities=25%  Similarity=0.239  Sum_probs=17.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHH-HHHHHHHH
Q 001172          406 AYQEAVALKRQEELIREEEA-AWLAESEQ  433 (1133)
Q Consensus       406 Ay~E~~A~~rQ~~Ll~E~E~-e~~~~~e~  433 (1133)
                      .|.|.+--.||..+-+-+-+ +++++.||
T Consensus       757 ~f~e~vk~~rqs~~~e~~~~~ea~leaer  785 (988)
T KOG2072|consen  757 KFKEHVKGERQSEYEEKLKQFEARLEAER  785 (988)
T ss_pred             HHHHHHhhhhHHHHHHHHHHHHHHHHHHH
Confidence            37777777788777665533 44556555


No 46 
>KOG3054 consensus Uncharacterized conserved protein [Function unknown]
Probab=89.81  E-value=1.2  Score=49.68  Aligned_cols=38  Identities=29%  Similarity=0.191  Sum_probs=19.8

Q ss_pred             CCCCcCCCCcccCCCCccCCCCcCCCCccCCCcccccccccccC
Q 001172          531 SPVNWDTDASEVIPPTEASSSGVCNLSSVPNGVTEKRNASVMDD  574 (1133)
Q Consensus       531 s~v~w~~d~~e~~~~~~~~~~~~~~~~~~~n~~~~k~~~s~~~~  574 (1133)
                      --|+-+.=||+++=-|..+.|      -+||=+++.--+.+|||
T Consensus       213 KvV~ledLas~f~Lrtqd~in------riq~~l~eg~ltGVmDD  250 (299)
T KOG3054|consen  213 KVVPLEDLASEFGLRTQDSIN------RIQELLAEGLLTGVMDD  250 (299)
T ss_pred             CeeeHHHHHHHhCccHHHHHH------HHHHHHHhhhheeeecC
Confidence            345555556666665554443      23444444444566666


No 47 
>PRK00247 putative inner membrane protein translocase component YidC; Validated
Probab=89.70  E-value=1.4  Score=52.77  Aligned_cols=15  Identities=20%  Similarity=0.355  Sum_probs=6.7

Q ss_pred             hhhHHHHHHhhccccc
Q 001172          285 TSTLVMDSLYSGLKAL  300 (1133)
Q Consensus       285 tSplvmd~L~~glk~l  300 (1133)
                      ..|+|+. ||..+..+
T Consensus       112 QiPIfig-Ly~vir~m  126 (429)
T PRK00247        112 QIPVFLG-LYQVLLRM  126 (429)
T ss_pred             HHHHHHH-HHHHHHhc
Confidence            3454443 45444443


No 48 
>CHL00118 atpG ATP synthase CF0 B' subunit; Validated
Probab=89.64  E-value=3.3  Score=42.80  Aligned_cols=30  Identities=13%  Similarity=-0.011  Sum_probs=21.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q 001172          410 AVALKRQEELIREEEAAWLAESEQKAKRGA  439 (1133)
Q Consensus       410 ~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~  439 (1133)
                      ++...|++++...+++.++.++|-++.+.+
T Consensus        49 ~~l~~R~~~I~~~l~~Ae~~~~ea~~~~~e   78 (156)
T CHL00118         49 KVLDERKEYIRKNLTKASEILAKANELTKQ   78 (156)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            445678888888888888777766555554


No 49 
>KOG0297 consensus TNF receptor-associated factor [Signal transduction mechanisms]
Probab=89.43  E-value=0.23  Score=58.34  Aligned_cols=77  Identities=27%  Similarity=0.444  Sum_probs=62.9

Q ss_pred             ccccEEEEEEccccccC-------CCeeeCCceee--cCeEEEEEEEeCCCC--CCCeEEEEEEEecCCCC-CCCCeEEE
Q 001172           65 ELYGKYTWRIEKFSQIS-------KRELRSNAFEV--GGYKWYILIYPQGCD--VCNHLSLFLCVANHDKL-LPGWSHFA  132 (1133)
Q Consensus        65 ~~tGshtwkIdNFS~LK-------ge~I~Sp~FsV--GGhkWRI~~YPnG~~--~~dyLSLYLeL~s~~sl-~p~WsV~A  132 (1133)
                      ...|+..|+|.+|...+       ...+.|+.|+.  .||+-+.++|-+|+.  .+-++|+|+.+..+..+ ...|.++-
T Consensus       277 ~~~g~~iwki~~~~~~~~e~~~~~~~~~~S~~f~t~~~Gyk~~~~~~lng~g~~~~~~~s~~~~~~~ge~d~~l~wpf~~  356 (391)
T KOG0297|consen  277 SYDGTLIWKIPDYGRKKQEAVAGATLSLFSPAFYTSKYGYKLCARIYLNGDGTGKGTHLSLYFVVMRGEYDALLPWPFRQ  356 (391)
T ss_pred             ccCCEEEEEecchhhhhHHHHhccCccccccccccccccHHHHhHhhhcCCCCCCcceeeeeeeecccCcccccccCCCC
Confidence            45799999999996544       24688999986  799999999988865  36799999999887654 34699999


Q ss_pred             EEEEEEEeC
Q 001172          133 QFTIAVVNR  141 (1133)
Q Consensus       133 qFtf~LLNQ  141 (1133)
                      ..+|.|+++
T Consensus       357 ~v~~~l~dq  365 (391)
T KOG0297|consen  357 KVTLMLLDQ  365 (391)
T ss_pred             ceEEEEecc
Confidence            999999999


No 50 
>TIGR03321 alt_F1F0_F0_B alternate F1F0 ATPase, F0 subunit B. CC and in principle may run in either direction. This model represents the F0 subunit B of this apparent second ATP synthase.
Probab=89.35  E-value=3  Score=46.13  Aligned_cols=50  Identities=12%  Similarity=0.143  Sum_probs=27.6

Q ss_pred             hhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 001172          387 RRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKR  437 (1133)
Q Consensus       387 rr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk  437 (1133)
                      -.+|.|++|--++. ++.---=-++...|++++...+++.++.++|-++.+
T Consensus        10 ~qiInFlil~~lL~-kfl~kPi~~~l~~R~~~I~~~l~~Ae~~~~eA~~~~   59 (246)
T TIGR03321        10 AQLINFLILVWLLK-RFLYRPILDAMDAREKKIAGELADADTKKREAEQER   59 (246)
T ss_pred             HHHHHHHHHHHHHH-HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34566666544443 222222234556778888888877776665544333


No 51 
>PRK13428 F0F1 ATP synthase subunit delta; Provisional
Probab=89.11  E-value=2.5  Score=50.79  Aligned_cols=49  Identities=10%  Similarity=0.096  Sum_probs=28.5

Q ss_pred             hHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 001172          388 RTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKR  437 (1133)
Q Consensus       388 r~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk  437 (1133)
                      .+|.|+++--++. ++..----++..+||+++..++++.+..+++-++.+
T Consensus         7 qlInFlIl~~lL~-kfl~~Pi~~~l~~R~~~I~~~L~eAe~a~~ea~~~~   55 (445)
T PRK13428          7 QLIGFAVIVFLVW-RFVVPPVRRLMAARQDTVRQQLAESATAADRLAEAD   55 (445)
T ss_pred             HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3455555444443 222222345567889999999988876655544433


No 52 
>PF07946 DUF1682:  Protein of unknown function (DUF1682);  InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found. 
Probab=88.95  E-value=1.5  Score=50.45  Aligned_cols=8  Identities=25%  Similarity=0.376  Sum_probs=5.9

Q ss_pred             CCEEEEEE
Q 001172          180 GDTLIIKA  187 (1133)
Q Consensus       180 DDsLTIeC  187 (1133)
                      .|.++|.+
T Consensus       110 ~D~v~i~i  117 (321)
T PF07946_consen  110 KDRVTIEI  117 (321)
T ss_pred             CCeEEEEE
Confidence            67777777


No 53 
>PF00651 BTB:  BTB/POZ domain;  InterPro: IPR013069 The BTB (for BR-C, ttk and bab) [] or POZ (for Pox virus and Zinc finger) [] domain is present near the N terminus of a fraction of zinc finger (IPR007087 from INTERPRO) proteins and in proteins that contain the IPR006652 from INTERPRO motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation []. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule []. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT [, , ]. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.; GO: 0005515 protein binding; PDB: 3M5B_A 1R28_B 3LBZ_A 3E4U_F 3BIM_B 1R2B_A 1R29_A 2VPK_A 2YY9_B 3GA1_A ....
Probab=88.67  E-value=0.048  Score=50.82  Aligned_cols=32  Identities=22%  Similarity=0.363  Sum_probs=22.9

Q ss_pred             ccchhhhhhh-hhhhcchhhh---hhhHHHHHHhhc
Q 001172          265 EKTDAILKVV-VKHFFIEKEV---TSTLVMDSLYSG  296 (1133)
Q Consensus       265 e~sDVilkV~-~k~Ff~~keV---tSplvmd~L~~g  296 (1133)
                      +.+|+++++. +++|++||.+   +||+|.+-+...
T Consensus         9 ~~~D~~i~v~d~~~~~vhk~iL~~~S~~F~~~~~~~   44 (111)
T PF00651_consen    9 EFSDVTIRVGDGKTFYVHKNILAARSPYFRNLFEGS   44 (111)
T ss_dssp             TS--EEEEETTTEEEEE-HHHHHHHBHHHHHHHTTT
T ss_pred             CCCCEEEEECCCEEEeechhhhhccchhhhhccccc
Confidence            4699999999 8999999977   778776655444


No 54 
>KOG2002 consensus TPR-containing nuclear phosphoprotein that regulates K(+) uptake [Inorganic ion transport and metabolism]
Probab=88.36  E-value=7.1  Score=50.57  Aligned_cols=9  Identities=22%  Similarity=0.076  Sum_probs=4.8

Q ss_pred             eeecCCchH
Q 001172          212 VRVYLTNVE  220 (1133)
Q Consensus       212 I~VPpSNLg  220 (1133)
                      +.+-|+|+.
T Consensus       639 L~~dpkN~y  647 (1018)
T KOG2002|consen  639 LRNDPKNMY  647 (1018)
T ss_pred             HhcCcchhh
Confidence            445566653


No 55 
>PF07946 DUF1682:  Protein of unknown function (DUF1682);  InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found. 
Probab=88.35  E-value=1  Score=51.65  Aligned_cols=15  Identities=13%  Similarity=0.082  Sum_probs=6.8

Q ss_pred             HHhhHHhhhHHHHHH
Q 001172          380 RRLTELGRRTVEIFV  394 (1133)
Q Consensus       380 ~~~~Elgrr~~~ifa  394 (1133)
                      ..+..+.-.+++.++
T Consensus       234 ~~l~~~v~~l~D~~~  248 (321)
T PF07946_consen  234 EPLLKLVFYLIDKLA  248 (321)
T ss_pred             HHHHHHHHHHHHHhh
Confidence            344444444444444


No 56 
>PRK14474 F0F1 ATP synthase subunit B; Provisional
Probab=87.67  E-value=4  Score=45.59  Aligned_cols=44  Identities=18%  Similarity=0.281  Sum_probs=24.7

Q ss_pred             hhHHHHHHHhhhhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          387 RRTVEIFVLAHIFSNKIEVAYQ--EAVALKRQEELIREEEAAWLAESEQ  433 (1133)
Q Consensus       387 rr~~~ifa~~k~F~~rv~~Ay~--E~~A~~rQ~~Ll~E~E~e~~~~~e~  433 (1133)
                      -.+|.|++|--+++   +..|+  =++-.+|++++..++++++..++|-
T Consensus        10 ~qiInFlILv~lL~---~fl~kPi~~~l~eR~~~I~~~l~~Ae~~~~eA   55 (250)
T PRK14474         10 AQIINFLILVYLLR---RFLYKPIIQVMKKRQQRIANRWQDAEQRQQEA   55 (250)
T ss_pred             HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34555555544443   22322  2345677888888887766655443


No 57 
>PRK14472 F0F1 ATP synthase subunit B; Provisional
Probab=87.52  E-value=5.3  Score=41.94  Aligned_cols=54  Identities=17%  Similarity=0.104  Sum_probs=31.4

Q ss_pred             HhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q 001172          385 LGRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRGA  439 (1133)
Q Consensus       385 lgrr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~  439 (1133)
                      .-..+|.|++|--++. ++.---=-++-..|++++...+++.+...++-++.+.+
T Consensus        21 ~~~~~i~Flil~~lL~-~~l~kpi~~~l~~R~~~I~~~l~~Ae~~~~eA~~~~~e   74 (175)
T PRK14472         21 IFWTAVTFVIVLLILK-KIAWGPILSALEEREKGIQSSIDRAHSAKDEAEAILRK   74 (175)
T ss_pred             HHHHHHHHHHHHHHHH-HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3455666666544443 22222234455778888888888877766654444433


No 58 
>PRK14471 F0F1 ATP synthase subunit B; Provisional
Probab=87.50  E-value=5.4  Score=41.32  Aligned_cols=29  Identities=14%  Similarity=0.121  Sum_probs=19.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 001172          410 AVALKRQEELIREEEAAWLAESEQKAKRG  438 (1133)
Q Consensus       410 ~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~  438 (1133)
                      ++-.+|+..+-.++++.+..+++-++.+.
T Consensus        35 ~~l~~R~~~I~~~l~~A~~~~~ea~~~~~   63 (164)
T PRK14471         35 GAVKEREDSIKNALASAEEARKEMQNLQA   63 (164)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44567788888888877766665544443


No 59 
>PRK13453 F0F1 ATP synthase subunit B; Provisional
Probab=87.26  E-value=5.6  Score=41.84  Aligned_cols=52  Identities=17%  Similarity=0.139  Sum_probs=28.4

Q ss_pred             hhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 001172          386 GRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRG  438 (1133)
Q Consensus       386 grr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~  438 (1133)
                      ...+|.|++|--++. ++.---=-++-..|++.+..++++.+....+-++.+.
T Consensus        22 ~~~iInFliL~~lL~-~~l~~pi~~~l~~R~~~I~~~l~~Ae~~~~eA~~~~~   73 (173)
T PRK13453         22 IVTVLTFIVLLALLK-KFAWGPLKDVMDKRERDINRDIDDAEQAKLNAQKLEE   73 (173)
T ss_pred             HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344555555444442 2222222344567888888888887776555444443


No 60 
>PRK09174 F0F1 ATP synthase subunit B'; Validated
Probab=87.26  E-value=4.7  Score=43.84  Aligned_cols=55  Identities=13%  Similarity=0.129  Sum_probs=34.4

Q ss_pred             HHhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q 001172          384 ELGRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRGA  439 (1133)
Q Consensus       384 Elgrr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~  439 (1133)
                      .+-.-+|.|++|--++. |+..=-=-++..+||+++...+++.++.+.|-++.+.+
T Consensus        55 ~l~w~~I~FliL~~lL~-k~~~~pI~~vLe~R~~~I~~~L~~Ae~~k~eAe~~~~~  109 (204)
T PRK09174         55 QLLWLAITFGLFYLFMS-RVILPRIGGIIETRRDRIAQDLDQAARLKQEADAAVAA  109 (204)
T ss_pred             HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34556666666544443 32222234566889999999998888877765555544


No 61 
>PRK06231 F0F1 ATP synthase subunit B; Validated
Probab=87.08  E-value=5.6  Score=43.19  Aligned_cols=51  Identities=12%  Similarity=-0.050  Sum_probs=30.1

Q ss_pred             HhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          385 LGRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAK  436 (1133)
Q Consensus       385 lgrr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~k  436 (1133)
                      +--.+|.|+++--+|. ++.-----++...|++.+..++++.+...+|-++.
T Consensus        51 ~i~qlInFlIlv~lL~-k~l~kPi~~~L~~R~~~I~~~L~~Ae~~~~eA~~~  101 (205)
T PRK06231         51 FIAHLIAFSILLLLGI-FLFWKPTQRFLNKRKELIEAEINQANELKQQAQQL  101 (205)
T ss_pred             HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4455666666655554 22222234556778888888887776665554333


No 62 
>PRK13454 F0F1 ATP synthase subunit B'; Provisional
Probab=86.97  E-value=5  Score=42.61  Aligned_cols=54  Identities=13%  Similarity=0.129  Sum_probs=28.6

Q ss_pred             HHhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 001172          384 ELGRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRG  438 (1133)
Q Consensus       384 Elgrr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~  438 (1133)
                      .+-.-+|-||+|--++. |+..-==-++..+|+.++-..+++.+..+.+-+..+.
T Consensus        33 q~~~~lI~F~iL~~ll~-k~l~~PI~~~l~~R~~~I~~~l~~Ae~~~~eA~~~~~   86 (181)
T PRK13454         33 QIFWLLVTLVAIYFVLT-RVALPRIGAVLAERQGTITNDLAAAEELKQKAVEAEK   86 (181)
T ss_pred             HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH
Confidence            34445666666444443 2222222455677788887777776666554443333


No 63 
>PRK08475 F0F1 ATP synthase subunit B; Validated
Probab=86.68  E-value=5.8  Score=41.64  Aligned_cols=52  Identities=17%  Similarity=0.178  Sum_probs=31.3

Q ss_pred             HhhhHHHHHHHhhhhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q 001172          385 LGRRTVEIFVLAHIFSNKIEVAYQ--EAVALKRQEELIREEEAAWLAESEQKAKRGA  439 (1133)
Q Consensus       385 lgrr~~~ifa~~k~F~~rv~~Ay~--E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~  439 (1133)
                      +-..+|.|+++--++.   +.+|+  -++-.+|++++..++++.+....+-+..+.+
T Consensus        25 ~~~~~inflil~~lL~---~fl~kPi~~~l~~R~~~I~~~l~~Ae~~~~ea~~~~~e   78 (167)
T PRK08475         25 IIERTINFLIFVGILW---YFAAKPLKNFYKSRINKISKRLEEIQEKLKESKEKKED   78 (167)
T ss_pred             HHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3344677777655554   23332  3456788888888888777665554444433


No 64 
>PRK07352 F0F1 ATP synthase subunit B; Validated
Probab=86.32  E-value=7  Score=41.00  Aligned_cols=49  Identities=6%  Similarity=0.095  Sum_probs=29.5

Q ss_pred             hHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 001172          388 RTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKR  437 (1133)
Q Consensus       388 r~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk  437 (1133)
                      .+|.|+++--+|. ++.--.=-++...||+++...+++.+..+++-++..
T Consensus        25 ~iinflIl~~lL~-~fl~kpI~~~l~~R~~~I~~~l~~A~~~~~ea~~~~   73 (174)
T PRK07352         25 NLINLAIVIGLLY-YFGRGFLGKILEERREAILQALKEAEERLRQAAQAL   73 (174)
T ss_pred             HHHHHHHHHHHHH-HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4566555544443 222223356678889999998888776655544333


No 65 
>KOG3809 consensus Microtubule-binding protein MIP-T3 [Cytoskeleton]
Probab=86.22  E-value=2.6  Score=50.15  Aligned_cols=12  Identities=8%  Similarity=0.111  Sum_probs=8.5

Q ss_pred             hHHhhhHHHHHH
Q 001172          383 TELGRRTVEIFV  394 (1133)
Q Consensus       383 ~Elgrr~~~ifa  394 (1133)
                      +|+-..+||+++
T Consensus       101 pE~TNelLQ~~g  112 (583)
T KOG3809|consen  101 PEETNELLQMLG  112 (583)
T ss_pred             HHHHHHHHHHHH
Confidence            566777777776


No 66 
>PRK13455 F0F1 ATP synthase subunit B; Provisional
Probab=85.77  E-value=5.4  Score=42.21  Aligned_cols=48  Identities=19%  Similarity=0.159  Sum_probs=30.5

Q ss_pred             HHHHHHHhhhhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q 001172          389 TVEIFVLAHIFSNKIEVAYQ---EAVALKRQEELIREEEAAWLAESEQKAKRGA  439 (1133)
Q Consensus       389 ~~~ifa~~k~F~~rv~~Ay~---E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~  439 (1133)
                      +|.|+++.-++.   ..+|.   -++-..||+++..++++.++.++|-+++..+
T Consensus        33 ~inflil~~iL~---~f~~~~~v~~~L~~R~~~I~~~l~~Ae~~~~eA~~~l~e   83 (184)
T PRK13455         33 TLAFLLFIGILV---YFKVPGMIGGMLDKRAEGIRSELEEARALREEAQTLLAS   83 (184)
T ss_pred             HHHHHHHHHHHH---HHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            466666555553   22321   4567889999999998887776665544443


No 67 
>PRK07353 F0F1 ATP synthase subunit B'; Validated
Probab=85.61  E-value=8.7  Score=38.56  Aligned_cols=31  Identities=13%  Similarity=0.058  Sum_probs=20.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q 001172          409 EAVALKRQEELIREEEAAWLAESEQKAKRGA  439 (1133)
Q Consensus       409 E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~  439 (1133)
                      -++-..|+.++...+++.+....+-++...+
T Consensus        31 ~~~l~~R~~~I~~~l~~Ae~~~~ea~~~~~~   61 (140)
T PRK07353         31 GKVVEEREDYIRTNRAEAKERLAEAEKLEAQ   61 (140)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3455678888888887777666655444443


No 68 
>PRK13460 F0F1 ATP synthase subunit B; Provisional
Probab=85.11  E-value=9.6  Score=40.01  Aligned_cols=53  Identities=13%  Similarity=0.009  Sum_probs=28.9

Q ss_pred             HhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 001172          385 LGRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRG  438 (1133)
Q Consensus       385 lgrr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~  438 (1133)
                      ....+|.|++|--++. ++..--=-++-..|++++..++++.+....|-++.+.
T Consensus        19 ~~~~~i~Flil~~iL~-~~~~kpi~~~l~~R~~~I~~~l~~Ae~~~~eA~~~~~   71 (173)
T PRK13460         19 VVWTLVTFLVVVLVLK-KFAWDVILKALDERASGVQNDINKASELRLEAEALLK   71 (173)
T ss_pred             HHHHHHHHHHHHHHHH-HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3445666555444442 2222222445567888888888777766555444333


No 69 
>PRK14475 F0F1 ATP synthase subunit B; Provisional
Probab=84.78  E-value=8.4  Score=40.29  Aligned_cols=31  Identities=13%  Similarity=-0.018  Sum_probs=23.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q 001172          409 EAVALKRQEELIREEEAAWLAESEQKAKRGA  439 (1133)
Q Consensus       409 E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~  439 (1133)
                      -++-..|++++..+++++++.++|-++.+.+
T Consensus        36 ~~~le~R~~~I~~~l~~Ae~~k~eAe~~~~~   66 (167)
T PRK14475         36 AGALDAYAAKIQAELDEAQRLREEAQALLAD   66 (167)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            5667889999999999988887776555544


No 70 
>PRK06569 F0F1 ATP synthase subunit B'; Validated
Probab=84.77  E-value=14  Score=39.03  Aligned_cols=58  Identities=16%  Similarity=-0.024  Sum_probs=37.9

Q ss_pred             HhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHH
Q 001172          385 LGRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRGAAEKE  443 (1133)
Q Consensus       385 lgrr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~k~ke  443 (1133)
                      +=.-+|.|++|--+|+ |+..=-=.++...||.++-..++++++.+.|-++.+++-+++
T Consensus        13 ifw~iI~FlILy~ll~-kf~~ppI~~iLe~R~~~I~~~L~~Ae~~k~eAe~l~a~ye~~   70 (155)
T PRK06569         13 IFWLIVTFGLLYIFVY-KFITPKAEEIFNNRQTNIQDNITQADTLTIEVEKLNKYYNEE   70 (155)
T ss_pred             HHHHHHHHHHHHHHHH-HHhHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH
Confidence            3445566665444443 333333466778899999999999988888776666653433


No 71 
>CHL00019 atpF ATP synthase CF0 B subunit
Probab=84.61  E-value=8.9  Score=40.65  Aligned_cols=50  Identities=8%  Similarity=0.048  Sum_probs=28.0

Q ss_pred             hHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 001172          388 RTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRG  438 (1133)
Q Consensus       388 r~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~  438 (1133)
                      .+|.|++|--++. ++.--.=-++-.+||..+...+++.++.+.+-+.+..
T Consensus        30 ~~Inflill~lL~-~fl~kPI~~~l~~R~~~I~~~l~~Ae~~~~eA~~~~~   79 (184)
T CHL00019         30 NLINLSVVLGVLI-YFGKGVLSDLLDNRKQTILNTIRNSEERREEAIEKLE   79 (184)
T ss_pred             HHHHHHHHHHHHH-HHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3556555444443 2222223445577888888888877766555444333


No 72 
>PF06098 Radial_spoke_3:  Radial spoke protein 3;  InterPro: IPR009290 This family consists of several radial spoke protein 3 (RSP3) sequences. Eukaryotic cilia and flagella present in diverse types of cells perform motile, sensory, and developmental functions in organisms from protists to humans. They are centred by precisely organised, microtubule-based structures, the axonemes. The axoneme consists of two central singlet microtubules, called the central pair, and nine outer doublet microtubules. These structures are well conserved during evolution. The outer doublet microtubules, each composed of A and B sub-fibres, are connected to each other by nexin links, while the central pair is held at the centre of the axoneme by radial spokes. The radial spokes are T-shaped structures extending from the A-tubule of each outer doublet microtubule to the centre of the axoneme. Radial spoke protein 3 (RSP3), is present at the proximal end of the spoke stalk and helps in anchoring the radial spoke to the outer doublet. It is thought that radial spokes regulate the activity of inner arm dynein through protein phosphorylation and dephosphorylation [].
Probab=84.35  E-value=11  Score=43.41  Aligned_cols=47  Identities=26%  Similarity=0.349  Sum_probs=31.5

Q ss_pred             hhhHHHHHHHhhhhhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH
Q 001172          386 GRRTVEIFVLAHIFSNKIEVAYQEA--VALKRQEELIREEEAAWLAESEQ  433 (1133)
Q Consensus       386 grr~~~ifa~~k~F~~rv~~Ay~E~--~A~~rQ~~Ll~E~E~e~~~~~e~  433 (1133)
                      =.=+|++.| .|.++|-++..+.|-  -++.+|.+..+++..+++.+-+|
T Consensus       130 V~PiLeVLV-gKtlEQAl~EV~EEeEL~~lr~~q~~fe~~R~aEl~e~qr  178 (291)
T PF06098_consen  130 VKPILEVLV-GKTLEQALMEVMEEEELAALRRQQRAFEELRNAELAEVQR  178 (291)
T ss_pred             hhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            336778888 888887777776654  35566666777776666666554


No 73 
>PRK14473 F0F1 ATP synthase subunit B; Provisional
Probab=84.06  E-value=9.4  Score=39.59  Aligned_cols=29  Identities=7%  Similarity=-0.004  Sum_probs=18.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 001172          410 AVALKRQEELIREEEAAWLAESEQKAKRG  438 (1133)
Q Consensus       410 ~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~  438 (1133)
                      ++-..|+.++...+++.+....+-++.+.
T Consensus        35 ~~l~~R~~~I~~~l~~Ae~~~~ea~~~~~   63 (164)
T PRK14473         35 NLLNERTRRIEESLRDAEKVREQLANAKR   63 (164)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44567788888888877766655444433


No 74 
>KOG3054 consensus Uncharacterized conserved protein [Function unknown]
Probab=84.02  E-value=4  Score=45.63  Aligned_cols=14  Identities=29%  Similarity=0.174  Sum_probs=6.5

Q ss_pred             HHHHHHHHHHHHHH
Q 001172          416 QEELIREEEAAWLA  429 (1133)
Q Consensus       416 Q~~Ll~E~E~e~~~  429 (1133)
                      +.++.+|-|+.+|.
T Consensus       116 err~qRe~E~~eRE  129 (299)
T KOG3054|consen  116 ERRAQREAEEAERE  129 (299)
T ss_pred             HHHHHHHHHHHHHH
Confidence            34445555544443


No 75 
>PRK09510 tolA cell envelope integrity inner membrane protein TolA; Provisional
Probab=83.09  E-value=6.8  Score=46.57  Aligned_cols=9  Identities=33%  Similarity=0.542  Sum_probs=4.1

Q ss_pred             cccCCCCCC
Q 001172          507 VSDVSDSVD  515 (1133)
Q Consensus       507 ~sd~sd~~d  515 (1133)
                      +.|+-..++
T Consensus       259 ~dd~~~gl~  267 (387)
T PRK09510        259 VDDLFGGLD  267 (387)
T ss_pred             HHHHhhccc
Confidence            444444444


No 76 
>KOG2891 consensus Surface glycoprotein [General function prediction only]
Probab=82.57  E-value=5.3  Score=45.32  Aligned_cols=19  Identities=32%  Similarity=0.700  Sum_probs=9.6

Q ss_pred             ecccCCCCcccccceeccccc
Q 001172          154 RFWKKEHDWGWKKFMELSKVS  174 (1133)
Q Consensus       154 ~FsskesdWGfskFIkrsdLE  174 (1133)
                      -|.. .++|- .-||...++.
T Consensus       123 dfps-rhdwd-d~fm~~kdmd  141 (445)
T KOG2891|consen  123 DFPS-RHDWD-DFFMDAKDMD  141 (445)
T ss_pred             CCCc-ccchH-HHHhhhhhhh
Confidence            3443 36663 2366666554


No 77 
>PRK08476 F0F1 ATP synthase subunit B'; Validated
Probab=82.51  E-value=14  Score=37.92  Aligned_cols=30  Identities=10%  Similarity=-0.016  Sum_probs=18.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 001172          409 EAVALKRQEELIREEEAAWLAESEQKAKRG  438 (1133)
Q Consensus       409 E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~  438 (1133)
                      -++...|+++.-..+++.++.+.+-++.+.
T Consensus        33 ~~~l~~R~~~I~~~l~~A~~~~~ea~~~~~   62 (141)
T PRK08476         33 LKFMDNRNASIKNDLEKVKTNSSDVSEIEH   62 (141)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345566777777777776666555444433


No 78 
>COG3064 TolA Membrane protein involved in colicin uptake [Cell envelope biogenesis, outer membrane]
Probab=82.23  E-value=4.6  Score=46.56  Aligned_cols=11  Identities=18%  Similarity=0.278  Sum_probs=4.6

Q ss_pred             HHHHHHHHHHH
Q 001172          417 EELIREEEAAW  427 (1133)
Q Consensus       417 ~~Ll~E~E~e~  427 (1133)
                      ++.|++||.+.
T Consensus       100 qErlkQle~er  110 (387)
T COG3064         100 QERLKQLEKER  110 (387)
T ss_pred             HHHHHHHHHHH
Confidence            34444444433


No 79 
>PF09727 CortBP2:  Cortactin-binding protein-2;  InterPro: IPR019131  This entry represents a N-terminal domain found in cortactin-binding protein 2 and in filamin A interacting protein 1 (Filip1). In addition to being a positional candidate for autism, cortactin-binding protein 2 is expressed at highest levels in the brain in humans. Towards the C-terminal end of this protein are a series of proline-rich regions which are likely to be the points of interaction with the SH3 domain of cortactin. The human protein has six associated ankyrin repeat domains (IPR002110 from INTERPRO) towards the C terminus of the protein which act as protein-protein interaction domains [].  Filip1 controls the start of neocortical cell migration from the ventricular zone by acting through a filamin-A/F-actin axis. It may be able to induce the degradation of Filamin A [, ].
Probab=82.15  E-value=12  Score=40.85  Aligned_cols=38  Identities=26%  Similarity=0.283  Sum_probs=27.0

Q ss_pred             HHHhhHHhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          379 ERRLTELGRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESE  432 (1133)
Q Consensus       379 e~~~~Elgrr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e  432 (1133)
                      +..|-+--+||++..+                -|..++.+.|.|||+|+|+-.+
T Consensus        90 ~~~qk~~q~Rm~~qL~----------------~aE~rhrr~i~eLe~EKrkh~~  127 (192)
T PF09727_consen   90 MEHQKKMQRRMLEQLA----------------AAEKRHRRTIQELEEEKRKHAE  127 (192)
T ss_pred             HHHHHHHHHHHHHHHH----------------HHHHHHHHHHHHHHHHHHHHHH
Confidence            3444455566666655                2678899999999999887553


No 80 
>PRK13461 F0F1 ATP synthase subunit B; Provisional
Probab=82.08  E-value=14  Score=38.14  Aligned_cols=47  Identities=19%  Similarity=0.109  Sum_probs=25.0

Q ss_pred             hhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          386 GRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQ  433 (1133)
Q Consensus       386 grr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~  433 (1133)
                      ...+|.|++|--++. ++.---=-++-..|+..+...+++.+....+-
T Consensus         9 ~~~~inF~il~~iL~-~f~~kpi~~~l~~R~~~I~~~l~~A~~~~~eA   55 (159)
T PRK13461          9 IATIINFIILLLILK-HFFFDKIKAVIDSRQSEIDNKIEKADEDQKKA   55 (159)
T ss_pred             HHHHHHHHHHHHHHH-HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345566655544443 22211123445677788777777766554443


No 81 
>PRK06568 F0F1 ATP synthase subunit B; Validated
Probab=81.93  E-value=12  Score=39.26  Aligned_cols=49  Identities=8%  Similarity=-0.082  Sum_probs=28.9

Q ss_pred             hHHHHHHHhhhhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q 001172          388 RTVEIFVLAHIFSNKIEVAYQ--EAVALKRQEELIREEEAAWLAESEQKAKRGA  439 (1133)
Q Consensus       388 r~~~ifa~~k~F~~rv~~Ay~--E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~  439 (1133)
                      -+|-|++|--+|.   +.+|+  -++-.+|+++.-.+++++++.++|-+.-+..
T Consensus        10 q~I~FlIll~ll~---kfawkPI~~~LeeR~~~I~~~Ld~Ae~~r~eA~~l~~e   60 (154)
T PRK06568         10 LAVSFVIFVYLIY---RPAKKAILNSLDAKILEVQEKVLKAEKLKEDAALLFEQ   60 (154)
T ss_pred             HHHHHHHHHHHHH---HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3455555444554   33333  2344667888888888888777765544443


No 82 
>KOG2357 consensus Uncharacterized conserved protein [Function unknown]
Probab=81.70  E-value=11  Score=44.85  Aligned_cols=26  Identities=12%  Similarity=0.286  Sum_probs=15.0

Q ss_pred             ccceecccccc----CCc-cCCEEEEEEEEE
Q 001172          165 KKFMELSKVSD----GFK-DGDTLIIKAQVQ  190 (1133)
Q Consensus       165 skFIkrsdLEs----GFL-vDDsLTIeCdVt  190 (1133)
                      .++++|.+|.+    -++ ..|.++|++.+.
T Consensus       189 L~l~~RqDl~S~v~~~v~P~~D~vt~ev~l~  219 (440)
T KOG2357|consen  189 LKLVKRQDLLSRVMNSVRPVGDQVTFEVTLN  219 (440)
T ss_pred             hhhhhhccHHHHHHHhcccccceEEEEEecC
Confidence            35666766662    222 357777776653


No 83 
>PF07888 CALCOCO1:  Calcium binding and coiled-coil domain (CALCOCO1) like;  InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1 (Q61026 from SWISSPROT), and thus enhances transcriptional activation by a number of nuclear receptors. CoCoA has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region []. 
Probab=80.86  E-value=67  Score=40.07  Aligned_cols=63  Identities=22%  Similarity=0.197  Sum_probs=35.0

Q ss_pred             hhhhhhHHHHHHhhhhhhhhhcccCCchhhhhhhhcccCCCCCCCCCccc---cccCCCCCCCCccc
Q 001172          457 NRKGKEKKREERSSMALSDRLEDENPSDEKKEFIVEDAQPLPEKPDVLED---VSDVSDSVDGGAEV  520 (1133)
Q Consensus       457 ~rk~~e~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~d~~e~---~sd~sd~~d~~~e~  520 (1133)
                      ..||||+..+|+..-...-+++|..-.+- -+.+-++|--+.+-......   .+|+|||-|.+-|.
T Consensus       429 ~qkEKEql~~EkQeL~~yi~~Le~r~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~s~s~de~p~~  494 (546)
T PF07888_consen  429 AQKEKEQLQEEKQELLEYIERLEQRLDKV-ADEKWKEAAALTEDATAASPPSCPADLSDSEDESPED  494 (546)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hhhhhhccccccCccccccCCCCCcCCCCcccCCccc
Confidence            34677777777777666666666332222 23333344444443333322   27888888887774


No 84 
>COG3064 TolA Membrane protein involved in colicin uptake [Cell envelope biogenesis, outer membrane]
Probab=79.85  E-value=6.6  Score=45.34  Aligned_cols=12  Identities=33%  Similarity=0.260  Sum_probs=5.2

Q ss_pred             HHHHHHhhhhhh
Q 001172          463 KKREERSSMALS  474 (1133)
Q Consensus       463 ~~~ee~~~~~~~  474 (1133)
                      ++++|+..++++
T Consensus       178 kA~eeAkaKAe~  189 (387)
T COG3064         178 KAAEEAKAKAEA  189 (387)
T ss_pred             HHHHHHHHHHHH
Confidence            344444444443


No 85 
>TIGR01144 ATP_synt_b ATP synthase, F0 subunit b. This model describes the F1/F0 ATP synthase b subunit in bacteria only. Scoring just below the trusted cutoff are the N-terminal domains of Mycobacterial b/delta fusion proteins and a subunit from an archaeon, Methanosarcina barkeri, in which the ATP synthase homolog differs in architecture and is not experimentally confirmed. This model helps resolve b from the related b' subunit. Within the family is an example from a sodium-translocating rather than proton-translocating ATP synthase.
Probab=79.71  E-value=19  Score=36.48  Aligned_cols=36  Identities=14%  Similarity=0.062  Sum_probs=19.9

Q ss_pred             HHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          393 FVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQ  433 (1133)
Q Consensus       393 fa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~  433 (1133)
                      |+|-+.|...|..     +-..|+.....++++.+....+-
T Consensus        10 ~il~~~~~~pi~~-----~l~~R~~~I~~~l~~A~~~~~ea   45 (147)
T TIGR01144        10 WFCMKYVWPPLAK-----AIETRQKKIADGLASAERAKKEA   45 (147)
T ss_pred             HHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHH
Confidence            3344444444443     34567777777777766554443


No 86 
>COG4499 Predicted membrane protein [Function unknown]
Probab=79.66  E-value=3.4  Score=48.72  Aligned_cols=19  Identities=21%  Similarity=0.060  Sum_probs=12.2

Q ss_pred             CCceeecC-----eEEEEEEEeCC
Q 001172           87 SNAFEVGG-----YKWYILIYPQG  105 (1133)
Q Consensus        87 Sp~FsVGG-----hkWRI~~YPnG  105 (1133)
                      +++|.+..     -.|.|.|-|..
T Consensus        48 ~~~f~~~eit~~~Ds~vIsy~i~~   71 (434)
T COG4499          48 SPPFIVAEITEDNDSFVISYPIPE   71 (434)
T ss_pred             CCCccceeecccCceeEEEecCcc
Confidence            66666543     46888877654


No 87 
>KOG0742 consensus AAA+-type ATPase [Posttranslational modification, protein turnover, chaperones]
Probab=78.65  E-value=21  Score=43.07  Aligned_cols=18  Identities=33%  Similarity=0.288  Sum_probs=10.6

Q ss_pred             hhhHHHHHHhhccccccc
Q 001172          285 TSTLVMDSLYSGLKALEG  302 (1133)
Q Consensus       285 tSplvmd~L~~glk~le~  302 (1133)
                      -+.++|..||+--|+...
T Consensus        10 i~a~~~S~~~~~nka~ad   27 (630)
T KOG0742|consen   10 ISALAMSWLFGINKAYAD   27 (630)
T ss_pred             HHHHHHHHHhccchhhcc
Confidence            456666666666664433


No 88 
>PRK05759 F0F1 ATP synthase subunit B; Validated
Probab=77.95  E-value=23  Score=36.14  Aligned_cols=24  Identities=25%  Similarity=0.125  Sum_probs=14.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          410 AVALKRQEELIREEEAAWLAESEQ  433 (1133)
Q Consensus       410 ~~A~~rQ~~Ll~E~E~e~~~~~e~  433 (1133)
                      ++-..|+.++...+++.+....+-
T Consensus        31 ~~l~~R~~~I~~~l~~a~~~~~~a   54 (156)
T PRK05759         31 KALEERQKKIADGLAAAERAKKEL   54 (156)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344566777777776665554443


No 89 
>smart00225 BTB Broad-Complex, Tramtrack and Bric a brac. Domain in Broad-Complex, Tramtrack and Bric a brac. Also known as POZ (poxvirus and zinc finger) domain. Known to be a protein-protein interaction motif found at the N-termini of several C2H2-type transcription factors as well as Shaw-type potassium channels. Known structure reveals a tightly intertwined dimer formed via interactions between N-terminal strand and helix structures. However in a subset of BTB/POZ domains, these two secondary structures appear to be missing. Be aware SMART predicts BTB/POZ domains without the beta1- and alpha1-secondary structures.
Probab=77.77  E-value=0.46  Score=41.30  Aligned_cols=75  Identities=21%  Similarity=0.198  Sum_probs=46.9

Q ss_pred             hhhhhhhhhhhcchhhh---hhhHHHHHHhhccc-----c--ccc-ccccccccccccccCCCCCCceEeecceeeehhh
Q 001172          268 DAILKVVVKHFFIEKEV---TSTLVMDSLYSGLK-----A--LEG-QSKSKKTKAKLLDAEDTPPPIVHVENDMFVLVDD  336 (1133)
Q Consensus       268 DVilkV~~k~Ff~~keV---tSplvmd~L~~glk-----~--le~-~t~~~k~~~~~~~t~e~p~~~~~v~~d~~~l~~D  336 (1133)
                      ||++.+.++.|.+||.+   .|+.|..-+..+..     .  +.. ....++..++++|++....+..        .+.+
T Consensus         1 dv~i~v~~~~~~~h~~iL~~~s~~f~~~~~~~~~~~~~~~i~l~~~~~~~f~~~l~~ly~~~~~~~~~--------~~~~   72 (90)
T smart00225        1 DVTLVVGGKKFKAHKAVLAACSPYFKALFSGDFKESKKSEIYLDDVSPEDFRALLEFLYTGKLDLPEE--------NVEE   72 (90)
T ss_pred             CeEEEECCEEEehHHHHHhhcCHHHHHHHcCCCccCCCCEEEecCCCHHHHHHHHHeecCceeecCHH--------HHHH
Confidence            67889999999999987   44655444333221     1  111 1123666778888876432221        4568


Q ss_pred             HHHHHHHhhcCCCC
Q 001172          337 VLLLLERAALEPLP  350 (1133)
Q Consensus       337 ll~~leR~al~~Lp  350 (1133)
                      |+.+++.+.++.|-
T Consensus        73 l~~~a~~~~~~~l~   86 (90)
T smart00225       73 LLELADYLQIPGLV   86 (90)
T ss_pred             HHHHHHHHCcHHHH
Confidence            88888888877663


No 90 
>KOG2963 consensus RNA-binding protein required for 60S ribosomal subunit biogenesis [Translation, ribosomal structure and biogenesis]
Probab=77.56  E-value=3.5  Score=48.04  Aligned_cols=10  Identities=20%  Similarity=0.501  Sum_probs=5.2

Q ss_pred             hHHHHHhhhc
Q 001172          233 KLGRLIEDKA  242 (1133)
Q Consensus       233 hLG~LLqS~~  242 (1133)
                      -+-.|++.+.
T Consensus       207 ~vkkl~k~~~  216 (405)
T KOG2963|consen  207 NVKKLVKAKK  216 (405)
T ss_pred             HHHHHHhhhc
Confidence            4555555443


No 91 
>KOG4848 consensus Extracellular matrix-associated peroxidase [Extracellular structures; Defense mechanisms]
Probab=77.53  E-value=20  Score=39.05  Aligned_cols=17  Identities=47%  Similarity=0.468  Sum_probs=8.3

Q ss_pred             HHHHHHHHHHHHHHHHh
Q 001172          440 AEKEKKAKKKLAKQKRN  456 (1133)
Q Consensus       440 k~kekk~~~k~ak~~e~  456 (1133)
                      ||||.|+++|+||++++
T Consensus       198 kEkeekK~~KeaKrk~k  214 (225)
T KOG4848|consen  198 KEKEEKKAVKEAKRKEK  214 (225)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            44444555555554443


No 92 
>KOG1103 consensus Predicted coiled-coil protein [Function unknown]
Probab=77.14  E-value=11  Score=44.14  Aligned_cols=17  Identities=35%  Similarity=0.499  Sum_probs=11.7

Q ss_pred             CCCcccCCCccceeeec
Q 001172          575 SSSTCSTDSVPSVVMYG  591 (1133)
Q Consensus       575 ~sstcs~ds~~~~~~~g  591 (1133)
                      -|--|-|+|.|---..|
T Consensus       316 msvfcqtesfpaerthg  332 (561)
T KOG1103|consen  316 MSVFCQTESFPAERTHG  332 (561)
T ss_pred             hhhhhhcccCchhhccc
Confidence            36679999988554444


No 93 
>PF10211 Ax_dynein_light:  Axonemal dynein light chain;  InterPro: IPR019347  Axonemal dynein light chain proteins play a dynamic role in flagellar and cilial motility. Eukaryotic cilia and flagella are complex organelles consisting of a core structure, the axoneme, which is composed of nine microtubule doublets forming a cylinder that surrounds a pair of central singlet microtubules. This ultra-structural arrangement seems to be one of the most stable micro-tubular assemblies known and is responsible for the flagellar and ciliary movement of a large number of organisms ranging from protozoan to mammals. This light chain interacts directly with the N-terminal half of the heavy chains []. 
Probab=76.26  E-value=43  Score=36.13  Aligned_cols=46  Identities=13%  Similarity=0.125  Sum_probs=28.6

Q ss_pred             HHHhhHHhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          379 ERRLTELGRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEE  424 (1133)
Q Consensus       379 e~~~~Elgrr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E  424 (1133)
                      .-.|+|-|.=++.|+---.+.-.++..-|...++.-.++.|..|..
T Consensus        76 Ti~C~ERGlLL~rvrde~~~~l~~y~~l~~s~~~f~~rk~l~~e~~  121 (189)
T PF10211_consen   76 TIDCPERGLLLLRVRDEYRMTLDAYQTLYESSIAFGMRKALQAEQG  121 (189)
T ss_pred             HhCcHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3457888866666655333333456666777777766776666543


No 94 
>KOG3915 consensus Transcription regulator dachshund, contains SKI/SNO domain [Transcription]
Probab=75.46  E-value=22  Score=43.06  Aligned_cols=16  Identities=38%  Similarity=0.395  Sum_probs=7.4

Q ss_pred             HHHHHHHHHHHHHHHH
Q 001172          404 EVAYQEAVALKRQEEL  419 (1133)
Q Consensus       404 ~~Ay~E~~A~~rQ~~L  419 (1133)
                      ++|..-+-++++|-++
T Consensus       513 kva~dnar~qekQiq~  528 (641)
T KOG3915|consen  513 KVAIDNARAQEKQIQL  528 (641)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4444444444444443


No 95 
>cd02666 Peptidase_C19J A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=75.41  E-value=0.96  Score=52.42  Aligned_cols=34  Identities=21%  Similarity=0.262  Sum_probs=31.0

Q ss_pred             eecCCchHHHHhhhhhHHHhhHHHHHhhhccccccchhhhccC
Q 001172          213 RVYLTNVEQICRRFVEERRGKLGRLIEDKARWSSFCAFWLGID  255 (1133)
Q Consensus       213 ~VPpSNLgaTC~~fved~~~hLG~LLqS~~~~ssFRaf~~a~~  255 (1133)
                      +||+.|+|.||         ||++|||.......||.+++..+
T Consensus         1 PvGL~NlGNTC---------YmNSlLQ~L~~i~~lR~~i~~~~   34 (343)
T cd02666           1 PAGLDNIGNTC---------YLNSLLQYFFTIKPLRDLVLNFD   34 (343)
T ss_pred             CCCcccCCcee---------HHHHHHHHHHccHHHHHHHHcCC
Confidence            48999999999         99999999999999999888765


No 96 
>KOG4786 consensus Ubinuclein, nuclear protein interacting with cellular and viral transcription factors [Transcription; Signal transduction mechanisms]
Probab=75.06  E-value=2.3  Score=52.95  Aligned_cols=79  Identities=27%  Similarity=0.361  Sum_probs=54.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh---HHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhhhhhhhh
Q 001172          401 NKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRGA---AEKEKKAKKKLAKQKRNNRKGKEKKREERSSMALSDRL  477 (1133)
Q Consensus       401 ~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~---k~kekk~~~k~ak~~e~~rk~~e~~~ee~~~~~~~~~~  477 (1133)
                      .-+++|-+|--|+.-+.+=+.|+|+.-..++++++-+..   +|++.+.++.+.+++++.|++.++..+...+.+.+.++
T Consensus       673 ~~feAaSKELAaL~~RA~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~K~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~  752 (1136)
T KOG4786|consen  673 YFFEAASKELAALKERAERLKKLEEKLKKEKEKQAEKDRIEAKKFEERMKKEQEKQEEKERKEREKREEKERKEIMERKK  752 (1136)
T ss_pred             HHHHHHHHHHHHHHHHhhhhhhccccccccchhhhhccchhhHHHHhhcccccccCcchhhhhhhhhHHHHHHHHhhhhh
Confidence            334789999999998888888888766666665555543   67777777777888888888777655444444444444


Q ss_pred             cc
Q 001172          478 ED  479 (1133)
Q Consensus       478 ~~  479 (1133)
                      +|
T Consensus       753 ~E  754 (1136)
T KOG4786|consen  753 RE  754 (1136)
T ss_pred             hc
Confidence            44


No 97 
>KOG0556 consensus Aspartyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]
Probab=73.60  E-value=3.7  Score=48.82  Aligned_cols=20  Identities=55%  Similarity=0.727  Sum_probs=15.4

Q ss_pred             CCccccccCCCCCCCCccccC
Q 001172          502 DVLEDVSDVSDSVDGGAEVLQ  522 (1133)
Q Consensus       502 d~~e~~sd~sd~~d~~~e~~~  522 (1133)
                      -++-+|||++.+.|+ .|||-
T Consensus        68 ~~~~~v~dl~~~~~~-~~V~v   87 (533)
T KOG0556|consen   68 RELTDVSDLDESNDG-SEVLV   87 (533)
T ss_pred             cceeehhhhhhhcCC-ceEEE
Confidence            455579999999999 66654


No 98 
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton]
Probab=73.39  E-value=9.7  Score=48.43  Aligned_cols=19  Identities=37%  Similarity=0.427  Sum_probs=12.5

Q ss_pred             HHHHHHHhhhhhHHHHHHHHHHH
Q 001172          389 TVEIFVLAHIFSNKIEVAYQEAV  411 (1133)
Q Consensus       389 ~~~ifa~~k~F~~rv~~Ay~E~~  411 (1133)
                      |=+||+-.++|+    .||.|++
T Consensus       975 mEEFFaDi~tFr----naf~ea~  993 (1102)
T KOG1924|consen  975 MEEFFADIRTFR----NAFLEAV  993 (1102)
T ss_pred             HHHHHHHHHHHH----HHHHHHH
Confidence            345888788884    5565555


No 99 
>KOG2357 consensus Uncharacterized conserved protein [Function unknown]
Probab=73.13  E-value=19  Score=43.19  Aligned_cols=26  Identities=27%  Similarity=0.446  Sum_probs=14.2

Q ss_pred             CCCCCceE------eecceeeehhhHHHHHHHhh
Q 001172          318 DTPPPIVH------VENDMFVLVDDVLLLLERAA  345 (1133)
Q Consensus       318 e~p~~~~~------v~~d~~~l~~Dll~~leR~a  345 (1133)
                      .+|..++.      |.++||.  .-++..|..|+
T Consensus       262 ~lP~~~~vmSE~nEvs~~i~~--~~v~~~l~k~~  293 (440)
T KOG2357|consen  262 NLPESFVVMSESNEVSGAIFE--DKVVSQLNKYG  293 (440)
T ss_pred             CCCcceeeeecchhhhhhhhc--hHHHHHHHHHH
Confidence            55655544      4455554  35666666663


No 100
>KOG4691 consensus Uncharacterized conserved protein [Function unknown]
Probab=72.93  E-value=24  Score=38.56  Aligned_cols=23  Identities=13%  Similarity=0.101  Sum_probs=11.6

Q ss_pred             HHHHHHHHHhhhHHHHHHHHHHH
Q 001172          428 LAESEQKAKRGAAEKEKKAKKKL  450 (1133)
Q Consensus       428 ~~~~e~~~kk~~k~kekk~~~k~  450 (1133)
                      ++.++|+++-++++||++....+
T Consensus       122 ~~~~~R~~Rla~~~~E~~~~i~e  144 (227)
T KOG4691|consen  122 RLHELRIARLAQEEREQEQRIAE  144 (227)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Confidence            33444555555555665544433


No 101
>TIGR02794 tolA_full TolA protein. TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB.
Probab=72.67  E-value=18  Score=42.49  Aligned_cols=7  Identities=0%  Similarity=0.140  Sum_probs=3.0

Q ss_pred             hHHHHHH
Q 001172          400 SNKIEVA  406 (1133)
Q Consensus       400 ~~rv~~A  406 (1133)
                      ++|....
T Consensus        95 ~~~~~~~  101 (346)
T TIGR02794        95 EQRAAAE  101 (346)
T ss_pred             HHHHHHH
Confidence            3444433


No 102
>PRK14474 F0F1 ATP synthase subunit B; Provisional
Probab=72.44  E-value=49  Score=37.17  Aligned_cols=20  Identities=10%  Similarity=0.182  Sum_probs=10.4

Q ss_pred             hHHHHHHHhhhhhHHHHHHH
Q 001172          388 RTVEIFVLAHIFSNKIEVAY  407 (1133)
Q Consensus       388 r~~~ifa~~k~F~~rv~~Ay  407 (1133)
                      =+|=+|+|-+.|..-|..++
T Consensus        15 FlILv~lL~~fl~kPi~~~l   34 (250)
T PRK14474         15 FLILVYLLRRFLYKPIIQVM   34 (250)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            34556666666644444443


No 103
>KOG4031 consensus Vesicle coat protein clathrin, light chain [Intracellular trafficking, secretion, and vesicular transport]
Probab=72.29  E-value=14  Score=40.56  Aligned_cols=24  Identities=17%  Similarity=0.194  Sum_probs=15.0

Q ss_pred             HHHHHHHHHHHHH-HHHHHHHHHHH
Q 001172          413 LKRQEELIREEEA-AWLAESEQKAK  436 (1133)
Q Consensus       413 ~~rQ~~Ll~E~E~-e~~~~~e~~~k  436 (1133)
                      ...|+++|+|..+ ++++++|-++|
T Consensus       112 keeQ~~rl~ekD~~sek~k~Elrek  136 (216)
T KOG4031|consen  112 KEEQMKRLQEKDEASEKLKEELREK  136 (216)
T ss_pred             HHHHHHHHHHhhhhHHHHHHHHHHH
Confidence            3578899999855 44445554333


No 104
>PF09756 DDRGK:  DDRGK domain;  InterPro: IPR019153  This is a family of proteins of approximately 300 residues. They contain a highly conserved DDRGK motif. The function is unknown. ; PDB: 1WI9_A.
Probab=71.37  E-value=1.3  Score=47.73  Aligned_cols=18  Identities=22%  Similarity=0.499  Sum_probs=0.0

Q ss_pred             HHHHHhhhhhhhHHHHHH
Q 001172          451 AKQKRNNRKGKEKKREER  468 (1133)
Q Consensus       451 ak~~e~~rk~~e~~~ee~  468 (1133)
                      .+++++++++++.+++|+
T Consensus        37 ~~ee~r~k~ee~~~~~E~   54 (188)
T PF09756_consen   37 EREEERRKKEEEEEEEEE   54 (188)
T ss_dssp             ------------------
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            333333333333333333


No 105
>PRK09173 F0F1 ATP synthase subunit B; Validated
Probab=71.04  E-value=40  Score=34.82  Aligned_cols=30  Identities=13%  Similarity=0.015  Sum_probs=20.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 001172          409 EAVALKRQEELIREEEAAWLAESEQKAKRG  438 (1133)
Q Consensus       409 E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~  438 (1133)
                      -++-..|++++...+++.+..+++-++.+.
T Consensus        28 ~~~l~~R~~~I~~~l~~Ae~~~~eA~~~~~   57 (159)
T PRK09173         28 ARSLDARADRIKNELAEARRLREEAQQLLA   57 (159)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            455678888888888887766665444443


No 106
>PF06936 Selenoprotein_S:  Selenoprotein S (SelS);  InterPro: IPR009703 This family consists of several mammalian selenoprotein S (SelS) sequences. SelS is a plasma membrane protein and is present in a variety of tissues and cell types. These proteins are involved in the degradation process of misfolded endoplasmic reticulum (ER) luminal proteins which participate in the transfer of misfolded proteins from the ER to the cytosol, where they are destroyed by the proteasome in a ubiquitin-dependent manner []. They probably serve as a linker between DER1, which mediates the retro-translocation of misfolded proteins into the cytosol, and the ATPase complex VCP, which mediates the translocation and ubiquitination.; GO: 0008430 selenium binding, 0006886 intracellular protein transport, 0030176 integral to endoplasmic reticulum membrane; PDB: 2Q2F_A.
Probab=70.11  E-value=25  Score=38.22  Aligned_cols=27  Identities=15%  Similarity=0.177  Sum_probs=2.0

Q ss_pred             HhhHHhhhHHHHHHHhhhhhHHHHHHH
Q 001172          381 RLTELGRRTVEIFVLAHIFSNKIEVAY  407 (1133)
Q Consensus       381 ~~~Elgrr~~~ifa~~k~F~~rv~~Ay  407 (1133)
                      -+...|+=+|-++++..++-++|...|
T Consensus        32 ~L~~yGWyil~~~I~ly~l~qkl~~~~   58 (190)
T PF06936_consen   32 FLSSYGWYILFGCILLYLLWQKLSPSF   58 (190)
T ss_dssp             ------------------------HHH
T ss_pred             HHHHhCHHHHHHHHHHHHHHHHHHHHH
Confidence            455678777776666666557777666


No 107
>PTZ00234 variable surface protein Vir12; Provisional
Probab=70.06  E-value=6.1  Score=47.57  Aligned_cols=27  Identities=19%  Similarity=0.592  Sum_probs=19.5

Q ss_pred             CCCCCCCCCCCCCcc--------cccccccCCCCC
Q 001172          825 GPDLAPATHGYIPQS--------YRNVKMGNPVGS  851 (1133)
Q Consensus       825 g~dpS~~~~sy~pqs--------YrNAi~g~~~~~  851 (1133)
                      +++|.+..|+.+--+        |||+|||-.+-+
T Consensus       341 ~~~~~~~~~~~l~~~~d~~dSn~~rniim~~ailG  375 (433)
T PTZ00234        341 APEVNPDTSNFLEKTFGILKSEYFRHSIVGASIIG  375 (433)
T ss_pred             CCcCCCCCCchhhhhhhhhhhHHHHHHHHHHHHHH
Confidence            777777777766543        999999955433


No 108
>PF15236 CCDC66:  Coiled-coil domain-containing protein 66
Probab=69.86  E-value=21  Score=37.80  Aligned_cols=23  Identities=17%  Similarity=0.061  Sum_probs=12.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHhhh
Q 001172          417 EELIREEEAAWLAESEQKAKRGA  439 (1133)
Q Consensus       417 ~~Ll~E~E~e~~~~~e~~~kk~~  439 (1133)
                      ..+++|+++|.|+..|++.-+..
T Consensus        80 rr~~EE~~EE~Rl~rere~~q~~  102 (157)
T PF15236_consen   80 RRRREEEEEEERLAREREELQRQ  102 (157)
T ss_pred             HHHHHhHHHHHHHHHHHHHHHHH
Confidence            34445555666776666544443


No 109
>COG5269 ZUO1 Ribosome-associated chaperone zuotin [Translation, ribosomal structure and biogenesis / Posttranslational modification, protein turnover, chaperones]
Probab=69.85  E-value=31  Score=39.53  Aligned_cols=59  Identities=22%  Similarity=0.229  Sum_probs=29.0

Q ss_pred             CCcccchhHHHHHHhhHHhhhHHH-HHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          368 EDFNKDSIERDERRLTELGRRTVE-IFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWL  428 (1133)
Q Consensus       368 ~dvs~dtve~ee~~~~Elgrr~~~-ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~  428 (1133)
                      +|+..|.-++|-.|..|-+.|--- -.-...+  -||....+=+.+...+-++++|.|.+..
T Consensus       184 ed~pdd~e~rD~kry~e~KNr~~r~k~K~~Dn--aRl~~LV~~A~~~DPRIK~fkEqeK~~k  243 (379)
T COG5269         184 EDYPDDMEERDRKRYSEAKNREKRAKLKNQDN--ARLKRLVQIAKKRDPRIKSFKEQEKEMK  243 (379)
T ss_pred             hhcCcchhhhhhHHHHHhhhHHHHHhhhchhH--HHHHHHHHHHHhcCcchhhHHHHHHHHH
Confidence            335444444565555554433211 0000111  2455555556666777788888766543


No 110
>PRK09174 F0F1 ATP synthase subunit B'; Validated
Probab=69.50  E-value=68  Score=35.10  Aligned_cols=22  Identities=5%  Similarity=0.092  Sum_probs=11.9

Q ss_pred             HHHHHHHhhhhhHHHHHHHHHH
Q 001172          389 TVEIFVLAHIFSNKIEVAYQEA  410 (1133)
Q Consensus       389 ~~~ifa~~k~F~~rv~~Ay~E~  410 (1133)
                      .|=+|+|.++|-.+|.....+.
T Consensus        64 liL~~lL~k~~~~pI~~vLe~R   85 (204)
T PRK09174         64 GLFYLFMSRVILPRIGGIIETR   85 (204)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            3445555666655555554443


No 111
>PF06637 PV-1:  PV-1 protein (PLVAP);  InterPro: IPR009538 This family consists of several PV-1 (PLVAP) proteins, which seem to be specific to mammals. PV-1 is a novel protein component of the endothelial fenestral and stomatal diaphragms []. The function of this family is unknown.
Probab=69.20  E-value=46  Score=39.66  Aligned_cols=73  Identities=26%  Similarity=0.276  Sum_probs=38.6

Q ss_pred             HhhhhhHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhh-hhhhhHHHHH
Q 001172          395 LAHIFSNKIEVAYQEAVALKRQEELIREE-----EAAWLAESEQKAKRGAAEKEKKAKKKLAKQKRNN-RKGKEKKREE  467 (1133)
Q Consensus       395 ~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~-----E~e~~~~~e~~~kk~~k~kekk~~~k~ak~~e~~-rk~~e~~~ee  467 (1133)
                      |++-+.-.|..+-+|---+.||+..++--     |+.++.++|-.++-++..-|--++.+.|-||+.+ ||+++-.+.|
T Consensus       286 lar~Lr~~I~~VarENs~LqrQKle~e~~l~a~qeakek~~KEAqareaklqaec~rQ~qlaLEEKaaLrkerd~L~ke  364 (442)
T PF06637_consen  286 LARSLRAGIERVARENSDLQRQKLEAEQGLQASQEAKEKAGKEAQAREAKLQAECARQTQLALEEKAALRKERDSLAKE  364 (442)
T ss_pred             HHHHHhhhHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            45666667777888888788887665543     4444444444343333333444444445444422 3444433333


No 112
>CHL00118 atpG ATP synthase CF0 B' subunit; Validated
Probab=68.75  E-value=51  Score=34.19  Aligned_cols=10  Identities=40%  Similarity=0.577  Sum_probs=4.8

Q ss_pred             HHHHhhcCCC
Q 001172          340 LLERAALEPL  349 (1133)
Q Consensus       340 ~leR~al~~L  349 (1133)
                      +|-++...||
T Consensus        38 lL~k~l~~Pi   47 (156)
T CHL00118         38 LLNIILYKPL   47 (156)
T ss_pred             HHHHHHHHHH
Confidence            4445554444


No 113
>TIGR03321 alt_F1F0_F0_B alternate F1F0 ATPase, F0 subunit B. CC and in principle may run in either direction. This model represents the F0 subunit B of this apparent second ATP synthase.
Probab=68.18  E-value=69  Score=35.64  Aligned_cols=13  Identities=8%  Similarity=0.337  Sum_probs=6.3

Q ss_pred             HHHHHHHhhhhhH
Q 001172          389 TVEIFVLAHIFSN  401 (1133)
Q Consensus       389 ~~~ifa~~k~F~~  401 (1133)
                      +|=+|+|-+.|..
T Consensus        16 lil~~lL~kfl~k   28 (246)
T TIGR03321        16 LILVWLLKRFLYR   28 (246)
T ss_pred             HHHHHHHHHHhHH
Confidence            4445555555533


No 114
>KOG0112 consensus Large RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=68.13  E-value=6  Score=50.78  Aligned_cols=14  Identities=57%  Similarity=0.601  Sum_probs=8.2

Q ss_pred             CCCCccccccCCCC
Q 001172          500 KPDVLEDVSDVSDS  513 (1133)
Q Consensus       500 k~d~~e~~sd~sd~  513 (1133)
                      ||-..++++|-+|+
T Consensus       782 ~p~~k~da~~~~ds  795 (975)
T KOG0112|consen  782 KPHRKGDASDESDS  795 (975)
T ss_pred             CCCCcCcCccCccc
Confidence            66666666555443


No 115
>COG0711 AtpF F0F1-type ATP synthase, subunit b [Energy production and conversion]
Probab=67.96  E-value=85  Score=32.98  Aligned_cols=30  Identities=17%  Similarity=0.006  Sum_probs=19.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 001172          408 QEAVALKRQEELIREEEAAWLAESEQKAKR  437 (1133)
Q Consensus       408 ~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk  437 (1133)
                      -.++...|+.++=.++.+++++++|-++-.
T Consensus        31 i~~~l~~R~~~I~~~l~~A~~~~~ea~~~~   60 (161)
T COG0711          31 ILKALDERQAKIADDLAEAERLKEEAQALL   60 (161)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345556677777778877777766544433


No 116
>PRK13454 F0F1 ATP synthase subunit B'; Provisional
Probab=66.93  E-value=37  Score=36.18  Aligned_cols=26  Identities=19%  Similarity=0.153  Sum_probs=15.8

Q ss_pred             hhhHHHHHHHhhhhhHHHHHHHHHHH
Q 001172          386 GRRTVEIFVLAHIFSNKIEVAYQEAV  411 (1133)
Q Consensus       386 grr~~~ifa~~k~F~~rv~~Ay~E~~  411 (1133)
                      --=.|=+|+|.++|-.+|.....+..
T Consensus        39 I~F~iL~~ll~k~l~~PI~~~l~~R~   64 (181)
T PRK13454         39 VTLVAIYFVLTRVALPRIGAVLAERQ   64 (181)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            33455566667776666766665553


No 117
>KOG2192 consensus PolyC-binding hnRNP-K protein HRB57A/hnRNP, contains KH domain [RNA processing and modification; General function prediction only]
Probab=66.83  E-value=27  Score=39.80  Aligned_cols=102  Identities=17%  Similarity=0.177  Sum_probs=57.0

Q ss_pred             chhhhhhhhcccccccccccccccccccCCCCcccccccc-CCCCCCCcccCccCCccccccCCcccccCccccccCCCC
Q 001172          978 HLDIINDLLDDEHGVGMAAGASTVLQSLSNGPHTLNRQFS-FPRDISMSSDIGSSAGSCKFERTRSYHDDGFQRGYSSSV 1056 (1133)
Q Consensus       978 HLDIINDLL~eeq~~g~~~~~~~~~~~~~~~~~~f~~~fs-~~g~~~~~~d~~s~~~~~rf~~t~~y~d~g~~~~y~~~~ 1056 (1133)
                      -+.||=|||.|--.-|++.---..|.--.-+|-.|-.+|- .||+-+-.+--|        .+--.=--..+.++|+.-+
T Consensus       183 ~i~~il~~i~e~pikgsa~py~p~fyd~t~dyggf~M~f~d~pg~pgpapqrg--------gqgpp~~~~sdlmay~r~G  254 (390)
T KOG2192|consen  183 CIKIILDLISESPIKGSAQPYDPNFYDETYDYGGFTMMFDDRPGRPGPAPQRG--------GQGPPPPRGSDLMAYDRRG  254 (390)
T ss_pred             HHHHHHHHhhcCCcCCcCCcCCccccCcccccCCceeecCCCCCCCCCCCCCC--------CCCCCCCCccccceeccCC
Confidence            3677788888877777765433334333345666665554 444433211111        1111111122456666554


Q ss_pred             ---CcccccccccCccCCCCCCCCccCCCCCCCCCCCC
Q 001172         1057 ---GHFDSVREFIPQATALPYSNGQIDGMIPTMWPMPG 1091 (1133)
Q Consensus      1057 ---~~~~~~r~~~~~~~~~~Y~ng~~D~~~~~~W~~~~ 1091 (1133)
                         .-|+++=+|+++    --.+|-+|...|+-|+|+.
T Consensus       255 rpG~rydg~vdFs~d----etw~saidtw~~SewqmaY  288 (390)
T KOG2192|consen  255 RPGDRYDGMVDFSAD----ETWPSAIDTWSPSEWQMAY  288 (390)
T ss_pred             CCCcccccccccccc----ccCCCcCCCcCcccccccc
Confidence               236666666554    3358899999999999953


No 118
>PF00430 ATP-synt_B:  ATP synthase B/B' CF(0);  InterPro: IPR002146 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include:   F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP.   F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents subunits B and B' from the F0 complex in F-ATPases found in chloroplasts and in bacterial plasma membranes. The B subunits are part of the peripheral stalk that links the F1 and F0 complexes together, and which acts as a stator to prevent certain subunits from rotating with the central rotary element. The peripheral stalk differs in subunit composition between mitochondrial, chloroplast and bacterial F-ATPases. In bacterial and chloroplast F-ATPases, the peripheral stalk is composed of one copy of the delta subunit (homologous to OSCP in mitochondria), and two copies of subunit B in bacteria, or one copy each of subunits B and B' in chloroplasts and photosynthetic bacteria []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0015078 hydrogen ion transmembrane transporter activity, 0015986 ATP synthesis coupled proton transport, 0045263 proton-transporting ATP synthase complex, coupling factor F(o); PDB: 1L2P_A 2KHK_A 1B9U_A.
Probab=66.07  E-value=18  Score=35.40  Aligned_cols=35  Identities=20%  Similarity=0.209  Sum_probs=15.1

Q ss_pred             HHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          392 IFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAES  431 (1133)
Q Consensus       392 ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~  431 (1133)
                      +|++.++|...|...+     ..|+.++..++++.+...+
T Consensus        13 ~~~l~~~~~~pi~~~l-----~~R~~~I~~~~~~a~~~~~   47 (132)
T PF00430_consen   13 FFLLNKFLYKPIKKFL-----DERKAKIQSELEEAEELKE   47 (132)
T ss_dssp             HHHHHHHTHHHHHHHC-----S--S-HHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHH-----HHHHHHHHHHHHhHHHHHH
Confidence            4444555554444433     3445555555554444433


No 119
>PF06098 Radial_spoke_3:  Radial spoke protein 3;  InterPro: IPR009290 This family consists of several radial spoke protein 3 (RSP3) sequences. Eukaryotic cilia and flagella present in diverse types of cells perform motile, sensory, and developmental functions in organisms from protists to humans. They are centred by precisely organised, microtubule-based structures, the axonemes. The axoneme consists of two central singlet microtubules, called the central pair, and nine outer doublet microtubules. These structures are well conserved during evolution. The outer doublet microtubules, each composed of A and B sub-fibres, are connected to each other by nexin links, while the central pair is held at the centre of the axoneme by radial spokes. The radial spokes are T-shaped structures extending from the A-tubule of each outer doublet microtubule to the centre of the axoneme. Radial spoke protein 3 (RSP3), is present at the proximal end of the spoke stalk and helps in anchoring the radial spoke to the outer doublet. It is thought that radial spokes regulate the activity of inner arm dynein through protein phosphorylation and dephosphorylation [].
Probab=65.61  E-value=38  Score=39.08  Aligned_cols=19  Identities=26%  Similarity=0.078  Sum_probs=11.5

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q 001172          404 EVAYQEAVALKRQEELIRE  422 (1133)
Q Consensus       404 ~~Ay~E~~A~~rQ~~Ll~E  422 (1133)
                      +...++.|.+.|..+|.+.
T Consensus       158 lr~~q~~fe~~R~aEl~e~  176 (291)
T PF06098_consen  158 LRRQQRAFEELRNAELAEV  176 (291)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            4456667766666666543


No 120
>PF06658 DUF1168:  Protein of unknown function (DUF1168);  InterPro: IPR009548 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=64.92  E-value=44  Score=34.97  Aligned_cols=19  Identities=32%  Similarity=0.557  Sum_probs=8.6

Q ss_pred             HHHHHHHHHHHHHHHHhhh
Q 001172          440 AEKEKKAKKKLAKQKRNNR  458 (1133)
Q Consensus       440 k~kekk~~~k~ak~~e~~r  458 (1133)
                      |+++|+.++|+++...+..
T Consensus        85 KkR~KR~KkK~kk~~~k~~  103 (142)
T PF06658_consen   85 KKRAKRQKKKQKKKKKKKK  103 (142)
T ss_pred             HHHHHHHHHHHHHHHhhhc
Confidence            4444444444444444433


No 121
>PRK13460 F0F1 ATP synthase subunit B; Provisional
Probab=64.57  E-value=1.1e+02  Score=32.34  Aligned_cols=25  Identities=24%  Similarity=0.107  Sum_probs=13.8

Q ss_pred             hhhHHHHHHHhhhhhHHHHHHHHHH
Q 001172          386 GRRTVEIFVLAHIFSNKIEVAYQEA  410 (1133)
Q Consensus       386 grr~~~ifa~~k~F~~rv~~Ay~E~  410 (1133)
                      -.=+|=+|+|-+.|...|...+.+.
T Consensus        24 i~Flil~~iL~~~~~kpi~~~l~~R   48 (173)
T PRK13460         24 VTFLVVVLVLKKFAWDVILKALDER   48 (173)
T ss_pred             HHHHHHHHHHHHHhHHHHHHHHHHH
Confidence            3445566666677655555554433


No 122
>PF06936 Selenoprotein_S:  Selenoprotein S (SelS);  InterPro: IPR009703 This family consists of several mammalian selenoprotein S (SelS) sequences. SelS is a plasma membrane protein and is present in a variety of tissues and cell types. These proteins are involved in the degradation process of misfolded endoplasmic reticulum (ER) luminal proteins which participate in the transfer of misfolded proteins from the ER to the cytosol, where they are destroyed by the proteasome in a ubiquitin-dependent manner []. They probably serve as a linker between DER1, which mediates the retro-translocation of misfolded proteins into the cytosol, and the ATPase complex VCP, which mediates the translocation and ubiquitination.; GO: 0008430 selenium binding, 0006886 intracellular protein transport, 0030176 integral to endoplasmic reticulum membrane; PDB: 2Q2F_A.
Probab=64.18  E-value=27  Score=37.98  Aligned_cols=13  Identities=31%  Similarity=0.294  Sum_probs=5.9

Q ss_pred             HHHHHHHHHHHHH
Q 001172          402 KIEVAYQEAVALK  414 (1133)
Q Consensus       402 rv~~Ay~E~~A~~  414 (1133)
                      -+.++.+|++..-
T Consensus        76 d~v~~rqEa~eaA   88 (190)
T PF06936_consen   76 DVVVRRQEAMEAA   88 (190)
T ss_dssp             HHHHHHHHHHHHH
T ss_pred             hHHHHHHHHHHHH
Confidence            3344455554433


No 123
>PRK14472 F0F1 ATP synthase subunit B; Provisional
Probab=63.99  E-value=1.1e+02  Score=32.21  Aligned_cols=17  Identities=24%  Similarity=0.278  Sum_probs=7.3

Q ss_pred             HHHHHHhhhhhHHHHHH
Q 001172          390 VEIFVLAHIFSNKIEVA  406 (1133)
Q Consensus       390 ~~ifa~~k~F~~rv~~A  406 (1133)
                      |=+|+|-+.+...|...
T Consensus        30 il~~lL~~~l~kpi~~~   46 (175)
T PRK14472         30 IVLLILKKIAWGPILSA   46 (175)
T ss_pred             HHHHHHHHHhHHHHHHH
Confidence            44444444443333333


No 124
>PRK13461 F0F1 ATP synthase subunit B; Provisional
Probab=63.54  E-value=1.2e+02  Score=31.31  Aligned_cols=20  Identities=25%  Similarity=0.557  Sum_probs=9.8

Q ss_pred             HHHHHHHhhhhhHHHHHHHH
Q 001172          389 TVEIFVLAHIFSNKIEVAYQ  408 (1133)
Q Consensus       389 ~~~ifa~~k~F~~rv~~Ay~  408 (1133)
                      +|=+|+|.+.+...|.....
T Consensus        16 ~il~~iL~~f~~kpi~~~l~   35 (159)
T PRK13461         16 IILLLILKHFFFDKIKAVID   35 (159)
T ss_pred             HHHHHHHHHHhHHHHHHHHH
Confidence            44455555555444444433


No 125
>KOG3915 consensus Transcription regulator dachshund, contains SKI/SNO domain [Transcription]
Probab=62.60  E-value=68  Score=39.15  Aligned_cols=16  Identities=25%  Similarity=0.289  Sum_probs=7.8

Q ss_pred             HHHHHHHHHHHHHHHH
Q 001172          417 EELIREEEAAWLAESE  432 (1133)
Q Consensus       417 ~~Ll~E~E~e~~~~~e  432 (1133)
                      +++|||-|-.+.+|++
T Consensus       535 md~lrerelreslekq  550 (641)
T KOG3915|consen  535 MDFLRERELRESLEKQ  550 (641)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4455555554444443


No 126
>KOG0742 consensus AAA+-type ATPase [Posttranslational modification, protein turnover, chaperones]
Probab=62.50  E-value=77  Score=38.64  Aligned_cols=7  Identities=71%  Similarity=0.624  Sum_probs=4.7

Q ss_pred             HHHHHhh
Q 001172          339 LLLERAA  345 (1133)
Q Consensus       339 ~~leR~a  345 (1133)
                      .+|||+|
T Consensus        76 eaLERaA   82 (630)
T KOG0742|consen   76 EALERAA   82 (630)
T ss_pred             HHHHHHH
Confidence            4777774


No 127
>PRK13453 F0F1 ATP synthase subunit B; Provisional
Probab=62.45  E-value=1.2e+02  Score=31.98  Aligned_cols=24  Identities=8%  Similarity=-0.031  Sum_probs=12.5

Q ss_pred             HhhhHHHHHHHhhhhhHHHHHHHH
Q 001172          385 LGRRTVEIFVLAHIFSNKIEVAYQ  408 (1133)
Q Consensus       385 lgrr~~~ifa~~k~F~~rv~~Ay~  408 (1133)
                      +-.=+|=+|+|-+.|...|.....
T Consensus        25 iInFliL~~lL~~~l~~pi~~~l~   48 (173)
T PRK13453         25 VLTFIVLLALLKKFAWGPLKDVMD   48 (173)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344455566666666544444433


No 128
>KOG2505 consensus Ankyrin repeat protein [General function prediction only]
Probab=62.11  E-value=14  Score=45.19  Aligned_cols=14  Identities=21%  Similarity=0.489  Sum_probs=10.0

Q ss_pred             CCeEEEEEEEecCC
Q 001172          109 CNHLSLFLCVANHD  122 (1133)
Q Consensus       109 ~dyLSLYLeL~s~~  122 (1133)
                      +.+|+||-++....
T Consensus       177 ~~~l~vyr~~~~~~  190 (591)
T KOG2505|consen  177 GSVLSVYRCILHVG  190 (591)
T ss_pred             CceEEEeeeccccc
Confidence            57888887776544


No 129
>KOG3654 consensus Uncharacterized CH domain protein [Cytoskeleton]
Probab=61.96  E-value=16  Score=44.55  Aligned_cols=26  Identities=23%  Similarity=0.342  Sum_probs=15.3

Q ss_pred             CchHHHHhhhhhHHHhhHHHHHhhhcc
Q 001172          217 TNVEQICRRFVEERRGKLGRLIEDKAR  243 (1133)
Q Consensus       217 SNLgaTC~~fved~~~hLG~LLqS~~~  243 (1133)
                      ..|+. |.+-|+.+-.-|..|-+...+
T Consensus       190 eg~~e-y~~siekln~~ls~lqq~mqr  215 (708)
T KOG3654|consen  190 EGLGE-YNRSIEKLNAALSSLQQDMQR  215 (708)
T ss_pred             hhHHH-HHHHHHHHHHHHHHHHHHHHH
Confidence            33554 666777666666665555543


No 130
>PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed
Probab=61.55  E-value=46  Score=43.07  Aligned_cols=16  Identities=31%  Similarity=0.464  Sum_probs=8.3

Q ss_pred             hHHHHHHhhccccccc
Q 001172          287 TLVMDSLYSGLKALEG  302 (1133)
Q Consensus       287 plvmd~L~~glk~le~  302 (1133)
                      =+++|+++.|+...+.
T Consensus       410 LvLlDE~~~GtDp~eg  425 (782)
T PRK00409        410 LVLFDELGAGTDPDEG  425 (782)
T ss_pred             EEEecCCCCCCCHHHH
Confidence            3455555555554444


No 131
>PF08229 SHR3_chaperone:  ER membrane protein SH3 ;  InterPro: IPR013248 This family of proteins are membrane localised chaperones that are required for correct plasma membrane localisation of amino acid permeases (AAPs) []. Shr3 prevents AAPs proteins from aggregating and assists in their correct folding. In the absence of Shr3, AAPs are retained in the ER.
Probab=61.37  E-value=10  Score=41.40  Aligned_cols=13  Identities=15%  Similarity=0.100  Sum_probs=5.1

Q ss_pred             HHHHHHHHHHHHH
Q 001172          415 RQEELIREEEAAW  427 (1133)
Q Consensus       415 rQ~~Ll~E~E~e~  427 (1133)
                      ..+++++|.++++
T Consensus       160 ~~~~~~~~~~~~e  172 (196)
T PF08229_consen  160 KDAKELEEFEKEE  172 (196)
T ss_pred             hhHHHHHHHHHHH
Confidence            3334444443333


No 132
>KOG2689 consensus Predicted ubiquitin regulatory protein [Posttranslational modification, protein turnover, chaperones]
Probab=61.23  E-value=15  Score=41.98  Aligned_cols=10  Identities=10%  Similarity=0.079  Sum_probs=3.7

Q ss_pred             HHHHHHHHHH
Q 001172          410 AVALKRQEEL  419 (1133)
Q Consensus       410 ~~A~~rQ~~L  419 (1133)
                      ++.++.-+++
T Consensus        99 kik~k~~k~~  108 (290)
T KOG2689|consen   99 KIKQKQIKRE  108 (290)
T ss_pred             HhcchhHHHH
Confidence            3333333333


No 133
>KOG2341 consensus TATA box binding protein (TBP)-associated factor, RNA polymerase II [Transcription]
Probab=60.65  E-value=11  Score=46.70  Aligned_cols=33  Identities=15%  Similarity=0.207  Sum_probs=15.5

Q ss_pred             HHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          393 FVLAHIFSNKIEVAYQEAVALKRQEELIREEEA  425 (1133)
Q Consensus       393 fa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~  425 (1133)
                      ++.-+++..++..-|.=.=-..+|.++||+++.
T Consensus       408 iA~~R~es~k~~~ry~~~sDvr~QLrflEqldk  440 (563)
T KOG2341|consen  408 IAEHRVESEKSRERYEIISDVRRQLRFLEQLDK  440 (563)
T ss_pred             HHHHHHHHhhccchheehHHHHHHHHHHHHHHH
Confidence            443444444444444433344555555555544


No 134
>PF11208 DUF2992:  Protein of unknown function (DUF2992);  InterPro: IPR016787 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=60.36  E-value=26  Score=36.13  Aligned_cols=45  Identities=27%  Similarity=0.272  Sum_probs=22.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhh
Q 001172          413 LKRQEELIREEEAAWLAESEQKAKRGAAEKEKKAKKKLAKQKRNN  457 (1133)
Q Consensus       413 ~~rQ~~Ll~E~E~e~~~~~e~~~kk~~k~kekk~~~k~ak~~e~~  457 (1133)
                      -+-|+.|-.+.|+.+...+.+...+++.+|+++.+.|++|.+++-
T Consensus        85 TkAQ~ALk~q~E~~K~~rk~~~k~~re~~k~~kf~lkq~K~k~KH  129 (132)
T PF11208_consen   85 TKAQQALKLQREQRKKERKKRSKEQREAEKERKFELKQQKRKEKH  129 (132)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence            445566655555544433333222333455566666666655543


No 135
>PF10147 CR6_interact:  Growth arrest and DNA-damage-inducible proteins-interacting protein 1;  InterPro: IPR018472 Members of this family of proteins act as negative regulators of G1 to S cell cycle phase progression by inhibiting cyclin-dependent kinases. Inhibitory effects are additive with GADD45 proteins but occur also in the absence of GADD45 proteins. Furthermore, they act as a repressor of the orphan nuclear receptor NR4A1 by inhibiting AB domain-mediated transcriptional activity []. They may be involved in the hormone-mediated regulation of NR4A1 transcriptional activity.; GO: 0007049 cell cycle, 0005634 nucleus
Probab=60.23  E-value=1.7e+02  Score=32.84  Aligned_cols=14  Identities=64%  Similarity=0.665  Sum_probs=5.8

Q ss_pred             HHHHHHHHHHHHHH
Q 001172          440 AEKEKKAKKKLAKQ  453 (1133)
Q Consensus       440 k~kekk~~~k~ak~  453 (1133)
                      ++||.|+++|+||.
T Consensus       187 kEkeeKKk~K~aKk  200 (217)
T PF10147_consen  187 KEKEEKKKKKEAKK  200 (217)
T ss_pred             HHHHHHHHHHHHHH
Confidence            34444444444443


No 136
>KOG1363 consensus Predicted regulator of the ubiquitin pathway (contains UAS and UBX domains) [Signal transduction mechanisms]
Probab=59.80  E-value=25  Score=42.91  Aligned_cols=11  Identities=9%  Similarity=-0.000  Sum_probs=5.9

Q ss_pred             hHHHHHhhhcc
Q 001172          233 KLGRLIEDKAR  243 (1133)
Q Consensus       233 hLG~LLqS~~~  243 (1133)
                      +|...+.+-++
T Consensus       146 ~f~~~f~~ry~  156 (460)
T KOG1363|consen  146 TFVDNFGDRYG  156 (460)
T ss_pred             HHHHHHHHhcC
Confidence            35555555554


No 137
>PF08229 SHR3_chaperone:  ER membrane protein SH3 ;  InterPro: IPR013248 This family of proteins are membrane localised chaperones that are required for correct plasma membrane localisation of amino acid permeases (AAPs) []. Shr3 prevents AAPs proteins from aggregating and assists in their correct folding. In the absence of Shr3, AAPs are retained in the ER.
Probab=59.72  E-value=7.6  Score=42.31  Aligned_cols=17  Identities=41%  Similarity=0.507  Sum_probs=6.6

Q ss_pred             HHHHHHHHHHHHHHHHH
Q 001172          407 YQEAVALKRQEELIREE  423 (1133)
Q Consensus       407 y~E~~A~~rQ~~Ll~E~  423 (1133)
                      |.|..-.+.+++..+|+
T Consensus       156 YAe~~~~~~~~~~~~~e  172 (196)
T PF08229_consen  156 YAERKDAKELEEFEKEE  172 (196)
T ss_pred             HHhhhhHHHHHHHHHHH
Confidence            33333333333333333


No 138
>PF10500 SR-25:  Nuclear RNA-splicing-associated protein;  InterPro: IPR019532  SR-25, otherwise known as ADP-ribosylation factor-like factor 6-interacting protein 4, is expressed in virtually all tissue types. At the N terminus there is a repeat of serine-arginine (SR repeat), and towards the middle of the protein there are clusters of both serines and of basic amino acids. The presence of many nuclear localisation signals strongly implies that this is a nuclear protein that may contribute to RNA splicing []. SR-25 is also implicated, along with heat-shock-protein-27, as a mediator in the Rac1 (GTPase ras-related C3 botulinum toxin substrate 1; also see IPR019093 from INTERPRO) signalling pathway [].
Probab=59.61  E-value=11  Score=41.65  Aligned_cols=9  Identities=22%  Similarity=0.250  Sum_probs=6.8

Q ss_pred             CCCCCCccc
Q 001172          498 PEKPDVLED  506 (1133)
Q Consensus       498 ~~k~d~~e~  506 (1133)
                      +++|-|-||
T Consensus       156 am~PmTkEE  164 (225)
T PF10500_consen  156 AMAPMTKEE  164 (225)
T ss_pred             hcCCCCHHH
Confidence            467888887


No 139
>KOG0949 consensus Predicted helicase, DEAD-box superfamily [General function prediction only]
Probab=57.77  E-value=22  Score=46.76  Aligned_cols=17  Identities=18%  Similarity=0.313  Sum_probs=12.0

Q ss_pred             cccccCCchhhhhhhhc
Q 001172          971 VLVDEFPHLDIINDLLD  987 (1133)
Q Consensus       971 ~v~dEFPHLDIINDLL~  987 (1133)
                      +||+-|-.|+..|++..
T Consensus      1279 ~vg~lf~~ls~v~sv~~ 1295 (1330)
T KOG0949|consen 1279 GVGELFKSLSVVNSVVL 1295 (1330)
T ss_pred             ccchhhhhhhcchHHHH
Confidence            67777777777777543


No 140
>KOG2341 consensus TATA box binding protein (TBP)-associated factor, RNA polymerase II [Transcription]
Probab=57.14  E-value=32  Score=42.78  Aligned_cols=27  Identities=30%  Similarity=0.173  Sum_probs=14.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhhHHHHH
Q 001172          418 ELIREEEAAWLAESEQKAKRGAAEKEK  444 (1133)
Q Consensus       418 ~Ll~E~E~e~~~~~e~~~kk~~k~kek  444 (1133)
                      .=|+++|.-.++|+||+..|+++++.+
T Consensus       430 ~QLrflEqldk~E~Erk~~~ere~l~r  456 (563)
T KOG2341|consen  430 RQLRFLEQLDKAEEERKESREREELLR  456 (563)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence            345666666666666655555444433


No 141
>KOG0783 consensus Uncharacterized conserved protein, contains ankyrin and BTB/POZ domains [Function unknown]
Probab=56.81  E-value=6.3  Score=50.33  Aligned_cols=49  Identities=16%  Similarity=0.211  Sum_probs=31.9

Q ss_pred             HHHHHHHhhcCCCCCCCCCCCccccccCCCCCCcccchh----HH-HHHHhhHHhhhHHHHHH
Q 001172          337 VLLLLERAALEPLPPKDEKGPQNRTKESNSGEDFNKDSI----ER-DERRLTELGRRTVEIFV  394 (1133)
Q Consensus       337 ll~~leR~al~~Lp~~d~~~~q~~~~~~~~~~dvs~dtv----e~-ee~~~~Elgrr~~~ifa  394 (1133)
                      +|.++|.+.+..|+.   .|.+.+++.      .+..+.    |- -+|+|-||..+||+|..
T Consensus       791 il~iaDqlli~~Lk~---Ice~~ll~k------l~lk~~~~llefaamY~ak~L~~~C~dfic  844 (1267)
T KOG0783|consen  791 ILSIADQLLILELKS---ICEQSLLRK------LNLKTLPTLLEFAAMYHAKELYSRCIDFIC  844 (1267)
T ss_pred             HHHHHHHHHHHHHHH---HHHHHHHhH------hcccchHHHHHHHHHhhHHHHHHHHHHHHH
Confidence            445556666666654   466665554      333332    22 69999999999999876


No 142
>KOG3634 consensus Troponin [Cytoskeleton]
Probab=56.39  E-value=22  Score=41.55  Aligned_cols=31  Identities=35%  Similarity=0.279  Sum_probs=19.0

Q ss_pred             HHHHHHHHHHHHHHHHHhhhHHHHHHHHHHH
Q 001172          420 IREEEAAWLAESEQKAKRGAAEKEKKAKKKL  450 (1133)
Q Consensus       420 l~E~E~e~~~~~e~~~kk~~k~kekk~~~k~  450 (1133)
                      |.|.++.++.+.+|.++..++-|+|..+.+.
T Consensus        85 l~ey~~~~~~~rer~E~eL~eLkekq~~r~~  115 (361)
T KOG3634|consen   85 LGEYEDFDRIEREREEKELKELKEKQEKRKL  115 (361)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            5566666666666666666655555555554


No 143
>KOG4661 consensus Hsp27-ERE-TATA-binding protein/Scaffold attachment factor (SAF-B) [Transcription]
Probab=55.99  E-value=56  Score=40.65  Aligned_cols=25  Identities=16%  Similarity=0.260  Sum_probs=10.1

Q ss_pred             hhhhHHHHHHhhhhhhhhhcccCCch
Q 001172          459 KGKEKKREERSSMALSDRLEDENPSD  484 (1133)
Q Consensus       459 k~~e~~~ee~~~~~~~~~~~~~~~~~  484 (1133)
                      -|.|+.+-|+ .+-+++++.+..++.
T Consensus       670 mErERLEreR-M~ve~eRr~eqeRih  694 (940)
T KOG4661|consen  670 MERERLERER-MKVEEERRDEQERIH  694 (940)
T ss_pred             HHHHHHHHHH-HHHHHhhcchhhhhh
Confidence            3333333333 333344444444444


No 144
>PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed
Probab=55.48  E-value=59  Score=42.14  Aligned_cols=15  Identities=0%  Similarity=0.164  Sum_probs=7.7

Q ss_pred             ceeecCeEEEEEEEe
Q 001172           89 AFEVGGYKWYILIYP  103 (1133)
Q Consensus        89 ~FsVGGhkWRI~~YP  103 (1133)
                      .++..+-.+.|-+-.
T Consensus       188 ~it~r~~r~~i~vk~  202 (782)
T PRK00409        188 IITIRNDRYVLPVKA  202 (782)
T ss_pred             eEEEECCEEEEEech
Confidence            344555556555543


No 145
>PRK08475 F0F1 ATP synthase subunit B; Validated
Probab=55.40  E-value=1.2e+02  Score=31.99  Aligned_cols=10  Identities=40%  Similarity=0.415  Sum_probs=3.9

Q ss_pred             HHHHhhcCCC
Q 001172          340 LLERAALEPL  349 (1133)
Q Consensus       340 ~leR~al~~L  349 (1133)
                      +|-+++-.||
T Consensus        38 lL~~fl~kPi   47 (167)
T PRK08475         38 ILWYFAAKPL   47 (167)
T ss_pred             HHHHHHHHHH
Confidence            3344433333


No 146
>KOG0345 consensus ATP-dependent RNA helicase [RNA processing and modification]
Probab=55.19  E-value=18  Score=44.17  Aligned_cols=41  Identities=32%  Similarity=0.434  Sum_probs=22.0

Q ss_pred             HhhHHhhhHHHHHH---HhhhhhHHHHHHHHHHHHHH-HHHHHHHHHH
Q 001172          381 RLTELGRRTVEIFV---LAHIFSNKIEVAYQEAVALK-RQEELIREEE  424 (1133)
Q Consensus       381 ~~~Elgrr~~~ifa---~~k~F~~rv~~Ay~E~~A~~-rQ~~Ll~E~E  424 (1133)
                      .++|++..-+..|.   -+-.|.   +..|+++.-++ ||.+|..+.|
T Consensus       451 ~M~Elk~~~~~~~~~~~~~id~~---~ikykdkkrEk~Rq~~l~~~~~  495 (567)
T KOG0345|consen  451 KMPELKQYKIGEFFFPKPAIDFS---EIKYKDKKREKQRQQKLKVRKE  495 (567)
T ss_pred             CcHHHhhhhccceeccCCCcchh---hhcccchHHHHHHHHHHHHHHH
Confidence            34677776666542   234455   56677766554 4444443333


No 147
>PF04696 Pinin_SDK_memA:  pinin/SDK/memA/ protein conserved region;  InterPro: IPR006786 This conserved region is located adjacent and C-terminal to a N-terminal pinin/SKD domain IPR006787 from INTERPRO. Members of this family have very varied localisations within the eukaryotic cell. Pinin is known to localise at the desmosomes and is implicated in anchoring intermediate filaments to the desmosomal plaque []. SDK2/3 is a dynamically localised nuclear protein thought to be involved in modulation of alternative pre-mRNA splicing []. MemA is a tumour marker preferentially expressed in human melanoma cell lines. A common feature of the members of this family is that they may all participate in regulating protein-protein interactions [].
Probab=54.66  E-value=1.7e+02  Score=30.09  Aligned_cols=10  Identities=20%  Similarity=0.188  Sum_probs=7.4

Q ss_pred             HhhhHHHHHH
Q 001172          385 LGRRTVEIFV  394 (1133)
Q Consensus       385 lgrr~~~ifa  394 (1133)
                      =|||||..+.
T Consensus         8 RnRRmFG~Ll   17 (131)
T PF04696_consen    8 RNRRMFGGLL   17 (131)
T ss_pred             hhhhHHHHHH
Confidence            3889998654


No 148
>COG4499 Predicted membrane protein [Function unknown]
Probab=54.28  E-value=12  Score=44.44  Aligned_cols=12  Identities=25%  Similarity=0.581  Sum_probs=6.0

Q ss_pred             HHHHHHHHHHHH
Q 001172          414 KRQEELIREEEA  425 (1133)
Q Consensus       414 ~rQ~~Ll~E~E~  425 (1133)
                      +...+||.|.++
T Consensus       365 ~~r~e~lk~~n~  376 (434)
T COG4499         365 DKRQELLKEYNK  376 (434)
T ss_pred             hHHHHHHHHHHH
Confidence            334455555544


No 149
>KOG3859 consensus Septins (P-loop GTPases) [Cell cycle control, cell division, chromosome partitioning]
Probab=53.99  E-value=72  Score=37.15  Aligned_cols=15  Identities=40%  Similarity=0.403  Sum_probs=10.2

Q ss_pred             HHHHHHHHHHHHHHH
Q 001172          405 VAYQEAVALKRQEEL  419 (1133)
Q Consensus       405 ~Ay~E~~A~~rQ~~L  419 (1133)
                      +.++|.++.+|++-|
T Consensus       318 ~s~qet~eaKr~e~~  332 (406)
T KOG3859|consen  318 FSLQETYEAKRNEFL  332 (406)
T ss_pred             ccHHHHHHHHHHHHH
Confidence            347777777777654


No 150
>PRK13428 F0F1 ATP synthase subunit delta; Provisional
Probab=53.78  E-value=88  Score=38.00  Aligned_cols=19  Identities=11%  Similarity=0.190  Sum_probs=8.8

Q ss_pred             HHHHHHHhhhhhHHHHHHH
Q 001172          389 TVEIFVLAHIFSNKIEVAY  407 (1133)
Q Consensus       389 ~~~ifa~~k~F~~rv~~Ay  407 (1133)
                      +|=+|+|-+.|...|....
T Consensus        12 lIl~~lL~kfl~~Pi~~~l   30 (445)
T PRK13428         12 AVIVFLVWRFVVPPVRRLM   30 (445)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            3444555555544444443


No 151
>PF12037 DUF3523:  Domain of unknown function (DUF3523);  InterPro: IPR021911  This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is typically between 257 to 277 amino acids in length. This domain is found associated with PF00004 from PFAM. This domain has a conserved LER sequence motif. 
Probab=53.59  E-value=1.6e+02  Score=34.07  Aligned_cols=12  Identities=33%  Similarity=0.564  Sum_probs=5.7

Q ss_pred             HHHHHHHHHHHH
Q 001172          405 VAYQEAVALKRQ  416 (1133)
Q Consensus       405 ~Ay~E~~A~~rQ  416 (1133)
                      +=|++..+.+|.
T Consensus       109 aqY~D~LaRkR~  120 (276)
T PF12037_consen  109 AQYEDELARKRY  120 (276)
T ss_pred             HHHHHHHHHHHH
Confidence            335555554443


No 152
>TIGR01069 mutS2 MutS2 family protein. Function of MutS2 is unknown. It should not be considered a DNA mismatch repair protein. It is likely a DNA mismatch binding protein of unknown cellular function.
Probab=53.57  E-value=1e+02  Score=40.03  Aligned_cols=15  Identities=0%  Similarity=0.113  Sum_probs=8.7

Q ss_pred             ceeecCeEEEEEEEe
Q 001172           89 AFEVGGYKWYILIYP  103 (1133)
Q Consensus        89 ~FsVGGhkWRI~~YP  103 (1133)
                      .+++.+-.|.|-+-.
T Consensus       183 ~it~r~~r~vipvk~  197 (771)
T TIGR01069       183 IVTIRNGRYVLPLKS  197 (771)
T ss_pred             eEEEECCEEEEEeeH
Confidence            355556667665543


No 153
>PRK07352 F0F1 ATP synthase subunit B; Validated
Probab=53.52  E-value=2.2e+02  Score=30.06  Aligned_cols=23  Identities=13%  Similarity=-0.116  Sum_probs=12.8

Q ss_pred             HhhhHHHHHHHhhhhhHHHHHHH
Q 001172          385 LGRRTVEIFVLAHIFSNKIEVAY  407 (1133)
Q Consensus       385 lgrr~~~ifa~~k~F~~rv~~Ay  407 (1133)
                      .-.=+|=||+|-+.+..-|..+.
T Consensus        26 iinflIl~~lL~~fl~kpI~~~l   48 (174)
T PRK07352         26 LINLAIVIGLLYYFGRGFLGKIL   48 (174)
T ss_pred             HHHHHHHHHHHHHHhHHHHHHHH
Confidence            45556666666666644444443


No 154
>PHA00451 protein kinase
Probab=53.31  E-value=32  Score=39.54  Aligned_cols=14  Identities=14%  Similarity=0.259  Sum_probs=6.2

Q ss_pred             HhhhhhHHHhhHHH
Q 001172          223 CRRFVEERRGKLGR  236 (1133)
Q Consensus       223 C~~fved~~~hLG~  236 (1133)
                      |.++-.+.+.|+..
T Consensus       141 C~r~~nd~h~~ya~  154 (362)
T PHA00451        141 CYRESNDVHHHYAA  154 (362)
T ss_pred             cccccchHHHHHHH
Confidence            33344444445544


No 155
>PRK14473 F0F1 ATP synthase subunit B; Provisional
Probab=53.07  E-value=2.3e+02  Score=29.46  Aligned_cols=22  Identities=18%  Similarity=0.229  Sum_probs=12.0

Q ss_pred             HHHHHHHhhhhhHHHHHHHHHH
Q 001172          389 TVEIFVLAHIFSNKIEVAYQEA  410 (1133)
Q Consensus       389 ~~~ifa~~k~F~~rv~~Ay~E~  410 (1133)
                      ++=+|+|-+.|...|..+..+.
T Consensus        19 lil~~lL~~fl~kpi~~~l~~R   40 (164)
T PRK14473         19 LLLIFLLRTFLYRPVLNLLNER   40 (164)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            3345556666665555555443


No 156
>PRK14471 F0F1 ATP synthase subunit B; Provisional
Probab=53.07  E-value=1.4e+02  Score=31.14  Aligned_cols=20  Identities=25%  Similarity=0.170  Sum_probs=11.7

Q ss_pred             HHHHHhhhhhHHHHHHHHHH
Q 001172          391 EIFVLAHIFSNKIEVAYQEA  410 (1133)
Q Consensus       391 ~ifa~~k~F~~rv~~Ay~E~  410 (1133)
                      =+|+|.+.|-..|.....+.
T Consensus        21 l~~ll~~~l~~pi~~~l~~R   40 (164)
T PRK14471         21 LLLLLAKFAWKPILGAVKER   40 (164)
T ss_pred             HHHHHHHHhHHHHHHHHHHH
Confidence            34455666666666665544


No 157
>KOG3654 consensus Uncharacterized CH domain protein [Cytoskeleton]
Probab=52.38  E-value=26  Score=42.86  Aligned_cols=15  Identities=27%  Similarity=0.361  Sum_probs=7.7

Q ss_pred             HHHhhhhhhhHHHHH
Q 001172          453 QKRNNRKGKEKKREE  467 (1133)
Q Consensus       453 ~~e~~rk~~e~~~ee  467 (1133)
                      |||+.+||+|+++.|
T Consensus       430 eeer~~keee~arre  444 (708)
T KOG3654|consen  430 EEERAPKEEEVARRE  444 (708)
T ss_pred             HhhhcchhhhhhHHH
Confidence            344555555555543


No 158
>PRK06231 F0F1 ATP synthase subunit B; Validated
Probab=52.29  E-value=1.3e+02  Score=32.85  Aligned_cols=14  Identities=29%  Similarity=0.159  Sum_probs=10.0

Q ss_pred             HHHHHHHhhcCCCC
Q 001172          337 VLLLLERAALEPLP  350 (1133)
Q Consensus       337 ll~~leR~al~~Lp  350 (1133)
                      |+.+|-++.-.|+.
T Consensus        61 lv~lL~k~l~kPi~   74 (205)
T PRK06231         61 LLLLGIFLFWKPTQ   74 (205)
T ss_pred             HHHHHHHHHHHHHH
Confidence            34578888888874


No 159
>KOG1150 consensus Predicted molecular chaperone (DnaJ superfamily) [Posttranslational modification, protein turnover, chaperones]
Probab=51.65  E-value=36  Score=37.59  Aligned_cols=42  Identities=26%  Similarity=0.341  Sum_probs=23.7

Q ss_pred             HHHHHHHHHHHHH--HHHhhhhhhhHHHHHHhhhhhh-hhhcccC
Q 001172          440 AEKEKKAKKKLAK--QKRNNRKGKEKKREERSSMALS-DRLEDEN  481 (1133)
Q Consensus       440 k~kekk~~~k~ak--~~e~~rk~~e~~~ee~~~~~~~-~~~~~~~  481 (1133)
                      .+.|+|+++.+++  +|+++.+++|-+++|+..++.| ++.-||.
T Consensus       168 ae~erkRk~~e~r~~~eRkr~re~eIeaeek~Kr~~E~qKnfEEs  212 (250)
T KOG1150|consen  168 AELERKRKELEARANEERKRQREEEIEAEEKRKREREWQKNFEES  212 (250)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            6667777666644  4444446666677765555444 3333443


No 160
>KOG2689 consensus Predicted ubiquitin regulatory protein [Posttranslational modification, protein turnover, chaperones]
Probab=51.55  E-value=79  Score=36.43  Aligned_cols=27  Identities=11%  Similarity=0.004  Sum_probs=16.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          406 AYQEAVALKRQEELIREEEAAWLAESE  432 (1133)
Q Consensus       406 Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e  432 (1133)
                      -...+.....+.+|+|-++.++++.++
T Consensus        99 kik~k~~k~~e~e~~E~~~~~k~~~ks  125 (290)
T KOG2689|consen   99 KIKQKQIKREELELREALEREKQRRKS  125 (290)
T ss_pred             HhcchhHHHHHHHHHHhhhhHHhhhhc
Confidence            344555566777777777766666554


No 161
>KOG1363 consensus Predicted regulator of the ubiquitin pathway (contains UAS and UBX domains) [Signal transduction mechanisms]
Probab=51.39  E-value=48  Score=40.53  Aligned_cols=9  Identities=33%  Similarity=0.438  Sum_probs=3.8

Q ss_pred             chhhhccCH
Q 001172          248 CAFWLGIDQ  256 (1133)
Q Consensus       248 Raf~~a~~~  256 (1133)
                      -+|+.|.+.
T Consensus       169 ~a~~~A~~~  177 (460)
T KOG1363|consen  169 NAFLEAFDR  177 (460)
T ss_pred             HHHHHHHhh
Confidence            344444443


No 162
>PF01086 Clathrin_lg_ch:  Clathrin light chain;  InterPro: IPR000996 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ].  Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins [].  This entry represents clathrin light chains, which are more divergent in sequence than the heavy chains []. In higher eukaryotes, two genes encode distinct but related light chains, each of which can yield two separate forms via alternative splicing. In yeast there is a single light chain whose sequence is only distantly related to that of higher eukaryotes. Clathrin light chains have a conserved acidic N-terminal domain, a central coiled-coil domain and a conserved C-terminal domain.  More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030130 clathrin coat of trans-Golgi network vesicle, 0030132 clathrin coat of coated pit; PDB: 3LVG_E 3LVH_D.
Probab=50.83  E-value=20  Score=39.65  Aligned_cols=14  Identities=21%  Similarity=0.259  Sum_probs=6.3

Q ss_pred             HHHHHHHHHHHHHH
Q 001172          415 RQEELIREEEAAWL  428 (1133)
Q Consensus       415 rQ~~Ll~E~E~e~~  428 (1133)
                      +|.+.|+|.+++++
T Consensus       120 ~~~~~i~ekD~~e~  133 (225)
T PF01086_consen  120 ERDKRIEEKDAEEE  133 (225)
T ss_dssp             HHTTTSTHHHHHHH
T ss_pred             HHHHHHHHHHHHHH
Confidence            34445544444333


No 163
>TIGR01144 ATP_synt_b ATP synthase, F0 subunit b. This model describes the F1/F0 ATP synthase b subunit in bacteria only. Scoring just below the trusted cutoff are the N-terminal domains of Mycobacterial b/delta fusion proteins and a subunit from an archaeon, Methanosarcina barkeri, in which the ATP synthase homolog differs in architecture and is not experimentally confirmed. This model helps resolve b from the related b' subunit. Within the family is an example from a sodium-translocating rather than proton-translocating ATP synthase.
Probab=49.96  E-value=2.9e+02  Score=27.99  Aligned_cols=6  Identities=17%  Similarity=0.219  Sum_probs=2.3

Q ss_pred             hhhhhH
Q 001172          396 AHIFSN  401 (1133)
Q Consensus       396 ~k~F~~  401 (1133)
                      .+++.+
T Consensus        21 ~~~l~~   26 (147)
T TIGR01144        21 AKAIET   26 (147)
T ss_pred             HHHHHH
Confidence            344433


No 164
>KOG4001 consensus Axonemal dynein light chain [Cytoskeleton]
Probab=49.91  E-value=1.1e+02  Score=34.00  Aligned_cols=69  Identities=19%  Similarity=0.198  Sum_probs=37.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhhhh-hhhHHHHHHhhhh
Q 001172          404 EVAYQEAVALKRQEELIREEEAAWLAESEQKAKRGAAEKEKKAKKKLAKQKRNNRK-GKEKKREERSSMA  472 (1133)
Q Consensus       404 ~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~k~kekk~~~k~ak~~e~~rk-~~e~~~ee~~~~~  472 (1133)
                      .+.|.-.||-=+++.|..|.|.+....+=+..+-.+++-|.+-.+..+|.+--.++ +++.+-||++-+.
T Consensus       166 qtlyeSsvAfGmRKALqae~ek~~~~~~~k~le~~k~~Le~~ia~~k~K~e~~e~r~~E~r~ieEkk~~e  235 (259)
T KOG4001|consen  166 QTLYESSVAFGMRKALQAENEKTRATTEWKVLEDKKKELELKIAQLKKKLETDEIRSEEEREIEEKKMKE  235 (259)
T ss_pred             HHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHH
Confidence            56677888888999998888766544443333333444454444444444333332 3333334444333


No 165
>PRK07353 F0F1 ATP synthase subunit B'; Validated
Probab=49.86  E-value=1.9e+02  Score=29.13  Aligned_cols=18  Identities=22%  Similarity=0.471  Sum_probs=9.0

Q ss_pred             HHHHhhhhhHHHHHHHHH
Q 001172          392 IFVLAHIFSNKIEVAYQE  409 (1133)
Q Consensus       392 ifa~~k~F~~rv~~Ay~E  409 (1133)
                      +|+|.++|-..|.....+
T Consensus        19 ~~ll~~~l~~pi~~~l~~   36 (140)
T PRK07353         19 TFILNALFYKPVGKVVEE   36 (140)
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            444555555555444443


No 166
>KOG1962 consensus B-cell receptor-associated protein and related proteins [Defense mechanisms]
Probab=49.78  E-value=1.3e+02  Score=33.56  Aligned_cols=22  Identities=14%  Similarity=0.372  Sum_probs=12.4

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHH
Q 001172          396 AHIFSNKIEVAYQEAVALKRQEE  418 (1133)
Q Consensus       396 ~k~F~~rv~~Ay~E~~A~~rQ~~  418 (1133)
                      +-++ .|+....++-+.+..+++
T Consensus       110 ~lvI-~R~~~ll~~l~~l~~~~~  131 (216)
T KOG1962|consen  110 SLVI-RRLHTLLRELATLRANEK  131 (216)
T ss_pred             HHHH-HHHHHHHHHHHHHHhhHH
Confidence            4444 566666666665555533


No 167
>PRK08476 F0F1 ATP synthase subunit B'; Validated
Probab=49.73  E-value=1.8e+02  Score=29.88  Aligned_cols=15  Identities=13%  Similarity=0.118  Sum_probs=6.3

Q ss_pred             HHHHhhhhhHHHHHH
Q 001172          392 IFVLAHIFSNKIEVA  406 (1133)
Q Consensus       392 ifa~~k~F~~rv~~A  406 (1133)
                      +|+|.+.|-..|...
T Consensus        21 ~~~l~kfl~kPi~~~   35 (141)
T PRK08476         21 IVILNSWLYKPLLKF   35 (141)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            333444444444433


No 168
>KOG2007 consensus Cysteinyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]
Probab=49.62  E-value=71  Score=39.55  Aligned_cols=46  Identities=13%  Similarity=0.156  Sum_probs=26.0

Q ss_pred             HHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 001172          391 EIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKR  437 (1133)
Q Consensus       391 ~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk  437 (1133)
                      +|..+.--+...++..|.+.+-...+.-|++-+-.|+ .-+++++|.
T Consensus       491 ~iL~l~d~lR~~~l~~~g~~led~~g~~lvk~v~~e~-~l~~~eek~  536 (586)
T KOG2007|consen  491 EILELCDTLRDSDLLELGVRLEDRKGSTLVKAVDREE-LLQEREEKT  536 (586)
T ss_pred             HHHHHhHHHHHhhHHHhhhHHHhCCchHHHHHhhhHH-HHHHHHHHh
Confidence            6666666665556666666665555555665555554 333444443


No 169
>KOG4715 consensus SWI/SNF-related matrix-associated actin-dependent regulator of chromatin  [Chromatin structure and dynamics]
Probab=49.52  E-value=86  Score=36.71  Aligned_cols=41  Identities=15%  Similarity=0.186  Sum_probs=24.6

Q ss_pred             CCCccccccCCCCCCCCccccCCCCCCCCCCCCCcCCCCcccCCC
Q 001172          501 PDVLEDVSDVSDSVDGGAEVLQPDSEDRDTSPVNWDTDASEVIPP  545 (1133)
Q Consensus       501 ~d~~e~~sd~sd~~d~~~e~~~~d~edr~~s~v~w~~d~~e~~~~  545 (1133)
                      +.-+|+.+.+.-+    .+--..+.+|.++--.+|.+-.-|+..+
T Consensus       343 ~~~~ee~tespa~----g~eptstp~a~es~t~~~t~~~~~~p~~  383 (410)
T KOG4715|consen  343 ETHLEESTESPAN----GEEPTSTPEAKESGTTGTTTENKEKPEE  383 (410)
T ss_pred             cccccccccCCCC----CCCCCCCccccccccccCccccccCcch
Confidence            3445555444332    2333456788888888998876665443


No 170
>PF08243 SPT2:  SPT2 chromatin protein;  InterPro: IPR013256 This entry includes the Saccharomyces cerevisiae (Baker's yeast) protein SPT2 which is a chromatin protein involved in transcriptional regulation []. These proteins shows conservation of several domains across numerous species, including having a cluster of positively charged amino acids. This cluster probably functions in the binding properties of the proteins []. Sin1p/Spt2p probably modulates the local chromatin structure by binding two strands of double-stranded DNA at their crossover point. Sin1p/Spt2p has sequence similarity to HMG1 and serves as a negative transcriptional regulator of a small family of genes that are activated by the SWI/SNF chromatin-remodelling complex. It is also involved in maintaining the integrity of chromatin during transcription elongation. Sin1p/Spt2 is required for, and is directly involved in, the efficient recruitment of the mRNA cleavage/polyadenylation complex []. Spt2 is also involved in regulating levels of histone H3 over transcribed regions [].
Probab=49.26  E-value=24  Score=35.54  Aligned_cols=38  Identities=13%  Similarity=0.214  Sum_probs=25.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q 001172          402 KIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRGA  439 (1133)
Q Consensus       402 rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~  439 (1133)
                      -+++.|.+....+.+-..|..+|.++.++.|++.++.|
T Consensus        73 dMEA~~~~I~~EE~rS~r~A~~ED~~e~~~e~~~~~~K  110 (116)
T PF08243_consen   73 DMEASFADIEKEEKRSARIARLEDEREERREEEEEREK  110 (116)
T ss_pred             hhhcCHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHH
Confidence            35667888777777777777777766555555554444


No 171
>KOG0343 consensus RNA Helicase [RNA processing and modification]
Probab=49.15  E-value=44  Score=41.81  Aligned_cols=6  Identities=17%  Similarity=0.074  Sum_probs=3.1

Q ss_pred             CeEEEE
Q 001172          110 NHLSLF  115 (1133)
Q Consensus       110 dyLSLY  115 (1133)
                      .|||++
T Consensus       272 ~~vsvh  277 (758)
T KOG0343|consen  272 VYVSVH  277 (758)
T ss_pred             cEEEEe
Confidence            455555


No 172
>TIGR01069 mutS2 MutS2 family protein. Function of MutS2 is unknown. It should not be considered a DNA mismatch repair protein. It is likely a DNA mismatch binding protein of unknown cellular function.
Probab=48.92  E-value=2e+02  Score=37.54  Aligned_cols=16  Identities=31%  Similarity=0.464  Sum_probs=7.4

Q ss_pred             hHHHHHHhhccccccc
Q 001172          287 TLVMDSLYSGLKALEG  302 (1133)
Q Consensus       287 plvmd~L~~glk~le~  302 (1133)
                      =+++|+++.|+...|.
T Consensus       405 LvLlDE~g~GtD~~eg  420 (771)
T TIGR01069       405 LVLFDELGAGTDPDEG  420 (771)
T ss_pred             EEEecCCCCCCCHHHH
Confidence            3444555555544433


No 173
>PRK05759 F0F1 ATP synthase subunit B; Validated
Probab=48.05  E-value=3.2e+02  Score=27.92  Aligned_cols=19  Identities=16%  Similarity=0.233  Sum_probs=9.2

Q ss_pred             HHHHhhhhhHHHHHHHHHH
Q 001172          392 IFVLAHIFSNKIEVAYQEA  410 (1133)
Q Consensus       392 ifa~~k~F~~rv~~Ay~E~  410 (1133)
                      +|+|-+++...|.....+.
T Consensus        18 ~~il~~~~~~pi~~~l~~R   36 (156)
T PRK05759         18 VWFIMKFVWPPIMKALEER   36 (156)
T ss_pred             HHHHHHHhHHHHHHHHHHH
Confidence            3444555555555554433


No 174
>PF12569 NARP1:  NMDA receptor-regulated protein 1 ;  InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=47.99  E-value=14  Score=45.37  Aligned_cols=16  Identities=19%  Similarity=0.148  Sum_probs=9.5

Q ss_pred             hhhHHHhhHHHHHhhh
Q 001172          226 FVEERRGKLGRLIEDK  241 (1133)
Q Consensus       226 fved~~~hLG~LLqS~  241 (1133)
                      |...+..||-.+|..+
T Consensus       126 F~~~~~~yl~~~l~Kg  141 (517)
T PF12569_consen  126 FKERLDEYLRPQLRKG  141 (517)
T ss_pred             HHHHHHHHHHHHHhcC
Confidence            4445566777777553


No 175
>KOG3634 consensus Troponin [Cytoskeleton]
Probab=47.96  E-value=77  Score=37.30  Aligned_cols=7  Identities=29%  Similarity=0.283  Sum_probs=3.3

Q ss_pred             CCCCCCC
Q 001172          608 SRGKNQR  614 (1133)
Q Consensus       608 ~rg~~~~  614 (1133)
                      ++||-|.
T Consensus       203 ~mtKeQq  209 (361)
T KOG3634|consen  203 GMTKEQQ  209 (361)
T ss_pred             cccHHHH
Confidence            4455443


No 176
>KOG0066 consensus eIF2-interacting protein ABC50 (ABC superfamily) [Translation, ribosomal structure and biogenesis]
Probab=47.67  E-value=35  Score=41.56  Aligned_cols=46  Identities=22%  Similarity=0.210  Sum_probs=23.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHH
Q 001172          406 AYQEAVALKRQEELIREEEAAWLAESEQKAKRGAAEKEKKAKKKLA  451 (1133)
Q Consensus       406 Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~k~kekk~~~k~a  451 (1133)
                      .|+-.|++++|+.+-+=+..|+++++-+..-+.+|.-||+-+++.+
T Consensus       496 ~FKKmY~Qk~~e~~K~yekQeK~LkelKa~GkS~KqAEkq~Ke~lt  541 (807)
T KOG0066|consen  496 LFKKMYAQKMQEHEKNYEKQEKQLKELKAEGKSAKQAEKQVKEQLT  541 (807)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcchHHHHHHHHHHHH
Confidence            3556677777655444333344444444333444555555555543


No 177
>KOG3088 consensus Secretory carrier membrane protein [Intracellular trafficking, secretion, and vesicular transport]
Probab=47.44  E-value=22  Score=40.97  Aligned_cols=23  Identities=39%  Similarity=0.437  Sum_probs=14.1

Q ss_pred             HHHHHHhhhhhhhHHHHHHhhhh
Q 001172          450 LAKQKRNNRKGKEKKREERSSMA  472 (1133)
Q Consensus       450 ~ak~~e~~rk~~e~~~ee~~~~~  472 (1133)
                      ++||||.+|||+|-.|.|++...
T Consensus        70 ~~rqeEL~Rke~ELdRREr~~a~   92 (313)
T KOG3088|consen   70 LKKQEELRRKEQELDRRERALAR   92 (313)
T ss_pred             HHHHHHHHHHHHHHhHHHHHHhh
Confidence            45666777777766666655443


No 178
>PF02731 SKIP_SNW:  SKIP/SNW domain;  InterPro: IPR004015  SKIP (SKI-interacting protein) is an essential spliceosomal component and transcriptional coregulator, which may provide regulatory coupling of transcription initiation and splicing []. SKIP was identified in a yeast 2-hybrid screen, where it was shown to interact with both the cellular and viral forms of SKI through the highly conserved region on SKIP known as the SNW domain []. SKIP is now known to interact with a number of other proteins as well. SKIP potentiates the activity of important transcription factors, such as vitamin D receptor, CBF1 (RBP-Jkappa), Smad2/3, and MyoD. It works with Ski in overcoming pRb-mediated cell cycle arrest, and it is targeted by the viral transactivators EBNA2 and E7 []. This entry represents the SNW domain.; GO: 0000398 nuclear mRNA splicing, via spliceosome, 0005681 spliceosomal complex
Probab=47.23  E-value=48  Score=35.26  Aligned_cols=20  Identities=25%  Similarity=0.471  Sum_probs=12.2

Q ss_pred             HHHhhHHhhhHHHHHHHhhhh
Q 001172          379 ERRLTELGRRTVEIFVLAHIF  399 (1133)
Q Consensus       379 e~~~~Elgrr~~~ifa~~k~F  399 (1133)
                      ++|++--||.+.+.-+ -..|
T Consensus        88 DKRlaadgr~l~~~~I-Nd~F  107 (158)
T PF02731_consen   88 DKRLAADGRGLQDVEI-NDKF  107 (158)
T ss_pred             HHHHhhcccccCCccc-cHHH
Confidence            5666656777666655 4555


No 179
>PF15066 CAGE1:  Cancer-associated gene protein 1 family
Probab=46.84  E-value=1.8e+02  Score=35.74  Aligned_cols=39  Identities=23%  Similarity=0.285  Sum_probs=29.8

Q ss_pred             hHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhhhhhhhh
Q 001172          439 AAEKEKKAKKKLAKQKRNNRKGKEKKREERSSMALSDRL  477 (1133)
Q Consensus       439 ~k~kekk~~~k~ak~~e~~rk~~e~~~ee~~~~~~~~~~  477 (1133)
                      ++|||.+.++=++-++|-.|.+++..+|....|...+|.
T Consensus       484 krEKe~~EqefLslqeEfQk~ekenl~ERqkLKs~leKL  522 (527)
T PF15066_consen  484 KREKETREQEFLSLQEEFQKHEKENLEERQKLKSRLEKL  522 (527)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Confidence            367777777777999999999999888887777665543


No 180
>KOG1772 consensus Vacuolar H+-ATPase V1 sector, subunit G [Energy production and conversion]
Probab=46.79  E-value=57  Score=32.75  Aligned_cols=25  Identities=20%  Similarity=0.223  Sum_probs=12.7

Q ss_pred             HHHHHHHHHHHHHHHhhhhhhhHHH
Q 001172          441 EKEKKAKKKLAKQKRNNRKGKEKKR  465 (1133)
Q Consensus       441 ~kekk~~~k~ak~~e~~rk~~e~~~  465 (1133)
                      .|-|-++-||||+|-...-++=+..
T Consensus        26 Rk~K~~RLKQAKeEA~~Eie~yr~q   50 (108)
T KOG1772|consen   26 RKRKLRRLKQAKEEAEKEIEEYRSQ   50 (108)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3444555666666655544433333


No 181
>cd02664 Peptidase_C19H A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=46.44  E-value=10  Score=43.10  Aligned_cols=33  Identities=15%  Similarity=0.208  Sum_probs=28.9

Q ss_pred             cCCchHHHHhhhhhHHHhhHHHHHhhhccccccchhhhccCH
Q 001172          215 YLTNVEQICRRFVEERRGKLGRLIEDKARWSSFCAFWLGIDQ  256 (1133)
Q Consensus       215 PpSNLgaTC~~fved~~~hLG~LLqS~~~~ssFRaf~~a~~~  256 (1133)
                      |+.|+|.||         ||+.+||...+...||.+++...+
T Consensus         1 GL~NlGnTC---------Y~NS~LQ~L~~~~~fr~~ll~~~~   33 (327)
T cd02664           1 GLINLGNTC---------YMNSVLQALFMAKDFRRQVLSLNL   33 (327)
T ss_pred             CCcCCcccH---------HHHHHHHHHHCcHHHHHHHHcCCc
Confidence            578999999         999999999998899998777653


No 182
>KOG0681 consensus Actin-related protein - Arp5p [Cytoskeleton]
Probab=46.34  E-value=31  Score=42.82  Aligned_cols=20  Identities=25%  Similarity=0.127  Sum_probs=14.8

Q ss_pred             hhHHHHHHHHHHHHHHHHHH
Q 001172          399 FSNKIEVAYQEAVALKRQEE  418 (1133)
Q Consensus       399 F~~rv~~Ay~E~~A~~rQ~~  418 (1133)
                      ++|||+-|-.++...-|+++
T Consensus       351 ~~qr~lkas~dar~rar~ek  370 (645)
T KOG0681|consen  351 KKQRILKASTDARLRARVEK  370 (645)
T ss_pred             HHHHHHHhhhhhhccccccc
Confidence            45788888888876667666


No 183
>COG5644 Uncharacterized conserved protein [Function unknown]
Probab=46.26  E-value=55  Score=40.80  Aligned_cols=32  Identities=41%  Similarity=0.358  Sum_probs=20.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHH
Q 001172          409 EAVALKRQEELIREEEAAWLAESEQKAKRGAAEKEK  444 (1133)
Q Consensus       409 E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~k~kek  444 (1133)
                      |---.-+|.+|.+||.=    .+||||||.+|-|-|
T Consensus       370 e~~~rt~qlr~mrelmf----reerkakr~~kiksk  401 (869)
T COG5644         370 EVAERTRQLRFMRELMF----REERKAKRVAKIKSK  401 (869)
T ss_pred             HHHHHHHHHHHHHHHHH----HHHHHHHHHHHHhHH
Confidence            33345578888888743    456778877655544


No 184
>TIGR00570 cdk7 CDK-activating kinase assembly factor MAT1. All proteins in this family for which functions are known are cyclin dependent protein kinases that are components of TFIIH, a complex that is involved in nucleotide excision repair and transcription initiation. Also known as MAT1 (menage a trois 1). This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=45.91  E-value=98  Score=36.21  Aligned_cols=8  Identities=38%  Similarity=0.401  Sum_probs=3.2

Q ss_pred             hhHHHHHH
Q 001172          335 DDVLLLLE  342 (1133)
Q Consensus       335 ~Dll~~le  342 (1133)
                      .|-|...|
T Consensus        96 NdYLE~vE  103 (309)
T TIGR00570        96 NDYLEEVE  103 (309)
T ss_pred             HHHHHHHH
Confidence            34444333


No 185
>KOG2635 consensus Medium subunit of clathrin adaptor complex [Intracellular trafficking, secretion, and vesicular transport]
Probab=44.98  E-value=50  Score=40.07  Aligned_cols=41  Identities=20%  Similarity=0.321  Sum_probs=20.5

Q ss_pred             HhhhHHH-HHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          385 LGRRTVE-IFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLA  429 (1133)
Q Consensus       385 lgrr~~~-ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~  429 (1133)
                      .-+.+|+ |||.+.+.    --.|+|-+-+--=+..++.+-.||+.
T Consensus       102 I~~~~FelifAFDEiv----sLGyre~v~laQikty~eMdSHEEKi  143 (512)
T KOG2635|consen  102 ILENAFELIFAFDEIV----SLGYRENVNLAQIKTYLEMDSHEEKI  143 (512)
T ss_pred             HHHhhhhhhhccchhh----hhcccccccHHHhhhhhccccHHHHH
Confidence            3345666 45544443    34488887554444444444344433


No 186
>KOG3756 consensus Pinin (desmosome-associated protein) [Cytoskeleton]
Probab=44.92  E-value=2.5e+02  Score=33.08  Aligned_cols=43  Identities=14%  Similarity=0.150  Sum_probs=23.8

Q ss_pred             HhhhHHHHHH-HhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          385 LGRRTVEIFV-LAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAE  430 (1133)
Q Consensus       385 lgrr~~~ifa-~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~  430 (1133)
                      -+||||-.+. .-.-|++   ..-+..-++.+|.++.+-+|+..+.+
T Consensus       140 RNRRmFG~LlGTL~KFkq---E~kr~t~rq~KraEieqKlEeq~~eE  183 (340)
T KOG3756|consen  140 RNRRMFGLLLGTLQKFKQ---ESKRATERQVKRAEIEQKLEEQAEEE  183 (340)
T ss_pred             HhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4899998433 4455763   22222225666777766665544433


No 187
>KOG3878 consensus Protein involved in maintenance of Golgi structure and ER-Golgi transport [Intracellular trafficking, secretion, and vesicular transport]
Probab=44.90  E-value=2.1e+02  Score=33.98  Aligned_cols=33  Identities=15%  Similarity=0.159  Sum_probs=16.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          401 NKIEVAYQEAVALKRQEELIREEEAAWLAESEQ  433 (1133)
Q Consensus       401 ~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~  433 (1133)
                      .|+-.+|+--||..+|.+=-.+-|+|.|+-+|+
T Consensus       115 dr~C~~F~~yia~~k~~kde~lkE~e~r~~ee~  147 (469)
T KOG3878|consen  115 DRMCSAFRPYIAAVKQDKDETLKEKELRLMEEK  147 (469)
T ss_pred             HhcchhhhhHHHHhhhhhhhHHHHHHHHHHHhh
Confidence            455556666666666655333333344443333


No 188
>cd02659 peptidase_C19C A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=44.63  E-value=8.5  Score=43.27  Aligned_cols=34  Identities=21%  Similarity=0.159  Sum_probs=29.9

Q ss_pred             eeecCCchHHHHhhhhhHHHhhHHHHHhhhccccccchhhhcc
Q 001172          212 VRVYLTNVEQICRRFVEERRGKLGRLIEDKARWSSFCAFWLGI  254 (1133)
Q Consensus       212 I~VPpSNLgaTC~~fved~~~hLG~LLqS~~~~ssFRaf~~a~  254 (1133)
                      ++||+.|+|.||         ||+.+||...+...||.+++..
T Consensus         1 g~~GL~N~GntC---------Y~NsvLQ~L~~~~~f~~~~l~~   34 (334)
T cd02659           1 GYVGLKNQGATC---------YMNSLLQQLYMTPEFRNAVYSI   34 (334)
T ss_pred             CCCCcccCCcch---------HHHHHHHHHhcCHHHHHHHHcC
Confidence            468999999999         9999999999888888876664


No 189
>KOG2441 consensus mRNA splicing factor/probable chromatin binding snw family nuclear protein [RNA processing and modification; Chromatin structure and dynamics]
Probab=44.40  E-value=69  Score=38.44  Aligned_cols=20  Identities=25%  Similarity=0.522  Sum_probs=11.8

Q ss_pred             HHHhhHHhhhHHHHHHHhhhh
Q 001172          379 ERRLTELGRRTVEIFVLAHIF  399 (1133)
Q Consensus       379 e~~~~Elgrr~~~ifa~~k~F  399 (1133)
                      +++|+-.||-+-++-+ -.+|
T Consensus       261 dkRlaadgrglq~v~I-NdnF  280 (506)
T KOG2441|consen  261 DKRLAADGRGLQDVHI-NDNF  280 (506)
T ss_pred             hhhhhhccCCccccee-cccH
Confidence            5667767776655533 4444


No 190
>CHL00019 atpF ATP synthase CF0 B subunit
Probab=44.34  E-value=2.2e+02  Score=30.31  Aligned_cols=11  Identities=18%  Similarity=-0.167  Sum_probs=6.6

Q ss_pred             HHHHHhhcCCC
Q 001172          339 LLLERAALEPL  349 (1133)
Q Consensus       339 ~~leR~al~~L  349 (1133)
                      .+|-+++-.|+
T Consensus        39 ~lL~~fl~kPI   49 (184)
T CHL00019         39 GVLIYFGKGVL   49 (184)
T ss_pred             HHHHHHhHhHH
Confidence            45666666665


No 191
>PF15346 ARGLU:  Arginine and glutamate-rich 1
Probab=44.21  E-value=3.9e+02  Score=28.51  Aligned_cols=16  Identities=44%  Similarity=0.337  Sum_probs=6.2

Q ss_pred             HHHHhhhhhhhhhccc
Q 001172          465 REERSSMALSDRLEDE  480 (1133)
Q Consensus       465 ~ee~~~~~~~~~~~~~  480 (1133)
                      ++|+..+..+..+..+
T Consensus       100 aeEr~~~~Eer~~~~e  115 (149)
T PF15346_consen  100 AEERLRMEEERRRELE  115 (149)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3334444333333333


No 192
>KOG4571 consensus Activating transcription factor 4 [Transcription]
Probab=43.95  E-value=1e+02  Score=35.85  Aligned_cols=52  Identities=23%  Similarity=0.228  Sum_probs=40.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhhhh
Q 001172          407 YQEAVALKRQEELIREEEAAWLAESEQKAKRGAAEKEKKAKKKLAKQKRNNRK  459 (1133)
Q Consensus       407 y~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~k~kekk~~~k~ak~~e~~rk  459 (1133)
                      |+++. ..-++.|+-|++..+++.+|.|.+-..-+||=+..|+.-.|.-++|+
T Consensus       242 YRqKk-Rae~E~l~ge~~~Le~rN~~LK~qa~~lerEI~ylKqli~e~~~~r~  293 (294)
T KOG4571|consen  242 YRQKK-RAEKEALLGELEGLEKRNEELKDQASELEREIRYLKQLILEVYKKRV  293 (294)
T ss_pred             HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence            88886 45567788888888888888888777788888888888766666553


No 193
>KOG1015 consensus Transcription regulator XNP/ATRX, DEAD-box superfamily [Transcription]
Probab=43.78  E-value=19  Score=47.01  Aligned_cols=54  Identities=22%  Similarity=0.276  Sum_probs=31.0

Q ss_pred             HHHHHHHHHHHHHHHHHHhhh-------------HHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhhh
Q 001172          419 LIREEEAAWLAESEQKAKRGA-------------AEKEKKAKKKLAKQKRNNRKGKEKKREERSSMA  472 (1133)
Q Consensus       419 Ll~E~E~e~~~~~e~~~kk~~-------------k~kekk~~~k~ak~~e~~rk~~e~~~ee~~~~~  472 (1133)
                      +=-|.|++++++.|+++|+..             |+|-+|+.+|..|..+..+.-.+++++|++++.
T Consensus       549 se~E~ee~ekK~~ek~~kk~esseSd~vn~~sksK~K~rKkiRkII~d~kL~keT~~a~k~EkeRrk  615 (1567)
T KOG1015|consen  549 SESEEEEEEKKEEEKEEKKEESSESDNVNDDSKSKGKGRKKIRKIIKDDKLRKETQNALKEEKERRK  615 (1567)
T ss_pred             ccccHHHHHHHHHhhhhhcccccccCCcCccccccccchhHHHhhcchhhhchhHHHHHHHHHHHHH
Confidence            445666666667777777752             444456666666666555544444455544433


No 194
>PRK12704 phosphodiesterase; Provisional
Probab=43.56  E-value=62  Score=40.08  Aligned_cols=20  Identities=20%  Similarity=0.298  Sum_probs=12.0

Q ss_pred             cccccCCCCcccCCCcccee
Q 001172          569 ASVMDDSSSTCSTDSVPSVV  588 (1133)
Q Consensus       569 ~s~~~~~sstcs~ds~~~~~  588 (1133)
                      ..+|.=-++-|.++...|||
T Consensus       195 ~~a~qr~a~~~~~e~~~~~v  214 (520)
T PRK12704        195 AQAIQRCAADHVAETTVSVV  214 (520)
T ss_pred             HHHHHhhcchhhhhhceeee
Confidence            34555566666666666555


No 195
>PRK04863 mukB cell division protein MukB; Provisional
Probab=42.91  E-value=1.9e+02  Score=40.46  Aligned_cols=40  Identities=13%  Similarity=0.387  Sum_probs=29.5

Q ss_pred             HHHHHhhHHhhhHHH--HHHHhhhhhHHHHHHHHHHHHHHHH
Q 001172          377 RDERRLTELGRRTVE--IFVLAHIFSNKIEVAYQEAVALKRQ  416 (1133)
Q Consensus       377 ~ee~~~~Elgrr~~~--ifa~~k~F~~rv~~Ay~E~~A~~rQ  416 (1133)
                      +.++..++...+-+.  +|+|+.+...|-..+|.+++-.+..
T Consensus       941 r~e~~~~~~~~~~~~~~~~~l~~~~~~~~~~~y~~~~~~l~~  982 (1486)
T PRK04863        941 KQDYQQAQQTQRDAKQQAFALTEVVQRRAHFSYEDAAEMLAK  982 (1486)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccHHHHHhHhhc
Confidence            355555555555543  8999999999999999999885543


No 196
>smart00784 SPT2 SPT2 chromatin protein. This entry includes the Saccharomyces cerevisiae protein SPT2 which is a chromatin protein involved in transcriptional regulation PUBMED:15563464.
Probab=42.69  E-value=41  Score=33.91  Aligned_cols=36  Identities=14%  Similarity=0.101  Sum_probs=20.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 001172          403 IEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRG  438 (1133)
Q Consensus       403 v~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~  438 (1133)
                      +++.|.+....+.+-..|..+|.++.++.|+++|+.
T Consensus        69 MEa~~~~i~~EE~~s~r~a~~ED~~e~~~e~~~~~~  104 (111)
T smart00784       69 MEAGGADIQEEERRSARLARLEDREEERLEKEEERE  104 (111)
T ss_pred             hhcCHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Confidence            556666666666666666665555554444444433


No 197
>KOG4657 consensus Uncharacterized conserved protein [Function unknown]
Probab=42.49  E-value=4.4e+02  Score=29.94  Aligned_cols=53  Identities=23%  Similarity=0.175  Sum_probs=26.5

Q ss_pred             HHHHHHHHHHHH-HHHHHHHHHHHHHH-HHHHHHHHHhhhHHHHHHHHHHHHHHHHh
Q 001172          402 KIEVAYQEAVAL-KRQEELIREEEAAW-LAESEQKAKRGAAEKEKKAKKKLAKQKRN  456 (1133)
Q Consensus       402 rv~~Ay~E~~A~-~rQ~~Ll~E~E~e~-~~~~e~~~kk~~k~kekk~~~k~ak~~e~  456 (1133)
                      +++..++|..++ .++.+|++|.+++. +.+.|-+++++  |-|.-++.+|-+++|+
T Consensus        62 ~l~~~l~etene~~~~neL~~ek~~~q~~ieqeik~~q~--elEvl~~n~Q~lkeE~  116 (246)
T KOG4657|consen   62 NLKADLRETENELVKVNELKTEKEARQMGIEQEIKATQS--ELEVLRRNLQLLKEEK  116 (246)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHh
Confidence            334455555544 36777777766544 33445555554  3444344444333333


No 198
>PF12757 DUF3812:  Protein of unknown function (DUF3812);  InterPro: IPR024527 This family of fungal proteins represents the eisosome 1 family. Eisosome protein 1 is required for normal formation of eisosomes, large cytoplasmic protein assemblies that localize to specialised domains on plasma membrane and mark the site of endocytosis [].
Probab=42.12  E-value=58  Score=33.17  Aligned_cols=17  Identities=29%  Similarity=0.470  Sum_probs=8.4

Q ss_pred             HHHHHHHHHHHHHHHHH
Q 001172          409 EAVALKRQEELIREEEA  425 (1133)
Q Consensus       409 E~~A~~rQ~~Ll~E~E~  425 (1133)
                      +++|..+=.=+|-|..+
T Consensus        75 d~IA~~rVqPvLdeI~e   91 (126)
T PF12757_consen   75 DAIARKRVQPVLDEIDE   91 (126)
T ss_pred             HHHHHHhcCCchHHHHH
Confidence            44444444555555544


No 199
>PF09831 DUF2058:  Uncharacterized protein conserved in bacteria (DUF2058);  InterPro: IPR018636  This family, found in various prokaryotic proteins, has no known function. 
Probab=41.07  E-value=45  Score=36.08  Aligned_cols=7  Identities=43%  Similarity=0.624  Sum_probs=2.7

Q ss_pred             HHHHhhh
Q 001172          433 QKAKRGA  439 (1133)
Q Consensus       433 ~~~kk~~  439 (1133)
                      +|+|+.+
T Consensus        15 kkakka~   21 (177)
T PF09831_consen   15 KKAKKAK   21 (177)
T ss_pred             HHHHHHH
Confidence            3333333


No 200
>PRK06569 F0F1 ATP synthase subunit B'; Validated
Probab=41.01  E-value=3.5e+02  Score=28.86  Aligned_cols=12  Identities=17%  Similarity=0.263  Sum_probs=4.7

Q ss_pred             hhhhhHHHHHHH
Q 001172          396 AHIFSNKIEVAY  407 (1133)
Q Consensus       396 ~k~F~~rv~~Ay  407 (1133)
                      .+.|-.+|...+
T Consensus        28 ~kf~~ppI~~iL   39 (155)
T PRK06569         28 YKFITPKAEEIF   39 (155)
T ss_pred             HHHhHHHHHHHH
Confidence            444333443333


No 201
>PF10186 Atg14:  UV radiation resistance protein and autophagy-related subunit 14;  InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 [].  The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=40.68  E-value=5.4e+02  Score=28.46  Aligned_cols=28  Identities=21%  Similarity=0.201  Sum_probs=19.7

Q ss_pred             HhhHHhhhHHHHHHHhhhhhHHHHHHHH
Q 001172          381 RLTELGRRTVEIFVLAHIFSNKIEVAYQ  408 (1133)
Q Consensus       381 ~~~Elgrr~~~ifa~~k~F~~rv~~Ay~  408 (1133)
                      ++.++.-++.++...-..++.+|..+..
T Consensus        21 ~L~~~~~~l~~~~~~~~~l~~~i~~~l~   48 (302)
T PF10186_consen   21 RLLELRSELQQLKEENEELRRRIEEILE   48 (302)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4666676777777766777777777766


No 202
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=40.63  E-value=1.2e+02  Score=36.69  Aligned_cols=14  Identities=21%  Similarity=0.102  Sum_probs=8.0

Q ss_pred             hhhhhhcccCCCCC
Q 001172          486 KKEFIVEDAQPLPE  499 (1133)
Q Consensus       486 ~~~~~~~~~~~~~~  499 (1133)
                      ++..++.++.++++
T Consensus       118 ~~~~~~r~~e~la~  131 (459)
T KOG0288|consen  118 EMRRKMRIAERLAE  131 (459)
T ss_pred             HHHHHHHHHHHHHH
Confidence            44555556666665


No 203
>PF15236 CCDC66:  Coiled-coil domain-containing protein 66
Probab=40.37  E-value=1.5e+02  Score=31.66  Aligned_cols=26  Identities=31%  Similarity=0.358  Sum_probs=12.7

Q ss_pred             HHHHHHHHHHH-HHHHHHHhhhHHHHH
Q 001172          419 LIREEEAAWLA-ESEQKAKRGAAEKEK  444 (1133)
Q Consensus       419 Ll~E~E~e~~~-~~e~~~kk~~k~kek  444 (1133)
                      .++++|.+.++ +++.++.|..++++.
T Consensus        72 r~k~~E~err~~EE~~EE~Rl~rere~   98 (157)
T PF15236_consen   72 RQKQEEEERRRREEEEEEERLAREREE   98 (157)
T ss_pred             HHHHHHHHHHHHHhHHHHHHHHHHHHH
Confidence            34444444433 445555566555553


No 204
>COG5374 Uncharacterized conserved protein [Function unknown]
Probab=39.75  E-value=1.2e+02  Score=33.19  Aligned_cols=68  Identities=16%  Similarity=0.059  Sum_probs=34.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhhh---HH-----HHHHHHHHHHHHHHhhhhhhhHHHHHHhhhhhhhhhcc
Q 001172          412 ALKRQEELIREEEAAWLAESEQKAKRGA---AE-----KEKKAKKKLAKQKRNNRKGKEKKREERSSMALSDRLED  479 (1133)
Q Consensus       412 A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~---k~-----kekk~~~k~ak~~e~~rk~~e~~~ee~~~~~~~~~~~~  479 (1133)
                      ...++..+++|+=+++-...-+.-+...   .-     |.-+..+..+|..|...|.-|+..|-++.++.++.+.+
T Consensus       116 ~v~~~~~~v~~ml~~~~~~~~k~D~~eA~~t~lk~~~~~~~~~le~Lqkn~~~~~k~~d~~ne~~~~v~~e~skkk  191 (192)
T COG5374         116 VVMRVMSIVEEMLEENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESSKKK  191 (192)
T ss_pred             HHHHHHHHHHHHHHHhhhcccchhhhhcchHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHhhcC
Confidence            4567888888876665332211111111   11     11122223355555556666666666666666555443


No 205
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=39.40  E-value=5.6e+02  Score=31.44  Aligned_cols=14  Identities=29%  Similarity=0.496  Sum_probs=9.8

Q ss_pred             hhHHHHHHHhhcCC
Q 001172          335 DDVLLLLERAALEP  348 (1133)
Q Consensus       335 ~Dll~~leR~al~~  348 (1133)
                      --+|.++-|.+..|
T Consensus       118 a~~L~A~~r~g~~p  131 (420)
T COG4942         118 AEQLAALQRSGRNP  131 (420)
T ss_pred             HHHHHHHHhccCCC
Confidence            44667778877776


No 206
>cd02669 Peptidase_C19M A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=39.34  E-value=14  Score=44.30  Aligned_cols=37  Identities=16%  Similarity=0.062  Sum_probs=32.6

Q ss_pred             ceeeecCCchHHHHhhhhhHHHhhHHHHHhhhccccccchhhhccC
Q 001172          210 ELVRVYLTNVEQICRRFVEERRGKLGRLIEDKARWSSFCAFWLGID  255 (1133)
Q Consensus       210 ~tI~VPpSNLgaTC~~fved~~~hLG~LLqS~~~~ssFRaf~~a~~  255 (1133)
                      ..+.||+.|+|.||         +|+.+||...+...||.+++...
T Consensus       116 ~~G~vGL~NlGnTC---------YmNsvLQ~L~~~p~lr~~~l~~~  152 (440)
T cd02669         116 LPGFVGLNNIKNND---------YANVIIQALSHVKPIRNFFLLYE  152 (440)
T ss_pred             cCCccCccCCCCch---------HHHHHHHHHHCCHHHHHHHhhcc
Confidence            35789999999999         99999999999999999877543


No 207
>PF00430 ATP-synt_B:  ATP synthase B/B' CF(0);  InterPro: IPR002146 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include:   F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP.   F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents subunits B and B' from the F0 complex in F-ATPases found in chloroplasts and in bacterial plasma membranes. The B subunits are part of the peripheral stalk that links the F1 and F0 complexes together, and which acts as a stator to prevent certain subunits from rotating with the central rotary element. The peripheral stalk differs in subunit composition between mitochondrial, chloroplast and bacterial F-ATPases. In bacterial and chloroplast F-ATPases, the peripheral stalk is composed of one copy of the delta subunit (homologous to OSCP in mitochondria), and two copies of subunit B in bacteria, or one copy each of subunits B and B' in chloroplasts and photosynthetic bacteria []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0015078 hydrogen ion transmembrane transporter activity, 0015986 ATP synthesis coupled proton transport, 0045263 proton-transporting ATP synthase complex, coupling factor F(o); PDB: 1L2P_A 2KHK_A 1B9U_A.
Probab=39.16  E-value=94  Score=30.46  Aligned_cols=14  Identities=36%  Similarity=0.472  Sum_probs=5.8

Q ss_pred             HHHHHHHHHHHHHH
Q 001172          403 IEVAYQEAVALKRQ  416 (1133)
Q Consensus       403 v~~Ay~E~~A~~rQ  416 (1133)
                      |.....+....+.+
T Consensus        35 I~~~~~~a~~~~~e   48 (132)
T PF00430_consen   35 IQSELEEAEELKEE   48 (132)
T ss_dssp             HHHHHHHHHHHHHH
T ss_pred             HHHHHHhHHHHHHH
Confidence            44444444433333


No 208
>KOG0112 consensus Large RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=39.10  E-value=28  Score=45.07  Aligned_cols=19  Identities=26%  Similarity=0.275  Sum_probs=14.7

Q ss_pred             CCCCCCccc-cccCCCCCCC
Q 001172          498 PEKPDVLED-VSDVSDSVDG  516 (1133)
Q Consensus       498 ~~k~d~~e~-~sd~sd~~d~  516 (1133)
                      ..|-|..++ +||+.|-++=
T Consensus       784 ~~k~da~~~~dsd~~~~~sl  803 (975)
T KOG0112|consen  784 HRKGDASDESDSDSNDKLSL  803 (975)
T ss_pred             CCcCcCccCccccccCcccc
Confidence            357788888 8999998763


No 209
>PHA03247 large tegument protein UL36; Provisional
Probab=38.99  E-value=1.2e+03  Score=35.24  Aligned_cols=11  Identities=18%  Similarity=0.359  Sum_probs=5.5

Q ss_pred             HHHHHhhcccc
Q 001172          289 VMDSLYSGLKA  299 (1133)
Q Consensus       289 vmd~L~~glk~  299 (1133)
                      ...+|+..|+-
T Consensus      2226 aLaalWs~L~P 2236 (3151)
T PHA03247       2226 ALAALWSCMFP 2236 (3151)
T ss_pred             HHHHHHHhhCc
Confidence            34455555543


No 210
>PRK11637 AmiB activator; Provisional
Probab=38.29  E-value=3.3e+02  Score=32.72  Aligned_cols=9  Identities=0%  Similarity=-0.226  Sum_probs=4.9

Q ss_pred             CCCCCCCcC
Q 001172          528 RDTSPVNWD  536 (1133)
Q Consensus       528 r~~s~v~w~  536 (1133)
                      +.+++..|-
T Consensus       314 ~i~~~fg~~  322 (428)
T PRK11637        314 PTLHRFGEQ  322 (428)
T ss_pred             CccCCCCCC
Confidence            555555554


No 211
>PF03879 Cgr1:  Cgr1 family;  InterPro: IPR005579 Cgr1 is involved in nucleolar integrity and is required for processing pre-rRNA for the 60S ribosome subunit. In Saccharomyces cerevisiae, this protein is conserved and contributes to compartmentalisation of nucleolar constituents []. Cgr1 is a small hydrophilic protein and members of this family are coiled-coil proteins []. Its primary role appears to be in ribosome biogenesis [, ]. Expression of CGR1 is also associated with a cessation of yeast cell growth, which is a prerequisite for germination in this organism [].
Probab=37.95  E-value=3.9e+02  Score=27.08  Aligned_cols=23  Identities=13%  Similarity=0.313  Sum_probs=12.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Q 001172          404 EVAYQEAVALKRQEELIREEEAA  426 (1133)
Q Consensus       404 ~~Ay~E~~A~~rQ~~Ll~E~E~e  426 (1133)
                      ...|.....+..+++.|.+.|.|
T Consensus        30 ~tSwekr~~~R~~~~~~K~~ekE   52 (108)
T PF03879_consen   30 RTSWEKRMEKRLELKAIKEKEKE   52 (108)
T ss_pred             cCcHHHHHHHHHHHHHHHHHHHH
Confidence            34455555555555555555444


No 212
>KOG2507 consensus Ubiquitin regulatory protein UBXD2, contains UAS and UBX domains [General function prediction only]
Probab=37.81  E-value=28  Score=41.95  Aligned_cols=11  Identities=27%  Similarity=0.374  Sum_probs=4.1

Q ss_pred             HHHHHHHHHHH
Q 001172          408 QEAVALKRQEE  418 (1133)
Q Consensus       408 ~E~~A~~rQ~~  418 (1133)
                      ++..-+++-++
T Consensus       204 tk~leqkr~Ek  214 (506)
T KOG2507|consen  204 TKNLEQKRVEK  214 (506)
T ss_pred             Hhhccccchhh
Confidence            33333333333


No 213
>PF09731 Mitofilin:  Mitochondrial inner membrane protein;  InterPro: IPR019133  Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex []. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence (residues 1-43) rich in positive-charged and hydroxylated residues and a membrane anchor domain (residues 47-66). In addition, it has three centrally located coiled coil domains (residues 200-240,280-310 and 400-420) []. ; GO: 0031305 integral to mitochondrial inner membrane
Probab=36.95  E-value=6.4e+02  Score=31.39  Aligned_cols=34  Identities=21%  Similarity=0.248  Sum_probs=15.6

Q ss_pred             hHHHHHHhhHHhhhHHHHHHHhhhhhHHHHHHHHHHH
Q 001172          375 IERDERRLTELGRRTVEIFVLAHIFSNKIEVAYQEAV  411 (1133)
Q Consensus       375 ve~ee~~~~Elgrr~~~ifa~~k~F~~rv~~Ay~E~~  411 (1133)
                      |..=..+...|.+++-++-.   .++.+|+.+..+..
T Consensus       253 i~~a~~~i~~L~~~l~~l~~---~~~~~l~~~L~~q~  286 (582)
T PF09731_consen  253 IAHAKERIDALQKELAELKE---EEEEELERALEEQR  286 (582)
T ss_pred             HHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHH
Confidence            44333344455665555333   33345555544444


No 214
>TIGR00570 cdk7 CDK-activating kinase assembly factor MAT1. All proteins in this family for which functions are known are cyclin dependent protein kinases that are components of TFIIH, a complex that is involved in nucleotide excision repair and transcription initiation. Also known as MAT1 (menage a trois 1). This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=36.89  E-value=1.2e+02  Score=35.53  Aligned_cols=8  Identities=25%  Similarity=0.297  Sum_probs=3.0

Q ss_pred             HHHHHHHH
Q 001172          429 AESEQKAK  436 (1133)
Q Consensus       429 ~~~e~~~k  436 (1133)
                      ++.|++++
T Consensus       151 ~~~E~~~~  158 (309)
T TIGR00570       151 LEFEKEEE  158 (309)
T ss_pred             HHHHHHHH
Confidence            33333333


No 215
>KOG1265 consensus Phospholipase C [Lipid transport and metabolism]
Probab=36.42  E-value=2.4e+02  Score=37.43  Aligned_cols=119  Identities=24%  Similarity=0.189  Sum_probs=0.0

Q ss_pred             CCCcccchhHHHHHHhhHHhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHH
Q 001172          367 GEDFNKDSIERDERRLTELGRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRGAAEKEKKA  446 (1133)
Q Consensus       367 ~~dvs~dtve~ee~~~~Elgrr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~k~kekk~  446 (1133)
                      +.|...-..+-..++-.||-+-|=+-..         ...+.-..-++-|-.||+++ ..+....+.|+-|..-+||+|.
T Consensus      1024 ~~d~~~r~~el~~rq~~el~~~~~~~~~---------~e~e~k~~hl~~~~~~l~kl-~~eaq~~Q~k~LK~~~e~e~kE 1093 (1189)
T KOG1265|consen 1024 NSDNAGRVRELVNRQTQELLEMRREQYE---------EEFELKEEHLKEQISLLRKL-LSEAQTNQTKALKESLEKETKE 1093 (1189)
T ss_pred             cchhhhhHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH


Q ss_pred             HHHHHHHHHhhhhh-----hhHHHHHHhhhh----------hhhhhcccCCchhhhhhhhcccC
Q 001172          447 KKKLAKQKRNNRKG-----KEKKREERSSMA----------LSDRLEDENPSDEKKEFIVEDAQ  495 (1133)
Q Consensus       447 ~~k~ak~~e~~rk~-----~e~~~ee~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~  495 (1133)
                      .+|...++..++-.     ++|++.|+..+.          ++-|++++...+..+..++++.+
T Consensus      1094 lk~~l~kkr~e~ik~~~~~kdK~e~er~~rE~n~s~i~~~V~e~krL~~~~~k~~e~L~k~~~~ 1157 (1189)
T KOG1265|consen 1094 LKKKLDKKRMEDIKVDKVIKDKAERERRKRELNSSNIKEFVEERKRLAEKQSKRQEQLVKKHLE 1157 (1189)
T ss_pred             HHHHHHHHHHHhhhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH


No 216
>PF04889 Cwf_Cwc_15:  Cwf15/Cwc15 cell cycle control protein;  InterPro: IPR006973 This family represents Cwf15/Cwc15 (from Schizosaccharomyces pombe and Saccharomyces cerevisiae respectively) and their homologues. The function of these proteins is unknown, but they form part of the spliceosome and are thus thought to be involved in mRNA splicing [].; GO: 0000398 nuclear mRNA splicing, via spliceosome, 0005681 spliceosomal complex
Probab=36.41  E-value=69  Score=36.15  Aligned_cols=7  Identities=57%  Similarity=0.757  Sum_probs=3.8

Q ss_pred             HHHHHHH
Q 001172          418 ELIREEE  424 (1133)
Q Consensus       418 ~Ll~E~E  424 (1133)
                      .||+|||
T Consensus       150 ~Ll~ELe  156 (244)
T PF04889_consen  150 ALLRELE  156 (244)
T ss_pred             HHHHHHH
Confidence            4555664


No 217
>PLN02316 synthase/transferase
Probab=36.31  E-value=78  Score=42.41  Aligned_cols=25  Identities=24%  Similarity=0.266  Sum_probs=15.3

Q ss_pred             cCCCCcccCCCccceeeecCccCCCccc
Q 001172          573 DDSSSTCSTDSVPSVVMYGPYKGNSLAN  600 (1133)
Q Consensus       573 ~~~sstcs~ds~~~~~~~g~~~~~~~~~  600 (1133)
                      =+-++++.+.+ +.|.+.|.|  |+|++
T Consensus       335 YN~~~~~L~~~-~~v~i~gg~--N~W~~  359 (1036)
T PLN02316        335 YNRSSGPLAHS-TEIWIHGGY--NNWID  359 (1036)
T ss_pred             ECCCCCCCCCC-CcEEEEEeE--cCCCC
Confidence            34455666653 667788854  66654


No 218
>PF11861 DUF3381:  Domain of unknown function (DUF3381);  InterPro: IPR024576 This uncharacterised domain is found in eukaryotic rRNA methyltransferases like yeast Spb1 and mammalian homologue Ftsj3. Spb1 is required for proper assembly of pre-ribosomal particles during the biogenesis of the 60S ribosomal subunit [, ].
Probab=35.80  E-value=81  Score=33.40  Aligned_cols=14  Identities=36%  Similarity=0.496  Sum_probs=7.1

Q ss_pred             eeehhhHHHHHHHh
Q 001172          331 FVLVDDVLLLLERA  344 (1133)
Q Consensus       331 ~~l~~Dll~~leR~  344 (1133)
                      |+-++|.+.+|-.+
T Consensus        30 Fi~s~dpi~~L~~~   43 (159)
T PF11861_consen   30 FIKSEDPIDLLGSA   43 (159)
T ss_pred             HhcCCcHHHHHHHc
Confidence            33345555555554


No 219
>smart00786 SHR3_chaperone ER membrane protein SH3. This family of proteins are membrane localised chaperones that are required for correct plasma membrane localisation of amino acid permeases (AAPs) PUBMED:15623581. Shr3 prevents AAPs proteins from aggregating and assists in their correct folding. In the absence of Shr3, AAPs are retained in the ER.
Probab=35.78  E-value=44  Score=36.63  Aligned_cols=13  Identities=38%  Similarity=0.585  Sum_probs=5.2

Q ss_pred             HHHHHHHHHHHHH
Q 001172          407 YQEAVALKRQEEL  419 (1133)
Q Consensus       407 y~E~~A~~rQ~~L  419 (1133)
                      |-|..-.+.+++.
T Consensus       156 YAer~~~~~~~~~  168 (196)
T smart00786      156 YAERKDAKQKEEF  168 (196)
T ss_pred             HHHHhHHHHHHHH
Confidence            4444433333333


No 220
>cd03405 Band_7_HflC Band_7_HflC: The band 7 domain of flotillin (reggie) like proteins. This group includes proteins similar to prokaryotic HlfC (High frequency of lysogenization C). Although many members of the band 7 family are lipid raft associated, prokaryote plasma membranes lack cholesterol and are unlikely to have lipid raft domains.  Individual proteins of this band 7 domain family may cluster to form membrane microdomains which may in turn recruit multiprotein complexes. Escherichia coli HflC is an integral membrane protein which may localize to the plasma membrane. HflC associates with another band 7 family member (HflK) to form an HflKC complex.  HflKC interacts with FtsH in a large complex termed the FtsH holo-enzyme. FtsH is an AAA ATP-dependent protease which exerts progressive proteolysis against membrane-embedded and soluble substrate proteins.  HflKC can modulate the activity of FtsH. HflKC plays a role in the decision between lysogenic and lytic cycle growth during la
Probab=35.24  E-value=2e+02  Score=31.34  Aligned_cols=32  Identities=25%  Similarity=0.337  Sum_probs=15.5

Q ss_pred             HHHHHHHhhhHHHHHHHHHHHHHHHHhhhhhh
Q 001172          430 ESEQKAKRGAAEKEKKAKKKLAKQKRNNRKGK  461 (1133)
Q Consensus       430 ~~e~~~kk~~k~kekk~~~k~ak~~e~~rk~~  461 (1133)
                      +.|+++.+.+.+-++++.+.+|+.+..++..+
T Consensus       176 e~~~~a~~~~aea~~~~~~~~Aea~a~a~~~~  207 (242)
T cd03405         176 EGEEEAERIRADADRERTVILAEAYREAQEIR  207 (242)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            33444444444445555555555555554433


No 221
>cd02661 Peptidase_C19E A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=35.23  E-value=23  Score=38.79  Aligned_cols=34  Identities=15%  Similarity=0.268  Sum_probs=29.4

Q ss_pred             eecCCchHHHHhhhhhHHHhhHHHHHhhhccccccchhhhccC
Q 001172          213 RVYLTNVEQICRRFVEERRGKLGRLIEDKARWSSFCAFWLGID  255 (1133)
Q Consensus       213 ~VPpSNLgaTC~~fved~~~hLG~LLqS~~~~ssFRaf~~a~~  255 (1133)
                      +||+.|+|.||         +|+.+||...+...||.+++...
T Consensus         1 ~~GL~N~gntC---------Y~NsvLQ~L~~~~~~~~~~~~~~   34 (304)
T cd02661           1 GAGLQNLGNTC---------FLNSVLQCLTHTPPLANYLLSRE   34 (304)
T ss_pred             CCCccccCchh---------HHHHHHHHhhCCHHHHHHHhcch
Confidence            37899999999         99999999998888988776544


No 222
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=35.07  E-value=2.4e+02  Score=34.40  Aligned_cols=9  Identities=22%  Similarity=0.076  Sum_probs=4.6

Q ss_pred             CCCCCCCcC
Q 001172          528 RDTSPVNWD  536 (1133)
Q Consensus       528 r~~s~v~w~  536 (1133)
                      +..-||+..
T Consensus       298 ~l~~PV~G~  306 (420)
T COG4942         298 QLAWPVTGR  306 (420)
T ss_pred             CcCCCCCCc
Confidence            445566554


No 223
>PF13904 DUF4207:  Domain of unknown function (DUF4207)
Probab=34.99  E-value=2.2e+02  Score=32.34  Aligned_cols=12  Identities=17%  Similarity=0.133  Sum_probs=5.4

Q ss_pred             hhhHHHHHHHhh
Q 001172          386 GRRTVEIFVLAH  397 (1133)
Q Consensus       386 grr~~~ifa~~k  397 (1133)
                      ..-.++-+.+.|
T Consensus        85 ~~~aye~Wl~~K   96 (264)
T PF13904_consen   85 SLSAYEEWLSAK   96 (264)
T ss_pred             ccchHHHHHHHH
Confidence            334555554333


No 224
>TIGR03319 YmdA_YtgF conserved hypothetical protein YmdA/YtgF.
Probab=34.56  E-value=1e+02  Score=38.13  Aligned_cols=20  Identities=20%  Similarity=0.275  Sum_probs=13.7

Q ss_pred             cccccCCCCcccCCCcccee
Q 001172          569 ASVMDDSSSTCSTDSVPSVV  588 (1133)
Q Consensus       569 ~s~~~~~sstcs~ds~~~~~  588 (1133)
                      ..+|.=-++-|.++...+||
T Consensus       189 ~~aiqr~a~~~~~e~~~~~v  208 (514)
T TIGR03319       189 ATAIQRYAGDHVAETTVSVV  208 (514)
T ss_pred             HHHHHhccchhhhhheeeeE
Confidence            45566667777777777666


No 225
>KOG4673 consensus Transcription factor TMF, TATA element modulatory factor [Transcription]
Probab=34.04  E-value=1.5e+02  Score=38.14  Aligned_cols=37  Identities=30%  Similarity=0.374  Sum_probs=29.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 001172          401 NKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKR  437 (1133)
Q Consensus       401 ~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk  437 (1133)
                      .+|-++++|.=|+.+|.+=|+++-+....+.|.++|.
T Consensus       416 kKvqa~~kERDalr~e~kslk~ela~~l~~DeLaEkd  452 (961)
T KOG4673|consen  416 KKVQALTKERDALRREQKSLKKELAAALLKDELAEKD  452 (961)
T ss_pred             HHHHHHHHhHHHHHHHHHHHHHHHHHhhhhHHHHHHH
Confidence            4567889999999999998888888777766655554


No 226
>COG5269 ZUO1 Ribosome-associated chaperone zuotin [Translation, ribosomal structure and biogenesis / Posttranslational modification, protein turnover, chaperones]
Probab=33.87  E-value=2.6e+02  Score=32.50  Aligned_cols=8  Identities=38%  Similarity=0.468  Sum_probs=5.0

Q ss_pred             hhHHHHHH
Q 001172          387 RRTVEIFV  394 (1133)
Q Consensus       387 rr~~~ifa  394 (1133)
                      +|++|+..
T Consensus       219 ~~LV~~A~  226 (379)
T COG5269         219 KRLVQIAK  226 (379)
T ss_pred             HHHHHHHH
Confidence            46666665


No 227
>PF05890 Ebp2:  Eukaryotic rRNA processing protein EBP2;  InterPro: IPR008610 This family consists of several eukaryotic rRNA processing protein EBP2 sequences. Ebp2p is required for the maturation of 25S rRNA and 60S subunit assembly. Ebp2p may be one of the target proteins of Rrs1p for executing the signal to regulate ribosome biogenesis [].
Probab=33.62  E-value=1.7e+02  Score=33.55  Aligned_cols=25  Identities=16%  Similarity=0.229  Sum_probs=14.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          408 QEAVALKRQEELIREEEAAWLAESE  432 (1133)
Q Consensus       408 ~E~~A~~rQ~~Ll~E~E~e~~~~~e  432 (1133)
                      .+....+...+||.|.+.-+..|+.
T Consensus       116 SD~HM~KVr~kLl~~~~~ie~~E~~  140 (271)
T PF05890_consen  116 SDEHMEKVRQKLLKEQKRIEASEEA  140 (271)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3455566677777776554444443


No 228
>KOG0717 consensus Molecular chaperone (DnaJ superfamily) [Posttranslational modification, protein turnover, chaperones]
Probab=33.24  E-value=3.7e+02  Score=33.33  Aligned_cols=16  Identities=19%  Similarity=0.144  Sum_probs=9.9

Q ss_pred             HHHHHHHHHHHHHHHH
Q 001172          410 AVALKRQEELIREEEA  425 (1133)
Q Consensus       410 ~~A~~rQ~~Ll~E~E~  425 (1133)
                      .+|...|+++++-|+.
T Consensus       196 q~ak~eyNe~Vr~lv~  211 (508)
T KOG0717|consen  196 QYARQEYNETVRNLVG  211 (508)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4566666776666654


No 229
>KOG1103 consensus Predicted coiled-coil protein [Function unknown]
Probab=33.17  E-value=1.9e+02  Score=34.45  Aligned_cols=12  Identities=42%  Similarity=0.722  Sum_probs=5.9

Q ss_pred             hhhhhHHHHHHh
Q 001172          458 RKGKEKKREERS  469 (1133)
Q Consensus       458 rk~~e~~~ee~~  469 (1133)
                      +|.++.++.|++
T Consensus       223 ~k~eee~aaERe  234 (561)
T KOG1103|consen  223 KKGEEEAAAERE  234 (561)
T ss_pred             CCChHHHHHHHh
Confidence            345555555544


No 230
>KOG2573 consensus Ribosome biogenesis protein - Nop56p/Sik1p [RNA processing and modification; Translation, ribosomal structure and biogenesis]
Probab=33.02  E-value=31  Score=41.29  Aligned_cols=13  Identities=38%  Similarity=0.588  Sum_probs=5.7

Q ss_pred             HHHhhHHHHHhhh
Q 001172          229 ERRGKLGRLIEDK  241 (1133)
Q Consensus       229 d~~~hLG~LLqS~  241 (1133)
                      +-++.|..-|.++
T Consensus       278 eyRk~L~~YL~~K  290 (498)
T KOG2573|consen  278 EYRKQLSDYLKDK  290 (498)
T ss_pred             HHHHHHHHHHHHH
Confidence            3444444444443


No 231
>PF02029 Caldesmon:  Caldesmon;  InterPro: IPR006018  This group of proteins includes two protein families: caldesmon and lymphocyte specific protein.  Caldesmon (CDM) is an actin- and myosin-binding protein implicated in the regulation of actomyosin interactions in smooth muscle and non-muscle cells, possibly acting as a bridge between myosin and actin filaments []. CDM is believed to be an elongated molecule, with an N-terminal myosin/calmodulin- binding domain and a C-terminal tropomyosin/actin/calmodulin-binding domain, separated by a 40nm-long central helix []. A high-molecular-weight form of CDM is predominantly expressed in smooth muscles, while a low-molecular-weight form is widely distributed in non- muscle tissues and cells (the protein is not expressed in skeletal muscle or heart). 
Probab=32.99  E-value=1.7e+02  Score=36.41  Aligned_cols=7  Identities=43%  Similarity=0.790  Sum_probs=4.4

Q ss_pred             cCCCccc
Q 001172          594 KGNSLAN  600 (1133)
Q Consensus       594 ~~~~~~~  600 (1133)
                      |||.|+.
T Consensus       429 ~g~v~~~  435 (492)
T PF02029_consen  429 KGNVFSS  435 (492)
T ss_pred             cCCcccC
Confidence            6676644


No 232
>PF00769 ERM:  Ezrin/radixin/moesin family;  InterPro: IPR011259 The ERM family consists of three closely-related proteins, ezrin, radixin and moesin []. Ezrin was first identified as a constituent of microvilli [], radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions [], and moesin as a heparin binding protein []. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain; an extended alpha-helical domain; and a charged C-terminal domain []. Ezrin, radixin and merlin also contain a polyproline region between the helical and C-terminal domains. The N-terminal domain is highly conserved, and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily. ERM proteins crosslink actin filaments with plasma membranes. They co-localise with CD44 at actin filament-plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains [].; GO: 0008092 cytoskeletal protein binding, 0005737 cytoplasm, 0019898 extrinsic to membrane; PDB: 2I1J_A 2I1K_A 1E5W_A 1EF1_C.
Probab=32.87  E-value=4.1e+02  Score=29.94  Aligned_cols=20  Identities=25%  Similarity=0.140  Sum_probs=11.8

Q ss_pred             HHHhhHHhhhHHHHHHHhhh
Q 001172          379 ERRLTELGRRTVEIFVLAHI  398 (1133)
Q Consensus       379 e~~~~Elgrr~~~ifa~~k~  398 (1133)
                      |+...||-+|+++|=--.+.
T Consensus         4 Er~k~Ele~rL~q~eee~~~   23 (246)
T PF00769_consen    4 EREKQELEERLRQMEEEMRR   23 (246)
T ss_dssp             HHHCHHHHHHHHHHHHHHHH
T ss_pred             hHHHHHHHHHHHHHHHHHHH
Confidence            44556677777776653333


No 233
>KOG2062 consensus 26S proteasome regulatory complex, subunit RPN2/PSMD1 [Posttranslational modification, protein turnover, chaperones]
Probab=32.81  E-value=16  Score=46.57  Aligned_cols=18  Identities=44%  Similarity=0.427  Sum_probs=10.4

Q ss_pred             HHHHHHHHHHHHHHHHhh
Q 001172          440 AEKEKKAKKKLAKQKRNN  457 (1133)
Q Consensus       440 k~kekk~~~k~ak~~e~~  457 (1133)
                      ..|.|++.++.+||++.+
T Consensus       793 t~kakar~k~~~~ek~~~  810 (929)
T KOG2062|consen  793 TAKAKARAKKEAKEKEPN  810 (929)
T ss_pred             hhhhhhhhhhhhhhcccc
Confidence            555666666666665433


No 234
>KOG4739 consensus Uncharacterized protein involved in synaptonemal complex formation [Cell cycle control, cell division, chromosome partitioning; General function prediction only]
Probab=32.65  E-value=3.6e+02  Score=30.62  Aligned_cols=10  Identities=20%  Similarity=0.345  Sum_probs=3.9

Q ss_pred             ccccccccCC
Q 001172          598 LANYQNQKSP  607 (1133)
Q Consensus       598 ~~~~~~~~s~  607 (1133)
                      |+-.++.+.|
T Consensus       220 F~~~~~~~~~  229 (233)
T KOG4739|consen  220 FSPVRVHKLP  229 (233)
T ss_pred             cchhhhcccc
Confidence            3333444433


No 235
>PRK12585 putative monovalent cation/H+ antiporter subunit G; Reviewed
Probab=32.57  E-value=1.2e+02  Score=33.47  Aligned_cols=11  Identities=0%  Similarity=0.253  Sum_probs=4.9

Q ss_pred             HHHHhhHHhhh
Q 001172          378 DERRLTELGRR  388 (1133)
Q Consensus       378 ee~~~~Elgrr  388 (1133)
                      ++|+....+++
T Consensus       108 ~~~~k~~~~~~  118 (197)
T PRK12585        108 RSVKKDDIKKK  118 (197)
T ss_pred             HHHHhhhhhhc
Confidence            44554444433


No 236
>COG5493 Uncharacterized conserved protein containing a coiled-coil domain [Function unknown]
Probab=32.53  E-value=4.8e+02  Score=29.16  Aligned_cols=24  Identities=17%  Similarity=0.251  Sum_probs=12.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          401 NKIEVAYQEAVALKRQEELIREEEA  425 (1133)
Q Consensus       401 ~rv~~Ay~E~~A~~rQ~~Ll~E~E~  425 (1133)
                      +|-+..| |.++.+.|-+-|.+.+.
T Consensus        24 ~Rpeily-e~l~kL~pwq~latk~d   47 (231)
T COG5493          24 QRPEILY-EVLAKLTPWQQLATKQD   47 (231)
T ss_pred             hChHHHH-HHHHhhchHHHHHHHHH
Confidence            5666666 45555555544444333


No 237
>PF15086 UPF0542:  Uncharacterised protein family UPF0542
Probab=32.49  E-value=64  Score=30.46  Aligned_cols=14  Identities=14%  Similarity=-0.044  Sum_probs=8.6

Q ss_pred             HHHHHHHHHHHHHH
Q 001172          417 EELIREEEAAWLAE  430 (1133)
Q Consensus       417 ~~Ll~E~E~e~~~~  430 (1133)
                      -+|..|+|.+|+++
T Consensus        42 wkLaK~ie~~ere~   55 (74)
T PF15086_consen   42 WKLAKAIEKEEREK   55 (74)
T ss_pred             HHHHHHHHHHHHHH
Confidence            46777776665543


No 238
>cd03404 Band_7_HflK Band_7_HflK: The band 7 domain of flotillin (reggie) like proteins. This group includes proteins similar to prokaryotic HlfK (High frequency of lysogenization K). Although many members of the band 7 family are lipid raft associated, prokaryote plasma membranes lack cholesterol and are unlikely to have lipid raft domains.  Individual proteins of this band 7 domain family may cluster to form membrane microdomains which may in turn recruit multiprotein complexes. Escherichia coli HflK is an integral membrane protein which may localize to the plasma membrane. HflK associates with another band 7 family member (HflC) to form an HflKC complex.  HflKC interacts with FtsH in a large complex termed the FtsH holo-enzyme. FtsH is an AAA ATP-dependent protease which exerts progressive proteolysis against membrane-embedded and soluble substrate proteins.  HflKC can modulate the activity of FtsH. HflKC plays a role in the decision between lysogenic and lytic cycle growth during la
Probab=32.34  E-value=1.6e+02  Score=32.60  Aligned_cols=17  Identities=24%  Similarity=0.329  Sum_probs=10.4

Q ss_pred             CccCCEEEEEEEEEEee
Q 001172          177 FKDGDTLIIKAQVQVIR  193 (1133)
Q Consensus       177 FLvDDsLTIeCdVtVIk  193 (1133)
                      ++..|...|.+++.|.-
T Consensus        80 v~T~D~~~v~vd~~v~y   96 (266)
T cd03404          80 MLTGDENIVDVEFAVQY   96 (266)
T ss_pred             eEeCCCCEEEEEEEEEE
Confidence            34556667777766643


No 239
>cd02671 Peptidase_C19O A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=32.33  E-value=13  Score=42.82  Aligned_cols=34  Identities=21%  Similarity=0.097  Sum_probs=29.3

Q ss_pred             ceeeecCCchHHHHhhhhhHHHhhHHHHHhhhccccccchhhh
Q 001172          210 ELVRVYLTNVEQICRRFVEERRGKLGRLIEDKARWSSFCAFWL  252 (1133)
Q Consensus       210 ~tI~VPpSNLgaTC~~fved~~~hLG~LLqS~~~~ssFRaf~~  252 (1133)
                      ....||+.|+|.||         ||+.+||...+...||..+.
T Consensus        21 ~~~~~GL~NlGnTC---------YmNSvLQ~L~~~p~fr~~l~   54 (332)
T cd02671          21 LLPFVGLNNLGNTC---------YLNSVLQVLYFCPGFKHGLK   54 (332)
T ss_pred             CCCCcceeccCceE---------eHHHHHHHHHcChHHHHHHH
Confidence            46789999999999         99999999998778876543


No 240
>TIGR01933 hflK HflK protein. HflK and HflC are paralogs encoded by tandem genes in Proteobacteria, spirochetes, and some other bacterial lineages. The HflKC complex is anchored in the membrane and exposed to the periplasm. The complex is not active as a protease, but rather binds to and appears to modulate the ATP-dependent protease FtsH. The overall function of HflKC is not fully described.//Regulation of FtsH by HflKC appears to be negative (PubMed:8947034,PubMed:96367)
Probab=32.13  E-value=2.5e+02  Score=31.10  Aligned_cols=15  Identities=27%  Similarity=0.437  Sum_probs=8.5

Q ss_pred             ccCCEEEEEEEEEEe
Q 001172          178 KDGDTLIIKAQVQVI  192 (1133)
Q Consensus       178 LvDDsLTIeCdVtVI  192 (1133)
                      +..|...|.+++.|.
T Consensus        54 ~T~D~~~v~vd~~v~   68 (261)
T TIGR01933        54 LTGDENIVNVEMNVQ   68 (261)
T ss_pred             EeCCCCEEEEEEEEE
Confidence            445555666666554


No 241
>PRK12704 phosphodiesterase; Provisional
Probab=31.82  E-value=2.5e+02  Score=34.97  Aligned_cols=8  Identities=38%  Similarity=0.596  Sum_probs=3.0

Q ss_pred             ccceeeec
Q 001172          584 VPSVVMYG  591 (1133)
Q Consensus       584 ~~~~~~~g  591 (1133)
                      +|+-=|.|
T Consensus       216 lp~d~mkg  223 (520)
T PRK12704        216 LPNDEMKG  223 (520)
T ss_pred             cCCchhhc
Confidence            44333433


No 242
>KOG2505 consensus Ankyrin repeat protein [General function prediction only]
Probab=31.80  E-value=81  Score=38.98  Aligned_cols=14  Identities=21%  Similarity=0.470  Sum_probs=9.5

Q ss_pred             CCccC-CEEEEEEEE
Q 001172          176 GFKDG-DTLIIKAQV  189 (1133)
Q Consensus       176 GFLvD-DsLTIeCdV  189 (1133)
                      .||.. |+|.|+|--
T Consensus       289 ~~Le~~~~IFiRA~~  303 (591)
T KOG2505|consen  289 KYLEKADLIFIRAPA  303 (591)
T ss_pred             HHHccCCeEEEEccc
Confidence            46654 888888753


No 243
>PF07543 PGA2:  Protein trafficking PGA2;  InterPro: IPR011431 A Saccharomyces cerevisiae (Baker's yeast) member of this family (PGA2, P53903 from SWISSPROT) is a single pass membrane protein which has been implicated in protein trafficking [, ].
Probab=31.79  E-value=1.3e+02  Score=31.50  Aligned_cols=25  Identities=24%  Similarity=0.325  Sum_probs=12.2

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          396 AHIFSNKIEVAYQEAVALKRQEELIREEE  424 (1133)
Q Consensus       396 ~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E  424 (1133)
                      +.||=    -=|--+++.+.|.+=++++.
T Consensus        26 gYiLl----RPY~~kl~~k~~~kq~eke~   50 (140)
T PF07543_consen   26 GYILL----RPYFRKLAAKDQKKQLEKEK   50 (140)
T ss_pred             HHHHH----HHHHHHHHHHHHHHHHHHHH
Confidence            66663    23334555555555444443


No 244
>PF15295 CCDC50_N:  Coiled-coil domain-containing protein 50  N-terminus
Probab=31.75  E-value=4.8e+02  Score=27.38  Aligned_cols=9  Identities=33%  Similarity=0.124  Sum_probs=5.8

Q ss_pred             HHHHhhhhh
Q 001172          392 IFVLAHIFS  400 (1133)
Q Consensus       392 ifa~~k~F~  400 (1133)
                      =++||+.++
T Consensus        26 DgaLA~~LQ   34 (132)
T PF15295_consen   26 DGALAHRLQ   34 (132)
T ss_pred             hHHHHHHHH
Confidence            346677776


No 245
>KOG4369 consensus RTK signaling protein MASK/UNC-44 [Signal transduction mechanisms]
Probab=31.74  E-value=30  Score=46.01  Aligned_cols=27  Identities=22%  Similarity=0.315  Sum_probs=20.4

Q ss_pred             cccCCchhhhhhhhcccccccccccccc
Q 001172          973 VDEFPHLDIINDLLDDEHGVGMAAGAST 1000 (1133)
Q Consensus       973 ~dEFPHLDIINDLL~eeq~~g~~~~~~~ 1000 (1133)
                      .-|||||- --|||--|-+.|-|+..++
T Consensus      1931 s~~y~~~~-~~~l~h~e~~~g~~~~~~s 1957 (2131)
T KOG4369|consen 1931 SNLYNHLA-AQLLTHQESSTGAPGPTSS 1957 (2131)
T ss_pred             cccchhHH-HHHHHhhhccCCCCCCCCC
Confidence            35889985 4688888888887777555


No 246
>PF04747 DUF612:  Protein of unknown function, DUF612;  InterPro: IPR006836 This family includes several uncharacterised proteins from Caenorhabditis elegans.
Probab=31.50  E-value=1.9e+02  Score=34.81  Aligned_cols=24  Identities=25%  Similarity=0.347  Sum_probs=10.8

Q ss_pred             HHHHHHhhhhhHHHHHHHHHHHHH
Q 001172          390 VEIFVLAHIFSNKIEVAYQEAVAL  413 (1133)
Q Consensus       390 ~~ifa~~k~F~~rv~~Ay~E~~A~  413 (1133)
                      |-.|.|+-.|-.-|-.--+|.||-
T Consensus        34 frqwlltavlpnsind~r~e~fas   57 (510)
T PF04747_consen   34 FRQWLLTAVLPNSINDKRKEAFAS   57 (510)
T ss_pred             HHHHHHHHhccccccHHHHHHhhh
Confidence            333444444444444444555553


No 247
>KOG2484 consensus GTPase [General function prediction only]
Probab=31.38  E-value=41  Score=40.47  Aligned_cols=18  Identities=28%  Similarity=0.405  Sum_probs=10.1

Q ss_pred             CCCcccCCCccceeeecCc
Q 001172          575 SSSTCSTDSVPSVVMYGPY  593 (1133)
Q Consensus       575 ~sstcs~ds~~~~~~~g~~  593 (1133)
                      .+|.|.+-+.-+- +.|.|
T Consensus       226 ~~s~c~gae~l~~-~lgny  243 (435)
T KOG2484|consen  226 QSSVCFGAETLMK-VLGNY  243 (435)
T ss_pred             ccchhhhHHHHHH-HhcCc
Confidence            4677776664422 45555


No 248
>PHA02571 a-gt.4 hypothetical protein; Provisional
Probab=31.07  E-value=1.2e+02  Score=30.57  Aligned_cols=37  Identities=41%  Similarity=0.406  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHh--hhHHHHHHHHHHHHHH
Q 001172          417 EELIREEEAAWLAESEQKAKR--GAAEKEKKAKKKLAKQ  453 (1133)
Q Consensus       417 ~~Ll~E~E~e~~~~~e~~~kk--~~k~kekk~~~k~ak~  453 (1133)
                      ++++.++++.-+++.+++|.|  +|..+|=||+++-|.+
T Consensus        17 ee~~~~~q~~~e~eA~kkA~K~lkKN~rEIkRL~~HAe~   55 (109)
T PHA02571         17 EELLSELQARNEAEAEKKAAKILKKNRREIKRLKKHAEE   55 (109)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHH


No 249
>COG5192 BMS1 GTP-binding protein required for 40S ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=30.84  E-value=58  Score=40.77  Aligned_cols=29  Identities=31%  Similarity=0.406  Sum_probs=18.0

Q ss_pred             HHHHHHHHHHHHHHHHhhhhhhhHHHHHH
Q 001172          440 AEKEKKAKKKLAKQKRNNRKGKEKKREER  468 (1133)
Q Consensus       440 k~kekk~~~k~ak~~e~~rk~~e~~~ee~  468 (1133)
                      ++||++.-++.||+||-.+||+|++...+
T Consensus      1030 ~~ke~~~s~~r~k~~~i~k~e~er~qr~r 1058 (1077)
T COG5192        1030 EEKERMESLQRAKEEEIGKKEKEREQRIR 1058 (1077)
T ss_pred             HHHHHHHHHHHHHHHHHhhhHHHHHHHHH
Confidence            56666666666677776666666554443


No 250
>cd02657 Peptidase_C19A A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyse bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=30.66  E-value=25  Score=39.12  Aligned_cols=33  Identities=15%  Similarity=0.125  Sum_probs=28.2

Q ss_pred             cCCchHHHHhhhhhHHHhhHHHHHhhhccccccchhhhccCH
Q 001172          215 YLTNVEQICRRFVEERRGKLGRLIEDKARWSSFCAFWLGIDQ  256 (1133)
Q Consensus       215 PpSNLgaTC~~fved~~~hLG~LLqS~~~~ssFRaf~~a~~~  256 (1133)
                      |+.|+|.||         ||+.+||...+...||.+++....
T Consensus         1 Gl~N~GntC---------y~NsvLQ~L~~~~~f~~~~~~~~~   33 (305)
T cd02657           1 GLTNLGNTC---------YLNSTLQCLRSVPELRDALKNYNP   33 (305)
T ss_pred             Ccccccchh---------HHHHHHHHHhCCHHHHHHHHhccc
Confidence            467999999         999999999998889888766543


No 251
>PF04747 DUF612:  Protein of unknown function, DUF612;  InterPro: IPR006836 This family includes several uncharacterised proteins from Caenorhabditis elegans.
Probab=30.52  E-value=2.7e+02  Score=33.64  Aligned_cols=14  Identities=36%  Similarity=0.620  Sum_probs=7.2

Q ss_pred             cccCCCCCCCCCCC
Q 001172          603 NQKSPSRGKNQRGK  616 (1133)
Q Consensus       603 ~~~s~~rg~~~~~~  616 (1133)
                      -|--|-.-||..+|
T Consensus       250 q~a~p~~kk~kk~k  263 (510)
T PF04747_consen  250 QQAAPQEKKNKKNK  263 (510)
T ss_pred             hhcCcchhhccccc
Confidence            34445555565554


No 252
>COG2811 NtpF Archaeal/vacuolar-type H+-ATPase subunit H [Energy production and conversion]
Probab=30.45  E-value=3.2e+02  Score=27.72  Aligned_cols=17  Identities=18%  Similarity=0.243  Sum_probs=7.8

Q ss_pred             HHHHHHHHHHHHHHHHH
Q 001172          413 LKRQEELIREEEAAWLA  429 (1133)
Q Consensus       413 ~~rQ~~Ll~E~E~e~~~  429 (1133)
                      .+.|+++|.|-.++.++
T Consensus        27 kEe~~~~i~eAr~eare   43 (108)
T COG2811          27 KEEAEQIIKEAREEARE   43 (108)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            44455555544444333


No 253
>KOG4722 consensus Zn-finger protein [General function prediction only]
Probab=30.25  E-value=3.8e+02  Score=32.81  Aligned_cols=96  Identities=22%  Similarity=0.237  Sum_probs=0.0

Q ss_pred             hHHhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh---------HHHHHHHHHHH-HH
Q 001172          383 TELGRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRGA---------AEKEKKAKKKL-AK  452 (1133)
Q Consensus       383 ~Elgrr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~---------k~kekk~~~k~-ak  452 (1133)
                      .|.-.+--++++-..-.++|+..--.|-  +.||++.|.|-|.=.-..+.-+++|+.         ++.|-...+|| ++
T Consensus       334 Lea~NKRhdlLaK~~e~Eekl~E~nee~--~~r~ee~~a~dea~~e~~~a~eAekqa~~eE~L~k~~eqEa~ienkq~~~  411 (672)
T KOG4722|consen  334 LEADNKRHDLLAKDAEVEEKLLEKNEEF--LNRFEEFLAEDEAFFEIFEAFEAEKQAFFEEFLDKFFEQEACIENKQLAD  411 (672)
T ss_pred             HHHhhhHHHHHHhhHHHHHHHHHHHHHH--HHHHHHHhhhhHHHHHHHHHHhHHHHHhHHHHHHHHHHHHHHHHHHHHHH


Q ss_pred             HHHhhhhhhhHHHHHHhhhhhhhhhccc
Q 001172          453 QKRNNRKGKEKKREERSSMALSDRLEDE  480 (1133)
Q Consensus       453 ~~e~~rk~~e~~~ee~~~~~~~~~~~~~  480 (1133)
                      |+.+.-+.+||+++..++.|+.-..+.|
T Consensus       412 EkARedaa~EkA~dkeEq~AA~TAaq~e  439 (672)
T KOG4722|consen  412 EKAREDAAEEKAADKEEQEAAATAAQAE  439 (672)
T ss_pred             HHhhhhHHHHHhhhhHHHHHHHHHHHHH


No 254
>PF02029 Caldesmon:  Caldesmon;  InterPro: IPR006018  This group of proteins includes two protein families: caldesmon and lymphocyte specific protein.  Caldesmon (CDM) is an actin- and myosin-binding protein implicated in the regulation of actomyosin interactions in smooth muscle and non-muscle cells, possibly acting as a bridge between myosin and actin filaments []. CDM is believed to be an elongated molecule, with an N-terminal myosin/calmodulin- binding domain and a C-terminal tropomyosin/actin/calmodulin-binding domain, separated by a 40nm-long central helix []. A high-molecular-weight form of CDM is predominantly expressed in smooth muscles, while a low-molecular-weight form is widely distributed in non- muscle tissues and cells (the protein is not expressed in skeletal muscle or heart). 
Probab=29.98  E-value=2.6e+02  Score=34.88  Aligned_cols=29  Identities=17%  Similarity=0.142  Sum_probs=16.4

Q ss_pred             cCCCCCCCCCcc-ccccCCCCCCCCccccC
Q 001172          494 AQPLPEKPDVLE-DVSDVSDSVDGGAEVLQ  522 (1133)
Q Consensus       494 ~~~~~~k~d~~e-~~sd~sd~~d~~~e~~~  522 (1133)
                      +|..+.|.--+- -||-|.|-|.--+-.++
T Consensus       369 ~~k~~~k~~~~~~~vskid~rleqyt~a~~  398 (492)
T PF02029_consen  369 IQKSSVKKSHPPAVVSKIDDRLEQYTSAIE  398 (492)
T ss_pred             HhhcCCcccCCCcccccchhhHHHHHHHhh
Confidence            444444543333 57777777766665555


No 255
>KOG0783 consensus Uncharacterized conserved protein, contains ankyrin and BTB/POZ domains [Function unknown]
Probab=29.87  E-value=21  Score=45.96  Aligned_cols=57  Identities=16%  Similarity=0.137  Sum_probs=39.1

Q ss_pred             cchhhhhhhhhhhcchh---hhhhhHHHHHHhhcccc-------cccccc-------------cccccccccccCCCCCC
Q 001172          266 KTDAILKVVVKHFFIEK---EVTSTLVMDSLYSGLKA-------LEGQSK-------------SKKTKAKLLDAEDTPPP  322 (1133)
Q Consensus       266 ~sDVilkV~~k~Ff~~k---eVtSplvmd~L~~glk~-------le~~t~-------------~~k~~~~~~~t~e~p~~  322 (1133)
                      ..||+|.|++..|.+||   ++||+.|-+-|....+.       +|+.+.             -|.-++.+||||-+-.|
T Consensus       558 ~hDVtf~vg~~~F~aHKfIl~~rs~flrkL~l~~~~~s~~~dIY~~~~~~~~~~~~ve~i~p~mfe~lL~~iYtdt~~~P  637 (1267)
T KOG0783|consen  558 FHDVTFYVGTSMFHAHKFILCARSSFLRKLLLQKKKSSVSNDIYIEEITQSHSTIRVEDIPPLMFEILLHYIYTDTLLSP  637 (1267)
T ss_pred             cceEEEEecCeecccceEEEEeccHHHHHHHHhhccccccceeeeecccccCceeeeccCCHHHHHHHHHHHhcccccCC
Confidence            38999999999999999   66999777655554444       222221             13445778899865444


No 256
>PF14235 DUF4337:  Domain of unknown function (DUF4337)
Probab=29.82  E-value=2.3e+02  Score=30.08  Aligned_cols=27  Identities=26%  Similarity=0.244  Sum_probs=13.2

Q ss_pred             HHHHHHHHHHHHHHhhhhhhhHHHHHH
Q 001172          442 KEKKAKKKLAKQKRNNRKGKEKKREER  468 (1133)
Q Consensus       442 kekk~~~k~ak~~e~~rk~~e~~~ee~  468 (1133)
                      +||++-++++++-+++.|+.|+++|+.
T Consensus        80 ~~~~~~~~e~~~l~~~A~~~e~~~d~~  106 (157)
T PF14235_consen   80 KEKARYKSEAEELEAKAKEAEAESDHA  106 (157)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhHHHH
Confidence            334444444555555555555555544


No 257
>KOG2457 consensus A/G-specific adenine DNA glycosylase [Replication, recombination and repair]
Probab=29.71  E-value=48  Score=39.65  Aligned_cols=21  Identities=24%  Similarity=0.426  Sum_probs=13.3

Q ss_pred             cccCCcccccCCchhhhhhhh
Q 001172          966 RQTQGVLVDEFPHLDIINDLL  986 (1133)
Q Consensus       966 ~Q~qg~v~dEFPHLDIINDLL  986 (1133)
                      +|--|-+.+-|-|+-.--|.|
T Consensus       443 ~~~~G~~~htfshi~~ts~V~  463 (555)
T KOG2457|consen  443 DLFKGQAKHTFSHICVTSDVL  463 (555)
T ss_pred             hhcccchhhhhhhhhhHHHHH
Confidence            555666667777766665554


No 258
>KOG0994 consensus Extracellular matrix glycoprotein Laminin subunit beta [Extracellular structures]
Probab=29.66  E-value=5.2e+02  Score=35.49  Aligned_cols=18  Identities=22%  Similarity=0.231  Sum_probs=7.4

Q ss_pred             HhhHHhhhHHHHHHHhhh
Q 001172          381 RLTELGRRTVEIFVLAHI  398 (1133)
Q Consensus       381 ~~~Elgrr~~~ifa~~k~  398 (1133)
                      ++-||+++..+.=..++.
T Consensus      1627 ~~e~lk~~~~qns~~A~~ 1644 (1758)
T KOG0994|consen 1627 RMEELKHKAAQNSAEAKQ 1644 (1758)
T ss_pred             HHHHHHHHHHhccHHHHH
Confidence            333444444444433333


No 259
>PTZ00436 60S ribosomal protein L19-like protein; Provisional
Probab=29.66  E-value=3.6e+02  Score=31.77  Aligned_cols=7  Identities=0%  Similarity=0.040  Sum_probs=3.2

Q ss_pred             HHHhhhc
Q 001172          236 RLIEDKA  242 (1133)
Q Consensus       236 ~LLqS~~  242 (1133)
                      ++|.++.
T Consensus        13 sVL~cGk   19 (357)
T PTZ00436         13 DILRCGR   19 (357)
T ss_pred             HHhCCCC
Confidence            4444444


No 260
>PF12999 PRKCSH-like:  Glucosidase II beta subunit-like
Probab=29.66  E-value=1.9e+02  Score=31.45  Aligned_cols=35  Identities=31%  Similarity=0.493  Sum_probs=17.5

Q ss_pred             HHHhhHHhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHH
Q 001172          379 ERRLTELGRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIRE  422 (1133)
Q Consensus       379 e~~~~Elgrr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E  422 (1133)
                      .-.|.|+|+......       +.....|++++..  +++||++
T Consensus       116 ~N~C~e~~~~~~~~~-------~~~~~~~~~G~~~--r~~~i~~  150 (176)
T PF12999_consen  116 PNTCAELGKEYREEL-------EEEEEIYKEGLKI--RQELIEE  150 (176)
T ss_pred             ccHHHHHHHHHHHHH-------HHHHHHHHHHHHH--HHHHHHH
Confidence            456777775544332       2334556666532  3355543


No 261
>KOG2668 consensus Flotillins [Intracellular trafficking, secretion, and vesicular transport; Cytoskeleton]
Probab=29.48  E-value=2.1e+02  Score=34.30  Aligned_cols=41  Identities=22%  Similarity=0.101  Sum_probs=18.8

Q ss_pred             HHHHhhhHHHHHHHHHHHHHHHHh-hhhhhhHHHHHHhhhhh
Q 001172          433 QKAKRGAAEKEKKAKKKLAKQKRN-NRKGKEKKREERSSMAL  473 (1133)
Q Consensus       433 ~~~kk~~k~kekk~~~k~ak~~e~-~rk~~e~~~ee~~~~~~  473 (1133)
                      |..|=++.+|++--+++||..+.- .++++|.-+-|+..+++
T Consensus       293 r~~klaEAnk~~~~~qaqAEA~~irk~geAEA~~ieA~akae  334 (428)
T KOG2668|consen  293 RETKLAEANKELYNKQAQAEAELIRKQGEAEAFAIEADAKAE  334 (428)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhhhhH
Confidence            344444455555555555544432 23444444444444443


No 262
>cd02658 Peptidase_C19B A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=29.31  E-value=21  Score=39.91  Aligned_cols=30  Identities=20%  Similarity=0.165  Sum_probs=25.6

Q ss_pred             cCCchHHHHhhhhhHHHhhHHHHHhhhccccccchhhhc
Q 001172          215 YLTNVEQICRRFVEERRGKLGRLIEDKARWSSFCAFWLG  253 (1133)
Q Consensus       215 PpSNLgaTC~~fved~~~hLG~LLqS~~~~ssFRaf~~a  253 (1133)
                      |+.|+|.||         ||+.+||...+...||.+++.
T Consensus         1 GL~NlGNTC---------Y~NsvLQ~L~~~~~f~~~l~~   30 (311)
T cd02658           1 GLRNLGNSC---------YLNSVLQVLFSIPSFQWRYDD   30 (311)
T ss_pred             CcccCCcch---------HHHHHHHHHHCCHHHHHHHhh
Confidence            578999999         999999999988888876543


No 263
>KOG2270 consensus Serine/threonine protein kinase involved in cell cycle control [Signal transduction mechanisms; Cell cycle control, cell division, chromosome partitioning]
Probab=28.99  E-value=40  Score=40.73  Aligned_cols=14  Identities=21%  Similarity=0.520  Sum_probs=7.8

Q ss_pred             CCEEEEEEEEEEee
Q 001172          180 GDTLIIKAQVQVIR  193 (1133)
Q Consensus       180 DDsLTIeCdVtVIk  193 (1133)
                      ---+.++-+|-|+.
T Consensus       231 PePIlLk~hVLVM~  244 (520)
T KOG2270|consen  231 PEPILLKNHVLVME  244 (520)
T ss_pred             CCceeeecceEeee
Confidence            34555666666654


No 264
>KOG1156 consensus N-terminal acetyltransferase [Chromatin structure and dynamics]
Probab=28.88  E-value=57  Score=41.30  Aligned_cols=32  Identities=38%  Similarity=0.544  Sum_probs=20.8

Q ss_pred             HHHHHHHHHHHHHHHhhhhhhhHHHHHHhhhh
Q 001172          441 EKEKKAKKKLAKQKRNNRKGKEKKREERSSMA  472 (1133)
Q Consensus       441 ~kekk~~~k~ak~~e~~rk~~e~~~ee~~~~~  472 (1133)
                      +.+||.++|+.|+++|+.|+++++.++....+
T Consensus       584 ~e~kk~~~k~rk~~kk~~~e~~~~~~~~~~~~  615 (700)
T KOG1156|consen  584 DEEKKIKKKQRKAKKKAKKEAKKKKDKKKKEA  615 (700)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence            44666777777777777777776665544433


No 265
>PRK07720 fliJ flagellar biosynthesis chaperone; Validated
Probab=28.72  E-value=6.4e+02  Score=25.68  Aligned_cols=14  Identities=0%  Similarity=0.081  Sum_probs=7.1

Q ss_pred             HHHHHHHHHHHHHH
Q 001172          402 KIEVAYQEAVALKR  415 (1133)
Q Consensus       402 rv~~Ay~E~~A~~r  415 (1133)
                      .....+.+.+....
T Consensus        71 ~fl~~L~~~i~~q~   84 (146)
T PRK07720         71 QFVTNLERTIDHYQ   84 (146)
T ss_pred             HHHHHHHHHHHHHH
Confidence            44555555554443


No 266
>KOG4591 consensus Uncharacterized conserved protein, contains BTB/POZ domain [General function prediction only]
Probab=28.55  E-value=20  Score=39.43  Aligned_cols=89  Identities=18%  Similarity=0.085  Sum_probs=52.5

Q ss_pred             cchhhhhhh---hhhhcchhhh---hhhHHHHHHhh----ccccccccc-ccccccccccccCCCCCCceEeecceeeeh
Q 001172          266 KTDAILKVV---VKHFFIEKEV---TSTLVMDSLYS----GLKALEGQS-KSKKTKAKLLDAEDTPPPIVHVENDMFVLV  334 (1133)
Q Consensus       266 ~sDVilkV~---~k~Ff~~keV---tSplvmd~L~~----glk~le~~t-~~~k~~~~~~~t~e~p~~~~~v~~d~~~l~  334 (1133)
                      .+|+.||++   .|+--+||=|   |+-..-=+--+    ...++..+. .-+..++|.|||||+.-.+    .|.|  .
T Consensus        66 fSDlk~K~~gns~k~i~AHKfVLAARsD~WkfaN~~dekse~~~~dDad~Ea~~t~iRWIYTDEidfk~----dD~~--L  139 (280)
T KOG4591|consen   66 FSDLKFKFAGNSDKHIPAHKFVLAARSDFWKFANGGDEKSEELDLDDADFEAFHTAIRWIYTDEIDFKE----DDEF--L  139 (280)
T ss_pred             ccceeEEecCCccccCchhhhhhhhhcchhhhccCCCcchhhhcccccCHHHHHHhheeeecccccccc----chHH--H
Confidence            589999999   6888889955   44311100000    111222221 0155678999999975222    2222  3


Q ss_pred             hhHHHHHHHhhcCCCCCCCCCCCcccccc
Q 001172          335 DDVLLLLERAALEPLPPKDEKGPQNRTKE  363 (1133)
Q Consensus       335 ~Dll~~leR~al~~Lp~~d~~~~q~~~~~  363 (1133)
                      -.|+.++.|+.|+-|+.   -|.+.+-.-
T Consensus       140 ~el~e~An~FqLe~Lke---~C~k~l~a~  165 (280)
T KOG4591|consen  140 LELCELANRFQLELLKE---RCEKGLGAL  165 (280)
T ss_pred             HHHHHHHHHHHHHHHHH---HHHHHHhhH
Confidence            46778999999999976   466655443


No 267
>KOG1962 consensus B-cell receptor-associated protein and related proteins [Defense mechanisms]
Probab=28.53  E-value=2.5e+02  Score=31.46  Aligned_cols=8  Identities=38%  Similarity=0.293  Sum_probs=2.9

Q ss_pred             hHHHHHHH
Q 001172          336 DVLLLLER  343 (1133)
Q Consensus       336 Dll~~leR  343 (1133)
                      +++.++=|
T Consensus        88 ~~~~~l~r   95 (216)
T KOG1962|consen   88 HLLEALFR   95 (216)
T ss_pred             HHHHHHHH
Confidence            33333333


No 268
>KOG4657 consensus Uncharacterized conserved protein [Function unknown]
Probab=28.27  E-value=7.3e+02  Score=28.30  Aligned_cols=12  Identities=25%  Similarity=0.268  Sum_probs=4.9

Q ss_pred             cCCCCccCCCcc
Q 001172          553 VCNLSSVPNGVT  564 (1133)
Q Consensus       553 ~~~~~~~~n~~~  564 (1133)
                      +.++-+..|.+.
T Consensus       207 L~gl~~a~s~vr  218 (246)
T KOG4657|consen  207 LQGLTSAISAVR  218 (246)
T ss_pred             cccHHHHHHHhh
Confidence            344444444433


No 269
>TIGR01147 V_ATP_synt_G vacuolar ATP synthase, subunit G. This model describes the vacuolar ATP synthase G subunit in eukaryotes and includes members from diverse groups e.g., fungi, plants, parasites etc. V-ATPases are multi-subunit enzymes composed of two functional domains: A transmembrane Vo domain and a peripheral catalytic domain V1. The G subunit is one of the subunits of the catalytic domain. V-ATPases are responsible for the acidification of endosomes and lysosomes, which are part of the central vacuolar system.
Probab=28.11  E-value=2.4e+02  Score=28.66  Aligned_cols=26  Identities=27%  Similarity=0.337  Sum_probs=12.6

Q ss_pred             HHHHHHHHHHHHHhhhhhhhHHHHHH
Q 001172          443 EKKAKKKLAKQKRNNRKGKEKKREER  468 (1133)
Q Consensus       443 ekk~~~k~ak~~e~~rk~~e~~~ee~  468 (1133)
                      .|-++-||||++-.+.-++=++..|+
T Consensus        28 ~r~~RLKqAK~EA~~EI~~yr~~kE~   53 (113)
T TIGR01147        28 RKTKRLKQAKEEAQKEVEKYKQQREK   53 (113)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44445556666555544443333333


No 270
>KOG4661 consensus Hsp27-ERE-TATA-binding protein/Scaffold attachment factor (SAF-B) [Transcription]
Probab=27.69  E-value=3.6e+02  Score=34.10  Aligned_cols=9  Identities=22%  Similarity=0.191  Sum_probs=3.4

Q ss_pred             hhcccCCch
Q 001172          476 RLEDENPSD  484 (1133)
Q Consensus       476 ~~~~~~~~~  484 (1133)
                      -+.++.+.+
T Consensus       680 M~ve~eRr~  688 (940)
T KOG4661|consen  680 MKVEEERRD  688 (940)
T ss_pred             HHHHHhhcc
Confidence            333333333


No 271
>PLN02850 aspartate-tRNA ligase
Probab=27.45  E-value=52  Score=40.81  Aligned_cols=19  Identities=11%  Similarity=0.149  Sum_probs=12.7

Q ss_pred             CCCC-CCCCCcccccccchh
Q 001172          889 RIDP-NSVQSAFPFSMVTRD  907 (1133)
Q Consensus       889 r~~~-~s~~~gf~~g~~~~e  907 (1133)
                      |++. ...+|=-.|-|.-.|
T Consensus       302 RaE~s~t~RHl~EFt~Le~E  321 (530)
T PLN02850        302 RAEDSFTHRHLCEFTGLDLE  321 (530)
T ss_pred             ccCCCCCCccchhhccchhh
Confidence            7776 466776677766665


No 272
>KOG0343 consensus RNA Helicase [RNA processing and modification]
Probab=27.41  E-value=1e+02  Score=38.88  Aligned_cols=15  Identities=40%  Similarity=0.477  Sum_probs=7.1

Q ss_pred             CCCCCccccCCCCCC
Q 001172          513 SVDGGAEVLQPDSED  527 (1133)
Q Consensus       513 ~~d~~~e~~~~d~ed  527 (1133)
                      .-||..|+-..+.||
T Consensus       733 ~~~g~k~~~~~~~eD  747 (758)
T KOG0343|consen  733 AADGIKEVSNSSVED  747 (758)
T ss_pred             ccccccccCCcchhh
Confidence            345666663333333


No 273
>cd02668 Peptidase_C19L A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=27.38  E-value=20  Score=40.61  Aligned_cols=32  Identities=13%  Similarity=0.052  Sum_probs=27.7

Q ss_pred             cCCchHHHHhhhhhHHHhhHHHHHhhhccccccchhhhccC
Q 001172          215 YLTNVEQICRRFVEERRGKLGRLIEDKARWSSFCAFWLGID  255 (1133)
Q Consensus       215 PpSNLgaTC~~fved~~~hLG~LLqS~~~~ssFRaf~~a~~  255 (1133)
                      |+.|+|.||         ||+.+||...+...||.+++...
T Consensus         1 GL~NlGnTC---------Y~NsvLQ~L~~~~~fr~~~l~~~   32 (324)
T cd02668           1 GLKNLGATC---------YVNSFLQLWFMNLEFRKAVYECN   32 (324)
T ss_pred             CcccCCcee---------HHHHHHHHHHCCHHHHHHHHccC
Confidence            568999999         99999999999889988776554


No 274
>PRK03963 V-type ATP synthase subunit E; Provisional
Probab=27.34  E-value=1.2e+02  Score=32.27  Aligned_cols=7  Identities=29%  Similarity=0.501  Sum_probs=2.7

Q ss_pred             HHHHHHH
Q 001172          417 EELIREE  423 (1133)
Q Consensus       417 ~~Ll~E~  423 (1133)
                      +++|+|-
T Consensus        20 ~~il~~A   26 (198)
T PRK03963         20 EYILEEA   26 (198)
T ss_pred             HHHHHHH
Confidence            3344333


No 275
>KOG2007 consensus Cysteinyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]
Probab=27.26  E-value=1.1e+02  Score=38.14  Aligned_cols=6  Identities=50%  Similarity=1.303  Sum_probs=3.5

Q ss_pred             eEEEEE
Q 001172           95 YKWYIL  100 (1133)
Q Consensus        95 hkWRI~  100 (1133)
                      ..|+++
T Consensus        56 v~wY~C   61 (586)
T KOG2007|consen   56 VTWYIC   61 (586)
T ss_pred             EEEEEe
Confidence            566654


No 276
>KOG4055 consensus Uncharacterized conserved protein [Function unknown]
Probab=27.23  E-value=3.9e+02  Score=29.51  Aligned_cols=6  Identities=17%  Similarity=0.440  Sum_probs=2.4

Q ss_pred             HHHHHH
Q 001172          404 EVAYQE  409 (1133)
Q Consensus       404 ~~Ay~E  409 (1133)
                      .++|+-
T Consensus       109 daefq~  114 (213)
T KOG4055|consen  109 DAEFQI  114 (213)
T ss_pred             HHHHHH
Confidence            334443


No 277
>PF03879 Cgr1:  Cgr1 family;  InterPro: IPR005579 Cgr1 is involved in nucleolar integrity and is required for processing pre-rRNA for the 60S ribosome subunit. In Saccharomyces cerevisiae, this protein is conserved and contributes to compartmentalisation of nucleolar constituents []. Cgr1 is a small hydrophilic protein and members of this family are coiled-coil proteins []. Its primary role appears to be in ribosome biogenesis [, ]. Expression of CGR1 is also associated with a cessation of yeast cell growth, which is a prerequisite for germination in this organism [].
Probab=27.18  E-value=2.9e+02  Score=27.93  Aligned_cols=30  Identities=20%  Similarity=0.303  Sum_probs=12.9

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          399 FSNKIEVAYQEAVALKRQEELIREEEAAWL  428 (1133)
Q Consensus       399 F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~  428 (1133)
                      |+.|.+.--........+.+|-+|-|++..
T Consensus        33 wekr~~~R~~~~~~K~~ekElKeEKe~er~   62 (108)
T PF03879_consen   33 WEKRMEKRLELKAIKEKEKELKEEKEAERQ   62 (108)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            334444333344444444455554444433


No 278
>KOG2573 consensus Ribosome biogenesis protein - Nop56p/Sik1p [RNA processing and modification; Translation, ribosomal structure and biogenesis]
Probab=26.71  E-value=70  Score=38.57  Aligned_cols=11  Identities=9%  Similarity=0.199  Sum_probs=5.7

Q ss_pred             ccEEEEEEccc
Q 001172           67 YGKYTWRIEKF   77 (1133)
Q Consensus        67 tGshtwkIdNF   77 (1133)
                      .|+..|++.++
T Consensus        11 ~GY~lf~~~~~   21 (498)
T KOG2573|consen   11 TGYGLFKVKEQ   21 (498)
T ss_pred             CceeEEEEech
Confidence            45555555553


No 279
>KOG4326 consensus Mitochondrial F1F0-ATP synthase, subunit e [Energy production and conversion]
Probab=26.53  E-value=4.7e+02  Score=24.97  Aligned_cols=12  Identities=17%  Similarity=0.218  Sum_probs=5.9

Q ss_pred             hHHhhhHHHHHH
Q 001172          383 TELGRRTVEIFV  394 (1133)
Q Consensus       383 ~Elgrr~~~ifa  394 (1133)
                      .-+||...-++.
T Consensus        12 IkfGRysaL~lG   23 (81)
T KOG4326|consen   12 IKFGRYSALSLG   23 (81)
T ss_pred             HHhhHHHHHHHH
Confidence            345665544443


No 280
>COG1422 Predicted membrane protein [Function unknown]
Probab=26.50  E-value=56  Score=36.02  Aligned_cols=38  Identities=11%  Similarity=0.198  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHHHhhhhhhhHHHHHHhhhhhhhhhcc
Q 001172          442 KEKKAKKKLAKQKRNNRKGKEKKREERSSMALSDRLED  479 (1133)
Q Consensus       442 kekk~~~k~ak~~e~~rk~~e~~~ee~~~~~~~~~~~~  479 (1133)
                      ++=++.||++||=.++.+|+.|+.++++-+..++++.|
T Consensus        72 ekm~~~qk~m~efq~e~~eA~~~~d~~~lkkLq~~qme  109 (201)
T COG1422          72 EKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQME  109 (201)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHH


No 281
>PTZ00074 60S ribosomal protein L34; Provisional
Probab=26.23  E-value=1.6e+02  Score=30.82  Aligned_cols=7  Identities=29%  Similarity=0.596  Sum_probs=3.4

Q ss_pred             HHHHHHH
Q 001172          401 NKIEVAY  407 (1133)
Q Consensus       401 ~rv~~Ay  407 (1133)
                      +||..||
T Consensus        87 ~rIirAF   93 (135)
T PTZ00074         87 DRIVRAF   93 (135)
T ss_pred             HHHHHHH
Confidence            4555444


No 282
>PHA01750 hypothetical protein
Probab=26.22  E-value=2.6e+02  Score=26.25  Aligned_cols=28  Identities=29%  Similarity=0.469  Sum_probs=22.4

Q ss_pred             HHHHHhhhhhHHHHHHHHHHHHHHHHHHH
Q 001172          391 EIFVLAHIFSNKIEVAYQEAVALKRQEEL  419 (1133)
Q Consensus       391 ~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~L  419 (1133)
                      -+|+...++- +|+.++++++-.-.|.+|
T Consensus        17 tlFaIiqlYl-KIKq~lkdAvkeIV~~EL   44 (75)
T PHA01750         17 TLFAIIQLYL-KIKQALKDAVKEIVNSEL   44 (75)
T ss_pred             HHHHHHHHHH-HHHHHHHHHHHHHHHHHH
Confidence            3677777774 899999999988888776


No 283
>PRK00106 hypothetical protein; Provisional
Probab=26.17  E-value=3.1e+02  Score=34.48  Aligned_cols=25  Identities=12%  Similarity=0.177  Sum_probs=10.5

Q ss_pred             CCCCCC-CcccCCCcccccccCCCCC
Q 001172          610 GKNQRG-KSTYDGNVWATETENQPSR  634 (1133)
Q Consensus       610 g~~~~~-~~~~~~~~~~~~~~~~ps~  634 (1133)
                      ||.-|| |....-|+..--.|+-|..
T Consensus       242 GreGrNir~~E~~tGvdliiddtp~~  267 (535)
T PRK00106        242 GREGRNIRTLESLTGIDVIIDDTPEV  267 (535)
T ss_pred             CCCcchHHHHHHHhCceEEEcCCCCe
Confidence            444444 3333333333335555543


No 284
>PF05529 Bap31:  B-cell receptor-associated protein 31-like ;  InterPro: IPR008417 Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31 [].; GO: 0006886 intracellular protein transport, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=26.12  E-value=4.3e+02  Score=28.24  Aligned_cols=14  Identities=14%  Similarity=0.197  Sum_probs=6.7

Q ss_pred             hhhHHHHHHHhhhh
Q 001172          386 GRRTVEIFVLAHIF  399 (1133)
Q Consensus       386 grr~~~ifa~~k~F  399 (1133)
                      |--+|=.|++.+++
T Consensus       108 Gf~LfL~l~I~r~~  121 (192)
T PF05529_consen  108 GFALFLSLVIRRVH  121 (192)
T ss_pred             HHHHHHHHHHHHHH
Confidence            54555555544444


No 285
>TIGR02473 flagell_FliJ flagellar export protein FliJ. Members of this family are the FliJ protein found, in nearly every case, in the midst of other flagellar biosynthesis genes in bacgterial genomes. Typically the fliJ gene is found adjacent to the gene for the flagellum-specific ATPase FliI. Sequence scoring in the gray zone between trusted and noise cutoffs include both probable FliJ proteins and components of bacterial type III secretion systems.
Probab=25.68  E-value=6.6e+02  Score=24.85  Aligned_cols=10  Identities=0%  Similarity=0.132  Sum_probs=4.3

Q ss_pred             HHHHHHHHHH
Q 001172          402 KIEVAYQEAV  411 (1133)
Q Consensus       402 rv~~Ay~E~~  411 (1133)
                      +......+.+
T Consensus        68 ~f~~~l~~~i   77 (141)
T TIGR02473        68 RFIRQLDQRI   77 (141)
T ss_pred             HHHHHHHHHH
Confidence            3344444444


No 286
>KOG2385 consensus Uncharacterized conserved protein [Function unknown]
Probab=25.55  E-value=56  Score=40.54  Aligned_cols=43  Identities=28%  Similarity=0.188  Sum_probs=30.4

Q ss_pred             HHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          390 VEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQK  434 (1133)
Q Consensus       390 ~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~  434 (1133)
                      |.|-++..-|.+ ++..-.|++ ..||.+|+-+.|+|+..++|++
T Consensus       568 lg~d~~~~~f~~-~le~~~e~~-~~rqr~l~~~~eaee~~~~e~~  610 (633)
T KOG2385|consen  568 LGIDVLSEEFRQ-TLEVQEEKY-ESRQRTLYSQKEAEEEIEQEVL  610 (633)
T ss_pred             hCccccCHHHHH-HhhcCChHH-HHHHHHHHhHHhHHHHHHHHHH
Confidence            666666666743 344444554 7889999999999888877776


No 287
>PF05529 Bap31:  B-cell receptor-associated protein 31-like ;  InterPro: IPR008417 Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31 [].; GO: 0006886 intracellular protein transport, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=25.55  E-value=4.4e+02  Score=28.17  Aligned_cols=6  Identities=17%  Similarity=0.224  Sum_probs=2.5

Q ss_pred             HHHHHH
Q 001172          389 TVEIFV  394 (1133)
Q Consensus       389 ~~~ifa  394 (1133)
                      -|--|+
T Consensus       105 YIsGf~  110 (192)
T PF05529_consen  105 YISGFA  110 (192)
T ss_pred             HHHHHH
Confidence            344444


No 288
>PTZ00491 major vault protein; Provisional
Probab=25.38  E-value=2.6e+02  Score=36.85  Aligned_cols=67  Identities=22%  Similarity=0.308  Sum_probs=37.8

Q ss_pred             cccchhHH-HHHHhhHHhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHH
Q 001172          370 FNKDSIER-DERRLTELGRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRGAAEKEKKA  446 (1133)
Q Consensus       370 vs~dtve~-ee~~~~Elgrr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~k~kekk~  446 (1133)
                      |++-+||+ |.+-+. .=.|-+++.+         +...+--.|.-+|+....|-|+..++|.+|-.-+.+.|+.+|.
T Consensus       633 VDvqsvEpvD~~tr~-~LqkSVqlAi---------EItt~sqEa~A~h~a~~~eQea~g~Lerqk~~d~~~aE~~r~~  700 (850)
T PTZ00491        633 VDVQSVEPVDERTRD-SLQKSVQLAI---------EITTKSQEAAARHQAELLEQEARGRLERQKMHDKAKAEEQRTK  700 (850)
T ss_pred             EeeeeeeecCHHHHH-HHHHHHHHhh---------hhhchhHHHHHHHHHHHHHHHhhchhHHHhhhhHHHHHHHHHH
Confidence            55666776 554433 2234444444         5555555566677777777777777766665555554444443


No 289
>PF03538 VRP1:  Salmonella virulence plasmid 28.1kDa A protein;  InterPro: IPR018003 This entry included insecticidal toxin complex proteins (TcaA, TccA, TcbA, TcdA) from Photorhabdus luminescens subsp. laumondii and Xenorhabdus nematophilus (Achromobacter nematophilus) [], and virulence proteins from Salmonella typhimurium that are encoded on a 90kb plasmid. P. luminescens and X. nematophilus are Gram-negative bacteria that form entomopathogenic symbioses with soil nematodes. The bacteria are found in the gut of entomopathogenic nematodes that invade and kill insects. When the nematode invades an insect host the bacteria are released into the insect haemocoel (the open circulatory system), both the bacteria and the nematode undergo multiple rounds of replication which kills the insect host. Mapping of the insecticidal toxin loci and studies on knockout mutants in P. luminescens showed that deletion of either tca or tcd loci dramatically reduced toxicity, while the double mutant tca/tcd abolished toxicity []. However the biology of toxin action is unclear as is the species range of insects the toxins are active against. S. typhimurium contains a 90kb plasmid that is associated with virulence. This plasmid encodes at least 6 genes needed by the bacterium for invading host macrophages during infection. These include the 70kDa mkaA protein [], a recognised virulence factor, and more recently described, four spv genes under the control of a regulator []. Deletion studies on the virulence plasmid have shown that an open reading frame encoding a 28kDa protein was needed for successful invasion of the host. This protein, designated mkfA [], VRP4 [] or VirA [] by different groups, is utilised by the microbe upon entry into macrophages, although the exact mechanism is unclear.
Probab=25.13  E-value=26  Score=40.92  Aligned_cols=48  Identities=25%  Similarity=0.418  Sum_probs=32.1

Q ss_pred             cccccCCchhhhhhhhccc--cc---ccccccccccccccCCCCccccccccCCC
Q 001172          971 VLVDEFPHLDIINDLLDDE--HG---VGMAAGASTVLQSLSNGPHTLNRQFSFPR 1020 (1133)
Q Consensus       971 ~v~dEFPHLDIINDLL~ee--q~---~g~~~~~~~~~~~~~~~~~~f~~~fs~~g 1020 (1133)
                      -|-.|-|.|||+|++|+.-  .-   .|..  .-.+.+.|..-+..|.+||.+|-
T Consensus       171 n~~~~vp~L~ivNevL~~~i~~~~~~~~~~--~~~v~~~La~~~~p~~LPY~~~~  223 (325)
T PF03538_consen  171 NMNTEVPTLDIVNEVLESAIKAYLDQTGDT--RKSVYQALATARYPFTLPYHDPH  223 (325)
T ss_pred             HhcCcCcHHHHHHHHHHHHHHHhcccCCcc--hhHHHHHHHhccCCCCCCCccHH
Confidence            4677889999999999741  00   1211  11456667667777888887765


No 290
>PF00443 UCH:  Ubiquitin carboxyl-terminal hydrolase;  InterPro: IPR001394 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the MEROPS peptidase family C19 (ubiquitin-specific protease family, clan CA). Families within the CA clan are loosely termed papain-like as protein fold of the peptidase unit resembles that of papain, the type example for clan CA. Predicted active site residues for members of this family and family C1 occur in the same order in the sequence: N/Q, C, H. The type example is human ubiquitin-specific protease 14. Ubiquitin is highly conserved, commonly found conjugated to proteins in eukaryotic cells, where it may act as a marker for rapid degradation, or it may have a chaperone function in protein assembly []. The ubiquitin is released by cleavage from the bound protein by a protease []. A number of deubiquitinising proteases are known: all are activated by thiol compounds [, ], and inhibited by thiol-blocking agents and ubiquitin aldehyde [, ], and as such have the properties of cysteine proteases []. The deubiquitinsing proteases can be split into 2 size ranges (20-30 kDa, IPR001578 from INTERPRO, and 100-200 kDa) []: this family are the 100-200 kDa peptides which includes the Ubp1 ubiquitin peptidase from yeast. Only one conserved cysteine can be identified, along with two conserved histidines. The spacing between the cysteine and the second histidine is thought to be more representative of the cysteine/histidine spacing of a cysteine protease catalytic dyad [].; GO: 0004221 ubiquitin thiolesterase activity, 0006511 ubiquitin-dependent protein catabolic process; PDB: 2LBC_A 3MHH_A 3MHS_A 3M99_A 2Y6E_D 2VHF_A 2HD5_A 3NHE_A 2IBI_A 1NBF_B ....
Probab=25.09  E-value=29  Score=36.30  Aligned_cols=33  Identities=21%  Similarity=0.337  Sum_probs=26.6

Q ss_pred             eecCCchHHHHhhhhhHHHhhHHHHHhhhccccccchhhhcc
Q 001172          213 RVYLTNVEQICRRFVEERRGKLGRLIEDKARWSSFCAFWLGI  254 (1133)
Q Consensus       213 ~VPpSNLgaTC~~fved~~~hLG~LLqS~~~~ssFRaf~~a~  254 (1133)
                      +||+.|+|.||         ||+..||...+...||.++...
T Consensus         1 ~~Gl~N~gntC---------ylNs~lQ~L~~~~~~~~~l~~~   33 (269)
T PF00443_consen    1 PVGLQNIGNTC---------YLNSVLQCLFHIPPFRNYLLSY   33 (269)
T ss_dssp             --EESBSSSTH---------HHHHHHHHHHTSHHHHHHHHTT
T ss_pred             CCCcEeCCCch---------HHhHHHHhhhhhhhhhhhhhhc
Confidence            37889999999         9999999998887888766543


No 291
>PF05279 Asp-B-Hydro_N:  Aspartyl beta-hydroxylase N-terminal region;  InterPro: IPR007943 This domain is found in members of the junctin, junctate and aspartyl beta-hydroxylase protein families. Junctate is an integral ER/SR membrane calcium binding protein, which comes from an alternatively spliced form of the same gene that generates aspartyl beta-hydroxylase and junctin []. Aspartyl beta-hydroxylase catalyses the post-translational hydroxylation of aspartic acid or asparagine residues contained within epidermal growth factor (EGF) domains of proteins []. This domain is also found in several eukaryotic triadin proteins. Triadin is a ryanodine receptor and calsequestrin binding protein located in junctional sarcoplasmic reticulum of striated muscles [].; GO: 0016020 membrane
Probab=24.93  E-value=1.8e+02  Score=33.00  Aligned_cols=10  Identities=30%  Similarity=0.418  Sum_probs=4.7

Q ss_pred             ccccccccCC
Q 001172          309 TKAKLLDAED  318 (1133)
Q Consensus       309 ~~~~~~~t~e  318 (1133)
                      ++|-..|+|.
T Consensus        43 ~kL~iYDaDG   52 (243)
T PF05279_consen   43 GKLGIYDADG   52 (243)
T ss_pred             hhheeeeccC
Confidence            4454445443


No 292
>PF12569 NARP1:  NMDA receptor-regulated protein 1 ;  InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=24.77  E-value=85  Score=38.93  Aligned_cols=20  Identities=40%  Similarity=0.503  Sum_probs=9.4

Q ss_pred             HHHHHHHHHHHHhhhhhhhH
Q 001172          444 KKAKKKLAKQKRNNRKGKEK  463 (1133)
Q Consensus       444 kk~~~k~ak~~e~~rk~~e~  463 (1133)
                      ||.++|+.|+++|+++++++
T Consensus       413 Kk~~kK~kK~~~k~~~~~~~  432 (517)
T PF12569_consen  413 KKAKKKAKKAAKKAKKEEAE  432 (517)
T ss_pred             HHHHHHHHHHHHHHhHHHHH
Confidence            44445555554444444333


No 293
>PTZ00401 aspartyl-tRNA synthetase; Provisional
Probab=24.68  E-value=58  Score=40.67  Aligned_cols=19  Identities=11%  Similarity=0.123  Sum_probs=10.8

Q ss_pred             CCCCCC-CCCcccccccchh
Q 001172          889 RIDPNS-VQSAFPFSMVTRD  907 (1133)
Q Consensus       889 r~~~~s-~~~gf~~g~~~~e  907 (1133)
                      |++... .+|=-.|-|.-.|
T Consensus       290 RaE~s~T~RHl~EFt~Le~E  309 (550)
T PTZ00401        290 RSENSNTHRHLTEFVGLDVE  309 (550)
T ss_pred             eCCCCCCCCCccchhhhhhh
Confidence            666553 4666566555554


No 294
>PRK05689 fliJ flagellar biosynthesis chaperone; Validated
Probab=24.59  E-value=7e+02  Score=25.31  Aligned_cols=11  Identities=9%  Similarity=0.063  Sum_probs=4.7

Q ss_pred             HHHHHHHHHHH
Q 001172          440 AEKEKKAKKKL  450 (1133)
Q Consensus       440 k~kekk~~~k~  450 (1133)
                      +-|||+.....
T Consensus       113 kL~ek~~~~~~  123 (147)
T PRK05689        113 TLQERYQTEAR  123 (147)
T ss_pred             HHHHHHHHHHH
Confidence            44444444333


No 295
>TIGR03319 YmdA_YtgF conserved hypothetical protein YmdA/YtgF.
Probab=24.45  E-value=4.5e+02  Score=32.79  Aligned_cols=6  Identities=17%  Similarity=0.673  Sum_probs=2.7

Q ss_pred             ccCCCC
Q 001172          628 TENQPS  633 (1133)
Q Consensus       628 ~~~~ps  633 (1133)
                      .|+-|.
T Consensus       240 iddtp~  245 (514)
T TIGR03319       240 IDDTPE  245 (514)
T ss_pred             EcCCCC
Confidence            444443


No 296
>KOG4043 consensus Uncharacterized conserved protein [Function unknown]
Probab=24.45  E-value=49  Score=35.40  Aligned_cols=20  Identities=25%  Similarity=0.358  Sum_probs=7.6

Q ss_pred             HhhhHHHHHHHHHHHHHHHH
Q 001172          436 KRGAAEKEKKAKKKLAKQKR  455 (1133)
Q Consensus       436 kk~~k~kekk~~~k~ak~~e  455 (1133)
                      ||.++.||+|+++++.+..|
T Consensus       179 KkhedDKeRKK~KeEk~~~e  198 (214)
T KOG4043|consen  179 KKHEDDKERKKEKEEKGDAE  198 (214)
T ss_pred             hhhhhhHHHHHHHHHhhhhh
Confidence            33333444443333333333


No 297
>PF05384 DegS:  Sensor protein DegS;  InterPro: IPR008595 This is a group of Bacillus DegS proteins. The DegS-DegU two-component regulatory system of Bacillus subtilis controls various processes that characterise the transition from the exponential to the stationary growth phase, including the induction of extracellular degradative enzymes, expression of late competence genes and down-regulation of the sigma D regulon []. The entry also contains one sequence Q8R9D3 from SWISSPROT from Thermoanaerobacter tengcongensis which is described as a sensory transduction histidine kinase.; GO: 0016301 kinase activity, 0007165 signal transduction
Probab=24.39  E-value=6.9e+02  Score=26.80  Aligned_cols=33  Identities=30%  Similarity=0.445  Sum_probs=24.2

Q ss_pred             HHHhhhh----hHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          393 FVLAHIF----SNKIEVAYQEAVALKRQEELIREEEA  425 (1133)
Q Consensus       393 fa~~k~F----~~rv~~Ay~E~~A~~rQ~~Ll~E~E~  425 (1133)
                      .-+.++|    +.-|+.||.++.-...+..+++|.|.
T Consensus        65 ~eVS~~f~~ysE~dik~AYe~A~~lQ~~L~~~re~E~  101 (159)
T PF05384_consen   65 AEVSRNFDRYSEEDIKEAYEEAHELQVRLAMLREREK  101 (159)
T ss_pred             HHHHhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3358888    67899999998766666666666554


No 298
>PF15295 CCDC50_N:  Coiled-coil domain-containing protein 50  N-terminus
Probab=24.36  E-value=2.1e+02  Score=29.94  Aligned_cols=6  Identities=33%  Similarity=0.734  Sum_probs=2.4

Q ss_pred             ccCCCC
Q 001172          315 DAEDTP  320 (1133)
Q Consensus       315 ~t~e~p  320 (1133)
                      |...||
T Consensus         7 d~~~lP   12 (132)
T PF15295_consen    7 DQSNLP   12 (132)
T ss_pred             ccccCC
Confidence            333444


No 299
>PF12037 DUF3523:  Domain of unknown function (DUF3523);  InterPro: IPR021911  This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is typically between 257 to 277 amino acids in length. This domain is found associated with PF00004 from PFAM. This domain has a conserved LER sequence motif. 
Probab=24.23  E-value=9.7e+02  Score=27.98  Aligned_cols=33  Identities=21%  Similarity=0.194  Sum_probs=16.3

Q ss_pred             HHHHhhHHhhhHHHHHHH-hhhhhHHHHHHHHHH
Q 001172          378 DERRLTELGRRTVEIFVL-AHIFSNKIEVAYQEA  410 (1133)
Q Consensus       378 ee~~~~Elgrr~~~ifa~-~k~F~~rv~~Ay~E~  410 (1133)
                      |+-+..|+..++-++-+. +.+=-+|+.+.+.|.
T Consensus        61 E~TkQ~E~~ak~~e~ea~~~q~~~e~~rv~~EE~   94 (276)
T PF12037_consen   61 EETKQAELQAKIAEYEAAQAQAEIERQRVEAEER   94 (276)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            455556666666665442 222224455544443


No 300
>cd02667 Peptidase_C19K A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=24.12  E-value=34  Score=38.06  Aligned_cols=30  Identities=10%  Similarity=0.238  Sum_probs=26.7

Q ss_pred             cCCchHHHHhhhhhHHHhhHHHHHhhhccccccchhhhc
Q 001172          215 YLTNVEQICRRFVEERRGKLGRLIEDKARWSSFCAFWLG  253 (1133)
Q Consensus       215 PpSNLgaTC~~fved~~~hLG~LLqS~~~~ssFRaf~~a  253 (1133)
                      |+.|+|.||         ||+..||...+...||.+++.
T Consensus         1 Gl~N~GntC---------y~NsvLQ~L~~~~~~~~~~l~   30 (279)
T cd02667           1 GLSNLGNTC---------FFNAVMQNLSQTPALRELLSE   30 (279)
T ss_pred             CCcCCCCch---------HHHHHHHHHhcCHHHHHHHHH
Confidence            567999999         999999999998889887766


No 301
>PF11214 Med2:  Mediator complex subunit 2;  InterPro: IPR021017 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins.  The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11.  The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation.   The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22.  The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4.  The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16.  The CDK8 module contains: MED12, MED13, CCNC and CDK8.   Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP.  This family of mediator complex subunit 2 proteins is conserved in fungi. Cyclin-dependent kinase CDK8 or Srb10 interacts with and phosphorylates Med2. Post-translational modifications of Mediator subunits are important for regulation of gene expression [, ]. 
Probab=23.93  E-value=74  Score=31.91  Aligned_cols=6  Identities=33%  Similarity=0.590  Sum_probs=2.2

Q ss_pred             HHhhhh
Q 001172          454 KRNNRK  459 (1133)
Q Consensus       454 ~e~~rk  459 (1133)
                      +|++||
T Consensus        94 ~E~~rk   99 (105)
T PF11214_consen   94 EERKRK   99 (105)
T ss_pred             HHHHHH
Confidence            333333


No 302
>PF12072 DUF3552:  Domain of unknown function (DUF3552);  InterPro: IPR022711  This presumed domain is functionally uncharacterised. This domain is found in bacteria, archaea and eukaryotes. This domain is about 200 amino acids in length. This domain is found associated with PF00013 from PFAM, PF01966 from PFAM. This domain has a single completely conserved residue A that may be functionally important. ; GO: 0008663 2',3'-cyclic-nucleotide 2'-phosphodiesterase activity
Probab=23.83  E-value=9.6e+02  Score=26.09  Aligned_cols=19  Identities=37%  Similarity=0.343  Sum_probs=8.6

Q ss_pred             HHHHhhhhhHHHHHHHHHH
Q 001172          392 IFVLAHIFSNKIEVAYQEA  410 (1133)
Q Consensus       392 ifa~~k~F~~rv~~Ay~E~  410 (1133)
                      +|+..+++..++..|-.++
T Consensus        18 ~~~~~~~~~~~~~~A~~~A   36 (201)
T PF12072_consen   18 YLVRKKINRKKLEQAEKEA   36 (201)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            3443444445555444443


No 303
>PF07763 FEZ:  FEZ-like protein;  InterPro: IPR011680 This is a family of eukaryotic proteins thought to be involved in axonal outgrowth and fasciculation []. The N-terminal regions of these sequences are less conserved than the C-terminal regions, and are highly acidic []. The Caenorhabditis elegans homolog, UNC-76 (Q7JNU9 from SWISSPROT), may play structural and signalling roles in the control of axonal extension and adhesion (particularly in the presence of adjacent neuronal cells []) and these roles have also been postulated for other FEZ family proteins []. Certain homologs have been definitively found to interact with the N-terminal variable region (V1) of PKC-zeta, and this interaction causes cytoplasmic translocation of the FEZ family protein in mammalian neuronal cells []. The C-terminal region probably participates in the association with the regulatory domain of PKC-zeta []. The members of this family are predicted to form coiled-coil structures [, ], which may interact with members of the RhoA family of signalling proteins [], but are not thought to contain other characteristic protein motifs []. Certain members of this family are expressed almost exclusively in the brain, whereas others (such as FEZ2, Q76LN0 from SWISSPROT) are expressed in other tissues, and are thought to perform similar but unknown functions in these tissues [].
Probab=23.81  E-value=4.4e+02  Score=30.15  Aligned_cols=51  Identities=25%  Similarity=0.263  Sum_probs=30.7

Q ss_pred             hhHHhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 001172          382 LTELGRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRG  438 (1133)
Q Consensus       382 ~~Elgrr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~  438 (1133)
                      +-+++=|.+.+--|..++ .+++++-++-=     +.||.+|.-...++=||+.|-.
T Consensus       163 ~~~e~L~~LS~seL~~ll-~e~E~~Ir~yS-----EeLV~qLA~RDELefEKEvKN~  213 (244)
T PF07763_consen  163 SYEEGLRQLSLSELNELL-EEMETAIREYS-----EELVQQLALRDELEFEKEVKNT  213 (244)
T ss_pred             CcHHHHHhcCHHHHHHHH-HHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHH
Confidence            456666666666666666 46777655544     4455556555556666666553


No 304
>PF05672 MAP7:  MAP7 (E-MAP-115) family;  InterPro: IPR008604 The organisation of microtubules varies with the cell type and is presumably controlled by tissue-specific microtubule-associated proteins (MAPs). The 115 kDa epithelial MAP (E-MAP-115) has been identified as a microtubule-stabilising protein predominantly expressed in cell lines of epithelial origin []. The binding of this microtubule associated protein is nucleotide independent [].
Probab=23.75  E-value=1.8e+02  Score=31.55  Aligned_cols=19  Identities=26%  Similarity=0.181  Sum_probs=12.8

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q 001172          415 RQEELIREEEAAWLAESEQ  433 (1133)
Q Consensus       415 rQ~~Ll~E~E~e~~~~~e~  433 (1133)
                      ||..+..|-|+.+++++|.
T Consensus        33 RlAReQkErEE~ER~ekEE   51 (171)
T PF05672_consen   33 RLAREQKEREEQERLEKEE   51 (171)
T ss_pred             HHHHHHhhhHHHHHHHHHH
Confidence            6677777777777665543


No 305
>KOG2962 consensus Prohibitin-related membrane protease subunits [General function prediction only]
Probab=23.72  E-value=8.6e+02  Score=27.90  Aligned_cols=6  Identities=17%  Similarity=0.861  Sum_probs=3.0

Q ss_pred             CeEEEE
Q 001172          110 NHLSLF  115 (1133)
Q Consensus       110 dyLSLY  115 (1133)
                      +|+++|
T Consensus        29 GHvgvY   34 (322)
T KOG2962|consen   29 GHVGVY   34 (322)
T ss_pred             CceEEE
Confidence            455554


No 306
>cd08045 TAF4 TATA Binding Protein (TBP) Associated Factor 4 (TAF4) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 4 (TAF4) is one of several TAFs that bind TBP and are involved in forming the Transcription Factor IID (TFIID) complex. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryote. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAF orthologs and paralogs. Several hypotheses are
Probab=23.46  E-value=9.8e+02  Score=26.21  Aligned_cols=12  Identities=8%  Similarity=0.047  Sum_probs=6.0

Q ss_pred             hhHHHHHHHhhc
Q 001172          335 DDVLLLLERAAL  346 (1133)
Q Consensus       335 ~Dll~~leR~al  346 (1133)
                      ..|...|.+++-
T Consensus        48 ~~l~~~~~~i~~   59 (212)
T cd08045          48 SPLAKKIRKIAK   59 (212)
T ss_pred             HHHHHHHHHHHH
Confidence            344455555544


No 307
>KOG2223 consensus Uncharacterized conserved protein, contains TBC domain [Signal transduction mechanisms; General function prediction only]
Probab=23.46  E-value=1.7e+02  Score=35.91  Aligned_cols=8  Identities=25%  Similarity=0.434  Sum_probs=3.4

Q ss_pred             ccccCCCC
Q 001172          518 AEVLQPDS  525 (1133)
Q Consensus       518 ~e~~~~d~  525 (1133)
                      .|.+++|.
T Consensus       356 v~~i~LDi  363 (586)
T KOG2223|consen  356 VELIKLDI  363 (586)
T ss_pred             hhheeech
Confidence            34444443


No 308
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=23.36  E-value=3e+02  Score=35.65  Aligned_cols=59  Identities=25%  Similarity=0.254  Sum_probs=40.8

Q ss_pred             CCcCCCCCCCCCCC--CCC---CC--Cccceeeccc-hhhhhhc-cccccCCCCCCCCCCCCCccccc
Q 001172          784 PNMSRPSSAPLVPG--PRP---TA--PVVSVVHTAP-LLARSVS-AAGRLGPDLAPATHGYIPQSYRN  842 (1133)
Q Consensus       784 ~~~sRPsSAP~~p~--pr~---t~--p~~s~vqt~p-lLsRS~s-a~gr~g~dpS~~~~sy~pqsYrN  842 (1133)
                      +.-.|++|+|+-|.  |.+   |.  |++.+-|+++ +-+|.+= -.+|..|-+++-+|.|+-+.-||
T Consensus       498 t~~~rtssspvmpv~lp~~s~~ty~~~~v~a~~~a~~s~~r~~~~~~~~a~~~~~~~~~~~l~r~r~~  565 (825)
T KOG0267|consen  498 TVPERTSSSPVMPVILPQASMSTYPEPPVGASSTARTSSARILPVTFNQANNISSEEAPVTLRRQRRN  565 (825)
T ss_pred             ccccccccCCccccccCCCcccccCCCCccccCcccccccccccccccccccccCcCCccccccccCC
Confidence            35679999998776  222   22  3455556644 4456666 36788888888899999888887


No 309
>PRK01005 V-type ATP synthase subunit E; Provisional
Probab=23.21  E-value=4.1e+02  Score=29.40  Aligned_cols=14  Identities=14%  Similarity=0.166  Sum_probs=6.1

Q ss_pred             hhhhhHHHHHHHHH
Q 001172          396 AHIFSNKIEVAYQE  409 (1133)
Q Consensus       396 ~k~F~~rv~~Ay~E  409 (1133)
                      .+|.++.+..|=.|
T Consensus        15 dki~~eiL~eA~~e   28 (207)
T PRK01005         15 DALREETLKPAEEE   28 (207)
T ss_pred             HHHHHHHHHHHHHH
Confidence            44444444444333


No 310
>PLN02381 valyl-tRNA synthetase
Probab=23.08  E-value=94  Score=41.77  Aligned_cols=60  Identities=23%  Similarity=0.246  Sum_probs=0.0

Q ss_pred             HHHHHHHHhhhHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhhhhhhhhcccCCchhhhhhh
Q 001172          429 AESEQKAKRGAAEKEKKAKKKLAKQKRNNRKGKEKKREERSSMALSDRLEDENPSDEKKEFI  490 (1133)
Q Consensus       429 ~~~e~~~kk~~k~kekk~~~k~ak~~e~~rk~~e~~~ee~~~~~~~~~~~~~~~~~~~~~~~  490 (1133)
                      ++-|||.||.+|.|||..++-.|.+++.+.|.+.+.+.  ...+...|..+.+++.+-++..
T Consensus        16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~   75 (1066)
T PLN02381         16 EELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQAS--DGTNVPKKSEKKSRKRDVEDEN   75 (1066)
T ss_pred             HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc--ccCCCcccchhhhhhhhhhhcc


No 311
>PTZ00436 60S ribosomal protein L19-like protein; Provisional
Probab=23.06  E-value=5.1e+02  Score=30.61  Aligned_cols=6  Identities=50%  Similarity=0.219  Sum_probs=2.3

Q ss_pred             HHHHhh
Q 001172          433 QKAKRG  438 (1133)
Q Consensus       433 ~~~kk~  438 (1133)
                      .+++|.
T Consensus       159 ~eArR~  164 (357)
T PTZ00436        159 LAAKRL  164 (357)
T ss_pred             HHHHHH
Confidence            334433


No 312
>KOG0949 consensus Predicted helicase, DEAD-box superfamily [General function prediction only]
Probab=23.01  E-value=2.7e+02  Score=37.50  Aligned_cols=19  Identities=32%  Similarity=0.340  Sum_probs=12.5

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q 001172          407 YQEAVALKRQEELIREEEA  425 (1133)
Q Consensus       407 y~E~~A~~rQ~~Ll~E~E~  425 (1133)
                      +.|+.|.++-.+|-+-|+|
T Consensus       862 fcekla~kv~~~Le~~e~E  880 (1330)
T KOG0949|consen  862 FCEKLALKVHRQLESMEME  880 (1330)
T ss_pred             HHHHHHHHHHHHHHHHHHh
Confidence            4477777777777655555


No 313
>cd07658 F-BAR_NOSTRIN The F-BAR (FES-CIP4 Homology and Bin/Amphiphysin/Rvs) domain of Nitric Oxide Synthase TRaffic INducer (NOSTRIN). F-BAR domains are dimerization modules that bind and bend membranes and are found in proteins involved in membrane dynamics and actin reorganization. Nitric Oxide Synthase TRaffic INducer (NOSTRIN) is expressed in endothelial and epithelial cells and is involved in the regulation, trafficking and targeting of endothelial NOS (eNOS). NOSTRIN facilitates the endocytosis of eNOS by coordinating the functions of dynamin and the Wiskott-Aldrich syndrome protein (WASP). Increased expression of NOSTRIN may be correlated to preeclampsia. NOSTRIN contains an N-terminal F-BAR domain and a C-terminal SH3 domain. F-BAR domains form banana-shaped dimers with a positively-charged concave surface that binds to negatively-charged lipid membranes. They can induce membrane deformation in the form of long tubules. The F-BAR domain of NOSTRIN is necessary and sufficient fo
Probab=22.78  E-value=1.1e+03  Score=26.36  Aligned_cols=15  Identities=40%  Similarity=0.636  Sum_probs=9.9

Q ss_pred             ehhhHHHHH-HHhhcC
Q 001172          333 LVDDVLLLL-ERAALE  347 (1133)
Q Consensus       333 l~~Dll~~l-eR~al~  347 (1133)
                      ++.||+..+ ||+.++
T Consensus        20 ~ckel~~f~kERa~IE   35 (239)
T cd07658          20 FCKELATVLQERAELE   35 (239)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            567777655 777666


No 314
>PF14738 PaaSYMP:  Solute carrier (proton/amino acid symporter), TRAMD3 or PAT1
Probab=22.76  E-value=4.1e+02  Score=28.32  Aligned_cols=11  Identities=27%  Similarity=0.290  Sum_probs=5.9

Q ss_pred             HHhhhHHHHHH
Q 001172          384 ELGRRTVEIFV  394 (1133)
Q Consensus       384 Elgrr~~~ifa  394 (1133)
                      ++.++||+-|=
T Consensus        75 ~kRr~mme~~E   85 (154)
T PF14738_consen   75 EKRRKMMEEME   85 (154)
T ss_pred             HHHHHHHHHHH
Confidence            45556665443


No 315
>KOG2572 consensus Ribosome biogenesis protein - Nop58p/Nop5p [RNA processing and modification; Translation, ribosomal structure and biogenesis]
Probab=22.56  E-value=30  Score=41.49  Aligned_cols=11  Identities=9%  Similarity=0.138  Sum_probs=6.0

Q ss_pred             hhhhhhhcchh
Q 001172          272 KVVVKHFFIEK  282 (1133)
Q Consensus       272 kV~~k~Ff~~k  282 (1133)
                      ..+|-+|-+|.
T Consensus       290 ElVGaRlIsha  300 (498)
T KOG2572|consen  290 ELVGARLISHA  300 (498)
T ss_pred             HHHHHHHHHhh
Confidence            34455665665


No 316
>cd08045 TAF4 TATA Binding Protein (TBP) Associated Factor 4 (TAF4) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 4 (TAF4) is one of several TAFs that bind TBP and are involved in forming the Transcription Factor IID (TFIID) complex. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryote. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAF orthologs and paralogs. Several hypotheses are
Probab=22.55  E-value=2.7e+02  Score=30.46  Aligned_cols=6  Identities=17%  Similarity=-0.130  Sum_probs=2.1

Q ss_pred             HHHHHH
Q 001172          424 EAAWLA  429 (1133)
Q Consensus       424 E~e~~~  429 (1133)
                      |++.+.
T Consensus       126 e~err~  131 (212)
T cd08045         126 EEEKRD  131 (212)
T ss_pred             HHHhHH
Confidence            333333


No 317
>COG5638 Uncharacterized conserved protein [Function unknown]
Probab=22.53  E-value=98  Score=37.34  Aligned_cols=19  Identities=37%  Similarity=0.401  Sum_probs=14.2

Q ss_pred             CCCCCCCCccc--cccCCCCC
Q 001172          496 PLPEKPDVLED--VSDVSDSV  514 (1133)
Q Consensus       496 ~~~~k~d~~e~--~sd~sd~~  514 (1133)
                      ++--|.-++||  +-|++|-+
T Consensus       524 k~kkk~~~leegF~~D~~DpR  544 (622)
T COG5638         524 KLKKKASNLEEGFVFDPKDPR  544 (622)
T ss_pred             hhhhhhhccccccccCCCCcc
Confidence            45567889998  78888765


No 318
>PF08701 GN3L_Grn1:  GNL3L/Grn1 putative GTPase;  InterPro: IPR014813 Grn1 (yeast) and GNL3L (human) are putative GTPases which are required for growth and play a role in processing of nucleolar pre-rRNA []. This family contains a potential nuclear localisation signal. 
Probab=22.41  E-value=1.9e+02  Score=27.71  Aligned_cols=16  Identities=31%  Similarity=0.345  Sum_probs=9.0

Q ss_pred             HHHHHHHHHHHHHHHH
Q 001172          417 EELIREEEAAWLAESE  432 (1133)
Q Consensus       417 ~~Ll~E~E~e~~~~~e  432 (1133)
                      ++||.|.|+..+..+|
T Consensus        45 e~iL~eie~~k~~~ee   60 (79)
T PF08701_consen   45 EEILKEIEEKKERAEE   60 (79)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4567777665544433


No 319
>KOG2391 consensus Vacuolar sorting protein/ubiquitin receptor VPS23 [Posttranslational modification, protein turnover, chaperones; Intracellular trafficking, secretion, and vesicular transport]
Probab=22.27  E-value=6.4e+02  Score=30.28  Aligned_cols=48  Identities=23%  Similarity=0.213  Sum_probs=22.3

Q ss_pred             HHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHH
Q 001172          403 IEVAYQEAVALKRQEE---LIREEEAAWLAESEQKAKRGAAEKEKKAKKKL  450 (1133)
Q Consensus       403 v~~Ay~E~~A~~rQ~~---Ll~E~E~e~~~~~e~~~kk~~k~kekk~~~k~  450 (1133)
                      |..|+.|++-..++++   |..|.|.=+|.++|-+.=+++=+.++.++++|
T Consensus       211 visa~~eklR~r~eeeme~~~aeq~slkRt~EeL~~G~~kL~~~~etLEqq  261 (365)
T KOG2391|consen  211 VISAVREKLRRRREEEMERLQAEQESLKRTEEELNIGKQKLVAMKETLEQQ  261 (365)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhhHHHHHHHHHHHHHH
Confidence            4555666654444432   34444554555555544444433333333333


No 320
>PTZ00491 major vault protein; Provisional
Probab=22.15  E-value=1.5e+03  Score=30.39  Aligned_cols=28  Identities=14%  Similarity=0.208  Sum_probs=19.9

Q ss_pred             hhhHHHhhHHHHHhhhccccccchhhhc
Q 001172          226 FVEERRGKLGRLIEDKARWSSFCAFWLG  253 (1133)
Q Consensus       226 fved~~~hLG~LLqS~~~~ssFRaf~~a  253 (1133)
                      |+.|.|++|++.+.....-..|-.|-.+
T Consensus       571 FvGd~Ck~iaSrIR~aVA~~~Fd~FHkn  598 (850)
T PTZ00491        571 FVGDACKTIASRVRAAVASEPFDEFHKN  598 (850)
T ss_pred             hHHHHHHHHHHHHHHHHhcCCHHHHhcc
Confidence            7788999999988766655566555333


No 321
>PF04568 IATP:  Mitochondrial ATPase inhibitor, IATP;  InterPro: IPR007648  ATP synthase inhibitor prevents the enzyme from switching to ATP hydrolysis during collapse of the electrochemical gradient, for example during oxygen deprivation [] ATP synthase inhibitor forms a one to one complex with the F1 ATPase, possibly by binding at the alpha-beta interface. It is thought to inhibit ATP synthesis by preventing the release of ATP []. The minimum inhibitory region for bovine inhibitor (P01096 from SWISSPROT) is from residues 39 to 72 []. The inhibitor has two oligomeric states, dimer (the active state) and tetramer. At low pH , the inhibitor forms a dimer via antiparallel coiled coil interactions between the C-terminal regions of two monomers. At high pH, the inhibitor forms tetramers and higher oligomers by coiled coil interactions involving the N terminus and inhibitory region, thus preventing the inhibitory activity []. ; GO: 0004857 enzyme inhibitor activity, 0045980 negative regulation of nucleotide metabolic process, 0005739 mitochondrion; PDB: 1GMJ_B 1OHH_H 1HF9_B 2V7Q_J.
Probab=22.06  E-value=2.8e+02  Score=27.67  Aligned_cols=27  Identities=26%  Similarity=0.311  Sum_probs=15.7

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          396 AHIFSNKIEVAYQEAVALKRQEELIREE  423 (1133)
Q Consensus       396 ~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~  423 (1133)
                      ...| .+-+.|-.|-|+.+.|.+-|+.|
T Consensus        51 g~~f-~krE~A~E~~Y~r~~EkEqL~~L   77 (100)
T PF04568_consen   51 GGAF-GKREAAQEEQYFRKKEKEQLKKL   77 (100)
T ss_dssp             CHHH-HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CCcc-chHHHhhHHHHHHHHHHHHHHHH
Confidence            4556 35566666666666665555544


No 322
>PLN03188 kinesin-12 family protein; Provisional
Probab=22.00  E-value=7.6e+02  Score=34.32  Aligned_cols=24  Identities=17%  Similarity=0.292  Sum_probs=17.5

Q ss_pred             HHHHhhHHhhhHHHHHHH--hhhhhH
Q 001172          378 DERRLTELGRRTVEIFVL--AHIFSN  401 (1133)
Q Consensus       378 ee~~~~Elgrr~~~ifa~--~k~F~~  401 (1133)
                      -|+.|.|+=.-.+|..++  ++|++|
T Consensus      1090 ~eK~c~eel~~a~q~am~ghar~~e~ 1115 (1320)
T PLN03188       1090 TEKRCAEELKEAMQMAMEGHARMLEQ 1115 (1320)
T ss_pred             HhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            578888888888887775  445543


No 323
>PLN03237 DNA topoisomerase 2; Provisional
Probab=21.88  E-value=1.9e+02  Score=40.25  Aligned_cols=17  Identities=24%  Similarity=0.528  Sum_probs=13.7

Q ss_pred             hhHHHHHHHhhcCCCCC
Q 001172          335 DDVLLLLERAALEPLPP  351 (1133)
Q Consensus       335 ~Dll~~leR~al~~Lp~  351 (1133)
                      .+|+..|+....++.|.
T Consensus      1060 ~~l~~~L~~~gf~~~~~ 1076 (1465)
T PLN03237       1060 ADLVEELRQKGFTPFPK 1076 (1465)
T ss_pred             HHHHHHHHHcCCCccch
Confidence            48888999988888764


No 324
>PF09731 Mitofilin:  Mitochondrial inner membrane protein;  InterPro: IPR019133  Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex []. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence (residues 1-43) rich in positive-charged and hydroxylated residues and a membrane anchor domain (residues 47-66). In addition, it has three centrally located coiled coil domains (residues 200-240,280-310 and 400-420) []. ; GO: 0031305 integral to mitochondrial inner membrane
Probab=21.75  E-value=7.9e+02  Score=30.63  Aligned_cols=9  Identities=11%  Similarity=0.076  Sum_probs=3.8

Q ss_pred             HhhHHHHHh
Q 001172          231 RGKLGRLIE  239 (1133)
Q Consensus       231 ~~hLG~LLq  239 (1133)
                      ..++..++.
T Consensus       119 ~~~~e~~~~  127 (582)
T PF09731_consen  119 EEALEEAAK  127 (582)
T ss_pred             hhhhhhhhh
Confidence            334444443


No 325
>KOG4709 consensus Uncharacterized conserved protein [Function unknown]
Probab=21.32  E-value=3e+02  Score=30.59  Aligned_cols=14  Identities=29%  Similarity=0.432  Sum_probs=5.9

Q ss_pred             CCCCCCccccCCCC
Q 001172          512 DSVDGGAEVLQPDS  525 (1133)
Q Consensus       512 d~~d~~~e~~~~d~  525 (1133)
                      |.--+.+-+|.++.
T Consensus       121 ~~esS~~p~l~te~  134 (217)
T KOG4709|consen  121 DLESSGAPLLGTEP  134 (217)
T ss_pred             cccccccccccccC
Confidence            43334444554443


No 326
>PRK05689 fliJ flagellar biosynthesis chaperone; Validated
Probab=21.29  E-value=8.7e+02  Score=24.64  Aligned_cols=69  Identities=13%  Similarity=0.107  Sum_probs=31.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhh
Q 001172          402 KIEVAYQEAVALKRQEELIREEEAAWLAESEQKAKRGAAEKEKKAKKKLAKQKRNNRKGKEKKREERSS  470 (1133)
Q Consensus       402 rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~~~e~~~kk~~k~kekk~~~k~ak~~e~~rk~~e~~~ee~~~  470 (1133)
                      .+...+.+++....+.--.-+.+-+...+.=.++.+..|.=||=+.++.++......|.+.|.-+|-+.
T Consensus        71 ~fi~~L~~~I~~q~~~v~~~~~~ve~~r~~~~~a~~~~k~lEkL~ek~~~~~~~~e~r~EQk~~DE~a~  139 (147)
T PRK05689         71 QFLQQLEKAITQQRQQLTQWTQKVDNARKYWQEKKQRLEALETLQERYQTEARLAENKREQKQMDEFAQ  139 (147)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH
Confidence            334444444433333322233333333333344444445555555555555555555566666666443


No 327
>PRK04654 sec-independent translocase; Provisional
Probab=21.14  E-value=6e+02  Score=28.59  Aligned_cols=38  Identities=26%  Similarity=0.103  Sum_probs=22.5

Q ss_pred             HHhhHHhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 001172          380 RRLTELGRRTVEIFVLAHIFSNKIEVAYQEAVALKRQEELIREEEAAWLA  429 (1133)
Q Consensus       380 ~~~~Elgrr~~~ifa~~k~F~~rv~~Ay~E~~A~~rQ~~Ll~E~E~e~~~  429 (1133)
                      ++|++++|.+-.++.       +++     ..+...+++|.+|++.++.+
T Consensus        23 erLPe~aRtlGk~ir-------k~R-----~~~~~vk~El~~El~~~ELr   60 (214)
T PRK04654         23 ERLPKAARFAGLWVR-------RAR-----MQWDSVKQELERELEAEELK   60 (214)
T ss_pred             hHHHHHHHHHHHHHH-------HHH-----HHHHHHHHHHHHhhhHHHHH
Confidence            466777777665544       222     12456677777777765543


No 328
>KOG2481 consensus Protein required for normal rRNA processing [RNA processing and modification]
Probab=21.05  E-value=1.8e+02  Score=36.18  Aligned_cols=30  Identities=20%  Similarity=0.298  Sum_probs=20.5

Q ss_pred             HHHHHHHHHHHHHHHHhhhhhhhHHHHHHh
Q 001172          440 AEKEKKAKKKLAKQKRNNRKGKEKKREERS  469 (1133)
Q Consensus       440 k~kekk~~~k~ak~~e~~rk~~e~~~ee~~  469 (1133)
                      +.|+|+.=+|..-.+.++++++++.++.+.
T Consensus       535 s~k~~~ly~k~~~~~~rk~~e~~~l~~kkk  564 (570)
T KOG2481|consen  535 SKKERKLYQKMKFSKSRKEQEALKLKKKKK  564 (570)
T ss_pred             chHHHHHHHHHhhhhhhhhhHHHHHHHHHH
Confidence            777777777777666666666666665443


No 329
>COG1390 NtpE Archaeal/vacuolar-type H+-ATPase subunit E [Energy production and conversion]
Probab=20.90  E-value=5.5e+02  Score=28.14  Aligned_cols=19  Identities=21%  Similarity=0.373  Sum_probs=7.6

Q ss_pred             cCCCCCCCCccccCCCCCC
Q 001172          509 DVSDSVDGGAEVLQPDSED  527 (1133)
Q Consensus       509 d~sd~~d~~~e~~~~d~ed  527 (1133)
                      .+.+.+.|.--+..+.-.|
T Consensus       112 ~~~~~~~~~~~iV~~~e~d  130 (194)
T COG1390         112 EALEKLLGGELVVYLNEKD  130 (194)
T ss_pred             HHHHhcCCCCeEEEeCccc
Confidence            3444444433334433333


No 330
>KOG0566 consensus Inositol-1,4,5-triphosphate 5-phosphatase (synaptojanin), INP51/INP52/INP53 family [Intracellular trafficking, secretion, and vesicular transport]
Probab=20.68  E-value=6.8e+02  Score=33.77  Aligned_cols=29  Identities=28%  Similarity=0.165  Sum_probs=24.0

Q ss_pred             hhhhhccccccCCCCCCCCCCCCCccccc
Q 001172          814 LARSVSAAGRLGPDLAPATHGYIPQSYRN  842 (1133)
Q Consensus       814 LsRS~sa~gr~g~dpS~~~~sy~pqsYrN  842 (1133)
                      ..+..+++.|+|+.+-.++|-+.||+--+
T Consensus      1015 ~~~~a~~a~~~~~~~~~~~P~~~~~s~~~ 1043 (1080)
T KOG0566|consen 1015 MPSPASAATLSGPWYVISKPLAPPQSNNG 1043 (1080)
T ss_pred             CCCccchhccCCCcccccCCccchhhccc
Confidence            67778888899998888899999988654


No 331
>PF11675 DUF3271:  Protein of unknown function (DUF3271);  InterPro: IPR021689  This family of proteins with unknown function appears to be restricted to Plasmodium. 
Probab=20.27  E-value=36  Score=38.26  Aligned_cols=48  Identities=31%  Similarity=0.300  Sum_probs=29.5

Q ss_pred             ccCCchhhhhhhhccccccccccccccccc--------ccCCCCccccccccCCCC
Q 001172          974 DEFPHLDIINDLLDDEHGVGMAAGASTVLQ--------SLSNGPHTLNRQFSFPRD 1021 (1133)
Q Consensus       974 dEFPHLDIINDLL~eeq~~g~~~~~~~~~~--------~~~~~~~~f~~~fs~~g~ 1021 (1133)
                      +-.|+||||||.|.||--+=+-+=...-+|        +++++.+.+.+-|+-.+.
T Consensus        49 ~h~~YLdiIN~il~~eSeN~Kyayeg~nYHwvITdFdIsIdNsS~~lk~~fs~~k~  104 (249)
T PF11675_consen   49 KHTKYLDIINDILRDESENIKYAYEGGNYHWVITDFDISIDNSSRYLKKKFSKNKI  104 (249)
T ss_pred             cchhHHHHHHHHHhccccccceeeeCCceEEEEEeeeEEecCCCHHHHhhhcccch
Confidence            457999999999999765422221111111        356666666666766664


No 332
>PRK07720 fliJ flagellar biosynthesis chaperone; Validated
Probab=20.20  E-value=9.3e+02  Score=24.52  Aligned_cols=12  Identities=33%  Similarity=0.166  Sum_probs=5.3

Q ss_pred             hHHHHHHHHHHH
Q 001172          439 AAEKEKKAKKKL  450 (1133)
Q Consensus       439 ~k~kekk~~~k~  450 (1133)
                      .|-|||+...-.
T Consensus       112 ekLker~~~~~~  123 (146)
T PRK07720        112 EKMKEKKQEMFA  123 (146)
T ss_pred             HHHHHHHHHHHH
Confidence            344444444333


No 333
>PF10669 Phage_Gp23:  Protein gp23 (Bacteriophage A118);  InterPro: IPR018926  This entry is represented by the major tail subunit protein, Gp23 of Listeria phage A118 and prophage found in Bacilli. The function is currently unknown. 
Probab=20.12  E-value=6.2e+02  Score=25.41  Aligned_cols=13  Identities=38%  Similarity=0.493  Sum_probs=6.3

Q ss_pred             ccCCCCCCCCCcc
Q 001172          493 DAQPLPEKPDVLE  505 (1133)
Q Consensus       493 ~~~~~~~k~d~~e  505 (1133)
                      -++.++-|+|-||
T Consensus       104 svgklahkvdlle  116 (121)
T PF10669_consen  104 SVGKLAHKVDLLE  116 (121)
T ss_pred             HHHHHHHHHHHHH
Confidence            3444555555444


Done!