Query         psy14697
Match_columns 175
No_of_seqs    111 out of 204
Neff          4.1 
Searched_HMMs 46136
Date          Fri Aug 16 18:18:59 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy14697.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14697hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF02064 MAS20:  MAS20 protein  100.0 9.3E-60   2E-64  366.7   6.3  121   51-171     1-121 (121)
  2 TIGR00985 3a0801s04tom mitocho 100.0 2.5E-58 5.4E-63  369.5  14.6  127   44-170     4-148 (148)
  3 KOG4056|consensus              100.0 8.7E-56 1.9E-60  352.1  13.5  132   44-175     6-143 (143)
  4 TIGR00990 3a0801s09 mitochondr  97.5 0.00049 1.1E-08   63.9   8.4   45  105-149   119-163 (615)
  5 PF00515 TPR_1:  Tetratricopept  94.5   0.061 1.3E-06   31.3   3.4   29  117-145     5-33  (34)
  6 PF07719 TPR_2:  Tetratricopept  94.0   0.083 1.8E-06   30.3   3.3   29  117-145     5-33  (34)
  7 PF13428 TPR_14:  Tetratricopep  90.7    0.64 1.4E-05   29.0   4.4   37  117-153     5-41  (44)
  8 smart00028 TPR Tetratricopepti  90.3    0.36 7.9E-06   24.8   2.6   29  117-145     5-33  (34)
  9 PF13371 TPR_9:  Tetratricopept  90.2    0.66 1.4E-05   30.6   4.3   39  117-155    33-71  (73)
 10 PF13176 TPR_7:  Tetratricopept  89.4    0.22 4.8E-06   30.1   1.4   32  117-148     3-34  (36)
 11 PF13181 TPR_8:  Tetratricopept  89.4    0.58 1.3E-05   26.9   3.2   28  117-144     5-32  (34)
 12 PF13414 TPR_11:  TPR repeat; P  88.3    0.78 1.7E-05   30.0   3.6   34  117-150     7-40  (69)
 13 PF14561 TPR_20:  Tetratricopep  88.3     1.1 2.5E-05   32.8   4.8   47  102-155    18-70  (90)
 14 PRK10370 formate-dependent nit  84.0      13 0.00029   30.3   9.4   39  117-155    77-115 (198)
 15 PRK11677 hypothetical protein;  83.7     9.5 0.00021   30.6   8.3   36  107-144    50-86  (134)
 16 PRK13183 psbN photosystem II r  83.2       1 2.2E-05   30.5   2.1   25   43-67      4-28  (46)
 17 PF13432 TPR_16:  Tetratricopep  79.3     2.6 5.7E-05   27.3   3.1   29  117-145    35-63  (65)
 18 PF14559 TPR_19:  Tetratricopep  79.1     7.1 0.00015   25.2   5.1   41  102-149    21-61  (68)
 19 PF04689 S1FA:  DNA binding pro  79.0     2.1 4.5E-05   31.1   2.6   38   51-88     19-62  (69)
 20 PF13432 TPR_16:  Tetratricopep  78.7     3.5 7.6E-05   26.7   3.6   34  118-151     2-35  (65)
 21 PF06295 DUF1043:  Protein of u  78.2      13 0.00028   29.0   7.2   36  108-145    47-83  (128)
 22 TIGR00540 hemY_coli hemY prote  78.1      34 0.00074   30.5  10.7   45  104-148   132-188 (409)
 23 PF10602 RPN7:  26S proteasome   77.7      12 0.00025   30.4   7.1   49  110-158    33-81  (177)
 24 PF13174 TPR_6:  Tetratricopept  77.7     2.8 6.2E-05   23.4   2.6   29  117-145     4-32  (33)
 25 CHL00020 psbN photosystem II p  77.6     1.8   4E-05   28.8   1.9   23   45-67      3-25  (43)
 26 PF10841 DUF2644:  Protein of u  75.7     2.9 6.3E-05   29.7   2.6   25   45-69     13-37  (60)
 27 cd00189 TPR Tetratricopeptide   74.7     7.1 0.00015   23.9   4.0   31  117-147    38-68  (100)
 28 PF13374 TPR_10:  Tetratricopep  72.8     4.6  0.0001   23.4   2.7   27  117-143     6-32  (42)
 29 PF02468 PsbN:  Photosystem II   70.5     3.6 7.8E-05   27.4   1.9   24   45-68      3-26  (43)
 30 TIGR02795 tol_pal_ybgF tol-pal  70.4     9.4  0.0002   26.3   4.2   33  119-151    82-114 (119)
 31 KOG4648|consensus               69.4     2.9 6.4E-05   39.6   1.9   30  118-147   102-132 (536)
 32 PF14559 TPR_19:  Tetratricopep  69.3     6.4 0.00014   25.4   3.0   28  123-150     1-28  (68)
 33 PF10516 SHNi-TPR:  SHNi-TPR;    68.5     6.4 0.00014   25.0   2.8   25  117-141     5-29  (38)
 34 PF01102 Glycophorin_A:  Glycop  66.6     4.1 8.9E-05   32.2   1.9   20   46-65     68-88  (122)
 35 COG4968 PilE Tfp pilus assembl  66.1      15 0.00032   30.0   5.0   33   75-116    29-61  (139)
 36 KOG4234|consensus               65.4      13 0.00028   33.0   4.8   53  100-152    62-135 (271)
 37 COG4235 Cytochrome c biogenesi  64.1      30 0.00064   31.1   7.0   39  117-155   160-198 (287)
 38 KOG0548|consensus               63.0     6.4 0.00014   38.2   2.8   33  113-145     2-34  (539)
 39 PF13424 TPR_12:  Tetratricopep  61.4     7.1 0.00015   26.1   2.1   33  111-143    44-76  (78)
 40 KOG1174|consensus               61.1      12 0.00025   36.3   4.1   39  116-154   474-512 (564)
 41 PF08572 PRP3:  pre-mRNA proces  60.6      25 0.00055   30.2   5.8   44   78-121   146-209 (223)
 42 cd08323 CARD_APAF1 Caspase act  60.5      20 0.00044   26.4   4.5   40  116-158    47-86  (86)
 43 PF12868 DUF3824:  Domain of un  59.7     2.9 6.4E-05   33.6  -0.0   18   46-63      7-24  (137)
 44 PRK02603 photosystem I assembl  59.4      27 0.00058   27.1   5.3   36  116-151    75-110 (172)
 45 KOG3119|consensus               59.1     5.2 0.00011   35.0   1.4   28   65-92    194-221 (269)
 46 PLN03098 LPA1 LOW PSII ACCUMUL  59.0     8.1 0.00018   36.8   2.7   41  107-147    69-109 (453)
 47 PRK15359 type III secretion sy  58.8      20 0.00043   27.5   4.5   38  102-146    88-125 (144)
 48 cd00189 TPR Tetratricopeptide   58.7      10 0.00022   23.2   2.3   30  116-145    71-100 (100)
 49 PRK11447 cellulose synthase su  57.6      47   0.001   34.0   8.0   48  102-149    92-148 (1157)
 50 KOG0624|consensus               57.4      20 0.00043   34.1   4.9   47  103-155    32-78  (504)
 51 PF07721 TPR_4:  Tetratricopept  56.9      14  0.0003   20.8   2.5   21  116-136     4-24  (26)
 52 TIGR02795 tol_pal_ybgF tol-pal  56.2      16 0.00034   25.2   3.2   30  117-146    43-72  (119)
 53 PRK10803 tol-pal system protei  55.0      29 0.00064   30.0   5.3   56  117-172   184-250 (263)
 54 PF13414 TPR_11:  TPR repeat; P  54.4      15 0.00033   23.8   2.7   28  117-144    41-69  (69)
 55 TIGR02552 LcrH_SycD type III s  53.9      29 0.00063   25.0   4.4   38  117-154    89-126 (135)
 56 PRK15363 pathogenicity island   53.9      32 0.00069   28.3   5.1   36  102-144    99-134 (157)
 57 PF13424 TPR_12:  Tetratricopep  53.5      17 0.00037   24.2   2.9   25  118-142    10-34  (78)
 58 PF11118 DUF2627:  Protein of u  51.7      17 0.00038   26.9   2.9   36   44-79     40-77  (77)
 59 COG1729 Uncharacterized protei  51.4      15 0.00032   32.6   2.9   54  119-172   184-248 (262)
 60 PF06522 B12D:  NADH-ubiquinone  50.8      17 0.00038   25.8   2.7   16   65-82     34-49  (73)
 61 PRK12370 invasion protein regu  50.3      37 0.00079   31.8   5.5   29  118-146   377-405 (553)
 62 CHL00033 ycf3 photosystem I as  49.8      31 0.00067   26.5   4.3   29  119-147    41-69  (168)
 63 PF06365 CD34_antigen:  CD34/Po  49.7      10 0.00022   32.5   1.6   19   56-74    112-130 (202)
 64 PF00731 AIRC:  AIR carboxylase  48.6     7.8 0.00017   31.5   0.7   20   71-90    129-148 (150)
 65 PF05942 PaREP1:  Archaeal PaRE  48.4      26 0.00056   26.4   3.5   28  114-141     7-34  (115)
 66 PF08066 PMC2NT:  PMC2NT (NUC01  48.3      34 0.00074   25.0   4.0   58   67-124    16-78  (91)
 67 KOG4555|consensus               47.4      19 0.00041   30.1   2.8   29  122-150    52-80  (175)
 68 KOG0553|consensus               46.9      27 0.00058   31.8   3.9   37  111-147   147-183 (304)
 69 PRK11234 nfrB bacteriophage N4  46.8      42  0.0009   33.5   5.5   57  105-163   639-698 (727)
 70 TIGR00990 3a0801s09 mitochondr  45.9      27 0.00058   32.8   3.9   37  116-152   545-581 (615)
 71 PRK09720 cybC cytochrome b562;  45.0      88  0.0019   24.2   6.0   60   73-133    16-82  (100)
 72 PF03704 BTAD:  Bacterial trans  44.8      32 0.00069   25.6   3.5   32  113-144     6-37  (146)
 73 PRK10866 outer membrane biogen  43.7 1.9E+02  0.0042   24.4   8.5   42  103-148    63-104 (243)
 74 PF12895 Apc3:  Anaphase-promot  43.2      37 0.00081   23.1   3.4   27  113-139    58-84  (84)
 75 PLN02539 glucose-6-phosphate 1  41.7      73  0.0016   30.6   6.1  115   31-172    23-150 (491)
 76 PRK09400 secE preprotein trans  41.2      30 0.00064   24.3   2.6   23   44-66     32-54  (61)
 77 KOG0543|consensus               41.0      82  0.0018   29.7   6.2   41  102-149   287-327 (397)
 78 smart00386 HAT HAT (Half-A-TPR  40.8      51  0.0011   17.6   3.2   27  127-153     1-27  (33)
 79 PRK15359 type III secretion sy  40.2      53  0.0011   25.1   4.2   30  117-146    62-91  (144)
 80 PLN03088 SGT1,  suppressor of   39.9      49  0.0011   29.4   4.5   33  117-149    40-72  (356)
 81 PF12301 CD99L2:  CD99 antigen   39.6      16 0.00035   30.3   1.3   32   40-71    109-140 (169)
 82 TIGR00327 secE_euk_arch protei  39.4      33 0.00071   24.2   2.6   24   44-67     28-51  (61)
 83 PF09976 TPR_21:  Tetratricopep  39.3      25 0.00054   26.6   2.2   23  118-140   123-145 (145)
 84 KOG4516|consensus               39.2     9.4  0.0002   30.3  -0.2   51   15-71     27-77  (118)
 85 PF03605 DcuA_DcuB:  Anaerobic   39.0      41 0.00088   31.3   3.9   22   64-85    184-205 (364)
 86 COG5343 Uncharacterized protei  38.9      32  0.0007   30.3   3.1   24   52-75    108-131 (240)
 87 PRK15174 Vi polysaccharide exp  38.5      67  0.0014   31.0   5.4   33  116-148   113-145 (656)
 88 PRK02603 photosystem I assembl  37.8      65  0.0014   25.0   4.4   27  120-146   113-139 (172)
 89 PTZ00459 mucin-associated surf  37.6      19 0.00042   31.9   1.5   10   40-49      1-10  (291)
 90 TIGR01495 ETRAMP Plasmodium ri  37.6      36 0.00078   25.3   2.8   16   59-74     67-82  (85)
 91 KOG1174|consensus               37.1      54  0.0012   31.9   4.5   74   55-154   299-372 (564)
 92 PF09976 TPR_21:  Tetratricopep  36.8      63  0.0014   24.3   4.1   33  114-146    49-81  (145)
 93 PF15147 DUF4578:  Domain of un  36.6      13 0.00029   29.7   0.4   11    4-14      7-17  (127)
 94 TIGR02521 type_IV_pilW type IV  36.6      80  0.0017   23.4   4.6   29  118-146    70-98  (234)
 95 PRK11788 tetratricopeptide rep  35.7      70  0.0015   27.2   4.6   39  117-155   286-324 (389)
 96 TIGR02521 type_IV_pilW type IV  35.4      80  0.0017   23.4   4.4   29  117-145    35-63  (234)
 97 TIGR02552 LcrH_SycD type III s  34.4      93   0.002   22.3   4.5   29  118-146    56-84  (135)
 98 PRK12489 anaerobic C4-dicarbox  34.4      47   0.001   31.4   3.6   22   63-84    191-212 (443)
 99 KOG1125|consensus               34.1      69  0.0015   31.6   4.7   59   99-157   403-474 (579)
100 TIGR03302 OM_YfiO outer membra  34.0 1.2E+02  0.0025   24.2   5.4   29  117-145   170-198 (235)
101 PF06305 DUF1049:  Protein of u  33.4      44 0.00095   22.4   2.5    7   78-84     47-53  (68)
102 PF13371 TPR_9:  Tetratricopept  33.3      54  0.0012   21.2   2.8   31  120-150     2-32  (73)
103 PRK10866 outer membrane biogen  32.9   1E+02  0.0022   26.0   5.2   33  118-150    37-69  (243)
104 PF11712 Vma12:  Endoplasmic re  32.7   2E+02  0.0043   22.5   6.4   28   65-92      9-39  (142)
105 COG2443 Sss1 Preprotein transl  32.6      48   0.001   23.8   2.6   22   45-66     34-55  (65)
106 PRK11189 lipoprotein NlpI; Pro  32.5   1E+02  0.0022   26.4   5.1   28  117-144   240-267 (296)
107 PF13838 Clathrin_H_link:  Clat  32.2      33 0.00072   24.5   1.8   27  110-139     6-32  (66)
108 PF06024 DUF912:  Nucleopolyhed  32.1      54  0.0012   24.6   3.0   17   51-67     67-83  (101)
109 PF08359 TetR_C_4:  YsiA-like p  31.8 1.8E+02   0.004   21.3   5.8   73   66-138    47-128 (133)
110 KOG0547|consensus               31.5      70  0.0015   31.6   4.3   41  106-146   108-149 (606)
111 PF11846 DUF3366:  Domain of un  31.4 2.7E+02  0.0058   22.1   8.9   27  119-145   150-176 (193)
112 PF12487 DUF3703:  Protein of u  31.3      88  0.0019   24.5   4.1   41  107-148     3-43  (112)
113 KOG2769|consensus               31.3 1.2E+02  0.0025   29.7   5.7   44   78-121   293-356 (522)
114 PF06703 SPC25:  Microsomal sig  31.1      34 0.00074   27.1   1.9   21   51-71     30-50  (162)
115 PRK14748 kdpF potassium-transp  31.0      54  0.0012   20.2   2.3   14   53-66      8-21  (29)
116 PRK10747 putative protoheme IX  31.0 2.6E+02  0.0056   25.0   7.6   32  117-148   332-363 (398)
117 PF08802 CytB6-F_Fe-S:  Cytochr  30.8      54  0.0012   21.3   2.4   22   46-67     14-35  (39)
118 TIGR03302 OM_YfiO outer membra  30.6 2.2E+02  0.0047   22.6   6.5   32  117-148    74-105 (235)
119 PRK09782 bacteriophage N4 rece  30.4 1.4E+02   0.003   30.9   6.4   95   68-171    38-140 (987)
120 PF09164 VitD-bind_III:  Vitami  30.3      40 0.00087   24.5   1.9   21   68-88      5-25  (68)
121 PRK01001 putative inner membra  30.2      65  0.0014   32.9   4.0   75   46-124     3-81  (795)
122 KOG0548|consensus               29.8 2.3E+02   0.005   27.9   7.4   29  120-148   365-393 (539)
123 PF04654 DUF599:  Protein of un  29.1      70  0.0015   27.0   3.5   32   55-86      6-37  (216)
124 KOG0909|consensus               29.0      55  0.0012   31.6   3.1   47  124-171    84-131 (500)
125 PLN03088 SGT1,  suppressor of   29.0   1E+02  0.0022   27.5   4.6   37  117-153    74-110 (356)
126 KOG3498|consensus               28.9      65  0.0014   23.4   2.8   22   44-65     33-54  (67)
127 KOG0772|consensus               28.2      59  0.0013   32.1   3.2   57  107-165   568-624 (641)
128 PRK02898 cobalt transport prot  28.0      38 0.00083   26.2   1.6   26   47-72     68-93  (100)
129 COG1687 AzlD Predicted branche  27.8      68  0.0015   25.1   2.9   24  143-166    29-52  (106)
130 PF09716 ETRAMP:  Malarial earl  27.8      59  0.0013   23.8   2.5   14   56-69     68-81  (84)
131 PRK10370 formate-dependent nit  27.6 2.9E+02  0.0063   22.4   6.8   31  117-147   148-178 (198)
132 PF13431 TPR_17:  Tetratricopep  27.5      81  0.0018   18.7   2.7   25  102-133     9-33  (34)
133 PF12688 TPR_5:  Tetratrico pep  27.2 1.3E+02  0.0028   23.1   4.5   31  115-145    40-70  (120)
134 PF12072 DUF3552:  Domain of un  27.2 1.7E+02  0.0036   24.2   5.4   16   50-65      5-20  (201)
135 PF10828 DUF2570:  Protein of u  27.2 2.7E+02  0.0059   20.9   7.2  100   49-165     5-105 (110)
136 PF13429 TPR_15:  Tetratricopep  27.1      96  0.0021   25.6   3.9   36  117-152   150-185 (280)
137 PLN03098 LPA1 LOW PSII ACCUMUL  26.9 1.1E+02  0.0024   29.4   4.7   38  102-143   105-142 (453)
138 PF04415 DUF515:  Protein of un  26.9 1.3E+02  0.0028   28.6   5.1   67   46-120    31-97  (416)
139 PRK11462 putative transporter;  26.5      58  0.0013   29.4   2.7   23   65-87    426-448 (460)
140 PRK11447 cellulose synthase su  26.4 2.2E+02  0.0048   29.3   7.0   27  119-145   391-417 (1157)
141 PF07862 Nif11:  Nitrogen fixat  25.7      41 0.00088   21.6   1.2   12   72-83     14-25  (49)
142 KOG3289|consensus               25.2      31 0.00068   29.6   0.7   54   20-76     23-92  (199)
143 TIGR01162 purE phosphoribosyla  25.1      90  0.0019   25.7   3.3   20   71-90    127-146 (156)
144 KOG2072|consensus               24.8 3.4E+02  0.0075   28.5   7.9   89   63-155   186-280 (988)
145 PRK00888 ftsB cell division pr  24.7 1.4E+02   0.003   22.6   4.1   29   58-86     12-41  (105)
146 COG1575 MenA 1,4-dihydroxy-2-n  24.5      21 0.00047   32.3  -0.4   15   66-80     66-83  (303)
147 PF14938 SNAP:  Soluble NSF att  24.4      63  0.0014   27.4   2.4   28  117-144   118-146 (282)
148 PF02526 GBP_repeat:  Glycophor  24.3      41 0.00089   21.7   1.0   10   73-82     15-24  (38)
149 PRK15058 cytochrome b562; Prov  24.1 2.7E+02  0.0058   22.4   5.8   22  112-133    89-110 (128)
150 PRK10747 putative protoheme IX  23.9 5.2E+02   0.011   23.0   8.8   34  120-153   160-193 (398)
151 smart00745 MIT Microtubule Int  23.9   1E+02  0.0022   21.1   3.0   20  129-148    31-50  (77)
152 PF02234 CDI:  Cyclin-dependent  23.8      58  0.0012   21.8   1.7   22  103-124     6-27  (51)
153 PF13480 Acetyltransf_6:  Acety  23.7      73  0.0016   22.7   2.3   37   76-116     2-38  (142)
154 COG1453 Predicted oxidoreducta  23.6      48   0.001   31.2   1.6   49   66-119    55-105 (391)
155 PF00646 F-box:  F-box domain;   23.6      62  0.0013   19.8   1.7   16  156-171     5-20  (48)
156 PF04341 DUF485:  Protein of un  23.4      65  0.0014   23.6   2.0   18   72-89      2-19  (91)
157 PF06864 PAP_PilO:  Pilin acces  23.3 1.5E+02  0.0032   27.1   4.7    9  133-141   274-282 (414)
158 COG4700 Uncharacterized protei  23.3 1.4E+02  0.0029   26.5   4.2   42  114-155    90-133 (251)
159 PF08631 SPO22:  Meiosis protei  23.3 1.5E+02  0.0032   25.3   4.5   46  103-148    25-71  (278)
160 PF10883 DUF2681:  Protein of u  23.1      75  0.0016   23.9   2.3    9   58-66     15-23  (87)
161 PF01034 Syndecan:  Syndecan do  23.0      29 0.00064   24.9   0.1    9   58-66     26-34  (64)
162 PF13525 YfiO:  Outer membrane   22.7      94   0.002   25.1   3.0   28  118-145    10-37  (203)
163 KOG1130|consensus               22.7 1.8E+02   0.004   28.5   5.3   60  110-169    14-87  (639)
164 PF14407 Frankia_peptide:  Ribo  22.6      45 0.00096   23.8   1.0   12   73-84     11-22  (61)
165 PRK11788 tetratricopeptide rep  22.5 1.7E+02  0.0037   24.8   4.7   25  117-141    73-97  (389)
166 PF12777 MT:  Microtubule-bindi  22.5      68  0.0015   28.6   2.4   23   56-84    316-338 (344)
167 PRK15179 Vi polysaccharide bio  22.4 2.1E+02  0.0046   28.4   5.9   36  115-150   156-191 (694)
168 PHA03065 Hypothetical protein;  22.4      87  0.0019   29.9   3.1   20   72-91     75-94  (438)
169 PRK10153 DNA-binding transcrip  22.3 4.6E+02  0.0099   25.0   7.9   77   53-145   376-452 (517)
170 PF14964 DUF4507:  Domain of un  22.2      84  0.0018   29.2   2.9   30  129-159   325-361 (362)
171 PRK14750 kdpF potassium-transp  22.1   1E+02  0.0022   19.1   2.3   15   52-66      7-21  (29)
172 KOG0624|consensus               22.1 1.5E+02  0.0032   28.5   4.5   68  100-167    92-160 (504)
173 TIGR03798 ocin_TIGR03798 bacte  22.0      40 0.00086   23.0   0.6   15   69-83      9-23  (64)
174 TIGR02917 PEP_TPR_lipo putativ  21.8 1.7E+02  0.0037   26.7   4.8   34  117-150    60-93  (899)
175 TIGR03644 marine_trans_1 proba  21.6      97  0.0021   28.8   3.2   22   44-65    243-264 (404)
176 smart00299 CLH Clathrin heavy   21.4 2.4E+02  0.0051   20.9   4.8   39  126-167    95-133 (140)
177 PF12569 NARP1:  NMDA receptor-  21.4 1.7E+02  0.0037   28.0   4.9   23  119-141   462-484 (517)
178 cd02577 PSTD1 PSTD1: Pseudouri  21.2 5.4E+02   0.012   23.1   7.8   15   59-73    133-147 (319)
179 COG1930 CbiN ABC-type cobalt t  21.0      34 0.00073   26.4   0.1   33   33-73     60-92  (97)
180 KOG0425|consensus               20.7      73  0.0016   26.8   2.0   20   70-89    145-164 (171)
181 KOG3060|consensus               20.7 1.1E+02  0.0024   27.7   3.3   37  118-154   193-233 (289)
182 PF11817 Foie-gras_1:  Foie gra  20.5      96  0.0021   26.1   2.8   30  117-146   182-211 (247)
183 PF02158 Neuregulin:  Neureguli  20.4      34 0.00073   32.3   0.0   33   51-88     15-48  (404)
184 PRK12370 invasion protein regu  20.4 1.5E+02  0.0033   27.7   4.3   35  117-151   342-376 (553)
185 PF08690 GET2:  GET complex sub  20.3      69  0.0015   28.7   1.9   13   78-90      7-19  (302)
186 KOG4162|consensus               20.1 1.4E+02  0.0029   30.7   4.0   27  117-143   758-784 (799)
187 PF13512 TPR_18:  Tetratricopep  20.0 1.4E+02  0.0031   24.1   3.5   34  115-148    49-82  (142)

No 1  
>PF02064 MAS20:  MAS20 protein import receptor;  InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=100.00  E-value=9.3e-60  Score=366.74  Aligned_cols=121  Identities=50%  Similarity=0.899  Sum_probs=68.5

Q ss_pred             HHHHHHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHHHHHhhcCCCCCCCChHHHHHHHHHHHHhHhHHhhcCChh
Q psy14697         51 TAVAVGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNRELLQQKNKRGIPDLKDHSAVQQYFLQEIQLGETLLAAGDLD  130 (175)
Q Consensus        51 ~~agvaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~e~~E~fFl~eV~lGE~L~a~G~~~  130 (175)
                      +++|++|++|+|||||||||||||||||||||+|||+++++++.+....+|+++|++++|+|||+||++||+|+++|+++
T Consensus         1 ~~ag~a~~~~lgYciYFD~KRR~dP~frkkL~~rrk~~~~~~~~~~~~~~~~~~d~~~~e~~Fl~qV~lGE~L~~~G~~~   80 (121)
T PF02064_consen    1 IAAGVAAAAFLGYCIYFDYKRRSDPDFRKKLRERRKKQAKAQEQAKSEKLPDPNDPEEMERFFLQQVQLGEQLLAQGDYE   80 (121)
T ss_dssp             ------------------------------------------------SSSHHHHHHHHHHHHHHHHHHHHHHHHTT-HH
T ss_pred             ChHHHHHHHHHHHHhhcccccccChHHHHHHHHHHHHHHHHHhhhhcccccCcCCHHHHHHHHHHHHHHHHHHHhCCCHH
Confidence            47888999999999999999999999999999999988876556677889999999999999999999999999999999


Q ss_pred             hHHHHHHHHHHhcCChHHHHHHHhhcCCHHHHHHHHHhcCC
Q psy14697        131 NGVEHLANALTVCGQPNQLLGVLQQTLPPNVFNALIEKLPP  171 (175)
Q Consensus       131 eaa~hf~nAl~VcpqP~~LL~IyQqTLPp~Vf~~Li~kLp~  171 (175)
                      +|++||||||+|||||++||+|||+||||+||++|+++||+
T Consensus        81 ~aa~hf~nAl~V~~qP~~LL~i~q~tlP~~vf~~l~~~l~e  121 (121)
T PF02064_consen   81 EAAEHFYNALKVCPQPAELLQIYQKTLPPPVFEMLVQKLPE  121 (121)
T ss_dssp             HHHHHHHHHHHTSSSHHHHHHHHHHHS-HHHHHHHHHTS--
T ss_pred             HHHHHHHHHHHhCCCHHHHHHHHHhhCCHHHHHHHHHHcCC
Confidence            99999999999999999999999999999999999999996


No 2  
>TIGR00985 3a0801s04tom mitochondrial import receptor subunit translocase of outer membrane 20 kDa subunit.
Probab=100.00  E-value=2.5e-58  Score=369.53  Aligned_cols=127  Identities=45%  Similarity=0.746  Sum_probs=116.3

Q ss_pred             hhhhHHHHHHHHHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHHH--------HH---------hhcCCCCCCCCh
Q psy14697         44 ISGNKVVTAVAVGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNRELL--------QQ---------KNKRGIPDLKDH  106 (175)
Q Consensus        44 ~~~~~~~~~agvaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~~--------~~---------a~~~~~p~~~d~  106 (175)
                      ++++++++++|++|++|+||||||||||||||||||+||+|||++++++        ++         ++....|+|+|+
T Consensus         4 ~~~~~~~~~ag~a~~~flgYciYFD~KRR~dPdFRkkLr~rr~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~p~d~   83 (148)
T TIGR00985         4 SNKSNVVIAAGIAAAAFLGYAIYFDYKRRNDPDFRKKLRRRRKKQAGAEKQYGGLAKEKQRQRIRPAAAGLAKAPDPTDP   83 (148)
T ss_pred             chHHHHHHHHHHHHHHHHHHHHhhhhhhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCCH
Confidence            5677888999999999999999999999999999999999998877431        10         234568999999


Q ss_pred             HHHHHHHHHHHHhHhHHhhcC-ChhhHHHHHHHHHHhcCChHHHHHHHhhcCCHHHHHHHHHhcC
Q psy14697        107 SAVQQYFLQEIQLGETLLAAG-DLDNGVEHLANALTVCGQPNQLLGVLQQTLPPNVFNALIEKLP  170 (175)
Q Consensus       107 e~~E~fFl~eV~lGE~L~a~G-~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLPp~Vf~~Li~kLp  170 (175)
                      +++|+|||+||++||+|+++| +.++|+.||||||+|||||++||+|||+|||++||+|||.++|
T Consensus        84 ~e~E~~Fl~eV~~GE~L~~~g~~~~ega~hf~nAl~Vc~qP~~LL~iyq~tlP~~Vf~~lia~~p  148 (148)
T TIGR00985        84 SEKEAFFLQEVQLGEELMAQGTNVDEGAVHFYNALKVYPQPQQLLSIYQQTLPEPIFRLMIAQLP  148 (148)
T ss_pred             HHHHHHHHHHHHHHHHHHhCCCchHHHHHHHHHHHHhCCCHHHHHHHHHhhCCHHHHHHHHhhcC
Confidence            999999999999999999999 8999999999999999999999999999999999999999987


No 3  
>KOG4056|consensus
Probab=100.00  E-value=8.7e-56  Score=352.09  Aligned_cols=132  Identities=50%  Similarity=0.831  Sum_probs=121.1

Q ss_pred             hhhhHHHHHHHHHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHH---HHHhhcCC---CCCCCChHHHHHHHHHHH
Q psy14697         44 ISGNKVVTAVAVGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNREL---LQQKNKRG---IPDLKDHSAVQQYFLQEI  117 (175)
Q Consensus        44 ~~~~~~~~~agvaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~---~~~a~~~~---~p~~~d~e~~E~fFl~eV  117 (175)
                      ...+++.+++|+||++|+||||||||||||||+|||+||+||+++..+   +++++.++   .|||+|++++|+|||+||
T Consensus         6 ~v~~~~vI~agiag~af~gYciYFd~KRrsdP~fk~~lr~rrk~q~~ae~a~~~~a~sk~~~~pd~~d~~~~E~~Fmqqv   85 (143)
T KOG4056|consen    6 PVLRTSVIAAGIAGLAFIGYCIYFDKKRRSDPDFKKKLRERRKKQASAELAKERAAASKADNIPDPSDAEEVEKFFMQQV   85 (143)
T ss_pred             chhhhHHHHHHHHHHHHHHHHhhcccccccChhHHHHHHHHHHHHhhcccccccchhhccccCCCCCCHHHHHHHHHHHH
Confidence            445667799999999999999999999999999999999999998843   34444555   999999999999999999


Q ss_pred             HhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhhcCCHHHHHHHHHhcCCCCCC
Q psy14697        118 QLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQTLPPNVFNALIEKLPPAGIV  175 (175)
Q Consensus       118 ~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLPp~Vf~~Li~kLp~~~i~  175 (175)
                      ++||+|+++|++++||+||||||.|||||++||+|||+||||+||++|+.+||.+.++
T Consensus        86 ~lGE~L~~qg~~e~ga~h~~nAi~vcgqpaqLL~vlq~tlp~~if~~i~~~l~~~~~~  143 (143)
T KOG4056|consen   86 QLGEELLAQGNEEEGAEHLANAIVVCGQPAQLLQVLQQTLPEAIFAMLLKKLPSASER  143 (143)
T ss_pred             HhHHHHHHccCHHHHHHHHHHHHhhcCCHHHHHHHHHhhCCHHHHHHHHHHhhcccCC
Confidence            9999999999999999999999999999999999999999999999999999998753


No 4  
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=97.47  E-value=0.00049  Score=63.92  Aligned_cols=45  Identities=11%  Similarity=0.200  Sum_probs=39.6

Q ss_pred             ChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHH
Q psy14697        105 DHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQL  149 (175)
Q Consensus       105 d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~L  149 (175)
                      ++++++.+...-..+|..+...|++++|+.+|.+||...|+|.-.
T Consensus       119 ~~~~~~~~a~~~k~~G~~~~~~~~~~~Ai~~y~~al~~~p~~~~~  163 (615)
T TIGR00990       119 SEEERKKYAAKLKEKGNKAYRNKDFNKAIKLYSKAIECKPDPVYY  163 (615)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhcCCchHHH
Confidence            455778888888999999999999999999999999999987543


No 5  
>PF00515 TPR_1:  Tetratricopeptide repeat;  InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=94.54  E-value=0.061  Score=31.34  Aligned_cols=29  Identities=24%  Similarity=0.410  Sum_probs=25.5

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQ  145 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq  145 (175)
                      ..+|..+...|++++|+.+|.+||.+.|+
T Consensus         5 ~~~g~~~~~~~~~~~A~~~~~~al~~~p~   33 (34)
T PF00515_consen    5 YNLGNAYFQLGDYEEALEYYQRALELDPD   33 (34)
T ss_dssp             HHHHHHHHHTT-HHHHHHHHHHHHHHSTT
T ss_pred             HHHHHHHHHhCCchHHHHHHHHHHHHCcC
Confidence            46899999999999999999999999885


No 6  
>PF07719 TPR_2:  Tetratricopeptide repeat;  InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=94.04  E-value=0.083  Score=30.28  Aligned_cols=29  Identities=21%  Similarity=0.384  Sum_probs=25.7

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQ  145 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq  145 (175)
                      ..+|..+...|++++|..||-+|+..+|+
T Consensus         5 ~~lg~~~~~~~~~~~A~~~~~~al~l~p~   33 (34)
T PF07719_consen    5 YYLGQAYYQLGNYEEAIEYFEKALELDPN   33 (34)
T ss_dssp             HHHHHHHHHTT-HHHHHHHHHHHHHHSTT
T ss_pred             HHHHHHHHHhCCHHHHHHHHHHHHHHCcC
Confidence            46899999999999999999999999986


No 7  
>PF13428 TPR_14:  Tetratricopeptide repeat
Probab=90.73  E-value=0.64  Score=29.00  Aligned_cols=37  Identities=19%  Similarity=0.199  Sum_probs=32.6

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHH
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVL  153 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~Iy  153 (175)
                      ..+|+.+...|++++|...|-++|+.+|+-.+....|
T Consensus         5 ~~la~~~~~~G~~~~A~~~~~~~l~~~P~~~~a~~~L   41 (44)
T PF13428_consen    5 LALARAYRRLGQPDEAERLLRRALALDPDDPEAWRAL   41 (44)
T ss_pred             HHHHHHHHHcCCHHHHHHHHHHHHHHCcCCHHHHHHh
Confidence            5789999999999999999999999999887766544


No 8  
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=90.34  E-value=0.36  Score=24.76  Aligned_cols=29  Identities=31%  Similarity=0.460  Sum_probs=25.0

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQ  145 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq  145 (175)
                      ..+|..+...|++++|+.+|.+++.+.|+
T Consensus         5 ~~~a~~~~~~~~~~~a~~~~~~~~~~~~~   33 (34)
T smart00028        5 YNLGNAYLKLGDYDEALEYYEKALELDPN   33 (34)
T ss_pred             HHHHHHHHHHhhHHHHHHHHHHHHccCCC
Confidence            46788899999999999999999988653


No 9  
>PF13371 TPR_9:  Tetratricopeptide repeat
Probab=90.21  E-value=0.66  Score=30.61  Aligned_cols=39  Identities=18%  Similarity=0.267  Sum_probs=34.6

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhh
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQ  155 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQq  155 (175)
                      ...|..+...|++++|...|-+++..+|++.+...+.++
T Consensus        33 ~~~a~~~~~~g~~~~A~~~l~~~l~~~p~~~~~~~~~a~   71 (73)
T PF13371_consen   33 LQRARCLFQLGRYEEALEDLERALELSPDDPDARALRAM   71 (73)
T ss_pred             HHHHHHHHHhccHHHHHHHHHHHHHHCCCcHHHHHHHHh
Confidence            347889999999999999999999999999998887654


No 10 
>PF13176 TPR_7:  Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=89.45  E-value=0.22  Score=30.13  Aligned_cols=32  Identities=25%  Similarity=0.468  Sum_probs=26.2

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQ  148 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~  148 (175)
                      ..+|..+...|++++|++.+-+||.+..+|.+
T Consensus         3 ~~Lg~~~~~~g~~~~Ai~~y~~aL~l~~~~~~   34 (36)
T PF13176_consen    3 NNLGRIYRQQGDYEKAIEYYEQALALARDPED   34 (36)
T ss_dssp             HHHHHHHHHCT-HHHHHHHHHHHHHHHHHCT-
T ss_pred             HHHHHHHHHcCCHHHHHHHHHHHHHhcccccC
Confidence            36899999999999999999999988776653


No 11 
>PF13181 TPR_8:  Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=89.39  E-value=0.58  Score=26.88  Aligned_cols=28  Identities=25%  Similarity=0.395  Sum_probs=25.5

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcC
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCG  144 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcp  144 (175)
                      ..+|..+...|++++|..+|-+|+.+.|
T Consensus         5 ~~lg~~y~~~~~~~~A~~~~~~a~~~~~   32 (34)
T PF13181_consen    5 YNLGKIYEQLGDYEEALEYFEKALELNP   32 (34)
T ss_dssp             HHHHHHHHHTTSHHHHHHHHHHHHHHHT
T ss_pred             HHHHHHHHHcCCHHHHHHHHHHHHhhCC
Confidence            4689999999999999999999999876


No 12 
>PF13414 TPR_11:  TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=88.33  E-value=0.78  Score=30.02  Aligned_cols=34  Identities=18%  Similarity=0.375  Sum_probs=28.8

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHH
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLL  150 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL  150 (175)
                      ..+|..+...|++++|+.+|.+||.+.|+=.+++
T Consensus         7 ~~~g~~~~~~~~~~~A~~~~~~ai~~~p~~~~~~   40 (69)
T PF13414_consen    7 YNLGQIYFQQGDYEEAIEYFEKAIELDPNNAEAY   40 (69)
T ss_dssp             HHHHHHHHHTTHHHHHHHHHHHHHHHSTTHHHHH
T ss_pred             HHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHH
Confidence            4678999999999999999999999988655543


No 13 
>PF14561 TPR_20:  Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=88.33  E-value=1.1  Score=32.79  Aligned_cols=47  Identities=26%  Similarity=0.400  Sum_probs=36.5

Q ss_pred             CCCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCCh------HHHHHHHhh
Q psy14697        102 DLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQP------NQLLGVLQQ  155 (175)
Q Consensus       102 ~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP------~~LL~IyQq  155 (175)
                      +|+|++++       .++++.++..|++++|.++|-..++.-+..      ..||.||+-
T Consensus        18 ~P~D~~ar-------~~lA~~~~~~g~~e~Al~~Ll~~v~~dr~~~~~~ar~~ll~~f~~   70 (90)
T PF14561_consen   18 NPDDLDAR-------YALADALLAAGDYEEALDQLLELVRRDRDYEDDAARKRLLDIFEL   70 (90)
T ss_dssp             STT-HHHH-------HHHHHHHHHTT-HHHHHHHHHHHHCC-TTCCCCHHHHHHHHHHHH
T ss_pred             CCCCHHHH-------HHHHHHHHHCCCHHHHHHHHHHHHHhCccccccHHHHHHHHHHHH
Confidence            46787766       678999999999999999999999987766      677777763


No 14 
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=83.98  E-value=13  Score=30.30  Aligned_cols=39  Identities=31%  Similarity=0.388  Sum_probs=33.3

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhh
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQ  155 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQq  155 (175)
                      ..+|..+...|++++|+.+|.+|++.-|+-.+++.-|-.
T Consensus        77 ~~Lg~~~~~~g~~~~A~~a~~~Al~l~P~~~~~~~~lA~  115 (198)
T PRK10370         77 ALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALAT  115 (198)
T ss_pred             HHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHH
Confidence            668999999999999999999999999987777655433


No 15 
>PRK11677 hypothetical protein; Provisional
Probab=83.73  E-value=9.5  Score=30.60  Aligned_cols=36  Identities=17%  Similarity=0.216  Sum_probs=28.0

Q ss_pred             HHHHHHHHHHHHhHhHHhhcCChhhHHHHHHH-HHHhcC
Q psy14697        107 SAVQQYFLQEIQLGETLLAAGDLDNGVEHLAN-ALTVCG  144 (175)
Q Consensus       107 e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~n-Al~Vcp  144 (175)
                      .++...|.+-.+|=+.|.  .+|.+=-.|+++ |-.+||
T Consensus        50 qeV~~HFa~TA~Ll~~L~--~~Y~~Ly~HlA~~s~~Llp   86 (134)
T PRK11677         50 QELVSHFARSAELLDTMA--KDYRQLYQHMAKSSSELLP   86 (134)
T ss_pred             HHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHcc
Confidence            357889998888877776  467788889977 667777


No 16 
>PRK13183 psbN photosystem II reaction center protein N; Provisional
Probab=83.23  E-value=1  Score=30.46  Aligned_cols=25  Identities=8%  Similarity=0.215  Sum_probs=18.0

Q ss_pred             hhhhhHHHHHHHHHHHHHhhhhhee
Q psy14697         43 LISGNKVVTAVAVGAAVVVGYCLYF   67 (175)
Q Consensus        43 m~~~~~~~~~agvaga~flGYcIYF   67 (175)
                      |-+.+.+++..+....+++||+||=
T Consensus         4 me~A~~~~i~i~~lL~~~TgyaiYt   28 (46)
T PRK13183          4 MSPALSLAITILAILLALTGFGIYT   28 (46)
T ss_pred             cchhHHHHHHHHHHHHHHhhheeee
Confidence            3456666666666667899999993


No 17 
>PF13432 TPR_16:  Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=79.28  E-value=2.6  Score=27.30  Aligned_cols=29  Identities=24%  Similarity=0.326  Sum_probs=25.4

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQ  145 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq  145 (175)
                      ..+|..+..+|++++|+..|-.++...|+
T Consensus        35 ~~lg~~~~~~g~~~~A~~~~~~a~~~~P~   63 (65)
T PF13432_consen   35 YLLGRILYQQGRYDEALAYYERALELDPD   63 (65)
T ss_dssp             HHHHHHHHHTT-HHHHHHHHHHHHHHSTT
T ss_pred             HHHHHHHHHcCCHHHHHHHHHHHHHHCcC
Confidence            56899999999999999999999999886


No 18 
>PF14559 TPR_19:  Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=79.07  E-value=7.1  Score=25.17  Aligned_cols=41  Identities=20%  Similarity=0.225  Sum_probs=31.8

Q ss_pred             CCCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHH
Q psy14697        102 DLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQL  149 (175)
Q Consensus       102 ~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~L  149 (175)
                      +|.|++.+       +.+++.++..|++++|...|-+++...|+-.+.
T Consensus        21 ~p~~~~~~-------~~la~~~~~~g~~~~A~~~l~~~~~~~~~~~~~   61 (68)
T PF14559_consen   21 NPDNPEAR-------LLLAQCYLKQGQYDEAEELLERLLKQDPDNPEY   61 (68)
T ss_dssp             TTTSHHHH-------HHHHHHHHHTT-HHHHHHHHHCCHGGGTTHHHH
T ss_pred             CCCCHHHH-------HHHHHHHHHcCCHHHHHHHHHHHHHHCcCHHHH
Confidence            34566655       468999999999999999999999999984333


No 19 
>PF04689 S1FA:  DNA binding protein S1FA;  InterPro: IPR006779  S1FA is an unusual small plant peptide of only 70 amino acids with a basic domain which contains a nuclear localization signal and a putative DNA binding helix. S1FA is highly conserved between dicotyledonous and monocotyledonous plants and may be a DNA-binding protein that specifically recognises the negative promoter element S1F [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=78.97  E-value=2.1  Score=31.07  Aligned_cols=38  Identities=29%  Similarity=0.530  Sum_probs=21.2

Q ss_pred             HHHHHHHHHHhh-hhheeccC-----CCCChhhHHHhHHHHHHh
Q psy14697         51 TAVAVGAAVVVG-YCLYFDKK-----RRSDPLFKEKLKERRRKN   88 (175)
Q Consensus        51 ~~agvaga~flG-YcIYFD~K-----RR~dP~FRKkLrerRrk~   88 (175)
                      .+++.+.+.++| |.+|.=-|     |...|--+||++++|-+|
T Consensus        19 vV~g~ll~flvGnyvlY~Yaqk~lpp~kkkpvskkk~k~e~lkq   62 (69)
T PF04689_consen   19 VVAGLLLVFLVGNYVLYVYAQKTLPPKKKKPVSKKKMKRERLKQ   62 (69)
T ss_pred             hHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccHHHHHHHHHhc
Confidence            455555566666 77775433     344555566666555443


No 20 
>PF13432 TPR_16:  Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=78.72  E-value=3.5  Score=26.69  Aligned_cols=34  Identities=21%  Similarity=0.261  Sum_probs=29.1

Q ss_pred             HhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHH
Q psy14697        118 QLGETLLAAGDLDNGVEHLANALTVCGQPNQLLG  151 (175)
Q Consensus       118 ~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~  151 (175)
                      ++|..+..+|++++|..+|-.++...|+=.+.+.
T Consensus         2 ~~a~~~~~~g~~~~A~~~~~~~l~~~P~~~~a~~   35 (65)
T PF13432_consen    2 ALARALYQQGDYDEAIAAFEQALKQDPDNPEAWY   35 (65)
T ss_dssp             HHHHHHHHCTHHHHHHHHHHHHHCCSTTHHHHHH
T ss_pred             hHHHHHHHcCCHHHHHHHHHHHHHHCCCCHHHHH
Confidence            5789999999999999999999999987555443


No 21 
>PF06295 DUF1043:  Protein of unknown function (DUF1043);  InterPro: IPR009386 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=78.16  E-value=13  Score=29.00  Aligned_cols=36  Identities=25%  Similarity=0.290  Sum_probs=26.7

Q ss_pred             HHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHH-HhcCC
Q psy14697        108 AVQQYFLQEIQLGETLLAAGDLDNGVEHLANAL-TVCGQ  145 (175)
Q Consensus       108 ~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl-~Vcpq  145 (175)
                      ++..+|.+..++-..|.  .+|.+--.|+++.- .+||+
T Consensus        47 ~V~~HF~~ta~Ll~~l~--~~Y~~l~~Hla~~a~~Ll~~   83 (128)
T PF06295_consen   47 EVNDHFAQTAELLDNLT--QDYQKLYQHLAKGAEELLPD   83 (128)
T ss_pred             HHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHhccC
Confidence            58999999998888886  45777777887753 44553


No 22 
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=78.09  E-value=34  Score=30.53  Aligned_cols=45  Identities=18%  Similarity=0.226  Sum_probs=31.1

Q ss_pred             CChHHHHHHHHHH------------HHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697        104 KDHSAVQQYFLQE------------IQLGETLLAAGDLDNGVEHLANALTVCGQPNQ  148 (175)
Q Consensus       104 ~d~e~~E~fFl~e------------V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~  148 (175)
                      .|.+...+||.+-            +..++.++.+|++++|...|...+...|+-.+
T Consensus       132 g~~~~A~~~l~~a~~~~p~~~l~~~~~~a~l~l~~~~~~~Al~~l~~l~~~~P~~~~  188 (409)
T TIGR00540       132 GDEARANQHLEEAAELAGNDNILVEIARTRILLAQNELHAARHGVDKLLEMAPRHKE  188 (409)
T ss_pred             CCHHHHHHHHHHHHHhCCcCchHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHH
Confidence            5666666676552            22366666678888888888888888886553


No 23 
>PF10602 RPN7:  26S proteasome subunit RPN7;  InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis [].  The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity [].   The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=77.70  E-value=12  Score=30.43  Aligned_cols=49  Identities=14%  Similarity=0.301  Sum_probs=40.7

Q ss_pred             HHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhhcCC
Q psy14697        110 QQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQTLP  158 (175)
Q Consensus       110 E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLP  158 (175)
                      |.-=+-...+|+.+...|+.++|+.++.++..=|..|...++++=..+-
T Consensus        33 esir~~~~~l~~~~~~~Gd~~~A~k~y~~~~~~~~~~~~~id~~l~~ir   81 (177)
T PF10602_consen   33 ESIRMALEDLADHYCKIGDLEEALKAYSRARDYCTSPGHKIDMCLNVIR   81 (177)
T ss_pred             HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhcCCHHHHHHHHHHHHH
Confidence            3333456789999999999999999999999999999999988765443


No 24 
>PF13174 TPR_6:  Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=77.66  E-value=2.8  Score=23.40  Aligned_cols=29  Identities=17%  Similarity=0.301  Sum_probs=25.6

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQ  145 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq  145 (175)
                      ..+|..+...|++++|...|-+.+.-||+
T Consensus         4 ~~~a~~~~~~g~~~~A~~~~~~~~~~~P~   32 (33)
T PF13174_consen    4 YRLARCYYKLGDYDEAIEYFQRLIKRYPD   32 (33)
T ss_dssp             HHHHHHHHHHCHHHHHHHHHHHHHHHSTT
T ss_pred             HHHHHHHHHccCHHHHHHHHHHHHHHCcC
Confidence            46788888889999999999999999986


No 25 
>CHL00020 psbN photosystem II protein N
Probab=77.62  E-value=1.8  Score=28.83  Aligned_cols=23  Identities=22%  Similarity=0.291  Sum_probs=17.0

Q ss_pred             hhhHHHHHHHHHHHHHhhhhhee
Q psy14697         45 SGNKVVTAVAVGAAVVVGYCLYF   67 (175)
Q Consensus        45 ~~~~~~~~agvaga~flGYcIYF   67 (175)
                      +.+.+++..+....+++||+||=
T Consensus         3 ~A~~~~i~i~~ll~~~Tgy~iYt   25 (43)
T CHL00020          3 TATLVAIFISGLLVSFTGYALYT   25 (43)
T ss_pred             chhhHHHHHHHHHHHhhheeeee
Confidence            45566666666667899999993


No 26 
>PF10841 DUF2644:  Protein of unknown function (DUF2644);  InterPro: IPR020300 This entry is represented by Bacteriophage PY100, Orf49. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.  This entry contains membrane proteins with no known function.
Probab=75.65  E-value=2.9  Score=29.67  Aligned_cols=25  Identities=24%  Similarity=0.382  Sum_probs=18.1

Q ss_pred             hhhHHHHHHHHHHHHHhhhhheecc
Q psy14697         45 SGNKVVTAVAVGAAVVVGYCLYFDK   69 (175)
Q Consensus        45 ~~~~~~~~agvaga~flGYcIYFD~   69 (175)
                      +...+-....++.++++-||+|.||
T Consensus        13 TT~~iQffg~lv~agil~~~vy~dr   37 (60)
T PF10841_consen   13 TTAFIQFFGALVMAGILIYCVYLDR   37 (60)
T ss_pred             hHHHHHHHHHHHHHHHHHhheeeCC
Confidence            3444556666666789999999995


No 27 
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=74.72  E-value=7.1  Score=23.86  Aligned_cols=31  Identities=16%  Similarity=0.263  Sum_probs=22.3

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChH
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPN  147 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~  147 (175)
                      ..+|..+...|++++|..+|-+++...|...
T Consensus        38 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~   68 (100)
T cd00189          38 YNLAAAYYKLGKYEEALEDYEKALELDPDNA   68 (100)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhCCCcch
Confidence            4566777777777778888888777776655


No 28 
>PF13374 TPR_10:  Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=72.77  E-value=4.6  Score=23.44  Aligned_cols=27  Identities=26%  Similarity=0.370  Sum_probs=21.9

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhc
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVC  143 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~Vc  143 (175)
                      -.+|..+..+|++++|...+-.|+.++
T Consensus         6 ~~la~~~~~~g~~~~A~~~~~~al~~~   32 (42)
T PF13374_consen    6 NNLANAYRAQGRYEEALELLEEALEIR   32 (42)
T ss_dssp             HHHHHHHHHCT-HHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHhhhhcchhhHHHHHHHHHH
Confidence            467888999999999999998887754


No 29 
>PF02468 PsbN:  Photosystem II reaction centre N protein (psbN);  InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection [].   This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=70.52  E-value=3.6  Score=27.43  Aligned_cols=24  Identities=13%  Similarity=0.329  Sum_probs=17.6

Q ss_pred             hhhHHHHHHHHHHHHHhhhhheec
Q psy14697         45 SGNKVVTAVAVGAAVVVGYCLYFD   68 (175)
Q Consensus        45 ~~~~~~~~agvaga~flGYcIYFD   68 (175)
                      +.+.+++..++...+++||+||.-
T Consensus         3 ~a~~~~i~i~~~lv~~Tgy~iYta   26 (43)
T PF02468_consen    3 TATVLAIFISCLLVSITGYAIYTA   26 (43)
T ss_pred             ceeeHHHHHHHHHHHHHhhhhhhe
Confidence            455666666666678999999954


No 30 
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=70.45  E-value=9.4  Score=26.35  Aligned_cols=33  Identities=18%  Similarity=0.187  Sum_probs=16.3

Q ss_pred             hHhHHhhcCChhhHHHHHHHHHHhcCChHHHHH
Q psy14697        119 LGETLLAAGDLDNGVEHLANALTVCGQPNQLLG  151 (175)
Q Consensus       119 lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~  151 (175)
                      +|+.+...|+.++|..++-.++..+|+.....+
T Consensus        82 ~~~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~  114 (119)
T TIGR02795        82 LGMSLQELGDKEKAKATLQQVIKRYPGSSAAKL  114 (119)
T ss_pred             HHHHHHHhCChHHHHHHHHHHHHHCcCChhHHH
Confidence            344444445555555555555555555444433


No 31 
>KOG4648|consensus
Probab=69.45  E-value=2.9  Score=39.55  Aligned_cols=30  Identities=20%  Similarity=0.461  Sum_probs=27.4

Q ss_pred             HhHhHHhhcCChhhHHHHHHHHHHhcC-ChH
Q psy14697        118 QLGETLLAAGDLDNGVEHLANALTVCG-QPN  147 (175)
Q Consensus       118 ~lGE~L~a~G~~~eaa~hf~nAl~Vcp-qP~  147 (175)
                      +.|.....||.++||+.|+.++|.++| +|.
T Consensus       102 E~GN~yFKQgKy~EAIDCYs~~ia~~P~NpV  132 (536)
T KOG4648|consen  102 ERGNTYFKQGKYEEAIDCYSTAIAVYPHNPV  132 (536)
T ss_pred             HhhhhhhhccchhHHHHHhhhhhccCCCCcc
Confidence            468899999999999999999999999 774


No 32 
>PF14559 TPR_19:  Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=69.28  E-value=6.4  Score=25.41  Aligned_cols=28  Identities=32%  Similarity=0.392  Sum_probs=20.8

Q ss_pred             HhhcCChhhHHHHHHHHHHhcCChHHHH
Q psy14697        123 LLAAGDLDNGVEHLANALTVCGQPNQLL  150 (175)
Q Consensus       123 L~a~G~~~eaa~hf~nAl~VcpqP~~LL  150 (175)
                      |+.+|++++|+.+|-+++...|+-.++.
T Consensus         1 ll~~~~~~~A~~~~~~~l~~~p~~~~~~   28 (68)
T PF14559_consen    1 LLKQGDYDEAIELLEKALQRNPDNPEAR   28 (68)
T ss_dssp             HHHTTHHHHHHHHHHHHHHHTTTSHHHH
T ss_pred             ChhccCHHHHHHHHHHHHHHCCCCHHHH
Confidence            4567888888888888888888655443


No 33 
>PF10516 SHNi-TPR:  SHNi-TPR;  InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B.  This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat []. 
Probab=68.54  E-value=6.4  Score=25.03  Aligned_cols=25  Identities=28%  Similarity=0.419  Sum_probs=22.1

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHH
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALT  141 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~  141 (175)
                      ..+||--+-.++++.|+..|-+||.
T Consensus         5 ~~Lgeisle~e~f~qA~~D~~~aL~   29 (38)
T PF10516_consen    5 DLLGEISLENENFEQAIEDYEKALE   29 (38)
T ss_pred             HHHHHHHHHhccHHHHHHHHHHHHH
Confidence            4689988888999999999999875


No 34 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=66.63  E-value=4.1  Score=32.23  Aligned_cols=20  Identities=20%  Similarity=0.438  Sum_probs=13.8

Q ss_pred             hhHHHHHHHHHHH-HHhhhhh
Q psy14697         46 GNKVVTAVAVGAA-VVVGYCL   65 (175)
Q Consensus        46 ~~~~~~~agvaga-~flGYcI   65 (175)
                      ..+++++||++|. +++.|||
T Consensus        68 ~Ii~gv~aGvIg~Illi~y~i   88 (122)
T PF01102_consen   68 GIIFGVMAGVIGIILLISYCI   88 (122)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHH
T ss_pred             ehhHHHHHHHHHHHHHHHHHH
Confidence            4456677777764 6778988


No 35 
>COG4968 PilE Tfp pilus assembly protein PilE [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=66.08  E-value=15  Score=29.99  Aligned_cols=33  Identities=15%  Similarity=0.382  Sum_probs=21.0

Q ss_pred             hhhHHHhHHHHHHhHHHHHHhhcCCCCCCCChHHHHHHHHHH
Q psy14697         75 PLFKEKLKERRRKNRELLQQKNKRGIPDLKDHSAVQQYFLQE  116 (175)
Q Consensus        75 P~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~e~~E~fFl~e  116 (175)
                      |.|+.-++|-||.+.++.=         ..+...+|+||..|
T Consensus        29 PSY~~yv~rs~R~~a~A~L---------~~~a~~~Er~y~~q   61 (139)
T COG4968          29 PSYQNYVLRSRRSAAKAAL---------LENAQFMERYYALQ   61 (139)
T ss_pred             HhHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHc
Confidence            8888888887777765421         11234578888544


No 36 
>KOG4234|consensus
Probab=65.37  E-value=13  Score=33.03  Aligned_cols=53  Identities=17%  Similarity=0.332  Sum_probs=39.9

Q ss_pred             CCCCCChHHHHHHHHHHH--------------------HhHhHHhhcCChhhHHHHHHHHHHhcCC-hHHHHHH
Q psy14697        100 IPDLKDHSAVQQYFLQEI--------------------QLGETLLAAGDLDNGVEHLANALTVCGQ-PNQLLGV  152 (175)
Q Consensus       100 ~p~~~d~e~~E~fFl~eV--------------------~lGE~L~a~G~~~eaa~hf~nAl~Vcpq-P~~LL~I  152 (175)
                      -|+|.+.|-.+.+.|.++                    ..|..|...|++++|...+..||..||. |+++-.|
T Consensus        62 se~~~~SEl~eEeLmae~E~i~~deek~k~~~kad~lK~EGN~~F~ngdyeeA~skY~~Ale~cp~~~~e~rsI  135 (271)
T KOG4234|consen   62 SEEPQHSELKEEELMAEIEKIFSDEEKDKAIEKADSLKKEGNELFKNGDYEEANSKYQEALESCPSTSTEERSI  135 (271)
T ss_pred             CCCccchhhhHHHHHHHHHHhcCcHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHhCccccHHHHHH
Confidence            367778777777777332                    2478888999999999999999999986 4444443


No 37 
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=64.07  E-value=30  Score=31.13  Aligned_cols=39  Identities=26%  Similarity=0.347  Sum_probs=34.0

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhh
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQ  155 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQq  155 (175)
                      ..||+..+..|+..+|+..+.||++.-|.-.+++.-|=.
T Consensus       160 ~~Lg~~ym~~~~~~~A~~AY~~A~rL~g~n~~~~~g~ae  198 (287)
T COG4235         160 DLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAE  198 (287)
T ss_pred             HHHHHHHHHhcchhHHHHHHHHHHHhCCCCHHHHHHHHH
Confidence            569999999999999999999999999987777766543


No 38 
>KOG0548|consensus
Probab=62.97  E-value=6.4  Score=38.23  Aligned_cols=33  Identities=12%  Similarity=0.345  Sum_probs=30.0

Q ss_pred             HHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697        113 FLQEIQLGETLLAAGDLDNGVEHLANALTVCGQ  145 (175)
Q Consensus       113 Fl~eV~lGE~L~a~G~~~eaa~hf~nAl~Vcpq  145 (175)
                      |.+..+.|-..++.|+++.|+.||.+||.+-|.
T Consensus         2 a~e~k~kgnaa~s~~d~~~ai~~~t~ai~l~p~   34 (539)
T KOG0548|consen    2 AVELKEKGNAAFSSGDFETAIRLFTEAIMLSPT   34 (539)
T ss_pred             hhHHHHHHHhhcccccHHHHHHHHHHHHccCCC
Confidence            456788899999999999999999999999988


No 39 
>PF13424 TPR_12:  Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=61.41  E-value=7.1  Score=26.06  Aligned_cols=33  Identities=24%  Similarity=0.290  Sum_probs=28.2

Q ss_pred             HHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhc
Q psy14697        111 QYFLQEIQLGETLLAAGDLDNGVEHLANALTVC  143 (175)
Q Consensus       111 ~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~Vc  143 (175)
                      ....--..+|+.+...|++++|..+|-+|+.++
T Consensus        44 ~~a~~~~~lg~~~~~~g~~~~A~~~~~~al~i~   76 (78)
T PF13424_consen   44 DTANTLNNLGECYYRLGDYEEALEYYQKALDIF   76 (78)
T ss_dssp             HHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhh
Confidence            345566789999999999999999999999876


No 40 
>KOG1174|consensus
Probab=61.05  E-value=12  Score=36.26  Aligned_cols=39  Identities=21%  Similarity=0.232  Sum_probs=33.5

Q ss_pred             HHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHh
Q psy14697        116 EIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQ  154 (175)
Q Consensus       116 eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQ  154 (175)
                      ++.+|.-+.+++++.+|.+||+.||..=||-+.-+.=+|
T Consensus       474 H~~Lgd~~~A~Ne~Q~am~~y~~ALr~dP~~~~sl~Gl~  512 (564)
T KOG1174|consen  474 HNHLGDIMRAQNEPQKAMEYYYKALRQDPKSKRTLRGLR  512 (564)
T ss_pred             HHHHHHHHHHhhhHHHHHHHHHHHHhcCccchHHHHHHH
Confidence            478999999999999999999999999999887666554


No 41 
>PF08572 PRP3:  pre-mRNA processing factor 3 (PRP3);  InterPro: IPR013881  Pre-mRNA processing factor 3 (PRP3) is a U4/U6-associated splicing factor. The human PRP3 has been implicated in autosomal retinitis pigmentosa []. 
Probab=60.61  E-value=25  Score=30.23  Aligned_cols=44  Identities=16%  Similarity=0.325  Sum_probs=26.2

Q ss_pred             HHHhHHHHHHhHHHH--HHh--hcCCCCC----------------CCChHHHHHHHHHHHHhHh
Q psy14697         78 KEKLKERRRKNRELL--QQK--NKRGIPD----------------LKDHSAVQQYFLQEIQLGE  121 (175)
Q Consensus        78 RKkLrerRrk~~~~~--~~a--~~~~~p~----------------~~d~e~~E~fFl~eV~lGE  121 (175)
                      |||||++||+.+..+  ++.  ++..-|.                -.||.++|.-.-+||+.-.
T Consensus       146 rKKlRr~rR~e~~kEkq~kIrlGL~ppP~PKVKlSNLMrVL~~eAV~DPT~vE~~Vr~Q~eeR~  209 (223)
T PF08572_consen  146 RKKLRRQRRQEKQKEKQDKIRLGLEPPPPPKVKLSNLMRVLGNEAVQDPTKVEAKVRKQMEERQ  209 (223)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHcCCCCCCCCcccHHHHHHHhhcchhcCcHHHHHHHHHHHHHHH
Confidence            789998776654332  222  2222121                3678888888888877543


No 42 
>cd08323 CARD_APAF1 Caspase activation and recruitment domain similar to that found in Apoptotic Protease-Activating Factor 1. Caspase activation and recruitment domain (CARD) similar to that found in apoptotic protease-activating factor 1 (APAF-1), which is an activator of caspase-9. APAF-1 contains WD-40 repeats, a CARD, and an ATPase domain. Upon stimulation, APAF-1, together with caspase-9, forms the heptameric 'apoptosome', which leads to the processing and activation of caspase-9, starting a caspase cascade which leads to apoptosis. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effect
Probab=60.48  E-value=20  Score=26.43  Aligned_cols=40  Identities=23%  Similarity=0.301  Sum_probs=29.9

Q ss_pred             HHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhhcCC
Q psy14697        116 EIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQTLP  158 (175)
Q Consensus       116 eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLP  158 (175)
                      -.++=..|...|+  .|..+|++||.=.++| .|-.+++..+|
T Consensus        47 a~~Lld~L~trG~--~Af~~F~~aL~~~~~~-~La~lL~~~~~   86 (86)
T cd08323          47 AVMLINMILTKDN--HAYVSFYNALLHEGYK-DLALLLHDAIP   86 (86)
T ss_pred             HHHHHHHHHhcCH--HHHHHHHHHHHhcCCh-HHHHHHhccCC
Confidence            3445566677887  4899999999965555 78888888776


No 43 
>PF12868 DUF3824:  Domain of unknwon function (DUF3824);  InterPro: IPR024436 This repeating domain is proline-rich but its function is unknown.
Probab=59.75  E-value=2.9  Score=33.62  Aligned_cols=18  Identities=17%  Similarity=0.040  Sum_probs=8.1

Q ss_pred             hhHHHHHHHHHHHHHhhh
Q psy14697         46 GNKVVTAVAVGAAVVVGY   63 (175)
Q Consensus        46 ~~~~~~~agvaga~flGY   63 (175)
                      +.+.++++|+||+++.+.
T Consensus         7 R~la~~aLaAAG~G~AA~   24 (137)
T PF12868_consen    7 RDLAEAALAAAGAGYAAH   24 (137)
T ss_pred             HHHhHHHHHHHHHHHHHH
Confidence            344444445555444443


No 44 
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=59.38  E-value=27  Score=27.11  Aligned_cols=36  Identities=19%  Similarity=0.260  Sum_probs=23.1

Q ss_pred             HHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHH
Q psy14697        116 EIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLG  151 (175)
Q Consensus       116 eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~  151 (175)
                      ...+|..+...|++++|..++-+|+...|+-...+.
T Consensus        75 ~~~la~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~  110 (172)
T PRK02603         75 LYNMGIIYASNGEHDKALEYYHQALELNPKQPSALN  110 (172)
T ss_pred             HHHHHHHHHHcCCHHHHHHHHHHHHHhCcccHHHHH
Confidence            345566666667777777777777777665444433


No 45 
>KOG3119|consensus
Probab=59.11  E-value=5.2  Score=34.97  Aligned_cols=28  Identities=21%  Similarity=0.502  Sum_probs=23.1

Q ss_pred             heeccCCCCChhhHHHhHHHHHHhHHHH
Q psy14697         65 LYFDKKRRSDPLFKEKLKERRRKNRELL   92 (175)
Q Consensus        65 IYFD~KRR~dP~FRKkLrerRrk~~~~~   92 (175)
                      -||+++|||+..-||...++|+++.+.+
T Consensus       194 ~y~err~rNN~A~~kSR~~~k~~~~e~~  221 (269)
T KOG3119|consen  194 EYKERRRRNNEAVRKSRDKRKQKEDEMA  221 (269)
T ss_pred             HHHHHHHhhhHHHHHhhhhHHHHHHHHH
Confidence            5999999999999999887777766544


No 46 
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=59.02  E-value=8.1  Score=36.76  Aligned_cols=41  Identities=20%  Similarity=0.370  Sum_probs=33.4

Q ss_pred             HHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChH
Q psy14697        107 SAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPN  147 (175)
Q Consensus       107 e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~  147 (175)
                      ++...+....+.+|-.|...|++++|+.+|-+||..-|+..
T Consensus        69 ~~dP~~a~a~~NLG~AL~~lGryeEAIa~f~rALeL~Pd~a  109 (453)
T PLN03098         69 EADVKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPNPD  109 (453)
T ss_pred             cCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCch
Confidence            34445555668889999999999999999999999988865


No 47 
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=58.85  E-value=20  Score=27.51  Aligned_cols=38  Identities=21%  Similarity=0.195  Sum_probs=29.6

Q ss_pred             CCCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697        102 DLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQP  146 (175)
Q Consensus       102 ~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP  146 (175)
                      +|.|++..       ..+|..+...|++++|..+|-.||.+.|+=
T Consensus        88 ~p~~~~a~-------~~lg~~l~~~g~~~eAi~~~~~Al~~~p~~  125 (144)
T PRK15359         88 DASHPEPV-------YQTGVCLKMMGEPGLAREAFQTAIKMSYAD  125 (144)
T ss_pred             CCCCcHHH-------HHHHHHHHHcCCHHHHHHHHHHHHHhCCCC
Confidence            45665554       467888888899999999999999988863


No 48 
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=58.69  E-value=10  Score=23.16  Aligned_cols=30  Identities=20%  Similarity=0.276  Sum_probs=25.2

Q ss_pred             HHHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697        116 EIQLGETLLAAGDLDNGVEHLANALTVCGQ  145 (175)
Q Consensus       116 eV~lGE~L~a~G~~~eaa~hf~nAl~Vcpq  145 (175)
                      ...+|..+...|++++|..+|.+++..+|+
T Consensus        71 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~  100 (100)
T cd00189          71 YYNLGLAYYKLGKYEEALEAYEKALELDPN  100 (100)
T ss_pred             HHHHHHHHHHHHhHHHHHHHHHHHHccCCC
Confidence            356778888889999999999999988874


No 49 
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=57.64  E-value=47  Score=34.01  Aligned_cols=48  Identities=10%  Similarity=0.079  Sum_probs=35.9

Q ss_pred             CCCChHHHHHHHH---------HHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHH
Q psy14697        102 DLKDHSAVQQYFL---------QEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQL  149 (175)
Q Consensus       102 ~~~d~e~~E~fFl---------~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~L  149 (175)
                      .|.+++..+....         +.++++..+...|++++|+.+|-+++.--|.+.++
T Consensus        92 ~P~~~~~~~~~~~~~~~~~~~~~~l~~A~ll~~~g~~~eA~~~~~~~l~~~p~~~~l  148 (1157)
T PRK11447         92 APDSNAYRSSRTTMLLSTPEGRQALQQARLLATTGRTEEALASYDKLFNGAPPELDL  148 (1157)
T ss_pred             CCCChHHHHHHHHHHhcCCchhhHHHHHHHHHhCCCHHHHHHHHHHHccCCCCChHH
Confidence            4666665443322         34888999999999999999999999987766554


No 50 
>KOG0624|consensus
Probab=57.40  E-value=20  Score=34.10  Aligned_cols=47  Identities=28%  Similarity=0.593  Sum_probs=39.0

Q ss_pred             CCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhh
Q psy14697        103 LKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQ  155 (175)
Q Consensus       103 ~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQq  155 (175)
                      ..+++++|+    +.++|.+|+++|.+.+|+.||-.||.  ++|..-+.||++
T Consensus        32 ~~~~advek----hlElGk~lla~~Q~sDALt~yHaAve--~dp~~Y~aifrR   78 (504)
T KOG0624|consen   32 TASPADVEK----HLELGKELLARGQLSDALTHYHAAVE--GDPNNYQAIFRR   78 (504)
T ss_pred             cCCHHHHHH----HHHHHHHHHHhhhHHHHHHHHHHHHc--CCchhHHHHHHH
Confidence            346667765    78999999999999999999999865  678888888876


No 51 
>PF07721 TPR_4:  Tetratricopeptide repeat;  InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=56.93  E-value=14  Score=20.82  Aligned_cols=21  Identities=38%  Similarity=0.547  Sum_probs=17.5

Q ss_pred             HHHhHhHHhhcCChhhHHHHH
Q psy14697        116 EIQLGETLLAAGDLDNGVEHL  136 (175)
Q Consensus       116 eV~lGE~L~a~G~~~eaa~hf  136 (175)
                      .+.+|+.+..+|+.++|..++
T Consensus         4 ~~~la~~~~~~G~~~eA~~~l   24 (26)
T PF07721_consen    4 RLALARALLAQGDPDEAERLL   24 (26)
T ss_pred             HHHHHHHHHHcCCHHHHHHHH
Confidence            357899999999999988765


No 52 
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=56.23  E-value=16  Score=25.18  Aligned_cols=30  Identities=17%  Similarity=0.271  Sum_probs=27.4

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQP  146 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP  146 (175)
                      ..+|..+...|++++|..+|-.++..+|+.
T Consensus        43 ~~l~~~~~~~~~~~~A~~~~~~~~~~~p~~   72 (119)
T TIGR02795        43 YWLGEAYYAQGKYADAAKAFLAVVKKYPKS   72 (119)
T ss_pred             HHHHHHHHhhccHHHHHHHHHHHHHHCCCC
Confidence            558999999999999999999999999884


No 53 
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=55.04  E-value=29  Score=30.02  Aligned_cols=56  Identities=18%  Similarity=0.320  Sum_probs=39.4

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCC----hHHHHH---HHhh-c---CCHHHHHHHHHhcCCC
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQ----PNQLLG---VLQQ-T---LPPNVFNALIEKLPPA  172 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq----P~~LL~---IyQq-T---LPp~Vf~~Li~kLp~~  172 (175)
                      ..+|+.+...|++++|+.+|.+++.-||+    |.-|+.   +|++ -   --..+|+-+++..|..
T Consensus       184 y~LG~~y~~~g~~~~A~~~f~~vv~~yP~s~~~~dAl~klg~~~~~~g~~~~A~~~~~~vi~~yP~s  250 (263)
T PRK10803        184 YWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAMFKVGVIMQDKGDTAKAKAVYQQVIKKYPGT  250 (263)
T ss_pred             HHHHHHHHHcCCHHHHHHHHHHHHHHCCCCcchhHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCC
Confidence            57899999999999999999999999997    555543   3321 1   1124566677776653


No 54 
>PF13414 TPR_11:  TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=54.42  E-value=15  Score=23.75  Aligned_cols=28  Identities=25%  Similarity=0.458  Sum_probs=24.1

Q ss_pred             HHhHhHHhhcC-ChhhHHHHHHHHHHhcC
Q psy14697        117 IQLGETLLAAG-DLDNGVEHLANALTVCG  144 (175)
Q Consensus       117 V~lGE~L~a~G-~~~eaa~hf~nAl~Vcp  144 (175)
                      ..+|..+...| ++++|..+|-+||++-|
T Consensus        41 ~~~g~~~~~~~~~~~~A~~~~~~al~l~P   69 (69)
T PF13414_consen   41 YNLGLAYMKLGKDYEEAIEDFEKALKLDP   69 (69)
T ss_dssp             HHHHHHHHHTTTHHHHHHHHHHHHHHHST
T ss_pred             HHHHHHHHHhCccHHHHHHHHHHHHHcCc
Confidence            56788888899 69999999999998765


No 55 
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=53.87  E-value=29  Score=24.98  Aligned_cols=38  Identities=24%  Similarity=0.476  Sum_probs=31.2

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHh
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQ  154 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQ  154 (175)
                      ..+|..+...|++++|...|-.++.+.|+........+
T Consensus        89 ~~la~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~  126 (135)
T TIGR02552        89 FHAAECLLALGEPESALKALDLAIEICGENPEYSELKE  126 (135)
T ss_pred             HHHHHHHHHcCCHHHHHHHHHHHHHhccccchHHHHHH
Confidence            44777888889999999999999999998877655544


No 56 
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=53.87  E-value=32  Score=28.28  Aligned_cols=36  Identities=19%  Similarity=0.365  Sum_probs=31.0

Q ss_pred             CCCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcC
Q psy14697        102 DLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCG  144 (175)
Q Consensus       102 ~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~Vcp  144 (175)
                      +|+||+..+       .+|+-++.-|+.++|...|..||..|.
T Consensus        99 ~~ddp~~~~-------~ag~c~L~lG~~~~A~~aF~~Ai~~~~  134 (157)
T PRK15363         99 KIDAPQAPW-------AAAECYLACDNVCYAIKALKAVVRICG  134 (157)
T ss_pred             CCCCchHHH-------HHHHHHHHcCCHHHHHHHHHHHHHHhc
Confidence            467777664       489999999999999999999999994


No 57 
>PF13424 TPR_12:  Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=53.47  E-value=17  Score=24.18  Aligned_cols=25  Identities=20%  Similarity=0.364  Sum_probs=17.6

Q ss_pred             HhHhHHhhcCChhhHHHHHHHHHHh
Q psy14697        118 QLGETLLAAGDLDNGVEHLANALTV  142 (175)
Q Consensus       118 ~lGE~L~a~G~~~eaa~hf~nAl~V  142 (175)
                      .+|..+...|++++|..+|-+|+.+
T Consensus        10 ~la~~~~~~~~~~~A~~~~~~al~~   34 (78)
T PF13424_consen   10 NLARVYRELGRYDEALDYYEKALDI   34 (78)
T ss_dssp             HHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred             HHHHHHHHcCCHHHHHHHHHHHHHH
Confidence            4677777777777788877777765


No 58 
>PF11118 DUF2627:  Protein of unknown function (DUF2627);  InterPro: IPR020138 This entry represents uncharacterised membrane proteins with no known function.
Probab=51.70  E-value=17  Score=26.94  Aligned_cols=36  Identities=19%  Similarity=0.205  Sum_probs=21.9

Q ss_pred             hhhhHHHHHHHHHHHHHh-hhhheeccCCCC-ChhhHH
Q psy14697         44 ISGNKVVTAVAVGAAVVV-GYCLYFDKKRRS-DPLFKE   79 (175)
Q Consensus        44 ~~~~~~~~~agvaga~fl-GYcIYFD~KRR~-dP~FRK   79 (175)
                      |..-++++..-++|.+|+ ||-.|=|+||.. .|-|||
T Consensus        40 wlqfl~G~~lf~~G~~Fi~GfI~~RDRKrnkV~pRF~k   77 (77)
T PF11118_consen   40 WLQFLAGLLLFAIGVGFIAGFILHRDRKRNKVQPRFRK   77 (77)
T ss_pred             HHHHHHHHHHHHHHHHHHHhHhheeeccccccchhhcC
Confidence            333445555555555444 688999999964 455553


No 59 
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=51.41  E-value=15  Score=32.63  Aligned_cols=54  Identities=22%  Similarity=0.355  Sum_probs=38.2

Q ss_pred             hHhHHhhcCChhhHHHHHHHHHHhcCCh----HHHHHHH-------hhcCCHHHHHHHHHhcCCC
Q psy14697        119 LGETLLAAGDLDNGVEHLANALTVCGQP----NQLLGVL-------QQTLPPNVFNALIEKLPPA  172 (175)
Q Consensus       119 lGE~L~a~G~~~eaa~hf~nAl~VcpqP----~~LL~Iy-------QqTLPp~Vf~~Li~kLp~~  172 (175)
                      |||.+.++|++++|+.-|.++++=||+-    .-||++=       |+..--.+|+-|+.+.|.+
T Consensus       184 LGe~~y~qg~y~~Aa~~f~~~~k~~P~s~KApdallKlg~~~~~l~~~d~A~atl~qv~k~YP~t  248 (262)
T COG1729         184 LGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLGRLGNTDEACATLQQVIKRYPGT  248 (262)
T ss_pred             HHHHHHhcccchHHHHHHHHHHHhCCCCCCChHHHHHHHHHHHHhcCHHHHHHHHHHHHHHCCCC
Confidence            8999999999999999999999999974    4444321       1223334555666666543


No 60 
>PF06522 B12D:  NADH-ubiquinone reductase complex 1 MLRQ subunit;  InterPro: IPR010530 The MLRQ subunit of mitochondrial NADH-ubiquinone reductase complex I is nuclear [] and is found in plants [], insects, fungi and higher metazoans []. It appears to act within the membrane and, in mammals, is highly expressed in muscle and neural tissue, indicative of a role in ATP generation [].
Probab=50.77  E-value=17  Score=25.82  Aligned_cols=16  Identities=38%  Similarity=0.480  Sum_probs=9.0

Q ss_pred             heeccCCCCChhhHHHhH
Q psy14697         65 LYFDKKRRSDPLFKEKLK   82 (175)
Q Consensus        65 IYFD~KRR~dP~FRKkLr   82 (175)
                      |.+|+++|.+  --.+++
T Consensus        34 V~~~k~~~~~--pw~~~~   49 (73)
T PF06522_consen   34 VRWNKKNRPE--PWEKYK   49 (73)
T ss_pred             eEEEecCCcC--hhhhcC
Confidence            3578888743  334444


No 61 
>PRK12370 invasion protein regulator; Provisional
Probab=50.28  E-value=37  Score=31.81  Aligned_cols=29  Identities=21%  Similarity=0.366  Sum_probs=15.8

Q ss_pred             HhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697        118 QLGETLLAAGDLDNGVEHLANALTVCGQP  146 (175)
Q Consensus       118 ~lGE~L~a~G~~~eaa~hf~nAl~VcpqP  146 (175)
                      .+|..+...|++++|..+|-+|+..-|.+
T Consensus       377 ~lg~~l~~~G~~~eAi~~~~~Al~l~P~~  405 (553)
T PRK12370        377 YYGWNLFMAGQLEEALQTINECLKLDPTR  405 (553)
T ss_pred             HHHHHHHHCCCHHHHHHHHHHHHhcCCCC
Confidence            34555555555555555555555555553


No 62 
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=49.82  E-value=31  Score=26.50  Aligned_cols=29  Identities=14%  Similarity=0.239  Sum_probs=17.5

Q ss_pred             hHhHHhhcCChhhHHHHHHHHHHhcCChH
Q psy14697        119 LGETLLAAGDLDNGVEHLANALTVCGQPN  147 (175)
Q Consensus       119 lGE~L~a~G~~~eaa~hf~nAl~VcpqP~  147 (175)
                      +|-.+...|++++|..+|.+|+..-++|.
T Consensus        41 ~g~~~~~~g~~~~A~~~~~~al~l~~~~~   69 (168)
T CHL00033         41 DGMSAQSEGEYAEALQNYYEAMRLEIDPY   69 (168)
T ss_pred             HHHHHHHcCCHHHHHHHHHHHHhccccch
Confidence            45555556666666666666666655543


No 63 
>PF06365 CD34_antigen:  CD34/Podocalyxin family;  InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=49.70  E-value=10  Score=32.47  Aligned_cols=19  Identities=21%  Similarity=-0.029  Sum_probs=15.5

Q ss_pred             HHHHHhhhhheeccCCCCC
Q psy14697         56 GAAVVVGYCLYFDKKRRSD   74 (175)
Q Consensus        56 aga~flGYcIYFD~KRR~d   74 (175)
                      +..++++|.+||=++||+-
T Consensus       112 lLla~~~~~~Y~~~~Rrs~  130 (202)
T PF06365_consen  112 LLLAILLGAGYCCHQRRSW  130 (202)
T ss_pred             HHHHHHHHHHHHhhhhccC
Confidence            5567788999999999984


No 64 
>PF00731 AIRC:  AIR carboxylase;  InterPro: IPR000031 Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. PurK, N5-carboxyaminoimidazole ribonucleotide (N5_CAIR) synthetase, catalyzes the conversion of 5-aminoimidazole ribonucleotide (AIR), ATP, and bicarbonate to N5-CAIR, ADP, and Pi. PurE converts N5-CAIR to CAIR, the sixth step of de novo purine biosynthesis. In the presence of high concentrations of bicarbonate, PurE is reported able to convert AIR to CAIR directly and without ATP. Some members of this family contain two copies of this domain []. The crystal structure of PurE indicates a unique quaternary structure that confirms the octameric nature of the enzyme [].; GO: 0004638 phosphoribosylaminoimidazole carboxylase activity, 0006189 'de novo' IMP biosynthetic process; PDB: 3TRH_O 2YWX_A 2NSL_A 1D7A_A 2NSJ_A 1QCZ_A 2ATE_A 2NSH_A 3RG8_C 3RGG_D ....
Probab=48.56  E-value=7.8  Score=31.55  Aligned_cols=20  Identities=40%  Similarity=0.594  Sum_probs=15.7

Q ss_pred             CCCChhhHHHhHHHHHHhHH
Q psy14697         71 RRSDPLFKEKLKERRRKNRE   90 (175)
Q Consensus        71 RR~dP~FRKkLrerRrk~~~   90 (175)
                      +.+||+.++||++.|+++++
T Consensus       129 a~~d~~l~~kl~~~~~~~~~  148 (150)
T PF00731_consen  129 ALKDPELREKLRAYREKMKE  148 (150)
T ss_dssp             HTT-HHHHHHHHHHHHHHHH
T ss_pred             hcCCHHHHHHHHHHHHHHHc
Confidence            46899999999998877653


No 65 
>PF05942 PaREP1:  Archaeal PaREP1/PaREP8 family;  InterPro: IPR010268 This family consists of several archaeal PaREP1 proteins, the function of the family is unknown.; PDB: 2Q00_A 2JPU_A.
Probab=48.39  E-value=26  Score=26.39  Aligned_cols=28  Identities=29%  Similarity=0.437  Sum_probs=22.8

Q ss_pred             HHHHHhHhHHhhcCChhhHHHHHHHHHH
Q psy14697        114 LQEIQLGETLLAAGDLDNGVEHLANALT  141 (175)
Q Consensus       114 l~eV~lGE~L~a~G~~~eaa~hf~nAl~  141 (175)
                      ..++.+++.++..|+..+|+++.++|..
T Consensus         7 ~~~l~~A~e~L~~G~~rqAaeK~~~A~~   34 (115)
T PF05942_consen    7 EKELEEADELLEKGDLRQAAEKAFKAVE   34 (115)
T ss_dssp             HHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence            4567889999999999999998888776


No 66 
>PF08066 PMC2NT:  PMC2NT (NUC016) domain;  InterPro: IPR012588  Exosomes are nano-compartments that function in the degradation or processing of RNA (including mRNA, rRNA, snRNA and snoRNA) [, ]. Exosomes occur in both archaea and eukaryotes, and have a similar overall structure to each other and to bacterial/organelle PNPases (polynucleotide phosphorylases; 2.7.7.8 from EC) [], consisting of a barrel structure composed of a hexameric ring of PH domains that act as a degradation chamber, and an S1-domain/KH-domain containing cap that binds the RNA substrate (and sometimes accessory proteins) in order to regulate and restrict entry into the degradation chamber []. There are two types of exosomes in eukaryotes, cytoplasmic exosomes that are responsible for 3'-5' exoribonuclease degradation of mRNAs, and nuclear exosomes that degrade pre-mRNAs (such as nonsense transcripts) and degrade rRNAs, snRNAs and snoRNAs. Unstructured RNA substrates feed in through the pore made by the S1 domains, are degraded by the PH domain ring, and exit as nucleotides via the PH pore at the opposite end of the barrel [, ].  There are several accessory proteins that help degrade, unwind or polyadenylate RNA substrate before they enter the exosome. This entry represents the N-terminal domain of Rrp6 (exosome component 10 in humans), a nuclear exosome accessory factor that interacts with the bottom of the hexameric PH-ring opposite the cap. Rrp6 functions as a hydrolytic exonuclease, and is homologous to RNase-D in Escherichia coli. More information about these proteins can be found at Protein of the Month: RNA Exosomes [].; GO: 0006396 RNA processing, 0000176 nuclear exosome (RNase complex)
Probab=48.32  E-value=34  Score=24.97  Aligned_cols=58  Identities=26%  Similarity=0.381  Sum_probs=34.2

Q ss_pred             eccCCCCChhhHHHhHHHHHHhHHH-----HHHhhcCCCCCCCChHHHHHHHHHHHHhHhHHh
Q psy14697         67 FDKKRRSDPLFKEKLKERRRKNREL-----LQQKNKRGIPDLKDHSAVQQYFLQEIQLGETLL  124 (175)
Q Consensus        67 FD~KRR~dP~FRKkLrerRrk~~~~-----~~~a~~~~~p~~~d~e~~E~fFl~eV~lGE~L~  124 (175)
                      |||-|--||+|++.+.+..++-...     +........+.+.|.+.++..|-.=+..=+.|+
T Consensus        16 ~~Fy~s~dp~f~~~ld~~s~rll~l~n~ll~~~~~~~~~~~~~~~~d~d~~~~~vvd~~D~Ll   78 (91)
T PF08066_consen   16 YDFYRSFDPEFAESLDEQSQRLLSLINSLLKSAGSKSNISSPDDVDDVDERWDSVVDVNDSLL   78 (91)
T ss_pred             HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCCccccHHHHHHHHHHHHHHHH
Confidence            5677888999999999876554322     112233344455566777776644333333333


No 67 
>KOG4555|consensus
Probab=47.36  E-value=19  Score=30.13  Aligned_cols=29  Identities=31%  Similarity=0.455  Sum_probs=22.8

Q ss_pred             HHhhcCChhhHHHHHHHHHHhcCChHHHH
Q psy14697        122 TLLAAGDLDNGVEHLANALTVCGQPNQLL  150 (175)
Q Consensus       122 ~L~a~G~~~eaa~hf~nAl~VcpqP~~LL  150 (175)
                      .|.-.|+.++|.+.|.+||.|||+-..-.
T Consensus        52 alaE~g~Ld~AlE~F~qal~l~P~raSay   80 (175)
T KOG4555|consen   52 ALAEAGDLDGALELFGQALCLAPERASAY   80 (175)
T ss_pred             HHHhccchHHHHHHHHHHHHhcccchHhh
Confidence            34446899999999999999998755443


No 68 
>KOG0553|consensus
Probab=46.86  E-value=27  Score=31.80  Aligned_cols=37  Identities=24%  Similarity=0.303  Sum_probs=33.7

Q ss_pred             HHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChH
Q psy14697        111 QYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPN  147 (175)
Q Consensus       111 ~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~  147 (175)
                      .|+-.-..+|=.++++|++++|++.|-|||-+.|.=.
T Consensus       147 ~yskay~RLG~A~~~~gk~~~A~~aykKaLeldP~Ne  183 (304)
T KOG0553|consen  147 HYSKAYGRLGLAYLALGKYEEAIEAYKKALELDPDNE  183 (304)
T ss_pred             HHHHHHHHHHHHHHccCcHHHHHHHHHhhhccCCCcH
Confidence            6777778999999999999999999999999999865


No 69 
>PRK11234 nfrB bacteriophage N4 adsorption protein B; Provisional
Probab=46.82  E-value=42  Score=33.51  Aligned_cols=57  Identities=14%  Similarity=0.242  Sum_probs=37.5

Q ss_pred             ChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHH---hhcCCHHHHH
Q psy14697        105 DHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVL---QQTLPPNVFN  163 (175)
Q Consensus       105 d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~Iy---QqTLPp~Vf~  163 (175)
                      +.+-.++|+..|.++||.|+..|...+++  |.+||.-+..-.+-|+-|   +..+.++..+
T Consensus       639 ~~~~~~~~~~~~~~lGe~L~~~g~i~~~~--l~~aL~~~~~~~~~lG~~Lv~~g~it~~~l~  698 (727)
T PRK11234        639 AGEIWRQYVPHQFLFAEILTTLGHIPRSA--INVLLLRHERSSLPLGKFLVTEGVISQETLD  698 (727)
T ss_pred             chhhhhhhhhhhhhHHHHHHHcCCCCHHH--HHHHHHHhcccCCcHHHHHHHcCCCCHHHHH
Confidence            33456899999999999999999876443  567777665544444433   2344444433


No 70 
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=45.92  E-value=27  Score=32.76  Aligned_cols=37  Identities=19%  Similarity=0.363  Sum_probs=31.6

Q ss_pred             HHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHH
Q psy14697        116 EIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGV  152 (175)
Q Consensus       116 eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~I  152 (175)
                      .+.+|+.+...|++++|+..|-+|+.+.++-.++++.
T Consensus       545 ~~~la~~~~~~g~~~eAi~~~e~A~~l~~~~~e~~~a  581 (615)
T TIGR00990       545 VATMAQLLLQQGDVDEALKLFERAAELARTEGELVQA  581 (615)
T ss_pred             HHHHHHHHHHccCHHHHHHHHHHHHHHhccHHHHHHH
Confidence            3578999999999999999999999999887776443


No 71 
>PRK09720 cybC cytochrome b562; Provisional
Probab=45.03  E-value=88  Score=24.17  Aligned_cols=60  Identities=12%  Similarity=0.179  Sum_probs=36.8

Q ss_pred             CChhhHHHhHHHHHHhHHHHHHhhcCCC-CCCCChHHHH------HHHHHHHHhHhHHhhcCChhhHH
Q psy14697         73 SDPLFKEKLKERRRKNRELLQQKNKRGI-PDLKDHSAVQ------QYFLQEIQLGETLLAAGDLDNGV  133 (175)
Q Consensus        73 ~dP~FRKkLrerRrk~~~~~~~a~~~~~-p~~~d~e~~E------~fFl~eV~lGE~L~a~G~~~eaa  133 (175)
                      +..+|++.|-+=|.....+++. -.+++ --+.|.+++.      .-+..||.....|..+|+++||-
T Consensus        16 ~a~~~k~al~~mr~aA~~Ak~~-~P~kL~~~~~ds~e~K~y~~Gld~lI~qID~A~~La~~GkL~eAK   82 (100)
T PRK09720         16 NAAQVKDALTKMRAAALDAQKA-TPPKLEDKSPDSPEMKDFRHGFDILVGQIDDALKLANEGKVKEAQ   82 (100)
T ss_pred             CHHHHHHHHHHHHHHHHHHhhc-CChhhccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHH
Confidence            4457888887766554443321 11111 0123444453      45899999999999999988773


No 72 
>PF03704 BTAD:  Bacterial transcriptional activator domain;  InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=44.82  E-value=32  Score=25.64  Aligned_cols=32  Identities=25%  Similarity=0.356  Sum_probs=17.1

Q ss_pred             HHHHHHhHhHHhhcCChhhHHHHHHHHHHhcC
Q psy14697        113 FLQEIQLGETLLAAGDLDNGVEHLANALTVCG  144 (175)
Q Consensus       113 Fl~eV~lGE~L~a~G~~~eaa~hf~nAl~Vcp  144 (175)
                      |.+.+..|+.....|+.+.+..++.+|+..|.
T Consensus         6 F~~~~~~a~~~~~~~~~~~~~~~~~~al~ly~   37 (146)
T PF03704_consen    6 FEALVREARAAARAGDPEEAIELLEEALALYR   37 (146)
T ss_dssp             HHHHHHHHHHHHHTT-HHHHHHHHHHHHTT--
T ss_pred             HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHhC
Confidence            44455556665555666666666666666654


No 73 
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=43.68  E-value=1.9e+02  Score=24.39  Aligned_cols=42  Identities=10%  Similarity=-0.011  Sum_probs=33.7

Q ss_pred             CCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697        103 LKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQ  148 (175)
Q Consensus       103 ~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~  148 (175)
                      |.++...+.    +..+|..+...+++++|+.+|-+.|..+|+-.+
T Consensus        63 P~s~~a~~a----~l~la~ayy~~~~y~~A~~~~e~fi~~~P~~~~  104 (243)
T PRK10866         63 PFGPYSQQV----QLDLIYAYYKNADLPLAQAAIDRFIRLNPTHPN  104 (243)
T ss_pred             CCChHHHHH----HHHHHHHHHhcCCHHHHHHHHHHHHHhCcCCCc
Confidence            344444443    678999999999999999999999999998644


No 74 
>PF12895 Apc3:  Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=43.23  E-value=37  Score=23.08  Aligned_cols=27  Identities=22%  Similarity=0.262  Sum_probs=20.9

Q ss_pred             HHHHHHhHhHHhhcCChhhHHHHHHHH
Q psy14697        113 FLQEIQLGETLLAAGDLDNGVEHLANA  139 (175)
Q Consensus       113 Fl~eV~lGE~L~a~G~~~eaa~hf~nA  139 (175)
                      +.....+|..+...|++++|+.||-+|
T Consensus        58 ~~~~~l~a~~~~~l~~y~eAi~~l~~~   84 (84)
T PF12895_consen   58 PDIHYLLARCLLKLGKYEEAIKALEKA   84 (84)
T ss_dssp             HHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHhCCHHHHHHHHhcC
Confidence            444456799999999999999999876


No 75 
>PLN02539 glucose-6-phosphate 1-dehydrogenase
Probab=41.67  E-value=73  Score=30.59  Aligned_cols=115  Identities=16%  Similarity=0.340  Sum_probs=61.7

Q ss_pred             ccCCCCccchhhhhhhhHHHHHHHHHHHHHh---hh-hheeccCCCCChhhHHHhHHHHHHhHHHHHHhhcCCCCCCCCh
Q psy14697         31 IGPSGRLMSPMELISGNKVVTAVAVGAAVVV---GY-CLYFDKKRRSDPLFKEKLKERRRKNRELLQQKNKRGIPDLKDH  106 (175)
Q Consensus        31 ~~~~~~~~~~~~m~~~~~~~~~agvaga~fl---GY-cIYFD~KRR~dP~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~  106 (175)
                      .|.||+|-      .|.++-+.........+   .. .|-+.+..-+|.+||+++++.-+...         +    .+.
T Consensus        23 FGAtGDLa------~RKL~PaL~~L~~~~~lpp~~~~IiG~aR~~~s~e~fr~~v~~~l~~~~---------~----~~~   83 (491)
T PLN02539         23 LGASGDLA------KKKTFPALFNLYRQGFLPPDEVHIFGYARSKITDEELRDRIRGYLKDEK---------N----APA   83 (491)
T ss_pred             eCCccHHH------HhhHHHHHHHHHHcCCCCCCCcEEEEEECCCCCHHHHHHHHHHHHHhhc---------c----ccH
Confidence            47899987      23333333333333344   22 34578888899999999987543211         0    122


Q ss_pred             HHHHHHHHHHHHhHhHHhhcCChh--hHHHHHHHHHHhcC-------ChHHHHHHHhhcCCHHHHHHHHHhcCCC
Q psy14697        107 SAVQQYFLQEIQLGETLLAAGDLD--NGVEHLANALTVCG-------QPNQLLGVLQQTLPPNVFNALIEKLPPA  172 (175)
Q Consensus       107 e~~E~fFl~eV~lGE~L~a~G~~~--eaa~hf~nAl~Vcp-------qP~~LL~IyQqTLPp~Vf~~Li~kLp~~  172 (175)
                      +..++| ++.+.     -.+|+++  +.-.++.++|.=..       .+..  .||==.+||.+|..+++.|...
T Consensus        84 ~~~~~F-~~~~~-----Y~~~d~~~~e~y~~L~~~l~~~~~~~~~~~~~~~--rifYLA~PP~~f~~i~~~L~~~  150 (491)
T PLN02539         84 EAVSKF-LQLIK-----YVSGAYDSEEGFRRLDKEISEHEISKNSAEGSSR--RLFYLALPPSVYPPVCKMIKKC  150 (491)
T ss_pred             HHHHHH-HhhCe-----EEecCCCChHHHHHHHHHHHHHhhhccccCCCCc--eEEEEecChHHHHHHHHHHHHh
Confidence            334443 33221     1245543  44556666664110       0011  1344479999999998877544


No 76 
>PRK09400 secE preprotein translocase subunit SecE; Reviewed
Probab=41.21  E-value=30  Score=24.28  Aligned_cols=23  Identities=17%  Similarity=0.268  Sum_probs=17.3

Q ss_pred             hhhhHHHHHHHHHHHHHhhhhhe
Q psy14697         44 ISGNKVVTAVAVGAAVVVGYCLY   66 (175)
Q Consensus        44 ~~~~~~~~~agvaga~flGYcIY   66 (175)
                      ..+.+..+++|.+..+++||.||
T Consensus        32 f~~ia~~~~iG~~i~G~iGf~Ik   54 (61)
T PRK09400         32 FLLVAKVTGLGILLIGLIGFIIY   54 (61)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Confidence            34455667778888899999987


No 77 
>KOG0543|consensus
Probab=40.96  E-value=82  Score=29.72  Aligned_cols=41  Identities=24%  Similarity=0.343  Sum_probs=34.7

Q ss_pred             CCCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHH
Q psy14697        102 DLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQL  149 (175)
Q Consensus       102 ~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~L  149 (175)
                      +|++..++       -..|+.++..|+++.|..-|-+|+++.|+-.+.
T Consensus       287 ~~~N~KAL-------yRrG~A~l~~~e~~~A~~df~ka~k~~P~Nka~  327 (397)
T KOG0543|consen  287 DPNNVKAL-------YRRGQALLALGEYDLARDDFQKALKLEPSNKAA  327 (397)
T ss_pred             CCCchhHH-------HHHHHHHHhhccHHHHHHHHHHHHHhCCCcHHH
Confidence            56776666       357999999999999999999999999987543


No 78 
>smart00386 HAT HAT (Half-A-TPR) repeats. Present in several RNA-binding proteins. Structurally and sequentially thought to be similar to TPRs.
Probab=40.79  E-value=51  Score=17.64  Aligned_cols=27  Identities=19%  Similarity=0.196  Sum_probs=19.7

Q ss_pred             CChhhHHHHHHHHHHhcCChHHHHHHH
Q psy14697        127 GDLDNGVEHLANALTVCGQPNQLLGVL  153 (175)
Q Consensus       127 G~~~eaa~hf~nAl~VcpqP~~LL~Iy  153 (175)
                      |+.+.+..-|.+|+..||+..++-.-|
T Consensus         1 ~~~~~~r~i~e~~l~~~~~~~~~W~~y   27 (33)
T smart00386        1 GDIERARKIYERALEKFPKSVELWLKY   27 (33)
T ss_pred             CcHHHHHHHHHHHHHHCCCChHHHHHH
Confidence            456677788899999998776665443


No 79 
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=40.23  E-value=53  Score=25.14  Aligned_cols=30  Identities=17%  Similarity=0.251  Sum_probs=19.1

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQP  146 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP  146 (175)
                      ..+|..+...|++++|..+|-+|+..-|+-
T Consensus        62 ~~lg~~~~~~g~~~~A~~~y~~Al~l~p~~   91 (144)
T PRK15359         62 IALAGTWMMLKEYTTAINFYGHALMLDASH   91 (144)
T ss_pred             HHHHHHHHHHhhHHHHHHHHHHHHhcCCCC
Confidence            446666666666666666666666665544


No 80 
>PLN03088 SGT1,  suppressor of G2 allele of SKP1; Provisional
Probab=39.86  E-value=49  Score=29.44  Aligned_cols=33  Identities=9%  Similarity=0.145  Sum_probs=25.9

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHH
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQL  149 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~L  149 (175)
                      ..+|..+...|++++|+..+-+||.+.|+-.+.
T Consensus        40 ~~~a~~~~~~g~~~eAl~~~~~Al~l~P~~~~a   72 (356)
T PLN03088         40 ADRAQANIKLGNFTEAVADANKAIELDPSLAKA   72 (356)
T ss_pred             HHHHHHHHHcCCHHHHHHHHHHHHHhCcCCHHH
Confidence            567888888888888888888998888864443


No 81 
>PF12301 CD99L2:  CD99 antigen like protein 2;  InterPro: IPR022078  This family of proteins is found in eukaryotes. Proteins in this family are typically between 165 and 237 amino acids in length. CD99L2 and CD99 are involved in trans-endothelial migration of neutrophils in vitro and in the recruitment of neutrophils into inflamed peritoneum. 
Probab=39.57  E-value=16  Score=30.35  Aligned_cols=32  Identities=22%  Similarity=0.125  Sum_probs=18.4

Q ss_pred             hhhhhhhhHHHHHHHHHHHHHhhhhheeccCC
Q psy14697         40 PMELISGNKVVTAVAVGAAVVVGYCLYFDKKR   71 (175)
Q Consensus        40 ~~~m~~~~~~~~~agvaga~flGYcIYFD~KR   71 (175)
                      -+++.++++.+|+.+++.|++-+-.=||=||+
T Consensus       109 ~~~~~~g~IaGIvsav~valvGAvsSyiaYqk  140 (169)
T PF12301_consen  109 DGEAEAGTIAGIVSAVVVALVGAVSSYIAYQK  140 (169)
T ss_pred             ccCcccchhhhHHHHHHHHHHHHHHHHHHHHh
Confidence            34556777777776666654444444555543


No 82 
>TIGR00327 secE_euk_arch protein translocase SEC61 complex gamma subunit, archaeal and eukaryotic. This model describes archaeal SEC61-like and eukaryotic SEC61 but not bacterial secE proteins, for which a Pfam pfam00584 (SecE) has been created.
Probab=39.42  E-value=33  Score=24.19  Aligned_cols=24  Identities=17%  Similarity=0.246  Sum_probs=17.8

Q ss_pred             hhhhHHHHHHHHHHHHHhhhhhee
Q psy14697         44 ISGNKVVTAVAVGAAVVVGYCLYF   67 (175)
Q Consensus        44 ~~~~~~~~~agvaga~flGYcIYF   67 (175)
                      ..+.+..+++|.+..+++||.||.
T Consensus        28 f~~iak~t~iG~~i~G~IGf~Ikl   51 (61)
T TIGR00327        28 YLKVAKVTGIGIIIVGIIGYIIKI   51 (61)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345556677788888999999873


No 83 
>PF09976 TPR_21:  Tetratricopeptide repeat;  InterPro: IPR018704  This domain, found in various hypothetical prokaryotic proteins, has no known function. 
Probab=39.26  E-value=25  Score=26.56  Aligned_cols=23  Identities=39%  Similarity=0.564  Sum_probs=21.3

Q ss_pred             HhHhHHhhcCChhhHHHHHHHHH
Q psy14697        118 QLGETLLAAGDLDNGVEHLANAL  140 (175)
Q Consensus       118 ~lGE~L~a~G~~~eaa~hf~nAl  140 (175)
                      -+|+-+.++|+.++|..+|-+||
T Consensus       123 ~~Gdi~~~~g~~~~A~~~y~~Al  145 (145)
T PF09976_consen  123 LLGDIYLAQGDYDEARAAYQKAL  145 (145)
T ss_pred             HHHHHHHHCCCHHHHHHHHHHhC
Confidence            59999999999999999998886


No 84 
>KOG4516|consensus
Probab=39.17  E-value=9.4  Score=30.25  Aligned_cols=51  Identities=35%  Similarity=0.463  Sum_probs=27.1

Q ss_pred             ccccccccccccccccccCCCCccchhhhhhhhHHHHHHHHHHHHHhhhhheeccCC
Q psy14697         15 CWFGDWCGYFGLWSGRIGPSGRLMSPMELISGNKVVTAVAVGAAVVVGYCLYFDKKR   71 (175)
Q Consensus        15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~~~agvaga~flGYcIYFD~KR   71 (175)
                      .|----||..|+-|+-+-.-|+=---..-+.+.++.+++++++    ||  |||.||
T Consensus        27 ~~~~~a~g~~g~~s~~~~N~~~rkP~~~gi~~~ll~i~a~~~A----Gy--y~~~~r   77 (118)
T KOG4516|consen   27 IWNPIACGVAGVGSAIFINWGFRKPVFSGIQKHLLFIAAGVGA----GY--YFDQKR   77 (118)
T ss_pred             CcchhhHhhhHHHHHHHHhhhhcCchHHHHHHHHHHHHHHHhh----hH--HHHHHH
Confidence            4555558888876664432222111111244666666666654    66  677665


No 85 
>PF03605 DcuA_DcuB:  Anaerobic c4-dicarboxylate membrane transporter;  InterPro: IPR004668 These proteins are members of the C4-Dicarboxylate Uptake (Dcu) family. Most proteins in this family are predicted to have 12 GES predicted transmembrane regions; however the one member whose membrane topology has been experimentally determined has 10 transmembrane regions, with both the N- and C-termini localized to the periplasm []. The DcuA and DcuB proteins are involved in the transport of aspartate, malate, fumarate and succinate in many species [, , ], and are thought to function as antiporters with any two of these substrates. Since DcuA is encoded in an operon with the gene for aspartase, and DcuB is encoded in an operon with the gene for fumarase, their physiological functions may be to catalyze aspartate:fumarate and fumarate:malate exchange during the anaerobic utilization of aspartate and fumarate, respectively []. The Escherichia coli DcuA and DcuB proteins have very different expression patterns []. DcuA is constitutively expressed; DcuB is strongly induced anaerobically by FNR and C4-dicarboxylates, while it is repressed by nitrate and subject to CRP-mediated catabolite repression.; GO: 0015556 C4-dicarboxylate transmembrane transporter activity, 0015740 C4-dicarboxylate transport, 0016021 integral to membrane
Probab=39.05  E-value=41  Score=31.32  Aligned_cols=22  Identities=23%  Similarity=0.320  Sum_probs=17.8

Q ss_pred             hheeccCCCCChhhHHHhHHHH
Q psy14697         64 CLYFDKKRRSDPLFKEKLKERR   85 (175)
Q Consensus        64 cIYFD~KRR~dP~FRKkLrerR   85 (175)
                      +-..+...-.||+|++||++-.
T Consensus       184 ~~~~GkeL~~Dp~yq~rl~~g~  205 (364)
T PF03605_consen  184 SSRRGKELDDDPEYQERLADGL  205 (364)
T ss_pred             HHhcCCccccCHHHHHHHhccc
Confidence            4467888899999999998643


No 86 
>COG5343 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=38.87  E-value=32  Score=30.31  Aligned_cols=24  Identities=29%  Similarity=0.443  Sum_probs=19.1

Q ss_pred             HHHHHHHHHhhhhheeccCCCCCh
Q psy14697         52 AVAVGAAVVVGYCLYFDKKRRSDP   75 (175)
Q Consensus        52 ~agvaga~flGYcIYFD~KRR~dP   75 (175)
                      +.++++++.++|++-|+.+|+..|
T Consensus       108 ataavAvlal~~a~~~s~~~~~~P  131 (240)
T COG5343         108 ATAAVAVLALALALTFSLRRGTTP  131 (240)
T ss_pred             HHHHHHHHHHHHccccccccCCCc
Confidence            333344578899999999999999


No 87 
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=38.52  E-value=67  Score=30.99  Aligned_cols=33  Identities=12%  Similarity=0.097  Sum_probs=27.2

Q ss_pred             HHHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697        116 EIQLGETLLAAGDLDNGVEHLANALTVCGQPNQ  148 (175)
Q Consensus       116 eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~  148 (175)
                      ...+|+.+...|++++|+.+|-+|+...|+..+
T Consensus       113 ~~~la~~l~~~g~~~~Ai~~l~~Al~l~P~~~~  145 (656)
T PRK15174        113 VLLVASVLLKSKQYATVADLAEQAWLAFSGNSQ  145 (656)
T ss_pred             HHHHHHHHHHcCCHHHHHHHHHHHHHhCCCcHH
Confidence            356788889999999999999999998777544


No 88 
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=37.85  E-value=65  Score=24.95  Aligned_cols=27  Identities=7%  Similarity=0.051  Sum_probs=12.5

Q ss_pred             HhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697        120 GETLLAAGDLDNGVEHLANALTVCGQP  146 (175)
Q Consensus       120 GE~L~a~G~~~eaa~hf~nAl~VcpqP  146 (175)
                      |..+...|+...+..++..|+..+.+-
T Consensus       113 g~~~~~~g~~~~a~~~~~~A~~~~~~A  139 (172)
T PRK02603        113 AVIYHKRGEKAEEAGDQDEAEALFDKA  139 (172)
T ss_pred             HHHHHHcCChHhHhhCHHHHHHHHHHH
Confidence            444444444444444455555544433


No 89 
>PTZ00459 mucin-associated surface protein (MASP); Provisional
Probab=37.60  E-value=19  Score=31.87  Aligned_cols=10  Identities=10%  Similarity=0.291  Sum_probs=6.7

Q ss_pred             hhhhhhhhHH
Q psy14697         40 PMELISGNKV   49 (175)
Q Consensus        40 ~~~m~~~~~~   49 (175)
                      |+|||++|.|
T Consensus         1 MaMmMTGRVL   10 (291)
T PTZ00459          1 MAMMMTGRVL   10 (291)
T ss_pred             CccchhchHH
Confidence            4566787766


No 90 
>TIGR01495 ETRAMP Plasmodium ring stage membrane protein ETRAMP. These genes have been shown to be found in the sub-telomeric regions of both P. falciparum and P. yoelii chromosomes.
Probab=37.56  E-value=36  Score=25.33  Aligned_cols=16  Identities=38%  Similarity=0.684  Sum_probs=9.0

Q ss_pred             HHhhhhheeccCCCCC
Q psy14697         59 VVVGYCLYFDKKRRSD   74 (175)
Q Consensus        59 ~flGYcIYFD~KRR~d   74 (175)
                      ..+|.-+|+-.|.+.|
T Consensus        67 t~~G~g~y~~~kk~~~   82 (85)
T TIGR01495        67 AGVGLGYYYKKKNKDD   82 (85)
T ss_pred             HHHHHhhhhhcccccc
Confidence            4556666765555443


No 91 
>KOG1174|consensus
Probab=37.15  E-value=54  Score=31.87  Aligned_cols=74  Identities=18%  Similarity=0.187  Sum_probs=47.2

Q ss_pred             HHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHHHHHhhcCCCCCCCChHHHHHHHHHHHHhHhHHhhcCChhhHHH
Q psy14697         55 VGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNRELLQQKNKRGIPDLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVE  134 (175)
Q Consensus        55 vaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~  134 (175)
                      ++.--||.-|+-||.||     |++-|-=-.|.           -.-++.+   .|.+    |-.|-.|++.|.+++|+.
T Consensus       299 ta~~wfV~~~~l~~~K~-----~~rAL~~~eK~-----------I~~~~r~---~~al----ilKG~lL~~~~R~~~A~I  355 (564)
T KOG1174|consen  299 TASHWFVHAQLLYDEKK-----FERALNFVEKC-----------IDSEPRN---HEAL----ILKGRLLIALERHTQAVI  355 (564)
T ss_pred             chhhhhhhhhhhhhhhh-----HHHHHHHHHHH-----------hccCccc---chHH----HhccHHHHhccchHHHHH
Confidence            44456888899999875     44444311111           1112333   3444    457999999999999999


Q ss_pred             HHHHHHHhcCChHHHHHHHh
Q psy14697        135 HLANALTVCGQPNQLLGVLQ  154 (175)
Q Consensus       135 hf~nAl~VcpqP~~LL~IyQ  154 (175)
                      ||--|+.+-|--   |.+||
T Consensus       356 aFR~Aq~Lap~r---L~~Y~  372 (564)
T KOG1174|consen  356 AFRTAQMLAPYR---LEIYR  372 (564)
T ss_pred             HHHHHHhcchhh---HHHHH
Confidence            999999886543   34554


No 92 
>PF09976 TPR_21:  Tetratricopeptide repeat;  InterPro: IPR018704  This domain, found in various hypothetical prokaryotic proteins, has no known function. 
Probab=36.82  E-value=63  Score=24.34  Aligned_cols=33  Identities=30%  Similarity=0.467  Sum_probs=27.1

Q ss_pred             HHHHHhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697        114 LQEIQLGETLLAAGDLDNGVEHLANALTVCGQP  146 (175)
Q Consensus       114 l~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP  146 (175)
                      +-.+.+++.+...|++++|...|-.++.-.+.|
T Consensus        49 ~A~l~lA~~~~~~g~~~~A~~~l~~~~~~~~d~   81 (145)
T PF09976_consen   49 LAALQLAKAAYEQGDYDEAKAALEKALANAPDP   81 (145)
T ss_pred             HHHHHHHHHHHHCCCHHHHHHHHHHHHhhCCCH
Confidence            345678888999999999999999988877665


No 93 
>PF15147 DUF4578:  Domain of unknown function (DUF4578)
Probab=36.59  E-value=13  Score=29.68  Aligned_cols=11  Identities=64%  Similarity=1.534  Sum_probs=7.9

Q ss_pred             CCCcccccCcc
Q psy14697          4 CGSSWVLPSSF   14 (175)
Q Consensus         4 ~~~~~~~~~~~   14 (175)
                      ||-||--||.|
T Consensus         7 CggSWScPstf   17 (127)
T PF15147_consen    7 CGGSWSCPSTF   17 (127)
T ss_pred             cCCCcCCchhH
Confidence            77777777765


No 94 
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=36.58  E-value=80  Score=23.42  Aligned_cols=29  Identities=21%  Similarity=0.269  Sum_probs=19.9

Q ss_pred             HhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697        118 QLGETLLAAGDLDNGVEHLANALTVCGQP  146 (175)
Q Consensus       118 ~lGE~L~a~G~~~eaa~hf~nAl~VcpqP  146 (175)
                      .+|..+...|++++|...|-+|+...|+.
T Consensus        70 ~la~~~~~~~~~~~A~~~~~~al~~~~~~   98 (234)
T TIGR02521        70 ALALYYQQLGELEKAEDSFRRALTLNPNN   98 (234)
T ss_pred             HHHHHHHHcCCHHHHHHHHHHHHhhCCCC
Confidence            45666667777777777777777766654


No 95 
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=35.69  E-value=70  Score=27.17  Aligned_cols=39  Identities=13%  Similarity=0.082  Sum_probs=27.4

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhh
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQ  155 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQq  155 (175)
                      ..+|+.+..+|++++|...|.+++...|+-..+...+..
T Consensus       286 ~~la~~~~~~g~~~~A~~~l~~~l~~~P~~~~~~~l~~~  324 (389)
T PRK11788        286 LALAQLLEEQEGPEAAQALLREQLRRHPSLRGFHRLLDY  324 (389)
T ss_pred             HHHHHHHHHhCCHHHHHHHHHHHHHhCcCHHHHHHHHHH
Confidence            445777777788888888888888887777666555543


No 96 
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=35.38  E-value=80  Score=23.41  Aligned_cols=29  Identities=34%  Similarity=0.398  Sum_probs=20.6

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQ  145 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq  145 (175)
                      ..+|..+...|++++|...|-+++...|+
T Consensus        35 ~~la~~~~~~~~~~~A~~~~~~~l~~~p~   63 (234)
T TIGR02521        35 VQLALGYLEQGDLEVAKENLDKALEHDPD   63 (234)
T ss_pred             HHHHHHHHHCCCHHHHHHHHHHHHHhCcc
Confidence            34677777777777777777777776654


No 97 
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=34.41  E-value=93  Score=22.29  Aligned_cols=29  Identities=10%  Similarity=0.084  Sum_probs=17.3

Q ss_pred             HhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697        118 QLGETLLAAGDLDNGVEHLANALTVCGQP  146 (175)
Q Consensus       118 ~lGE~L~a~G~~~eaa~hf~nAl~VcpqP  146 (175)
                      .+|..+...|++++|+..|-.++...|..
T Consensus        56 ~la~~~~~~~~~~~A~~~~~~~~~~~p~~   84 (135)
T TIGR02552        56 GLAACCQMLKEYEEAIDAYALAAALDPDD   84 (135)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhcCCCC
Confidence            45566666666666666666666665543


No 98 
>PRK12489 anaerobic C4-dicarboxylate transporter; Reviewed
Probab=34.38  E-value=47  Score=31.41  Aligned_cols=22  Identities=36%  Similarity=0.575  Sum_probs=19.1

Q ss_pred             hhheeccCCCCChhhHHHhHHH
Q psy14697         63 YCLYFDKKRRSDPLFKEKLKER   84 (175)
Q Consensus        63 YcIYFD~KRR~dP~FRKkLrer   84 (175)
                      |.-.+.+|-+.||+|.+++++.
T Consensus       191 ~~~~~gk~l~~Dp~~q~r~~~~  212 (443)
T PRK12489        191 WSLRRGKDLDKDPEFQERLKDP  212 (443)
T ss_pred             HHHHhcCccccCHHHHHHhhcc
Confidence            5666899999999999999875


No 99 
>KOG1125|consensus
Probab=34.06  E-value=69  Score=31.61  Aligned_cols=59  Identities=29%  Similarity=0.361  Sum_probs=43.8

Q ss_pred             CCCCCCChHHHHHHHHHHH-----------H--hHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhhcC
Q psy14697         99 GIPDLKDHSAVQQYFLQEI-----------Q--LGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQTL  157 (175)
Q Consensus        99 ~~p~~~d~e~~E~fFl~eV-----------~--lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTL  157 (175)
                      ..++..+...++++|++=+           |  ||=..-..|+++.|+.||--||.|-|+=.-|-+-|=-||
T Consensus       403 s~~~~~~l~~i~~~fLeaa~~~~~~~DpdvQ~~LGVLy~ls~efdraiDcf~~AL~v~Pnd~~lWNRLGAtL  474 (579)
T KOG1125|consen  403 SFLDSSHLAHIQELFLEAARQLPTKIDPDVQSGLGVLYNLSGEFDRAVDCFEAALQVKPNDYLLWNRLGATL  474 (579)
T ss_pred             CCCCHHHHHHHHHHHHHHHHhCCCCCChhHHhhhHHHHhcchHHHHHHHHHHHHHhcCCchHHHHHHhhHHh
Confidence            3566777789999999853           3  333333468899999999999999998877766655554


No 100
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=34.02  E-value=1.2e+02  Score=24.16  Aligned_cols=29  Identities=7%  Similarity=0.159  Sum_probs=26.4

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQ  145 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcpq  145 (175)
                      ..+|+.+..+|++++|...|.+++.-||+
T Consensus       170 ~~~a~~~~~~g~~~~A~~~~~~al~~~p~  198 (235)
T TIGR03302       170 LYVARFYLKRGAYVAAINRFETVVENYPD  198 (235)
T ss_pred             HHHHHHHHHcCChHHHHHHHHHHHHHCCC
Confidence            46788889999999999999999999986


No 101
>PF06305 DUF1049:  Protein of unknown function (DUF1049);  InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=33.44  E-value=44  Score=22.39  Aligned_cols=7  Identities=0%  Similarity=0.453  Sum_probs=2.8

Q ss_pred             HHHhHHH
Q psy14697         78 KEKLKER   84 (175)
Q Consensus        78 RKkLrer   84 (175)
                      |+++|+.
T Consensus        47 r~~~~~~   53 (68)
T PF06305_consen   47 RRRIRRL   53 (68)
T ss_pred             HHHHHHH
Confidence            3444433


No 102
>PF13371 TPR_9:  Tetratricopeptide repeat
Probab=33.27  E-value=54  Score=21.24  Aligned_cols=31  Identities=23%  Similarity=0.291  Sum_probs=23.4

Q ss_pred             HhHHhhcCChhhHHHHHHHHHHhcCChHHHH
Q psy14697        120 GETLLAAGDLDNGVEHLANALTVCGQPNQLL  150 (175)
Q Consensus       120 GE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL  150 (175)
                      .+.++.++++++|..++-+++..-|+...+.
T Consensus         2 ~~~~~~~~~~~~A~~~~~~~l~~~p~~~~~~   32 (73)
T PF13371_consen    2 KQIYLQQEDYEEALEVLERALELDPDDPELW   32 (73)
T ss_pred             HHHHHhCCCHHHHHHHHHHHHHhCcccchhh
Confidence            3556778888899999999988877755443


No 103
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=32.87  E-value=1e+02  Score=26.03  Aligned_cols=33  Identities=9%  Similarity=0.008  Sum_probs=29.0

Q ss_pred             HhHhHHhhcCChhhHHHHHHHHHHhcCChHHHH
Q psy14697        118 QLGETLLAAGDLDNGVEHLANALTVCGQPNQLL  150 (175)
Q Consensus       118 ~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL  150 (175)
                      ..|+.+...|++++|+..|-..+..+|.+...-
T Consensus        37 ~~A~~~~~~g~y~~Ai~~f~~l~~~yP~s~~a~   69 (243)
T PRK10866         37 ATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQ   69 (243)
T ss_pred             HHHHHHHHCCCHHHHHHHHHHHHHhCCCChHHH
Confidence            568888899999999999999999999986554


No 104
>PF11712 Vma12:  Endoplasmic reticulum-based factor for assembly of V-ATPase;  InterPro: IPR021013 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include:   F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP.   V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins [].  The yeast vacuolar proton-translocating ATPase (V-ATPase) is the best characterised member of the V-ATPase family. A total of thirteen genes are required for encoding the subunits of the enzyme complex itself and an additional three for providing factors necessary for the assembly of the whole. Vma12 is one of these latter, all three of which are localised to the endoplasmic reticulum []. 
Probab=32.73  E-value=2e+02  Score=22.48  Aligned_cols=28  Identities=29%  Similarity=0.483  Sum_probs=18.9

Q ss_pred             heeccC---CCCChhhHHHhHHHHHHhHHHH
Q psy14697         65 LYFDKK---RRSDPLFKEKLKERRRKNRELL   92 (175)
Q Consensus        65 IYFD~K---RR~dP~FRKkLrerRrk~~~~~   92 (175)
                      |||.-+   -|.+|||++++.+=|+++++.+
T Consensus         9 v~~p~~~~~~~~s~E~~a~le~Lr~~~ee~e   39 (142)
T PF11712_consen    9 VYFPPPPPKPRPSPELKARLERLRAEQEERE   39 (142)
T ss_pred             EEeCCCCCCCCCCHHHHHHHHHHHHHHHHHH
Confidence            455543   3449999999988776666544


No 105
>COG2443 Sss1 Preprotein translocase subunit Sss1 [Intracellular trafficking and secretion]
Probab=32.59  E-value=48  Score=23.83  Aligned_cols=22  Identities=18%  Similarity=0.386  Sum_probs=15.8

Q ss_pred             hhhHHHHHHHHHHHHHhhhhhe
Q psy14697         45 SGNKVVTAVAVGAAVVVGYCLY   66 (175)
Q Consensus        45 ~~~~~~~~agvaga~flGYcIY   66 (175)
                      +..+...++|....+++||.||
T Consensus        34 ~~~aKi~~~Gi~liG~IGfiI~   55 (65)
T COG2443          34 SKIAKITGLGILLIGIIGFIIY   55 (65)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            3444556677777789999997


No 106
>PRK11189 lipoprotein NlpI; Provisional
Probab=32.51  E-value=1e+02  Score=26.37  Aligned_cols=28  Identities=29%  Similarity=0.298  Sum_probs=24.5

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcC
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCG  144 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~Vcp  144 (175)
                      .-+|..+...|++++|+.+|-+|+...|
T Consensus       240 ~~Lg~~~~~~g~~~~A~~~~~~Al~~~~  267 (296)
T PRK11189        240 FYLAKYYLSLGDLDEAAALFKLALANNV  267 (296)
T ss_pred             HHHHHHHHHCCCHHHHHHHHHHHHHhCC
Confidence            4568888999999999999999998775


No 107
>PF13838 Clathrin_H_link:  Clathrin-H-link; PDB: 2XZG_A 3GD1_I 1BPO_C 1C9I_B 1C9L_A.
Probab=32.22  E-value=33  Score=24.51  Aligned_cols=27  Identities=19%  Similarity=0.480  Sum_probs=19.3

Q ss_pred             HHHHHHHHHhHhHHhhcCChhhHHHHHHHH
Q psy14697        110 QQYFLQEIQLGETLLAAGDLDNGVEHLANA  139 (175)
Q Consensus       110 E~fFl~eV~lGE~L~a~G~~~eaa~hf~nA  139 (175)
                      +..|.++.   ++|+++|++++||.+=+|+
T Consensus         6 d~l~~~~F---~~l~~~g~y~eAA~~AA~s   32 (66)
T PF13838_consen    6 DDLYVQQF---NELFSQGQYEEAAKVAANS   32 (66)
T ss_dssp             HHHHHHHH---HHHHHTT-HHHHHHHHHHS
T ss_pred             HHHHHHHH---HHHHHcCCHHHHHHHHHhC
Confidence            34455444   6789999999999987776


No 108
>PF06024 DUF912:  Nucleopolyhedrovirus protein of unknown function (DUF912);  InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=32.06  E-value=54  Score=24.61  Aligned_cols=17  Identities=18%  Similarity=0.542  Sum_probs=9.9

Q ss_pred             HHHHHHHHHHhhhhhee
Q psy14697         51 TAVAVGAAVVVGYCLYF   67 (175)
Q Consensus        51 ~~agvaga~flGYcIYF   67 (175)
                      +.++++..+++=|+|||
T Consensus        67 ~lls~v~IlVily~IyY   83 (101)
T PF06024_consen   67 SLLSFVCILVILYAIYY   83 (101)
T ss_pred             HHHHHHHHHHHHhhheE
Confidence            33344444556699986


No 109
>PF08359 TetR_C_4:  YsiA-like protein, C-terminal region;  InterPro: IPR013570 The members of this family are thought to be TetR-type (tetracycline resistance) transcriptional regulators that bear particular similarity to YsiA (P94548 from SWISSPROT). This entry represents the C-terminal domain.; PDB: 1VI0_B.
Probab=31.82  E-value=1.8e+02  Score=21.26  Aligned_cols=73  Identities=16%  Similarity=0.206  Sum_probs=41.3

Q ss_pred             eeccCCCCChhhHHHhHHHHHHhHHH-----HHHhhcCCCCCCCChHHHHHHHHHHHHh-HhHHhh-cCC--hhhHHHHH
Q psy14697         66 YFDKKRRSDPLFKEKLKERRRKNREL-----LQQKNKRGIPDLKDHSAVQQYFLQEIQL-GETLLA-AGD--LDNGVEHL  136 (175)
Q Consensus        66 YFD~KRR~dP~FRKkLrerRrk~~~~-----~~~a~~~~~p~~~d~e~~E~fFl~eV~l-GE~L~a-~G~--~~eaa~hf  136 (175)
                      +..--|.+++++|+++++.+++..+.     ++..+...+....|++..-.+|.--+.. =..+.. .++  .++-++.+
T Consensus        47 ~~~e~~~~~~~~~~~~~~~~~~~~~~i~~iI~eG~~~Gefr~d~d~~~~~~~i~G~i~~~v~~w~~~~~~~~l~~~~~~~  126 (133)
T PF08359_consen   47 LSLELRQSNEELRKKINEIRREYLRIIEEIIEEGQEQGEFRKDLDPELAARFIFGMINHTVLDWVLNDGKFDLEDIAEEL  126 (133)
T ss_dssp             HHCTTS-SSHHHHHHHHHHHHHHHHHHHHHHHHHHHTTSB-TT--HHHHHHHHHHHHHHHHHHHHHTTT-S-CGGGHHHH
T ss_pred             hHHHHhhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHcCccCCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHH
Confidence            34567888999999999876655432     2223344555557888888888877763 233443 343  34445544


Q ss_pred             HH
Q psy14697        137 AN  138 (175)
Q Consensus       137 ~n  138 (175)
                      .+
T Consensus       127 ~~  128 (133)
T PF08359_consen  127 MD  128 (133)
T ss_dssp             HH
T ss_pred             HH
Confidence            43


No 110
>KOG0547|consensus
Probab=31.51  E-value=70  Score=31.57  Aligned_cols=41  Identities=15%  Similarity=0.332  Sum_probs=35.7

Q ss_pred             hHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCC-h
Q psy14697        106 HSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQ-P  146 (175)
Q Consensus       106 ~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~Vcpq-P  146 (175)
                      +|++.+|=.+==..|.....+|+|++|+.....||..||. |
T Consensus       108 ~e~~~k~A~~lK~~GN~~f~~kkY~eAIkyY~~AI~l~p~ep  149 (606)
T KOG0547|consen  108 KEERLKYAAALKTKGNKFFRNKKYDEAIKYYTQAIELCPDEP  149 (606)
T ss_pred             hHHHHHHHHHHHhhhhhhhhcccHHHHHHHHHHHHhcCCCCc
Confidence            3566777777778899999999999999999999999999 6


No 111
>PF11846 DUF3366:  Domain of unknown function (DUF3366);  InterPro: IPR021797  This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 200 amino acids in length. 
Probab=31.39  E-value=2.7e+02  Score=22.13  Aligned_cols=27  Identities=22%  Similarity=0.211  Sum_probs=22.6

Q ss_pred             hHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697        119 LGETLLAAGDLDNGVEHLANALTVCGQ  145 (175)
Q Consensus       119 lGE~L~a~G~~~eaa~hf~nAl~Vcpq  145 (175)
                      +...+...|+.++|......|...||+
T Consensus       150 ~a~~l~~~G~~~eA~~~~~~~~~lyP~  176 (193)
T PF11846_consen  150 YALALALLGDPEEARQWLARARRLYPA  176 (193)
T ss_pred             HHHHHHHcCCHHHHHHHHHHHHHhCCc
Confidence            466677789999999999999999993


No 112
>PF12487 DUF3703:  Protein of unknown function (DUF3703) ;  InterPro: IPR022172  This family of proteins is found in bacteria. Proteins in this family are typically between 113 and 135 amino acids in length. 
Probab=31.31  E-value=88  Score=24.52  Aligned_cols=41  Identities=34%  Similarity=0.506  Sum_probs=33.7

Q ss_pred             HHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697        107 SAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQ  148 (175)
Q Consensus       107 e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~  148 (175)
                      +++..+|..|++......++|++..|-.|+-+|=.+ +||.-
T Consensus         3 ~~~~~~~~~el~~a~~a~~~~~~~~aw~hLErAHIl-gQ~~~   43 (112)
T PF12487_consen    3 KALRPAYDAELEAAREALAAGDLQQAWRHLERAHIL-GQPYP   43 (112)
T ss_pred             HHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHhhHHh-cCCch
Confidence            467889999999999999999999999999998544 44433


No 113
>KOG2769|consensus
Probab=31.29  E-value=1.2e+02  Score=29.70  Aligned_cols=44  Identities=16%  Similarity=0.283  Sum_probs=26.4

Q ss_pred             HHHhHHHHHHhHHHHH--Hh--hcCCCC----------------CCCChHHHHHHHHHHHHhHh
Q psy14697         78 KEKLKERRRKNRELLQ--QK--NKRGIP----------------DLKDHSAVQQYFLQEIQLGE  121 (175)
Q Consensus        78 RKkLrerRrk~~~~~~--~a--~~~~~p----------------~~~d~e~~E~fFl~eV~lGE  121 (175)
                      |||||++||+.+..++  +-  ++..-|                -..||..+|+-.-.||..=+
T Consensus       293 rKKLRRQ~R~ea~KEkqekIrLGL~~ppePKVKiSNLMrVLgsEAiqDPTK~E~~VR~Q~aeR~  356 (522)
T KOG2769|consen  293 RKKLRRQRRKEARKEKQEKIRLGLEPPPEPKVKLSNLMRVLGSEAIQDPTKLEAEVRDQMAERQ  356 (522)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcchHHHHHHHhhhccccCcHHHHHHHHHHHHHHH
Confidence            7899987776654321  21  222212                13788888888877776544


No 114
>PF06703 SPC25:  Microsomal signal peptidase 25 kDa subunit (SPC25);  InterPro: IPR009582 This family consists of several microsomal signal peptidase 25 kDa subunit proteins. Translocation of polypeptide chains across the endoplasmic reticulum (ER) membrane is triggered by signal sequences. Subsequently, signal recognition particle interacts with its membrane receptor and the ribosome-bound nascent chain is targeted to the ER where it is transferred into a protein-conducting channel. At some point, a second signal sequence recognition event takes place in the membrane and translocation of the nascent chain through the membrane occurs. The signal sequence of most secretory and membrane proteins is cleaved off at this stage. Cleavage occurs by the signal peptidase complex (SPC) as soon as the lumenal domain of the translocating polypeptide is large enough to expose its cleavage site to the enzyme. The signal peptidase complex is possibly also involved in proteolytic events in the ER membrane other than the processing of the signal sequence, for example the further digestion of the cleaved signal peptide or the degradation of membrane proteins. Mammalian signal peptidase is as a complex of five different polypeptide chains [].; GO: 0008233 peptidase activity, 0006465 signal peptide processing, 0005787 signal peptidase complex, 0016021 integral to membrane
Probab=31.07  E-value=34  Score=27.11  Aligned_cols=21  Identities=24%  Similarity=0.558  Sum_probs=15.2

Q ss_pred             HHHHHHHHHHhhhhheeccCC
Q psy14697         51 TAVAVGAAVVVGYCLYFDKKR   71 (175)
Q Consensus        51 ~~agvaga~flGYcIYFD~KR   71 (175)
                      ++.|.++.++.|+|.|||||-
T Consensus        30 L~lg~~a~~iA~~a~~~d~~~   50 (162)
T PF06703_consen   30 LALGYLAVIIAGFAFFYDYKY   50 (162)
T ss_pred             HHHHHHHHHHHHHHHHhhhcC
Confidence            455556666778999999954


No 115
>PRK14748 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=31.00  E-value=54  Score=20.23  Aligned_cols=14  Identities=21%  Similarity=0.475  Sum_probs=8.8

Q ss_pred             HHHHHHHHhhhhhe
Q psy14697         53 VAVGAAVVVGYCLY   66 (175)
Q Consensus        53 agvaga~flGYcIY   66 (175)
                      .++...+.+||-||
T Consensus         8 G~ilv~lLlgYLvy   21 (29)
T PRK14748          8 GVLLVFLLLGYLVY   21 (29)
T ss_pred             HHHHHHHHHHHHHH
Confidence            33444567888877


No 116
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=30.95  E-value=2.6e+02  Score=24.95  Aligned_cols=32  Identities=19%  Similarity=0.253  Sum_probs=28.8

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQ  148 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~  148 (175)
                      +.+|..++.++++++|.++|-+++..-|++..
T Consensus       332 l~lgrl~~~~~~~~~A~~~le~al~~~P~~~~  363 (398)
T PRK10747        332 STLGQLLMKHGEWQEASLAFRAALKQRPDAYD  363 (398)
T ss_pred             HHHHHHHHHCCCHHHHHHHHHHHHhcCCCHHH
Confidence            67899999999999999999999999888877


No 117
>PF08802 CytB6-F_Fe-S:  Cytochrome B6-F complex Fe-S subunit ;  InterPro: IPR014909 The cytochrome b6-f complex mediates electron transfer between photosystem II (PSII) and photosystem I (PSI), cyclic electron flow around PSI, and state transitions. The cytochrome b6-f complex has 4 large subunits, these are: cytochrome b6, subunit IV (17 kDa polypeptide, PetD), cytochrome f and the Rieske protein, while the 4 small subunits are: PetG, PetL, PetM and PetN. The complex functions as a dimer.  This protein corresponds to the alpha helical transmembrane domain of the cytochrome b6-f complex Rieske iron-sulphur subunit. ; GO: 0009496 plastoquinol-plastocyanin reductase activity, 0051537 2 iron, 2 sulfur cluster binding, 0055114 oxidation-reduction process, 0042651 thylakoid membrane; PDB: 1Q90_R 1VF5_D 2E75_D 2E74_D 2E76_D 2D2C_Q 2ZT9_D.
Probab=30.78  E-value=54  Score=21.35  Aligned_cols=22  Identities=9%  Similarity=0.048  Sum_probs=11.6

Q ss_pred             hhHHHHHHHHHHHHHhhhhhee
Q psy14697         46 GNKVVTAVAVGAAVVVGYCLYF   67 (175)
Q Consensus        46 ~~~~~~~agvaga~flGYcIYF   67 (175)
                      ..+.++++++++.....|.-||
T Consensus        14 ~ll~Gava~~a~~~lyP~~~ff   35 (39)
T PF08802_consen   14 LLLGGAVAVPAGGMLYPYVKFF   35 (39)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHhhHHHHHHHHhhhheeEe
Confidence            3344455555555555565554


No 118
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=30.55  E-value=2.2e+02  Score=22.57  Aligned_cols=32  Identities=9%  Similarity=0.201  Sum_probs=28.4

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQ  148 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~  148 (175)
                      ..+|+.+...|++++|...|-+++..+|+...
T Consensus        74 ~~la~~~~~~~~~~~A~~~~~~~l~~~p~~~~  105 (235)
T TIGR03302        74 LDLAYAYYKSGDYAEAIAAADRFIRLHPNHPD  105 (235)
T ss_pred             HHHHHHHHhcCCHHHHHHHHHHHHHHCcCCCc
Confidence            56799999999999999999999999996554


No 119
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=30.35  E-value=1.4e+02  Score=30.94  Aligned_cols=95  Identities=14%  Similarity=0.114  Sum_probs=60.9

Q ss_pred             ccCC-CCChhhHHHhHHHHHHhHH-HH--HHhhcCCCCCCCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhc
Q psy14697         68 DKKR-RSDPLFKEKLKERRRKNRE-LL--QQKNKRGIPDLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVC  143 (175)
Q Consensus        68 D~KR-R~dP~FRKkLrerRrk~~~-~~--~~a~~~~~p~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~Vc  143 (175)
                      |++| ++++.|.+-++...+..-. +-  =+....  -+|++++..       ..+++.++..|+.++|..++-+|+..-
T Consensus        38 ~~~~~~~~~~f~~a~~~~~~Gd~~~A~~~l~~Al~--~dP~n~~~~-------~~LA~~yl~~g~~~~A~~~~~kAv~ld  108 (987)
T PRK09782         38 DYRHFVIYPRLDKALKAQKNNDEATAIREFEYIHQ--QVPDNIPLT-------LYLAEAYRHFGHDDRARLLLEDQLKRH  108 (987)
T ss_pred             hHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHH--hCCCCHHHH-------HHHHHHHHHCCCHHHHHHHHHHHHhcC
Confidence            5555 6778888877765432221 11  011222  245666655       788999999999999999999999998


Q ss_pred             CChHHHHHHHh--hcCCH--HHHHHHHHhcCC
Q psy14697        144 GQPNQLLGVLQ--QTLPP--NVFNALIEKLPP  171 (175)
Q Consensus       144 pqP~~LL~IyQ--qTLPp--~Vf~~Li~kLp~  171 (175)
                      |....+...+-  +--|+  ++|+-+++.-|.
T Consensus       109 P~n~~~~~~La~i~~~~kA~~~ye~l~~~~P~  140 (987)
T PRK09782        109 PGDARLERSLAAIPVEVKSVTTVEELLAQQKA  140 (987)
T ss_pred             cccHHHHHHHHHhccChhHHHHHHHHHHhCCC
Confidence            87665554221  22333  557777776664


No 120
>PF09164 VitD-bind_III:  Vitamin D binding protein, domain III;  InterPro: IPR015247 This domain is predominantly found in Vitamin D binding proteins, and adopts a multihelical structure. It is required for formation of an actin 'clamp', allowing the protein to bind to actin []. ; PDB: 1MA9_A 1KW2_A 1KXP_D 1J7E_A 1J78_A 1LOT_A.
Probab=30.30  E-value=40  Score=24.51  Aligned_cols=21  Identities=29%  Similarity=0.393  Sum_probs=15.8

Q ss_pred             ccCCCCChhhHHHhHHHHHHh
Q psy14697         68 DKKRRSDPLFKEKLKERRRKN   88 (175)
Q Consensus        68 D~KRR~dP~FRKkLrerRrk~   88 (175)
                      ||.--.=-||||||.++-+++
T Consensus         5 dYse~tFtEyKKrL~e~l~~k   25 (68)
T PF09164_consen    5 DYSENTFTEYKKRLAERLRAK   25 (68)
T ss_dssp             TTTTS-HHHHHHHHHHHHHHH
T ss_pred             hhhhccHHHHHHHHHHHHHHH
Confidence            666667789999999877653


No 121
>PRK01001 putative inner membrane protein translocase component YidC; Provisional
Probab=30.23  E-value=65  Score=32.93  Aligned_cols=75  Identities=17%  Similarity=0.220  Sum_probs=38.2

Q ss_pred             hhHHHHHHHHHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHHHHH----hhcCCCCCCCChHHHHHHHHHHHHhHh
Q psy14697         46 GNKVVTAVAVGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNRELLQQ----KNKRGIPDLKDHSAVQQYFLQEIQLGE  121 (175)
Q Consensus        46 ~~~~~~~agvaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~~~~----a~~~~~p~~~d~e~~E~fFl~eV~lGE  121 (175)
                      ++++..+..+ +++|+|--+||-|+   |-.=.+.++++.|+..+.+.+    -+.+..|-.++.++-|.==-..+..|.
T Consensus         3 ~r~~~~v~~~-~~~~~~~~~~f~~~---~~~~~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   78 (795)
T PRK01001          3 KRTLLFVSLV-SAAFLGCQIFFGYR---DFRSCQDLAEKQRAISEQILAATESVGLSVVPWTTSLEEEESKNQYAVRVGD   78 (795)
T ss_pred             ceeehHHHHH-HHHHHHHHHHhhcc---cHHHHHHHHHHHHHHHHHHHHhhccccccccccccccchhhhhhhhhhhccc
Confidence            4444334333 35699999999997   333345566654443322211    234445555555443333333455565


Q ss_pred             HHh
Q psy14697        122 TLL  124 (175)
Q Consensus       122 ~L~  124 (175)
                      .|+
T Consensus        79 ~~~   81 (795)
T PRK01001         79 RLL   81 (795)
T ss_pred             eEE
Confidence            554


No 122
>KOG0548|consensus
Probab=29.77  E-value=2.3e+02  Score=27.90  Aligned_cols=29  Identities=21%  Similarity=0.371  Sum_probs=24.7

Q ss_pred             HhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697        120 GETLLAAGDLDNGVEHLANALTVCGQPNQ  148 (175)
Q Consensus       120 GE~L~a~G~~~eaa~hf~nAl~VcpqP~~  148 (175)
                      |-.....|++.+|+.|+..||+-=|+-..
T Consensus       365 Gne~Fk~gdy~~Av~~YteAIkr~P~Da~  393 (539)
T KOG0548|consen  365 GNEAFKKGDYPEAVKHYTEAIKRDPEDAR  393 (539)
T ss_pred             HHHHHhccCHHHHHHHHHHHHhcCCchhH
Confidence            99999999999999999999986654433


No 123
>PF04654 DUF599:  Protein of unknown function, DUF599;  InterPro: IPR006747 This family includes several uncharacterised proteins.
Probab=29.12  E-value=70  Score=26.97  Aligned_cols=32  Identities=28%  Similarity=0.528  Sum_probs=23.6

Q ss_pred             HHHHHHhhhhheeccCCCCChhhHHHhHHHHH
Q psy14697         55 VGAAVVVGYCLYFDKKRRSDPLFKEKLKERRR   86 (175)
Q Consensus        55 vaga~flGYcIYFD~KRR~dP~FRKkLrerRr   86 (175)
                      ...+..+||-.|.+|+-|.+|--....++.|+
T Consensus         6 ~~l~~~~gY~~~l~~~~~~~pt~~~~~~~~R~   37 (216)
T PF04654_consen    6 LFLLCWVGYHLYLERRVRRRPTLSGIMNQYRR   37 (216)
T ss_pred             HHHHHHHHHHHHHhhhccCCChHHHHHHHHHH
Confidence            33356889999999999999955555555444


No 124
>KOG0909|consensus
Probab=29.04  E-value=55  Score=31.59  Aligned_cols=47  Identities=19%  Similarity=0.196  Sum_probs=39.0

Q ss_pred             hhcCChhhHHHHHHHHHHhcCChHHHHHHHhhcCC-HHHHHHHHHhcCC
Q psy14697        124 LAAGDLDNGVEHLANALTVCGQPNQLLGVLQQTLP-PNVFNALIEKLPP  171 (175)
Q Consensus       124 ~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLP-p~Vf~~Li~kLp~  171 (175)
                      .+.+.+.+.+....+.+.||.+| .|..+=..|+| +++++...++++.
T Consensus        84 ~~~~~Fle~l~~~sqhvlvYEDP-~lq~~Als~iPvdEL~~kA~ekla~  131 (500)
T KOG0909|consen   84 RATNAFLEVLRSYSQHVLVYEDP-VLQAKALSTIPVDELKEKASEKLAK  131 (500)
T ss_pred             hcchHHHHHHHHhccCceeecCH-HHHHHHHhcCCHHHHHHHHHHhhcc
Confidence            34455677888889999999999 67777788999 8999999999884


No 125
>PLN03088 SGT1,  suppressor of G2 allele of SKP1; Provisional
Probab=28.98  E-value=1e+02  Score=27.48  Aligned_cols=37  Identities=5%  Similarity=0.196  Sum_probs=30.8

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHH
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVL  153 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~Iy  153 (175)
                      ..+|..+...|++++|..+|-+|+..-|+-.++...+
T Consensus        74 ~~lg~~~~~lg~~~eA~~~~~~al~l~P~~~~~~~~l  110 (356)
T PLN03088         74 LRKGTACMKLEEYQTAKAALEKGASLAPGDSRFTKLI  110 (356)
T ss_pred             HHHHHHHHHhCCHHHHHHHHHHHHHhCCCCHHHHHHH
Confidence            4679999999999999999999999998766654443


No 126
>KOG3498|consensus
Probab=28.93  E-value=65  Score=23.37  Aligned_cols=22  Identities=14%  Similarity=0.199  Sum_probs=16.6

Q ss_pred             hhhhHHHHHHHHHHHHHhhhhh
Q psy14697         44 ISGNKVVTAVAVGAAVVVGYCL   65 (175)
Q Consensus        44 ~~~~~~~~~agvaga~flGYcI   65 (175)
                      .++...+.+.|.+..+|+||.|
T Consensus        33 f~ki~~~~aiGf~~mG~iGf~v   54 (67)
T KOG3498|consen   33 FTKIAKATAIGFVIMGFIGFFV   54 (67)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            4555666777777788999987


No 127
>KOG0772|consensus
Probab=28.24  E-value=59  Score=32.14  Aligned_cols=57  Identities=16%  Similarity=0.361  Sum_probs=46.8

Q ss_pred             HHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhhcCCHHHHHHH
Q psy14697        107 SAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQTLPPNVFNAL  165 (175)
Q Consensus       107 e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLPp~Vf~~L  165 (175)
                      -.+..|...||.+-......-|.-||+...+++-.  .+|.=+-..|.||-|+.||+-+
T Consensus       568 g~L~sy~vk~iglak~~~~d~DpREAILKyAk~A~--~nP~~iapaYskTQPk~IF~~~  624 (641)
T KOG0772|consen  568 GLLSSYLVKQIGLAKNFMDDEDPREAILKYAKVAE--ENPYFIAPAYSKTQPKTIFQEV  624 (641)
T ss_pred             ccHHHHHHhhcccccccccccCHHHHHHHHHhhhh--hCccccchhhhccCCchhhccc
Confidence            46899999999998887777778899998888765  4565555799999999999865


No 128
>PRK02898 cobalt transport protein CbiN; Provisional
Probab=27.97  E-value=38  Score=26.17  Aligned_cols=26  Identities=27%  Similarity=0.318  Sum_probs=18.4

Q ss_pred             hHHHHHHHHHHHHHhhhhheeccCCC
Q psy14697         47 NKVVTAVAVGAAVVVGYCLYFDKKRR   72 (175)
Q Consensus        47 ~~~~~~agvaga~flGYcIYFD~KRR   72 (175)
                      +++.+.=++.|++++||++-+-+-||
T Consensus        68 SLLFaLQAAiGAgiIgY~lG~~~gr~   93 (100)
T PRK02898         68 SLLFALQAALGAGIIGYILGYYKGRS   93 (100)
T ss_pred             HHHHHHHHHHhhhhhheeeeehhhhh
Confidence            45556667777899999987655443


No 129
>COG1687 AzlD Predicted branched-chain amino acid permeases (azaleucine resistance) [Amino acid transport and metabolism]
Probab=27.82  E-value=68  Score=25.14  Aligned_cols=24  Identities=38%  Similarity=0.778  Sum_probs=21.4

Q ss_pred             cCChHHHHHHHhhcCCHHHHHHHH
Q psy14697        143 CGQPNQLLGVLQQTLPPNVFNALI  166 (175)
Q Consensus       143 cpqP~~LL~IyQqTLPp~Vf~~Li  166 (175)
                      -.+|.|+.+.+.+++|+.|+-+|.
T Consensus        29 ~~~ppq~v~~lgk~lP~~v~~~Lv   52 (106)
T COG1687          29 SGRPPQFVGYLGKVLPASVMGMLV   52 (106)
T ss_pred             CCCchHHHHHHHHhcCHHHHHHHH
Confidence            357899999999999999999884


No 130
>PF09716 ETRAMP:  Malarial early transcribed membrane protein (ETRAMP);  InterPro: IPR006389 These sequences represent a family of proteins from the malaria parasite Plasmodium falciparum, several of which have been shown to be expressed specifically in the ring stage as well as the rodent parasite Plasmodium yoelii []. A homologue from Plasmodium chabaudi was localized to the parasitophorous vacuole membrane []. Members have an initial hydrophobic, Phe/Tyr-rich stretch long enough to span the membrane, a highly charged region rich in Lys, a second putative transmembrane region, and a second highly charged, low complexity sequence region. Some members have up to 100 residues of additional C-terminal sequence. These genes have been shown to be found in the sub-telomeric regions of both Plasmodium falciparum and P. yoelii chromosomes.
Probab=27.76  E-value=59  Score=23.78  Aligned_cols=14  Identities=36%  Similarity=0.470  Sum_probs=8.7

Q ss_pred             HHHHHhhhhheecc
Q psy14697         56 GAAVVVGYCLYFDK   69 (175)
Q Consensus        56 aga~flGYcIYFD~   69 (175)
                      +.++.+||-+|.-+
T Consensus        68 li~~~~G~g~y~~~   81 (84)
T PF09716_consen   68 LIATALGYGYYKKK   81 (84)
T ss_pred             HHHHHHHHHHHhhc
Confidence            33456688888543


No 131
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=27.60  E-value=2.9e+02  Score=22.40  Aligned_cols=31  Identities=13%  Similarity=0.190  Sum_probs=24.9

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChH
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPN  147 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~  147 (175)
                      ..+|-.+...|++++|+.++-+++..-|.=.
T Consensus       148 ~~LA~~~~~~g~~~~Ai~~~~~aL~l~~~~~  178 (198)
T PRK10370        148 MLLASDAFMQADYAQAIELWQKVLDLNSPRV  178 (198)
T ss_pred             HHHHHHHHHcCCHHHHHHHHHHHHhhCCCCc
Confidence            3457777788999999999999998877643


No 132
>PF13431 TPR_17:  Tetratricopeptide repeat
Probab=27.50  E-value=81  Score=18.75  Aligned_cols=25  Identities=20%  Similarity=0.369  Sum_probs=18.8

Q ss_pred             CCCChHHHHHHHHHHHHhHhHHhhcCChhhHH
Q psy14697        102 DLKDHSAVQQYFLQEIQLGETLLAAGDLDNGV  133 (175)
Q Consensus       102 ~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa  133 (175)
                      +|+|+++.       -.+|..+...|+.++|.
T Consensus         9 ~P~n~~a~-------~nla~~~~~~g~~~~A~   33 (34)
T PF13431_consen    9 NPNNAEAY-------NNLANLYLNQGDYEEAI   33 (34)
T ss_pred             CCCCHHHH-------HHHHHHHHHCcCHHhhc
Confidence            46776665       35899999999988774


No 133
>PF12688 TPR_5:  Tetratrico peptide repeat
Probab=27.24  E-value=1.3e+02  Score=23.12  Aligned_cols=31  Identities=32%  Similarity=0.338  Sum_probs=27.3

Q ss_pred             HHHHhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697        115 QEIQLGETLLAAGDLDNGVEHLANALTVCGQ  145 (175)
Q Consensus       115 ~eV~lGE~L~a~G~~~eaa~hf~nAl~Vcpq  145 (175)
                      -.|++|..+...|++++|+.-|-+++.-.|.
T Consensus        40 a~i~lastlr~LG~~deA~~~L~~~~~~~p~   70 (120)
T PF12688_consen   40 ALIQLASTLRNLGRYDEALALLEEALEEFPD   70 (120)
T ss_pred             HHHHHHHHHHHcCCHHHHHHHHHHHHHHCCC
Confidence            4568999999999999999999999988887


No 134
>PF12072 DUF3552:  Domain of unknown function (DUF3552);  InterPro: IPR022711  This presumed domain is functionally uncharacterised. This domain is found in bacteria, archaea and eukaryotes. This domain is about 200 amino acids in length. This domain is found associated with PF00013 from PFAM, PF01966 from PFAM. This domain has a single completely conserved residue A that may be functionally important. ; GO: 0008663 2',3'-cyclic-nucleotide 2'-phosphodiesterase activity
Probab=27.20  E-value=1.7e+02  Score=24.23  Aligned_cols=16  Identities=25%  Similarity=0.505  Sum_probs=9.5

Q ss_pred             HHHHHHHHHHHhhhhh
Q psy14697         50 VTAVAVGAAVVVGYCL   65 (175)
Q Consensus        50 ~~~agvaga~flGYcI   65 (175)
                      .++++++.++++||.+
T Consensus         5 ~~i~~~~vG~~~G~~~   20 (201)
T PF12072_consen    5 IAIVALIVGIGIGYLV   20 (201)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3444444456788888


No 135
>PF10828 DUF2570:  Protein of unknown function (DUF2570);  InterPro: IPR022538 This entry is represented by Bacteriophage IME08, pseT.3. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.  This is a family of proteins with unknown function. 
Probab=27.15  E-value=2.7e+02  Score=20.91  Aligned_cols=100  Identities=21%  Similarity=0.268  Sum_probs=50.7

Q ss_pred             HHHHHHHHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHH-HHHhhcCCCCCCCChHHHHHHHHHHHHhHhHHhhcC
Q psy14697         49 VVTAVAVGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNREL-LQQKNKRGIPDLKDHSAVQQYFLQEIQLGETLLAAG  127 (175)
Q Consensus        49 ~~~~agvaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~-~~~a~~~~~p~~~d~e~~E~fFl~eV~lGE~L~a~G  127 (175)
                      +.+++++...+..||..|--+       .-.+|+.+-+++++. +..+..  .-.+.+..+-++..+++.+.=|..+.+ 
T Consensus         5 ~~~~l~~lvl~L~~~l~~qs~-------~i~~L~a~n~~q~~tI~qq~~~--~~~L~~~~~~~r~~~~~~~~~~qq~r~-   74 (110)
T PF10828_consen    5 IYIALAVLVLGLGGWLWYQSQ-------RIDRLRAENKAQAQTIQQQEDA--NQELKAQLQQNRQAVEEQQKREQQLRQ-   74 (110)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH-
Confidence            344445555556667666432       234455443333322 111111  011233344567777777777776654 


Q ss_pred             ChhhHHHHHHHHHHhcCChHHHHHHHhhcCCHHHHHHH
Q psy14697        128 DLDNGVEHLANALTVCGQPNQLLGVLQQTLPPNVFNAL  165 (175)
Q Consensus       128 ~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLPp~Vf~~L  165 (175)
                      +.++..+-+-+||+==  |     -+...+|.+|.+.|
T Consensus        75 ~~e~~~e~ik~~lk~d--~-----Ca~~~~P~~V~d~L  105 (110)
T PF10828_consen   75 QSEERRESIKTALKDD--P-----CANTAVPDAVIDSL  105 (110)
T ss_pred             HHHHHHHHHHHHHccC--c-----cccCCCCHHHHHHH
Confidence            4456677777777621  1     23345777776654


No 136
>PF13429 TPR_15:  Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=27.05  E-value=96  Score=25.65  Aligned_cols=36  Identities=19%  Similarity=0.227  Sum_probs=27.9

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHH
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGV  152 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~I  152 (175)
                      +..|+.+...|+.++|...+-+||..-|+-.+++.-
T Consensus       150 ~~~a~~~~~~G~~~~A~~~~~~al~~~P~~~~~~~~  185 (280)
T PF13429_consen  150 LALAEIYEQLGDPDKALRDYRKALELDPDDPDARNA  185 (280)
T ss_dssp             HHHHHHHHHCCHHHHHHHHHHHHHHH-TT-HHHHHH
T ss_pred             HHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHH
Confidence            456788888999999999999999999985555543


No 137
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=26.92  E-value=1.1e+02  Score=29.37  Aligned_cols=38  Identities=13%  Similarity=0.050  Sum_probs=30.5

Q ss_pred             CCCChHHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhc
Q psy14697        102 DLKDHSAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVC  143 (175)
Q Consensus       102 ~~~d~e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~Vc  143 (175)
                      +|+++++...|    ..+|-.+...|+.++|+.+|-+||..+
T Consensus       105 ~Pd~aeA~~A~----yNLAcaya~LGr~dEAla~LrrALels  142 (453)
T PLN03098        105 NPNPDEAQAAY----YNKACCHAYREEGKKAADCLRTALRDY  142 (453)
T ss_pred             CCCchHHHHHH----HHHHHHHHHcCCHHHHHHHHHHHHHhc
Confidence            45666553333    678999999999999999999999985


No 138
>PF04415 DUF515:  Protein of unknown function (DUF515)    ;  InterPro: IPR007509 This is a family of hypothetical archaeal proteins.
Probab=26.90  E-value=1.3e+02  Score=28.64  Aligned_cols=67  Identities=13%  Similarity=0.180  Sum_probs=35.3

Q ss_pred             hhHHHHHHHHHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHHHHHhhcCCCCCCCChHHHHHHHHHHHHhH
Q psy14697         46 GNKVVTAVAVGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNRELLQQKNKRGIPDLKDHSAVQQYFLQEIQLG  120 (175)
Q Consensus        46 ~~~~~~~agvaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~e~~E~fFl~eV~lG  120 (175)
                      +.+++++.-+...+|++|.+|+..+=|--   ++ |.. .|+++ .++-...-+-+.|+|++..  -|..||+.+
T Consensus        31 riiiGa~Vl~iIii~~~~~~Y~~~~~~~~---~e-Le~-aK~~k-I~evN~~F~g~l~nD~~K~--~li~qI~~A   97 (416)
T PF04415_consen   31 RIIIGAAVLIIIIIFIVYNIYYFLQNQPA---QE-LEN-AKAAK-INEVNKYFKGELPNDPQKI--ILINQIQSA   97 (416)
T ss_pred             hhhhhhhhHhHHHHHHHHHHHHHhhhhHH---HH-HHH-HHHHH-HHHHHHHhcCcCCCChHHH--HHHHHHHhc
Confidence            34454444555568899999998875432   22 332 11111 1111111222567888743  467777665


No 139
>PRK11462 putative transporter; Provisional
Probab=26.53  E-value=58  Score=29.42  Aligned_cols=23  Identities=17%  Similarity=0.332  Sum_probs=16.3

Q ss_pred             heeccCCCCChhhHHHhHHHHHH
Q psy14697         65 LYFDKKRRSDPLFKEKLKERRRK   87 (175)
Q Consensus        65 IYFD~KRR~dP~FRKkLrerRrk   87 (175)
                      -||+--|+...+.+++|++||++
T Consensus       426 ~~y~l~~~~~~~i~~~l~~~~~~  448 (460)
T PRK11462        426 RYYSLTTHNLKTVMEQLAQGKRR  448 (460)
T ss_pred             HhccCCHHHHHHHHHHHHHHHhh
Confidence            35677677788888888766543


No 140
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=26.44  E-value=2.2e+02  Score=29.33  Aligned_cols=27  Identities=22%  Similarity=0.236  Sum_probs=12.9

Q ss_pred             hHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697        119 LGETLLAAGDLDNGVEHLANALTVCGQ  145 (175)
Q Consensus       119 lGE~L~a~G~~~eaa~hf~nAl~Vcpq  145 (175)
                      +|..+..+|++++|..+|-+|+..-|.
T Consensus       391 Lg~~~~~~g~~~eA~~~y~~aL~~~p~  417 (1157)
T PRK11447        391 LGDVAMARKDYAAAERYYQQALRMDPG  417 (1157)
T ss_pred             HHHHHHHCCCHHHHHHHHHHHHHhCCC
Confidence            444444444444444444444444444


No 141
>PF07862 Nif11:  Nitrogen fixation protein of unknown function;  InterPro: IPR012903 This domain is found in the cyanobacteria, and the nitrogen-fixing proteobacterium Azotobacter vinelandii and may be involved in nitrogen fixation, but no role has been assigned []. 
Probab=25.66  E-value=41  Score=21.60  Aligned_cols=12  Identities=50%  Similarity=0.872  Sum_probs=10.7

Q ss_pred             CCChhhHHHhHH
Q psy14697         72 RSDPLFKEKLKE   83 (175)
Q Consensus        72 R~dP~FRKkLre   83 (175)
                      ++||+||.+|+.
T Consensus        14 ~~d~~l~~~l~~   25 (49)
T PF07862_consen   14 KSDPELREQLKA   25 (49)
T ss_pred             hcCHHHHHHHHh
Confidence            499999999986


No 142
>KOG3289|consensus
Probab=25.18  E-value=31  Score=29.62  Aligned_cols=54  Identities=22%  Similarity=0.268  Sum_probs=34.2

Q ss_pred             cccccccccccccCCCCccc----------------hhhhhhhhHHHHHHHHHHHHHhhhhheeccCCCCChh
Q psy14697         20 WCGYFGLWSGRIGPSGRLMS----------------PMELISGNKVVTAVAVGAAVVVGYCLYFDKKRRSDPL   76 (175)
Q Consensus        20 ~~~~~~~~~~~~~~~~~~~~----------------~~~m~~~~~~~~~agvaga~flGYcIYFD~KRR~dP~   76 (175)
                      -|+.+||.-|+++++|.-..                |+| +.-+.+=.=..-.|..++||  ||-.-|-+|-.
T Consensus        23 h~aVnGLLla~~~~kg~~v~itdcVPLfH~~laLaPmlE-vAl~lId~~~~~~GlviaGy--y~Ane~~~D~s   92 (199)
T KOG3289|consen   23 HAAVNGLLLAPATGKGECVEITDCVPLFHSHLALAPMLE-VALNLIDVWGAQAGLVIAGY--YHANERVNDQS   92 (199)
T ss_pred             ccceeeEEEeccCCCCCeEEEEecchhhccccccccHHH-HHHHHHHHHHHhcCeEEEEE--eecCCCccccC
Confidence            47889999999999997543                444 22222222222344556666  88888888744


No 143
>TIGR01162 purE phosphoribosylaminoimidazole carboxylase, PurE protein. Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. This model represents PurK, an N5-CAIR mutase.
Probab=25.08  E-value=90  Score=25.73  Aligned_cols=20  Identities=45%  Similarity=0.609  Sum_probs=16.9

Q ss_pred             CCCChhhHHHhHHHHHHhHH
Q psy14697         71 RRSDPLFKEKLKERRRKNRE   90 (175)
Q Consensus        71 RR~dP~FRKkLrerRrk~~~   90 (175)
                      +.+||+.++||++.|+++++
T Consensus       127 ~~~d~~l~~kl~~~r~~~~~  146 (156)
T TIGR01162       127 GIKDPELAEKLKEYRENQKE  146 (156)
T ss_pred             cCCCHHHHHHHHHHHHHHHH
Confidence            57899999999998877664


No 144
>KOG2072|consensus
Probab=24.83  E-value=3.4e+02  Score=28.47  Aligned_cols=89  Identities=17%  Similarity=0.259  Sum_probs=48.0

Q ss_pred             hhheeccCCCCChhhHHHhHHHHHHhHHHHHHhhcCCCCCCCChHHHHHHHHHHHHhHhHHhhcCChh------hHHHHH
Q psy14697         63 YCLYFDKKRRSDPLFKEKLKERRRKNRELLQQKNKRGIPDLKDHSAVQQYFLQEIQLGETLLAAGDLD------NGVEHL  136 (175)
Q Consensus        63 YcIYFD~KRR~dP~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~e~~E~fFl~eV~lGE~L~a~G~~~------eaa~hf  136 (175)
                      +|.  -|+|++  +|||=--.=|..-....+....+..++++|||.+|.|.=.-.++=..-+..+-+.      |-+.|+
T Consensus       186 FCL--kYqRkt--EFRrLCe~LR~HL~~i~k~~nq~~~v~Ln~~Etlql~LDtRf~QLdvAi~lELWQEAyrSiEDIhgL  261 (988)
T KOG2072|consen  186 FCL--KYQRKT--EFRRLCELLRMHLDNINKHQNQSTRVDLNDPETLQLYLDTRFQQLDVAIELELWQEAYRSIEDIHGL  261 (988)
T ss_pred             HHH--HHhhhH--HHHHHHHHHHHHHHHHHHhhccCcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            665  456664  8887433223332233333445778899999999988655444333333322211      223344


Q ss_pred             HHHHHhcCChHHHHHHHhh
Q psy14697        137 ANALTVCGQPNQLLGVLQQ  155 (175)
Q Consensus       137 ~nAl~VcpqP~~LL~IyQq  155 (175)
                      ..--+=-|.|.-|-+-|++
T Consensus       262 m~lSKrtPkp~~laNYY~K  280 (988)
T KOG2072|consen  262 MKLSKRTPKPSTLANYYEK  280 (988)
T ss_pred             HHHhcCCCchhHHHHHHHH
Confidence            4444455666666555554


No 145
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=24.70  E-value=1.4e+02  Score=22.63  Aligned_cols=29  Identities=14%  Similarity=0.237  Sum_probs=14.9

Q ss_pred             HHHhhhhhee-ccCCCCChhhHHHhHHHHH
Q psy14697         58 AVVVGYCLYF-DKKRRSDPLFKEKLKERRR   86 (175)
Q Consensus        58 a~flGYcIYF-D~KRR~dP~FRKkLrerRr   86 (175)
                      .++++|-+.| |+-.++--+.++++.+.++
T Consensus        12 l~~l~y~l~~g~~G~~~~~~l~~q~~~~~~   41 (105)
T PRK00888         12 LVWLQYSLWFGKNGILDYWRVNDQVAAQQQ   41 (105)
T ss_pred             HHHHHHHHhccCCcHHHHHHHHHHHHHHHH
Confidence            3567777766 3333344445555554333


No 146
>COG1575 MenA 1,4-dihydroxy-2-naphthoate octaprenyltransferase [Coenzyme metabolism]
Probab=24.45  E-value=21  Score=32.27  Aligned_cols=15  Identities=40%  Similarity=0.656  Sum_probs=12.4

Q ss_pred             eeccCCCCC---hhhHHH
Q psy14697         66 YFDKKRRSD---PLFKEK   80 (175)
Q Consensus        66 YFD~KRR~d---P~FRKk   80 (175)
                      ||||||-+|   |+.+++
T Consensus        66 Y~D~~KG~D~~g~~~~~~   83 (303)
T COG1575          66 YFDYKKGTDTHGPDRLKQ   83 (303)
T ss_pred             hhhhhcCCCCCCCccccc
Confidence            999999999   666554


No 147
>PF14938 SNAP:  Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=24.43  E-value=63  Score=27.43  Aligned_cols=28  Identities=21%  Similarity=0.312  Sum_probs=20.8

Q ss_pred             HHhHhHHhhc-CChhhHHHHHHHHHHhcC
Q psy14697        117 IQLGETLLAA-GDLDNGVEHLANALTVCG  144 (175)
Q Consensus       117 V~lGE~L~a~-G~~~eaa~hf~nAl~Vcp  144 (175)
                      ..+|+.+-.. |++++|++++-+|+.+|.
T Consensus       118 ~~lA~~ye~~~~d~e~Ai~~Y~~A~~~y~  146 (282)
T PF14938_consen  118 KELAEIYEEQLGDYEKAIEYYQKAAELYE  146 (282)
T ss_dssp             HHHHHHHCCTT--HHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Confidence            5677777777 788888888888888774


No 148
>PF02526 GBP_repeat:  Glycophorin-binding protein;  InterPro: IPR003681 The glycophorin-binding protein contains a tandem repeat. The repeated sequence determines the binding domain for an erythrocyte receptor binding protein of Plasmodium falciparum, the malarial parasite []. Erythrocyte invasion by the malarial merozoite is a receptor-mediated process, an obligatory step in the development of the parasite. The P. falciparum protein binds to the erythrocyte receptor glycophorin.
Probab=24.25  E-value=41  Score=21.71  Aligned_cols=10  Identities=30%  Similarity=0.833  Sum_probs=8.4

Q ss_pred             CChhhHHHhH
Q psy14697         73 SDPLFKEKLK   82 (175)
Q Consensus        73 ~dP~FRKkLr   82 (175)
                      .||+|||-|.
T Consensus        15 adpeyrkh~~   24 (38)
T PF02526_consen   15 ADPEYRKHLN   24 (38)
T ss_pred             cCHHHHHHHH
Confidence            5999999875


No 149
>PRK15058 cytochrome b562; Provisional
Probab=24.05  E-value=2.7e+02  Score=22.35  Aligned_cols=22  Identities=14%  Similarity=0.212  Sum_probs=19.1

Q ss_pred             HHHHHHHhHhHHhhcCChhhHH
Q psy14697        112 YFLQEIQLGETLLAAGDLDNGV  133 (175)
Q Consensus       112 fFl~eV~lGE~L~a~G~~~eaa  133 (175)
                      -+..||..-+.|+.+|+.++|-
T Consensus        89 ~Li~qID~a~~la~~GkL~eAK  110 (128)
T PRK15058         89 ILVGQIDGALKLANEGKVKEAQ  110 (128)
T ss_pred             HHHHHHHHHHHHHhCCCHHHHH
Confidence            3788999999999999998774


No 150
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=23.94  E-value=5.2e+02  Score=23.03  Aligned_cols=34  Identities=15%  Similarity=0.123  Sum_probs=22.7

Q ss_pred             HhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHH
Q psy14697        120 GETLLAAGDLDNGVEHLANALTVCGQPNQLLGVL  153 (175)
Q Consensus       120 GE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~Iy  153 (175)
                      ++..+.+|++++|..++-+++...|+..+.+..+
T Consensus       160 a~l~l~~g~~~~Al~~l~~~~~~~P~~~~al~ll  193 (398)
T PRK10747        160 VRIQLARNENHAARHGVDKLLEVAPRHPEVLRLA  193 (398)
T ss_pred             HHHHHHCCCHHHHHHHHHHHHhcCCCCHHHHHHH
Confidence            5555667777777777777777777666444433


No 151
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=23.90  E-value=1e+02  Score=21.06  Aligned_cols=20  Identities=15%  Similarity=0.298  Sum_probs=13.9

Q ss_pred             hhhHHHHHHHHHHhcCChHH
Q psy14697        129 LDNGVEHLANALTVCGQPNQ  148 (175)
Q Consensus       129 ~~eaa~hf~nAl~VcpqP~~  148 (175)
                      |.+|+++|-+|+.+.++|..
T Consensus        31 Y~~a~e~l~~~~~~~~~~~~   50 (77)
T smart00745       31 YKKAIEYLLEGIKVESDSKR   50 (77)
T ss_pred             HHHHHHHHHHHhccCCCHHH
Confidence            45667777788888777644


No 152
>PF02234 CDI:  Cyclin-dependent kinase inhibitor;  InterPro: IPR003175 Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase.; GO: 0004861 cyclin-dependent protein kinase inhibitor activity, 0007050 cell cycle arrest, 0005634 nucleus; PDB: 1H27_E 1JSU_C.
Probab=23.80  E-value=58  Score=21.79  Aligned_cols=22  Identities=23%  Similarity=0.439  Sum_probs=14.9

Q ss_pred             CCChHHHHHHHHHHHHhHhHHh
Q psy14697        103 LKDHSAVQQYFLQEIQLGETLL  124 (175)
Q Consensus       103 ~~d~e~~E~fFl~eV~lGE~L~  124 (175)
                      |.|+++++++|.++.+.-.+-.
T Consensus         6 p~d~~e~~~~~~~~l~~~~e~~   27 (51)
T PF02234_consen    6 PVDHEELERFFQEELQEQREEF   27 (51)
T ss_dssp             ---HHHHHHHHHHHHTTTTHHH
T ss_pred             CCCHHHHHHHHHHHHHHHHHHH
Confidence            7899999999999877654443


No 153
>PF13480 Acetyltransf_6:  Acetyltransferase (GNAT) domain
Probab=23.74  E-value=73  Score=22.68  Aligned_cols=37  Identities=11%  Similarity=0.228  Sum_probs=19.8

Q ss_pred             hhHHHhHHHHHHhHHHHHHhhcCCCCCCCChHHHHHHHHHH
Q psy14697         76 LFKEKLKERRRKNRELLQQKNKRGIPDLKDHSAVQQYFLQE  116 (175)
Q Consensus        76 ~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~e~~E~fFl~e  116 (175)
                      .+|+++|+++|+.++    .+.-.+-..+++++.++++-.-
T Consensus         2 k~r~~~rr~~r~~~~----~g~~~~~~~~~~~~~~~~~~~~   38 (142)
T PF13480_consen    2 KFRKKIRRAIRRAEK----LGGVRFEVATDPADLEAFYELY   38 (142)
T ss_pred             cHHHHHHHHHHHHHh----cCCEEEEEeCCHHHHHHHHHHH
Confidence            467888877766543    1211112235667776655443


No 154
>COG1453 Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only]
Probab=23.58  E-value=48  Score=31.20  Aligned_cols=49  Identities=16%  Similarity=0.318  Sum_probs=36.1

Q ss_pred             eeccCCCCChhhHHHhHHHHHHhHHHHHHhhcCCCC--CCCChHHHHHHHHHHHHh
Q psy14697         66 YFDKKRRSDPLFKEKLKERRRKNRELLQQKNKRGIP--DLKDHSAVQQYFLQEIQL  119 (175)
Q Consensus        66 YFD~KRR~dP~FRKkLrerRrk~~~~~~~a~~~~~p--~~~d~e~~E~fFl~eV~l  119 (175)
                      |-+|--.+.|-++|-|+++-|.+-.     -++++|  ...|.++++++|-+|.+.
T Consensus        55 ~~Yh~g~sE~~lgkaL~~~~Rekv~-----LaTKlp~~~~~~~edm~r~fneqLek  105 (391)
T COG1453          55 WPYHGGESEEFLGKALKDGYREKVK-----LATKLPSWPVKDREDMERIFNEQLEK  105 (391)
T ss_pred             ccccCCCchHHHHHHhhhcccceEE-----EEeecCCccccCHHHHHHHHHHHHHH
Confidence            4456677888888889887643322     236777  789999999999999753


No 155
>PF00646 F-box:  F-box domain;  InterPro: IPR001810 The F-box domain was first described as a sequence motif found in cyclin-F that interacts with the protein SKP1 [, ]. This relatively conserved structural motif is present in numerous proteins and serves as a link between a target protein and a ubiquitin-conjugating enzyme. The SCF complex (e.g., Skp1-Cullin-F-box) plays a similar role as an E3 ligase in the ubiquitin protein degradation pathway [, ]. Different F-box proteins as a part of SCF complex recruit particular substrates for ubiquitination through specific protein-protein interaction domains.  Many mammalian F-box domains contain leucine-rich or WD-40 repeats (IPR001680 from INTERPRO). However, several F-box proteins either have other previously described domains such as Sec7 domain found in FBS protein or do not contain defined protein-protein interaction domains or motifs.; GO: 0005515 protein binding; PDB: 2E32_A 2E31_A 3V7D_B 1NEX_B 3MKS_D 3L2O_B.
Probab=23.58  E-value=62  Score=19.78  Aligned_cols=16  Identities=25%  Similarity=0.671  Sum_probs=11.6

Q ss_pred             cCCHHHHHHHHHhcCC
Q psy14697        156 TLPPNVFNALIEKLPP  171 (175)
Q Consensus       156 TLPp~Vf~~Li~kLp~  171 (175)
                      -||++|...++..|+.
T Consensus         5 ~LP~~il~~Il~~l~~   20 (48)
T PF00646_consen    5 DLPDEILQEILSYLDP   20 (48)
T ss_dssp             HS-HHHHHHHHHTS-H
T ss_pred             HCCHHHHHHHHHHCcH
Confidence            4899999999888874


No 156
>PF04341 DUF485:  Protein of unknown function, DUF485;  InterPro: IPR007436 This family includes several putative integral membrane proteins.
Probab=23.42  E-value=65  Score=23.59  Aligned_cols=18  Identities=33%  Similarity=0.518  Sum_probs=13.1

Q ss_pred             CCChhhHHHhHHHHHHhH
Q psy14697         72 RSDPLFKEKLKERRRKNR   89 (175)
Q Consensus        72 R~dP~FRKkLrerRrk~~   89 (175)
                      .+||+|++=.|+||+-.-
T Consensus         2 ~~~p~f~~L~r~r~r~~~   19 (91)
T PF04341_consen    2 LRSPEFQELVRRRRRLAW   19 (91)
T ss_pred             CCCHHHHHHHHHHHHHHH
Confidence            479999987777666543


No 157
>PF06864 PAP_PilO:  Pilin accessory protein (PilO);  InterPro: IPR009663 This family consists of several enterobacterial PilO proteins. The function of PilO is unknown although it has been suggested that it is a cytoplasmic protein in the absence of other Pil proteins, but PilO protein is translocated to the outer membrane in the presence of other Pil proteins. Alternatively, PilO protein may form a complex with other Pil protein(s). PilO has been predicted to function as a component of the pilin transport apparatus and thin-pilus basal body []. This family does not seem to be related to IPR007445 from INTERPRO.
Probab=23.31  E-value=1.5e+02  Score=27.14  Aligned_cols=9  Identities=11%  Similarity=0.235  Sum_probs=5.2

Q ss_pred             HHHHHHHHH
Q psy14697        133 VEHLANALT  141 (175)
Q Consensus       133 a~hf~nAl~  141 (175)
                      +..|..+++
T Consensus       274 v~dF~~~~~  282 (414)
T PF06864_consen  274 VADFIRRVR  282 (414)
T ss_pred             HHHHHHHHH
Confidence            456666554


No 158
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=23.29  E-value=1.4e+02  Score=26.49  Aligned_cols=42  Identities=26%  Similarity=0.399  Sum_probs=34.3

Q ss_pred             HHHHHhHhHHhhcCChhhHHHHHHHHHH--hcCChHHHHHHHhh
Q psy14697        114 LQEIQLGETLLAAGDLDNGVEHLANALT--VCGQPNQLLGVLQQ  155 (175)
Q Consensus       114 l~eV~lGE~L~a~G~~~eaa~hf~nAl~--VcpqP~~LL~IyQq  155 (175)
                      ...+.+|+.|..-|++-||+.||..|+.  .-..|+-||+.=|-
T Consensus        90 qnr~rLa~al~elGr~~EA~~hy~qalsG~fA~d~a~lLglA~A  133 (251)
T COG4700          90 QNRYRLANALAELGRYHEAVPHYQQALSGIFAHDAAMLLGLAQA  133 (251)
T ss_pred             HHHHHHHHHHHHhhhhhhhHHHHHHHhccccCCCHHHHHHHHHH
Confidence            3568999999999999999999999997  45677777766553


No 159
>PF08631 SPO22:  Meiosis protein SPO22/ZIP4 like;  InterPro: IPR013940  SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=23.27  E-value=1.5e+02  Score=25.25  Aligned_cols=46  Identities=17%  Similarity=0.292  Sum_probs=38.1

Q ss_pred             CCChHHHHHHHHHHHHhHhHHhhcC-ChhhHHHHHHHHHHhcCChHH
Q psy14697        103 LKDHSAVQQYFLQEIQLGETLLAAG-DLDNGVEHLANALTVCGQPNQ  148 (175)
Q Consensus       103 ~~d~e~~E~fFl~eV~lGE~L~a~G-~~~eaa~hf~nAl~VcpqP~~  148 (175)
                      ..||+..+.--.---..|..+..++ ++++|+.=+-+|+.+|.+|.+
T Consensus        25 ~~~~~~~~~La~~~yn~G~~l~~~~~~~~~a~~wL~~a~~~l~~~~~   71 (278)
T PF08631_consen   25 SLDPDMAEELARVCYNIGKSLLSKKDKYEEAVKWLQRAYDILEKPGK   71 (278)
T ss_pred             cCCcHHHHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHhhhh
Confidence            4566667766666778999999999 999999999999999988744


No 160
>PF10883 DUF2681:  Protein of unknown function (DUF2681);  InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=23.10  E-value=75  Score=23.88  Aligned_cols=9  Identities=22%  Similarity=0.715  Sum_probs=4.4

Q ss_pred             HHHhhhhhe
Q psy14697         58 AVVVGYCLY   66 (175)
Q Consensus        58 a~flGYcIY   66 (175)
                      ++++||..|
T Consensus        15 ~~i~~y~~~   23 (87)
T PF10883_consen   15 ALILAYLWW   23 (87)
T ss_pred             HHHHHHHHH
Confidence            345555543


No 161
>PF01034 Syndecan:  Syndecan domain;  InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains:   A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains;  A transmembrane region;  A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins.    The proteins known to belong to this family are:    Syndecan 1.  Syndecan 2 or fibroglycan.  Syndecan 3 or neuroglycan or N-syndecan.  Syndecan 4 or amphiglycan or ryudocan.  Drosophila syndecan.   Caenorhabditis elegans probable syndecan (F57C7.3).    Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=23.04  E-value=29  Score=24.87  Aligned_cols=9  Identities=11%  Similarity=0.582  Sum_probs=0.0

Q ss_pred             HHHhhhhhe
Q psy14697         58 AVVVGYCLY   66 (175)
Q Consensus        58 a~flGYcIY   66 (175)
                      .+++.+.||
T Consensus        26 ilLIlf~iy   34 (64)
T PF01034_consen   26 ILLILFLIY   34 (64)
T ss_dssp             ---------
T ss_pred             HHHHHHHHH
Confidence            356667777


No 162
>PF13525 YfiO:  Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=22.71  E-value=94  Score=25.10  Aligned_cols=28  Identities=14%  Similarity=0.233  Sum_probs=15.3

Q ss_pred             HhHhHHhhcCChhhHHHHHHHHHHhcCC
Q psy14697        118 QLGETLLAAGDLDNGVEHLANALTVCGQ  145 (175)
Q Consensus       118 ~lGE~L~a~G~~~eaa~hf~nAl~Vcpq  145 (175)
                      +.|..++..|++.+|+..|...+.-||.
T Consensus        10 ~~a~~~~~~g~y~~Ai~~f~~l~~~~P~   37 (203)
T PF13525_consen   10 QKALEALQQGDYEEAIKLFEKLIDRYPN   37 (203)
T ss_dssp             HHHHHHHHCT-HHHHHHHHHHHHHH-TT
T ss_pred             HHHHHHHHCCCHHHHHHHHHHHHHHCCC
Confidence            4455555666666666666666665554


No 163
>KOG1130|consensus
Probab=22.66  E-value=1.8e+02  Score=28.51  Aligned_cols=60  Identities=23%  Similarity=0.279  Sum_probs=42.3

Q ss_pred             HHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHH--------------HHHHhhcCCHHHHHHHHHhc
Q psy14697        110 QQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQL--------------LGVLQQTLPPNVFNALIEKL  169 (175)
Q Consensus       110 E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~L--------------L~IyQqTLPp~Vf~~Li~kL  169 (175)
                      ++--++=...||.|+..|+...|+.-|-.||.|=..--.+              |.=|.|.|-=+.+++.++++
T Consensus        14 ~~SCleLalEGERLck~gdcraGv~ff~aA~qvGTeDl~tLSAIYsQLGNAyfyL~DY~kAl~yH~hDltlar~   87 (639)
T KOG1130|consen   14 DRSCLELALEGERLCKMGDCRAGVDFFKAALQVGTEDLSTLSAIYSQLGNAYFYLKDYEKALKYHTHDLTLARL   87 (639)
T ss_pred             hhHHHHHHHHHHHHHhccchhhhHHHHHHHHHhcchHHHHHHHHHHHhcchhhhHhhHHHHHhhhhhhHHHHHH
Confidence            3334555778999999999999999999999886554443              44455566555666655543


No 164
>PF14407 Frankia_peptide:  Ribosomally synthesized peptide prototyped by Frankia Franean1_4349.
Probab=22.61  E-value=45  Score=23.77  Aligned_cols=12  Identities=25%  Similarity=0.528  Sum_probs=10.0

Q ss_pred             CChhhHHHhHHH
Q psy14697         73 SDPLFKEKLKER   84 (175)
Q Consensus        73 ~dP~FRKkLrer   84 (175)
                      +||+||++|.-.
T Consensus        11 ~D~~FRqqllad   22 (61)
T PF14407_consen   11 TDEAFRQQLLAD   22 (61)
T ss_pred             cCHHHHHHHhcC
Confidence            699999998753


No 165
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=22.48  E-value=1.7e+02  Score=24.80  Aligned_cols=25  Identities=24%  Similarity=0.472  Sum_probs=16.4

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHH
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALT  141 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~  141 (175)
                      ..+|..+..+|++++|..++-+++.
T Consensus        73 ~~la~~~~~~g~~~~A~~~~~~~l~   97 (389)
T PRK11788         73 LALGNLFRRRGEVDRAIRIHQNLLS   97 (389)
T ss_pred             HHHHHHHHHcCcHHHHHHHHHHHhc
Confidence            3456666667777777776666665


No 166
>PF12777 MT:  Microtubule-binding stalk of dynein motor;  InterPro: IPR024743 The 380 kDa motor unit of dynein belongs to the AAA class of chaperone-like ATPases. The core of the 380 kDa motor unit contains a concatenated chain of six AAA modules (D1-6), of which four correspond to the ATP binding sites with P-loop signatures, and two are modules in which the P loop has been lost in evolution. This domain occurs between D4 and D5 and includes the two predicted alpha-helical coiled coil segments that form the stalk supporting the ATP-sensitive microtubule binding component [].; PDB: 3VKH_A 3VKG_A 3ERR_A 4AKI_A 4AI6_B 4AKH_A 4AKG_A 3QMZ_A 2RR7_A.
Probab=22.46  E-value=68  Score=28.61  Aligned_cols=23  Identities=26%  Similarity=0.453  Sum_probs=18.0

Q ss_pred             HHHHHhhhhheeccCCCCChhhHHHhHHH
Q psy14697         56 GAAVVVGYCLYFDKKRRSDPLFKEKLKER   84 (175)
Q Consensus        56 aga~flGYcIYFD~KRR~dP~FRKkLrer   84 (175)
                      .+++|++||=+||      ++||++|-++
T Consensus       316 laaa~isY~G~f~------~~~R~~l~~~  338 (344)
T PF12777_consen  316 LAAAFISYLGPFT------PEYRQELLKK  338 (344)
T ss_dssp             HHHHHHHCCCCTS------HHHHHHHHHH
T ss_pred             HHHHHHHHcCCCC------HHHHHHHHHH
Confidence            3467999999996      6888888653


No 167
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=22.43  E-value=2.1e+02  Score=28.44  Aligned_cols=36  Identities=3%  Similarity=-0.092  Sum_probs=26.4

Q ss_pred             HHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHH
Q psy14697        115 QEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLL  150 (175)
Q Consensus       115 ~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL  150 (175)
                      .+..+|+.|...|++++|.+.|-++++--|++.+.+
T Consensus       156 ~~~~~a~~l~~~g~~~~A~~~y~~~~~~~p~~~~~~  191 (694)
T PRK15179        156 EILLEAKSWDEIGQSEQADACFERLSRQHPEFENGY  191 (694)
T ss_pred             HHHHHHHHHHHhcchHHHHHHHHHHHhcCCCcHHHH
Confidence            346788888889999999999999998333333443


No 168
>PHA03065 Hypothetical protein; Provisional
Probab=22.41  E-value=87  Score=29.93  Aligned_cols=20  Identities=35%  Similarity=0.501  Sum_probs=15.4

Q ss_pred             CCChhhHHHhHHHHHHhHHH
Q psy14697         72 RSDPLFKEKLKERRRKNREL   91 (175)
Q Consensus        72 R~dP~FRKkLrerRrk~~~~   91 (175)
                      |.+=.-|..||+|||+..+.
T Consensus        75 RG~I~IK~~lReKRr~a~~~   94 (438)
T PHA03065         75 RGSIPIKQSLREKRRKASKN   94 (438)
T ss_pred             cCccchhhHHHHHHHHHHHH
Confidence            56778899999998876543


No 169
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=22.31  E-value=4.6e+02  Score=25.04  Aligned_cols=77  Identities=9%  Similarity=-0.040  Sum_probs=0.0

Q ss_pred             HHHHHHHHhhhhheeccCCCCChhhHHHhHHHHHHhHHHHHHhhcCCCCCCCChHHHHHHHHHHHHhHhHHhhcCChhhH
Q psy14697         53 VAVGAAVVVGYCLYFDKKRRSDPLFKEKLKERRRKNRELLQQKNKRGIPDLKDHSAVQQYFLQEIQLGETLLAAGDLDNG  132 (175)
Q Consensus        53 agvaga~flGYcIYFD~KRR~dP~FRKkLrerRrk~~~~~~~a~~~~~p~~~d~e~~E~fFl~eV~lGE~L~a~G~~~ea  132 (175)
                      +-+-+...++|.+-+.+..         +.+....+.....+.......++.++..-+..       |-....+|++++|
T Consensus       376 a~a~A~la~~~~~~~~~~~---------~~~~~l~~a~~~~~~a~al~~~~~~~~~~~al-------a~~~~~~g~~~~A  439 (517)
T PRK10153        376 TYAQAEKALADIVRHSQQP---------LDEKQLAALSTELDNIVALPELNVLPRIYEIL-------AVQALVKGKTDEA  439 (517)
T ss_pred             HHHHHHHHHHHHHHHhcCC---------ccHHHHHHHHHHHHHhhhcccCcCChHHHHHH-------HHHHHhcCCHHHH


Q ss_pred             HHHHHHHHHhcCC
Q psy14697        133 VEHLANALTVCGQ  145 (175)
Q Consensus       133 a~hf~nAl~Vcpq  145 (175)
                      ..||-+|+..=|+
T Consensus       440 ~~~l~rAl~L~ps  452 (517)
T PRK10153        440 YQAINKAIDLEMS  452 (517)
T ss_pred             HHHHHHHHHcCCC


No 170
>PF14964 DUF4507:  Domain of unknown function (DUF4507)
Probab=22.25  E-value=84  Score=29.15  Aligned_cols=30  Identities=33%  Similarity=0.618  Sum_probs=26.2

Q ss_pred             hhhHHHHHHHHHH-------hcCChHHHHHHHhhcCCH
Q psy14697        129 LDNGVEHLANALT-------VCGQPNQLLGVLQQTLPP  159 (175)
Q Consensus       129 ~~eaa~hf~nAl~-------VcpqP~~LL~IyQqTLPp  159 (175)
                      ++.|+++|+.|+.       +|+++.+|+..++ +||+
T Consensus       325 ~~laldRlAQalQVamasgcl~~n~~~L~~~l~-~LP~  361 (362)
T PF14964_consen  325 IQLALDRLAQALQVAMASGCLYGNRDDLLALLS-QLPH  361 (362)
T ss_pred             HHHHHHHHHHHHHHHHHhCcccCCHHHHHHHHH-hCCC
Confidence            6789999999997       5899999999998 6664


No 171
>PRK14750 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=22.13  E-value=1e+02  Score=19.06  Aligned_cols=15  Identities=27%  Similarity=0.529  Sum_probs=9.9

Q ss_pred             HHHHHHHHHhhhhhe
Q psy14697         52 AVAVGAAVVVGYCLY   66 (175)
Q Consensus        52 ~agvaga~flGYcIY   66 (175)
                      +.++.....+||-+|
T Consensus         7 ~g~llv~lLl~YLvY   21 (29)
T PRK14750          7 CGALLVLLLLGYLVY   21 (29)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            344444578888887


No 172
>KOG0624|consensus
Probab=22.08  E-value=1.5e+02  Score=28.51  Aligned_cols=68  Identities=18%  Similarity=0.222  Sum_probs=52.1

Q ss_pred             CCCCCCh-HHHHHHHHHHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHHHHhhcCCHHHHHHHHH
Q psy14697        100 IPDLKDH-SAVQQYFLQEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLGVLQQTLPPNVFNALIE  167 (175)
Q Consensus       100 ~p~~~d~-e~~E~fFl~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLPp~Vf~~Li~  167 (175)
                      ++|++.+ +.+-.|--.-+|.|-.|+.+|++|+|..-|-.-|.--|+-..-+..+.++.|-+-...|.+
T Consensus        92 l~Dl~rVlelKpDF~~ARiQRg~vllK~Gele~A~~DF~~vl~~~~s~~~~~eaqskl~~~~e~~~l~~  160 (504)
T KOG0624|consen   92 LQDLSRVLELKPDFMAARIQRGVVLLKQGELEQAEADFDQVLQHEPSNGLVLEAQSKLALIQEHWVLVQ  160 (504)
T ss_pred             hhhHHHHHhcCccHHHHHHHhchhhhhcccHHHHHHHHHHHHhcCCCcchhHHHHHHHHhHHHHHHHHH
Confidence            4444443 2234566677999999999999999999999999999988888888888877655444443


No 173
>TIGR03798 ocin_TIGR03798 bacteriocin propeptide, TIGR03798 family. This model describes a conserved, fairly long (about 65 residue) propeptide region for a family of putative microcins, that is, bacteriocins of small size. Members of the seed alignment tend to have the Gly-Gly motif as the last two residues of the matched region. This is a cleavage site for a combination processing/export ABC transporter with a peptidase domain.
Probab=21.95  E-value=40  Score=23.03  Aligned_cols=15  Identities=47%  Similarity=0.747  Sum_probs=12.7

Q ss_pred             cCCCCChhhHHHhHH
Q psy14697         69 KKRRSDPLFKEKLKE   83 (175)
Q Consensus        69 ~KRR~dP~FRKkLre   83 (175)
                      .+-++||+||++|+.
T Consensus         9 ~~~~~d~~L~~~l~~   23 (64)
T TIGR03798         9 EKVKTDPDLREKLKA   23 (64)
T ss_pred             HHHHcCHHHHHHHHH
Confidence            356799999999997


No 174
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=21.80  E-value=1.7e+02  Score=26.72  Aligned_cols=34  Identities=35%  Similarity=0.453  Sum_probs=29.6

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHH
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLL  150 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL  150 (175)
                      ..+|..+..+|++++|..+|.+++...|.+.+.+
T Consensus        60 ~~l~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~   93 (899)
T TIGR02917        60 FLLGKIYLALGDYAAAEKELRKALSLGYPKNQVL   93 (899)
T ss_pred             HHHHHHHHHcCCHHHHHHHHHHHHHcCCChhhhH
Confidence            4589999999999999999999999988776544


No 175
>TIGR03644 marine_trans_1 probable ammonium transporter, marine subtype. Members of this protein family are well conserved subclass of putative ammonimum transporters, belonging to the much broader set of ammonium/methylammonium transporter described by TIGR00836. Species with this transporter tend to be marine bacteria. Partial phylogenetic profiling (PPP) picks a member of this protein family as the single best-scoring protein vs. a reference profile for the marine environment Genome Property for a large number of different query genomes. This finding by PPP suggests that this transporter family represents an important adaptation to the marine environment.
Probab=21.56  E-value=97  Score=28.83  Aligned_cols=22  Identities=18%  Similarity=0.143  Sum_probs=13.8

Q ss_pred             hhhhHHHHHHHHHHHHHhhhhh
Q psy14697         44 ISGNKVVTAVAVGAAVVVGYCL   65 (175)
Q Consensus        44 ~~~~~~~~~agvaga~flGYcI   65 (175)
                      ..+|.++.++|...+.++.|..
T Consensus       243 ~~NT~la~a~g~l~~~~~~~~~  264 (404)
T TIGR03644       243 FANTNAAAAGGAIAALLLTKLL  264 (404)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            5666666666666666666654


No 176
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=21.44  E-value=2.4e+02  Score=20.86  Aligned_cols=39  Identities=13%  Similarity=0.392  Sum_probs=30.3

Q ss_pred             cCChhhHHHHHHHHHHhcCChHHHHHHHhhcCCHHHHHHHHH
Q psy14697        126 AGDLDNGVEHLANALTVCGQPNQLLGVLQQTLPPNVFNALIE  167 (175)
Q Consensus       126 ~G~~~eaa~hf~nAl~VcpqP~~LL~IyQqTLPp~Vf~~Li~  167 (175)
                      .|++++|+.+.-+.+   .++...++..++...++++..++.
T Consensus        95 ~~~~~~Al~~~l~~~---~d~~~a~~~~~~~~~~~lw~~~~~  133 (140)
T smart00299       95 DGNFKDAIVTLIEHL---GNYEKAIEYFVKQNNPELWAEVLK  133 (140)
T ss_pred             hcCHHHHHHHHHHcc---cCHHHHHHHHHhCCCHHHHHHHHH
Confidence            466777777655444   679999999999999999888765


No 177
>PF12569 NARP1:  NMDA receptor-regulated protein 1 ;  InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=21.43  E-value=1.7e+02  Score=28.05  Aligned_cols=23  Identities=30%  Similarity=0.342  Sum_probs=15.4

Q ss_pred             hHhHHhhcCChhhHHHHHHHHHH
Q psy14697        119 LGETLLAAGDLDNGVEHLANALT  141 (175)
Q Consensus       119 lGE~L~a~G~~~eaa~hf~nAl~  141 (175)
                      .||.|++.-+..+.|.+|.+=|.
T Consensus       462 ~GekL~~t~dPLe~A~kfl~pL~  484 (517)
T PF12569_consen  462 LGEKLLKTEDPLEEAMKFLKPLL  484 (517)
T ss_pred             cHHHHhcCCcHHHHHHHHHHHHH
Confidence            58888887776555666665443


No 178
>cd02577 PSTD1 PSTD1: Pseudouridine synthase, a subgroup of the TruD family. This group consists of several hypothetical archeal pseudouridine synthases assigned to the TruD family of psuedouridine synthases.  Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi).  The TruD family is comprised of proteins related to Escherichia coli TruD.
Probab=21.23  E-value=5.4e+02  Score=23.14  Aligned_cols=15  Identities=27%  Similarity=0.381  Sum_probs=11.3

Q ss_pred             HHhhhhheeccCCCC
Q psy14697         59 VVVGYCLYFDKKRRS   73 (175)
Q Consensus        59 ~flGYcIYFD~KRR~   73 (175)
                      .--|+-=||++||=.
T Consensus       133 ~~~GfpNYfG~QRFG  147 (319)
T cd02577         133 EKGGFPNYFGSQRFG  147 (319)
T ss_pred             HhcCCCccccccccC
Confidence            355778899999854


No 179
>COG1930 CbiN ABC-type cobalt transport system, periplasmic component [Inorganic ion transport and metabolism]
Probab=21.00  E-value=34  Score=26.41  Aligned_cols=33  Identities=24%  Similarity=0.408  Sum_probs=22.4

Q ss_pred             CCCCccchhhhhhhhHHHHHHHHHHHHHhhhhheeccCCCC
Q psy14697         33 PSGRLMSPMELISGNKVVTAVAVGAAVVVGYCLYFDKKRRS   73 (175)
Q Consensus        33 ~~~~~~~~~~m~~~~~~~~~agvaga~flGYcIYFD~KRR~   73 (175)
                      |||.+=        +++...=++.|++++||.+-+++-||.
T Consensus        60 pSGEIE--------SLLFslQaaiGa~IIgY~lGyyrgk~~   92 (97)
T COG1930          60 PSGEIE--------SLLFSLQAAIGAGIIGYFLGYYRGKGQ   92 (97)
T ss_pred             CCccHH--------HHHHHHHHHhcceeeeeeeeeecchhh
Confidence            666654        344455566678899999988876543


No 180
>KOG0425|consensus
Probab=20.71  E-value=73  Score=26.85  Aligned_cols=20  Identities=20%  Similarity=0.363  Sum_probs=15.0

Q ss_pred             CCCCChhhHHHhHHHHHHhH
Q psy14697         70 KRRSDPLFKEKLKERRRKNR   89 (175)
Q Consensus        70 KRR~dP~FRKkLrerRrk~~   89 (175)
                      -|-|+-|||||+++.-++.+
T Consensus       145 ~Ren~~EykkkV~r~vr~s~  164 (171)
T KOG0425|consen  145 WRENPEEYKKKVRRCVRRSQ  164 (171)
T ss_pred             HhhCHHHHHHHHHHHHHHHH
Confidence            47889999999997544433


No 181
>KOG3060|consensus
Probab=20.70  E-value=1.1e+02  Score=27.71  Aligned_cols=37  Identities=24%  Similarity=0.359  Sum_probs=31.3

Q ss_pred             HhHhHHhhcCC---hhhHHHHHHHHHHhcC-ChHHHHHHHh
Q psy14697        118 QLGETLLAAGD---LDNGVEHLANALTVCG-QPNQLLGVLQ  154 (175)
Q Consensus       118 ~lGE~L~a~G~---~~eaa~hf~nAl~Vcp-qP~~LL~IyQ  154 (175)
                      .++|.+-.+|.   ++-+-.|+.+||+.+| +-..|.+||-
T Consensus       193 rlae~~Yt~gg~eN~~~arkyy~~alkl~~~~~ral~GI~l  233 (289)
T KOG3060|consen  193 RLAEVLYTQGGAENLELARKYYERALKLNPKNLRALFGIYL  233 (289)
T ss_pred             HHHHHHHHHhhHHHHHHHHHHHHHHHHhChHhHHHHHHHHH
Confidence            67898888874   6678889999999999 8888888874


No 182
>PF11817 Foie-gras_1:  Foie gras liver health family 1;  InterPro: IPR021773  Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats. 
Probab=20.54  E-value=96  Score=26.12  Aligned_cols=30  Identities=20%  Similarity=0.381  Sum_probs=26.4

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCCh
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQP  146 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP  146 (175)
                      ++++|++...|++++|+..|-.+...|.+.
T Consensus       182 ~~~A~ey~~~g~~~~A~~~l~~~~~~yr~e  211 (247)
T PF11817_consen  182 LEMAEEYFRLGDYDKALKLLEPAASSYRRE  211 (247)
T ss_pred             HHHHHHHHHCCCHHHHHHHHHHHHHHHHhC
Confidence            689999999999999999999998877653


No 183
>PF02158 Neuregulin:  Neuregulin family;  InterPro: IPR002154 Neuregulins are a sub-family of EGF-like molecules that have been shown to play multiple essential roles in vertebrate embryogenesis including: cardiac development, Schwann cell and oligodendrocyte differentiation, some aspects of neuronal development, as well as the formation of neuromuscular synapses [, ]. Included in the family are heregulin; neu differentiation factor; acetylcholine receptor synthesis stimulator; glial growth factor; and sensory and motor-neuron derived factor []. Multiple family members are generated by alternate splicing or by use of several cell type-specific transcription initiation sites. In general, they bind to and activate the erbB family of receptor tyrosine kinases (erbB2 (HER2), erbB3 (HER3), and erbB4 (HER4)), functioning both as heterodimers and homodimers.  The transmembrane forms of neuregulin 1 (NRG1) are present within synaptic vesicles, including those containing glutamate []. After exocytosis, NRG1 is in the presynaptic membrane, where the ectodomain of NRG1 may be cleaved off. The ectodomain then migrates across the synaptic cleft and binds to and activates a member of the EGF-receptor family on the postsynaptic membrane. This has been shown to increase the expression of certain glutamate-receptor subunits. NRG1 appears to signal for glutamate-receptor subunit expression, localisation, and /or phosphorylation facilitating subsequent glutamate transmission.   The NRG1 gene has been identified as a potential gene determining susceptibility to schizophrenia by a combination of genetic linkage and association approaches []. ; GO: 0005102 receptor binding, 0009790 embryo development; PDB: 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=20.43  E-value=34  Score=32.30  Aligned_cols=33  Identities=24%  Similarity=0.365  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHhhhh-heeccCCCCChhhHHHhHHHHHHh
Q psy14697         51 TAVAVGAAVVVGYC-LYFDKKRRSDPLFKEKLKERRRKN   88 (175)
Q Consensus        51 ~~agvaga~flGYc-IYFD~KRR~dP~FRKkLrerRrk~   88 (175)
                      |+.|+..++++.-+ .||--|++     ||||+++.|+.
T Consensus        15 IcvaLlVVGi~Cvv~aYCKTKKQ-----Rkklh~hLkqs   48 (404)
T PF02158_consen   15 ICVALLVVGIVCVVDAYCKTKKQ-----RKKLHEHLKQS   48 (404)
T ss_dssp             ---------------------------------------
T ss_pred             hhHHHHHHHHHHHHHHHHHhHHH-----HHHHHHHHhhh
Confidence            44444445556666 78888877     78888765543


No 184
>PRK12370 invasion protein regulator; Provisional
Probab=20.42  E-value=1.5e+02  Score=27.72  Aligned_cols=35  Identities=11%  Similarity=-0.095  Sum_probs=29.7

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhcCChHHHHH
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVCGQPNQLLG  151 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~LL~  151 (175)
                      ..+|..+...|++++|..+|-+|+...|+..+...
T Consensus       342 ~~lg~~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~  376 (553)
T PRK12370        342 GLLGLINTIHSEYIVGSLLFKQANLLSPISADIKY  376 (553)
T ss_pred             HHHHHHHHHccCHHHHHHHHHHHHHhCCCCHHHHH
Confidence            46788888999999999999999999998766543


No 185
>PF08690 GET2:  GET complex subunit GET2;  InterPro: IPR014802 This family corresponds to the GET complex subunit GET2. The GET complex is involved in the retrieval of ER resident proteins from the Golgi []. ; PDB: 3SJD_D 3ZS9_C.
Probab=20.25  E-value=69  Score=28.74  Aligned_cols=13  Identities=31%  Similarity=0.618  Sum_probs=8.8

Q ss_pred             HHHhHHHHHHhHH
Q psy14697         78 KEKLKERRRKNRE   90 (175)
Q Consensus        78 RKkLrerRrk~~~   90 (175)
                      |||||++||+++-
T Consensus         7 krRLrRERReAKi   19 (302)
T PF08690_consen    7 KRRLRRERREAKI   19 (302)
T ss_dssp             HHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHH
Confidence            6778777766653


No 186
>KOG4162|consensus
Probab=20.12  E-value=1.4e+02  Score=30.74  Aligned_cols=27  Identities=30%  Similarity=0.373  Sum_probs=23.9

Q ss_pred             HHhHhHHhhcCChhhHHHHHHHHHHhc
Q psy14697        117 IQLGETLLAAGDLDNGVEHLANALTVC  143 (175)
Q Consensus       117 V~lGE~L~a~G~~~eaa~hf~nAl~Vc  143 (175)
                      -++||.+-++|+...|+++|.-|+..=
T Consensus       758 ~~LG~v~k~~Gd~~~Aaecf~aa~qLe  784 (799)
T KOG4162|consen  758 YYLGEVFKKLGDSKQAAECFQAALQLE  784 (799)
T ss_pred             HHHHHHHHHccchHHHHHHHHHHHhhc
Confidence            368999999999999999999998753


No 187
>PF13512 TPR_18:  Tetratricopeptide repeat
Probab=20.04  E-value=1.4e+02  Score=24.07  Aligned_cols=34  Identities=9%  Similarity=0.223  Sum_probs=29.6

Q ss_pred             HHHHhHhHHhhcCChhhHHHHHHHHHHhcCChHH
Q psy14697        115 QEIQLGETLLAAGDLDNGVEHLANALTVCGQPNQ  148 (175)
Q Consensus       115 ~eV~lGE~L~a~G~~~eaa~hf~nAl~VcpqP~~  148 (175)
                      -|+.+|.....++++++|+.-+-+-|+..|+-.+
T Consensus        49 AqL~l~yayy~~~~y~~A~a~~~rFirLhP~hp~   82 (142)
T PF13512_consen   49 AQLDLAYAYYKQGDYEEAIAAYDRFIRLHPTHPN   82 (142)
T ss_pred             HHHHHHHHHHHccCHHHHHHHHHHHHHhCCCCCC
Confidence            3688999999999999999999999999987543


Done!