Query         psy2692
Match_columns 133
No_of_seqs    105 out of 168
Neff          5.5 
Searched_HMMs 46136
Date          Fri Aug 16 22:46:23 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy2692.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2692hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF06957 COPI_C:  Coatomer (COP 100.0 8.8E-49 1.9E-53  335.9  11.6  126    7-132   124-261 (422)
  2 KOG0292|consensus              100.0 2.1E-36 4.5E-41  274.5  11.0  124    8-132   913-1048(1202)
  3 PF07719 TPR_2:  Tetratricopept  88.1    0.94   2E-05   24.2   3.3   28   79-106     5-32  (34)
  4 PF13432 TPR_16:  Tetratricopep  86.7     0.6 1.3E-05   28.8   2.2   29   79-107     1-29  (65)
  5 PF04212 MIT:  MIT (microtubule  86.2     4.3 9.4E-05   26.0   6.2   53   76-130     6-59  (69)
  6 PF00515 TPR_1:  Tetratricopept  85.5     1.7 3.7E-05   23.5   3.5   28   79-106     5-32  (34)
  7 PF13174 TPR_6:  Tetratricopept  84.5     1.7 3.6E-05   22.9   3.1   29   78-106     3-31  (33)
  8 PF13371 TPR_9:  Tetratricopept  83.8     2.5 5.5E-05   26.3   4.2   41   78-123    32-72  (73)
  9 KOG0624|consensus               81.5       6 0.00013   34.9   6.8   92   10-107    47-138 (504)
 10 cd02684 MIT_2 MIT: domain cont  81.3     6.3 0.00014   26.3   5.5   59   71-130     2-60  (75)
 11 TIGR02552 LcrH_SycD type III s  79.9     7.4 0.00016   26.7   5.8   29   78-106    88-116 (135)
 12 cd02680 MIT_calpain7_2 MIT: do  79.5      14 0.00031   24.9   6.8   37   71-107     2-38  (75)
 13 cd02678 MIT_VPS4 MIT: domain c  79.3      11 0.00023   24.9   6.1   57   73-130     4-60  (75)
 14 cd02656 MIT MIT: domain contai  79.3     9.4  0.0002   24.8   5.8   55   75-130     6-60  (75)
 15 PF13414 TPR_11:  TPR repeat; P  76.1     3.9 8.5E-05   25.1   3.1   29   78-106     6-34  (69)
 16 smart00745 MIT Microtubule Int  76.1      14 0.00031   23.9   6.0   56   74-130     7-62  (77)
 17 smart00028 TPR Tetratricopepti  73.9     4.9 0.00011   19.2   2.6   26   79-104     5-30  (34)
 18 cd02677 MIT_SNX15 MIT: domain   72.5      21 0.00046   23.8   6.2   57   73-130     4-60  (75)
 19 PF13812 PPR_3:  Pentatricopept  72.1     4.3 9.4E-05   21.5   2.3   20    9-28      9-28  (34)
 20 PRK15331 chaperone protein Sic  71.6     7.6 0.00016   30.1   4.3   34   70-103    32-65  (165)
 21 PRK10370 formate-dependent nit  70.9     9.5 0.00021   29.4   4.8   92    9-107    81-176 (198)
 22 cd02681 MIT_calpain7_1 MIT: do  70.9      14 0.00031   24.9   5.0   53   77-130     8-61  (76)
 23 PF13428 TPR_14:  Tetratricopep  70.6     8.6 0.00019   22.2   3.5   28   79-106     5-32  (44)
 24 PF14559 TPR_19:  Tetratricopep  70.3       4 8.7E-05   25.0   2.1   23   85-107     1-23  (68)
 25 KOG0543|consensus               68.3      10 0.00023   33.2   4.9   47   81-127   214-260 (397)
 26 smart00668 CTLH C-terminal to   68.0     8.7 0.00019   23.1   3.3   38    9-46      9-53  (58)
 27 cd03598 CLECT_EMBP_like C-type  67.9     2.9 6.4E-05   28.9   1.2   42   82-124     3-45  (117)
 28 cd02682 MIT_AAA_Arch MIT: doma  67.6      18 0.00039   24.5   5.0   52   78-130     9-60  (75)
 29 cd02683 MIT_1 MIT: domain cont  67.4      17 0.00036   24.4   4.8   54   76-130     7-60  (77)
 30 PF04097 Nic96:  Nup93/Nic96;    66.4      15 0.00032   33.5   5.6   82   11-106   437-535 (613)
 31 PF14276 DUF4363:  Domain of un  64.9      46   0.001   23.4   7.3   61   72-132    25-90  (121)
 32 PRK09720 cybC cytochrome b562;  63.9      14  0.0003   26.6   4.0   28   71-98     58-85  (100)
 33 PF09976 TPR_21:  Tetratricopep  62.8      25 0.00053   25.1   5.3   90    9-102    56-145 (145)
 34 PF07721 TPR_4:  Tetratricopept  61.3     9.8 0.00021   19.9   2.3   19    8-26      8-26  (26)
 35 PF13525 YfiO:  Outer membrane   61.2      14  0.0003   28.3   4.0   31   78-108     8-38  (203)
 36 PLN03098 LPA1 LOW PSII ACCUMUL  61.0      18 0.00038   32.4   5.0   37   71-107    71-107 (453)
 37 cd00189 TPR Tetratricopeptide   60.9      12 0.00025   21.9   2.8   28   79-106     4-31  (100)
 38 PLN03088 SGT1,  suppressor of   60.1      33 0.00071   28.8   6.3  109    8-130     9-117 (356)
 39 KOG4648|consensus               57.3      15 0.00032   32.6   3.8   51   81-131   103-166 (536)
 40 PF13176 TPR_7:  Tetratricopept  56.9      19 0.00041   20.0   3.1   22   81-102     5-26  (36)
 41 PRK15363 pathogenicity island   56.5      26 0.00057   26.9   4.7   40   69-108    28-68  (157)
 42 PF13181 TPR_8:  Tetratricopept  56.1      25 0.00055   18.5   3.5   27   80-106     6-32  (34)
 43 TIGR02795 tol_pal_ybgF tol-pal  55.5      24 0.00051   23.1   3.9   21    9-29     10-30  (119)
 44 PF13374 TPR_10:  Tetratricopep  55.4      22 0.00048   19.2   3.2   25   80-104     7-31  (42)
 45 PF03986 Autophagy_N:  Autophag  55.3     3.9 8.5E-05   31.0   0.0   13    5-17     27-39  (145)
 46 PF13432 TPR_16:  Tetratricopep  55.0      32 0.00068   20.8   4.2   58    9-106     5-62  (65)
 47 TIGR03504 FimV_Cterm FimV C-te  54.3      14  0.0003   22.4   2.4   25    8-32      6-30  (44)
 48 COG1729 Uncharacterized protei  53.7      15 0.00032   30.6   3.1   31   78-108   144-174 (262)
 49 PRK15058 cytochrome b562; Prov  53.4      24 0.00051   26.4   3.9   88   12-101    20-116 (128)
 50 PF07361 Cytochrom_B562:  Cytoc  53.3      18 0.00039   25.6   3.1   31   71-101    61-91  (103)
 51 TIGR00756 PPR pentatricopeptid  53.3      16 0.00036   18.7   2.4   19   10-28      9-27  (35)
 52 KOG0553|consensus               53.1      28 0.00061   29.6   4.7   89    9-108    89-182 (304)
 53 PF05538 Campylo_MOMP:  Campylo  52.8     4.6  0.0001   35.7   0.0   37    1-48      1-37  (431)
 54 PF13424 TPR_12:  Tetratricopep  52.4      55  0.0012   20.4   5.2   26   79-104     9-34  (78)
 55 PF10579 Rapsyn_N:  Rapsyn N-te  51.8      28 0.00062   24.0   3.8   32   75-106     6-37  (80)
 56 KOG1840|consensus               51.8      34 0.00073   30.9   5.3  110    8-117   206-325 (508)
 57 PF09577 Spore_YpjB:  Sporulati  50.9      81  0.0018   25.7   7.0   40   72-111     2-41  (232)
 58 PF12895 Apc3:  Anaphase-promot  50.1      13 0.00029   23.9   1.9   78   14-100     2-83  (84)
 59 cd03600 CLECT_thrombomodulin_l  50.0      15 0.00033   26.2   2.4   42   82-125     6-48  (141)
 60 PF13041 PPR_2:  PPR repeat fam  49.4      16 0.00035   21.4   2.1   20   85-104    13-32  (50)
 61 TIGR03302 OM_YfiO outer membra  48.1      28  0.0006   26.3   3.7   32   75-106    33-64  (235)
 62 TIGR02795 tol_pal_ybgF tol-pal  48.1      37  0.0008   22.1   3.9   25   80-104     7-31  (119)
 63 PF09976 TPR_21:  Tetratricopep  47.0      30 0.00066   24.7   3.6   30   76-105    49-78  (145)
 64 PF12854 PPR_1:  PPR repeat      47.0      23 0.00049   19.7   2.3   17   10-26     16-32  (34)
 65 PRK15359 type III secretion sy  46.0      95  0.0021   22.3   6.1   30   78-107    95-124 (144)
 66 PF00059 Lectin_C:  Lectin C-ty  44.7      43 0.00093   21.3   3.8   33   90-124     4-36  (105)
 67 KOG2168|consensus               44.2      83  0.0018   30.3   6.8   93    9-109   643-740 (835)
 68 PRK10866 outer membrane biogen  44.1      37  0.0008   27.1   4.0   32   77-108    34-65  (243)
 69 PF10938 YfdX:  YfdX protein;    43.5      45 0.00098   25.1   4.2   40   72-111   114-153 (155)
 70 PF07720 TPR_3:  Tetratricopept  43.2      24 0.00052   20.3   2.1   20   80-99      6-25  (36)
 71 KOG3807|consensus               43.0      24 0.00052   31.3   2.9   37   74-112   276-312 (556)
 72 TIGR02552 LcrH_SycD type III s  42.7      59  0.0013   22.1   4.4   32   75-106    17-48  (135)
 73 PLN03088 SGT1,  suppressor of   41.0      38 0.00083   28.4   3.8   29   78-106     5-33  (356)
 74 PF01535 PPR:  PPR repeat;  Int  40.0      30 0.00066   17.5   2.1   19   85-103    10-28  (31)
 75 PRK15359 type III secretion sy  39.7      35 0.00076   24.6   3.0   28   80-107    29-56  (144)
 76 PF14561 TPR_20:  Tetratricopep  39.6 1.2E+02  0.0026   20.6   5.5   34   73-106    20-53  (90)
 77 cd05492 Bromo_ZMYND11 Bromodom  39.3 1.3E+02  0.0029   21.4   5.9   56   69-128    46-101 (109)
 78 PF13512 TPR_18:  Tetratricopep  38.7      50  0.0011   24.9   3.7   34   78-111    13-46  (142)
 79 TIGR02917 PEP_TPR_lipo putativ  38.1 1.5E+02  0.0032   25.9   7.0   20   10-29     65-84  (899)
 80 PF05347 Complex1_LYR:  Complex  37.6      50  0.0011   20.0   3.1   16  115-130    42-57  (59)
 81 KOG0545|consensus               36.5      32 0.00069   29.2   2.5   33   78-110   181-213 (329)
 82 PF13512 TPR_18:  Tetratricopep  35.0      67  0.0015   24.2   3.9   32   76-107    48-79  (142)
 83 KOG4234|consensus               34.8      44 0.00096   27.6   3.1   31   80-110   100-130 (271)
 84 KOG3677|consensus               34.8      49  0.0011   29.8   3.6   49   79-127   276-334 (525)
 85 cd03594 CLECT_REG-1_like C-typ  33.7      37  0.0008   23.2   2.2   43   82-124    12-55  (129)
 86 COG1484 DnaC DNA replication p  33.0 1.1E+02  0.0024   24.6   5.2   55   69-129   138-199 (254)
 87 PF02064 MAS20:  MAS20 protein   33.0      88  0.0019   22.9   4.2   31   76-106    64-94  (121)
 88 PF09613 HrpB1_HrpK:  Bacterial  32.6      88  0.0019   24.1   4.3   42   78-127    47-88  (160)
 89 PRK10803 tol-pal system protei  31.3      79  0.0017   25.7   4.1   33   76-108   143-176 (263)
 90 KOG2376|consensus               31.2      78  0.0017   29.6   4.3   50   78-127   178-227 (652)
 91 PF12335 SBF2:  Myotubularin pr  30.8   1E+02  0.0022   24.9   4.6   52   55-106     1-54  (225)
 92 KOG4281|consensus               30.8      73  0.0016   26.1   3.7   45   73-117     5-49  (236)
 93 KOG0553|consensus               30.8      69  0.0015   27.3   3.7   38   74-111    79-117 (304)
 94 PF13525 YfiO:  Outer membrane   30.5      85  0.0018   23.9   4.0   31   78-108    45-75  (203)
 95 KOG0739|consensus               30.3 1.3E+02  0.0028   26.4   5.3   58   72-130     7-65  (439)
 96 cd00037 CLECT C-type lectin (C  29.7      46   0.001   21.1   2.1   35   88-124     9-43  (116)
 97 COG4989 Predicted oxidoreducta  29.4      97  0.0021   26.2   4.3   47   14-61    135-186 (298)
 98 cd05508 Bromo_RACK7 Bromodomai  29.3 1.9E+02  0.0042   20.0   5.3   56   69-128    42-97  (99)
 99 PF07304 SRA1:  Steroid recepto  29.3 2.4E+02  0.0052   21.3   6.2   47   75-128    90-140 (157)
100 PF10255 Paf67:  RNA polymerase  28.6      83  0.0018   27.6   4.0   50   78-127   167-226 (404)
101 PRK10049 pgaA outer membrane p  28.6 3.3E+02  0.0072   25.2   8.0   99    8-108   279-392 (765)
102 cd06131 DNA_pol_III_epsilon_Ec  28.4      48   0.001   24.0   2.1   28    2-29     53-80  (167)
103 TIGR00990 3a0801s09 mitochondr  28.3      58  0.0012   29.0   3.0   28   79-106   131-158 (615)
104 PRK11788 tetratricopeptide rep  27.6 2.1E+02  0.0046   23.0   6.0   20   10-29    116-135 (389)
105 PF14769 CLAMP:  Flagellar C1a   27.5 1.6E+02  0.0034   20.3   4.6   21   87-107    34-54  (101)
106 PF09670 Cas_Cas02710:  CRISPR-  27.4 1.8E+02  0.0039   24.9   5.7   45   77-129   133-177 (379)
107 PF10516 SHNi-TPR:  SHNi-TPR;    27.3      71  0.0015   18.7   2.3   21   84-104    10-30  (38)
108 COG4063 MtrA Tetrahydromethano  27.1      77  0.0017   25.8   3.2   38   88-128   115-152 (238)
109 KOG0543|consensus               26.3 2.1E+02  0.0045   25.3   5.9   51   75-129   291-341 (397)
110 TIGR02878 spore_ypjB sporulati  26.2      79  0.0017   26.0   3.2   32   73-104     3-34  (233)
111 cd03602 CLECT_1 C-type lectin   26.2 1.3E+02  0.0028   20.0   3.9   36   87-124     8-43  (108)
112 PHA02841 hypothetical protein;  26.1      35 0.00077   23.9   1.0   33   21-55      6-39  (103)
113 PRK10866 outer membrane biogen  26.1 1.6E+02  0.0035   23.4   4.9   32   77-108    71-102 (243)
114 PRK11788 tetratricopeptide rep  26.1 3.5E+02  0.0075   21.7   7.1   21   10-30     44-64  (389)
115 CHL00033 ycf3 photosystem I as  25.7      98  0.0021   22.4   3.4   29   78-106    38-66  (168)
116 PRK09637 RNA polymerase sigma   25.6      84  0.0018   23.4   3.1   32   90-121     2-33  (181)
117 PRK10564 maltose regulon perip  24.7      65  0.0014   27.4   2.5   22    9-30    265-288 (303)
118 PRK09591 celC cellobiose phosp  24.3      80  0.0017   22.5   2.6   30    7-45     26-55  (104)
119 COG0329 DapA Dihydrodipicolina  24.2   3E+02  0.0066   22.7   6.4   81   14-111   167-250 (299)
120 PF04048 Sec8_exocyst:  Sec8 ex  24.1 1.4E+02  0.0031   21.8   4.1   55   72-130    45-102 (142)
121 cd00215 PTS_IIA_lac PTS_IIA, P  23.9      83  0.0018   22.1   2.6   29    8-45     22-50  (97)
122 PF06580 His_kinase:  Histidine  23.9 2.2E+02  0.0048   18.7   4.6   54   73-130    12-67  (82)
123 PRK10747 putative protoheme IX  23.4 2.4E+02  0.0053   23.7   5.8   89    8-107   270-360 (398)
124 PRK02603 photosystem I assembl  23.3 1.1E+02  0.0024   22.3   3.3   28   77-104    37-64  (172)
125 cd06130 DNA_pol_III_epsilon_li  22.9 1.1E+02  0.0023   21.6   3.1   28    2-29     49-76  (156)
126 cd03591 CLECT_collectin_like C  22.8 1.2E+02  0.0027   20.4   3.3   35   88-124    10-44  (114)
127 COG5379 BtaA S-adenosylmethion  22.7 1.5E+02  0.0032   25.9   4.3   43    3-50    186-228 (414)
128 TIGR00823 EIIA-LAC phosphotran  22.7      90   0.002   22.0   2.6   20    8-27     24-43  (99)
129 KOG0548|consensus               22.7   2E+02  0.0044   26.4   5.3   63   10-106   307-389 (539)
130 cd07957 Anticodon_Ia_Met Antic  22.5 2.5E+02  0.0054   18.8   6.0   57   74-130    44-107 (129)
131 smart00034 CLECT C-type lectin  22.5 1.4E+02  0.0031   19.3   3.5   41   83-125    13-54  (126)
132 KOG0547|consensus               22.3      78  0.0017   29.2   2.7   27   80-106   120-146 (606)
133 PF00140 Sigma70_r1_2:  Sigma-7  22.3 1.7E+02  0.0036   16.7   4.1   19   92-110     1-19  (37)
134 TIGR02521 type_IV_pilW type IV  22.2 1.4E+02  0.0031   21.0   3.6   29   78-106    34-62  (234)
135 cd03603 CLECT_VCBS A bacterial  22.1 1.4E+02   0.003   20.6   3.4   36   87-124     8-43  (118)
136 cd06266 RNaseH_typeII Ribonucl  22.0      56  0.0012   25.1   1.5   29    5-33    132-160 (193)
137 PF11817 Foie-gras_1:  Foie gra  21.9      85  0.0018   24.9   2.6   20    8-27    185-204 (247)
138 PF12793 SgrR_N:  Sugar transpo  21.8      89  0.0019   22.4   2.5   18    8-25     77-94  (115)
139 PF02255 PTS_IIA:  PTS system,   21.6      96  0.0021   21.6   2.5   20    8-27     21-40  (96)
140 COG1283 NptA Na+/phosphate sym  21.6 1.5E+02  0.0033   27.1   4.4   40   90-129   370-412 (533)
141 KOG0482|consensus               21.4 3.6E+02  0.0078   25.4   6.7   81   31-128   550-643 (721)
142 TIGR03302 OM_YfiO outer membra  21.2 2.3E+02   0.005   21.2   4.8   29   78-106    73-101 (235)
143 PRK11447 cellulose synthase su  21.1 2.7E+02  0.0058   27.1   6.2   25   82-106   276-300 (1157)
144 PF03705 CheR_N:  CheR methyltr  21.0 1.1E+02  0.0024   18.2   2.5   41   16-57      4-44  (57)
145 cd03590 CLECT_DC-SIGN_like C-t  21.0 1.4E+02  0.0029   20.1   3.2   40   83-124    13-53  (126)
146 cd06144 REX4_like DEDDh 3'-5'   20.9 1.2E+02  0.0025   22.1   3.0   29    1-29     48-76  (152)
147 cd03592 CLECT_selectins_like C  20.9 1.4E+02  0.0031   20.0   3.3   35   88-124     9-43  (115)
148 PF10607 CLTH:  CTLH/CRA C-term  20.8 3.1E+02  0.0067   19.3   7.5   19    9-27      9-27  (145)
149 PLN00416 carbonate dehydratase  20.8 1.6E+02  0.0034   24.3   4.0   73   13-101     2-74  (258)
150 KOG2076|consensus               20.6 1.4E+02  0.0031   29.0   4.1   40   72-111   135-175 (895)
151 KOG0980|consensus               20.5      91   0.002   30.4   2.8   40   20-59    208-262 (980)
152 PF05053 Menin:  Menin;  InterP  20.5 2.3E+02  0.0049   26.5   5.2   49   80-130   324-372 (618)
153 KOG1173|consensus               20.4 1.6E+02  0.0036   27.3   4.3   27   82-108   496-522 (611)
154 cd05513 Bromo_brd7_like Bromod  20.2 3.1E+02  0.0066   19.0   7.1   88   35-126     1-94  (98)
155 PF03564 DUF1759:  Protein of u  20.1      71  0.0015   22.7   1.7   80   13-108    52-132 (145)
156 PF04184 ST7:  ST7 protein;  In  20.0 4.8E+02    0.01   24.0   7.1   27   82-108   266-292 (539)

No 1  
>PF06957 COPI_C:  Coatomer (COPI) alpha subunit C-terminus;  InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits.  This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=100.00  E-value=8.8e-49  Score=335.88  Aligned_cols=126  Identities=43%  Similarity=0.713  Sum_probs=110.8

Q ss_pred             ChHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCC------------CCChHH
Q psy2692           7 TSESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKV------------DRSSID   74 (133)
Q Consensus         7 ~~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~------------~~~l~~   74 (133)
                      .++.|||||+||+||||||||+|||||+|||++|+++|.+||+|+|++|++|||..|++|||            ++++++
T Consensus       124 laadhvAAGsFetAm~LLnrQiGivnF~PLk~~Fl~~y~~s~~~l~~~~~~p~l~~~~~r~~~~~~~~~~lP~i~~~l~~  203 (422)
T PF06957_consen  124 LAADHVAAGSFETAMQLLNRQIGIVNFEPLKPLFLEVYQASRTYLPALPSLPPLPSYIRRNWDESNPKNGLPAIPLSLSS  203 (422)
T ss_dssp             SHHHHHHCT-HHHHHHHHHHHC-B---GGGHHHHHHHHCCTEEEE-SSTTTS-EEEEEBCTTTTSSSCCG-BB----HHH
T ss_pred             cHHHHHHhCCHHHHHHHHHHHhCccccHHHHHHHHHHHHhhceecccCCCCCCccccccCCccccccccCCCcCcCCHHH
Confidence            47889999999999999999999999999999999999999999999999999999999997            568999


Q ss_pred             HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhhhc
Q psy2692          75 LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYIVG  132 (133)
Q Consensus        75 L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYila  132 (133)
                      |.++||.|||+||+|||+||++.||+|||+|||++|+|++|++|++|+|+||||||+|
T Consensus       204 L~~~Lk~gyk~~t~gKF~eA~~~Fr~iL~~i~l~vv~~~~E~~e~~eli~icrEYilg  261 (422)
T PF06957_consen  204 LEERLKEGYKLFTAGKFEEAIEIFRSILHSIPLLVVESREEEDEAKELIEICREYILG  261 (422)
T ss_dssp             HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHC--BSSCHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHhcCCHHHHHHHHHHHHHHhheeeecCHHHHHHHHHHHHHHHHHHHH
Confidence            9999999999999999999999999999999999999999999999999999999997


No 2  
>KOG0292|consensus
Probab=100.00  E-value=2.1e-36  Score=274.53  Aligned_cols=124  Identities=48%  Similarity=0.743  Sum_probs=118.5

Q ss_pred             hHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCC------------CCChHHH
Q psy2692           8 SESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKV------------DRSSIDL   75 (133)
Q Consensus         8 ~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~------------~~~l~~L   75 (133)
                      ++.+||||+||+||++|++|+|++||.|+|.+|+++|.+||+|++++|++| +.+++-|||            +.+++++
T Consensus       913 ~adhvaAGsf~tA~~lL~dqvgvv~f~p~Kt~fl~iy~~sR~~l~~~~~~~-~~~~~~R~~se~~~~~~~P~v~~~l~~l  991 (1202)
T KOG0292|consen  913 AADHVAAGSFETAMRLLHDQVGVVNFGPLKTHFLKIYAGSRTYLRATPCLP-VSLYPVRNWSETSSKQGLPAVGFKLSQL  991 (1202)
T ss_pred             hhhhhhcCchHHHHHHHHhhhcceecccHHhhhhhhccccceeccCCCCcc-cccccccccccchhhccCCcccccHHHH
Confidence            678999999999999999999999999999999999999999999999999 666777775            6779999


Q ss_pred             HHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhhhc
Q psy2692          76 IAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYIVG  132 (133)
Q Consensus        76 ~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYila  132 (133)
                      ..+|++|||+||.|||.||+++||+||++||+++|+|++|++|++|+|.||||||+|
T Consensus       992 ~~kl~~gy~ltt~gKf~eAie~Frsii~~i~l~vvd~~~e~aea~~li~i~~eYi~g 1048 (1202)
T KOG0292|consen  992 NKKLQKGYKLTTEGKFGEAIEKFRSIIYSIPLLVVDSKEEEAEADELIKICREYIVG 1048 (1202)
T ss_pred             HHHHHHHHhhhccCcHHHHHHHHHHHHhheeEEEecchhhHHHHHHHHHHHHHHHhh
Confidence            999999999999999999999999999999999999999999999999999999997


No 3  
>PF07719 TPR_2:  Tetratricopeptide repeat;  InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=88.05  E-value=0.94  Score=24.21  Aligned_cols=28  Identities=21%  Similarity=0.195  Sum_probs=22.9

Q ss_pred             HHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          79 IQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      ...|.-++..|++++|+..|+..+..-|
T Consensus         5 ~~lg~~~~~~~~~~~A~~~~~~al~l~p   32 (34)
T PF07719_consen    5 YYLGQAYYQLGNYEEAIEYFEKALELDP   32 (34)
T ss_dssp             HHHHHHHHHTT-HHHHHHHHHHHHHHST
T ss_pred             HHHHHHHHHhCCHHHHHHHHHHHHHHCc
Confidence            3568889999999999999999987644


No 4  
>PF13432 TPR_16:  Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=86.73  E-value=0.6  Score=28.80  Aligned_cols=29  Identities=21%  Similarity=0.218  Sum_probs=25.1

Q ss_pred             HHHHHHhcccCCHHHHHHHHHHHHHHccc
Q psy2692          79 IQQCYQLTTSGKFVDAIEKLRNVLICIPL  107 (133)
Q Consensus        79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l  107 (133)
                      +..|..++..|+|++|+..|+.++..-|-
T Consensus         1 ~~~a~~~~~~g~~~~A~~~~~~~l~~~P~   29 (65)
T PF13432_consen    1 YALARALYQQGDYDEAIAAFEQALKQDPD   29 (65)
T ss_dssp             HHHHHHHHHCTHHHHHHHHHHHHHCCSTT
T ss_pred             ChHHHHHHHcCCHHHHHHHHHHHHHHCCC
Confidence            45788899999999999999999987764


No 5  
>PF04212 MIT:  MIT (microtubule interacting and transport) domain;  InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=86.24  E-value=4.3  Score=26.01  Aligned_cols=53  Identities=19%  Similarity=0.338  Sum_probs=38.9

Q ss_pred             HHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccC-hHHHHHHHHHHHHHHHhh
Q psy2692          76 IAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTT-KKEIEEAKQIINICKEYI  130 (133)
Q Consensus        76 ~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~-~~e~~e~~eli~icrEYi  130 (133)
                      ...++.|-+.=..|++++|++.+++-+..+.-..-.. .++..++  +-..+.+|+
T Consensus         6 ~~~~~~Av~~D~~g~~~~A~~~Y~~ai~~l~~~~~~~~~~~~~~~--l~~k~~~yl   59 (69)
T PF04212_consen    6 IELIKKAVEADEAGNYEEALELYKEAIEYLMQALKSESNPERRQA--LRQKMKEYL   59 (69)
T ss_dssp             HHHHHHHHHHHHTTSHHHHHHHHHHHHHHHHHHHHHSTTHHHHHH--HHHHHHHHH
T ss_pred             HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHhccCCCHHHHHH--HHHHHHHHH
Confidence            3457888888889999999999999888776654443 3333333  778888886


No 6  
>PF00515 TPR_1:  Tetratricopeptide repeat;  InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=85.54  E-value=1.7  Score=23.49  Aligned_cols=28  Identities=18%  Similarity=0.189  Sum_probs=23.0

Q ss_pred             HHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          79 IQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      ...|.-.+..|++++|+..|+..|..-|
T Consensus         5 ~~~g~~~~~~~~~~~A~~~~~~al~~~p   32 (34)
T PF00515_consen    5 YNLGNAYFQLGDYEEALEYYQRALELDP   32 (34)
T ss_dssp             HHHHHHHHHTT-HHHHHHHHHHHHHHST
T ss_pred             HHHHHHHHHhCCchHHHHHHHHHHHHCc
Confidence            4678888999999999999999987644


No 7  
>PF13174 TPR_6:  Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=84.51  E-value=1.7  Score=22.95  Aligned_cols=29  Identities=17%  Similarity=0.236  Sum_probs=24.1

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      .++.|.-....|++++|++.|+.++..-|
T Consensus         3 ~~~~a~~~~~~g~~~~A~~~~~~~~~~~P   31 (33)
T PF13174_consen    3 LYRLARCYYKLGDYDEAIEYFQRLIKRYP   31 (33)
T ss_dssp             HHHHHHHHHHHCHHHHHHHHHHHHHHHST
T ss_pred             HHHHHHHHHHccCHHHHHHHHHHHHHHCc
Confidence            45667777778999999999999998765


No 8  
>PF13371 TPR_9:  Tetratricopeptide repeat
Probab=83.77  E-value=2.5  Score=26.28  Aligned_cols=41  Identities=20%  Similarity=0.353  Sum_probs=32.1

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHH
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQII  123 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli  123 (133)
                      ....|.-++..|+|++|+..|..++..-|     +..+...++..|
T Consensus        32 ~~~~a~~~~~~g~~~~A~~~l~~~l~~~p-----~~~~~~~~~a~l   72 (73)
T PF13371_consen   32 WLQRARCLFQLGRYEEALEDLERALELSP-----DDPDARALRAML   72 (73)
T ss_pred             hHHHHHHHHHhccHHHHHHHHHHHHHHCC-----CcHHHHHHHHhc
Confidence            45678889999999999999999998877     455555555443


No 9  
>KOG0624|consensus
Probab=81.54  E-value=6  Score=34.94  Aligned_cols=92  Identities=15%  Similarity=0.132  Sum_probs=61.8

Q ss_pred             HHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhcccC
Q psy2692          10 SINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTSG   89 (133)
Q Consensus        10 ~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~g   89 (133)
                      ++.|.|+|..|+.--+.-|   +-.|  .....+|-++.+|+.-=-+-+.+. -+.|-...+.+..-.++|.|.=+.+.|
T Consensus        47 ~lla~~Q~sDALt~yHaAv---e~dp--~~Y~aifrRaT~yLAmGksk~al~-Dl~rVlelKpDF~~ARiQRg~vllK~G  120 (504)
T KOG0624|consen   47 ELLARGQLSDALTHYHAAV---EGDP--NNYQAIFRRATVYLAMGKSKAALQ-DLSRVLELKPDFMAARIQRGVVLLKQG  120 (504)
T ss_pred             HHHHhhhHHHHHHHHHHHH---cCCc--hhHHHHHHHHHHHhhhcCCccchh-hHHHHHhcCccHHHHHHHhchhhhhcc
Confidence            6789999999988777654   4444  344555556666653222222211 001111345566677999999999999


Q ss_pred             CHHHHHHHHHHHHHHccc
Q psy2692          90 KFVDAIEKLRNVLICIPL  107 (133)
Q Consensus        90 KF~eAl~~Fr~iL~~i~l  107 (133)
                      ++++|..-|+.+|..-|-
T Consensus       121 ele~A~~DF~~vl~~~~s  138 (504)
T KOG0624|consen  121 ELEQAEADFDQVLQHEPS  138 (504)
T ss_pred             cHHHHHHHHHHHHhcCCC
Confidence            999999999999998874


No 10 
>cd02684 MIT_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with an n-terminal serine/threonine kinase domain. The molecular function of the MIT domain is unclear.
Probab=81.29  E-value=6.3  Score=26.32  Aligned_cols=59  Identities=14%  Similarity=0.107  Sum_probs=42.1

Q ss_pred             ChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692          71 SSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI  130 (133)
Q Consensus        71 ~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi  130 (133)
                      +++.-.+.+++|-..=+.|++++|+..+.+-|......+-...++ +.-.-+-..+.||+
T Consensus         2 ~l~~Ai~lv~~Av~~D~~g~y~eA~~lY~~ale~~~~~~k~e~~~-~~k~~lr~k~~eyl   60 (75)
T cd02684           2 SLEKAIALVVQAVKKDQRGDAAAALSLYCSALQYFVPALHYETDA-QRKEALRQKVLQYV   60 (75)
T ss_pred             cHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHhhCCCH-HHHHHHHHHHHHHH
Confidence            345556788999999999999999999999998887765332222 22223666777775


No 11 
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=79.92  E-value=7.4  Score=26.74  Aligned_cols=29  Identities=14%  Similarity=0.028  Sum_probs=24.9

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      ....|+-+...|++++|+..|+.++..-|
T Consensus        88 ~~~la~~~~~~g~~~~A~~~~~~al~~~p  116 (135)
T TIGR02552        88 YFHAAECLLALGEPESALKALDLAIEICG  116 (135)
T ss_pred             HHHHHHHHHHcCCHHHHHHHHHHHHHhcc
Confidence            45667778889999999999999999876


No 12 
>cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=79.46  E-value=14  Score=24.92  Aligned_cols=37  Identities=14%  Similarity=0.026  Sum_probs=32.0

Q ss_pred             ChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHccc
Q psy2692          71 SSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPL  107 (133)
Q Consensus        71 ~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l  107 (133)
                      .++.-+..+++|...=++|..++|+..|.+.+-....
T Consensus         2 ~l~kai~Lv~~A~~eD~~gny~eA~~lY~~ale~~~~   38 (75)
T cd02680           2 DLERAHFLVTQAFDEDEKGNAEEAIELYTEAVELCIN   38 (75)
T ss_pred             CHHHHHHHHHHHHHhhHhhhHHHHHHHHHHHHHHHHH
Confidence            3455667788998888999999999999999999888


No 13 
>cd02678 MIT_VPS4 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in intracellular protein transport proteins of the AAA-ATPase family. The molecular function of the MIT domain is unclear.
Probab=79.32  E-value=11  Score=24.85  Aligned_cols=57  Identities=16%  Similarity=0.297  Sum_probs=41.3

Q ss_pred             HHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692          73 IDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI  130 (133)
Q Consensus        73 ~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi  130 (133)
                      +.-...+++|-+.=+.|++++|+..+.+.+..+...+-...++ .....+-..|.||+
T Consensus         4 ~~A~~l~~~Av~~D~~g~y~eA~~~Y~~aie~l~~~~k~e~~~-~~k~~~~~k~~eyl   60 (75)
T cd02678           4 QKAIELVKKAIEEDNAGNYEEALRLYQHALEYFMHALKYEKNP-KSKESIRAKCTEYL   60 (75)
T ss_pred             HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHhhCCCH-HHHHHHHHHHHHHH
Confidence            3345577888888899999999999999998887765332222 33344667777775


No 14 
>cd02656 MIT MIT: domain contained within Microtubule Interacting and Trafficking molecules. The MIT domain is found in sorting nexins, the nuclear thiol protease PalBH, the AAA protein spastin and archaebacterial proteins with similar domain architecture, vacuolar sorting proteins and others. The molecular function of the MIT domain is unclear.
Probab=79.27  E-value=9.4  Score=24.82  Aligned_cols=55  Identities=20%  Similarity=0.248  Sum_probs=40.0

Q ss_pred             HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692          75 LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI  130 (133)
Q Consensus        75 L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi  130 (133)
                      -.+.++.|-+.=+.|++++|+..+.+.+..+....-.+.+ ...-..+-..|.+|+
T Consensus         6 a~~l~~~Av~~D~~g~~~~Al~~Y~~a~e~l~~~~~~~~~-~~~k~~l~~k~~~yl   60 (75)
T cd02656           6 AKELIKQAVKEDEDGNYEEALELYKEALDYLLQALKAEKE-PKLRKLLRKKVKEYL   60 (75)
T ss_pred             HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHhccCCC-HHHHHHHHHHHHHHH
Confidence            3446778888888999999999999999888776533322 333444777788876


No 15 
>PF13414 TPR_11:  TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=76.10  E-value=3.9  Score=25.15  Aligned_cols=29  Identities=17%  Similarity=0.162  Sum_probs=25.7

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      -...|..++..|++++|+..|...|..-|
T Consensus         6 ~~~~g~~~~~~~~~~~A~~~~~~ai~~~p   34 (69)
T PF13414_consen    6 WYNLGQIYFQQGDYEEAIEYFEKAIELDP   34 (69)
T ss_dssp             HHHHHHHHHHTTHHHHHHHHHHHHHHHST
T ss_pred             HHHHHHHHHHcCCHHHHHHHHHHHHHcCC
Confidence            45779999999999999999999998765


No 16 
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=76.08  E-value=14  Score=23.87  Aligned_cols=56  Identities=11%  Similarity=0.216  Sum_probs=38.7

Q ss_pred             HHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692          74 DLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI  130 (133)
Q Consensus        74 ~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi  130 (133)
                      .-.+.++.|-+.=+.|+.++|+..+++-+..+.-..-.. .+...-..+-..|++|+
T Consensus         7 ~A~~li~~Av~~d~~g~~~eAl~~Y~~a~e~l~~~~~~~-~~~~~~~~~~~k~~eyl   62 (77)
T smart00745        7 KAKELISKALKADEAGDYEEALELYKKAIEYLLEGIKVE-SDSKRREAVKAKAAEYL   62 (77)
T ss_pred             HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHhccC-CCHHHHHHHHHHHHHHH
Confidence            344567788888889999999999999888877644222 22233334667777775


No 17 
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=73.88  E-value=4.9  Score=19.15  Aligned_cols=26  Identities=15%  Similarity=0.147  Sum_probs=21.1

Q ss_pred             HHHHHHhcccCCHHHHHHHHHHHHHH
Q psy2692          79 IQQCYQLTTSGKFVDAIEKLRNVLIC  104 (133)
Q Consensus        79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~  104 (133)
                      .+.|.-+...|++++|+..|+..+..
T Consensus         5 ~~~a~~~~~~~~~~~a~~~~~~~~~~   30 (34)
T smart00028        5 YNLGNAYLKLGDYDEALEYYEKALEL   30 (34)
T ss_pred             HHHHHHHHHHhhHHHHHHHHHHHHcc
Confidence            35567778889999999999988754


No 18 
>cd02677 MIT_SNX15 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in sorting nexin 15 and related proteins. The molecular function of the MIT domain is unclear.
Probab=72.48  E-value=21  Score=23.85  Aligned_cols=57  Identities=16%  Similarity=0.109  Sum_probs=40.1

Q ss_pred             HHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692          73 IDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI  130 (133)
Q Consensus        73 ~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi  130 (133)
                      +.-.+.++.|-..=+.|++++|+..+++-|......+-.. .+.+.-.-+-..+.+|+
T Consensus         4 ~~A~~l~~~Ave~d~~~~y~eA~~~Y~~~i~~~~~~~k~e-~~~~~k~~ir~K~~eYl   60 (75)
T cd02677           4 EQAAELIRLALEKEEEGDYEAAFEFYRAGVDLLLKGVQGD-SSPERREAVKRKIAEYL   60 (75)
T ss_pred             HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhccC-CCHHHHHHHHHHHHHHH
Confidence            3345678889999999999999999999998877754332 22222223566677775


No 19 
>PF13812 PPR_3:  Pentatricopeptide repeat domain
Probab=72.14  E-value=4.3  Score=21.47  Aligned_cols=20  Identities=15%  Similarity=0.280  Sum_probs=16.8

Q ss_pred             HHHHhccCHHHHHHHHHhhc
Q psy2692           9 ESINKSSSFETAFKLLHDQI   28 (133)
Q Consensus         9 ~~~~AAGsFe~A~~LLnrQi   28 (133)
                      ..++.+|+++.|++++++-.
T Consensus         9 ~a~~~~g~~~~a~~~~~~M~   28 (34)
T PF13812_consen    9 RACAKAGDPDAALQLFDEMK   28 (34)
T ss_pred             HHHHHCCCHHHHHHHHHHHH
Confidence            35788999999999998754


No 20 
>PRK15331 chaperone protein SicA; Provisional
Probab=71.56  E-value=7.6  Score=30.14  Aligned_cols=34  Identities=15%  Similarity=0.151  Sum_probs=28.3

Q ss_pred             CChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHH
Q psy2692          70 RSSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLI  103 (133)
Q Consensus        70 ~~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~  103 (133)
                      ++-+.|..--..||.++..||+.||...||-+..
T Consensus        32 is~~~le~iY~~Ay~~y~~Gk~~eA~~~F~~L~~   65 (165)
T PRK15331         32 IPQDMMDGLYAHAYEFYNQGRLDEAETFFRFLCI   65 (165)
T ss_pred             CCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH
Confidence            3445566677999999999999999999998765


No 21 
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=70.91  E-value=9.5  Score=29.37  Aligned_cols=92  Identities=9%  Similarity=-0.018  Sum_probs=57.5

Q ss_pred             HHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCC----CCCCccccCCCCCCCChHHHHHHHHHHHH
Q psy2692           9 ESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLP----LLAPVTAFPTRKVDRSSIDLIAKIQQCYQ   84 (133)
Q Consensus         9 ~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p----~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk   84 (133)
                      ..+.+.|+++.|...+++=+-   ..|-.+.+...+..+...-.+-.    ....+.-.+.+    ..++.......|.-
T Consensus        81 ~~~~~~g~~~~A~~a~~~Al~---l~P~~~~~~~~lA~aL~~~~g~~~~~~A~~~l~~al~~----dP~~~~al~~LA~~  153 (198)
T PRK10370         81 EYYLWRNDYDNALLAYRQALQ---LRGENAELYAALATVLYYQAGQHMTPQTREMIDKALAL----DANEVTALMLLASD  153 (198)
T ss_pred             HHHHHCCCHHHHHHHHHHHHH---hCCCCHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHh----CCCChhHHHHHHHH
Confidence            356789999999999997653   44655555555544332222210    01111111111    22233445667888


Q ss_pred             hcccCCHHHHHHHHHHHHHHccc
Q psy2692          85 LTTSGKFVDAIEKLRNVLICIPL  107 (133)
Q Consensus        85 ~~t~gKF~eAl~~Fr~iL~~i~l  107 (133)
                      ++..|++++|+..|+.++...|=
T Consensus       154 ~~~~g~~~~Ai~~~~~aL~l~~~  176 (198)
T PRK10370        154 AFMQADYAQAIELWQKVLDLNSP  176 (198)
T ss_pred             HHHcCCHHHHHHHHHHHHhhCCC
Confidence            89999999999999999999886


No 22 
>cd02681 MIT_calpain7_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=70.85  E-value=14  Score=24.90  Aligned_cols=53  Identities=15%  Similarity=0.194  Sum_probs=37.1

Q ss_pred             HHHHHHHHhcccCCHHHHHHHHHHHHHHcccc-cccChHHHHHHHHHHHHHHHhh
Q psy2692          77 AKIQQCYQLTTSGKFVDAIEKLRNVLICIPLL-VVTTKKEIEEAKQIINICKEYI  130 (133)
Q Consensus        77 ~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~-vv~~~~e~~e~~eli~icrEYi  130 (133)
                      +-++.|-..=+.|++++|+..+++-+-.+... ..++..+.. ...+-..|.||+
T Consensus         8 ~~a~~Ave~D~~g~y~eA~~~Y~~aie~l~~~~~~~~~n~~~-k~~ir~K~~eYl   61 (76)
T cd02681           8 QFARLAVQRDQEGRYSEAVFYYKEAAQLLIYAEMAGTLNDSH-LKTIQEKSNEYL   61 (76)
T ss_pred             HHHHHHHHHHHccCHHHHHHHHHHHHHHHHHHHHhcCCChHH-HHHHHHHHHHHH
Confidence            35678888889999999999999998887664 233232222 223567777775


No 23 
>PF13428 TPR_14:  Tetratricopeptide repeat
Probab=70.56  E-value=8.6  Score=22.25  Aligned_cols=28  Identities=21%  Similarity=0.165  Sum_probs=23.8

Q ss_pred             HHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          79 IQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      +.-|.-+...|++++|+..|+.++..-|
T Consensus         5 ~~la~~~~~~G~~~~A~~~~~~~l~~~P   32 (44)
T PF13428_consen    5 LALARAYRRLGQPDEAERLLRRALALDP   32 (44)
T ss_pred             HHHHHHHHHcCCHHHHHHHHHHHHHHCc
Confidence            4456677889999999999999998776


No 24 
>PF14559 TPR_19:  Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=70.35  E-value=4  Score=24.96  Aligned_cols=23  Identities=35%  Similarity=0.420  Sum_probs=19.2

Q ss_pred             hcccCCHHHHHHHHHHHHHHccc
Q psy2692          85 LTTSGKFVDAIEKLRNVLICIPL  107 (133)
Q Consensus        85 ~~t~gKF~eAl~~Fr~iL~~i~l  107 (133)
                      ++..|++++|+..|+.++..-|-
T Consensus         1 ll~~~~~~~A~~~~~~~l~~~p~   23 (68)
T PF14559_consen    1 LLKQGDYDEAIELLEKALQRNPD   23 (68)
T ss_dssp             HHHTTHHHHHHHHHHHHHHHTTT
T ss_pred             ChhccCHHHHHHHHHHHHHHCCC
Confidence            35679999999999999988774


No 25 
>KOG0543|consensus
Probab=68.29  E-value=10  Score=33.23  Aligned_cols=47  Identities=17%  Similarity=0.199  Sum_probs=40.0

Q ss_pred             HHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHH
Q psy2692          81 QCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICK  127 (133)
Q Consensus        81 ~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icr  127 (133)
                      .|=-+|+.|||..|...|+.++..+.--.-.+.+|.+++.++.-.|-
T Consensus       214 ~Gn~~fK~gk~~~A~~~Yerav~~l~~~~~~~~ee~~~~~~~k~~~~  260 (397)
T KOG0543|consen  214 RGNVLFKEGKFKLAKKRYERAVSFLEYRRSFDEEEQKKAEALKLACH  260 (397)
T ss_pred             hhhHHHhhchHHHHHHHHHHHHHHhhccccCCHHHHHHHHHHHHHHh
Confidence            37789999999999999999999998776677788888888776664


No 26 
>smart00668 CTLH C-terminal to LisH motif. Alpha-helical motif of unknown function.
Probab=67.97  E-value=8.7  Score=23.13  Aligned_cols=38  Identities=18%  Similarity=0.177  Sum_probs=32.2

Q ss_pred             HHHHhccCHHHHHHHHHhhcC-------cccccccHHHHHHHHhh
Q psy2692           9 ESINKSSSFETAFKLLHDQIG-------VKNFLPYRQMFLNSYLQ   46 (133)
Q Consensus         9 ~~~~AAGsFe~A~~LLnrQiG-------ivNf~PLk~~F~~~y~~   46 (133)
                      .+.+..|+++.|+..+++-..       -..|.-.++.|++....
T Consensus         9 ~~~i~~g~~~~a~~~~~~~~~~l~~~~~~l~f~L~~q~~lell~~   53 (58)
T smart00668        9 RELILKGDWDEALEWLSSLKPPLLERNSKLEFELRKQKFLELVRQ   53 (58)
T ss_pred             HHHHHcCCHHHHHHHHHHcCHHHhccCCCchhHHHHHHHHHHHHc
Confidence            357889999999999988876       67888899999988754


No 27 
>cd03598 CLECT_EMBP_like C-type lectin-like domain (CTLD) of the type found in the human proteins, eosinophil major basic protein (EMBP) and prepro major basic protein homolog (MBPH). CLECT_EMBP_like: C-type lectin-like domain (CTLD) of the type found in the human proteins, eosinophil major basic protein (EMBP) and prepro major basic protein homolog (MBPH).  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  Eosinophils and basophils carry out various functions in allergic, parasitic, and inflammatory diseases.  EMBP is stored in eosinophil crystalloid granules and is released upon degranulation.  EMBP is also expressed in basophils.  The proform of EMBP is expressed in placental X cells and breast tissue and increases significantly during human pregnancy.  EMBP has cytotoxic properties and damages bacteria and mammalian cells, in vitro, as well as, helminth parasites.  EMBP deposition has been observed in the inflamed tissue of all
Probab=67.88  E-value=2.9  Score=28.93  Aligned_cols=42  Identities=17%  Similarity=0.217  Sum_probs=34.9

Q ss_pred             HHHhcc-cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692          82 CYQLTT-SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN  124 (133)
Q Consensus        82 gyk~~t-~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~  124 (133)
                      +|+.++ ...|.||...+++. +-.-|+.++|.+|.+.+.+++.
T Consensus         3 Cy~~~~~~~t~~~A~~~C~~~-~g~~La~i~s~~e~~~l~~~~~   45 (117)
T cd03598           3 CYRFVKSPRTFRDAQVICRRC-YRGNLASIHSFAFNYRVQRLVS   45 (117)
T ss_pred             eEEEecCCCCHHHHHHHhhcC-CCceEeeecChhHhHHHHHHHh
Confidence            466664 58899999999987 6677899999999999988874


No 28 
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=67.60  E-value=18  Score=24.49  Aligned_cols=52  Identities=13%  Similarity=0.135  Sum_probs=34.8

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI  130 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi  130 (133)
                      -++.|-++=+.|+++||+..++.-+-.+.=++-...++.. ..-+-+.+.||.
T Consensus         9 ~a~~AVe~D~~gr~~eAi~~Y~~aIe~L~q~~~~~pD~~~-k~~yr~ki~eY~   60 (75)
T cd02682           9 YAINAVKAEKEGNAEDAITNYKKAIEVLSQIVKNYPDSPT-RLIYEQMINEYK   60 (75)
T ss_pred             HHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhCCChHH-HHHHHHHHHHHH
Confidence            4678888889999999999999877666554444333332 222555666664


No 29 
>cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear.
Probab=67.42  E-value=17  Score=24.40  Aligned_cols=54  Identities=17%  Similarity=0.272  Sum_probs=38.9

Q ss_pred             HHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692          76 IAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI  130 (133)
Q Consensus        76 ~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi  130 (133)
                      .+.++.|-..=+.|++++|+..+.+-|..+...+-... +...-.-+-..+.||+
T Consensus         7 ~~l~~~Ave~D~~g~y~eAl~~Y~~aie~l~~~lk~e~-d~~~k~~~r~ki~eY~   60 (77)
T cd02683           7 KEVLKRAVELDQEGRFQEALVCYQEGIDLLMQVLKGTK-DEAKKKNLRQKISEYM   60 (77)
T ss_pred             HHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHhhCC-CHHHHHHHHHHHHHHH
Confidence            45678888889999999999999999988877643332 2222333666777775


No 30 
>PF04097 Nic96:  Nup93/Nic96;  InterPro: IPR007231 Nup93/Nic96 is a component of the nuclear pore complex. It is required for the correct assembly of the nuclear pore complex []. In Saccharomyces cerevisiae, Nic96 has been shown to be involved in the distribution and cellular concentration of the GTPase Gsp1 []. The structure of Nic96 has revealed a mostly alpha helical structure [].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 2QX5_B 2RFO_A.
Probab=66.36  E-value=15  Score=33.50  Aligned_cols=82  Identities=16%  Similarity=0.361  Sum_probs=47.6

Q ss_pred             HHhccCHHHHHHHHHhhcCcc---------------cccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHH-
Q psy2692          11 INKSSSFETAFKLLHDQIGVK---------------NFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSID-   74 (133)
Q Consensus        11 ~~AAGsFe~A~~LLnrQiGiv---------------Nf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~-   74 (133)
                      |-.||++|.+++++|+++|-+               +...+-..+++.|.........+.          +.   ..+. 
T Consensus       437 y~La~~~d~vl~lln~~Ls~~l~~~~~~~~~~s~~~~l~~la~~i~~~y~~~~~~~~~~~----------~~---~~~t~  503 (613)
T PF04097_consen  437 YHLAEEYDKVLSLLNRLLSQVLSQPSSSSLSDSERERLIELAKEILERYKSNPHISSKVS----------RK---NRETF  503 (613)
T ss_dssp             HHHTT-HHHHHHHHHHHHHHHHHCSSTSSSSSTTTTSHHHHHHHHHHHHTTSHHHHTTS-----------HH---HHHHH
T ss_pred             HHHHhhHHHHHHHHHHHHHHHHcCccccccccchhhhHHHHHHHHHHHHHhCcchHhhcc----------HH---HHHHH
Confidence            456899999999999997643               222233444455544433322200          00   1222 


Q ss_pred             -HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          75 -LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        75 -L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                       +.-+|.+.+.++.+|+.++|++.+.++ ..+|
T Consensus       504 ~~Ll~L~~ff~~~~~g~~~~AL~~i~~L-~liP  535 (613)
T PF04097_consen  504 QLLLDLAEFFDLYHAGQYEQALDIIEKL-DLIP  535 (613)
T ss_dssp             HHHHHHHHHHHHHHTT-HHHHHHHHHHT-T-S-
T ss_pred             HHHHHHHHHHHHHHcCCHHHHHHHHHhC-CCCC
Confidence             333688999999999999999877653 4566


No 31 
>PF14276 DUF4363:  Domain of unknown function (DUF4363)
Probab=64.92  E-value=46  Score=23.44  Aligned_cols=61  Identities=11%  Similarity=0.129  Sum_probs=50.2

Q ss_pred             hHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHc-----ccccccChHHHHHHHHHHHHHHHhhhc
Q psy2692          72 SIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICI-----PLLVVTTKKEIEEAKQIINICKEYIVG  132 (133)
Q Consensus        72 l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i-----~l~vv~~~~e~~e~~eli~icrEYila  132 (133)
                      -+++.+.+....+.+..++.++|...+..+....     .+...-+.+|.+++..-+...+.||-+
T Consensus        25 ~~~i~~~l~~i~~~i~~~dW~~A~~~~~~l~~~W~k~~~~~~~~~~h~eid~i~~sl~rl~~~i~~   90 (121)
T PF14276_consen   25 TDSIEEQLEQIEEAIENEDWEKAYKETEELEKEWDKNKKRWSILIEHQEIDNIDISLARLKGYIEA   90 (121)
T ss_pred             HHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHhhchheeeeecHHHHHHHHHHHHHHHHHHHC
Confidence            4568889999999999999999999988887653     223336799999999999999999854


No 32 
>PRK09720 cybC cytochrome b562; Provisional
Probab=63.90  E-value=14  Score=26.57  Aligned_cols=28  Identities=21%  Similarity=0.264  Sum_probs=25.2

Q ss_pred             ChHHHHHHHHHHHHhcccCCHHHHHHHH
Q psy2692          71 SSIDLIAKIQQCYQLTTSGKFVDAIEKL   98 (133)
Q Consensus        71 ~l~~L~~~Lk~gyk~~t~gKF~eAl~~F   98 (133)
                      +++.|+..+..+-++...||++||...=
T Consensus        58 Gld~lI~qID~A~~La~~GkL~eAK~~a   85 (100)
T PRK09720         58 GFDILVGQIDDALKLANEGKVKEAQAAA   85 (100)
T ss_pred             HHHHHHHHHHHHHHHHHcCCHHHHHHHH
Confidence            6889999999999999999999998543


No 33 
>PF09976 TPR_21:  Tetratricopeptide repeat;  InterPro: IPR018704  This domain, found in various hypothetical prokaryotic proteins, has no known function. 
Probab=62.82  E-value=25  Score=25.14  Aligned_cols=90  Identities=11%  Similarity=0.054  Sum_probs=46.3

Q ss_pred             HHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhccc
Q psy2692           9 ESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTS   88 (133)
Q Consensus         9 ~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~   88 (133)
                      ..++..|+|+.|...|++=+.-..=.++++.. .+.. +++.+..-..-..+.. +.....-.+..... .-.|--++..
T Consensus        56 ~~~~~~g~~~~A~~~l~~~~~~~~d~~l~~~a-~l~L-A~~~~~~~~~d~Al~~-L~~~~~~~~~~~~~-~~~Gdi~~~~  131 (145)
T PF09976_consen   56 KAAYEQGDYDEAKAALEKALANAPDPELKPLA-RLRL-ARILLQQGQYDEALAT-LQQIPDEAFKALAA-ELLGDIYLAQ  131 (145)
T ss_pred             HHHHHCCCHHHHHHHHHHHHhhCCCHHHHHHH-HHHH-HHHHHHcCCHHHHHHH-HHhccCcchHHHHH-HHHHHHHHHC
Confidence            35667899999999999977654222344333 2211 1222111111000000 00000111222222 2467778889


Q ss_pred             CCHHHHHHHHHHHH
Q psy2692          89 GKFVDAIEKLRNVL  102 (133)
Q Consensus        89 gKF~eAl~~Fr~iL  102 (133)
                      |++++|+..|+..|
T Consensus       132 g~~~~A~~~y~~Al  145 (145)
T PF09976_consen  132 GDYDEARAAYQKAL  145 (145)
T ss_pred             CCHHHHHHHHHHhC
Confidence            99999999998753


No 34 
>PF07721 TPR_4:  Tetratricopeptide repeat;  InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=61.33  E-value=9.8  Score=19.89  Aligned_cols=19  Identities=16%  Similarity=0.202  Sum_probs=16.0

Q ss_pred             hHHHHhccCHHHHHHHHHh
Q psy2692           8 SESINKSSSFETAFKLLHD   26 (133)
Q Consensus         8 ~~~~~AAGsFe~A~~LLnr   26 (133)
                      +..+...|+++.|.+++.+
T Consensus         8 a~~~~~~G~~~eA~~~l~~   26 (26)
T PF07721_consen    8 ARALLAQGDPDEAERLLRR   26 (26)
T ss_pred             HHHHHHcCCHHHHHHHHhC
Confidence            4568899999999999864


No 35 
>PF13525 YfiO:  Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=61.16  E-value=14  Score=28.29  Aligned_cols=31  Identities=16%  Similarity=0.253  Sum_probs=24.0

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLL  108 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~  108 (133)
                      ..+.|...+..|++.+|+..|..++..-|-.
T Consensus         8 lY~~a~~~~~~g~y~~Ai~~f~~l~~~~P~s   38 (203)
T PF13525_consen    8 LYQKALEALQQGDYEEAIKLFEKLIDRYPNS   38 (203)
T ss_dssp             HHHHHHHHHHCT-HHHHHHHHHHHHHH-TTS
T ss_pred             HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC
Confidence            5578888888888888888888888887764


No 36 
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=61.02  E-value=18  Score=32.38  Aligned_cols=37  Identities=16%  Similarity=0.185  Sum_probs=30.8

Q ss_pred             ChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHccc
Q psy2692          71 SSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPL  107 (133)
Q Consensus        71 ~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l  107 (133)
                      ..++-....+.|..++..|++++|+..|+..|..-|=
T Consensus        71 dP~~a~a~~NLG~AL~~lGryeEAIa~f~rALeL~Pd  107 (453)
T PLN03098         71 DVKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPN  107 (453)
T ss_pred             CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCC
Confidence            4455666788899999999999999999999887664


No 37 
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=60.85  E-value=12  Score=21.91  Aligned_cols=28  Identities=18%  Similarity=0.200  Sum_probs=19.8

Q ss_pred             HHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          79 IQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      .+.|.-.+..|++++|+..|+.++..-|
T Consensus         4 ~~~a~~~~~~~~~~~A~~~~~~~~~~~~   31 (100)
T cd00189           4 LNLGNLYYKLGDYDEALEYYEKALELDP   31 (100)
T ss_pred             HHHHHHHHHHhcHHHHHHHHHHHHhcCC
Confidence            3455666677888888888888776554


No 38 
>PLN03088 SGT1,  suppressor of G2 allele of SKP1; Provisional
Probab=60.10  E-value=33  Score=28.83  Aligned_cols=109  Identities=13%  Similarity=0.058  Sum_probs=59.3

Q ss_pred             hHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhcc
Q psy2692           8 SESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTT   87 (133)
Q Consensus         8 ~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t   87 (133)
                      +.+....|+|+.|+.++++-+..- ..-..-++    ..+.+++-.-..--.+ ....+......+.....++.|.-+..
T Consensus         9 a~~a~~~~~~~~Ai~~~~~Al~~~-P~~~~a~~----~~a~~~~~~g~~~eAl-~~~~~Al~l~P~~~~a~~~lg~~~~~   82 (356)
T PLN03088          9 AKEAFVDDDFALAVDLYTQAIDLD-PNNAELYA----DRAQANIKLGNFTEAV-ADANKAIELDPSLAKAYLRKGTACMK   82 (356)
T ss_pred             HHHHHHcCCHHHHHHHHHHHHHhC-CCCHHHHH----HHHHHHHHcCCHHHHH-HHHHHHHHhCcCCHHHHHHHHHHHHH
Confidence            457788999999999999887642 22111111    1122221110000000 00000000011112234566777888


Q ss_pred             cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692          88 SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI  130 (133)
Q Consensus        88 ~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi  130 (133)
                      .|++++|+..|+..+..-|--        .++...+..|.+.+
T Consensus        83 lg~~~eA~~~~~~al~l~P~~--------~~~~~~l~~~~~kl  117 (356)
T PLN03088         83 LEEYQTAKAALEKGASLAPGD--------SRFTKLIKECDEKI  117 (356)
T ss_pred             hCCHHHHHHHHHHHHHhCCCC--------HHHHHHHHHHHHHH
Confidence            999999999999999987642        23566677776554


No 39 
>KOG4648|consensus
Probab=57.31  E-value=15  Score=32.62  Aligned_cols=51  Identities=20%  Similarity=0.265  Sum_probs=37.9

Q ss_pred             HHHHhcccCCHHHHHHHHHHHHHHcccccccC-------------hHHHHHHHHHHHHHHHhhh
Q psy2692          81 QCYQLTTSGKFVDAIEKLRNVLICIPLLVVTT-------------KKEIEEAKQIINICKEYIV  131 (133)
Q Consensus        81 ~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~-------------~~e~~e~~eli~icrEYil  131 (133)
                      .|-++|+.|||+||+++|-.-+-.-|-..|--             ..-+.++...|.+-|-|+-
T Consensus       103 ~GN~yFKQgKy~EAIDCYs~~ia~~P~NpV~~~NRA~AYlk~K~FA~AE~DC~~AiaLd~~Y~K  166 (536)
T KOG4648|consen  103 RGNTYFKQGKYEEAIDCYSTAIAVYPHNPVYHINRALAYLKQKSFAQAEEDCEAAIALDKLYVK  166 (536)
T ss_pred             hhhhhhhccchhHHHHHhhhhhccCCCCccchhhHHHHHHHHHHHHHHHHhHHHHHHhhHHHHH
Confidence            48999999999999999999998888444431             1234456667777777764


No 40 
>PF13176 TPR_7:  Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=56.93  E-value=19  Score=19.98  Aligned_cols=22  Identities=23%  Similarity=0.253  Sum_probs=17.1

Q ss_pred             HHHHhcccCCHHHHHHHHHHHH
Q psy2692          81 QCYQLTTSGKFVDAIEKLRNVL  102 (133)
Q Consensus        81 ~gyk~~t~gKF~eAl~~Fr~iL  102 (133)
                      -|--....|++++|++.|+..|
T Consensus         5 Lg~~~~~~g~~~~Ai~~y~~aL   26 (36)
T PF13176_consen    5 LGRIYRQQGDYEKAIEYYEQAL   26 (36)
T ss_dssp             HHHHHHHCT-HHHHHHHHHHHH
T ss_pred             HHHHHHHcCCHHHHHHHHHHHH
Confidence            3445678999999999999966


No 41 
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=56.50  E-value=26  Score=26.86  Aligned_cols=40  Identities=18%  Similarity=0.113  Sum_probs=31.5

Q ss_pred             CCC-hHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692          69 DRS-SIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLL  108 (133)
Q Consensus        69 ~~~-l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~  108 (133)
                      .++ -+.|..--.-||.+...||+.||...|+-+...=|-.
T Consensus        28 ~~~~~~~l~~lY~~A~~ly~~G~l~~A~~~f~~L~~~Dp~~   68 (157)
T PRK15363         28 DDDVTQPLNTLYRYAMQLMEVKEFAGAARLFQLLTIYDAWS   68 (157)
T ss_pred             CCChHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCccc
Confidence            445 3345556688999999999999999999988776654


No 42 
>PF13181 TPR_8:  Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=56.10  E-value=25  Score=18.47  Aligned_cols=27  Identities=19%  Similarity=0.230  Sum_probs=21.5

Q ss_pred             HHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          80 QQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        80 k~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      ..|.-....|++++|++.|+..+..-|
T Consensus         6 ~lg~~y~~~~~~~~A~~~~~~a~~~~~   32 (34)
T PF13181_consen    6 NLGKIYEQLGDYEEALEYFEKALELNP   32 (34)
T ss_dssp             HHHHHHHHTTSHHHHHHHHHHHHHHHT
T ss_pred             HHHHHHHHcCCHHHHHHHHHHHHhhCC
Confidence            456666779999999999999887543


No 43 
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=55.45  E-value=24  Score=23.07  Aligned_cols=21  Identities=10%  Similarity=0.180  Sum_probs=16.1

Q ss_pred             HHHHhccCHHHHHHHHHhhcC
Q psy2692           9 ESINKSSSFETAFKLLHDQIG   29 (133)
Q Consensus         9 ~~~~AAGsFe~A~~LLnrQiG   29 (133)
                      ..+...|+++.|+..+.+-+.
T Consensus        10 ~~~~~~~~~~~A~~~~~~~~~   30 (119)
T TIGR02795        10 LLVLKAGDYADAIQAFQAFLK   30 (119)
T ss_pred             HHHHHcCCHHHHHHHHHHHHH
Confidence            456778999999988877653


No 44 
>PF13374 TPR_10:  Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=55.43  E-value=22  Score=19.19  Aligned_cols=25  Identities=20%  Similarity=0.349  Sum_probs=18.6

Q ss_pred             HHHHHhcccCCHHHHHHHHHHHHHH
Q psy2692          80 QQCYQLTTSGKFVDAIEKLRNVLIC  104 (133)
Q Consensus        80 k~gyk~~t~gKF~eAl~~Fr~iL~~  104 (133)
                      +-|.-+...|++++|+..++.++..
T Consensus         7 ~la~~~~~~g~~~~A~~~~~~al~~   31 (42)
T PF13374_consen    7 NLANAYRAQGRYEEALELLEEALEI   31 (42)
T ss_dssp             HHHHHHHHCT-HHHHHHHHHHHHHH
T ss_pred             HHHHHHHhhhhcchhhHHHHHHHHH
Confidence            4455566779999999999988754


No 45 
>PF03986 Autophagy_N:  Autophagocytosis associated protein (Atg3), N-terminal domain ;  InterPro: IPR007134 Proteins in this entry belong to the Atg3 group of proteins and the Atg3 conjugation enzymes. Autophagy is a degradative transport pathway that delivers cytosolic proteins to the lysosome (vacuole) [] and is induced by starvation []. Cytosolic proteins appear inside the vacuole enclosed in autophagic vesicles. Autophagy significantly differs from other transport pathways by using double membrane layered transport intermediates, called autophagosomes [, ]. The breakdown of vesicular transport intermediates is a unique feature of autophagy []. Autophagy can also function in the elimination of invading bacteria and antigens []. Atg3 is the E2 enzyme for the LC3 lipidation process []. It is essential for autophagocytosis. The super protein complex, the Atg16L complex, consists of multiple Atg12-Atg5 conjugates. Atg16L has an E3-like role in the LC3 lipidation reaction. The activated intermediate, LC3-Atg3 (E2), is recruited to the site where the lipidation takes place [].  Atg3 catalyses the conjugation of Atg8 and phosphatidylethanolamine (PE). Atg3 has an alpha/beta-fold, and its core region is topologically similar to canonical E2 enzymes. Atg3 has two regions inserted in the core region and another with a long alpha-helical structure that protrudes from the core region as far as 30 A []. It interacts with atg8 through an intermediate thioester bond between Cys-288 and the C-terminal Gly of atg8. It also interacts with the C-terminal region of the E1-like atg7 enzyme. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the lysosome/vacuole. Atg3 is a ubiquitin like modifier that is topologically similar to the canonical E2 enzyme []. It catalyses the conjugation of Atg8 and phosphatidylethanolamine []. This domain is the N-terminal of Atg3 while the C-terminal is represented by IPR007135 from INTERPRO.; PDB: 3T7G_C 2DYT_A.
Probab=55.32  E-value=3.9  Score=30.99  Aligned_cols=13  Identities=15%  Similarity=0.350  Sum_probs=9.6

Q ss_pred             cCChHHHHhccCH
Q psy2692           5 KVTSESINKSSSF   17 (133)
Q Consensus         5 ~~~~~~~~AAGsF   17 (133)
                      .+||+++|+||+|
T Consensus        27 ~iTPeEFV~AGD~   39 (145)
T PF03986_consen   27 VITPEEFVAAGDY   39 (145)
T ss_dssp             ---HHHHHHHHHH
T ss_pred             eeCHHHHHHhhhH
Confidence            4799999999997


No 46 
>PF13432 TPR_16:  Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=54.96  E-value=32  Score=20.76  Aligned_cols=58  Identities=14%  Similarity=0.191  Sum_probs=40.3

Q ss_pred             HHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhccc
Q psy2692           9 ESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTS   88 (133)
Q Consensus         9 ~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~   88 (133)
                      ..+...|+|+.|...+++-+..-                       |.                 ........|.-+...
T Consensus         5 ~~~~~~g~~~~A~~~~~~~l~~~-----------------------P~-----------------~~~a~~~lg~~~~~~   44 (65)
T PF13432_consen    5 RALYQQGDYDEAIAAFEQALKQD-----------------------PD-----------------NPEAWYLLGRILYQQ   44 (65)
T ss_dssp             HHHHHCTHHHHHHHHHHHHHCCS-----------------------TT-----------------HHHHHHHHHHHHHHT
T ss_pred             HHHHHcCCHHHHHHHHHHHHHHC-----------------------CC-----------------CHHHHHHHHHHHHHc
Confidence            45778888888888887653221                       11                 122345666677889


Q ss_pred             CCHHHHHHHHHHHHHHcc
Q psy2692          89 GKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        89 gKF~eAl~~Fr~iL~~i~  106 (133)
                      |++++|+..|+.++..-|
T Consensus        45 g~~~~A~~~~~~a~~~~P   62 (65)
T PF13432_consen   45 GRYDEALAYYERALELDP   62 (65)
T ss_dssp             T-HHHHHHHHHHHHHHST
T ss_pred             CCHHHHHHHHHHHHHHCc
Confidence            999999999999987655


No 47 
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=54.32  E-value=14  Score=22.40  Aligned_cols=25  Identities=20%  Similarity=0.225  Sum_probs=20.8

Q ss_pred             hHHHHhccCHHHHHHHHHhhcCccc
Q psy2692           8 SESINKSSSFETAFKLLHDQIGVKN   32 (133)
Q Consensus         8 ~~~~~AAGsFe~A~~LLnrQiGivN   32 (133)
                      +..|+..|+.|+|-.+|+.-+.--|
T Consensus         6 A~ayie~Gd~e~Ar~lL~evl~~~~   30 (44)
T TIGR03504         6 ARAYIEMGDLEGARELLEEVIEEGD   30 (44)
T ss_pred             HHHHHHcCChHHHHHHHHHHHHcCC
Confidence            4679999999999999998875444


No 48 
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=53.75  E-value=15  Score=30.57  Aligned_cols=31  Identities=16%  Similarity=0.244  Sum_probs=28.3

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLL  108 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~  108 (133)
                      .-+.||+++..|||.+|...|+.-+...|=-
T Consensus       144 ~Y~~A~~~~ksgdy~~A~~~F~~fi~~YP~s  174 (262)
T COG1729         144 LYNAALDLYKSGDYAEAEQAFQAFIKKYPNS  174 (262)
T ss_pred             HHHHHHHHHHcCCHHHHHHHHHHHHHcCCCC
Confidence            6799999999999999999999999988743


No 49 
>PRK15058 cytochrome b562; Provisional
Probab=53.44  E-value=24  Score=26.39  Aligned_cols=88  Identities=18%  Similarity=0.218  Sum_probs=54.8

Q ss_pred             HhccCHHHHHHHHHhhcCcccc----cccHHHHHHHHhhhhhhccCCC----CCCCccccCCCC-CCCChHHHHHHHHHH
Q psy2692          12 NKSSSFETAFKLLHDQIGVKNF----LPYRQMFLNSYLQSRTAYSTLP----LLAPVTAFPTRK-VDRSSIDLIAKIQQC   82 (133)
Q Consensus        12 ~AAGsFe~A~~LLnrQiGivNf----~PLk~~F~~~y~~s~~~~~~~p----~lp~l~~~~~rn-~~~~l~~L~~~Lk~g   82 (133)
                      +-|.+-+.-|+-+++.++.++=    +-+|.-....-..+--+-.+.|    +.||=.  .... ---+++.|+..|..+
T Consensus        20 a~Aa~l~~~M~~m~~~~k~~~~A~~a~~~k~al~~mr~aa~~Ak~~~P~kl~~k~~ds--~e~K~Y~~G~d~Li~qID~a   97 (128)
T PRK15058         20 SFAADLEDNMETLNDNLKVVEKTDNAAEVKDALTKMRAAALDAQKATPPKLEDKAPDS--PEMKDFRHGFDILVGQIDGA   97 (128)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHhccCCchhhccCCCC--HHHHHHHHHHHHHHHHHHHH
Confidence            3455789999999999987754    4455544444433333332222    222210  0000 022688899999999


Q ss_pred             HHhcccCCHHHHHHHHHHH
Q psy2692          83 YQLTTSGKFVDAIEKLRNV  101 (133)
Q Consensus        83 yk~~t~gKF~eAl~~Fr~i  101 (133)
                      -++..+||++||...-+.+
T Consensus        98 ~~la~~GkL~eAK~~a~~l  116 (128)
T PRK15058         98 LKLANEGKVKEAQAAAEQL  116 (128)
T ss_pred             HHHHhCCCHHHHHHHHHHH
Confidence            9999999999998665443


No 50 
>PF07361 Cytochrom_B562:  Cytochrome b562;  InterPro: IPR009155 Cytochrome b562 is a haem-containing protein that is expressed in the periplasm of Escherichia coli. In b-type cytochromes, the haem atom is not covalently attached to the polypeptide. Cytochrome b562 has a four-helical bundle structure that is structurally similar to that found in members of the cytochrome c family (IPR002321 from INTERPRO). Cytochrome b562 has a reduction potential of 167 mV, which sets the energy yield possible in metabolism and is also a key determinant of the rate at which redox reactions proceed [].; GO: 0005506 iron ion binding, 0009055 electron carrier activity, 0020037 heme binding, 0042597 periplasmic space; PDB: 4ER9_A 3IQ6_G 2QLA_B 3FOO_A 3M79_C 256B_A 3NMI_F 3HNK_A 3NMK_D 2BC5_A ....
Probab=53.31  E-value=18  Score=25.56  Aligned_cols=31  Identities=26%  Similarity=0.373  Sum_probs=27.7

Q ss_pred             ChHHHHHHHHHHHHhcccCCHHHHHHHHHHH
Q psy2692          71 SSIDLIAKIQQCYQLTTSGKFVDAIEKLRNV  101 (133)
Q Consensus        71 ~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~i  101 (133)
                      +++.|+..+..+-+++.+|++++|....+.+
T Consensus        61 Gl~~li~~id~a~~~~~~G~l~~AK~~l~~l   91 (103)
T PF07361_consen   61 GLDKLIDQIDKAEALAEAGKLDEAKAALKKL   91 (103)
T ss_dssp             HHHHHHHHHHHHHHHHHTTHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence            6888999999999999999999999877665


No 51 
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=53.29  E-value=16  Score=18.73  Aligned_cols=19  Identities=21%  Similarity=0.461  Sum_probs=16.3

Q ss_pred             HHHhccCHHHHHHHHHhhc
Q psy2692          10 SINKSSSFETAFKLLHDQI   28 (133)
Q Consensus        10 ~~~AAGsFe~A~~LLnrQi   28 (133)
                      .++.+|+++.|++++++-.
T Consensus         9 ~~~~~~~~~~a~~~~~~M~   27 (35)
T TIGR00756         9 GLCKAGRVEEALELFKEML   27 (35)
T ss_pred             HHHHCCCHHHHHHHHHHHH
Confidence            5788999999999998753


No 52 
>KOG0553|consensus
Probab=53.10  E-value=28  Score=29.58  Aligned_cols=89  Identities=13%  Similarity=0.140  Sum_probs=56.7

Q ss_pred             HHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccC----CC-CCCCccccCCCCCCCChHHHHHHHHHHH
Q psy2692           9 ESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYST----LP-LLAPVTAFPTRKVDRSSIDLIAKIQQCY   83 (133)
Q Consensus         9 ~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~----~p-~lp~l~~~~~rn~~~~l~~L~~~Lk~gy   83 (133)
                      -++.+.|+|+.|.+...+-|-+.   |==+.|-.-=.++++.+..    +- +...+        .++..+-..-.+-|.
T Consensus        89 N~~m~~~~Y~eAv~kY~~AI~l~---P~nAVyycNRAAAy~~Lg~~~~AVkDce~Al--------~iDp~yskay~RLG~  157 (304)
T KOG0553|consen   89 NKLMKNKDYQEAVDKYTEAIELD---PTNAVYYCNRAAAYSKLGEYEDAVKDCESAL--------SIDPHYSKAYGRLGL  157 (304)
T ss_pred             HHHHHhhhHHHHHHHHHHHHhcC---CCcchHHHHHHHHHHHhcchHHHHHHHHHHH--------hcChHHHHHHHHHHH
Confidence            36788999999999999986554   4333333222333333321    11 01111        223344444567788


Q ss_pred             HhcccCCHHHHHHHHHHHHHHcccc
Q psy2692          84 QLTTSGKFVDAIEKLRNVLICIPLL  108 (133)
Q Consensus        84 k~~t~gKF~eAl~~Fr~iL~~i~l~  108 (133)
                      -+...||+.+|++.|+..|+.=|--
T Consensus       158 A~~~~gk~~~A~~aykKaLeldP~N  182 (304)
T KOG0553|consen  158 AYLALGKYEEAIEAYKKALELDPDN  182 (304)
T ss_pred             HHHccCcHHHHHHHHHhhhccCCCc
Confidence            8899999999999999999887754


No 53 
>PF05538 Campylo_MOMP:  Campylobacter major outer membrane protein;  InterPro: IPR008439 Campylobacter are Gram-negative, spiral, microaerophilic bacteria. Campylobacter jejuni is one of the main causative agents of food poisoning in the developed world. This family consists of Campylobacter major outer membrane proteins. The major outer membrane protein (MOMP), a putative porin and a multifunction surface protein of Campylobacter jejuni, may play an important role in the adaptation of the organism to various host environments [].
Probab=52.76  E-value=4.6  Score=35.74  Aligned_cols=37  Identities=19%  Similarity=0.213  Sum_probs=32.7

Q ss_pred             CccccCChHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhh
Q psy2692           1 MNLHKVTSESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSR   48 (133)
Q Consensus         1 ~~~~~~~~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~   48 (133)
                      |.|-|+|-.+.||+|+|.+|           |-.||.+.++.|-.+-.
T Consensus         1 MKl~KlSLaAavA~gafStA-----------~A~PLeEaiKdVdvSG~   37 (431)
T PF05538_consen    1 MKLVKLSLAAAVALGAFSTA-----------SATPLEEAIKDVDVSGF   37 (431)
T ss_pred             CchHHHHHHHHHHhhhhhhc-----------cCCcHHHHhcccccceE
Confidence            88999999999999999987           77899999999976543


No 54 
>PF13424 TPR_12:  Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=52.43  E-value=55  Score=20.36  Aligned_cols=26  Identities=12%  Similarity=0.150  Sum_probs=21.5

Q ss_pred             HHHHHHhcccCCHHHHHHHHHHHHHH
Q psy2692          79 IQQCYQLTTSGKFVDAIEKLRNVLIC  104 (133)
Q Consensus        79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~  104 (133)
                      ..-|.-+...|++++|++.|+..+..
T Consensus         9 ~~la~~~~~~~~~~~A~~~~~~al~~   34 (78)
T PF13424_consen    9 NNLARVYRELGRYDEALDYYEKALDI   34 (78)
T ss_dssp             HHHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence            45566777999999999999999988


No 55 
>PF10579 Rapsyn_N:  Rapsyn N-terminal myristoylation and linker region;  InterPro: IPR019568  Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane []. ; GO: 0008270 zinc ion binding, 0033130 acetylcholine receptor binding, 0007268 synaptic transmission, 0005856 cytoskeleton, 0030054 cell junction, 0045211 postsynaptic membrane
Probab=51.79  E-value=28  Score=24.01  Aligned_cols=32  Identities=22%  Similarity=0.242  Sum_probs=28.8

Q ss_pred             HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          75 LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        75 L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      -...+.+|.|+...++=++|+..++++|..++
T Consensus         6 ak~~ie~GlkLY~~~~~~~Al~~W~~aL~k~~   37 (80)
T PF10579_consen    6 AKQQIEKGLKLYHQNETQQALQKWRKALEKIT   37 (80)
T ss_pred             HHHHHHHHHHHhccchHHHHHHHHHHHHhhcC
Confidence            34589999999999999999999999998875


No 56 
>KOG1840|consensus
Probab=51.77  E-value=34  Score=30.90  Aligned_cols=110  Identities=14%  Similarity=0.006  Sum_probs=56.1

Q ss_pred             hHHHHhccCHHHHHHHHHhhcCcc-----cccccHHH----HHHHHhhhhhhccCCCC-CCCccccCCCCCCCChHHHHH
Q psy2692           8 SESINKSSSFETAFKLLHDQIGVK-----NFLPYRQM----FLNSYLQSRTAYSTLPL-LAPVTAFPTRKVDRSSIDLIA   77 (133)
Q Consensus         8 ~~~~~AAGsFe~A~~LLnrQiGiv-----Nf~PLk~~----F~~~y~~s~~~~~~~p~-lp~l~~~~~rn~~~~l~~L~~   77 (133)
                      +..++..|+||.|.++-.+-+-++     .-.|.-..    |=.+|....-+..+++- -+.+..+..---..+...-.-
T Consensus       206 a~~y~~~g~~e~A~~l~k~Al~~l~k~~G~~hl~va~~l~~~a~~y~~~~k~~eAv~ly~~AL~i~e~~~G~~h~~va~~  285 (508)
T KOG1840|consen  206 AEMYAVQGRLEKAEPLCKQALRILEKTSGLKHLVVASMLNILALVYRSLGKYDEAVNLYEEALTIREEVFGEDHPAVAAT  285 (508)
T ss_pred             HHHHHHhccHHHHHHHHHHHHHHHHHccCccCHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHhcCCCCHHHHHH
Confidence            467888999998776655544331     22232222    33344444444444442 222221110000112222222


Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHH
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIE  117 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~  117 (133)
                      ..+-|+-+-..|||.||...+.+.+-+.-=+-..+..|+.
T Consensus       286 l~nLa~ly~~~GKf~EA~~~~e~Al~I~~~~~~~~~~~v~  325 (508)
T KOG1840|consen  286 LNNLAVLYYKQGKFAEAEEYCERALEIYEKLLGASHPEVA  325 (508)
T ss_pred             HHHHHHHHhccCChHHHHHHHHHHHHHHHHhhccChHHHH
Confidence            3344666699999999999999887665443334444443


No 57 
>PF09577 Spore_YpjB:  Sporulation protein YpjB (SpoYpjB);  InterPro: IPR014231 Proteins in thie entry, typified by YpjB, are restricted to a subset of the endospore-forming bacteria which includes Bacillus species, but not species. In Bacillus subtilis, ypjB was found to be part of the sigma-E regulon []. Sigma-E is a sporulation sigma factor that regulates expression in the mother cell compartment. Null mutants of ypjB show a sporulation defect, but this gene is not, however, a part of the endospore formation minimal gene set.
Probab=50.92  E-value=81  Score=25.66  Aligned_cols=40  Identities=18%  Similarity=0.111  Sum_probs=31.0

Q ss_pred             hHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHccccccc
Q psy2692          72 SIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVT  111 (133)
Q Consensus        72 l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~  111 (133)
                      +..|...--+.|++++.||++||......+--..+-....
T Consensus         2 w~eLd~~sd~~lqlvk~~~yeeA~q~l~~fs~~f~~~~~~   41 (232)
T PF09577_consen    2 WKELDQLSDEALQLVKQGKYEEAKQLLEYFSEQFTSVDFK   41 (232)
T ss_pred             hHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHhhcccc
Confidence            3456666788999999999999999888777666665553


No 58 
>PF12895 Apc3:  Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=50.06  E-value=13  Score=23.95  Aligned_cols=78  Identities=18%  Similarity=0.260  Sum_probs=39.8

Q ss_pred             ccCHHHHHHHHHhhcCcccccccHHHH----HHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhcccC
Q psy2692          14 SSSFETAFKLLHDQIGVKNFLPYRQMF----LNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTSG   89 (133)
Q Consensus        14 AGsFe~A~~LLnrQiGivNf~PLk~~F----~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~g   89 (133)
                      .|+++.|..+.++=+..-...| .+.+    -.+|....-+-.++..+-.+        ..+........-.|.-++..|
T Consensus         2 ~~~y~~Ai~~~~k~~~~~~~~~-~~~~~~~la~~~~~~~~y~~A~~~~~~~--------~~~~~~~~~~~l~a~~~~~l~   72 (84)
T PF12895_consen    2 QGNYENAIKYYEKLLELDPTNP-NSAYLYNLAQCYFQQGKYEEAIELLQKL--------KLDPSNPDIHYLLARCLLKLG   72 (84)
T ss_dssp             TT-HHHHHHHHHHHHHHHCGTH-HHHHHHHHHHHHHHTTHHHHHHHHHHCH--------THHHCHHHHHHHHHHHHHHTT
T ss_pred             CccHHHHHHHHHHHHHHCCCCh-hHHHHHHHHHHHHHCCCHHHHHHHHHHh--------CCCCCCHHHHHHHHHHHHHhC
Confidence            3778888888888776665543 2221    12333332222222211110        111112333333477888899


Q ss_pred             CHHHHHHHHHH
Q psy2692          90 KFVDAIEKLRN  100 (133)
Q Consensus        90 KF~eAl~~Fr~  100 (133)
                      +++||++.|..
T Consensus        73 ~y~eAi~~l~~   83 (84)
T PF12895_consen   73 KYEEAIKALEK   83 (84)
T ss_dssp             -HHHHHHHHHH
T ss_pred             CHHHHHHHHhc
Confidence            99999998875


No 59 
>cd03600 CLECT_thrombomodulin_like C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR. CLECT_thrombomodulin_like: C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR.  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  In these thrombomodulin-like proteins the residues involved in coordinating Ca2+ in the classical MBP-A CTLD are not conserved.  TM exerts anti-fibrinolytic and anti-inflammatory activity.  TM also regulates blood coagulation in the anticoagulant protein C pathway.  In this pathway, the procoagulant properties of thrombin (T) are lost when it binds TM.  TM also plays a key role in tumor biology.  It is expressed on endothelial cells and on several type of tumor cell including squamous cell carcinoma.  Loss of TM expression correlates with advanced stage and poor prognosis.  Loss of function of TM func
Probab=50.03  E-value=15  Score=26.17  Aligned_cols=42  Identities=19%  Similarity=0.296  Sum_probs=34.5

Q ss_pred             HHHhcc-cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHH
Q psy2692          82 CYQLTT-SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINI  125 (133)
Q Consensus        82 gyk~~t-~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~i  125 (133)
                      +|+.++ ...|.||...++..  -.-|.+++|.+|.+.+++++..
T Consensus         6 Cy~~~~~~~sw~~A~~~C~~~--gg~La~i~s~~E~~~v~~~l~~   48 (141)
T cd03600           6 CYTLHPQKLTFLEAQRSCIEL--GGNLATVRSGEEADVVSLLLAA   48 (141)
T ss_pred             eEEEeCCccCHHHHHHHHHhh--CCEeeecCCHHHHHHHHHHHhh
Confidence            566665 48899999999984  6788999999999999888754


No 60 
>PF13041 PPR_2:  PPR repeat family 
Probab=49.44  E-value=16  Score=21.40  Aligned_cols=20  Identities=20%  Similarity=0.391  Sum_probs=17.0

Q ss_pred             hcccCCHHHHHHHHHHHHHH
Q psy2692          85 LTTSGKFVDAIEKLRNVLIC  104 (133)
Q Consensus        85 ~~t~gKF~eAl~~Fr~iL~~  104 (133)
                      +.+.|++++|.+.|++....
T Consensus        13 ~~~~~~~~~a~~l~~~M~~~   32 (50)
T PF13041_consen   13 YCKAGKFEEALKLFKEMKKR   32 (50)
T ss_pred             HHHCcCHHHHHHHHHHHHHc
Confidence            46789999999999998754


No 61 
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=48.15  E-value=28  Score=26.34  Aligned_cols=32  Identities=16%  Similarity=0.220  Sum_probs=23.1

Q ss_pred             HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          75 LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        75 L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      ....+..|..++..|+|++|+..|+.++..-|
T Consensus        33 ~~~~~~~g~~~~~~~~~~~A~~~~~~~~~~~p   64 (235)
T TIGR03302        33 AEELYEEAKEALDSGDYTEAIKYFEALESRYP   64 (235)
T ss_pred             HHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCC
Confidence            44467777777777888888887777776655


No 62 
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=48.07  E-value=37  Score=22.11  Aligned_cols=25  Identities=20%  Similarity=0.380  Sum_probs=12.2

Q ss_pred             HHHHHhcccCCHHHHHHHHHHHHHH
Q psy2692          80 QQCYQLTTSGKFVDAIEKLRNVLIC  104 (133)
Q Consensus        80 k~gyk~~t~gKF~eAl~~Fr~iL~~  104 (133)
                      +.|..++..|++++|++.|..++..
T Consensus         7 ~~~~~~~~~~~~~~A~~~~~~~~~~   31 (119)
T TIGR02795         7 DAALLVLKAGDYADAIQAFQAFLKK   31 (119)
T ss_pred             HHHHHHHHcCCHHHHHHHHHHHHHH
Confidence            3444444455555555555555443


No 63 
>PF09976 TPR_21:  Tetratricopeptide repeat;  InterPro: IPR018704  This domain, found in various hypothetical prokaryotic proteins, has no known function. 
Probab=47.02  E-value=30  Score=24.67  Aligned_cols=30  Identities=20%  Similarity=0.243  Sum_probs=25.4

Q ss_pred             HHHHHHHHHhcccCCHHHHHHHHHHHHHHc
Q psy2692          76 IAKIQQCYQLTTSGKFVDAIEKLRNVLICI  105 (133)
Q Consensus        76 ~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i  105 (133)
                      ...|..|-..+..|+|++|+..|+.++-..
T Consensus        49 ~A~l~lA~~~~~~g~~~~A~~~l~~~~~~~   78 (145)
T PF09976_consen   49 LAALQLAKAAYEQGDYDEAKAALEKALANA   78 (145)
T ss_pred             HHHHHHHHHHHHCCCHHHHHHHHHHHHhhC
Confidence            346778888899999999999999998754


No 64 
>PF12854 PPR_1:  PPR repeat
Probab=46.98  E-value=23  Score=19.68  Aligned_cols=17  Identities=24%  Similarity=0.487  Sum_probs=14.8

Q ss_pred             HHHhccCHHHHHHHHHh
Q psy2692          10 SINKSSSFETAFKLLHD   26 (133)
Q Consensus        10 ~~~AAGsFe~A~~LLnr   26 (133)
                      .+-.+|.+|.|++++.+
T Consensus        16 ~~Ck~G~~~~A~~l~~~   32 (34)
T PF12854_consen   16 GYCKAGRVDEAFELFDE   32 (34)
T ss_pred             HHHHCCCHHHHHHHHHh
Confidence            46789999999999976


No 65 
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=46.01  E-value=95  Score=22.31  Aligned_cols=30  Identities=17%  Similarity=-0.057  Sum_probs=25.9

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHccc
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPL  107 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l  107 (133)
                      ..+-|.-+.+.|++++|+..|+..+..-|-
T Consensus        95 ~~~lg~~l~~~g~~~eAi~~~~~Al~~~p~  124 (144)
T PRK15359         95 VYQTGVCLKMMGEPGLAREAFQTAIKMSYA  124 (144)
T ss_pred             HHHHHHHHHHcCCHHHHHHHHHHHHHhCCC
Confidence            467788888999999999999999987764


No 66 
>PF00059 Lectin_C:  Lectin C-type domain;  InterPro: IPR001304 Lectins occur in plants, animals, bacteria and viruses. Initially described for their carbohydrate-binding activity [], they are now recognised as a more diverse group of proteins, some of which are involved in protein-protein, protein-lipid or protein-nucleic acid interactions []. There are at least twelve structural families of lectins:   C-type lectins, which are Ca+-dependent.  S-type (galectins), a widespread family of glycan-binding proteins []. I-type, which have an immunoglobulin-like fold and can recognise sialic acids, other sugars and glycosaminoglycans []. P-type, which bind phosphomannosyl receptors []. Pentraxins []. (Trout) egg lectins. Calreticulin and calnexin, which act as molecular chaperones of the endoplasmic reticulum []. ERGIC-53 and VIP-36 []. Discoidins []. Eel aggutinins (fucolectins) []. Annexin lectins []. Fibrinogen-type lectins, which includes ficolins, tachylectins 5A and 5B, and Limax flavus (Spotted garden slug) agglutinin (these proteins have clear distinctions from one another, but they share a homologous fibrinogen-like domain used for carbohydrate binding). Also unclassified orphan lectins, including amphoterin, Cel-II, complement factor H, thrombospondin, sailic acid-binding lectins, adherence lectin, and cytokins (such as tumour necrosis factor and several interleukins).   C-type lectins can be further divided into seven subgroups based on additional non-lectin domains and gene structure: (I) hyalectans, (II) asialoglycoprotein receptors, (III) collectins, (IV) selectins, (V) NK group transmembrane receptors, (VI) macrophage mannose receptors, and (VII) simple (single domain) lectins []. Therefore, lectins are a diverse group of proteins, both in terms of structure and activity. Carbohydrate binding ability may have evolved independently and sporadically in numerous unrelated families, where each evolved a structure that was conserved to fulfil some other activity and function. In general, animal lectins act as recognition molecules within the immune system, their functions involving defence against pathogens, cell trafficking, immune regulation and the prevention of autoimmunity [].; GO: 0005488 binding; PDB: 1T8D_A 2H2T_B 1T8C_A 2H2R_A 1TN3_A 1RJH_A 1HTN_A 3G8K_B 2E3X_B 1UMR_D ....
Probab=44.68  E-value=43  Score=21.33  Aligned_cols=33  Identities=9%  Similarity=0.160  Sum_probs=27.9

Q ss_pred             CHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692          90 KFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN  124 (133)
Q Consensus        90 KF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~  124 (133)
                      +|.+|...+++  +..-|++++|.+|.+-++..+.
T Consensus         4 ~~~~A~~~C~~--~~~~L~~i~~~~e~~~i~~~~~   36 (105)
T PF00059_consen    4 TWEEAQQYCQS--MGAHLASINSEEENDFIQSQLK   36 (105)
T ss_dssp             EHHHHHHHHHH--TTSEEB-GSSHHHHHHHHHHHH
T ss_pred             CHHHHHHHHhc--CCCEEeEeCCHHHhhhhhhccc
Confidence            58999999997  8888999999999888888765


No 67 
>KOG2168|consensus
Probab=44.22  E-value=83  Score=30.32  Aligned_cols=93  Identities=15%  Similarity=0.216  Sum_probs=57.2

Q ss_pred             HHHHhccCHHHHHHHHHhhcCcccccc-----cHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHH
Q psy2692           9 ESINKSSSFETAFKLLHDQIGVKNFLP-----YRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCY   83 (133)
Q Consensus         9 ~~~~AAGsFe~A~~LLnrQiGivNf~P-----Lk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gy   83 (133)
                      +=|--||+||.|+.++|+-++=+=..|     .+...-.+-.+-..-+-..|+.+...       .++--.|.-++-..+
T Consensus       643 ~LY~lag~yd~al~link~LS~~l~~~~~~~~n~erl~~La~~~~~~y~~~~~~~~~~-------~~~t~~lLl~~~~~f  715 (835)
T KOG2168|consen  643 LLYHLAGDYDKALELINKLLSQVLHSPTLGQSNKERLGDLALSMNDIYESNKGDSAKV-------VVKTLSLLLDLVSFF  715 (835)
T ss_pred             HHHHHhhhhhHHHHHHHHHHHHHHhhcccCCcchhhHHHHHHHHHHHHHhccCcchhh-------HHHHHHHHHHHHHHH
Confidence            335679999999999999887655554     34433333333333333333333211       112223455667788


Q ss_pred             HhcccCCHHHHHHHHHHHHHHccccc
Q psy2692          84 QLTTSGKFVDAIEKLRNVLICIPLLV  109 (133)
Q Consensus        84 k~~t~gKF~eAl~~Fr~iL~~i~l~v  109 (133)
                      ..-..|++++|++..+. |+.||+..
T Consensus       716 ~~y~~~~~e~aL~~le~-l~LiP~~~  740 (835)
T KOG2168|consen  716 DLYHNGEWEEALSILEH-LDLIPLDP  740 (835)
T ss_pred             HHHhhhHHHHHHHHHHH-HhccCCCh
Confidence            88889999999998765 56788743


No 68 
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=44.09  E-value=37  Score=27.08  Aligned_cols=32  Identities=19%  Similarity=0.158  Sum_probs=27.4

Q ss_pred             HHHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692          77 AKIQQCYQLTTSGKFVDAIEKLRNVLICIPLL  108 (133)
Q Consensus        77 ~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~  108 (133)
                      .....|.+.+..|++.+|++.|+.++..-|..
T Consensus        34 ~~Y~~A~~~~~~g~y~~Ai~~f~~l~~~yP~s   65 (243)
T PRK10866         34 EIYATAQQKLQDGNWKQAITQLEALDNRYPFG   65 (243)
T ss_pred             HHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCC
Confidence            35588999999999999999999999988854


No 69 
>PF10938 YfdX:  YfdX protein;  InterPro: IPR021236  YfdX is a protein found in Proteobacteria of unknown function. The protein coding for this gene is regulated by EvgA in Escherichia coli []. ; PDB: 3DZA_C.
Probab=43.47  E-value=45  Score=25.12  Aligned_cols=40  Identities=23%  Similarity=0.228  Sum_probs=32.4

Q ss_pred             hHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHccccccc
Q psy2692          72 SIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVT  111 (133)
Q Consensus        72 l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~  111 (133)
                      +..-...++.|-++..+||..||......++-.+.+..++
T Consensus       114 L~~~~~av~~A~~ll~~~k~~eA~~aL~~A~~~lv~~s~~  153 (155)
T PF10938_consen  114 LAQTPAAVKQAAALLDEGKYYEANAALKQALDGLVFDSVD  153 (155)
T ss_dssp             HHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHT-EEEEEE
T ss_pred             HHhhHHHHHHHHHHHHCCCHHHHHHHHHHHhcCeEEEEee
Confidence            4555668999999999999999999999999888876553


No 70 
>PF07720 TPR_3:  Tetratricopeptide repeat;  InterPro: IPR011716 This entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones [].; PDB: 3KS2_O 3GZ2_A 3GZ1_A 3GYZ_A 4AM9_A 2VGX_A 2VGY_A.
Probab=43.22  E-value=24  Score=20.34  Aligned_cols=20  Identities=25%  Similarity=0.473  Sum_probs=16.1

Q ss_pred             HHHHHhcccCCHHHHHHHHH
Q psy2692          80 QQCYQLTTSGKFVDAIEKLR   99 (133)
Q Consensus        80 k~gyk~~t~gKF~eAl~~Fr   99 (133)
                      --||.+-..||+++|++.|.
T Consensus         6 ~~a~~~y~~~ky~~A~~~~~   25 (36)
T PF07720_consen    6 GLAYNFYQKGKYDEAIHFFQ   25 (36)
T ss_dssp             HHHHHHHHTT-HHHHHHHHH
T ss_pred             HHHHHHHHHhhHHHHHHHHH
Confidence            45888999999999999944


No 71 
>KOG3807|consensus
Probab=42.99  E-value=24  Score=31.27  Aligned_cols=37  Identities=19%  Similarity=0.468  Sum_probs=29.5

Q ss_pred             HHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccC
Q psy2692          74 DLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTT  112 (133)
Q Consensus        74 ~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~  112 (133)
                      .+..+|.-+-  -+.||..||+..||++..-.|++.+-|
T Consensus       276 YIKRRLAMCA--RklGrlrEA~K~~RDL~ke~pl~t~ln  312 (556)
T KOG3807|consen  276 YIKRRLAMCA--RKLGRLREAVKIMRDLMKEFPLLTMLN  312 (556)
T ss_pred             HHHHHHHHHH--HHhhhHHHHHHHHHHHhhhccHHHHHH
Confidence            3455665554  467999999999999999999987766


No 72 
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=42.72  E-value=59  Score=22.08  Aligned_cols=32  Identities=19%  Similarity=0.313  Sum_probs=25.3

Q ss_pred             HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          75 LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        75 L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      .......|+.+...|++++|+..|+.++..-|
T Consensus        17 ~~~~~~~a~~~~~~~~~~~A~~~~~~~~~~~p   48 (135)
T TIGR02552        17 LEQIYALAYNLYQQGRYDEALKLFQLLAAYDP   48 (135)
T ss_pred             HHHHHHHHHHHHHcccHHHHHHHHHHHHHhCC
Confidence            44467788888888999999999988877544


No 73 
>PLN03088 SGT1,  suppressor of G2 allele of SKP1; Provisional
Probab=41.04  E-value=38  Score=28.40  Aligned_cols=29  Identities=10%  Similarity=0.077  Sum_probs=22.5

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      .+++|-+++..|+|.+|+..|+..|..-|
T Consensus         5 l~~~a~~a~~~~~~~~Ai~~~~~Al~~~P   33 (356)
T PLN03088          5 LEDKAKEAFVDDDFALAVDLYTQAIDLDP   33 (356)
T ss_pred             HHHHHHHHHHcCCHHHHHHHHHHHHHhCC
Confidence            45678888888888888888888887655


No 74 
>PF01535 PPR:  PPR repeat;  InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) [].  The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=40.01  E-value=30  Score=17.46  Aligned_cols=19  Identities=21%  Similarity=0.361  Sum_probs=15.4

Q ss_pred             hcccCCHHHHHHHHHHHHH
Q psy2692          85 LTTSGKFVDAIEKLRNVLI  103 (133)
Q Consensus        85 ~~t~gKF~eAl~~Fr~iL~  103 (133)
                      +...|++++|.+.|+.+..
T Consensus        10 ~~~~~~~~~a~~~~~~M~~   28 (31)
T PF01535_consen   10 YCKMGQFEEALEVFDEMRE   28 (31)
T ss_pred             HHccchHHHHHHHHHHHhH
Confidence            3568999999999998753


No 75 
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=39.66  E-value=35  Score=24.62  Aligned_cols=28  Identities=14%  Similarity=0.290  Sum_probs=23.8

Q ss_pred             HHHHHhcccCCHHHHHHHHHHHHHHccc
Q psy2692          80 QQCYQLTTSGKFVDAIEKLRNVLICIPL  107 (133)
Q Consensus        80 k~gyk~~t~gKF~eAl~~Fr~iL~~i~l  107 (133)
                      +.|+.++..|++++|+..|+..+..-|-
T Consensus        29 ~~g~~~~~~g~~~~A~~~~~~al~~~P~   56 (144)
T PRK15359         29 ASGYASWQEGDYSRAVIDFSWLVMAQPW   56 (144)
T ss_pred             HHHHHHHHcCCHHHHHHHHHHHHHcCCC
Confidence            5688999999999999999998877663


No 76 
>PF14561 TPR_20:  Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=39.64  E-value=1.2e+02  Score=20.62  Aligned_cols=34  Identities=21%  Similarity=0.247  Sum_probs=26.5

Q ss_pred             HHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          73 IDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        73 ~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      +++..++.-|-.++-.|++++|++.+-.++..-+
T Consensus        20 ~D~~ar~~lA~~~~~~g~~e~Al~~Ll~~v~~dr   53 (90)
T PF14561_consen   20 DDLDARYALADALLAAGDYEEALDQLLELVRRDR   53 (90)
T ss_dssp             T-HHHHHHHHHHHHHTT-HHHHHHHHHHHHCC-T
T ss_pred             CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCc
Confidence            4566678889999999999999999998887643


No 77 
>cd05492 Bromo_ZMYND11 Bromodomain; ZMYND11_like sub-family. ZMYND11 or BS69 is a ubiquitously expressed nuclear protein that has been shown to associate with chromatin. It interacts with chromatin remodeling factors and might play a role in chromatin remodeling and gene expression. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=39.29  E-value=1.3e+02  Score=21.43  Aligned_cols=56  Identities=9%  Similarity=0.023  Sum_probs=47.0

Q ss_pred             CCChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHH
Q psy2692          69 DRSSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKE  128 (133)
Q Consensus        69 ~~~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrE  128 (133)
                      |..++.+..+|+.| +   =..++|-..-++-|.+.....--.+..+...+++++.-|++
T Consensus        46 PmDL~tI~~kl~~~-~---Y~s~~ef~~Dv~LI~~N~~~yNg~~s~~~~~A~~l~~d~~~  101 (109)
T cd05492          46 HLDVADIQEKINSE-K---YTSLEEFKADALLLLHNTAIFHGADSEQYDAARWLYRDTCH  101 (109)
T ss_pred             CCcHHHHHHHHHcC-C---CCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH
Confidence            88999999999887 2   33477778888888899888888888999999999999975


No 78 
>PF13512 TPR_18:  Tetratricopeptide repeat
Probab=38.67  E-value=50  Score=24.91  Aligned_cols=34  Identities=18%  Similarity=0.230  Sum_probs=28.2

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHccccccc
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVT  111 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~  111 (133)
                      ..++|.+++..|++.+|++.|..|.-..|+---.
T Consensus        13 ly~~a~~~l~~~~Y~~A~~~le~L~~ryP~g~ya   46 (142)
T PF13512_consen   13 LYQEAQEALQKGNYEEAIKQLEALDTRYPFGEYA   46 (142)
T ss_pred             HHHHHHHHHHhCCHHHHHHHHHHHHhcCCCCccc
Confidence            4588888899999999999999999888886443


No 79 
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=38.08  E-value=1.5e+02  Score=25.93  Aligned_cols=20  Identities=15%  Similarity=0.096  Sum_probs=15.5

Q ss_pred             HHHhccCHHHHHHHHHhhcC
Q psy2692          10 SINKSSSFETAFKLLHDQIG   29 (133)
Q Consensus        10 ~~~AAGsFe~A~~LLnrQiG   29 (133)
                      .+...|+|+.|+..+++-+.
T Consensus        65 ~~~~~g~~~~A~~~~~~~~~   84 (899)
T TIGR02917        65 IYLALGDYAAAEKELRKALS   84 (899)
T ss_pred             HHHHcCCHHHHHHHHHHHHH
Confidence            46678889999988887654


No 80 
>PF05347 Complex1_LYR:  Complex 1 protein (LYR family);  InterPro: IPR008011 This family of short proteins includes proteins from the NADH-ubiquinone oxidoreductase complex I. The family includes the B14 subunit from bovine NADH-ubiquinone oxidoreductase B14 subunit Q02366 from SWISSPROT, and the B22 subunit from the human enzyme Q9Y6M9 from SWISSPROT. The family has been named LYR after a highly conserved tripeptide motif close to the N terminus of these proteins.  Members of this family also found in yeast which do contain this complex. In these organisms they are believed to be be required for iron-sulphur custer biogenesis.
Probab=37.57  E-value=50  Score=20.04  Aligned_cols=16  Identities=13%  Similarity=0.298  Sum_probs=10.7

Q ss_pred             HHHHHHHHHHHHHHhh
Q psy2692         115 EIEEAKQIINICKEYI  130 (133)
Q Consensus       115 e~~e~~eli~icrEYi  130 (133)
                      +.+++.+++..|++|+
T Consensus        42 d~~~I~~~l~~g~~~l   57 (59)
T PF05347_consen   42 DPEKIEELLKKGEEEL   57 (59)
T ss_pred             CHHHHHHHHHHHHHHh
Confidence            4556677777777764


No 81 
>KOG0545|consensus
Probab=36.50  E-value=32  Score=29.21  Aligned_cols=33  Identities=27%  Similarity=0.452  Sum_probs=27.4

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcccccc
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVV  110 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv  110 (133)
                      .=++|-++|+.|+|.||...+|+.+-++--+..
T Consensus       181 l~q~GN~lfk~~~ykEA~~~YreAi~~l~~L~l  213 (329)
T KOG0545|consen  181 LHQEGNRLFKLGRYKEASSKYREAIICLRNLQL  213 (329)
T ss_pred             HHHhhhhhhhhccHHHHHHHHHHHHHHHHHHHh
Confidence            347899999999999999999998877654443


No 82 
>PF13512 TPR_18:  Tetratricopeptide repeat
Probab=35.03  E-value=67  Score=24.24  Aligned_cols=32  Identities=19%  Similarity=0.342  Sum_probs=28.2

Q ss_pred             HHHHHHHHHhcccCCHHHHHHHHHHHHHHccc
Q psy2692          76 IAKIQQCYQLTTSGKFVDAIEKLRNVLICIPL  107 (133)
Q Consensus        76 ~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l  107 (133)
                      ..+|..||-.+..|++++|+..+.+.|..=|-
T Consensus        48 qAqL~l~yayy~~~~y~~A~a~~~rFirLhP~   79 (142)
T PF13512_consen   48 QAQLDLAYAYYKQGDYEEAIAAYDRFIRLHPT   79 (142)
T ss_pred             HHHHHHHHHHHHccCHHHHHHHHHHHHHhCCC
Confidence            45899999999999999999999998877663


No 83 
>KOG4234|consensus
Probab=34.84  E-value=44  Score=27.62  Aligned_cols=31  Identities=19%  Similarity=0.284  Sum_probs=27.9

Q ss_pred             HHHHHhcccCCHHHHHHHHHHHHHHcccccc
Q psy2692          80 QQCYQLTTSGKFVDAIEKLRNVLICIPLLVV  110 (133)
Q Consensus        80 k~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv  110 (133)
                      ++|-++|..|.|.+|.+.|...|-++|-+..
T Consensus       100 ~EGN~~F~ngdyeeA~skY~~Ale~cp~~~~  130 (271)
T KOG4234|consen  100 KEGNELFKNGDYEEANSKYQEALESCPSTST  130 (271)
T ss_pred             HHHHHhhhcccHHHHHHHHHHHHHhCccccH
Confidence            5699999999999999999999999987643


No 84 
>KOG3677|consensus
Probab=34.80  E-value=49  Score=29.84  Aligned_cols=49  Identities=22%  Similarity=0.361  Sum_probs=35.7

Q ss_pred             HHHHHHhcccCCHHHHHHHHHHHHHHcccc----------cccChHHHHHHHHHHHHHH
Q psy2692          79 IQQCYQLTTSGKFVDAIEKLRNVLICIPLL----------VVTTKKEIEEAKQIINICK  127 (133)
Q Consensus        79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~----------vv~~~~e~~e~~eli~icr  127 (133)
                      -+.||...--|+|.||+..|-+||..|-=+          .-...++.+++.+++.||-
T Consensus       276 Y~VGFayLmmrryadai~~F~niLlyIqrtks~~~~~~y~~d~inKq~eqm~~llai~l  334 (525)
T KOG3677|consen  276 YQVGFAYLMMRRYADAIRVFLNILLYIQRTKSMFSRTTYQYDMINKQNEQMHHLLAICL  334 (525)
T ss_pred             eehhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhhHhhhhhhHHHHHHHHHHHH
Confidence            456788888899999999999999875211          1113566777888888884


No 85 
>cd03594 CLECT_REG-1_like C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2).  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart.  REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers.  Its expression is correlated with reduced survival from early-stage colorectal cancer.  REG-1 also binds and aggregates 
Probab=33.74  E-value=37  Score=23.21  Aligned_cols=43  Identities=16%  Similarity=0.130  Sum_probs=33.1

Q ss_pred             HHHhc-ccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692          82 CYQLT-TSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN  124 (133)
Q Consensus        82 gyk~~-t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~  124 (133)
                      +|+++ +...+.||...+++.-...-|++++|.+|.+-+..++.
T Consensus        12 Cy~~~~~~~tw~~A~~~C~~~~~g~~La~i~s~~e~~~l~~~~~   55 (129)
T cd03594          12 CYGYFRQPLSWSDAELFCQKYGPGAHLASIHSPAEAAAIASLIS   55 (129)
T ss_pred             eeeEeccCcCHHHHHHHHHhcCCCceEcccCCHHHHHHHHHHHH
Confidence            56555 46789999999988632466888899999888887764


No 86 
>COG1484 DnaC DNA replication protein [DNA replication, recombination, and repair]
Probab=33.04  E-value=1.1e+02  Score=24.58  Aligned_cols=55  Identities=25%  Similarity=0.335  Sum_probs=40.6

Q ss_pred             CCChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccC-------hHHHHHHHHHHHHHHHh
Q psy2692          69 DRSSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTT-------KKEIEEAKQIINICKEY  129 (133)
Q Consensus        69 ~~~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~-------~~e~~e~~eli~icrEY  129 (133)
                      -++..++..+|+.++..   |..+   ..+...+..+++++.|+       +.+.+.+-|+|.-.+++
T Consensus       138 f~~~~el~~~Lk~~~~~---~~~~---~~l~~~l~~~dlLIiDDlG~~~~~~~~~~~~~q~I~~r~~~  199 (254)
T COG1484         138 FITAPDLLSKLKAAFDE---GRLE---EKLLRELKKVDLLIIDDIGYEPFSQEEADLLFQLISRRYES  199 (254)
T ss_pred             EEEHHHHHHHHHHHHhc---CchH---HHHHHHhhcCCEEEEecccCccCCHHHHHHHHHHHHHHHhh
Confidence            55788899999999987   4333   34556677889999984       55667888888777664


No 87 
>PF02064 MAS20:  MAS20 protein import receptor;  InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=33.01  E-value=88  Score=22.93  Aligned_cols=31  Identities=29%  Similarity=0.443  Sum_probs=25.9

Q ss_pred             HHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          76 IAKIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        76 ~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      .+.++.|=.+.+.|+..+|..+|-+.|...|
T Consensus        64 l~qV~lGE~L~~~G~~~~aa~hf~nAl~V~~   94 (121)
T PF02064_consen   64 LQQVQLGEQLLAQGDYEEAAEHFYNALKVCP   94 (121)
T ss_dssp             HHHHHHHHHHHHTT-HHHHHHHHHHHHHTSS
T ss_pred             HHHHHHHHHHHhCCCHHHHHHHHHHHHHhCC
Confidence            4468889999999999999999999887655


No 88 
>PF09613 HrpB1_HrpK:  Bacterial type III secretion protein (HrpB1_HrpK);  InterPro: IPR013394  This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=32.55  E-value=88  Score=24.12  Aligned_cols=42  Identities=21%  Similarity=0.347  Sum_probs=32.7

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHH
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICK  127 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icr  127 (133)
                      .+=+|.-.+..|.+.||+..||++-..-|-...        ++.++..|-
T Consensus        47 ~~~~~~l~i~r~~w~dA~rlLr~l~~~~~~~p~--------~kALlA~CL   88 (160)
T PF09613_consen   47 DLFDGWLHIVRGDWDDALRLLRELEERAPGFPY--------AKALLALCL   88 (160)
T ss_pred             HHHHHHHHHHhCCHHHHHHHHHHHhccCCCChH--------HHHHHHHHH
Confidence            345688999999999999999999777765543        466777774


No 89 
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=31.34  E-value=79  Score=25.72  Aligned_cols=33  Identities=18%  Similarity=0.136  Sum_probs=25.7

Q ss_pred             HHHHHHHHHh-cccCCHHHHHHHHHHHHHHcccc
Q psy2692          76 IAKIQQCYQL-TTSGKFVDAIEKLRNVLICIPLL  108 (133)
Q Consensus        76 ~~~Lk~gyk~-~t~gKF~eAl~~Fr~iL~~i~l~  108 (133)
                      ......|+.+ +..|++.+|+..|+.++...|=-
T Consensus       143 ~~~Y~~A~~l~~~~~~y~~Ai~af~~fl~~yP~s  176 (263)
T PRK10803        143 NTDYNAAIALVQDKSRQDDAIVAFQNFVKKYPDS  176 (263)
T ss_pred             HHHHHHHHHHHHhcCCHHHHHHHHHHHHHHCcCC
Confidence            3357778876 56789999999999999888753


No 90 
>KOG2376|consensus
Probab=31.18  E-value=78  Score=29.57  Aligned_cols=50  Identities=22%  Similarity=0.211  Sum_probs=37.5

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHH
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICK  127 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icr  127 (133)
                      -.+.|..+.+.||..+|++..+..+...-=.--++..++++.++=|.+||
T Consensus       178 ~yN~Ac~~i~~gky~qA~elL~kA~~~~~e~l~~~d~~eEeie~el~~Ir  227 (652)
T KOG2376|consen  178 LYNTACILIENGKYNQAIELLEKALRICREKLEDEDTNEEEIEEELNPIR  227 (652)
T ss_pred             HHHHHHHHHhcccHHHHHHHHHHHHHHHHHhhcccccchhhHHHHHHHHH
Confidence            45889999999999999999999876654433344444666777777776


No 91 
>PF12335 SBF2:  Myotubularin protein ;  InterPro: IPR022096  This domain family is found in eukaryotes, and is approximately 220 amino acids in length. The family is found in association with PF02141 from PFAM, PF03456 from PFAM, PF03455 from PFAM. This family is the middle region of SBF2, a member of the myotubularin family. Myotubularin-related proteins have been suggested to work in phosphoinositide-mediated signalling events that may also convey control of myelination. Mutations of SBF2 are implicated in Charcot-Marie-Tooth disease. 
Probab=30.84  E-value=1e+02  Score=24.94  Aligned_cols=52  Identities=19%  Similarity=0.323  Sum_probs=37.5

Q ss_pred             CCCCCccccCCCCC--CCChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          55 PLLAPVTAFPTRKV--DRSSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        55 p~lp~l~~~~~rn~--~~~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      |..||+...+.++.  ..+-+...+-|+.+..+.=.||..||-+.|-.++..+-
T Consensus         1 ~~~p~~~~~~~~~~~~~~~s~rrlevlr~ci~~if~~k~~e~~k~~~av~~~lk   54 (225)
T PF12335_consen    1 PMGPPVVSIVDKNGNVFANSARRLEVLRNCISFIFDNKILEARKSLPAVLRALK   54 (225)
T ss_pred             CCCCCchhHHhcccchhhhHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHc
Confidence            55677766666653  22344566778999999999999999999877666543


No 92 
>KOG4281|consensus
Probab=30.83  E-value=73  Score=26.13  Aligned_cols=45  Identities=22%  Similarity=0.312  Sum_probs=36.3

Q ss_pred             HHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHH
Q psy2692          73 IDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIE  117 (133)
Q Consensus        73 ~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~  117 (133)
                      ..|.+-.|+.|+-..-|-|+|+++..|++|-.+-=-.|.-++|..
T Consensus         5 q~l~ntck~~fs~~~pg~~~~nlqkLr~vLd~lkp~Dvgl~~~~~   49 (236)
T KOG4281|consen    5 QRLFNTCKEVFSTGGPGPFEENLQKLRSVLDELKPEDVGLEPEMQ   49 (236)
T ss_pred             HHHHHHHHHHhcCCCCCccHHHHHHHHHHHhhCCHhhcCcChhhh
Confidence            457778899998888889999999999999988776666555543


No 93 
>KOG0553|consensus
Probab=30.82  E-value=69  Score=27.27  Aligned_cols=38  Identities=16%  Similarity=0.250  Sum_probs=32.4

Q ss_pred             HHHHHH-HHHHHhcccCCHHHHHHHHHHHHHHccccccc
Q psy2692          74 DLIAKI-QQCYQLTTSGKFVDAIEKLRNVLICIPLLVVT  111 (133)
Q Consensus        74 ~L~~~L-k~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~  111 (133)
                      .+.++| .+|=++...++|++|+..+-..|..-|--.|-
T Consensus        79 ~~AE~LK~eGN~~m~~~~Y~eAv~kY~~AI~l~P~nAVy  117 (304)
T KOG0553|consen   79 ALAESLKNEGNKLMKNKDYQEAVDKYTEAIELDPTNAVY  117 (304)
T ss_pred             HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCCcchH
Confidence            466677 46999999999999999999999988876654


No 94 
>PF13525 YfiO:  Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=30.55  E-value=85  Score=23.89  Aligned_cols=31  Identities=16%  Similarity=0.313  Sum_probs=25.8

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLL  108 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~  108 (133)
                      .+..|+-.+..|++.+|+..|.+.+..-|=.
T Consensus        45 ~l~la~a~y~~~~y~~A~~~~~~fi~~yP~~   75 (203)
T PF13525_consen   45 QLMLAYAYYKQGDYEEAIAAYERFIKLYPNS   75 (203)
T ss_dssp             HHHHHHHHHHTT-HHHHHHHHHHHHHH-TT-
T ss_pred             HHHHHHHHHHcCCHHHHHHHHHHHHHHCCCC
Confidence            6788999999999999999999999988864


No 95 
>KOG0739|consensus
Probab=30.26  E-value=1.3e+02  Score=26.41  Aligned_cols=58  Identities=16%  Similarity=0.197  Sum_probs=42.1

Q ss_pred             hHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHH-HHHHHHHhh
Q psy2692          72 SIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQ-IINICKEYI  130 (133)
Q Consensus        72 l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~e-li~icrEYi  130 (133)
                      +..-++..++|-..=+++..+||+..|++.|....... ..+...+.+++ +-..|.||+
T Consensus         7 l~kaI~lv~kA~~eD~a~nY~eA~~lY~~aleYF~~~l-KYE~~~~kaKd~IraK~~EYL   65 (439)
T KOG0739|consen    7 LQKAIDLVKKAIDEDNAKNYEEALRLYQNALEYFLHAL-KYEANNKKAKDSIRAKFTEYL   65 (439)
T ss_pred             HHHHHHHHHHHhhhcchhchHHHHHHHHHHHHHHHHHH-HhhhcChhHHHHHHHHHHHHH
Confidence            44456677888899999999999999999987766543 33434445566 446788886


No 96 
>cd00037 CLECT C-type lectin (CTL)/C-type lectin-like (CTLD) domain. CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs.  Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice.  Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis;  P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initia
Probab=29.74  E-value=46  Score=21.07  Aligned_cols=35  Identities=14%  Similarity=0.109  Sum_probs=28.6

Q ss_pred             cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692          88 SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN  124 (133)
Q Consensus        88 ~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~  124 (133)
                      ...|++|.+.+++.=  .-|+.+++.+|.+-+.+.+.
T Consensus         9 ~~~~~~A~~~C~~~~--~~L~~~~~~~e~~~i~~~~~   43 (116)
T cd00037           9 KLTWEEAQEYCRSLG--GHLASIHSEEENDFLASLLK   43 (116)
T ss_pred             ccCHHHHHHHHHHcC--CEEcccCCHHHHHHHHHHHh
Confidence            678999999999854  67888899888888877664


No 97 
>COG4989 Predicted oxidoreductase [General function prediction only]
Probab=29.40  E-value=97  Score=26.23  Aligned_cols=47  Identities=32%  Similarity=0.458  Sum_probs=34.0

Q ss_pred             ccCHHHHHHHHHhh-----cCcccccccHHHHHHHHhhhhhhccCCCCCCCcc
Q psy2692          14 SSSFETAFKLLHDQ-----IGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVT   61 (133)
Q Consensus        14 AGsFe~A~~LLnrQ-----iGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~   61 (133)
                      +..--.|++.|+++     +|+-||.|-.=.+++.|..-....-++- +.|+.
T Consensus       135 ~eeVAeAf~~L~~sGKVr~fGVSNf~p~Q~~LL~s~l~~~LvtNQle-lS~~~  186 (298)
T COG4989         135 AEEVAEAFTHLHKSGKVRHFGVSNFNPAQFELLQSRLPFTLVTNQLE-LSPLH  186 (298)
T ss_pred             HHHHHHHHHHHHhcCCeeeeecCCCCHHHHHHHHHhccchhhhccee-ecccc
Confidence            34445788999865     8999999999999998887765544443 44443


No 98 
>cd05508 Bromo_RACK7 Bromodomain, RACK7_like subfamily. RACK7 (also called human protein kinase C-binding protein) was identified as a potential tumor suppressor genes, it shares domain architecture with BS69/ZMYND11; both have been implicated in the regulation of cellular proliferation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=29.32  E-value=1.9e+02  Score=20.02  Aligned_cols=56  Identities=16%  Similarity=0.151  Sum_probs=44.0

Q ss_pred             CCChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHH
Q psy2692          69 DRSSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKE  128 (133)
Q Consensus        69 ~~~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrE  128 (133)
                      |..++.+..+|+.|.    =...+|-..-++-|........-++..-...++++.+.|+.
T Consensus        42 PmDL~tI~~kl~~~~----Y~s~~ef~~Dv~LI~~Na~~YN~~~s~i~~~A~~l~~~~~~   97 (99)
T cd05508          42 PMDLSTLEKNVRKKA----YGSTDAFLADAKWILHNAIIYNGGDHKLTQAAKAIVKICEQ   97 (99)
T ss_pred             CCCHHHHHHHHhcCC----CCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHh
Confidence            889999999997664    23466667777888888888777777888888999999874


No 99 
>PF07304 SRA1:  Steroid receptor RNA activator (SRA1);  InterPro: IPR009917 This entry consists of several hypothetical mammalian steroid receptor RNA activator proteins. The SRA-RNAs encode stable proteins that are widely expressed and upregulated in breast cancer cell lines. SRA-RNA is a steroid receptor co-activator which acts as a functional RNA. This domain is also found at the C terminus of Sec31, a component of the coat protein complex II (COPII, which promotes the formation of transport vesicles from the endoplasmic reticulum (ER). COPII has two main functions, the physical deformation of the endoplasmic reticulum membrane into vesicles and the selection of cargo molecules. ; PDB: 2YRU_A.
Probab=29.32  E-value=2.4e+02  Score=21.26  Aligned_cols=47  Identities=17%  Similarity=0.237  Sum_probs=34.5

Q ss_pred             HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHH----HHHHHHHHHHH
Q psy2692          75 LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIE----EAKQIINICKE  128 (133)
Q Consensus        75 L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~----e~~eli~icrE  128 (133)
                      +.++|.+=.++..+++|+.|.++.++|+-       +.-+|..    -+|+||.++|.
T Consensus        90 v~~~L~~L~~aL~~~d~~~A~~Ih~~L~t-------~h~~E~~~WmvGVKRLI~~~r~  140 (157)
T PF07304_consen   90 VVDKLHQLAQALQARDYDAADEIHVDLMT-------DHVDECGNWMVGVKRLIAMARN  140 (157)
T ss_dssp             HHHHHHHHHHHHHHT-HHHHHHHHHHHHH-------SSHHHHTTTHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHcCCHHHHHHHHHHHHh-------ccHHHhhhHHHHHHHHHHHHHh
Confidence            55677788889999999999999988763       4344443    47888888875


No 100
>PF10255 Paf67:  RNA polymerase I-associated factor PAF67;  InterPro: IPR019382  RNA polymerase I is a multi-subunit enzyme and its transcription competence is dependent on the presence of PAF67 []. 
Probab=28.59  E-value=83  Score=27.65  Aligned_cols=50  Identities=18%  Similarity=0.294  Sum_probs=36.5

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHccccc--ccC--------hHHHHHHHHHHHHHH
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIPLLV--VTT--------KKEIEEAKQIINICK  127 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~v--v~~--------~~e~~e~~eli~icr  127 (133)
                      --..||..+=-+++.||+..|.+||..+-=.-  -.+        .+-.+++-.++.||-
T Consensus       167 ~YyvGFaylMlrRY~DAir~f~~iL~yi~r~k~~~~~~~~q~d~i~K~~eqMyaLlAic~  226 (404)
T PF10255_consen  167 YYYVGFAYLMLRRYADAIRTFSQILLYIQRTKNQYHQRSYQYDQINKKNEQMYALLAICL  226 (404)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccchhhHHHhHHHHHHHHHHHHH
Confidence            34678999999999999999999998765443  000        145566777888874


No 101
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=28.59  E-value=3.3e+02  Score=25.22  Aligned_cols=99  Identities=15%  Similarity=0.050  Sum_probs=50.1

Q ss_pred             hHHHHhccCHHHHHHHHHhhcCccccc------ccHHHHH-----HHHhhhhhhccCCCCC-CC---ccccCCCCCCCCh
Q psy2692           8 SESINKSSSFETAFKLLHDQIGVKNFL------PYRQMFL-----NSYLQSRTAYSTLPLL-AP---VTAFPTRKVDRSS   72 (133)
Q Consensus         8 ~~~~~AAGsFe~A~~LLnrQiGivNf~------PLk~~F~-----~~y~~s~~~~~~~p~l-p~---l~~~~~rn~~~~l   72 (133)
                      +..+...|+++.|.+.+++=+-.-...      ....++.     +-|..+...+..+... |+   +...+.++ | +-
T Consensus       279 a~~yl~~g~~e~A~~~l~~~l~~~p~~~~~~~~~~~~L~~a~~~~g~~~eA~~~l~~~~~~~P~~~~~~~~~~~~-p-~~  356 (765)
T PRK10049        279 ASAYLKLHQPEKAQSILTELFYHPETIADLSDEELADLFYSLLESENYPGALTVTAHTINNSPPFLRLYGSPTSI-P-ND  356 (765)
T ss_pred             HHHHHhcCCcHHHHHHHHHHhhcCCCCCCCChHHHHHHHHHHHhcccHHHHHHHHHHHhhcCCceEeecCCCCCC-C-Cc
Confidence            346788999999999998744322111      1122222     1122233333322222 21   11111111 1 11


Q ss_pred             HHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692          73 IDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLL  108 (133)
Q Consensus        73 ~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~  108 (133)
                      ..+...+..|.-+...|++++|+..|+.++...|--
T Consensus       357 ~~~~a~~~~a~~l~~~g~~~eA~~~l~~al~~~P~n  392 (765)
T PRK10049        357 DWLQGQSLLSQVAKYSNDLPQAEMRARELAYNAPGN  392 (765)
T ss_pred             hHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCC
Confidence            123334455566667788888888888888776653


No 102
>cd06131 DNA_pol_III_epsilon_Ecoli_like DEDDh 3'-5' exonuclease domain of the epsilon subunit of Escherichia coli DNA polymerase III and similar proteins. This subfamily is composed of the epsilon subunit of Escherichia coli DNA polymerase III (Pol III) and similar proteins. Pol III is the main DNA replicating enzyme in bacteria, functioning as the chromosomal replicase. It is a holoenzyme complex of ten different subunits, three of which (alpha, epsilon, and theta) compose the catalytic core. The Pol III epsilon subunit, encoded by the dnaQ gene, is a DEDDh-type 3'-5' exonuclease which is responsible for the proofreading activity of the polymerase, increasing the fidelity of DNA synthesis. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The epsilon 
Probab=28.38  E-value=48  Score=24.04  Aligned_cols=28  Identities=21%  Similarity=0.296  Sum_probs=24.7

Q ss_pred             ccccCChHHHHhccCHHHHHHHHHhhcC
Q psy2692           2 NLHKVTSESINKSSSFETAFKLLHDQIG   29 (133)
Q Consensus         2 ~~~~~~~~~~~AAGsFe~A~~LLnrQiG   29 (133)
                      .+|.+|++.+..++.|+.+++-+...+|
T Consensus        53 ~ihGIt~e~l~~~~~~~~v~~~l~~~l~   80 (167)
T cd06131          53 KVHGITDEFLADKPKFAEIADEFLDFIR   80 (167)
T ss_pred             HHhCCCHHHHhcCCCHHHHHHHHHHHHC
Confidence            4799999999999999999998887765


No 103
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=28.27  E-value=58  Score=28.96  Aligned_cols=28  Identities=18%  Similarity=0.297  Sum_probs=25.5

Q ss_pred             HHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          79 IQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        79 Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      ...|..++..|+|++|+..|...|...|
T Consensus       131 k~~G~~~~~~~~~~~Ai~~y~~al~~~p  158 (615)
T TIGR00990       131 KEKGNKAYRNKDFNKAIKLYSKAIECKP  158 (615)
T ss_pred             HHHHHHHHHcCCHHHHHHHHHHHHhcCC
Confidence            4679999999999999999999998777


No 104
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=27.59  E-value=2.1e+02  Score=22.97  Aligned_cols=20  Identities=15%  Similarity=0.172  Sum_probs=13.4

Q ss_pred             HHHhccCHHHHHHHHHhhcC
Q psy2692          10 SINKSSSFETAFKLLHDQIG   29 (133)
Q Consensus        10 ~~~AAGsFe~A~~LLnrQiG   29 (133)
                      .+...|+|+.|++.+++-+.
T Consensus       116 ~~~~~g~~~~A~~~~~~~l~  135 (389)
T PRK11788        116 DYLKAGLLDRAEELFLQLVD  135 (389)
T ss_pred             HHHHCCCHHHHHHHHHHHHc
Confidence            34566777777777777654


No 105
>PF14769 CLAMP:  Flagellar C1a complex subunit C1a-32
Probab=27.46  E-value=1.6e+02  Score=20.28  Aligned_cols=21  Identities=10%  Similarity=0.241  Sum_probs=18.2

Q ss_pred             ccCCHHHHHHHHHHHHHHccc
Q psy2692          87 TSGKFVDAIEKLRNVLICIPL  107 (133)
Q Consensus        87 t~gKF~eAl~~Fr~iL~~i~l  107 (133)
                      ....++|+++.|+++|...-.
T Consensus        34 ~~~~~~~~~~~fk~~l~~~sv   54 (101)
T PF14769_consen   34 KGMSLEDSFKYFKELLLRHSV   54 (101)
T ss_pred             ccCCHHHHHHHHHHHHHHhcc
Confidence            889999999999999966544


No 106
>PF09670 Cas_Cas02710:  CRISPR-associated protein (Cas_Cas02710)
Probab=27.42  E-value=1.8e+02  Score=24.88  Aligned_cols=45  Identities=16%  Similarity=0.301  Sum_probs=33.8

Q ss_pred             HHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHh
Q psy2692          77 AKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEY  129 (133)
Q Consensus        77 ~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEY  129 (133)
                      ..++.+-.++..+.|..|...|..+...+     +.+++   .+-+-.+|+=|
T Consensus       133 ~~~~~a~~l~n~~~y~aA~~~l~~l~~rl-----~~~~~---~~~~~~l~~~y  177 (379)
T PF09670_consen  133 REWRRAKELFNRYDYGAAARILEELLRRL-----PGREE---YQRYKDLCEGY  177 (379)
T ss_pred             HHHHHHHHHHhcCCHHHHHHHHHHHHHhC-----Cchhh---HHHHHHHHHHH
Confidence            36789999999999999999999999872     33333   44566666654


No 107
>PF10516 SHNi-TPR:  SHNi-TPR;  InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B.  This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat []. 
Probab=27.26  E-value=71  Score=18.66  Aligned_cols=21  Identities=24%  Similarity=0.216  Sum_probs=16.8

Q ss_pred             HhcccCCHHHHHHHHHHHHHH
Q psy2692          84 QLTTSGKFVDAIEKLRNVLIC  104 (133)
Q Consensus        84 k~~t~gKF~eAl~~Fr~iL~~  104 (133)
                      =....++|++|+.-|++.|-.
T Consensus        10 isle~e~f~qA~~D~~~aL~i   30 (38)
T PF10516_consen   10 ISLENENFEQAIEDYEKALEI   30 (38)
T ss_pred             HHHHhccHHHHHHHHHHHHHH
Confidence            356788999999999988753


No 108
>COG4063 MtrA Tetrahydromethanopterin S-methyltransferase, subunit A [Coenzyme metabolism]
Probab=27.15  E-value=77  Score=25.75  Aligned_cols=38  Identities=18%  Similarity=0.277  Sum_probs=25.9

Q ss_pred             cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHH
Q psy2692          88 SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKE  128 (133)
Q Consensus        88 ~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrE  128 (133)
                      .|-=.||++.||+.+-.+-++.++|.++   ++..++-|++
T Consensus       115 ENi~~eaveRfqqqvelvdlid~eD~~~---I~~~v~ecv~  152 (238)
T COG4063         115 ENIPDEAVERFQQQVELVDLIDVEDPDE---ITAKVEECVE  152 (238)
T ss_pred             hcCCHHHHHHHHHHeeeehhcccCCHHH---HHHHHHHHHh
Confidence            3445799999998887777777766444   4455555554


No 109
>KOG0543|consensus
Probab=26.28  E-value=2.1e+02  Score=25.33  Aligned_cols=51  Identities=20%  Similarity=0.207  Sum_probs=41.4

Q ss_pred             HHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHh
Q psy2692          75 LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEY  129 (133)
Q Consensus        75 L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEY  129 (133)
                      .....+.|--+.+.|.+++|...|+.++..-|-    |++-..|+..+-...|+|
T Consensus       291 ~KALyRrG~A~l~~~e~~~A~~df~ka~k~~P~----Nka~~~el~~l~~k~~~~  341 (397)
T KOG0543|consen  291 VKALYRRGQALLALGEYDLARDDFQKALKLEPS----NKAARAELIKLKQKIREY  341 (397)
T ss_pred             hhHHHHHHHHHHhhccHHHHHHHHHHHHHhCCC----cHHHHHHHHHHHHHHHHH
Confidence            344567888999999999999999999998884    466667777777777776


No 110
>TIGR02878 spore_ypjB sporulation protein YpjB. Members of this protein, YpjB, family are restricted to a subset of endospore-forming bacteria, including Bacillus species but not CLostridium or some others. In Bacillus subtilis, ypjB was found to be part of the sigma-E regulon, where sigma-E is a sporulation sigma factor that regulates expression in the mother cell compartment. Null mutants of ypjB show a sporulation defect. This protein family is not, however, a part of the endospore formation minimal gene set.
Probab=26.23  E-value=79  Score=25.96  Aligned_cols=32  Identities=22%  Similarity=0.169  Sum_probs=24.1

Q ss_pred             HHHHHHHHHHHHhcccCCHHHHHHHHHHHHHH
Q psy2692          73 IDLIAKIQQCYQLTTSGKFVDAIEKLRNVLIC  104 (133)
Q Consensus        73 ~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~  104 (133)
                      ..|.+...++.++++.|+|+||.......=-.
T Consensus         3 ~eL~~lsD~~LQl~K~~~yeeAkqvL~yFs~~   34 (233)
T TIGR02878         3 EELNDLSDQVLQLTKQARYEEAKQVLEYFSKQ   34 (233)
T ss_pred             hhHhhhhHHHHHHHHhhhHHHHHHHHHHHHHH
Confidence            45666678899999999999998765544333


No 111
>cd03602 CLECT_1 C-type lectin (CTL)/C-type lectin-like (CTLD) domain subgroup 1; a subgroup of protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CLECT_1: C-type lectin (CTL)/C-type lectin-like (CTLD) domain subgroup 1; a subgroup of protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces including CaCO3 and ice.  Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions.  CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers from which ligand-binding sites project in differe
Probab=26.17  E-value=1.3e+02  Score=20.00  Aligned_cols=36  Identities=14%  Similarity=0.132  Sum_probs=29.8

Q ss_pred             ccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692          87 TSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN  124 (133)
Q Consensus        87 t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~  124 (133)
                      ....+.||...++..  -.-|++++|++|.+.++.++.
T Consensus         8 ~~~~w~~A~~~C~~~--g~~La~i~s~~e~~~l~~~~~   43 (108)
T cd03602           8 ESKTWSEAQQYCREN--YTDLATVQNQEDNALLSNLSR   43 (108)
T ss_pred             cccCHHHHHHHHHHH--CCccCeecCHHHHHHHHHHHh
Confidence            467899999999985  567888899999998888774


No 112
>PHA02841 hypothetical protein; Provisional
Probab=26.12  E-value=35  Score=23.93  Aligned_cols=33  Identities=18%  Similarity=0.411  Sum_probs=25.7

Q ss_pred             HHHHHhhcCcccccccHHHHHHHHhhhhhh-ccCCC
Q psy2692          21 FKLLHDQIGVKNFLPYRQMFLNSYLQSRTA-YSTLP   55 (133)
Q Consensus        21 ~~LLnrQiGivNf~PLk~~F~~~y~~s~~~-~~~~p   55 (133)
                      .++.+|.+...||  +|.-|...|..|+-- +.++.
T Consensus         6 ir~IKRKI~sLNF--~KK~FKK~YS~ckeigvtsin   39 (103)
T PHA02841          6 VKSIKRKIYSLNF--FKKTFKKTYSPCKEIGVTSIN   39 (103)
T ss_pred             HHHHHHHHHHHHH--HHHHHHHhhChHhHhCeeeec
Confidence            5788999999998  799999999888743 33333


No 113
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=26.06  E-value=1.6e+02  Score=23.38  Aligned_cols=32  Identities=9%  Similarity=0.106  Sum_probs=28.2

Q ss_pred             HHHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692          77 AKIQQCYQLTTSGKFVDAIEKLRNVLICIPLL  108 (133)
Q Consensus        77 ~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~  108 (133)
                      ..+..|+-.+..|++++|+..|++.+..-|=-
T Consensus        71 a~l~la~ayy~~~~y~~A~~~~e~fi~~~P~~  102 (243)
T PRK10866         71 VQLDLIYAYYKNADLPLAQAAIDRFIRLNPTH  102 (243)
T ss_pred             HHHHHHHHHHhcCCHHHHHHHHHHHHHhCcCC
Confidence            36788999999999999999999999887753


No 114
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=26.06  E-value=3.5e+02  Score=21.73  Aligned_cols=21  Identities=14%  Similarity=0.186  Sum_probs=15.8

Q ss_pred             HHHhccCHHHHHHHHHhhcCc
Q psy2692          10 SINKSSSFETAFKLLHDQIGV   30 (133)
Q Consensus        10 ~~~AAGsFe~A~~LLnrQiGi   30 (133)
                      .+...|+++.|...+.+-+..
T Consensus        44 ~~~~~~~~~~A~~~~~~al~~   64 (389)
T PRK11788         44 NFLLNEQPDKAIDLFIEMLKV   64 (389)
T ss_pred             HHHhcCChHHHHHHHHHHHhc
Confidence            445678888888888887765


No 115
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=25.73  E-value=98  Score=22.39  Aligned_cols=29  Identities=7%  Similarity=0.166  Sum_probs=24.4

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      .+..|.-....|++++|+..|+..+...+
T Consensus        38 ~~~~g~~~~~~g~~~~A~~~~~~al~l~~   66 (168)
T CHL00033         38 YYRDGMSAQSEGEYAEALQNYYEAMRLEI   66 (168)
T ss_pred             HHHHHHHHHHcCCHHHHHHHHHHHHhccc
Confidence            56778888889999999999999987643


No 116
>PRK09637 RNA polymerase sigma factor SigZ; Provisional
Probab=25.60  E-value=84  Score=23.41  Aligned_cols=32  Identities=13%  Similarity=0.162  Sum_probs=21.9

Q ss_pred             CHHHHHHHHHHHHHHcccccccChHHHHHHHH
Q psy2692          90 KFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQ  121 (133)
Q Consensus        90 KF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~e  121 (133)
                      ||++..+.|...++.+....+.++++.+++-|
T Consensus         2 ~~~~l~~~y~~~l~~~~~~~~~~~~~aeDi~Q   33 (181)
T PRK09637          2 KLESIWSEYKAQLKAFLHSRVSNEADVDDLLQ   33 (181)
T ss_pred             cHHHHHHHHHHHHHHHHHHHhCChhhHHHHHH
Confidence            56777777777777777766666666666544


No 117
>PRK10564 maltose regulon periplasmic protein; Provisional
Probab=24.70  E-value=65  Score=27.40  Aligned_cols=22  Identities=27%  Similarity=0.517  Sum_probs=17.8

Q ss_pred             HHHHhccCHHHHHHHHHh--hcCc
Q psy2692           9 ESINKSSSFETAFKLLHD--QIGV   30 (133)
Q Consensus         9 ~~~~AAGsFe~A~~LLnr--QiGi   30 (133)
                      .+.|++||.+.||+||++  .+|+
T Consensus       265 ~~AVk~gDi~KAL~LldEAe~LG~  288 (303)
T PRK10564        265 KQAVKKGDVDKALKLLDEAERLGS  288 (303)
T ss_pred             HHHHHcCCHHHHHHHHHHHHHhCC
Confidence            356899999999999985  4554


No 118
>PRK09591 celC cellobiose phosphotransferase system IIA component; Reviewed
Probab=24.30  E-value=80  Score=22.48  Aligned_cols=30  Identities=13%  Similarity=0.179  Sum_probs=23.1

Q ss_pred             ChHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHh
Q psy2692           7 TSESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYL   45 (133)
Q Consensus         7 ~~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~   45 (133)
                      .+...+..|+|+.|-.+|.+-         +..|.+.+.
T Consensus        26 eAl~~ak~gdf~~A~~~l~eA---------~~~l~~AH~   55 (104)
T PRK09591         26 EAFAAMREGNFDLAEQKLNQS---------NEELLEAHH   55 (104)
T ss_pred             HHHHHHHcCCHHHHHHHHHHH---------HHHHHHHHH
Confidence            356788999999999999886         555666553


No 119
>COG0329 DapA Dihydrodipicolinate synthase/N-acetylneuraminate lyase [Amino acid transport and metabolism / Cell envelope biogenesis, outer membrane]
Probab=24.24  E-value=3e+02  Score=22.67  Aligned_cols=81  Identities=17%  Similarity=0.154  Sum_probs=50.7

Q ss_pred             ccCHHHHHHHHHhhc--CcccccccHHHHHHHH-hhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhcccCC
Q psy2692          14 SSSFETAFKLLHDQI--GVKNFLPYRQMFLNSY-LQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTSGK   90 (133)
Q Consensus        14 AGsFe~A~~LLnrQi--GivNf~PLk~~F~~~y-~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~gK   90 (133)
                      .|+++.-.++...--  ...-|...=..+.... .+++.++++..+.-|                 +...+=|+++.+|+
T Consensus       167 ~gd~~~~~~~~~~~~~~~f~v~~G~d~~~~~~~~~G~~G~is~~~N~~p-----------------~~~~~l~~~~~~g~  229 (299)
T COG0329         167 SGDLDRLEEIIAALGDRDFIVLSGDDELALPALLLGADGVISVTANVAP-----------------ELAVELYRAAKAGD  229 (299)
T ss_pred             CcCHHHHHHHHHhcCccCeeEEeCchHHHHHHHhCCCCeEEecccccCH-----------------HHHHHHHHHHHcCC
Confidence            455555544433322  2344445555555444 366666666665444                 24566778888999


Q ss_pred             HHHHHHHHHHHHHHccccccc
Q psy2692          91 FVDAIEKLRNVLICIPLLVVT  111 (133)
Q Consensus        91 F~eAl~~Fr~iL~~i~l~vv~  111 (133)
                      +++|....+.++..+.++...
T Consensus       230 ~~~A~~l~~~l~~l~~~~~~~  250 (299)
T COG0329         230 IEEARELQDRLLPLIRLLFRE  250 (299)
T ss_pred             HHHHHHHHHHHHHHHHHHhcC
Confidence            999999999998877666554


No 120
>PF04048 Sec8_exocyst:  Sec8 exocyst complex component specific domain;  InterPro: IPR007191 Sec8 is a component of the exocyst complex involved in the docking of exocystic vesicles with a fusion site on the plasma membrane. The exocyst complex is composed of Sec3, Sec5, Sec6, Sec8, Sec10, Sec15, Exo70 and Exo84.; GO: 0006904 vesicle docking involved in exocytosis, 0015031 protein transport, 0000145 exocyst
Probab=24.12  E-value=1.4e+02  Score=21.82  Aligned_cols=55  Identities=20%  Similarity=0.303  Sum_probs=39.9

Q ss_pred             hHHHHHHHHHHHHhccc---CCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692          72 SIDLIAKIQQCYQLTTS---GKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI  130 (133)
Q Consensus        72 l~~L~~~Lk~gyk~~t~---gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi  130 (133)
                      +..+.+++..+.+.+-.   ..|..++..|..|+..+-    ++++.+.++++-+.-|+.=+
T Consensus        45 f~~~~~~~~~~L~~vV~eh~q~Fn~sI~sy~~i~~~i~----~sq~~i~~lK~~L~~ak~~L  102 (142)
T PF04048_consen   45 FEELKKRIEKALQEVVNEHYQGFNSSIGSYSQILSSIS----ESQERIRELKESLQEAKSLL  102 (142)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHH
Confidence            34444455555555543   357889999999888764    67899999999999998754


No 121
>cd00215 PTS_IIA_lac PTS_IIA, PTS system, lactose/cellobiose specific IIA subunit. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. This family is one of four structurally and functionally distinct group IIA PTS system cytoplasmic enzymes, necessary for the uptake of carbohydrates across the cytoplasmic membrane and their phosphorylation. This family of proteins normally function as a homotrimer, stabilized by a centrally located metal ion. Separation into subunits is thought to occur after phosphorylation.
Probab=23.94  E-value=83  Score=22.08  Aligned_cols=29  Identities=17%  Similarity=0.184  Sum_probs=22.5

Q ss_pred             hHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHh
Q psy2692           8 SESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYL   45 (133)
Q Consensus         8 ~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~   45 (133)
                      +...+..|+|+.|-.+|++-         +..|.+.+.
T Consensus        22 Al~~a~~g~fe~A~~~l~ea---------~~~l~~AH~   50 (97)
T cd00215          22 ALKAAKEGDFAEAEELLEEA---------NDSLNEAHH   50 (97)
T ss_pred             HHHHHHcCCHHHHHHHHHHH---------HHHHHHHHH
Confidence            56788999999999999886         555555553


No 122
>PF06580 His_kinase:  Histidine kinase;  InterPro: IPR010559 Two-component signal transduction systems enable bacteria to sense, respond, and adapt to a wide range of environments, stressors, and growth conditions []. Some bacteria can contain up to as many as 200 two-component systems that need tight regulation to prevent unwanted cross-talk []. These pathways have been adapted to response to a wide variety of stimuli, including nutrients, cellular redox state, changes in osmolarity, quorum signals, antibiotics, and more []. Two-component systems are comprised of a sensor histidine kinase (HK) and its cognate response regulator (RR) []. The HK catalyses its own auto-phosphorylation followed by the transfer of the phosphoryl group to the receiver domain on RR; phosphorylation of the RR usually activates an attached output domain, which can then effect changes in cellular physiology, often by regulating gene expression. Some HK are bifunctional, catalysing both the phosphorylation and dephosphorylation of their cognate RR. The input stimuli can regulate either the kinase or phosphatase activity of the bifunctional HK. A variant of the two-component system is the phospho-relay system. Here a hybrid HK auto-phosphorylates and then transfers the phosphoryl group to an internal receiver domain, rather than to a separate RR protein. The phosphoryl group is then shuttled to histidine phosphotransferase (HPT) and subsequently to a terminal RR, which can evoke the desired response [, ]. Signal transducing histidine kinases are the key elements in two-component signal transduction systems, which control complex processes such as the initiation of development in microorganisms [, ]. Examples of histidine kinases are EnvZ, which plays a central role in osmoregulation [], and CheA, which plays a central role in the chemotaxis system []. Histidine kinases usually have an N-terminal ligand-binding domain and a C-terminal kinase domain, but other domains may also be present. The kinase domain is responsible for the autophosphorylation of the histidine with ATP, the phosphotransfer from the kinase to an aspartate of the response regulator, and (with bifunctional enzymes) the phosphotransfer from aspartyl phosphate back to ADP or to water []. The kinase core has a unique fold, distinct from that of the Ser/Thr/Tyr kinase superfamily.  HKs can be roughly divided into two classes: orthodox and hybrid kinases [, ]. Most orthodox HKs, typified by the Escherichia coli EnvZ protein, function as periplasmic membrane receptors and have a signal peptide and transmembrane segment(s) that separate the protein into a periplasmic N-terminal sensing domain and a highly conserved cytoplasmic C-terminal kinase core. Members of this family, however, have an integral membrane sensor domain. Not all orthodox kinases are membrane bound, e.g., the nitrogen regulatory kinase NtrB (GlnL) is a soluble cytoplasmic HK []. Hybrid kinases contain multiple phosphodonor and phosphoacceptor sites and use multi-step phospho-relay schemes instead of promoting a single phosphoryl transfer. In addition to the sensor domain and kinase core, they contain a CheY-like receiver domain and a His-containing phosphotransfer (HPt) domain. This family represents a region within bacterial histidine kinase enzymes. Two-component signal transduction systems such as those mediated by histidine kinase are integral parts of bacterial cellular regulatory processes, and are used to regulate the expression of genes involved in virulence. Members of this family often contain IPR003594 from INTERPRO and/or IPR003660 from INTERPRO.; GO: 0000155 two-component sensor activity, 0000160 two-component signal transduction system (phosphorelay), 0016021 integral to membrane
Probab=23.88  E-value=2.2e+02  Score=18.74  Aligned_cols=54  Identities=17%  Similarity=0.200  Sum_probs=27.6

Q ss_pred             HHHHHHHHHHHHhcccC--CHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692          73 IDLIAKIQQCYQLTTSG--KFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI  130 (133)
Q Consensus        73 ~~L~~~Lk~gyk~~t~g--KF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi  130 (133)
                      ..|.+-|+.-+.+...+  +-++++..|-+.++...    +++++.-.+++=++.|+.|+
T Consensus        12 HFl~NtLn~I~~l~~~~~~~~~~~i~~ls~~lRy~l----~~~~~~v~l~~El~~i~~Yl   67 (82)
T PF06580_consen   12 HFLFNTLNSISWLARIDPEKASEMILSLSDLLRYSL----SSKEEFVTLEEELEFIENYL   67 (82)
T ss_pred             HHHHHHHHHHHHHHHcCHHHHHHHHHHHHHHHHHHh----CCCCCeeeHHHHHHHHHHHH
Confidence            34555555555554444  44556666666666655    33333333444445555543


No 123
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=23.44  E-value=2.4e+02  Score=23.69  Aligned_cols=89  Identities=15%  Similarity=0.121  Sum_probs=54.4

Q ss_pred             hHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCC--CCCCccccCCCCCCCChHHHHHHHHHHHHh
Q psy2692           8 SESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLP--LLAPVTAFPTRKVDRSSIDLIAKIQQCYQL   85 (133)
Q Consensus         8 ~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p--~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~   85 (133)
                      +..++..|+.+.|..+|.+.+. .+..|   ....+|.....   +-+  -+.-+..++.+.    .++-.-.+-.|.=+
T Consensus       270 A~~l~~~g~~~~A~~~L~~~l~-~~~~~---~l~~l~~~l~~---~~~~~al~~~e~~lk~~----P~~~~l~l~lgrl~  338 (398)
T PRK10747        270 AEHLIECDDHDTAQQIILDGLK-RQYDE---RLVLLIPRLKT---NNPEQLEKVLRQQIKQH----GDTPLLWSTLGQLL  338 (398)
T ss_pred             HHHHHHCCCHHHHHHHHHHHHh-cCCCH---HHHHHHhhccC---CChHHHHHHHHHHHhhC----CCCHHHHHHHHHHH
Confidence            4568899999999999999987 45544   33333333211   111  011111112111    12223356668888


Q ss_pred             cccCCHHHHHHHHHHHHHHccc
Q psy2692          86 TTSGKFVDAIEKLRNVLICIPL  107 (133)
Q Consensus        86 ~t~gKF~eAl~~Fr~iL~~i~l  107 (133)
                      ...|++.+|.+.|+..+..-|-
T Consensus       339 ~~~~~~~~A~~~le~al~~~P~  360 (398)
T PRK10747        339 MKHGEWQEASLAFRAALKQRPD  360 (398)
T ss_pred             HHCCCHHHHHHHHHHHHhcCCC
Confidence            8999999999999999887554


No 124
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=23.29  E-value=1.1e+02  Score=22.30  Aligned_cols=28  Identities=14%  Similarity=0.296  Sum_probs=22.6

Q ss_pred             HHHHHHHHhcccCCHHHHHHHHHHHHHH
Q psy2692          77 AKIQQCYQLTTSGKFVDAIEKLRNVLIC  104 (133)
Q Consensus        77 ~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~  104 (133)
                      .....|..+...|++++|+..|+..+..
T Consensus        37 ~~~~lg~~~~~~g~~~~A~~~~~~al~~   64 (172)
T PRK02603         37 VYYRDGMSAQADGEYAEALENYEEALKL   64 (172)
T ss_pred             HHHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence            3567788888889999999999988754


No 125
>cd06130 DNA_pol_III_epsilon_like an uncharacterized bacterial subgroup of the DEDDh 3'-5' exonuclease domain family with similarity to the epsilon subunit of DNA polymerase III. This subfamily is composed of uncharacterized bacterial proteins with similarity to the epsilon subunit of DNA polymerase III (Pol III), a multisubunit polymerase which is the main DNA replicating enzyme in bacteria, functioning as the chromosomal replicase. The Pol III holoenzyme is a complex of ten different subunits, three of which (alpha, epsilon, and theta) compose the catalytic core. The Pol III epsilon subunit, encoded by the dnaQ gene, is a DEDDh-type 3'-5' exonuclease which is responsible for the proofreading activity of the polymerase, increasing the fidelity of DNA synthesis. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that ser
Probab=22.85  E-value=1.1e+02  Score=21.64  Aligned_cols=28  Identities=18%  Similarity=0.477  Sum_probs=25.0

Q ss_pred             ccccCChHHHHhccCHHHHHHHHHhhcC
Q psy2692           2 NLHKVTSESINKSSSFETAFKLLHDQIG   29 (133)
Q Consensus         2 ~~~~~~~~~~~AAGsFe~A~~LLnrQiG   29 (133)
                      ++|.+|.+.+..++.|+.++.-+.+-++
T Consensus        49 ~i~GIt~e~l~~~~~~~~v~~~l~~~l~   76 (156)
T cd06130          49 AIHGITPEDVADAPTFPEVWPEIKPFLG   76 (156)
T ss_pred             cccCcCHHHHhcCCCHHHHHHHHHHHhC
Confidence            5899999999999999999998888766


No 126
>cd03591 CLECT_collectin_like C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1). CLECT_collectin_like: C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1).  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CTLDs of these collectins bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, or apoptotic cells) and mediate functions associated with killing and phagocytosis.  MBPs recognize high mannose oligosaccharides in a calcium dependent manner, bind to a broad range of pathogens, and trigger cell killing by activating the complement pathway.  MBP also acts directly as an opsonin.  SP-A and SP-D in addition to functioning as host defense components, a
Probab=22.82  E-value=1.2e+02  Score=20.40  Aligned_cols=35  Identities=14%  Similarity=0.062  Sum_probs=29.6

Q ss_pred             cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692          88 SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN  124 (133)
Q Consensus        88 ~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~  124 (133)
                      .-.+.+|...+++.  -.-|++++|.+|.+.+.+++.
T Consensus        10 ~~~w~~A~~~C~~~--g~~La~i~s~~e~~~l~~~~~   44 (114)
T cd03591          10 EKNFDDAQKLCSEA--GGTLAMPRNAAENAAIASYVK   44 (114)
T ss_pred             eeCHHHHHHHHhhc--CCEEecCCCHHHHHHHHHHHh
Confidence            45799999999984  677888999999999988875


No 127
>COG5379 BtaA S-adenosylmethionine:diacylglycerol 3-amino-3-carboxypropyl transferase [Lipid metabolism]
Probab=22.69  E-value=1.5e+02  Score=25.91  Aligned_cols=43  Identities=19%  Similarity=0.313  Sum_probs=38.0

Q ss_pred             cccCChHHHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhh
Q psy2692           3 LHKVTSESINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTA   50 (133)
Q Consensus         3 ~~~~~~~~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~   50 (133)
                      +|.|.+.+++++|+.+.--|.+++.     |.|+...-.--+.+||-+
T Consensus       186 ~~Gvn~~~m~~~r~lgeQR~ff~~k-----~ap~f~r~vl~wlt~rks  228 (414)
T COG5379         186 LHGVNPQAMFKARDLGEQRQFFDRK-----LAPLFDRPVLRWLTSRKS  228 (414)
T ss_pred             hhCCCHHHHHhhcchhHHHHHHHhh-----hhhhhhHHHHHHHhcccc
Confidence            7999999999999999999999886     789999888888887655


No 128
>TIGR00823 EIIA-LAC phosphotransferase system enzyme II, lactose-specific, factor III. operon. While the Lac permeases consist of two polypeptide chains (IIA and IICB), the Chb permease of E. coli consists of three (IIA, IIB and IIC). In B. subtilis, a PTS permease similar to the Chb permease of E. coli is believed to transport lichenan (a b-1,3;1,4 glucan) degradation products, oligosaccharides of 2-4 glucose units. This model is specific for the IIA subunit of the Lac PTS family.
Probab=22.66  E-value=90  Score=21.97  Aligned_cols=20  Identities=20%  Similarity=0.257  Sum_probs=17.5

Q ss_pred             hHHHHhccCHHHHHHHHHhh
Q psy2692           8 SESINKSSSFETAFKLLHDQ   27 (133)
Q Consensus         8 ~~~~~AAGsFe~A~~LLnrQ   27 (133)
                      +...+..|+|+.|-++|+.-
T Consensus        24 Al~~a~~gdfe~A~~~l~eA   43 (99)
T TIGR00823        24 ALKAAKAGDFAKARALVEQA   43 (99)
T ss_pred             HHHHHHcCCHHHHHHHHHHH
Confidence            56788999999999999876


No 129
>KOG0548|consensus
Probab=22.65  E-value=2e+02  Score=26.41  Aligned_cols=63  Identities=13%  Similarity=0.163  Sum_probs=0.0

Q ss_pred             HHHhccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCC--------------------C
Q psy2692          10 SINKSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKV--------------------D   69 (133)
Q Consensus        10 ~~~AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~--------------------~   69 (133)
                      +++..|+++.|...++|                                  .+..+|++                    .
T Consensus       307 a~~k~~~~~~ai~~~~k----------------------------------aLte~Rt~~~ls~lk~~Ek~~k~~e~~a~  352 (539)
T KOG0548|consen  307 AYTKREDYEGAIKYYQK----------------------------------ALTEHRTPDLLSKLKEAEKALKEAERKAY  352 (539)
T ss_pred             hhhhHHhHHHHHHHHHH----------------------------------HhhhhcCHHHHHHHHHHHHHHHHHHHHHh


Q ss_pred             CChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          70 RSSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        70 ~~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      ++..-=.+.=..|+-+|+.|+|++|+..+-..|..-|
T Consensus       353 ~~pe~A~e~r~kGne~Fk~gdy~~Av~~YteAIkr~P  389 (539)
T KOG0548|consen  353 INPEKAEEEREKGNEAFKKGDYPEAVKHYTEAIKRDP  389 (539)
T ss_pred             hChhHHHHHHHHHHHHHhccCHHHHHHHHHHHHhcCC


No 130
>cd07957 Anticodon_Ia_Met Anticodon-binding domain of methionyl tRNA synthetases. This domain is found in methionyl tRNA synthetases (MetRS), which belong to the class Ia aminoacyl tRNA synthetases. It lies C-terminal to the catalytic core domain, and recognizes and specifically binds to the tRNA anticodon (CAU). MetRS catalyzes the transfer of methionine to the 3'-end of its tRNA.
Probab=22.46  E-value=2.5e+02  Score=18.81  Aligned_cols=57  Identities=11%  Similarity=0.155  Sum_probs=32.9

Q ss_pred             HHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccccccCh------H-HHHHHHHHHHHHHHhh
Q psy2692          74 DLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTK------K-EIEEAKQIINICKEYI  130 (133)
Q Consensus        74 ~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~------~-e~~e~~eli~icrEYi  130 (133)
                      .+.+.+++.-+.+...+|.+|+....+.+..+=-..-..+      + +.++.+..+..|.+.+
T Consensus        44 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~n~y~~~~kpw~~~~~~~~~~~~~~l~~~~~~l  107 (129)
T cd07957          44 EAEELLEEVAEAMEELEFRKALEEIMELARAANKYIDETAPWKLAKEEDPERLATVLYVLLELL  107 (129)
T ss_pred             HHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHhhhccCCCHHHHccCCHHHHHHHHHHHHHHH
Confidence            3444556666666788999999988877744322221110      1 2334555666666543


No 131
>smart00034 CLECT C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
Probab=22.46  E-value=1.4e+02  Score=19.32  Aligned_cols=41  Identities=15%  Similarity=0.236  Sum_probs=32.4

Q ss_pred             HHhcc-cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHH
Q psy2692          83 YQLTT-SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINI  125 (133)
Q Consensus        83 yk~~t-~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~i  125 (133)
                      |+.+. ...+.+|...+++.  ..-|+.++|.+|.+.+.+++..
T Consensus        13 y~~~~~~~~~~~A~~~C~~~--~~~La~i~~~~e~~~i~~~~~~   54 (126)
T smart00034       13 YKFSTEKKTWADAQAFCQSL--GAHLASIHSEAENDFVASLLKN   54 (126)
T ss_pred             EEEECCccCHHHHHHHHHhc--CCEEcccCCHHHHHHHHHHHHh
Confidence            33443 47899999999998  5778888999999988887764


No 132
>KOG0547|consensus
Probab=22.34  E-value=78  Score=29.17  Aligned_cols=27  Identities=15%  Similarity=0.228  Sum_probs=0.0

Q ss_pred             HHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          80 QQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        80 k~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      ..|-+.|+.||+.+||..+...|-..|
T Consensus       120 ~~GN~~f~~kkY~eAIkyY~~AI~l~p  146 (606)
T KOG0547|consen  120 TKGNKFFRNKKYDEAIKYYTQAIELCP  146 (606)
T ss_pred             hhhhhhhhcccHHHHHHHHHHHHhcCC


No 133
>PF00140 Sigma70_r1_2:  Sigma-70 factor, region 1.2;  InterPro: IPR009042 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes.  With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. ; GO: 0003677 DNA binding, 0003700 sequence-specific DNA binding transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, DNA-dependent, 0006355 regulation of transcription, DNA-dependent; PDB: 1SMY_F 1IW7_P 1SIG_A 3IYD_F 2BE5_F 2A6E_F 2CW0_F 2A69_P 2A6H_P 3DXJ_P ....
Probab=22.31  E-value=1.7e+02  Score=16.71  Aligned_cols=19  Identities=21%  Similarity=0.242  Sum_probs=15.4

Q ss_pred             HHHHHHHHHHHHHcccccc
Q psy2692          92 VDAIEKLRNVLICIPLLVV  110 (133)
Q Consensus        92 ~eAl~~Fr~iL~~i~l~vv  110 (133)
                      +|++..|-+-+..+|++..
T Consensus         1 ~D~l~~Yl~ei~~~~LLt~   19 (37)
T PF00140_consen    1 SDSLRLYLKEIGRYPLLTA   19 (37)
T ss_dssp             HHHHHHHHHHHHHS-EETT
T ss_pred             CcHHHHHHHHHcCCCCCCH
Confidence            5889999999999999944


No 134
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=22.20  E-value=1.4e+02  Score=21.03  Aligned_cols=29  Identities=24%  Similarity=0.261  Sum_probs=22.2

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      ..+.|.-++..|++++|+..|+..+..-|
T Consensus        34 ~~~la~~~~~~~~~~~A~~~~~~~l~~~p   62 (234)
T TIGR02521        34 RVQLALGYLEQGDLEVAKENLDKALEHDP   62 (234)
T ss_pred             HHHHHHHHHHCCCHHHHHHHHHHHHHhCc
Confidence            55667777888888888888888876654


No 135
>cd03603 CLECT_VCBS A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CLECT_VCBS: A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces including CaCO3 and ice.  Bacterial CTLDs within this group are functionally uncharacterized.  Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions.  CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose.  CTLDs associate with each other through several different surface
Probab=22.08  E-value=1.4e+02  Score=20.59  Aligned_cols=36  Identities=8%  Similarity=-0.031  Sum_probs=30.2

Q ss_pred             ccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692          87 TSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN  124 (133)
Q Consensus        87 t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~  124 (133)
                      +...|.||...+++.  -.-|++++|.+|.+.+.+++.
T Consensus         8 ~~~sw~~A~~~C~~~--g~~La~I~s~~E~~fv~~~~~   43 (118)
T cd03603           8 GGMTWEAAQTLAESL--GGHLVTINSAEENDWLLSNFG   43 (118)
T ss_pred             CCcCHHHHHHHHHHc--CCEEcccCCHHHHHHHHHHhc
Confidence            367899999999985  678888999999988887764


No 136
>cd06266 RNaseH_typeII Ribonuclease H type II. Ribonuclease H (RNase H) is classified into two families, type I (prokaryotic RNase HI, eukaryotic RNase H1 and viral RNase H) and type II (prokaryotic RNase HII and HIII, and eukaryotic RNase H2/HII). RNase H endonucleolytically hydrolyzes an RNA strand when it is annealed to a complementary DNA strand in the presence of divalent cations, in DNA replication and repair. The enzyme can be found in bacteria, archaea, and eukaryotes. Most prokaryotic and eukaryotic genomes contain multiple RNase H genes, however, no prokaryotic genome contained the combination of only RNase HI and HIII. Despite a lack of evidence for homology from sequence comparisons, type I and type II RNase H share a common fold and similar steric configurations of the four acidic active-site residues, suggesting identical or very similar catalytic mechanisms. It appears that type I and type II RNases H also have overlapping functions in cells, as over-expression of Escheri
Probab=22.01  E-value=56  Score=25.06  Aligned_cols=29  Identities=17%  Similarity=0.177  Sum_probs=25.1

Q ss_pred             cCChHHHHhccCHHHHHHHHHhhcCcccc
Q psy2692           5 KVTSESINKSSSFETAFKLLHDQIGVKNF   33 (133)
Q Consensus         5 ~~~~~~~~AAGsFe~A~~LLnrQiGivNf   33 (133)
                      -|.++|.+|--.+|..|+.|+++++..+|
T Consensus       132 ~VaAASIiAKv~rd~~m~~l~~~~~~~~~  160 (193)
T cd06266         132 SVAAASILAKVTRDRLMEELDEEYPGYGF  160 (193)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHCCCCCC
Confidence            47889999999999999999999886444


No 137
>PF11817 Foie-gras_1:  Foie gras liver health family 1;  InterPro: IPR021773  Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats. 
Probab=21.91  E-value=85  Score=24.92  Aligned_cols=20  Identities=25%  Similarity=0.428  Sum_probs=17.7

Q ss_pred             hHHHHhccCHHHHHHHHHhh
Q psy2692           8 SESINKSSSFETAFKLLHDQ   27 (133)
Q Consensus         8 ~~~~~AAGsFe~A~~LLnrQ   27 (133)
                      +.++..+|+++.|.++|.+-
T Consensus       185 A~ey~~~g~~~~A~~~l~~~  204 (247)
T PF11817_consen  185 AEEYFRLGDYDKALKLLEPA  204 (247)
T ss_pred             HHHHHHCCCHHHHHHHHHHH
Confidence            57899999999999999875


No 138
>PF12793 SgrR_N:  Sugar transport-related sRNA regulator N-term
Probab=21.84  E-value=89  Score=22.43  Aligned_cols=18  Identities=28%  Similarity=0.549  Sum_probs=16.2

Q ss_pred             hHHHHhccCHHHHHHHHH
Q psy2692           8 SESINKSSSFETAFKLLH   25 (133)
Q Consensus         8 ~~~~~AAGsFe~A~~LLn   25 (133)
                      +.+++..|+++.|+++|.
T Consensus        77 ~~~~l~~g~~~~a~~ll~   94 (115)
T PF12793_consen   77 AEELLEQGKYEQALQLLD   94 (115)
T ss_pred             HHHHHHcCCHHHHHHHHH
Confidence            467889999999999998


No 139
>PF02255 PTS_IIA:  PTS system, Lactose/Cellobiose specific IIA subunit;  InterPro: IPR003188 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) [, ] is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC []. These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII).  The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site.  An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose [, , , ].  The lactose/cellobiose-specific family are one of four structurally and functionally distinct group IIA PTS system enzymes. This family of proteins normally function as a homotrimer, stabilised by a centrally located metal ion []. Separation into subunits is thought to occur after phosphorylation.; GO: 0005351 sugar:hydrogen symporter activity, 0006810 transport, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0016020 membrane; PDB: 3L8R_D 2E2A_B 1E2A_C 3K1S_C 2LRK_C 2LRL_A 2WY2_A 1WCR_A 2WWV_C.
Probab=21.63  E-value=96  Score=21.60  Aligned_cols=20  Identities=25%  Similarity=0.375  Sum_probs=16.2

Q ss_pred             hHHHHhccCHHHHHHHHHhh
Q psy2692           8 SESINKSSSFETAFKLLHDQ   27 (133)
Q Consensus         8 ~~~~~AAGsFe~A~~LLnrQ   27 (133)
                      +.+.+..|+|+.|-.+|++-
T Consensus        21 Al~~a~~~~fe~A~~~l~~a   40 (96)
T PF02255_consen   21 ALKAAREGDFEEAEELLKEA   40 (96)
T ss_dssp             HHHHHHTT-HHHHHHHHHHH
T ss_pred             HHHHHHhCCHHHHHHHHHHH
Confidence            56778899999999999876


No 140
>COG1283 NptA Na+/phosphate symporter [Inorganic ion transport and metabolism]
Probab=21.58  E-value=1.5e+02  Score=27.08  Aligned_cols=40  Identities=18%  Similarity=0.224  Sum_probs=28.0

Q ss_pred             CHHHHHHHHHHHHHHccccccc---ChHHHHHHHHHHHHHHHh
Q psy2692          90 KFVDAIEKLRNVLICIPLLVVT---TKKEIEEAKQIINICKEY  129 (133)
Q Consensus        90 KF~eAl~~Fr~iL~~i~l~vv~---~~~e~~e~~eli~icrEY  129 (133)
                      |.+|+++....-++....-+-.   |++|.+...++++.||++
T Consensus       370 ~~e~~vd~~~~~Ik~YL~~ls~~~Lse~es~r~~~iid~a~~l  412 (533)
T COG1283         370 KLEDAVDRLYEEIKLYLARLSKEGLSEEESRRWAEIIDAAINL  412 (533)
T ss_pred             HHHHHHHHHHHHHHHHHHHhccccCCHHHHHHHHHHHHHHHhH
Confidence            4566666666666554433222   688999999999999986


No 141
>KOG0482|consensus
Probab=21.43  E-value=3.6e+02  Score=25.37  Aligned_cols=81  Identities=17%  Similarity=0.280  Sum_probs=46.2

Q ss_pred             ccccccHHHHHHHH-hhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhccc------------CCHHHHHHH
Q psy2692          31 KNFLPYRQMFLNSY-LQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTS------------GKFVDAIEK   97 (133)
Q Consensus        31 vNf~PLk~~F~~~y-~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~------------gKF~eAl~~   97 (133)
                      -+|+||-+-.|.-| ..+|.+-|-.|                 ++|.+.+-.||--...            -+-=-++-.
T Consensus       550 ~~fepl~~~~mR~yI~~ak~~~P~vp-----------------~~l~dyi~~AYv~~Rrea~~~~~~t~ttpRtLL~IlR  612 (721)
T KOG0482|consen  550 LDFEPLDPNLMRRYISLAKRKNPVVP-----------------EALADYITGAYVELRREARSSKDFTYTTPRTLLGILR  612 (721)
T ss_pred             ccCCCCCHHHHHHHHHHHhhcCCCCC-----------------HHHHHHHHHHHHHHHHHhhccCCCcccCHHHHHHHHH
Confidence            57899999998888 45666655544                 2344444444432221            111122223


Q ss_pred             HHHHHHHcccccccChHHHHHHHHHHHHHHH
Q psy2692          98 LRNVLICIPLLVVTTKKEIEEAKQIINICKE  128 (133)
Q Consensus        98 Fr~iL~~i~l~vv~~~~e~~e~~eli~icrE  128 (133)
                      .-..|-.+-|...-++++++|+-.|++..++
T Consensus       613 ls~AlarLRls~~V~~~DV~EALRLme~sK~  643 (721)
T KOG0482|consen  613 LSTALARLRLSDSVEEDDVNEALRLMEMSKD  643 (721)
T ss_pred             HHHHHHHhhhccccchhhHHHHHHHHHhhhc
Confidence            3333444555666678888888888887765


No 142
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=21.23  E-value=2.3e+02  Score=21.21  Aligned_cols=29  Identities=21%  Similarity=0.418  Sum_probs=25.3

Q ss_pred             HHHHHHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          78 KIQQCYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        78 ~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      .+..|.-++..|++++|+..|+.++..-|
T Consensus        73 ~~~la~~~~~~~~~~~A~~~~~~~l~~~p  101 (235)
T TIGR03302        73 QLDLAYAYYKSGDYAEAIAAADRFIRLHP  101 (235)
T ss_pred             HHHHHHHHHhcCCHHHHHHHHHHHHHHCc
Confidence            46678888899999999999999998776


No 143
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=21.15  E-value=2.7e+02  Score=27.14  Aligned_cols=25  Identities=24%  Similarity=0.228  Sum_probs=18.9

Q ss_pred             HHHhcccCCHHHHHHHHHHHHHHcc
Q psy2692          82 CYQLTTSGKFVDAIEKLRNVLICIP  106 (133)
Q Consensus        82 gyk~~t~gKF~eAl~~Fr~iL~~i~  106 (133)
                      |..+...|++++|+..|+.+|..-|
T Consensus       276 G~~~~~~g~~~~A~~~l~~aL~~~P  300 (1157)
T PRK11447        276 GLAAVDSGQGGKAIPELQQAVRANP  300 (1157)
T ss_pred             HHHHHHCCCHHHHHHHHHHHHHhCC
Confidence            6666777888888888888877655


No 144
>PF03705 CheR_N:  CheR methyltransferase, all-alpha domain;  InterPro: IPR022641  CheR proteins are part of the chemotaxis signaling mechanism which methylates the chemotaxis receptor at specific glutamate residues. This entry refers to the N-terminal domain of the CherR-type MCP methyltransferases, which are found in bacteria, archaea and green plants. This entry is found in association with PF01739 from PFAM.  Methyl transfer from the ubiquitous S-adenosyl-L-methionine (AdoMet) to either nitrogen, oxygen or carbon atoms is frequently employed in diverse organisms ranging from bacteria to plants and mammals. The reaction is catalysed by methyltransferases (Mtases) and modifies DNA, RNA, proteins and small molecules, such as catechol for regulatory purposes. The various aspects of the role of DNA methylation in prokaryotic restriction-modification systems and in a number of cellular processes in eukaryotes including gene regulation and differentiation is well documented. Three classes of DNA Mtases transfer the methyl group from AdoMet to the target base to form either N-6-methyladenine, or N-4-methylcytosine, or C-5- methylcytosine. In C-5-cytosine Mtases, ten conserved motifs are arranged in the same order []. Motif I (a glycine-rich or closely related consensus sequence; FAGxGG in M.HhaI []), shared by other AdoMet-Mtases [], is part of the cofactor binding site and motif IV (PCQ) is part of the catalytic site. In contrast, sequence comparison among N-6-adenine and N-4-cytosine Mtases indicated two of the conserved segments [], although more conserved segments may be present. One of them corresponds to motif I in C-5-cytosine Mtases, and the other is named (D/N/S)PP(Y/F). Crystal structures are known for a number of Mtases [, , , ]. The cofactor binding sites are almost identical and the essential catalytic amino acids coincide. The comparable protein folding and the existence of equivalent amino acids in similar secondary and tertiary positions indicate that many (if not all) AdoMet-Mtases have a common catalytic domain structure. This permits tertiary structure prediction of other DNA, RNA, protein, and small-molecule AdoMet-Mtases from their amino acid sequences []. Flagellated bacteria swim towards favourable chemicals and away from deleterious ones. Sensing of chemoeffector gradients involves chemotaxis receptors, transmembrane (TM) proteins that detect stimuli through their periplasmic domains and transduce the signals via their cytoplasmic domains []. Signalling outputs from these receptors are influenced both by the binding of the chemoeffector ligand to their periplasmic domains and by methylation of specific glutamate residues on their cytoplasmic domains. Methylation is catalysed by CheR, an S-adenosylmethionine-dependent methyltransferase [], which reversibly methylates specific glutamate residues within a coiled coil region, to form gamma-glutamyl methyl ester residues [, ]. The structure of the Salmonella typhimurium chemotaxis receptor methyltransferase CheR, bound to S-adenosylhomocysteine, has been determined to a resolution of 2.0 A []. The structure reveals CheR to be a two-domain protein, with a smaller N-terminal helical domain linked via a single polypeptide connection to a larger C-terminal alpha/beta domain. The C-terminal domain has the characteristics of a nucleotide-binding fold, with an insertion of a small anti-parallel beta-sheet subdomain. The S-adenosylhomocysteine-binding site is formed mainly by the large domain, with contributions from residues within the N-terminal domain and the linker region [].; PDB: 1AF7_A 1BC5_A.
Probab=20.98  E-value=1.1e+02  Score=18.25  Aligned_cols=41  Identities=24%  Similarity=0.420  Sum_probs=26.2

Q ss_pred             CHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCC
Q psy2692          16 SFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLL   57 (133)
Q Consensus        16 sFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~l   57 (133)
                      +|+.-..++++..|+ +|.+.|+.+++-=...+...-++++.
T Consensus         4 ~f~~~~~~i~~~~Gi-~l~~~K~~~l~rRl~~rm~~~~~~~~   44 (57)
T PF03705_consen    4 EFERFRELIYRRTGI-DLSEYKRSLLERRLARRMRALGLPSF   44 (57)
T ss_dssp             HHHHHHHHHHHHH------GGGHHHHHHHHHHHHHHHT---H
T ss_pred             HHHHHHHHHHHHHCC-CCchhhHHHHHHHHHHHHHHcCCCCH
Confidence            477778889999997 89999999888777777766666543


No 145
>cd03590 CLECT_DC-SIGN_like C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR).  This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human.  These proteins are type II membrane proteins with a CTLD ectodomain.  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium.  DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on path
Probab=20.98  E-value=1.4e+02  Score=20.09  Aligned_cols=40  Identities=15%  Similarity=0.311  Sum_probs=32.2

Q ss_pred             HHhcc-cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692          83 YQLTT-SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN  124 (133)
Q Consensus        83 yk~~t-~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~  124 (133)
                      |++++ ...+.+|...++..  ..-|++++|.+|.+-+.+++.
T Consensus        13 y~~~~~~~tw~~A~~~C~~~--g~~La~i~s~~e~~~l~~~~~   53 (126)
T cd03590          13 YFFSTEKKSWEESRQFCEDM--GAHLVIINSQEEQEFISKILS   53 (126)
T ss_pred             EEEeCCCcCHHHHHHHHHhC--CCEEEeeCCHHHHHHHHHHhC
Confidence            44444 57899999999874  788889999999999988874


No 146
>cd06144 REX4_like DEDDh 3'-5' exonuclease domain of RNA exonuclease 4, XPMC2, Interferon Stimulated Gene product of 20 kDa, and similar proteins. This subfamily is composed of RNA exonuclease 4 (REX4 or Rex4p), XPMC2, Interferon (IFN) Stimulated Gene product of 20 kDa (ISG20), and similar proteins. REX4 is involved in pre-rRNA processing. It controls the ratio between the two forms of 5.8S rRNA in yeast. XPMC2 is a Xenopus gene which was identified through its ability to correct a mitotic defect in fission yeast. The human homolog of XPMC2 (hPMC2) may be involved in angiotensin II-induced adrenal cell cycle progression and cell proliferation. ISG20 is an IFN-induced antiviral exonuclease with a strong preference for single-stranded RNA and minor activity towards single-stranded DNA. These proteins are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clus
Probab=20.91  E-value=1.2e+02  Score=22.09  Aligned_cols=29  Identities=14%  Similarity=0.292  Sum_probs=25.8

Q ss_pred             CccccCChHHHHhccCHHHHHHHHHhhcC
Q psy2692           1 MNLHKVTSESINKSSSFETAFKLLHDQIG   29 (133)
Q Consensus         1 ~~~~~~~~~~~~AAGsFe~A~~LLnrQiG   29 (133)
                      +.+|.+|++....|-+|+.++.-+.+-+|
T Consensus        48 ~~ihGIt~~~v~~a~~~~~~~~~l~~~l~   76 (152)
T cd06144          48 TAVSGIRPEHLKDAPDFEEVQKKVAELLK   76 (152)
T ss_pred             ccCCCCCHHHHcCCCCHHHHHHHHHHHhC
Confidence            36899999999999999999998888775


No 147
>cd03592 CLECT_selectins_like C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins:  P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CLECT_selectins_like: C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins:  P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels).  CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins.  P- E- and L-sels are cell adhesion receptors that mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration.  L- sel is expressed constitutively on most leukocytes.  P-sel is stored in the Weibel-Palade bodies of endothelial cells and in the alpha granules of platlets.  E- sels are present on endothelial cells.  Following platelet and/or endothelial cell activation P- sel is rapidly translocated to the cell surface and E-sel exp
Probab=20.90  E-value=1.4e+02  Score=20.03  Aligned_cols=35  Identities=11%  Similarity=0.095  Sum_probs=29.0

Q ss_pred             cCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHH
Q psy2692          88 SGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIIN  124 (133)
Q Consensus        88 ~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~  124 (133)
                      .-.|.||...+++.  -.-|+.++|.+|.+-+.+++.
T Consensus         9 ~~~w~~A~~~C~~~--g~~La~i~s~~e~~~i~~~~~   43 (115)
T cd03592           9 KMTFNEAVKYCKSR--GTDLVAIQNAEENALLNGFAL   43 (115)
T ss_pred             ccCHHHHHHHHHHc--CCeEeecCCHHHHHHHHHHHH
Confidence            46799999999986  678889999999988887653


No 148
>PF10607 CLTH:  CTLH/CRA C-terminal to LisH motif domain;  InterPro: IPR019589 This entry represents the CRA (or CT11-RanBPM) domain, which is a protein-protein interaction domain present in crown eukaryotes (plants, animals, fungi) and which is found in Ran-binding proteins such as Ran-binding protein 9 (RanBP9 or RanBPM) and RanBP10. RanBPM is a scaffolding protein important in regulating cellular function in both the immune system and the nervous system, and may act as an adapter protein to couple membrane receptors to intracellular signaling pathways. This domain is at the C terminus of the proteins and is the binding domain for the CRA motif, which is comprised of approximately 100 amino acids at the C-terminal of RanBPM. It was found to be important for the interaction of RanBPM with fragile X mental retardation protein (FMRP), but its functional significance has yet to be determined []. 
Probab=20.82  E-value=3.1e+02  Score=19.28  Aligned_cols=19  Identities=11%  Similarity=0.230  Sum_probs=16.6

Q ss_pred             HHHHhccCHHHHHHHHHhh
Q psy2692           9 ESINKSSSFETAFKLLHDQ   27 (133)
Q Consensus         9 ~~~~AAGsFe~A~~LLnrQ   27 (133)
                      .+.+..|+.+.|+..+++.
T Consensus         9 ~~~I~~g~i~~Ai~w~~~~   27 (145)
T PF10607_consen    9 RQAILNGDIDPAIEWLNEN   27 (145)
T ss_pred             HHHHHcCCHHHHHHHHHHc
Confidence            4567899999999999986


No 149
>PLN00416 carbonate dehydratase
Probab=20.80  E-value=1.6e+02  Score=24.26  Aligned_cols=73  Identities=16%  Similarity=0.213  Sum_probs=49.3

Q ss_pred             hccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhcccCCHH
Q psy2692          13 KSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTSGKFV   92 (133)
Q Consensus        13 AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~gKF~   92 (133)
                      |+.|++.+..-|++-+  -|.+-++..--..-..-.+++....              .+.+.-.++|.+|++-|..++++
T Consensus         2 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~al~~Ll~Gn~rF~~~~~~   65 (258)
T PLN00416          2 ATESYEAAIKGLNDLL--STKADLGNVAAAKIKALTAELKELD--------------SSNSDAIERIKTGFTQFKTEKYL   65 (258)
T ss_pred             CcccHHHHHHHHHhhc--ccccccchHHHHhHHHHHHHHHHhh--------------cCHHHHHHHHHHHHHHHHhcccc
Confidence            5678999999998853  2566666554443333333343333              23456677999999999999998


Q ss_pred             HHHHHHHHH
Q psy2692          93 DAIEKLRNV  101 (133)
Q Consensus        93 eAl~~Fr~i  101 (133)
                      +-.+.|+.+
T Consensus        66 ~~~~~~~~l   74 (258)
T PLN00416         66 KNSTLFNHL   74 (258)
T ss_pred             cCHHHHHhh
Confidence            877777764


No 150
>KOG2076|consensus
Probab=20.56  E-value=1.4e+02  Score=28.98  Aligned_cols=40  Identities=20%  Similarity=0.150  Sum_probs=34.3

Q ss_pred             hHH-HHHHHHHHHHhcccCCHHHHHHHHHHHHHHccccccc
Q psy2692          72 SID-LIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLLVVT  111 (133)
Q Consensus        72 l~~-L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~  111 (133)
                      ++- +...|.+|--+|--|.+++|...+++|+.+.|.....
T Consensus       135 l~~~l~~ll~eAN~lfarg~~eeA~~i~~EvIkqdp~~~~a  175 (895)
T KOG2076|consen  135 LAPELRQLLGEANNLFARGDLEEAEEILMEVIKQDPRNPIA  175 (895)
T ss_pred             cCHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHhCccchhh
Confidence            444 7778899999999999999999999999999986443


No 151
>KOG0980|consensus
Probab=20.49  E-value=91  Score=30.43  Aligned_cols=40  Identities=25%  Similarity=0.297  Sum_probs=29.6

Q ss_pred             HHHHHHhhcCcccccccHHHHHHHHhhhh---------------hhccCCCCCCC
Q psy2692          20 AFKLLHDQIGVKNFLPYRQMFLNSYLQSR---------------TAYSTLPLLAP   59 (133)
Q Consensus        20 A~~LLnrQiGivNf~PLk~~F~~~y~~s~---------------~~~~~~p~lp~   59 (133)
                      =|.-|++|+----.+|+|++|+..|..-+               ..+|.+|.-||
T Consensus       208 mlfkLHs~vp~dtLeghRdRf~~qf~rLk~FY~~~S~lqYfk~LI~IP~LP~~~P  262 (980)
T KOG0980|consen  208 MLFKLHSQVPPDTLEGHRDRFHTQFERLKQFYADCSNLQYFKRLIQIPTLPEDAP  262 (980)
T ss_pred             HHHHHHcCCCHHHhhhHHHHHHHHHHHHHHHHHhcchhHHHHHHhcCCCCCCCCc
Confidence            46678888887778999999998765433               34788886554


No 152
>PF05053 Menin:  Menin;  InterPro: IPR007747 MEN1, the gene responsible for multiple endocrine neoplasia type 1, is a tumour suppressor gene that encodes a protein called Menin which may be an atypical GTPase stimulated by nm23 [].; GO: 0005634 nucleus; PDB: 3RE2_A 3U84_B 3U86_A 3U88_B 3U85_A.
Probab=20.45  E-value=2.3e+02  Score=26.48  Aligned_cols=49  Identities=16%  Similarity=0.226  Sum_probs=38.9

Q ss_pred             HHHHHhcccCCHHHHHHHHHHHHHHcccccccChHHHHHHHHHHHHHHHhh
Q psy2692          80 QQCYQLTTSGKFVDAIEKLRNVLICIPLLVVTTKKEIEEAKQIINICKEYI  130 (133)
Q Consensus        80 k~gyk~~t~gKF~eAl~~Fr~iL~~i~l~vv~~~~e~~e~~eli~icrEYi  130 (133)
                      +-|| +...+++.||+...-+.-.-|-...- ++++++--||+++|+-|-|
T Consensus       324 ~gg~-~yR~~~~~eA~~~Wa~aa~Vi~~YnY-~reDeEiYKEfleIAneLi  372 (618)
T PF05053_consen  324 LGGY-YYRHKRYREALRSWAEAADVIRKYNY-SREDEEIYKEFLEIANELI  372 (618)
T ss_dssp             HHHH-HHHTT-HHHHHHHHHHHHHHHTTSB---GGGHHHHHHHHHHHHTHH
T ss_pred             hhhH-HHHHHHHHHHHHHHHHHHHHHHHccc-CccHHHHHHHHHHHHHHHH
Confidence            4466 78889999999999988777777755 7999999999999998865


No 153
>KOG1173|consensus
Probab=20.40  E-value=1.6e+02  Score=27.34  Aligned_cols=27  Identities=22%  Similarity=0.256  Sum_probs=16.0

Q ss_pred             HHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692          82 CYQLTTSGKFVDAIEKLRNVLICIPLL  108 (133)
Q Consensus        82 gyk~~t~gKF~eAl~~Fr~iL~~i~l~  108 (133)
                      ||-.--.|+|+-|++.|...|..-|.-
T Consensus       496 g~iy~llgnld~Aid~fhKaL~l~p~n  522 (611)
T KOG1173|consen  496 GYIYHLLGNLDKAIDHFHKALALKPDN  522 (611)
T ss_pred             HHHHHHhcChHHHHHHHHHHHhcCCcc
Confidence            455555566666666666666655543


No 154
>cd05513 Bromo_brd7_like Bromodomain, brd7_like subgroup. The BRD7 gene encodes a nuclear protein that has been shown to inhibit cell growth and the progression of the cell cycle by regulating cell-cycle genes at the transcriptional level. BRD7 has been identified as a gene involved in nasopharyngeal carcinoma. The protein interacts with acetylated histone H3 via its bromodomain. Bromodomains are 110 amino acid long domains that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=20.21  E-value=3.1e+02  Score=18.96  Aligned_cols=88  Identities=11%  Similarity=-0.001  Sum_probs=54.0

Q ss_pred             ccHHHHHHHHhhhhhhccCCCCCCCccccCCCC------CCCChHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692          35 PYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRK------VDRSSIDLIAKIQQCYQLTTSGKFVDAIEKLRNVLICIPLL  108 (133)
Q Consensus        35 PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn------~~~~l~~L~~~Lk~gyk~~t~gKF~eAl~~Fr~iL~~i~l~  108 (133)
                      ||+..+.++...-+..=.+.++.-|+....--+      .|..++.+..+|+.+.    =..+.+-..-++-|.......
T Consensus         1 ~l~~~l~~il~~l~~~~~~~~F~~PV~~~~~pdY~~vIk~PmDL~tI~~kl~~~~----Y~s~~~f~~D~~li~~Na~~y   76 (98)
T cd05513           1 PLQKALEQLIRQLQRKDPHGFFAFPVTDFIAPGYSSIIKHPMDFSTMKEKIKNND----YQSIEEFKDDFKLMCENAMKY   76 (98)
T ss_pred             CHHHHHHHHHHHHHcCCccccccCcCCccccccHHHHHcCccCHHHHHHHHhCCC----CCCHHHHHHHHHHHHHHHHHH
Confidence            777888777765555444555544432111001      0888999988888763    334666777777777776666


Q ss_pred             cccChHHHHHHHHHHHHH
Q psy2692         109 VVTTKKEIEEAKQIINIC  126 (133)
Q Consensus       109 vv~~~~e~~e~~eli~ic  126 (133)
                      --.+...-..+.++.+..
T Consensus        77 N~~~s~~~~~A~~L~~~~   94 (98)
T cd05513          77 NKPDTIYYKAAKKLLHSG   94 (98)
T ss_pred             CCCCCHHHHHHHHHHHhh
Confidence            656655556666665543


No 155
>PF03564 DUF1759:  Protein of unknown function (DUF1759);  InterPro: IPR005312 This is a small family of proteins of unknown function. 
Probab=20.15  E-value=71  Score=22.74  Aligned_cols=80  Identities=18%  Similarity=0.302  Sum_probs=42.2

Q ss_pred             hccCHHHHHHHHHhhcCcccccccHHHHHHHHhhhhhhccCCCCCCCccccCCCCCCCChHHHHHHHHHHHHhccc-CCH
Q psy2692          13 KSSSFETAFKLLHDQIGVKNFLPYRQMFLNSYLQSRTAYSTLPLLAPVTAFPTRKVDRSSIDLIAKIQQCYQLTTS-GKF   91 (133)
Q Consensus        13 AAGsFe~A~~LLnrQiGivNf~PLk~~F~~~y~~s~~~~~~~p~lp~l~~~~~rn~~~~l~~L~~~Lk~gyk~~t~-gKF   91 (133)
                      ...+|+.|++.|++++|=-  .-    ..+.+...-..+|.+.          .+.+..+..+.+.++......+. |.=
T Consensus        52 ~~~~Y~~a~~~L~~~yg~~--~~----i~~~~~~~l~~l~~~~----------~~d~~~L~~~~~~v~~~i~~L~~lg~~  115 (145)
T PF03564_consen   52 SEENYEEAWELLEERYGNP--RR----IIQALLEELRNLPPIS----------NDDPEALRSLVDKVNNCIRALKALGVN  115 (145)
T ss_pred             cchhhHHHHHHHHHHhCCc--hH----HHHHHHHHHhcccccc----------chhHHHHHHHHHHHHHHHHHHHHcCCC
Confidence            4688999999999999832  22    2223333333333311          11133566666666665554443 443


Q ss_pred             HHHHHHHHHHHHHcccc
Q psy2692          92 VDAIEKLRNVLICIPLL  108 (133)
Q Consensus        92 ~eAl~~Fr~iL~~i~l~  108 (133)
                      .+.-....-|+..+|.-
T Consensus       116 ~~~~~l~~~i~~KLp~~  132 (145)
T PF03564_consen  116 VDDPLLISIILSKLPPE  132 (145)
T ss_pred             CCCHHHHHHHHHHCCHH
Confidence            33333445555666653


No 156
>PF04184 ST7:  ST7 protein;  InterPro: IPR007311 The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
Probab=20.01  E-value=4.8e+02  Score=24.03  Aligned_cols=27  Identities=26%  Similarity=0.395  Sum_probs=22.3

Q ss_pred             HHHhcccCCHHHHHHHHHHHHHHcccc
Q psy2692          82 CYQLTTSGKFVDAIEKLRNVLICIPLL  108 (133)
Q Consensus        82 gyk~~t~gKF~eAl~~Fr~iL~~i~l~  108 (133)
                      |--+-+.||-.||++.||+++.-.|..
T Consensus       266 AmCarklGr~~EAIk~~rdLlke~p~~  292 (539)
T PF04184_consen  266 AMCARKLGRLREAIKMFRDLLKEFPNL  292 (539)
T ss_pred             HHHHHHhCChHHHHHHHHHHHhhCCcc
Confidence            334448999999999999999988864


Done!