Query         psy2980
Match_columns 92
No_of_seqs    115 out of 1475
Neff          10.9
Searched_HMMs 46136
Date          Fri Aug 16 22:29:15 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy2980.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2980hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4441|consensus               99.9 1.5E-22 3.3E-27  129.6   8.7   90    1-92    385-475 (571)
  2 KOG4441|consensus               99.9 2.6E-22 5.5E-27  128.5   8.7   91    1-92    337-427 (571)
  3 PHA02713 hypothetical protein;  99.9 9.8E-22 2.1E-26  125.8  10.8   91    1-92    308-398 (557)
  4 PHA02713 hypothetical protein;  99.9 8.4E-21 1.8E-25  121.6  10.1   81   12-92    431-512 (557)
  5 PHA03098 kelch-like protein; P  99.8   8E-20 1.7E-24  116.5  10.2   90    1-91    299-388 (534)
  6 PHA03098 kelch-like protein; P  99.8 1.5E-19 3.3E-24  115.3  10.0   90    1-91    347-436 (534)
  7 PHA02790 Kelch-like protein; P  99.8 2.6E-19 5.7E-24  113.2  10.5   85    1-92    323-407 (480)
  8 PHA02790 Kelch-like protein; P  99.8 6.4E-19 1.4E-23  111.4  10.8   87    1-92    276-362 (480)
  9 PLN02153 epithiospecifier prot  99.8 2.4E-18 5.2E-23  104.9  11.0   90    1-91     90-192 (341)
 10 TIGR03548 mutarot_permut cycli  99.8 2.4E-18 5.3E-23  104.2  10.5   89    1-91     77-170 (323)
 11 PLN02153 epithiospecifier prot  99.8 4.4E-18 9.6E-23  103.7  10.7   90    1-91     37-136 (341)
 12 PLN02193 nitrile-specifier pro  99.8   1E-17 2.2E-22  105.8  11.3   89    1-91    233-327 (470)
 13 TIGR03547 muta_rot_YjhT mutatr  99.8 6.2E-18 1.4E-22  103.1   9.8   91    1-91     68-199 (346)
 14 PF13964 Kelch_6:  Kelch motif   99.7 1.9E-17   4E-22   75.5   6.6   50   34-83      1-50  (50)
 15 PLN02193 nitrile-specifier pro  99.7 6.5E-17 1.4E-21  102.2  11.1   90    1-91    180-277 (470)
 16 PRK14131 N-acetylneuraminic ac  99.7 5.7E-17 1.2E-21  100.0   9.8   91    1-91     89-220 (376)
 17 KOG0379|consensus               99.7 5.3E-17 1.1E-21  102.8   9.5   92    1-92    127-224 (482)
 18 TIGR03548 mutarot_permut cycli  99.7 8.4E-17 1.8E-21   97.5   9.7   77    1-80    128-205 (323)
 19 TIGR03547 muta_rot_YjhT mutatr  99.7 5.6E-16 1.2E-20   94.6  10.0   84    1-89     22-114 (346)
 20 PF01344 Kelch_1:  Kelch motif;  99.6 7.8E-16 1.7E-20   69.2   4.6   47   34-80      1-47  (47)
 21 KOG0379|consensus               99.6 1.1E-14 2.4E-19   92.3   9.2   91    1-91    178-274 (482)
 22 PRK14131 N-acetylneuraminic ac  99.6 5.9E-15 1.3E-19   91.2   7.2   92    1-92    226-343 (376)
 23 KOG4693|consensus               99.6 4.8E-15   1E-19   87.1   5.0   91    1-91     93-190 (392)
 24 PF13415 Kelch_3:  Galactose ox  99.6 2.4E-14 5.2E-19   64.9   6.2   48   44-91      1-49  (49)
 25 KOG4693|consensus               99.6 2.2E-14 4.7E-19   84.4   7.3   91    1-91    196-301 (392)
 26 PF07646 Kelch_2:  Kelch motif;  99.5 1.3E-13 2.9E-18   62.4   6.3   47   34-80      1-49  (49)
 27 PF13418 Kelch_4:  Galactose ox  99.5 5.6E-14 1.2E-18   63.6   4.4   47   34-80      1-48  (49)
 28 smart00612 Kelch Kelch domain.  99.5 1.2E-13 2.7E-18   61.7   4.7   46   46-92      1-46  (47)
 29 KOG1230|consensus               99.4 7.5E-13 1.6E-17   81.6   7.0   78   13-90    154-239 (521)
 30 KOG1230|consensus               99.4   1E-12 2.2E-17   81.0   7.0   79   13-91     98-186 (521)
 31 smart00612 Kelch Kelch domain.  99.2 2.2E-11 4.7E-16   54.3   4.1   44    1-45      4-47  (47)
 32 PF13415 Kelch_3:  Galactose ox  99.2 2.5E-11 5.3E-16   54.9   4.1   43    1-43      6-49  (49)
 33 KOG4152|consensus               99.2 1.2E-10 2.6E-15   74.0   6.8   90    1-92    220-332 (830)
 34 PF13964 Kelch_6:  Kelch motif   99.2 4.5E-11 9.9E-16   54.2   3.4   35    1-35     16-50  (50)
 35 PF13854 Kelch_5:  Kelch motif   99.1 3.8E-10 8.1E-15   49.4   5.5   40   31-70      1-41  (42)
 36 PLN02772 guanylate kinase       99.0 6.4E-09 1.4E-13   64.7   7.7   60   33-92     23-85  (398)
 37 COG3055 Uncharacterized protei  98.9 3.4E-09 7.4E-14   64.6   5.4   90    1-90     97-225 (381)
 38 PF01344 Kelch_1:  Kelch motif;  98.8 1.5E-09 3.3E-14   48.4   1.7   32    1-32     16-47  (47)
 39 KOG4152|consensus               98.8 1.6E-09 3.5E-14   69.1   2.1   89    1-91     47-145 (830)
 40 COG3055 Uncharacterized protei  98.7 4.5E-08 9.7E-13   59.8   5.9   75   15-89     60-142 (381)
 41 PF13418 Kelch_4:  Galactose ox  98.7 7.4E-09 1.6E-13   46.6   1.2   32    1-32     17-48  (49)
 42 PF07646 Kelch_2:  Kelch motif;  98.5 3.2E-07   7E-12   41.2   3.3   32    1-32     16-49  (49)
 43 PLN02772 guanylate kinase       98.4 1.8E-06 3.9E-11   54.1   7.2   54    1-54     39-96  (398)
 44 KOG2437|consensus               98.3 2.5E-07 5.3E-12   59.2   0.9   90    1-91    277-382 (723)
 45 KOG2437|consensus               97.7 0.00013 2.8E-09   47.3   5.7   65   13-77    346-420 (723)
 46 TIGR01640 F_box_assoc_1 F-box   97.3   0.004 8.7E-08   36.4   8.1   68   13-81     70-138 (230)
 47 PF12768 Rax2:  Cortical protei  97.0  0.0067 1.5E-07   36.9   7.1   76    2-78      3-81  (281)
 48 PF07250 Glyoxal_oxid_N:  Glyox  96.1   0.044 9.5E-07   32.8   6.2   74   14-90     47-125 (243)
 49 PF07893 DUF1668:  Protein of u  95.9    0.11 2.5E-06   32.5   7.4   71   14-84     87-171 (342)
 50 PF07250 Glyoxal_oxid_N:  Glyox  95.6   0.039 8.5E-07   33.0   4.6   50    2-54     83-138 (243)
 51 TIGR01640 F_box_assoc_1 F-box   95.1    0.33 7.2E-06   28.4   7.3   65   13-78     14-87  (230)
 52 PF03089 RAG2:  Recombination a  93.1    0.37   8E-06   29.8   4.7   54    1-54     43-111 (337)
 53 PF07893 DUF1668:  Protein of u  93.0    0.84 1.8E-05   28.7   6.4   54   22-81    157-221 (342)
 54 PF08268 FBA_3:  F-box associat  91.8     1.2 2.5E-05   23.8   8.3   66   11-76     18-87  (129)
 55 PF05096 Glu_cyclase_2:  Glutam  88.6       4 8.6E-05   25.0   6.8   52   13-70     68-119 (264)
 56 COG4257 Vgb Streptogramin lyas  88.3     2.8   6E-05   26.2   5.3   59   13-77    254-313 (353)
 57 smart00284 OLF Olfactomedin-li  80.8       7 0.00015   23.9   4.7   45   31-81     70-114 (255)
 58 PF03089 RAG2:  Recombination a  77.8     5.6 0.00012   24.9   3.7   41   14-54    132-174 (337)
 59 KOG2321|consensus               76.9       7 0.00015   26.8   4.2   53   12-72    154-208 (703)
 60 PF02191 OLF:  Olfactomedin-lik  74.4     9.7 0.00021   23.1   4.1   46   30-81     64-109 (250)
 61 PF06433 Me-amine-dh_H:  Methyl  73.4      14  0.0003   23.7   4.7   56   10-73    266-324 (342)
 62 KOG0316|consensus               71.4      18 0.00038   22.3   4.6   52   14-71    124-175 (307)
 63 PF03178 CPSF_A:  CPSF A subuni  68.3      26 0.00056   21.7   6.6   71   13-88    107-178 (321)
 64 PF14830 Haemocyan_bet_s:  Haem  65.3     8.5 0.00018   20.2   2.2   28   42-69     34-61  (103)
 65 PF14157 YmzC:  YmzC-like prote  61.5      17 0.00037   17.2   2.7   15   14-28     42-56  (63)
 66 PF05096 Glu_cyclase_2:  Glutam  61.3      37 0.00079   21.0   6.3   50   38-91     48-98  (264)
 67 PF07443 HARP:  HepA-related pr  59.2     4.7  0.0001   18.6   0.6   13   16-28     21-33  (55)
 68 PF11958 DUF3472:  Domain of un  58.1      35 0.00075   19.7   4.1   23   14-36     81-103 (184)
 69 PF12768 Rax2:  Cortical protei  58.0      43 0.00094   20.8   6.1   63   11-78     62-130 (281)
 70 PF06058 DCP1:  Dcp1-like decap  57.4      15 0.00033   19.7   2.5   17   60-76     28-44  (122)
 71 KOG2394|consensus               55.0      18  0.0004   24.8   2.9   13   43-55    342-354 (636)
 72 cd01206 Homer Homer type EVH1   53.9      33 0.00072   18.3   5.1   20   11-30      9-29  (111)
 73 PF08450 SGL:  SMP-30/Gluconola  51.8      48   0.001   19.5   7.5   56   12-79     21-78  (246)
 74 PF03178 CPSF_A:  CPSF A subuni  49.7      61  0.0013   20.1   6.0   60   13-80     62-127 (321)
 75 PF03088 Str_synth:  Strictosid  48.0      26 0.00056   17.8   2.3   20   10-29     34-53  (89)
 76 COG2380 Uncharacterized protei  45.9      28  0.0006   21.7   2.5   21   11-31     20-40  (327)
 77 cd01207 Ena-Vasp Enabled-VASP-  45.1      49  0.0011   17.6   5.6   18   13-30      9-26  (111)
 78 PRK11138 outer membrane biogen  42.3      91   0.002   20.0   7.4   57   13-76     79-147 (394)
 79 PRK11408 hypothetical protein;  40.6      68  0.0015   18.0   3.4   30   47-77     34-63  (145)
 80 PF13570 PQQ_3:  PQQ-like domai  40.4      31 0.00067   14.0   4.8   26   38-70     15-40  (40)
 81 PF07734 FBA_1:  F-box associat  40.1      67  0.0015   17.8   4.7   40   41-81      2-41  (164)
 82 PF03835 Rad4:  Rad4 transgluta  39.5      46 0.00099   18.2   2.6   21   12-32     36-56  (145)
 83 PF10902 DUF2693:  Protein of u  38.2      46   0.001   16.7   2.2   20   57-76     45-64  (83)
 84 PRK09511 nirD nitrite reductas  37.7      55  0.0012   17.0   2.7   32   16-49     75-106 (108)
 85 PF08662 eIF2A:  Eukaryotic tra  36.5      87  0.0019   18.0   4.2   25   44-71    111-135 (194)
 86 PF00614 PLDc:  Phospholipase D  35.7      35 0.00075   13.2   1.4   16   39-54      8-23  (28)
 87 KOG2055|consensus               34.5 1.5E+02  0.0033   20.2   4.9   27   47-79    272-298 (514)
 88 KOG2958|consensus               33.0      35 0.00076   21.6   1.6   20   11-30      9-28  (354)
 89 PRK00178 tolB translocation pr  33.0 1.4E+02   0.003   19.4   7.2   57   14-76    268-326 (430)
 90 PRK05137 tolB translocation pr  32.9 1.4E+02  0.0031   19.5   7.5   60   13-77    226-286 (435)
 91 PRK04792 tolB translocation pr  32.8 1.5E+02  0.0032   19.6   7.2   59   13-77    242-302 (448)
 92 PF09272 Hepsin-SRCR:  Hepsin,   31.7      70  0.0015   16.9   2.3   20   60-79     11-30  (110)
 93 PF14298 DUF4374:  Domain of un  31.5 1.7E+02  0.0036   19.8   6.8   61   12-74    366-429 (435)
 94 cd00837 EVH1 EVH1 (Enabled, Va  31.5      62  0.0013   16.7   2.2   17   60-76      8-24  (104)
 95 KOG0179|consensus               31.2      35 0.00075   20.5   1.3   19    1-21    129-147 (235)
 96 PF13952 DUF4216:  Domain of un  30.8      75  0.0016   15.6   3.6   19   57-75     54-73  (75)
 97 cd03476 Rieske_ArOX_small Smal  30.6      95  0.0021   16.7   4.3   35   16-51     78-115 (126)
 98 TIGR02800 propeller_TolB tol-p  30.3 1.5E+02  0.0033   18.9   7.6   60   13-77    214-274 (417)
 99 PF08950 DUF1861:  Protein of u  30.0 1.5E+02  0.0032   18.8   4.1   42   11-53     52-95  (298)
100 cd03529 Rieske_NirD Assimilato  29.6      86  0.0019   15.9   2.8    6   18-23     73-78  (103)
101 TIGR03074 PQQ_membr_DH membran  29.1 2.3E+02   0.005   20.7   7.9   58   13-76    641-719 (764)
102 PF05237 MoeZ_MoeB:  MoeZ/MoeB   29.1      81  0.0018   15.5   2.8   19   12-30     53-71  (84)
103 PF00568 WH1:  WH1 domain;  Int  29.0      88  0.0019   16.3   2.6   16   13-28     16-32  (111)
104 KOG1898|consensus               28.6 2.8E+02   0.006   21.4   6.0   31   60-90    997-1027(1205)
105 PF04096 Nucleoporin2:  Nucleop  28.0      86  0.0019   17.4   2.5   16   12-27    121-136 (141)
106 KOG0294|consensus               27.6 1.8E+02  0.0039   18.9   4.8   27   39-71     47-73  (362)
107 PF10946 DUF2625:  Protein of u  27.3 1.3E+02  0.0028   18.0   3.2   30   47-77    111-140 (208)
108 PF10282 Lactonase:  Lactonase,  27.3 1.7E+02  0.0036   18.5   6.6   76    2-80      4-83  (345)
109 PF09910 DUF2139:  Uncharacteri  27.2 1.8E+02  0.0039   18.8   6.5   17   57-73    169-185 (339)
110 PF14741 GH114_assoc:  N-termin  27.1   1E+02  0.0022   17.0   2.6   20   14-33     64-84  (129)
111 TIGR02377 MocE_fam_FeS Rieske   27.0      98  0.0021   15.7   3.4   13   15-27     26-38  (101)
112 PRK01029 tolB translocation pr  26.5   2E+02  0.0042   19.0   7.3   59   14-78    352-412 (428)
113 KOG2111|consensus               26.4 1.9E+02  0.0041   18.8   6.5   51    1-51     63-119 (346)
114 PF13360 PQQ_2:  PQQ-like domai  26.1 1.4E+02   0.003   17.2   8.3   55   14-77      4-64  (238)
115 PF03022 MRJP:  Major royal jel  25.4 1.8E+02  0.0038   18.1   9.3   62   12-79     33-107 (287)
116 TIGR02378 nirD_assim_sml nitri  24.5 1.1E+02  0.0024   15.5   2.7    7   17-23     73-79  (105)
117 PF07453 NUMOD1:  NUMOD1 domain  24.1      68  0.0015   12.8   1.9   12   14-25      2-13  (37)
118 PF09816 EAF:  RNA polymerase I  23.9 1.2E+02  0.0027   15.8   3.5   28   47-74     65-92  (109)
119 PRK09965 3-phenylpropionate di  23.2 1.2E+02  0.0026   15.5   3.7   10   40-49     90-99  (106)
120 TIGR02658 TTQ_MADH_Hv methylam  22.8 2.3E+02  0.0049   18.5   6.0   54   11-72    277-333 (352)
121 cd03583 NTR_complement_C3 NTR/  22.8 1.5E+02  0.0034   16.5   3.1   36   44-79     86-121 (149)
122 PF12904 Collagen_bind_2:  Puta  22.7 1.3E+02  0.0027   15.5   2.5   18   13-30     49-66  (93)
123 TIGR03352 VI_chp_3 type VI sec  22.6 1.4E+02   0.003   16.6   2.7   23   63-85    113-135 (146)
124 cd03584 NTR_complement_C4 NTR/  22.5 1.6E+02  0.0034   16.5   3.2   36   44-79     90-125 (153)
125 PF15232 DUF4585:  Domain of un  22.4 1.1E+02  0.0024   15.1   2.0   15   16-30     32-46  (75)
126 cd03548 Rieske_RO_Alpha_OMO_CA  22.3 1.5E+02  0.0032   16.1   4.5   37   16-52     81-124 (136)
127 KOG4499|consensus               21.9 2.2E+02  0.0047   17.9   4.1   39   11-51    231-273 (310)
128 PRK04043 tolB translocation pr  21.8 2.5E+02  0.0054   18.6   7.2   63   13-80    213-276 (419)
129 TIGR03032 conserved hypothetic  21.2 2.5E+02  0.0054   18.3   4.0   33   42-80    210-242 (335)
130 PRK04922 tolB translocation pr  21.1 2.5E+02  0.0055   18.4   7.2   58   14-76    229-287 (433)
131 KOG2445|consensus               21.0 2.5E+02  0.0054   18.3   4.7   22   10-31    198-219 (361)
132 PF15525 DUF4652:  Domain of un  20.9   2E+02  0.0044   17.1   8.2   68   10-79     85-158 (200)
133 PF08309 LVIVD:  LVIVD repeat;   20.4      95  0.0021   13.2   4.5   26   37-69      4-29  (42)

No 1  
>KOG4441|consensus
Probab=99.88  E-value=1.5e-22  Score=129.56  Aligned_cols=90  Identities=33%  Similarity=0.760  Sum_probs=81.2

Q ss_pred             CccccCC-CCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCC
Q psy2980           1 MLRGWHA-STENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHL   79 (92)
Q Consensus         1 v~GG~~~-~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~   79 (92)
                      ++||.++ ..+++  +++|||.+++|..+++|+.+|.++++++++++||++||.++....++++++|||.+++|+.+++|
T Consensus       385 avGG~dg~~~l~s--vE~YDp~~~~W~~va~m~~~r~~~gv~~~~g~iYi~GG~~~~~~~l~sve~YDP~t~~W~~~~~M  462 (571)
T KOG4441|consen  385 AVGGFDGEKSLNS--VECYDPVTNKWTPVAPMLTRRSGHGVAVLGGKLYIIGGGDGSSNCLNSVECYDPETNTWTLIAPM  462 (571)
T ss_pred             EEecccccccccc--EEEecCCCCcccccCCCCcceeeeEEEEECCEEEEEcCcCCCccccceEEEEcCCCCceeecCCc
Confidence            4799883 34555  99999999999999999999999999999999999999987444899999999999999999999


Q ss_pred             CCCccceeeEEcC
Q psy2980          80 PSARSWLGCVPLQ   92 (92)
Q Consensus        80 ~~~~~~~~~~~~~   92 (92)
                      +.+|..+++++++
T Consensus       463 ~~~R~~~g~a~~~  475 (571)
T KOG4441|consen  463 NTRRSGFGVAVLN  475 (571)
T ss_pred             ccccccceEEEEC
Confidence            9999999998764


No 2  
>KOG4441|consensus
Probab=99.88  E-value=2.6e-22  Score=128.54  Aligned_cols=91  Identities=33%  Similarity=0.644  Sum_probs=82.4

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP   80 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~   80 (92)
                      ++||.+......+.+++||+.+++|..+++|+.+|..++++++++.||++||.++ ...++++++|||.+++|..+++|+
T Consensus       337 v~GG~~~~~~~l~~ve~YD~~~~~W~~~a~M~~~R~~~~v~~l~g~iYavGG~dg-~~~l~svE~YDp~~~~W~~va~m~  415 (571)
T KOG4441|consen  337 VVGGYDSGSDRLSSVERYDPRTNQWTPVAPMNTKRSDFGVAVLDGKLYAVGGFDG-EKSLNSVECYDPVTNKWTPVAPML  415 (571)
T ss_pred             EEccccCCCcccceEEEecCCCCceeccCCccCccccceeEEECCEEEEEecccc-ccccccEEEecCCCCcccccCCCC
Confidence            5799874344555699999999999999999999999999999999999999986 677999999999999999999999


Q ss_pred             CCccceeeEEcC
Q psy2980          81 SARSWLGCVPLQ   92 (92)
Q Consensus        81 ~~~~~~~~~~~~   92 (92)
                      .+|..+++++++
T Consensus       416 ~~r~~~gv~~~~  427 (571)
T KOG4441|consen  416 TRRSGHGVAVLG  427 (571)
T ss_pred             cceeeeEEEEEC
Confidence            999999998864


No 3  
>PHA02713 hypothetical protein; Provisional
Probab=99.88  E-value=9.8e-22  Score=125.79  Aligned_cols=91  Identities=31%  Similarity=0.586  Sum_probs=79.5

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP   80 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~   80 (92)
                      |+||.+......+++++||+.+++|..+++||.+|..+++++++++||++||.++ ....+++++|||.+++|..+++||
T Consensus       308 viGG~~~~~~~~~~v~~Yd~~~n~W~~~~~m~~~R~~~~~~~~~g~IYviGG~~~-~~~~~sve~Ydp~~~~W~~~~~mp  386 (557)
T PHA02713        308 IAGGYNFNNPSLNKVYKINIENKIHVELPPMIKNRCRFSLAVIDDTIYAIGGQNG-TNVERTIECYTMGDDKWKMLPDMP  386 (557)
T ss_pred             EEcCCCCCCCccceEEEEECCCCeEeeCCCCcchhhceeEEEECCEEEEECCcCC-CCCCceEEEEECCCCeEEECCCCC
Confidence            5788753333456799999999999999999999999999999999999999865 445789999999999999999999


Q ss_pred             CCccceeeEEcC
Q psy2980          81 SARSWLGCVPLQ   92 (92)
Q Consensus        81 ~~~~~~~~~~~~   92 (92)
                      .+|..+++++++
T Consensus       387 ~~r~~~~~~~~~  398 (557)
T PHA02713        387 IALSSYGMCVLD  398 (557)
T ss_pred             cccccccEEEEC
Confidence            999999888764


No 4  
>PHA02713 hypothetical protein; Provisional
Probab=99.85  E-value=8.4e-21  Score=121.55  Aligned_cols=81  Identities=23%  Similarity=0.384  Sum_probs=72.1

Q ss_pred             cceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCC-CceEEcCCCCCCccceeeEE
Q psy2980          12 TNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDN-DSWEIMSHLPSARSWLGCVP   90 (92)
Q Consensus        12 ~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~-~~W~~~~~~~~~~~~~~~~~   90 (92)
                      .+++++|||.+++|..+++|+.+|..+++++++++||++||.++.....+.+++|||.+ ++|..+++||.+|..+++++
T Consensus       431 ~~~ve~YDP~td~W~~v~~m~~~r~~~~~~~~~~~IYv~GG~~~~~~~~~~ve~Ydp~~~~~W~~~~~m~~~r~~~~~~~  510 (557)
T PHA02713        431 SNKVIRYDTVNNIWETLPNFWTGTIRPGVVSHKDDIYVVCDIKDEKNVKTCIFRYNTNTYNGWELITTTESRLSALHTIL  510 (557)
T ss_pred             cceEEEECCCCCeEeecCCCCcccccCcEEEECCEEEEEeCCCCCCccceeEEEecCCCCCCeeEccccCcccccceeEE
Confidence            45699999999999999999999999999999999999999864333346789999999 89999999999999999988


Q ss_pred             cC
Q psy2980          91 LQ   92 (92)
Q Consensus        91 ~~   92 (92)
                      ++
T Consensus       511 ~~  512 (557)
T PHA02713        511 HD  512 (557)
T ss_pred             EC
Confidence            64


No 5  
>PHA03098 kelch-like protein; Provisional
Probab=99.83  E-value=8e-20  Score=116.54  Aligned_cols=90  Identities=21%  Similarity=0.385  Sum_probs=80.0

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP   80 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~   80 (92)
                      |+||.+......+++++||+.+++|..+++||.+|..+++++++++||++||... ....+++++||+.+++|+.+++||
T Consensus       299 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~R~~~~~~~~~~~lyv~GG~~~-~~~~~~v~~yd~~~~~W~~~~~lp  377 (534)
T PHA03098        299 FIGGMNKNNLSVNSVVSYDTKTKSWNKVPELIYPRKNPGVTVFNNRIYVIGGIYN-SISLNTVESWKPGESKWREEPPLI  377 (534)
T ss_pred             EECCCcCCCCeeccEEEEeCCCCeeeECCCCCcccccceEEEECCEEEEEeCCCC-CEecceEEEEcCCCCceeeCCCcC
Confidence            5788875555556799999999999999999999999999999999999999874 556889999999999999999999


Q ss_pred             CCccceeeEEc
Q psy2980          81 SARSWLGCVPL   91 (92)
Q Consensus        81 ~~~~~~~~~~~   91 (92)
                      .+|..++++++
T Consensus       378 ~~r~~~~~~~~  388 (534)
T PHA03098        378 FPRYNPCVVNV  388 (534)
T ss_pred             cCCccceEEEE
Confidence            99999888765


No 6  
>PHA03098 kelch-like protein; Provisional
Probab=99.82  E-value=1.5e-19  Score=115.27  Aligned_cols=90  Identities=24%  Similarity=0.477  Sum_probs=77.9

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP   80 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~   80 (92)
                      ++||.+. ....+++++||+.+++|..++++|.+|..+++++++++||++||.......++++++||+.+++|..++++|
T Consensus       347 v~GG~~~-~~~~~~v~~yd~~~~~W~~~~~lp~~r~~~~~~~~~~~iYv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~~p  425 (534)
T PHA03098        347 VIGGIYN-SISLNTVESWKPGESKWREEPPLIFPRYNPCVVNVNNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLP  425 (534)
T ss_pred             EEeCCCC-CEecceEEEEcCCCCceeeCCCcCcCCccceEEEECCEEEEECCcCCCCcccceEEEEeCCCCeeeecCCCC
Confidence            4788763 234556999999999999999999999999999999999999997543556789999999999999999999


Q ss_pred             CCccceeeEEc
Q psy2980          81 SARSWLGCVPL   91 (92)
Q Consensus        81 ~~~~~~~~~~~   91 (92)
                      .+|..++++++
T Consensus       426 ~~r~~~~~~~~  436 (534)
T PHA03098        426 ISHYGGCAIYH  436 (534)
T ss_pred             ccccCceEEEE
Confidence            99998887765


No 7  
>PHA02790 Kelch-like protein; Provisional
Probab=99.82  E-value=2.6e-19  Score=113.16  Aligned_cols=85  Identities=19%  Similarity=0.341  Sum_probs=73.5

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP   80 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~   80 (92)
                      ++||.+.  .  +.+++||+.+++|..+++||.+|..+++++++++||++||...   ..+.+++|||.+++|+.+++|+
T Consensus       323 viGG~~~--~--~sve~ydp~~n~W~~~~~l~~~r~~~~~~~~~g~IYviGG~~~---~~~~ve~ydp~~~~W~~~~~m~  395 (480)
T PHA02790        323 VVGGLPN--P--TSVERWFHGDAAWVNMPSLLKPRCNPAVASINNVIYVIGGHSE---TDTTTEYLLPNHDQWQFGPSTY  395 (480)
T ss_pred             EECCcCC--C--CceEEEECCCCeEEECCCCCCCCcccEEEEECCEEEEecCcCC---CCccEEEEeCCCCEEEeCCCCC
Confidence            4688642  2  3499999999999999999999999999999999999999753   2367899999999999999999


Q ss_pred             CCccceeeEEcC
Q psy2980          81 SARSWLGCVPLQ   92 (92)
Q Consensus        81 ~~~~~~~~~~~~   92 (92)
                      .+|..+++++++
T Consensus       396 ~~r~~~~~~~~~  407 (480)
T PHA02790        396 YPHYKSCALVFG  407 (480)
T ss_pred             CccccceEEEEC
Confidence            999998887653


No 8  
>PHA02790 Kelch-like protein; Provisional
Probab=99.81  E-value=6.4e-19  Score=111.41  Aligned_cols=87  Identities=23%  Similarity=0.397  Sum_probs=75.3

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP   80 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~   80 (92)
                      ++||.+. ....+.+++||+.+++|..+++|+.+|..+++++++++||++||.+.    .+++++|||.+++|..+++||
T Consensus       276 viGG~~~-~~~~~~v~~Ydp~~~~W~~~~~m~~~r~~~~~v~~~~~iYviGG~~~----~~sve~ydp~~n~W~~~~~l~  350 (480)
T PHA02790        276 LIGGWMN-NEIHNNAIAVNYISNNWIPIPPMNSPRLYASGVPANNKLYVVGGLPN----PTSVERWFHGDAAWVNMPSLL  350 (480)
T ss_pred             EEcCCCC-CCcCCeEEEEECCCCEEEECCCCCchhhcceEEEECCEEEEECCcCC----CCceEEEECCCCeEEECCCCC
Confidence            4788753 23445699999999999999999999999999999999999999753    367899999999999999999


Q ss_pred             CCccceeeEEcC
Q psy2980          81 SARSWLGCVPLQ   92 (92)
Q Consensus        81 ~~~~~~~~~~~~   92 (92)
                      .+|..+++++++
T Consensus       351 ~~r~~~~~~~~~  362 (480)
T PHA02790        351 KPRCNPAVASIN  362 (480)
T ss_pred             CCCcccEEEEEC
Confidence            999998887753


No 9  
>PLN02153 epithiospecifier protein
Probab=99.79  E-value=2.4e-18  Score=104.88  Aligned_cols=90  Identities=19%  Similarity=0.389  Sum_probs=72.7

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeCCC-----CCCcccceEEEECCEEEEEcCcCCCC-----CccceEEEEeCCC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKSPM-----KEKRYRPGIAVIDGKIYVLGGEEGWD-----GYHDSIECYDVDN   70 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~-----~~~~~~~~~~~~~~~iy~~GG~~~~~-----~~~~~~~~y~~~~   70 (92)
                      ||||.+... ..+++++||+.+++|..++++     |.+|..+++++.+++||++||.....     ..++++++||+.+
T Consensus        90 v~GG~~~~~-~~~~v~~yd~~t~~W~~~~~~~~~~~p~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~v~~yd~~~  168 (341)
T PLN02153         90 IFGGRDEKR-EFSDFYSYDTVKNEWTFLTKLDEEGGPEARTFHSMASDENHVYVFGGVSKGGLMKTPERFRTIEAYNIAD  168 (341)
T ss_pred             EECCCCCCC-ccCcEEEEECCCCEEEEeccCCCCCCCCCceeeEEEEECCEEEEECCccCCCccCCCcccceEEEEECCC
Confidence            589976432 345699999999999999876     78899999999999999999985311     2357899999999


Q ss_pred             CceEEcCCCC---CCccceeeEEc
Q psy2980          71 DSWEIMSHLP---SARSWLGCVPL   91 (92)
Q Consensus        71 ~~W~~~~~~~---~~~~~~~~~~~   91 (92)
                      ++|..++++.   .+|..++++++
T Consensus       169 ~~W~~l~~~~~~~~~r~~~~~~~~  192 (341)
T PLN02153        169 GKWVQLPDPGENFEKRGGAGFAVV  192 (341)
T ss_pred             CeEeeCCCCCCCCCCCCcceEEEE
Confidence            9999998764   67888877654


No 10 
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=99.79  E-value=2.4e-18  Score=104.17  Aligned_cols=89  Identities=20%  Similarity=0.285  Sum_probs=74.2

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeE----EeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEc
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTW----EYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIM   76 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W----~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~   76 (92)
                      ++||.+.. ...+++++||+.+++|    ..++++|.+|..+++++++++||++||... ....+++++||+.+++|+++
T Consensus        77 viGG~~~~-~~~~~v~~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~~~~iYv~GG~~~-~~~~~~v~~yd~~~~~W~~~  154 (323)
T TIGR03548        77 YIGGSNSS-ERFSSVYRITLDESKEELICETIGNLPFTFENGSACYKDGTLYVGGGNRN-GKPSNKSYLFNLETQEWFEL  154 (323)
T ss_pred             EEcCCCCC-CCceeEEEEEEcCCceeeeeeEcCCCCcCccCceEEEECCEEEEEeCcCC-CccCceEEEEcCCCCCeeEC
Confidence            47887632 2456799999999987    778999999999999999999999999754 45579999999999999999


Q ss_pred             CCCCC-CccceeeEEc
Q psy2980          77 SHLPS-ARSWLGCVPL   91 (92)
Q Consensus        77 ~~~~~-~~~~~~~~~~   91 (92)
                      ++||. +|..+.++++
T Consensus       155 ~~~p~~~r~~~~~~~~  170 (323)
T TIGR03548       155 PDFPGEPRVQPVCVKL  170 (323)
T ss_pred             CCCCCCCCCcceEEEE
Confidence            99884 7777776654


No 11 
>PLN02153 epithiospecifier protein
Probab=99.78  E-value=4.4e-18  Score=103.70  Aligned_cols=90  Identities=17%  Similarity=0.380  Sum_probs=72.0

Q ss_pred             CccccCCC-CCCcceEEEeeCCCCeEEeeCCCC-CCc---ccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEE
Q psy2980           1 MLRGWHAS-TENTNRVECYHIAENTWEYKSPMK-EKR---YRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEI   75 (92)
Q Consensus         1 v~GG~~~~-~~~~~~v~~yd~~~~~W~~~~~~~-~~~---~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~   75 (92)
                      |+||.... ....+++++||+.+++|..+++++ .++   ..+++++++++||++||... ...++++++||+.+++|+.
T Consensus        37 v~GG~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG~~~-~~~~~~v~~yd~~t~~W~~  115 (341)
T PLN02153         37 SFGGELKPNEHIDKDLYVFDFNTHTWSIAPANGDVPRISCLGVRMVAVGTKLYIFGGRDE-KREFSDFYSYDTVKNEWTF  115 (341)
T ss_pred             EECCccCCCCceeCcEEEEECCCCEEEEcCccCCCCCCccCceEEEEECCEEEEECCCCC-CCccCcEEEEECCCCEEEE
Confidence            57887532 223457999999999999987653 344   36788899999999999865 4557899999999999999


Q ss_pred             cCCC-----CCCccceeeEEc
Q psy2980          76 MSHL-----PSARSWLGCVPL   91 (92)
Q Consensus        76 ~~~~-----~~~~~~~~~~~~   91 (92)
                      +++|     |.+|..|+++++
T Consensus       116 ~~~~~~~~~p~~R~~~~~~~~  136 (341)
T PLN02153        116 LTKLDEEGGPEARTFHSMASD  136 (341)
T ss_pred             eccCCCCCCCCCceeeEEEEE
Confidence            9877     788998888765


No 12 
>PLN02193 nitrile-specifier protein
Probab=99.77  E-value=1e-17  Score=105.77  Aligned_cols=89  Identities=21%  Similarity=0.376  Sum_probs=74.2

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeCCC---CCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKSPM---KEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMS   77 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~---~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~   77 (92)
                      |+||.+.. ...+++++||+.+++|.+++++   |.+|..+++++.+++||++||.+. ...+++++.||+.+++|+.++
T Consensus       233 vfGG~~~~-~~~ndv~~yD~~t~~W~~l~~~~~~P~~R~~h~~~~~~~~iYv~GG~~~-~~~~~~~~~yd~~t~~W~~~~  310 (470)
T PLN02193        233 VFGGRDAS-RQYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVYVFGGVSA-TARLKTLDSYNIVDKKWFHCS  310 (470)
T ss_pred             EECCCCCC-CCCccEEEEECCCCEEEEcCcCCCCCCCccceEEEEECCEEEEECCCCC-CCCcceEEEEECCCCEEEeCC
Confidence            57887632 2345699999999999999877   789999999999999999999865 445789999999999999986


Q ss_pred             C---CCCCccceeeEEc
Q psy2980          78 H---LPSARSWLGCVPL   91 (92)
Q Consensus        78 ~---~~~~~~~~~~~~~   91 (92)
                      +   ++.+|..++++++
T Consensus       311 ~~~~~~~~R~~~~~~~~  327 (470)
T PLN02193        311 TPGDSFSIRGGAGLEVV  327 (470)
T ss_pred             CCCCCCCCCCCcEEEEE
Confidence            4   5678888877664


No 13 
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=99.77  E-value=6.2e-18  Score=103.14  Aligned_cols=91  Identities=15%  Similarity=0.226  Sum_probs=69.3

Q ss_pred             CccccCCCC-----CCcceEEEeeCCCCeEEeeC-CCCCCcccceEE-EECCEEEEEcCcCCCC----------------
Q psy2980           1 MLRGWHAST-----ENTNRVECYHIAENTWEYKS-PMKEKRYRPGIA-VIDGKIYVLGGEEGWD----------------   57 (92)
Q Consensus         1 v~GG~~~~~-----~~~~~v~~yd~~~~~W~~~~-~~~~~~~~~~~~-~~~~~iy~~GG~~~~~----------------   57 (92)
                      |+||.+...     ...+++++||+.+++|..++ .+|.++..++++ +++++||++||.+...                
T Consensus        68 v~GG~~~~~~~~~~~~~~~v~~Yd~~~~~W~~~~~~~p~~~~~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~  147 (346)
T TIGR03547        68 VFGGIGKANSEGSPQVFDDVYRYDPKKNSWQKLDTRSPVGLLGASGFSLHNGQAYFTGGVNKNIFDGYFADLSAADKDSE  147 (346)
T ss_pred             EEeCCCCCCCCCcceecccEEEEECCCCEEecCCCCCCCcccceeEEEEeCCEEEEEcCcChHHHHHHHhhHhhcCccch
Confidence            578875321     13456999999999999997 456666666665 6899999999975310                


Q ss_pred             -----------------CccceEEEEeCCCCceEEcCCCCC-CccceeeEEc
Q psy2980          58 -----------------GYHDSIECYDVDNDSWEIMSHLPS-ARSWLGCVPL   91 (92)
Q Consensus        58 -----------------~~~~~~~~y~~~~~~W~~~~~~~~-~~~~~~~~~~   91 (92)
                                       ...+++++|||.+++|+.+++||. +|..++++++
T Consensus       148 ~~~~~~~~~~~~~~~~~~~~~~v~~YDp~t~~W~~~~~~p~~~r~~~~~~~~  199 (346)
T TIGR03547       148 PKDKLIAAYFSQPPEDYFWNKNVLSYDPSTNQWRNLGENPFLGTAGSAIVHK  199 (346)
T ss_pred             hhhhhHHHHhCCChhHcCccceEEEEECCCCceeECccCCCCcCCCceEEEE
Confidence                             124789999999999999999996 6777777665


No 14 
>PF13964 Kelch_6:  Kelch motif
Probab=99.74  E-value=1.9e-17  Score=75.49  Aligned_cols=50  Identities=36%  Similarity=0.798  Sum_probs=46.0

Q ss_pred             CcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCCCc
Q psy2980          34 KRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPSAR   83 (92)
Q Consensus        34 ~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~~~   83 (92)
                      +|..+++++++++||++||........+++++||+.+++|+++++||.+|
T Consensus         1 pR~~~s~v~~~~~iyv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~mp~pR   50 (50)
T PF13964_consen    1 PRYGHSAVVVGGKIYVFGGYDNSGKYSNDVERYDPETNTWEQLPPMPTPR   50 (50)
T ss_pred             CCccCEEEEECCEEEEECCCCCCCCccccEEEEcCCCCcEEECCCCCCCC
Confidence            58889999999999999999765678999999999999999999999887


No 15 
>PLN02193 nitrile-specifier protein
Probab=99.74  E-value=6.5e-17  Score=102.18  Aligned_cols=90  Identities=14%  Similarity=0.397  Sum_probs=71.5

Q ss_pred             CccccCCCC-CCcceEEEeeCCCCeEEeeCC---CCC-CcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEE
Q psy2980           1 MLRGWHAST-ENTNRVECYHIAENTWEYKSP---MKE-KRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEI   75 (92)
Q Consensus         1 v~GG~~~~~-~~~~~v~~yd~~~~~W~~~~~---~~~-~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~   75 (92)
                      |+||..... ...+++++||+.+++|..++.   +|. .|..+++++++++||++||... ...++++++||+.+++|++
T Consensus       180 v~GG~~~~~~~~~~~v~~yD~~~~~W~~~~~~g~~P~~~~~~~~~v~~~~~lYvfGG~~~-~~~~ndv~~yD~~t~~W~~  258 (470)
T PLN02193        180 SFGGEFTPNQPIDKHLYVFDLETRTWSISPATGDVPHLSCLGVRMVSIGSTLYVFGGRDA-SRQYNGFYSFDTTTNEWKL  258 (470)
T ss_pred             EECCcCCCCCCeeCcEEEEECCCCEEEeCCCCCCCCCCcccceEEEEECCEEEEECCCCC-CCCCccEEEEECCCCEEEE
Confidence            578875322 233569999999999998864   333 2457788899999999999865 4568999999999999999


Q ss_pred             cCCC---CCCccceeeEEc
Q psy2980          76 MSHL---PSARSWLGCVPL   91 (92)
Q Consensus        76 ~~~~---~~~~~~~~~~~~   91 (92)
                      +.++   |.+|..|+++++
T Consensus       259 l~~~~~~P~~R~~h~~~~~  277 (470)
T PLN02193        259 LTPVEEGPTPRSFHSMAAD  277 (470)
T ss_pred             cCcCCCCCCCccceEEEEE
Confidence            9888   889999988765


No 16 
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=99.73  E-value=5.7e-17  Score=100.04  Aligned_cols=91  Identities=16%  Similarity=0.219  Sum_probs=69.5

Q ss_pred             CccccCC-C----CCCcceEEEeeCCCCeEEeeCC-CCCCcccceEEE-ECCEEEEEcCcCCC-----------------
Q psy2980           1 MLRGWHA-S----TENTNRVECYHIAENTWEYKSP-MKEKRYRPGIAV-IDGKIYVLGGEEGW-----------------   56 (92)
Q Consensus         1 v~GG~~~-~----~~~~~~v~~yd~~~~~W~~~~~-~~~~~~~~~~~~-~~~~iy~~GG~~~~-----------------   56 (92)
                      |+||... .    ....+++++||+.+++|..++. +|.++..+++++ .+++||++||....                 
T Consensus        89 V~GG~~~~~~~~~~~~~~~v~~YD~~~n~W~~~~~~~p~~~~~~~~~~~~~~~IYv~GG~~~~~~~~~~~d~~~~~~~~~  168 (376)
T PRK14131         89 VFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKLDTRSPVGLAGHVAVSLHNGKAYITGGVNKNIFDGYFEDLAAAGKDKT  168 (376)
T ss_pred             EEcCCCCCCCCCceeEcccEEEEeCCCCEEEeCCCCCCCcccceEEEEeeCCEEEEECCCCHHHHHHHHhhhhhcccchh
Confidence            5788753 1    1234679999999999999985 466666777666 89999999997521                 


Q ss_pred             ----------------CCccceEEEEeCCCCceEEcCCCCC-CccceeeEEc
Q psy2980          57 ----------------DGYHDSIECYDVDNDSWEIMSHLPS-ARSWLGCVPL   91 (92)
Q Consensus        57 ----------------~~~~~~~~~y~~~~~~W~~~~~~~~-~~~~~~~~~~   91 (92)
                                      ....+++++||+.+++|..++++|. +|..++++++
T Consensus       169 ~~~~i~~~~~~~~~~~~~~~~~v~~YD~~t~~W~~~~~~p~~~~~~~a~v~~  220 (376)
T PRK14131        169 PKDKINDAYFDKKPEDYFFNKEVLSYDPSTNQWKNAGESPFLGTAGSAVVIK  220 (376)
T ss_pred             hhhhhHHHHhcCChhhcCcCceEEEEECCCCeeeECCcCCCCCCCcceEEEE
Confidence                            0124789999999999999999996 6777777654


No 17 
>KOG0379|consensus
Probab=99.73  E-value=5.3e-17  Score=102.80  Aligned_cols=92  Identities=22%  Similarity=0.379  Sum_probs=79.7

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeC---CCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEc-
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKS---PMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIM-   76 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~---~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~-   76 (92)
                      +|||.+....+.++++.||+.|++|..+.   .+|.+|.+|++++.++++|++||........+++++||+.+.+|+++ 
T Consensus       127 lfGG~~~~~~~~~~l~~~d~~t~~W~~l~~~~~~P~~r~~Hs~~~~g~~l~vfGG~~~~~~~~ndl~i~d~~~~~W~~~~  206 (482)
T KOG0379|consen  127 LFGGTDKKYRNLNELHSLDLSTRTWSLLSPTGDPPPPRAGHSATVVGTKLVVFGGIGGTGDSLNDLHIYDLETSTWSELD  206 (482)
T ss_pred             EEccccCCCCChhheEeccCCCCcEEEecCcCCCCCCcccceEEEECCEEEEECCccCcccceeeeeeeccccccceecc
Confidence            58998754556678999999999998885   46889999999999999999999977555899999999999999987 


Q ss_pred             --CCCCCCccceeeEEcC
Q psy2980          77 --SHLPSARSWLGCVPLQ   92 (92)
Q Consensus        77 --~~~~~~~~~~~~~~~~   92 (92)
                        .+.|.||..|++++++
T Consensus       207 ~~g~~P~pR~gH~~~~~~  224 (482)
T KOG0379|consen  207 TQGEAPSPRYGHAMVVVG  224 (482)
T ss_pred             cCCCCCCCCCCceEEEEC
Confidence              4668899999998763


No 18 
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=99.72  E-value=8.4e-17  Score=97.51  Aligned_cols=77  Identities=16%  Similarity=0.295  Sum_probs=62.5

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeCCCC-CCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKSPMK-EKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHL   79 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~-~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~   79 (92)
                      |+||... ....+++++||+.+++|..++++| .+|..+.+++++++||++||.+.  ....++++||+.+++|+.+++|
T Consensus       128 v~GG~~~-~~~~~~v~~yd~~~~~W~~~~~~p~~~r~~~~~~~~~~~iYv~GG~~~--~~~~~~~~yd~~~~~W~~~~~~  204 (323)
T TIGR03548       128 VGGGNRN-GKPSNKSYLFNLETQEWFELPDFPGEPRVQPVCVKLQNELYVFGGGSN--IAYTDGYKYSPKKNQWQKVADP  204 (323)
T ss_pred             EEeCcCC-CccCceEEEEcCCCCCeeECCCCCCCCCCcceEEEECCEEEEEcCCCC--ccccceEEEecCCCeeEECCCC
Confidence            4788642 223566999999999999999887 47888888899999999999853  2345789999999999999876


Q ss_pred             C
Q psy2980          80 P   80 (92)
Q Consensus        80 ~   80 (92)
                      +
T Consensus       205 ~  205 (323)
T TIGR03548       205 T  205 (323)
T ss_pred             C
Confidence            3


No 19 
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=99.69  E-value=5.6e-16  Score=94.63  Aligned_cols=84  Identities=21%  Similarity=0.312  Sum_probs=66.8

Q ss_pred             CccccCCCCCCcceEEEeeC--CCCeEEeeCCCC-CCcccceEEEECCEEEEEcCcCCCC-----CccceEEEEeCCCCc
Q psy2980           1 MLRGWHASTENTNRVECYHI--AENTWEYKSPMK-EKRYRPGIAVIDGKIYVLGGEEGWD-----GYHDSIECYDVDNDS   72 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~--~~~~W~~~~~~~-~~~~~~~~~~~~~~iy~~GG~~~~~-----~~~~~~~~y~~~~~~   72 (92)
                      |+||..     .+.+++||+  .+++|..+++|| .+|..+++++++++||++||.....     ..++++++|||.+++
T Consensus        22 v~GG~~-----~~~~~~~d~~~~~~~W~~l~~~p~~~R~~~~~~~~~~~iYv~GG~~~~~~~~~~~~~~~v~~Yd~~~~~   96 (346)
T TIGR03547        22 VGLGSA-----GTSWYKLDLKKPSKGWQKIADFPGGPRNQAVAAAIDGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKNS   96 (346)
T ss_pred             EEcccc-----CCeeEEEECCCCCCCceECCCCCCCCcccceEEEECCEEEEEeCCCCCCCCCcceecccEEEEECCCCE
Confidence            356753     245899996  578899999998 5899999999999999999985311     257899999999999


Q ss_pred             eEEcC-CCCCCccceeeE
Q psy2980          73 WEIMS-HLPSARSWLGCV   89 (92)
Q Consensus        73 W~~~~-~~~~~~~~~~~~   89 (92)
                      |+.++ ++|.+|..++++
T Consensus        97 W~~~~~~~p~~~~~~~~~  114 (346)
T TIGR03547        97 WQKLDTRSPVGLLGASGF  114 (346)
T ss_pred             EecCCCCCCCcccceeEE
Confidence            99997 566777666555


No 20 
>PF01344 Kelch_1:  Kelch motif;  InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=99.63  E-value=7.8e-16  Score=69.20  Aligned_cols=47  Identities=34%  Similarity=0.698  Sum_probs=43.1

Q ss_pred             CcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980          34 KRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP   80 (92)
Q Consensus        34 ~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~   80 (92)
                      +|..+++++++++||++||.......++++++||+.+++|+.+++||
T Consensus         1 pR~~~~~~~~~~~iyv~GG~~~~~~~~~~v~~yd~~~~~W~~~~~mp   47 (47)
T PF01344_consen    1 PRSGHAAVVVGNKIYVIGGYDGNNQPTNSVEVYDPETNTWEELPPMP   47 (47)
T ss_dssp             -BBSEEEEEETTEEEEEEEBESTSSBEEEEEEEETTTTEEEEEEEES
T ss_pred             CCccCEEEEECCEEEEEeeecccCceeeeEEEEeCCCCEEEEcCCCC
Confidence            57889999999999999999876788999999999999999999886


No 21 
>KOG0379|consensus
Probab=99.60  E-value=1.1e-14  Score=92.35  Aligned_cols=91  Identities=23%  Similarity=0.462  Sum_probs=78.0

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeC---CCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKS---PMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMS   77 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~---~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~   77 (92)
                      ||||.+......+++++||+++.+|.++.   ..|.||.+|++++.+++++++||.+.....+++++.+|..+.+|+.+.
T Consensus       178 vfGG~~~~~~~~ndl~i~d~~~~~W~~~~~~g~~P~pR~gH~~~~~~~~~~v~gG~~~~~~~l~D~~~ldl~~~~W~~~~  257 (482)
T KOG0379|consen  178 VFGGIGGTGDSLNDLHIYDLETSTWSELDTQGEAPSPRYGHAMVVVGNKLLVFGGGDDGDVYLNDVHILDLSTWEWKLLP  257 (482)
T ss_pred             EECCccCcccceeeeeeeccccccceecccCCCCCCCCCCceEEEECCeEEEEeccccCCceecceEeeecccceeeecc
Confidence            68999865556777999999999999985   457899999999999999999998744778999999999999999764


Q ss_pred             ---CCCCCccceeeEEc
Q psy2980          78 ---HLPSARSWLGCVPL   91 (92)
Q Consensus        78 ---~~~~~~~~~~~~~~   91 (92)
                         ..|.+|+.|..+..
T Consensus       258 ~~g~~p~~R~~h~~~~~  274 (482)
T KOG0379|consen  258 TGGDLPSPRSGHSLTVS  274 (482)
T ss_pred             ccCCCCCCcceeeeEEE
Confidence               56889999887753


No 22 
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=99.59  E-value=5.9e-15  Score=91.18  Aligned_cols=92  Identities=17%  Similarity=0.189  Sum_probs=67.2

Q ss_pred             CccccCCCCCCcceEE--EeeCCCCeEEeeCCCCCCccc--------ceEEEECCEEEEEcCcCCCC-------------
Q psy2980           1 MLRGWHASTENTNRVE--CYHIAENTWEYKSPMKEKRYR--------PGIAVIDGKIYVLGGEEGWD-------------   57 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~--~yd~~~~~W~~~~~~~~~~~~--------~~~~~~~~~iy~~GG~~~~~-------------   57 (92)
                      |+||.......+.+++  .||+++++|..+++||.+|..        +.+++++++||++||.....             
T Consensus       226 v~GG~~~~~~~~~~~~~~~~~~~~~~W~~~~~~p~~~~~~~~~~~~~~~a~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~  305 (376)
T PRK14131        226 LINGEIKPGLRTDAVKQGKFTGNNLKWQKLPDLPPAPGGSSQEGVAGAFAGYSNGVLLVAGGANFPGARENYQNGKLYAH  305 (376)
T ss_pred             EEeeeECCCcCChhheEEEecCCCcceeecCCCCCCCcCCcCCccceEeceeECCEEEEeeccCCCCChhhhhcCCcccc
Confidence            4788654334444454  557789999999999877632        22567899999999975311             


Q ss_pred             ---CccceEEEEeCCCCceEEcCCCCCCccceeeEEcC
Q psy2980          58 ---GYHDSIECYDVDNDSWEIMSHLPSARSWLGCVPLQ   92 (92)
Q Consensus        58 ---~~~~~~~~y~~~~~~W~~~~~~~~~~~~~~~~~~~   92 (92)
                         .....+++||+.+++|+.+++||.+|..+++++++
T Consensus       306 ~~~~~~~~~e~yd~~~~~W~~~~~lp~~r~~~~av~~~  343 (376)
T PRK14131        306 EGLKKSWSDEIYALVNGKWQKVGELPQGLAYGVSVSWN  343 (376)
T ss_pred             cCCcceeehheEEecCCcccccCcCCCCccceEEEEeC
Confidence               01235789999999999999999999988877653


No 23 
>KOG4693|consensus
Probab=99.57  E-value=4.8e-15  Score=87.09  Aligned_cols=91  Identities=16%  Similarity=0.293  Sum_probs=74.2

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeC---CCCCCcccceEEEECCEEEEEcCcCC-CCCccceEEEEeCCCCceEEc
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKS---PMKEKRYRPGIAVIDGKIYVLGGEEG-WDGYHDSIECYDVDNDSWEIM   76 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~---~~~~~~~~~~~~~~~~~iy~~GG~~~-~~~~~~~~~~y~~~~~~W~~~   76 (92)
                      |+||.+.....++.+++||++++.|.+..   ..|..|.+|++|++++..|+|||+.. ....+++++.+|..+-+|+.+
T Consensus        93 vWGGRND~egaCN~Ly~fDp~t~~W~~p~v~G~vPgaRDGHsAcV~gn~MyiFGGye~~a~~FS~d~h~ld~~TmtWr~~  172 (392)
T KOG4693|consen   93 VWGGRNDDEGACNLLYEFDPETNVWKKPEVEGFVPGARDGHSACVWGNQMYIFGGYEEDAQRFSQDTHVLDFATMTWREM  172 (392)
T ss_pred             EEcCccCcccccceeeeeccccccccccceeeecCCccCCceeeEECcEEEEecChHHHHHhhhccceeEeccceeeeeh
Confidence            57998865666777999999999997764   47889999999999999999999943 467889999999999999998


Q ss_pred             CCC---CCCccceeeEEc
Q psy2980          77 SHL---PSARSWLGCVPL   91 (92)
Q Consensus        77 ~~~---~~~~~~~~~~~~   91 (92)
                      ...   |.=|..|+++++
T Consensus       173 ~Tkg~PprwRDFH~a~~~  190 (392)
T KOG4693|consen  173 HTKGDPPRWRDFHTASVI  190 (392)
T ss_pred             hccCCCchhhhhhhhhhc
Confidence            654   333556666554


No 24 
>PF13415 Kelch_3:  Galactose oxidase, central domain
Probab=99.57  E-value=2.4e-14  Score=64.87  Aligned_cols=48  Identities=25%  Similarity=0.522  Sum_probs=43.5

Q ss_pred             CCEEEEEcCcC-CCCCccceEEEEeCCCCceEEcCCCCCCccceeeEEc
Q psy2980          44 DGKIYVLGGEE-GWDGYHDSIECYDVDNDSWEIMSHLPSARSWLGCVPL   91 (92)
Q Consensus        44 ~~~iy~~GG~~-~~~~~~~~~~~y~~~~~~W~~~~~~~~~~~~~~~~~~   91 (92)
                      ++++|++||.. .....+++++.||+.+++|++++++|.+|..|+++++
T Consensus         1 g~~~~vfGG~~~~~~~~~nd~~~~~~~~~~W~~~~~~P~~R~~h~~~~i   49 (49)
T PF13415_consen    1 GNKLYVFGGYDDDGGTRLNDVWVFDLDTNTWTRIGDLPPPRSGHTATVI   49 (49)
T ss_pred             CCEEEEECCcCCCCCCEecCEEEEECCCCEEEECCCCCCCccceEEEEC
Confidence            47899999997 3477899999999999999999999999999999875


No 25 
>KOG4693|consensus
Probab=99.56  E-value=2.2e-14  Score=84.40  Aligned_cols=91  Identities=22%  Similarity=0.456  Sum_probs=75.0

Q ss_pred             CccccCCC--------CCCcceEEEeeCCCCeEEeeCC---CCCCcccceEEEECCEEEEEcCcCC-CCCccceEEEEeC
Q psy2980           1 MLRGWHAS--------TENTNRVECYHIAENTWEYKSP---MKEKRYRPGIAVIDGKIYVLGGEEG-WDGYHDSIECYDV   68 (92)
Q Consensus         1 v~GG~~~~--------~~~~~~v~~yd~~~~~W~~~~~---~~~~~~~~~~~~~~~~iy~~GG~~~-~~~~~~~~~~y~~   68 (92)
                      ||||...+        ...++++..+|..|..|...+.   .|.+|..|+..++++++|+|||+++ .+.-.+++++|||
T Consensus       196 iFGGR~D~~gpfHs~~e~Yc~~i~~ld~~T~aW~r~p~~~~~P~GRRSHS~fvYng~~Y~FGGYng~ln~HfndLy~FdP  275 (392)
T KOG4693|consen  196 IFGGRSDESGPFHSIHEQYCDTIMALDLATGAWTRTPENTMKPGGRRSHSTFVYNGKMYMFGGYNGTLNVHFNDLYCFDP  275 (392)
T ss_pred             EeccccccCCCccchhhhhcceeEEEeccccccccCCCCCcCCCcccccceEEEcceEEEecccchhhhhhhcceeeccc
Confidence            57886532        2456789999999999988864   4788999999999999999999976 3567899999999


Q ss_pred             CCCceEEcC---CCCCCccceeeEEc
Q psy2980          69 DNDSWEIMS---HLPSARSWLGCVPL   91 (92)
Q Consensus        69 ~~~~W~~~~---~~~~~~~~~~~~~~   91 (92)
                      ++..|+.+.   .-|.+|+.+.+++.
T Consensus       276 ~t~~W~~I~~~Gk~P~aRRRqC~~v~  301 (392)
T KOG4693|consen  276 KTSMWSVISVRGKYPSARRRQCSVVS  301 (392)
T ss_pred             ccchheeeeccCCCCCcccceeEEEE
Confidence            999999885   45788888877765


No 26 
>PF07646 Kelch_2:  Kelch motif;  InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=99.51  E-value=1.3e-13  Score=62.40  Aligned_cols=47  Identities=30%  Similarity=0.630  Sum_probs=42.0

Q ss_pred             CcccceEEEECCEEEEEcCc--CCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980          34 KRYRPGIAVIDGKIYVLGGE--EGWDGYHDSIECYDVDNDSWEIMSHLP   80 (92)
Q Consensus        34 ~~~~~~~~~~~~~iy~~GG~--~~~~~~~~~~~~y~~~~~~W~~~~~~~   80 (92)
                      +|..|++++++++||++||+  .......++++.||+.+++|+.+++|+
T Consensus         1 ~r~~hs~~~~~~kiyv~GG~~~~~~~~~~~~v~~~d~~t~~W~~~~~~g   49 (49)
T PF07646_consen    1 PRYGHSAVVLDGKIYVFGGYGTDNGGSSSNDVWVFDTETNQWTELSPMG   49 (49)
T ss_pred             CccceEEEEECCEEEEECCcccCCCCcccceeEEEECCCCEEeecCCCC
Confidence            57889999999999999999  344678999999999999999998875


No 27 
>PF13418 Kelch_4:  Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=99.50  E-value=5.6e-14  Score=63.63  Aligned_cols=47  Identities=26%  Similarity=0.630  Sum_probs=31.8

Q ss_pred             CcccceEEEE-CCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980          34 KRYRPGIAVI-DGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP   80 (92)
Q Consensus        34 ~~~~~~~~~~-~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~   80 (92)
                      ||..|+++.. ++.||++||.+.....+++++.||+.+++|++++++|
T Consensus         1 pR~~h~~~~~~~~~i~v~GG~~~~~~~~~d~~~~d~~~~~W~~~~~~P   48 (49)
T PF13418_consen    1 PRYGHSAVSIGDNSIYVFGGRDSSGSPLNDLWIFDIETNTWTRLPSMP   48 (49)
T ss_dssp             --BS-EEEEE-TTEEEEE--EEE-TEE---EEEEETTTTEEEE--SS-
T ss_pred             CcceEEEEEEeCCeEEEECCCCCCCcccCCEEEEECCCCEEEECCCCC
Confidence            5888999988 5899999999875678999999999999999998887


No 28 
>smart00612 Kelch Kelch domain.
Probab=99.48  E-value=1.2e-13  Score=61.69  Aligned_cols=46  Identities=35%  Similarity=0.695  Sum_probs=41.0

Q ss_pred             EEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCCCccceeeEEcC
Q psy2980          46 KIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPSARSWLGCVPLQ   92 (92)
Q Consensus        46 ~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~~~~~~~~~~~~   92 (92)
                      +||++||... ....+++++||+.+++|+.+++|+.+|..+++++++
T Consensus         1 ~iyv~GG~~~-~~~~~~v~~yd~~~~~W~~~~~~~~~r~~~~~~~~~   46 (47)
T smart00612        1 KIYVVGGFDG-GQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVIN   46 (47)
T ss_pred             CEEEEeCCCC-CceeeeEEEECCCCCeEccCCCCCCccccceEEEeC
Confidence            4899999865 567899999999999999999999999999988864


No 29 
>KOG1230|consensus
Probab=99.43  E-value=7.5e-13  Score=81.56  Aligned_cols=78  Identities=15%  Similarity=0.245  Sum_probs=66.3

Q ss_pred             ceEEEeeCCCCeEEeeC--CCCCCcccceEEEECCEEEEEcCcCC---CCCccceEEEEeCCCCceEEcCC-C--CCCcc
Q psy2980          13 NRVECYHIAENTWEYKS--PMKEKRYRPGIAVIDGKIYVLGGEEG---WDGYHDSIECYDVDNDSWEIMSH-L--PSARS   84 (92)
Q Consensus        13 ~~v~~yd~~~~~W~~~~--~~~~~~~~~~~~~~~~~iy~~GG~~~---~~~~~~~~~~y~~~~~~W~~~~~-~--~~~~~   84 (92)
                      +++|.||.++++|.++.  .-|.+|.+|.+++...+|++|||..+   ...+++++++||..+-+|.++.+ -  |.||+
T Consensus       154 kD~W~fd~~trkweql~~~g~PS~RSGHRMvawK~~lilFGGFhd~nr~y~YyNDvy~FdLdtykW~Klepsga~PtpRS  233 (521)
T KOG1230|consen  154 KDLWLFDLKTRKWEQLEFGGGPSPRSGHRMVAWKRQLILFGGFHDSNRDYIYYNDVYAFDLDTYKWSKLEPSGAGPTPRS  233 (521)
T ss_pred             hheeeeeeccchheeeccCCCCCCCccceeEEeeeeEEEEcceecCCCceEEeeeeEEEeccceeeeeccCCCCCCCCCC
Confidence            56999999999999995  35889999999999999999999843   34589999999999999999854 2  77888


Q ss_pred             ceeeEE
Q psy2980          85 WLGCVP   90 (92)
Q Consensus        85 ~~~~~~   90 (92)
                      ++.+.+
T Consensus       234 Gcq~~v  239 (521)
T KOG1230|consen  234 GCQFSV  239 (521)
T ss_pred             cceEEe
Confidence            776544


No 30 
>KOG1230|consensus
Probab=99.42  E-value=1e-12  Score=80.98  Aligned_cols=79  Identities=20%  Similarity=0.325  Sum_probs=64.9

Q ss_pred             ceEEEeeCCCCeEEee--CCCCCCcccceEEEEC-CEEEEEcCcCC-CCC----ccceEEEEeCCCCceEEc--CCCCCC
Q psy2980          13 NRVECYHIAENTWEYK--SPMKEKRYRPGIAVID-GKIYVLGGEEG-WDG----YHDSIECYDVDNDSWEIM--SHLPSA   82 (92)
Q Consensus        13 ~~v~~yd~~~~~W~~~--~~~~~~~~~~~~~~~~-~~iy~~GG~~~-~~~----~~~~~~~y~~~~~~W~~~--~~~~~~   82 (92)
                      ++++.||+++++|..+  |+.|.||..|.++++. +.+|++||.-. .++    -.++++.||..+++|+++  +.-|.+
T Consensus        98 ndLy~Yn~k~~eWkk~~spn~P~pRsshq~va~~s~~l~~fGGEfaSPnq~qF~HYkD~W~fd~~trkweql~~~g~PS~  177 (521)
T KOG1230|consen   98 NDLYSYNTKKNEWKKVVSPNAPPPRSSHQAVAVPSNILWLFGGEFASPNQEQFHHYKDLWLFDLKTRKWEQLEFGGGPSP  177 (521)
T ss_pred             eeeeEEeccccceeEeccCCCcCCCccceeEEeccCeEEEeccccCCcchhhhhhhhheeeeeeccchheeeccCCCCCC
Confidence            5699999999999888  4568899998888875 88999999732 122    368999999999999998  466899


Q ss_pred             ccceeeEEc
Q psy2980          83 RSWLGCVPL   91 (92)
Q Consensus        83 ~~~~~~~~~   91 (92)
                      |++|-+++-
T Consensus       178 RSGHRMvaw  186 (521)
T KOG1230|consen  178 RSGHRMVAW  186 (521)
T ss_pred             CccceeEEe
Confidence            999988763


No 31 
>smart00612 Kelch Kelch domain.
Probab=99.23  E-value=2.2e-11  Score=54.25  Aligned_cols=44  Identities=30%  Similarity=0.545  Sum_probs=36.7

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEECC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVIDG   45 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~   45 (92)
                      |+||.+. ....+++++||+.+++|..+++|+.+|..++++++++
T Consensus         4 v~GG~~~-~~~~~~v~~yd~~~~~W~~~~~~~~~r~~~~~~~~~g   47 (47)
T smart00612        4 VVGGFDG-GQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING   47 (47)
T ss_pred             EEeCCCC-CceeeeEEEECCCCCeEccCCCCCCccccceEEEeCC
Confidence            5799864 3345679999999999999999999999998887764


No 32 
>PF13415 Kelch_3:  Galactose oxidase, central domain
Probab=99.23  E-value=2.5e-11  Score=54.89  Aligned_cols=43  Identities=21%  Similarity=0.360  Sum_probs=37.0

Q ss_pred             CccccC-CCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEE
Q psy2980           1 MLRGWH-ASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVI   43 (92)
Q Consensus         1 v~GG~~-~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~   43 (92)
                      |+||.+ ......+++++||+.+++|++++++|.+|..|+++++
T Consensus         6 vfGG~~~~~~~~~nd~~~~~~~~~~W~~~~~~P~~R~~h~~~~i   49 (49)
T PF13415_consen    6 VFGGYDDDGGTRLNDVWVFDLDTNTWTRIGDLPPPRSGHTATVI   49 (49)
T ss_pred             EECCcCCCCCCEecCEEEEECCCCEEEECCCCCCCccceEEEEC
Confidence            689988 3455667899999999999999999999999988763


No 33 
>KOG4152|consensus
Probab=99.18  E-value=1.2e-10  Score=74.03  Aligned_cols=90  Identities=22%  Similarity=0.324  Sum_probs=72.0

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeC---CCCCCcccceEEEECCEEEEEcCcC-------------CCCCccceEE
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKS---PMKEKRYRPGIAVIDGKIYVLGGEE-------------GWDGYHDSIE   64 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~---~~~~~~~~~~~~~~~~~iy~~GG~~-------------~~~~~~~~~~   64 (92)
                      |+||..+-.+  .++|.+|+++-+|.+..   -.|.||..|++.++++++|+|||.-             .+....+++-
T Consensus       220 vyGGM~G~RL--gDLW~Ldl~Tl~W~kp~~~G~~PlPRSLHsa~~IGnKMyvfGGWVPl~~~~~~~~~hekEWkCTssl~  297 (830)
T KOG4152|consen  220 VYGGMSGCRL--GDLWTLDLDTLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGWVPLVMDDVKVATHEKEWKCTSSLA  297 (830)
T ss_pred             EEcccccccc--cceeEEecceeecccccccCCCCCCcccccceeecceeEEecceeeeeccccccccccceeeecccee
Confidence            4788765444  44999999999998874   3578999999999999999999961             1234678899


Q ss_pred             EEeCCCCceEEcC-------CCCCCccceeeEEcC
Q psy2980          65 CYDVDNDSWEIMS-------HLPSARSWLGCVPLQ   92 (92)
Q Consensus        65 ~y~~~~~~W~~~~-------~~~~~~~~~~~~~~~   92 (92)
                      ++|..+..|..+.       ..|.+|++|.+++++
T Consensus       298 clNldt~~W~tl~~d~~ed~tiPR~RAGHCAvAig  332 (830)
T KOG4152|consen  298 CLNLDTMAWETLLMDTLEDNTIPRARAGHCAVAIG  332 (830)
T ss_pred             eeeecchheeeeeeccccccccccccccceeEEec
Confidence            9999999999762       268899999888764


No 34 
>PF13964 Kelch_6:  Kelch motif
Probab=99.16  E-value=4.5e-11  Score=54.19  Aligned_cols=35  Identities=37%  Similarity=0.625  Sum_probs=30.1

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCc
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKSPMKEKR   35 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~   35 (92)
                      |+||.......++++++||+.+++|+++++||.+|
T Consensus        16 v~GG~~~~~~~~~~v~~yd~~t~~W~~~~~mp~pR   50 (50)
T PF13964_consen   16 VFGGYDNSGKYSNDVERYDPETNTWEQLPPMPTPR   50 (50)
T ss_pred             EECCCCCCCCccccEEEEcCCCCcEEECCCCCCCC
Confidence            58998754556777999999999999999999887


No 35 
>PF13854 Kelch_5:  Kelch motif
Probab=99.13  E-value=3.8e-10  Score=49.39  Aligned_cols=40  Identities=25%  Similarity=0.581  Sum_probs=35.5

Q ss_pred             CCCCcccceEEEECCEEEEEcCcCC-CCCccceEEEEeCCC
Q psy2980          31 MKEKRYRPGIAVIDGKIYVLGGEEG-WDGYHDSIECYDVDN   70 (92)
Q Consensus        31 ~~~~~~~~~~~~~~~~iy~~GG~~~-~~~~~~~~~~y~~~~   70 (92)
                      +|.+|..|++++++++||++||... ....+++++++|..+
T Consensus         1 ~P~~R~~hs~~~~~~~iyi~GG~~~~~~~~~~d~~~l~l~s   41 (42)
T PF13854_consen    1 IPSPRYGHSAVVVGNNIYIFGGYSGNNNSYSNDLYVLDLPS   41 (42)
T ss_pred             CCCCccceEEEEECCEEEEEcCccCCCCCEECcEEEEECCC
Confidence            4788999999999999999999983 577899999999865


No 36 
>PLN02772 guanylate kinase
Probab=98.96  E-value=6.4e-09  Score=64.68  Aligned_cols=60  Identities=20%  Similarity=0.283  Sum_probs=52.3

Q ss_pred             CCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEc---CCCCCCccceeeEEcC
Q psy2980          33 EKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIM---SHLPSARSWLGCVPLQ   92 (92)
Q Consensus        33 ~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~---~~~~~~~~~~~~~~~~   92 (92)
                      .++..++++++++++|++||.++.....+.+++||+.+.+|..-   .+.|.+|.+|++|+++
T Consensus        23 ~~~~~~tav~igdk~yv~GG~~d~~~~~~~v~i~D~~t~~W~~P~V~G~~P~~r~GhSa~v~~   85 (398)
T PLN02772         23 KPKNRETSVTIGDKTYVIGGNHEGNTLSIGVQILDKITNNWVSPIVLGTGPKPCKGYSAVVLN   85 (398)
T ss_pred             CCCCcceeEEECCEEEEEcccCCCccccceEEEEECCCCcEecccccCCCCCCCCcceEEEEC
Confidence            46788999999999999999887555789999999999999864   5779999999999873


No 37 
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=98.92  E-value=3.4e-09  Score=64.57  Aligned_cols=90  Identities=19%  Similarity=0.320  Sum_probs=65.5

Q ss_pred             CccccCCCC----CCcceEEEeeCCCCeEEeeCC-CCCCcccceEEEECC-EEEEEcCcCCC--C---------------
Q psy2980           1 MLRGWHAST----ENTNRVECYHIAENTWEYKSP-MKEKRYRPGIAVIDG-KIYVLGGEEGW--D---------------   57 (92)
Q Consensus         1 v~GG~~~~~----~~~~~v~~yd~~~~~W~~~~~-~~~~~~~~~~~~~~~-~iy~~GG~~~~--~---------------   57 (92)
                      +|||.....    ...+++++|||.+|+|+++.. .|.....++++.+++ +||++||++..  +               
T Consensus        97 vFgG~Gk~~~~~~~~~nd~Y~y~p~~nsW~kl~t~sP~gl~G~~~~~~~~~~i~f~GGvn~~if~~yf~dv~~a~~d~~~  176 (381)
T COG3055          97 VFGGYGKSVSSSPQVFNDAYRYDPSTNSWHKLDTRSPTGLVGASTFSLNGTKIYFFGGVNQNIFNGYFEDVGAAGKDKEA  176 (381)
T ss_pred             EeeccccCCCCCceEeeeeEEecCCCChhheeccccccccccceeEecCCceEEEEccccHHhhhhhHHhhhhhcccHHH
Confidence            467765222    233669999999999999974 577778888888887 89999998521  0               


Q ss_pred             ----------------CccceEEEEeCCCCceEEcCCCCCCccceeeEE
Q psy2980          58 ----------------GYHDSIECYDVDNDSWEIMSHLPSARSWLGCVP   90 (92)
Q Consensus        58 ----------------~~~~~~~~y~~~~~~W~~~~~~~~~~~~~~~~~   90 (92)
                                      .....+..|+|++++|+.+...|......++++
T Consensus       177 ~~~i~~~yf~~~~~dy~~n~ev~sy~p~~n~W~~~G~~pf~~~aGsa~~  225 (381)
T COG3055         177 VDKIIAHYFDKKAEDYFFNKEVLSYDPSTNQWRNLGENPFYGNAGSAVV  225 (381)
T ss_pred             HHHHHHHHhCCCHHHhcccccccccccccchhhhcCcCcccCccCccee
Confidence                            134678889999999999998776544434433


No 38 
>PF01344 Kelch_1:  Kelch motif;  InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=98.85  E-value=1.5e-09  Score=48.42  Aligned_cols=32  Identities=38%  Similarity=0.696  Sum_probs=28.1

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeCCCC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKSPMK   32 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~   32 (92)
                      |+||.+.....++++++||+.+++|..+++||
T Consensus        16 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~mp   47 (47)
T PF01344_consen   16 VIGGYDGNNQPTNSVEVYDPETNTWEELPPMP   47 (47)
T ss_dssp             EEEEBESTSSBEEEEEEEETTTTEEEEEEEES
T ss_pred             EEeeecccCceeeeEEEEeCCCCEEEEcCCCC
Confidence            58999876677888999999999999999876


No 39 
>KOG4152|consensus
Probab=98.84  E-value=1.6e-09  Score=69.06  Aligned_cols=89  Identities=16%  Similarity=0.253  Sum_probs=69.4

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEee---CCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYK---SPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMS   77 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~---~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~   77 (92)
                      ||||-+  ....+++..||..+++|..-   .++|.+...++.+..+.+||+|||.-..+.+.+++|-..-..-.|+++.
T Consensus        47 iFGGGN--EGiiDELHvYNTatnqWf~PavrGDiPpgcAA~GfvcdGtrilvFGGMvEYGkYsNdLYELQasRWeWkrlk  124 (830)
T KOG4152|consen   47 IFGGGN--EGIIDELHVYNTATNQWFAPAVRGDIPPGCAAFGFVCDGTRILVFGGMVEYGKYSNDLYELQASRWEWKRLK  124 (830)
T ss_pred             EecCCc--ccchhhhhhhccccceeecchhcCCCCCchhhcceEecCceEEEEccEeeeccccchHHHhhhhhhhHhhcC
Confidence            578854  33556799999999999654   4788899999999999999999998777888999866655555677763


Q ss_pred             -------CCCCCccceeeEEc
Q psy2980          78 -------HLPSARSWLGCVPL   91 (92)
Q Consensus        78 -------~~~~~~~~~~~~~~   91 (92)
                             +.|-+|.+|+...+
T Consensus       125 p~~p~nG~pPCPRlGHSFsl~  145 (830)
T KOG4152|consen  125 PKTPKNGPPPCPRLGHSFSLV  145 (830)
T ss_pred             CCCCCCCCCCCCccCceeEEe
Confidence                   35778999987654


No 40 
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=98.74  E-value=4.5e-08  Score=59.84  Aligned_cols=75  Identities=21%  Similarity=0.348  Sum_probs=55.6

Q ss_pred             EEEeeCCC--CeEEeeCCCCC-CcccceEEEECCEEEEEcCcCCC----CCccceEEEEeCCCCceEEcCCC-CCCccce
Q psy2980          15 VECYHIAE--NTWEYKSPMKE-KRYRPGIAVIDGKIYVLGGEEGW----DGYHDSIECYDVDNDSWEIMSHL-PSARSWL   86 (92)
Q Consensus        15 v~~yd~~~--~~W~~~~~~~~-~~~~~~~~~~~~~iy~~GG~~~~----~~~~~~~~~y~~~~~~W~~~~~~-~~~~~~~   86 (92)
                      .+..|++.  ..|.++...|- +|-....++++++||+|||....    .+..+++++|||++++|+++... |..-..+
T Consensus        60 fy~ldL~~~~k~W~~~a~FpG~~rnqa~~a~~~~kLyvFgG~Gk~~~~~~~~~nd~Y~y~p~~nsW~kl~t~sP~gl~G~  139 (381)
T COG3055          60 FYVLDLKKPGKGWTKIADFPGGARNQAVAAVIGGKLYVFGGYGKSVSSSPQVFNDAYRYDPSTNSWHKLDTRSPTGLVGA  139 (381)
T ss_pred             ceehhhhcCCCCceEcccCCCcccccchheeeCCeEEEeeccccCCCCCceEeeeeEEecCCCChhheeccccccccccc
Confidence            56666653  47999998874 67778888899999999998432    34679999999999999998533 4443333


Q ss_pred             eeE
Q psy2980          87 GCV   89 (92)
Q Consensus        87 ~~~   89 (92)
                      .++
T Consensus       140 ~~~  142 (381)
T COG3055         140 STF  142 (381)
T ss_pred             eeE
Confidence            333


No 41 
>PF13418 Kelch_4:  Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=98.69  E-value=7.4e-09  Score=46.60  Aligned_cols=32  Identities=25%  Similarity=0.402  Sum_probs=20.1

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeCCCC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKSPMK   32 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~   32 (92)
                      |+||.+......++++.||+.+++|.+++++|
T Consensus        17 v~GG~~~~~~~~~d~~~~d~~~~~W~~~~~~P   48 (49)
T PF13418_consen   17 VFGGRDSSGSPLNDLWIFDIETNTWTRLPSMP   48 (49)
T ss_dssp             EE--EEE-TEE---EEEEETTTTEEEE--SS-
T ss_pred             EECCCCCCCcccCCEEEEECCCCEEEECCCCC
Confidence            58998866567777999999999999998776


No 42 
>PF07646 Kelch_2:  Kelch motif;  InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=98.46  E-value=3.2e-07  Score=41.22  Aligned_cols=32  Identities=25%  Similarity=0.509  Sum_probs=26.1

Q ss_pred             Ccccc--CCCCCCcceEEEeeCCCCeEEeeCCCC
Q psy2980           1 MLRGW--HASTENTNRVECYHIAENTWEYKSPMK   32 (92)
Q Consensus         1 v~GG~--~~~~~~~~~v~~yd~~~~~W~~~~~~~   32 (92)
                      |+||.  +.....++++++||+++++|..++.++
T Consensus        16 v~GG~~~~~~~~~~~~v~~~d~~t~~W~~~~~~g   49 (49)
T PF07646_consen   16 VFGGYGTDNGGSSSNDVWVFDTETNQWTELSPMG   49 (49)
T ss_pred             EECCcccCCCCcccceeEEEECCCCEEeecCCCC
Confidence            58998  344566678999999999999998764


No 43 
>PLN02772 guanylate kinase
Probab=98.43  E-value=1.8e-06  Score=54.07  Aligned_cols=54  Identities=15%  Similarity=0.165  Sum_probs=43.6

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeC---CCCCCcccceEEEE-CCEEEEEcCcC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKS---PMKEKRYRPGIAVI-DGKIYVLGGEE   54 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~---~~~~~~~~~~~~~~-~~~iy~~GG~~   54 (92)
                      ||||.+.....++.+++||..+++|..-.   ..|.+|.+|+++++ +++|+++++..
T Consensus        39 v~GG~~d~~~~~~~v~i~D~~t~~W~~P~V~G~~P~~r~GhSa~v~~~~rilv~~~~~   96 (398)
T PLN02772         39 VIGGNHEGNTLSIGVQILDKITNNWVSPIVLGTGPKPCKGYSAVVLNKDRILVIKKGS   96 (398)
T ss_pred             EEcccCCCccccceEEEEECCCCcEecccccCCCCCCCCcceEEEECCceEEEEeCCC
Confidence            58887654446778999999999997764   57899999999998 57888888653


No 44 
>KOG2437|consensus
Probab=98.28  E-value=2.5e-07  Score=59.19  Aligned_cols=90  Identities=18%  Similarity=0.316  Sum_probs=66.0

Q ss_pred             CccccCCCCCCcceEEEeeCCCCeEEeeC---CCCCCcccceEEEE--CCEEEEEcCcCC-----CCCccceEEEEeCCC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENTWEYKS---PMKEKRYRPGIAVI--DGKIYVLGGEEG-----WDGYHDSIECYDVDN   70 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~W~~~~---~~~~~~~~~~~~~~--~~~iy~~GG~~~-----~~~~~~~~~~y~~~~   70 (92)
                      +.||+++- ....++|.|+...+.|..+-   ..|-.|..|..+..  +.++|++|-+-+     .....++++.||..+
T Consensus       277 LYGGWdG~-~~l~DFW~Y~v~e~~W~~iN~~t~~PG~RsCHRMVid~S~~KLYLlG~Y~~sS~r~~~s~RsDfW~FDi~~  355 (723)
T KOG2437|consen  277 LYGGWDGT-QDLADFWAYSVKENQWTCINRDTEGPGARSCHRMVIDISRRKLYLLGRYLDSSVRNSKSLRSDFWRFDIDT  355 (723)
T ss_pred             EecCcccc-hhHHHHHhhcCCcceeEEeecCCCCCcchhhhhhhhhhhHhHHhhhhhccccccccccccccceEEEecCC
Confidence            35888742 23344999999999998883   46788888877775  458999997732     124578999999999


Q ss_pred             CceEEcCC------CCCCccceeeEEc
Q psy2980          71 DSWEIMSH------LPSARSWLGCVPL   91 (92)
Q Consensus        71 ~~W~~~~~------~~~~~~~~~~~~~   91 (92)
                      +.|.-+.-      -|...+.|.+++.
T Consensus       356 ~~W~~ls~dt~~dGGP~~vfDHqM~Vd  382 (723)
T KOG2437|consen  356 NTWMLLSEDTAADGGPKLVFDHQMCVD  382 (723)
T ss_pred             ceeEEecccccccCCcceeecceeeEe
Confidence            99998742      2556677777764


No 45 
>KOG2437|consensus
Probab=97.74  E-value=0.00013  Score=47.27  Aligned_cols=65  Identities=20%  Similarity=0.376  Sum_probs=51.1

Q ss_pred             ceEEEeeCCCCeEEeeCC------CCCCcccceEEEECCE--EEEEcCcCC--CCCccceEEEEeCCCCceEEcC
Q psy2980          13 NRVECYHIAENTWEYKSP------MKEKRYRPGIAVIDGK--IYVLGGEEG--WDGYHDSIECYDVDNDSWEIMS   77 (92)
Q Consensus        13 ~~v~~yd~~~~~W~~~~~------~~~~~~~~~~~~~~~~--iy~~GG~~~--~~~~~~~~~~y~~~~~~W~~~~   77 (92)
                      +++|.||..++.|..+.-      =|...+.|..++..++  ||++||+.-  +......++.||.....|+.+.
T Consensus       346 sDfW~FDi~~~~W~~ls~dt~~dGGP~~vfDHqM~Vd~~k~~iyVfGGr~~~~~e~~f~GLYaf~~~~~~w~~l~  420 (723)
T KOG2437|consen  346 SDFWRFDIDTNTWMLLSEDTAADGGPKLVFDHQMCVDSEKHMIYVFGGRILTCNEPQFSGLYAFNCQCQTWKLLR  420 (723)
T ss_pred             cceEEEecCCceeEEecccccccCCcceeecceeeEecCcceEEEecCeeccCCCccccceEEEecCCccHHHHH
Confidence            479999999999988842      1444577888888776  999999842  2346788999999999998764


No 46 
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=97.34  E-value=0.004  Score=36.41  Aligned_cols=68  Identities=18%  Similarity=0.300  Sum_probs=44.7

Q ss_pred             ceEEEeeCCCCeEEeeCCCCCC-cccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCC
Q psy2980          13 NRVECYHIAENTWEYKSPMKEK-RYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPS   81 (92)
Q Consensus        13 ~~v~~yd~~~~~W~~~~~~~~~-~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~   81 (92)
                      ..+++|+..+++|+.+...+.. ......+.+++.+|-+..... ......+..||..+++|+...++|.
T Consensus        70 ~~~~Vys~~~~~Wr~~~~~~~~~~~~~~~v~~~G~lyw~~~~~~-~~~~~~IvsFDl~~E~f~~~i~~P~  138 (230)
T TIGR01640        70 SEHQVYTLGSNSWRTIECSPPHHPLKSRGVCINGVLYYLAYTLK-TNPDYFIVSFDVSSERFKEFIPLPC  138 (230)
T ss_pred             ccEEEEEeCCCCccccccCCCCccccCCeEEECCEEEEEEEECC-CCCcEEEEEEEcccceEeeeeecCc
Confidence            4689999999999998743321 112226678888888764321 1112379999999999996224443


No 47 
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=97.05  E-value=0.0067  Score=36.92  Aligned_cols=76  Identities=13%  Similarity=0.209  Sum_probs=51.5

Q ss_pred             ccccC--CCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEEE-CCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCC
Q psy2980           2 LRGWH--ASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVI-DGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSH   78 (92)
Q Consensus         2 ~GG~~--~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~   78 (92)
                      +||--  ...+++..+..||+.+.+|.....--.+.. ..+... ++++|+.|-..........+-.||..+.+|..+..
T Consensus         3 VGG~F~~aGsL~C~~lC~yd~~~~qW~~~g~~i~G~V-~~l~~~~~~~Llv~G~ft~~~~~~~~la~yd~~~~~w~~~~~   81 (281)
T PF12768_consen    3 VGGSFTSAGSLPCPGLCLYDTDNSQWSSPGNGISGTV-TDLQWASNNQLLVGGNFTLNGTNSSNLATYDFKNQTWSSLGG   81 (281)
T ss_pred             EeeecCCCCCcCCCEEEEEECCCCEeecCCCCceEEE-EEEEEecCCEEEEEEeeEECCCCceeEEEEecCCCeeeecCC
Confidence            66733  233467789999999999999875322221 222223 67888887554323346778999999999988865


No 48 
>PF07250 Glyoxal_oxid_N:  Glyoxal oxidase N-terminus;  InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=96.15  E-value=0.044  Score=32.81  Aligned_cols=74  Identities=16%  Similarity=0.243  Sum_probs=48.0

Q ss_pred             eEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCC----CceEEcC-CCCCCccceee
Q psy2980          14 RVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDN----DSWEIMS-HLPSARSWLGC   88 (92)
Q Consensus        14 ~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~----~~W~~~~-~~~~~~~~~~~   88 (92)
                      .-..||+.++++..+.-..-.-....+..-++.+.+.||..+   -.+.+..|+|..    ..|.+.. .|..+|=..++
T Consensus        47 ~s~~yD~~tn~~rpl~v~td~FCSgg~~L~dG~ll~tGG~~~---G~~~ir~~~p~~~~~~~~w~e~~~~m~~~RWYpT~  123 (243)
T PF07250_consen   47 HSVEYDPNTNTFRPLTVQTDTFCSGGAFLPDGRLLQTGGDND---GNKAIRIFTPCTSDGTCDWTESPNDMQSGRWYPTA  123 (243)
T ss_pred             EEEEEecCCCcEEeccCCCCCcccCcCCCCCCCEEEeCCCCc---cccceEEEecCCCCCCCCceECcccccCCCccccc
Confidence            357899999999888643222222223334778999999743   245677788865    5798875 57777766555


Q ss_pred             EE
Q psy2980          89 VP   90 (92)
Q Consensus        89 ~~   90 (92)
                      ..
T Consensus       124 ~~  125 (243)
T PF07250_consen  124 TT  125 (243)
T ss_pred             eE
Confidence            44


No 49 
>PF07893 DUF1668:  Protein of unknown function (DUF1668);  InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function. 
Probab=95.85  E-value=0.11  Score=32.50  Aligned_cols=71  Identities=15%  Similarity=0.222  Sum_probs=46.3

Q ss_pred             eEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCc----cceEEE--EeC--------CCCceEEcCCC
Q psy2980          14 RVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGY----HDSIEC--YDV--------DNDSWEIMSHL   79 (92)
Q Consensus        14 ~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~----~~~~~~--y~~--------~~~~W~~~~~~   79 (92)
                      ....||+.+..-...|.+..+.....++.+++.||++.........    ...++.  |++        ..-.|+.+++.
T Consensus        87 ~t~vyDt~t~av~~~P~l~~pk~~pisv~VG~~LY~m~~~~~~~~~~~~~~~~FE~l~~~~~~~~~~~~~~w~W~~LP~P  166 (342)
T PF07893_consen   87 RTLVYDTDTRAVATGPRLHSPKRCPISVSVGDKLYAMDRSPFPEPAGRPDFPCFEALVYRPPPDDPSPEESWSWRSLPPP  166 (342)
T ss_pred             CeEEEECCCCeEeccCCCCCCCcceEEEEeCCeEEEeeccCccccccCccceeEEEeccccccccccCCCcceEEcCCCC
Confidence            4789999999988888887777666677789999999876431111    013333  442        22258888776


Q ss_pred             CCCcc
Q psy2980          80 PSARS   84 (92)
Q Consensus        80 ~~~~~   84 (92)
                      |..+.
T Consensus       167 Pf~~~  171 (342)
T PF07893_consen  167 PFVRD  171 (342)
T ss_pred             Ccccc
Confidence            65443


No 50 
>PF07250 Glyoxal_oxid_N:  Glyoxal oxidase N-terminus;  InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=95.62  E-value=0.039  Score=32.99  Aligned_cols=50  Identities=18%  Similarity=0.479  Sum_probs=37.4

Q ss_pred             ccccCCCCCCcceEEEeeCCC----CeEEeeC-CCCCCcccceEEEE-CCEEEEEcCcC
Q psy2980           2 LRGWHASTENTNRVECYHIAE----NTWEYKS-PMKEKRYRPGIAVI-DGKIYVLGGEE   54 (92)
Q Consensus         2 ~GG~~~~~~~~~~v~~yd~~~----~~W~~~~-~~~~~~~~~~~~~~-~~~iy~~GG~~   54 (92)
                      .||... ...  .+..|++.+    ..|.+.+ .|..+|...++..+ +++++++||..
T Consensus        83 tGG~~~-G~~--~ir~~~p~~~~~~~~w~e~~~~m~~~RWYpT~~~L~DG~vlIvGG~~  138 (243)
T PF07250_consen   83 TGGDND-GNK--AIRIFTPCTSDGTCDWTESPNDMQSGRWYPTATTLPDGRVLIVGGSN  138 (243)
T ss_pred             eCCCCc-ccc--ceEEEecCCCCCCCCceECcccccCCCccccceECCCCCEEEEeCcC
Confidence            566642 333  378888876    5798876 58999999888886 67899999974


No 51 
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=95.09  E-value=0.33  Score=28.40  Aligned_cols=65  Identities=14%  Similarity=0.301  Sum_probs=39.0

Q ss_pred             ceEEEeeCCCCeEEeeCCCCCCcccc----eEEEEC----C-EEEEEcCcCCCCCccceEEEEeCCCCceEEcCC
Q psy2980          13 NRVECYHIAENTWEYKSPMKEKRYRP----GIAVID----G-KIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSH   78 (92)
Q Consensus        13 ~~v~~yd~~~~~W~~~~~~~~~~~~~----~~~~~~----~-~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~   78 (92)
                      ..+..+||.|++|..+|.++.++...    ....++    + ++..+..... ......++.|+..++.|+.+..
T Consensus        14 ~~~~V~NP~T~~~~~LP~~~~~~~~~~~~~~~~G~d~~~~~YKVv~~~~~~~-~~~~~~~~Vys~~~~~Wr~~~~   87 (230)
T TIGR01640        14 KRLVVWNPSTGQSRWLPTPKSRRSNKESDTYFLGYDPIEKQYKVLCFSDRSG-NRNQSEHQVYTLGSNSWRTIEC   87 (230)
T ss_pred             CcEEEECCCCCCEEecCCCCCcccccccceEEEeecccCCcEEEEEEEeecC-CCCCccEEEEEeCCCCcccccc
Confidence            45899999999999998654421111    111122    1 2333322111 1234578899999999999864


No 52 
>PF03089 RAG2:  Recombination activating protein 2;  InterPro: IPR004321 The variable portion of the genes encoding immunoglobulins and T cell receptors are assembled from component V, D, and J DNA segments by a site-specific recombination reaction termed V(D)J recombination. V(D)J recombination is targeted to specific sites on the chromosome by recombination signal sequences (RSSs) that flank antigen receptor gene segments. The RSS consists of a conserved heptamer (consensus, 5'-CACAGTG-3') and nonamer (consensus, 5'-ACAAAAACC-3') separated by a spacer of either 12 or 23 bp. Efficient recombination occurs between a 12-RSS and a 23-RSS, a restriction known as the 12/23 rule. V(D)J recombination can be divided into two phases, DNA cleavage and DNA joining. DNA cleavage requires two lymphocyte-specific factors, the products of the recombination activating genes, RAG1 and RAG2, which together recognise the RSSs and create double strand breaks at the RSS-coding segment junctions []. RAG-mediated DNA cleavage occurs in a synaptic complex termed the paired complex, which is constituted from two distinct RSS-RAG complexes, a 12-SC and a 23-SC (where SC stands for signal complex). The DNA cleavage reaction involves two distinct enzymatic steps, initial nicking that creates a 3'-OH between a coding segment and its RSS, followed by hairpin formation in which the newly created 3'-OH attacks a phosphodiester bond on the opposite DNA strand. This generates a blunt, 5' phosphorylated signal end containing all of the RSS elements, and a covalently sealed hairpin coding end.  The second phase of V(D)J recombination, in which broken DNA fragments are processed and joined, is less well characterised. Signal ends are typically joined precisely to form a signal joint, whereas joining of the coding ends requires the hairpin structure to be opened and typically involves nucleotide addition and deletion before formation of the coding joint. The factors involved in these processes include ubiquitously expressed proteins involved in the repair of DNA double strand breaks by nonhomologous end joining, terminal deoxynucleotidyl transferase, and Artemis protein. In addition to their critical roles in RSS recognition and DNA cleavage, the RAG proteins may perform two distinct types of functions in the postcleavage phase of V(D)J. A structural function has been inferred from the finding that, after DNA cleavage in vitro, the DNA ends remain associated with the RAG proteins in a "four end" complex known as the cleaved signal complex. After release of the coding ends in vitro, and after coding joint formation in vivo, the RAG proteins remain in a stable signal end complex (SEC) containing the two signal ends. These postcleavage complexes may serve as essential scaffolds for the second phase of the reaction, with the RAG proteins acting to organise the DNA processing and joining events.  The second type of RAG protein-mediated postcleavage activity is the catalysis of phosphodiester bond hydrolysis and strand transfer reactions. The RAG proteins are capable of opening hairpin coding ends in vitro. The RAG proteins also show 3' flap endonuclease activity that may contribute to coding end processing/joining and can utilise the 3' OH group on the signal ends to attack hairpin coding ends (forming hybrid or open/shut joints) or virtually any DNA duplex (forming a transposition product).; GO: 0003677 DNA binding, 0006310 DNA recombination, 0005634 nucleus
Probab=93.09  E-value=0.37  Score=29.79  Aligned_cols=54  Identities=19%  Similarity=0.330  Sum_probs=34.6

Q ss_pred             CccccCCCCCCcceEEEeeCCCCe--------EE---eeCCCCCCcccceEEEEC--C--EEEEEcCcC
Q psy2980           1 MLRGWHASTENTNRVECYHIAENT--------WE---YKSPMKEKRYRPGIAVID--G--KIYVLGGEE   54 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~--------W~---~~~~~~~~~~~~~~~~~~--~--~iy~~GG~~   54 (92)
                      |-||...+..-+++++........        ..   .+..+|.+|++|++-++.  +  ...+|||..
T Consensus        43 IHGGrTPNNElS~~LY~ls~~s~~cNkK~tl~C~EKeLvGdvP~aRYGHt~~vV~SrGKta~VlFGGRS  111 (337)
T PF03089_consen   43 IHGGRTPNNELSSSLYILSVDSRGCNKKVTLCCQEKELVGDVPEARYGHTINVVHSRGKTACVLFGGRS  111 (337)
T ss_pred             ecCCcCCCcccccceEEEEeecCCCCceeEEEEecceecCCCCcccccceEEEEEECCcEEEEEECCcc
Confidence            457877555556667776654332        21   125689999999887753  2  378889874


No 53 
>PF07893 DUF1668:  Protein of unknown function (DUF1668);  InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function. 
Probab=93.02  E-value=0.84  Score=28.74  Aligned_cols=54  Identities=24%  Similarity=0.409  Sum_probs=34.8

Q ss_pred             CCeEEeeCCCCCCccc-------ceEEEE-CCEEEEE-cCcCCCCCccceEEEEeCCCCceEEcC--CCCC
Q psy2980          22 ENTWEYKSPMKEKRYR-------PGIAVI-DGKIYVL-GGEEGWDGYHDSIECYDVDNDSWEIMS--HLPS   81 (92)
Q Consensus        22 ~~~W~~~~~~~~~~~~-------~~~~~~-~~~iy~~-GG~~~~~~~~~~~~~y~~~~~~W~~~~--~~~~   81 (92)
                      .-.|+.+|++|.....       .+-+++ +..||+- -+.      ....++||+.+.+|++..  .||+
T Consensus       157 ~w~W~~LP~PPf~~~~~~~~~~i~sYavv~g~~I~vS~~~~------~~GTysfDt~~~~W~~~GdW~LPF  221 (342)
T PF07893_consen  157 SWSWRSLPPPPFVRDRRYSDYRITSYAVVDGRTIFVSVNGR------RWGTYSFDTESHEWRKHGDWMLPF  221 (342)
T ss_pred             cceEEcCCCCCccccCCcccceEEEEEEecCCeEEEEecCC------ceEEEEEEcCCcceeeccceecCc
Confidence            3368999876654432       244455 6667773 221      135899999999999986  4553


No 54 
>PF08268 FBA_3:  F-box associated domain;  InterPro: IPR013187 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=91.77  E-value=1.2  Score=23.80  Aligned_cols=66  Identities=9%  Similarity=0.186  Sum_probs=42.3

Q ss_pred             CcceEEEeeCCCCeEEeeCCC---CCCcccceEEEECCEEEEEcCcCCCCCccceEEEE-eCCCCceEEc
Q psy2980          11 NTNRVECYHIAENTWEYKSPM---KEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECY-DVDNDSWEIM   76 (92)
Q Consensus        11 ~~~~v~~yd~~~~~W~~~~~~---~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y-~~~~~~W~~~   76 (92)
                      ....+.+||..+.+|..++.+   ........++.++++|-++.-........-+++.. |-.+.+|.+.
T Consensus        18 ~~~~IvsFDv~~E~f~~i~~P~~~~~~~~~~~L~~~~G~L~~v~~~~~~~~~~~~iWvLeD~~k~~Wsk~   87 (129)
T PF08268_consen   18 DNNVIVSFDVRSEKFRFIKLPEDPYSSDCSSTLIEYKGKLALVSYNDQGEPDSIDIWVLEDYEKQEWSKK   87 (129)
T ss_pred             CCcEEEEEEcCCceEEEEEeeeeeccccCccEEEEeCCeEEEEEecCCCCcceEEEEEeeccccceEEEE
Confidence            345699999999999988653   23445567778899987775332211112345544 3456789865


No 55 
>PF05096 Glu_cyclase_2:  Glutamine cyclotransferase;  InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=88.64  E-value=4  Score=25.04  Aligned_cols=52  Identities=15%  Similarity=0.291  Sum_probs=37.3

Q ss_pred             ceEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCC
Q psy2980          13 NRVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDN   70 (92)
Q Consensus        13 ~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~   70 (92)
                      +.+..||+.+++-....++|..-.+=+++.++++||..--.      ....+.||+.+
T Consensus        68 S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~~d~l~qLTWk------~~~~f~yd~~t  119 (264)
T PF05096_consen   68 SSLRKVDLETGKVLQSVPLPPRYFGEGITILGDKLYQLTWK------EGTGFVYDPNT  119 (264)
T ss_dssp             EEEEEEETTTSSEEEEEE-TTT--EEEEEEETTEEEEEESS------SSEEEEEETTT
T ss_pred             EEEEEEECCCCcEEEEEECCccccceeEEEECCEEEEEEec------CCeEEEEcccc
Confidence            46999999999877666777766777899999999998543      34566677653


No 56 
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=88.34  E-value=2.8  Score=26.17  Aligned_cols=59  Identities=8%  Similarity=0.203  Sum_probs=38.9

Q ss_pred             ceEEEeeCCCCeEEeeCCC-CCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980          13 NRVECYHIAENTWEYKSPM-KEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMS   77 (92)
Q Consensus        13 ~~v~~yd~~~~~W~~~~~~-~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~   77 (92)
                      ..+.+|||.+.+|.+-+-+ ..+|....-+--.+++++-      ....+.+.+|||.+.+...++
T Consensus       254 g~l~rfdPs~~sW~eypLPgs~arpys~rVD~~grVW~s------ea~agai~rfdpeta~ftv~p  313 (353)
T COG4257         254 GSLHRFDPSVTSWIEYPLPGSKARPYSMRVDRHGRVWLS------EADAGAIGRFDPETARFTVLP  313 (353)
T ss_pred             ceeeEeCcccccceeeeCCCCCCCcceeeeccCCcEEee------ccccCceeecCcccceEEEec
Confidence            4589999999999877532 2334332222234567763      223678999999999888763


No 57 
>smart00284 OLF Olfactomedin-like domains.
Probab=80.79  E-value=7  Score=23.89  Aligned_cols=45  Identities=16%  Similarity=0.294  Sum_probs=32.3

Q ss_pred             CCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCC
Q psy2980          31 MKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPS   81 (92)
Q Consensus        31 ~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~   81 (92)
                      +|.+-.+.+.+++++.+|.--      .....+.+||..+.+-.....+|.
T Consensus        70 Lp~~~~GtG~VVYngslYY~~------~~s~~iiKydL~t~~v~~~~~Lp~  114 (255)
T smart00284       70 LPHAGQGTGVVVYNGSLYFNK------FNSHDICRFDLTTETYQKEPLLNG  114 (255)
T ss_pred             CCCccccccEEEECceEEEEe------cCCccEEEEECCCCcEEEEEecCc
Confidence            555566778889999998863      235789999999987654444543


No 58 
>PF03089 RAG2:  Recombination activating protein 2;  InterPro: IPR004321 The variable portion of the genes encoding immunoglobulins and T cell receptors are assembled from component V, D, and J DNA segments by a site-specific recombination reaction termed V(D)J recombination. V(D)J recombination is targeted to specific sites on the chromosome by recombination signal sequences (RSSs) that flank antigen receptor gene segments. The RSS consists of a conserved heptamer (consensus, 5'-CACAGTG-3') and nonamer (consensus, 5'-ACAAAAACC-3') separated by a spacer of either 12 or 23 bp. Efficient recombination occurs between a 12-RSS and a 23-RSS, a restriction known as the 12/23 rule. V(D)J recombination can be divided into two phases, DNA cleavage and DNA joining. DNA cleavage requires two lymphocyte-specific factors, the products of the recombination activating genes, RAG1 and RAG2, which together recognise the RSSs and create double strand breaks at the RSS-coding segment junctions []. RAG-mediated DNA cleavage occurs in a synaptic complex termed the paired complex, which is constituted from two distinct RSS-RAG complexes, a 12-SC and a 23-SC (where SC stands for signal complex). The DNA cleavage reaction involves two distinct enzymatic steps, initial nicking that creates a 3'-OH between a coding segment and its RSS, followed by hairpin formation in which the newly created 3'-OH attacks a phosphodiester bond on the opposite DNA strand. This generates a blunt, 5' phosphorylated signal end containing all of the RSS elements, and a covalently sealed hairpin coding end.  The second phase of V(D)J recombination, in which broken DNA fragments are processed and joined, is less well characterised. Signal ends are typically joined precisely to form a signal joint, whereas joining of the coding ends requires the hairpin structure to be opened and typically involves nucleotide addition and deletion before formation of the coding joint. The factors involved in these processes include ubiquitously expressed proteins involved in the repair of DNA double strand breaks by nonhomologous end joining, terminal deoxynucleotidyl transferase, and Artemis protein. In addition to their critical roles in RSS recognition and DNA cleavage, the RAG proteins may perform two distinct types of functions in the postcleavage phase of V(D)J. A structural function has been inferred from the finding that, after DNA cleavage in vitro, the DNA ends remain associated with the RAG proteins in a "four end" complex known as the cleaved signal complex. After release of the coding ends in vitro, and after coding joint formation in vivo, the RAG proteins remain in a stable signal end complex (SEC) containing the two signal ends. These postcleavage complexes may serve as essential scaffolds for the second phase of the reaction, with the RAG proteins acting to organise the DNA processing and joining events.  The second type of RAG protein-mediated postcleavage activity is the catalysis of phosphodiester bond hydrolysis and strand transfer reactions. The RAG proteins are capable of opening hairpin coding ends in vitro. The RAG proteins also show 3' flap endonuclease activity that may contribute to coding end processing/joining and can utilise the 3' OH group on the signal ends to attack hairpin coding ends (forming hybrid or open/shut joints) or virtually any DNA duplex (forming a transposition product).; GO: 0003677 DNA binding, 0006310 DNA recombination, 0005634 nucleus
Probab=77.83  E-value=5.6  Score=24.92  Aligned_cols=41  Identities=12%  Similarity=0.142  Sum_probs=27.4

Q ss_pred             eEEEeeCCCCeE--EeeCCCCCCcccceEEEECCEEEEEcCcC
Q psy2980          14 RVECYHIAENTW--EYKSPMKEKRYRPGIAVIDGKIYVLGGEE   54 (92)
Q Consensus        14 ~v~~yd~~~~~W--~~~~~~~~~~~~~~~~~~~~~iy~~GG~~   54 (92)
                      .|+..|++-.-.  ..+|.+.-+...|.+.+-++.+|++||-.
T Consensus       132 ~VfLiDleFGC~tah~lpEl~dG~SFHvslar~D~VYilGGHs  174 (337)
T PF03089_consen  132 QVFLIDLEFGCCTAHTLPELQDGQSFHVSLARNDCVYILGGHS  174 (337)
T ss_pred             eEEEEeccccccccccchhhcCCeEEEEEEecCceEEEEccEE
Confidence            356666654433  33455555667777778899999999963


No 59 
>KOG2321|consensus
Probab=76.93  E-value=7  Score=26.82  Aligned_cols=53  Identities=13%  Similarity=0.225  Sum_probs=32.7

Q ss_pred             cceEEEeeCCCCeEEeeCCCCCCcccceEEEEC--CEEEEEcCcCCCCCccceEEEEeCCCCc
Q psy2980          12 TNRVECYHIAENTWEYKSPMKEKRYRPGIAVID--GKIYVLGGEEGWDGYHDSIECYDVDNDS   72 (92)
Q Consensus        12 ~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~--~~iy~~GG~~~~~~~~~~~~~y~~~~~~   72 (92)
                      ..+|+++|.+.++|..--....+..+  ++.++  ..|+.+|+.+      ..++.|||..+.
T Consensus       154 g~evYRlNLEqGrfL~P~~~~~~~lN--~v~in~~hgLla~Gt~~------g~VEfwDpR~ks  208 (703)
T KOG2321|consen  154 GSEVYRLNLEQGRFLNPFETDSGELN--VVSINEEHGLLACGTED------GVVEFWDPRDKS  208 (703)
T ss_pred             CcceEEEEccccccccccccccccce--eeeecCccceEEecccC------ceEEEecchhhh
Confidence            45699999999988543332222222  33332  3477888754      378888887653


No 60 
>PF02191 OLF:  Olfactomedin-like domain;  InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=74.44  E-value=9.7  Score=23.12  Aligned_cols=46  Identities=22%  Similarity=0.232  Sum_probs=31.8

Q ss_pred             CCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCC
Q psy2980          30 PMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPS   81 (92)
Q Consensus        30 ~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~   81 (92)
                      .+|.+-.+.+.+++++.+|.--      ...+.+.+||..++.-.....+|.
T Consensus        64 ~Lp~~~~GtG~vVYngslYY~~------~~s~~IvkydL~t~~v~~~~~L~~  109 (250)
T PF02191_consen   64 KLPYPWQGTGHVVYNGSLYYNK------YNSRNIVKYDLTTRSVVARRELPG  109 (250)
T ss_pred             EEeceeccCCeEEECCcEEEEe------cCCceEEEEECcCCcEEEEEECCc
Confidence            3455556677888899888863      247899999999887553334443


No 61 
>PF06433 Me-amine-dh_H:  Methylamine dehydrogenase heavy chain (MADH);  InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO).  RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor  MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=73.38  E-value=14  Score=23.67  Aligned_cols=56  Identities=18%  Similarity=0.297  Sum_probs=30.2

Q ss_pred             CCcceEEEeeCCCCeEEeeCCCCCCcccceEEEE-CC--EEEEEcCcCCCCCccceEEEEeCCCCce
Q psy2980          10 ENTNRVECYHIAENTWEYKSPMKEKRYRPGIAVI-DG--KIYVLGGEEGWDGYHDSIECYDVDNDSW   73 (92)
Q Consensus        10 ~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~--~iy~~GG~~~~~~~~~~~~~y~~~~~~W   73 (92)
                      .+..+||.||+++.+  ++..++......++.+- ++  .||..-+.      -..++.||..+.+-
T Consensus       266 dpgteVWv~D~~t~k--rv~Ri~l~~~~~Si~Vsqd~~P~L~~~~~~------~~~l~v~D~~tGk~  324 (342)
T PF06433_consen  266 DPGTEVWVYDLKTHK--RVARIPLEHPIDSIAVSQDDKPLLYALSAG------DGTLDVYDAATGKL  324 (342)
T ss_dssp             S-EEEEEEEETTTTE--EEEEEEEEEEESEEEEESSSS-EEEEEETT------TTEEEEEETTT--E
T ss_pred             CCceEEEEEECCCCe--EEEEEeCCCccceEEEccCCCcEEEEEcCC------CCeEEEEeCcCCcE
Confidence            356789999999886  22222222222234443 33  45554221      35799999988754


No 62 
>KOG0316|consensus
Probab=71.39  E-value=18  Score=22.29  Aligned_cols=52  Identities=15%  Similarity=0.268  Sum_probs=36.3

Q ss_pred             eEEEeeCCCCeEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCC
Q psy2980          14 RVECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDND   71 (92)
Q Consensus        14 ~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~   71 (92)
                      ++.++|...++..++.-+...+.......+.+..++.|..++      .+..||....
T Consensus       124 s~r~wDCRS~s~ePiQildea~D~V~Si~v~~heIvaGS~DG------tvRtydiR~G  175 (307)
T KOG0316|consen  124 SVRLWDCRSRSFEPIQILDEAKDGVSSIDVAEHEIVAGSVDG------TVRTYDIRKG  175 (307)
T ss_pred             eeEEEEcccCCCCccchhhhhcCceeEEEecccEEEeeccCC------cEEEEEeecc
Confidence            467777777777777666667777777778888877776554      5667777554


No 63 
>PF03178 CPSF_A:  CPSF A subunit region;  InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=68.30  E-value=26  Score=21.71  Aligned_cols=71  Identities=17%  Similarity=0.158  Sum_probs=42.4

Q ss_pred             ceEEEeeCCCCe-EEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCCCccceee
Q psy2980          13 NRVECYHIAENT-WEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPSARSWLGC   88 (92)
Q Consensus        13 ~~v~~yd~~~~~-W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~~~~~~~~   88 (92)
                      +.+..|+...++ +.....+..+-...++.+.++.|++---.     ..-.+..|+...++-..++.-..++...++
T Consensus       107 ~~l~v~~l~~~~~l~~~~~~~~~~~i~sl~~~~~~I~vgD~~-----~sv~~~~~~~~~~~l~~va~d~~~~~v~~~  178 (321)
T PF03178_consen  107 NKLYVYDLDNSKTLLKKAFYDSPFYITSLSVFKNYILVGDAM-----KSVSLLRYDEENNKLILVARDYQPRWVTAA  178 (321)
T ss_dssp             TEEEEEEEETTSSEEEEEEE-BSSSEEEEEEETTEEEEEESS-----SSEEEEEEETTTE-EEEEEEESS-BEEEEE
T ss_pred             CEEEEEEccCcccchhhheecceEEEEEEeccccEEEEEEcc-----cCEEEEEEEccCCEEEEEEecCCCccEEEE
Confidence            357888888777 88887665555666777788866553221     134455677766666666554444443333


No 64 
>PF14830 Haemocyan_bet_s:  Haemocyanin beta-sandwich; PDB: 1JS8_B 3QJO_A 1LNL_A.
Probab=65.31  E-value=8.5  Score=20.16  Aligned_cols=28  Identities=32%  Similarity=0.506  Sum_probs=14.6

Q ss_pred             EECCEEEEEcCcCCCCCccceEEEEeCC
Q psy2980          42 VIDGKIYVLGGEEGWDGYHDSIECYDVD   69 (92)
Q Consensus        42 ~~~~~iy~~GG~~~~~~~~~~~~~y~~~   69 (92)
                      ...+.+++.||...+.=....++.||+.
T Consensus        34 ~~AG~F~vLGG~~EMpW~FdRlykydIT   61 (103)
T PF14830_consen   34 HKAGTFFVLGGEKEMPWAFDRLYKYDIT   61 (103)
T ss_dssp             EEEEEEEE---TTS---B-SS-EEEE-H
T ss_pred             eEccEEEEcCCCccCccccCccchhhHH
Confidence            5567899999987655567888888873


No 65 
>PF14157 YmzC:  YmzC-like protein; PDB: 3KVP_E.
Probab=61.47  E-value=17  Score=17.23  Aligned_cols=15  Identities=20%  Similarity=0.206  Sum_probs=12.0

Q ss_pred             eEEEeeCCCCeEEee
Q psy2980          14 RVECYHIAENTWEYK   28 (92)
Q Consensus        14 ~v~~yd~~~~~W~~~   28 (92)
                      +++.||+++|.-..+
T Consensus        42 KIfkyd~~tNei~L~   56 (63)
T PF14157_consen   42 KIFKYDEDTNEITLK   56 (63)
T ss_dssp             EEEEEETTTTEEEEE
T ss_pred             EEEEeCCCCCeEEEE
Confidence            589999999986544


No 66 
>PF05096 Glu_cyclase_2:  Glutamine cyclotransferase;  InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=61.27  E-value=37  Score=21.03  Aligned_cols=50  Identities=24%  Similarity=0.264  Sum_probs=35.2

Q ss_pred             ceEEE-ECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCCCccceeeEEc
Q psy2980          38 PGIAV-IDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPSARSWLGCVPL   91 (92)
Q Consensus        38 ~~~~~-~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~~~~~~~~~~~   91 (92)
                      .+... .++.+|---|..+    .+.+..+|+.+.+-....++|..-++-|.+.+
T Consensus        48 QGL~~~~~g~LyESTG~yG----~S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~   98 (264)
T PF05096_consen   48 QGLEFLDDGTLYESTGLYG----QSSLRKVDLETGKVLQSVPLPPRYFGEGITIL   98 (264)
T ss_dssp             EEEEEEETTEEEEEECSTT----EEEEEEEETTTSSEEEEEE-TTT--EEEEEEE
T ss_pred             ccEEecCCCEEEEeCCCCC----cEEEEEEECCCCcEEEEEECCccccceeEEEE
Confidence            34444 5678887766544    68899999999988877888887777777765


No 67 
>PF07443 HARP:  HepA-related protein (HARP);  InterPro: IPR010003 This entry represents a conserved region approximately 60 residues long within eukaryotic HepA-related protein (HARP). This exhibits single-stranded DNA-dependent ATPase activity, and is ubiquitously expressed in human and mouse tissues []. Family members may contain more than one copy of this region.; GO: 0004386 helicase activity, 0005524 ATP binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0016568 chromatin modification, 0005634 nucleus
Probab=59.23  E-value=4.7  Score=18.57  Aligned_cols=13  Identities=15%  Similarity=0.626  Sum_probs=10.8

Q ss_pred             EEeeCCCCeEEee
Q psy2980          16 ECYHIAENTWEYK   28 (92)
Q Consensus        16 ~~yd~~~~~W~~~   28 (92)
                      ..||+.|..|.-.
T Consensus        21 r~YD~~Tr~W~F~   33 (55)
T PF07443_consen   21 RNYDPKTRKWNFS   33 (55)
T ss_pred             cccCccceeeeee
Confidence            6799999999654


No 68 
>PF11958 DUF3472:  Domain of unknown function (DUF3472);  InterPro: IPR021862  This presumed domain is functionally uncharacterised. This domain is found in bacteria, eukaryotes and viruses. This domain is typically between 174 to 190 amino acids in length. This domain has a single completely conserved residue G that may be functionally important. 
Probab=58.11  E-value=35  Score=19.74  Aligned_cols=23  Identities=9%  Similarity=0.144  Sum_probs=17.8

Q ss_pred             eEEEeeCCCCeEEeeCCCCCCcc
Q psy2980          14 RVECYHIAENTWEYKSPMKEKRY   36 (92)
Q Consensus        14 ~v~~yd~~~~~W~~~~~~~~~~~   36 (92)
                      ..+.+|+.+++|..|..+..|..
T Consensus        81 ~~~v~d~~t~~~~~Igs~~~p~~  103 (184)
T PF11958_consen   81 TGYVTDPTTGKWTLIGSIKRPKT  103 (184)
T ss_pred             EEEEEECCCCcEEEEEEEEEcCc
Confidence            35788999999999987655554


No 69 
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=57.97  E-value=43  Score=20.83  Aligned_cols=63  Identities=16%  Similarity=0.296  Sum_probs=38.2

Q ss_pred             CcceEEEeeCCCCeEEeeCC-----CCCCcccceEEEE-CCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCC
Q psy2980          11 NTNRVECYHIAENTWEYKSP-----MKEKRYRPGIAVI-DGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSH   78 (92)
Q Consensus        11 ~~~~v~~yd~~~~~W~~~~~-----~~~~~~~~~~~~~-~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~   78 (92)
                      ....+-.||..+.+|..++.     +|.+......... .+.+++.|....   -..-+..|  .-.+|..+..
T Consensus        62 ~~~~la~yd~~~~~w~~~~~~~s~~ipgpv~a~~~~~~d~~~~~~aG~~~~---g~~~l~~~--dGs~W~~i~~  130 (281)
T PF12768_consen   62 NSSNLATYDFKNQTWSSLGGGSSNSIPGPVTALTFISNDGSNFWVAGRSAN---GSTFLMKY--DGSSWSSIGS  130 (281)
T ss_pred             CceeEEEEecCCCeeeecCCcccccCCCcEEEEEeeccCCceEEEeceecC---CCceEEEE--cCCceEeccc
Confidence            34458999999999988865     2333322222222 345777775421   24456677  4568988865


No 70 
>PF06058 DCP1:  Dcp1-like decapping family;  InterPro: IPR010334 An essential step in mRNA turnover is decapping. In yeast, two proteins have been identified that are essential for decapping, Dcp1 (this family) and Dcp2 (IPR007722 from INTERPRO). The precise role of these proteins in the decapping reaction has not been established. Evidence suggests that the Dcp1 may enhance the function of Dcp2 [].; PDB: 1Q67_A 2QKM_C 2QKL_A.
Probab=57.43  E-value=15  Score=19.71  Aligned_cols=17  Identities=24%  Similarity=0.554  Sum_probs=14.5

Q ss_pred             cceEEEEeCCCCceEEc
Q psy2980          60 HDSIECYDVDNDSWEIM   76 (92)
Q Consensus        60 ~~~~~~y~~~~~~W~~~   76 (92)
                      .-.++.||+.+++|.+.
T Consensus        28 ~v~vY~f~~~~~~W~K~   44 (122)
T PF06058_consen   28 HVVVYKFDHETNEWEKT   44 (122)
T ss_dssp             EEEEEEEETTTTEEEEE
T ss_pred             eEEEEeecCCCCcEeec
Confidence            55788999999999975


No 71 
>KOG2394|consensus
Probab=54.98  E-value=18  Score=24.76  Aligned_cols=13  Identities=54%  Similarity=0.825  Sum_probs=10.8

Q ss_pred             ECCEEEEEcCcCC
Q psy2980          43 IDGKIYVLGGEEG   55 (92)
Q Consensus        43 ~~~~iy~~GG~~~   55 (92)
                      -+++.++.||.++
T Consensus       342 PDGKyIvtGGEDD  354 (636)
T KOG2394|consen  342 PDGKYIVTGGEDD  354 (636)
T ss_pred             CCccEEEecCCcc
Confidence            4888999999875


No 72 
>cd01206 Homer Homer type EVH1 domain. Homer type EVH1 domain. Homer is a synaptic scaffolding protein, involved in neuronal signaling. It contains an  EVH1 domain, which binds to both neurotransmitter receptors, such as the metabotropic glutamate receptor (mGluR) and to other scaffolding proteins via PPXXF motifs, in order to target them to the synaptic junction. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=53.88  E-value=33  Score=18.25  Aligned_cols=20  Identities=15%  Similarity=0.089  Sum_probs=15.8

Q ss_pred             CcceEEEeeCCCC-eEEeeCC
Q psy2980          11 NTNRVECYHIAEN-TWEYKSP   30 (92)
Q Consensus        11 ~~~~v~~yd~~~~-~W~~~~~   30 (92)
                      ....|..||+.++ .|....+
T Consensus         9 arA~V~~yd~~tKk~WvPs~~   29 (111)
T cd01206           9 TRAHVFQIDPKTKKNWIPASK   29 (111)
T ss_pred             eeeEEEEECCCCcceeEeCCC
Confidence            3456999999875 9988865


No 73 
>PF08450 SGL:  SMP-30/Gluconolaconase/LRE-like region;  InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=51.78  E-value=48  Score=19.51  Aligned_cols=56  Identities=21%  Similarity=0.270  Sum_probs=34.6

Q ss_pred             cceEEEeeCCCCeEEeeCCCCCCcccceEEEE--CCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCC
Q psy2980          12 TNRVECYHIAENTWEYKSPMKEKRYRPGIAVI--DGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHL   79 (92)
Q Consensus        12 ~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~--~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~   79 (92)
                      ..+++++|+.+.+-..+....    ..+++..  ++.+|+...        ..+..+|+.+.+++.+...
T Consensus        21 ~~~i~~~~~~~~~~~~~~~~~----~~G~~~~~~~g~l~v~~~--------~~~~~~d~~~g~~~~~~~~   78 (246)
T PF08450_consen   21 GGRIYRVDPDTGEVEVIDLPG----PNGMAFDRPDGRLYVADS--------GGIAVVDPDTGKVTVLADL   78 (246)
T ss_dssp             TTEEEEEETTTTEEEEEESSS----EEEEEEECTTSEEEEEET--------TCEEEEETTTTEEEEEEEE
T ss_pred             CCEEEEEECCCCeEEEEecCC----CceEEEEccCCEEEEEEc--------CceEEEecCCCcEEEEeec
Confidence            356888888887765543322    2334444  577777653        2345568888888877554


No 74 
>PF03178 CPSF_A:  CPSF A subunit region;  InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=49.68  E-value=61  Score=20.08  Aligned_cols=60  Identities=20%  Similarity=0.323  Sum_probs=38.0

Q ss_pred             ceEEEeeCCCC-----eEEeeCCCCCCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCc-eEEcCCCC
Q psy2980          13 NRVECYHIAEN-----TWEYKSPMKEKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDS-WEIMSHLP   80 (92)
Q Consensus        13 ~~v~~yd~~~~-----~W~~~~~~~~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~-W~~~~~~~   80 (92)
                      -.+..|+..+.     ++..+...+....-.+++.+++.+.+..|        +.+..|+...++ +...+.+.
T Consensus        62 Gri~v~~i~~~~~~~~~l~~i~~~~~~g~V~ai~~~~~~lv~~~g--------~~l~v~~l~~~~~l~~~~~~~  127 (321)
T PF03178_consen   62 GRILVFEISESPENNFKLKLIHSTEVKGPVTAICSFNGRLVVAVG--------NKLYVYDLDNSKTLLKKAFYD  127 (321)
T ss_dssp             EEEEEEEECSS-----EEEEEEEEEESS-EEEEEEETTEEEEEET--------TEEEEEEEETTSSEEEEEEE-
T ss_pred             cEEEEEEEEcccccceEEEEEEEEeecCcceEhhhhCCEEEEeec--------CEEEEEEccCcccchhhheec
Confidence            56889998874     66666544444444667778888555443        467777777766 77665443


No 75 
>PF03088 Str_synth:  Strictosidine synthase;  InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=48.02  E-value=26  Score=17.77  Aligned_cols=20  Identities=15%  Similarity=0.041  Sum_probs=14.4

Q ss_pred             CCcceEEEeeCCCCeEEeeC
Q psy2980          10 ENTNRVECYHIAENTWEYKS   29 (92)
Q Consensus        10 ~~~~~v~~yd~~~~~W~~~~   29 (92)
                      .++-.+..|||.+++...+.
T Consensus        34 ~~~GRll~ydp~t~~~~vl~   53 (89)
T PF03088_consen   34 RPTGRLLRYDPSTKETTVLL   53 (89)
T ss_dssp             ---EEEEEEETTTTEEEEEE
T ss_pred             CCCcCEEEEECCCCeEEEeh
Confidence            35567999999999986664


No 76 
>COG2380 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=45.86  E-value=28  Score=21.68  Aligned_cols=21  Identities=14%  Similarity=0.420  Sum_probs=16.7

Q ss_pred             CcceEEEeeCCCCeEEeeCCC
Q psy2980          11 NTNRVECYHIAENTWEYKSPM   31 (92)
Q Consensus        11 ~~~~v~~yd~~~~~W~~~~~~   31 (92)
                      .+.....+|.+|++|..++.+
T Consensus        20 ~~f~~~~~~vEtdew~~i~~~   40 (327)
T COG2380          20 KPFAEESFDVETDEWEAIPTL   40 (327)
T ss_pred             ccccceeeecccccccccCcc
Confidence            344578899999999999754


No 77 
>cd01207 Ena-Vasp Enabled-VASP-type homology (EVH1) domain. Enabled-VASP-type homology (EVH1) domain. The EVH1 domain binds to other proteins at proline rich sequences. It is found in proteins involved in cytoskeletal reorganization such as Enabled and VASP. Ena-VASP type EVH1 domains specifically recognize FPPPP motifs in the focal adhesion proteins zyxin and vinculin, and the ActA surface protein of Listeria monocytogenes.  It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=45.13  E-value=49  Score=17.62  Aligned_cols=18  Identities=17%  Similarity=0.294  Sum_probs=15.3

Q ss_pred             ceEEEeeCCCCeEEeeCC
Q psy2980          13 NRVECYHIAENTWEYKSP   30 (92)
Q Consensus        13 ~~v~~yd~~~~~W~~~~~   30 (92)
                      +.|..||+.++.|.....
T Consensus         9 A~Vm~~d~~tk~W~P~~~   26 (111)
T cd01207           9 ASVMVYDDSNKKWVPAGG   26 (111)
T ss_pred             EEeeEEcCCCCcEEcCCC
Confidence            468999999999988755


No 78 
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=42.32  E-value=91  Score=19.98  Aligned_cols=57  Identities=16%  Similarity=0.396  Sum_probs=33.2

Q ss_pred             ceEEEeeCCCC--eEEeeCCCC--------CCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCC--ceEEc
Q psy2980          13 NRVECYHIAEN--TWEYKSPMK--------EKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDND--SWEIM   76 (92)
Q Consensus        13 ~~v~~yd~~~~--~W~~~~~~~--------~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~--~W~~~   76 (92)
                      ..+.+||..+.  .|+.-..-.        ..+...+.++.++.+|+...       ...++++|..+.  .|+.-
T Consensus        79 g~l~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~v~~~~v~v~~~-------~g~l~ald~~tG~~~W~~~  147 (394)
T PRK11138         79 GLVKALDADTGKEIWSVDLSEKDGWFSKNKSALLSGGVTVAGGKVYIGSE-------KGQVYALNAEDGEVAWQTK  147 (394)
T ss_pred             CeEEEEECCCCcEeeEEcCCCcccccccccccccccccEEECCEEEEEcC-------CCEEEEEECCCCCCccccc
Confidence            35889998765  486542211        11223345667888887532       236888887553  57654


No 79 
>PRK11408 hypothetical protein; Provisional
Probab=40.55  E-value=68  Score=18.00  Aligned_cols=30  Identities=17%  Similarity=0.326  Sum_probs=20.9

Q ss_pred             EEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980          47 IYVLGGEEGWDGYHDSIECYDVDNDSWEIMS   77 (92)
Q Consensus        47 iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~   77 (92)
                      +|.+.|..- ......++.|-|.+-+|+.+.
T Consensus        34 ~FAiNgG~l-g~~~G~v~Y~aPDtL~WE~l~   63 (145)
T PRK11408         34 FFALNGGGL-GEDVGEVYYFAPDTLKWESLE   63 (145)
T ss_pred             eEEeccCcc-CCCCCeEEEeCCCccceeccC
Confidence            455543221 445788999999999999763


No 80 
>PF13570 PQQ_3:  PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=40.37  E-value=31  Score=14.02  Aligned_cols=26  Identities=27%  Similarity=0.450  Sum_probs=15.6

Q ss_pred             ceEEEECCEEEEEcCcCCCCCccceEEEEeCCC
Q psy2980          38 PGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDN   70 (92)
Q Consensus        38 ~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~   70 (92)
                      .+.++.++.+|+...       -..++++|+.+
T Consensus        15 ~~~~v~~g~vyv~~~-------dg~l~ald~~t   40 (40)
T PF13570_consen   15 SSPAVAGGRVYVGTG-------DGNLYALDAAT   40 (40)
T ss_dssp             S--EECTSEEEEE-T-------TSEEEEEETT-
T ss_pred             cCCEEECCEEEEEcC-------CCEEEEEeCCC
Confidence            334667888888753       34788888753


No 81 
>PF07734 FBA_1:  F-box associated;  InterPro: IPR006527 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=40.08  E-value=67  Score=17.82  Aligned_cols=40  Identities=20%  Similarity=0.337  Sum_probs=23.8

Q ss_pred             EEECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCCC
Q psy2980          41 AVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLPS   81 (92)
Q Consensus        41 ~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~~   81 (92)
                      +.+++.+|=+.-... ......+.+||..+++.++..++|.
T Consensus         2 V~vnG~~hW~~~~~~-~~~~~~IlsFDl~~E~F~~~~~lP~   41 (164)
T PF07734_consen    2 VFVNGALHWLAYDEN-NDEKDFILSFDLSTEKFGRSLPLPF   41 (164)
T ss_pred             EEECCEEEeeEEecC-CCCceEEEEEeccccccCCEECCCC
Confidence            345677766654322 1112279999999999944445543


No 82 
>PF03835 Rad4:  Rad4 transglutaminase-like domain;  InterPro: IPR018325 RAD4/Xp-C proteins contain an ancient transglutaminase fold that is also found in peptide-N-glycanases (PNGases), which remove glycans from glycoproteins during their degradation. The PNGases retain the catalytic triad that is typical of this fold and are predicted to have a reaction mechanism similar to that involved in transglutamination. In contrast, the RAD4/Xp-C proteins are predicted to be inactive and are likely to only possess the interaction function in DNA repair []. ; GO: 0003684 damaged DNA binding, 0006289 nucleotide-excision repair, 0005634 nucleus; PDB: 2QSG_A 2QSF_A 2QSH_A 1X3W_A 1X3Z_A 3ESW_A.
Probab=39.47  E-value=46  Score=18.25  Aligned_cols=21  Identities=29%  Similarity=0.439  Sum_probs=16.8

Q ss_pred             cceEEEeeCCCCeEEeeCCCC
Q psy2980          12 TNRVECYHIAENTWEYKSPMK   32 (92)
Q Consensus        12 ~~~v~~yd~~~~~W~~~~~~~   32 (92)
                      .--+|+|++.+++|..+-++.
T Consensus        36 ~~W~EV~~~~~~rWI~VDp~~   56 (145)
T PF03835_consen   36 NFWVEVYSPEEKRWIHVDPVV   56 (145)
T ss_dssp             CEEEEEEETTTTEEEEEETTT
T ss_pred             eEEEEEEecCCCeEEEeeeec
Confidence            335899999999999996554


No 83 
>PF10902 DUF2693:  Protein of unknown function (DUF2693);  InterPro: IPR024401 This family of proteins is found in bacteria and bacteriophages. Its function is unknown. 
Probab=38.19  E-value=46  Score=16.75  Aligned_cols=20  Identities=20%  Similarity=0.579  Sum_probs=16.2

Q ss_pred             CCccceEEEEeCCCCceEEc
Q psy2980          57 DGYHDSIECYDVDNDSWEIM   76 (92)
Q Consensus        57 ~~~~~~~~~y~~~~~~W~~~   76 (92)
                      .+....+..||...+.|+..
T Consensus        45 kes~~s~~yfDve~~~WRSF   64 (83)
T PF10902_consen   45 KESTTSVRYFDVEKKGWRSF   64 (83)
T ss_pred             cCCcceEEEEEeccCceeee
Confidence            44567799999999999854


No 84 
>PRK09511 nirD nitrite reductase small subunit; Provisional
Probab=37.74  E-value=55  Score=16.98  Aligned_cols=32  Identities=9%  Similarity=0.087  Sum_probs=14.5

Q ss_pred             EEeeCCCCeEEeeCCCCCCcccceEEEECCEEEE
Q psy2980          16 ECYHIAENTWEYKSPMKEKRYRPGIAVIDGKIYV   49 (92)
Q Consensus        16 ~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~iy~   49 (92)
                      ++||..|.+....+.  .+...+.+-+-++.||+
T Consensus        75 ~~Fdl~TG~~~~~~~--~~l~typV~ve~g~V~v  106 (108)
T PRK09511         75 QRFRLSDGLCMEDEQ--FSVKHYDARVKDGVVQL  106 (108)
T ss_pred             CEEECCCcccCCCCC--ccEeeEeEEEECCEEEE
Confidence            556666665433221  12222334444666655


No 85 
>PF08662 eIF2A:  Eukaryotic translation initiation factor eIF2A;  InterPro: IPR013979  This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins. 
Probab=36.46  E-value=87  Score=18.04  Aligned_cols=25  Identities=32%  Similarity=0.611  Sum_probs=15.5

Q ss_pred             CCEEEEEcCcCCCCCccceEEEEeCCCC
Q psy2980          44 DGKIYVLGGEEGWDGYHDSIECYDVDND   71 (92)
Q Consensus        44 ~~~iy~~GG~~~~~~~~~~~~~y~~~~~   71 (92)
                      ++.+.+++|..+   ...++..||..+.
T Consensus       111 ~G~~l~~~g~~n---~~G~l~~wd~~~~  135 (194)
T PF08662_consen  111 DGRFLVLAGFGN---LNGDLEFWDVRKK  135 (194)
T ss_pred             CCCEEEEEEccC---CCcEEEEEECCCC
Confidence            566777777643   2246778888643


No 86 
>PF00614 PLDc:  Phospholipase D Active site motif;  InterPro: IPR001736 Phosphatidylcholine-hydrolysing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certain types of transport vesicles or may be constitutive vesicular transport to signal transduction pathways. PC-hydrolysing PLD is a homologue of cardiolipin synthase, phosphatidylserine synthase, bacterial PLDs, and viral proteins. Each of these appears to possess a domain duplication which is apparent by the presence of two motifs containing well-conserved histidine, lysine, and/or asparagine residues which may contribute to the active site aspartic acid. An Escherichia coli endonuclease (nuc) and similar proteins appear to be PLD homologues but possess only one of these motifs [, , , ].; GO: 0003824 catalytic activity, 0008152 metabolic process; PDB: 3HSI_C.
Probab=35.69  E-value=35  Score=13.21  Aligned_cols=16  Identities=25%  Similarity=0.644  Sum_probs=8.9

Q ss_pred             eEEEECCEEEEEcCcC
Q psy2980          39 GIAVIDGKIYVLGGEE   54 (92)
Q Consensus        39 ~~~~~~~~iy~~GG~~   54 (92)
                      -.+++++.+.++||.+
T Consensus         8 K~~vvD~~~a~vGg~n   23 (28)
T PF00614_consen    8 KFVVVDDRVAFVGGAN   23 (28)
T ss_dssp             -EEEETTTEEEEE---
T ss_pred             EEEEEcCCEEEECcee
Confidence            3566777777777754


No 87 
>KOG2055|consensus
Probab=34.47  E-value=1.5e+02  Score=20.22  Aligned_cols=27  Identities=11%  Similarity=0.249  Sum_probs=19.8

Q ss_pred             EEEEcCcCCCCCccceEEEEeCCCCceEEcCCC
Q psy2980          47 IYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHL   79 (92)
Q Consensus        47 iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~   79 (92)
                      ..+++|.      .+-++.||..+.+-.++.++
T Consensus       272 ~i~~s~r------rky~ysyDle~ak~~k~~~~  298 (514)
T KOG2055|consen  272 VIFTSGR------RKYLYSYDLETAKVTKLKPP  298 (514)
T ss_pred             EEEeccc------ceEEEEeeccccccccccCC
Confidence            5555554      45789999998888887655


No 88 
>KOG2958|consensus
Probab=33.02  E-value=35  Score=21.63  Aligned_cols=20  Identities=25%  Similarity=0.478  Sum_probs=15.4

Q ss_pred             CcceEEEeeCCCCeEEeeCC
Q psy2980          11 NTNRVECYHIAENTWEYKSP   30 (92)
Q Consensus        11 ~~~~v~~yd~~~~~W~~~~~   30 (92)
                      +...-.+|||.+++|-.+.+
T Consensus         9 ~q~~hrRynPltd~wVlvSp   28 (354)
T KOG2958|consen    9 NQHSHRRYNPLTDEWVLVSP   28 (354)
T ss_pred             ccCchhccCCccceeEEech
Confidence            34457899999999987743


No 89 
>PRK00178 tolB translocation protein TolB; Provisional
Probab=32.95  E-value=1.4e+02  Score=19.36  Aligned_cols=57  Identities=12%  Similarity=0.085  Sum_probs=30.9

Q ss_pred             eEEEeeCCCCeEEeeCCCCCCcccceEE-EEC-CEEEEEcCcCCCCCccceEEEEeCCCCceEEc
Q psy2980          14 RVECYHIAENTWEYKSPMKEKRYRPGIA-VID-GKIYVLGGEEGWDGYHDSIECYDVDNDSWEIM   76 (92)
Q Consensus        14 ~v~~yd~~~~~W~~~~~~~~~~~~~~~~-~~~-~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~   76 (92)
                      +++.+|..+.....+...+.  ...... .-+ ..|++.....+    ...++.+|..+..++++
T Consensus       268 ~Iy~~d~~~~~~~~lt~~~~--~~~~~~~spDg~~i~f~s~~~g----~~~iy~~d~~~g~~~~l  326 (430)
T PRK00178        268 EIYVMDLASRQLSRVTNHPA--IDTEPFWGKDGRTLYFTSDRGG----KPQIYKVNVNGGRAERV  326 (430)
T ss_pred             eEEEEECCCCCeEEcccCCC--CcCCeEECCCCCEEEEEECCCC----CceEEEEECCCCCEEEe
Confidence            58888998888777654221  111111 123 34544432111    34678888777766655


No 90 
>PRK05137 tolB translocation protein TolB; Provisional
Probab=32.88  E-value=1.4e+02  Score=19.47  Aligned_cols=60  Identities=17%  Similarity=0.210  Sum_probs=31.0

Q ss_pred             ceEEEeeCCCCeEEeeCCCCCCcccceEEEECC-EEEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980          13 NRVECYHIAENTWEYKSPMKEKRYRPGIAVIDG-KIYVLGGEEGWDGYHDSIECYDVDNDSWEIMS   77 (92)
Q Consensus        13 ~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~   77 (92)
                      ..++.+|+.+.+...+...+...... ...-++ .|++....++    ..+++.+|..+..-+++.
T Consensus       226 ~~i~~~dl~~g~~~~l~~~~g~~~~~-~~SPDG~~la~~~~~~g----~~~Iy~~d~~~~~~~~Lt  286 (435)
T PRK05137        226 PRVYLLDLETGQRELVGNFPGMTFAP-RFSPDGRKVVMSLSQGG----NTDIYTMDLRSGTTTRLT  286 (435)
T ss_pred             CEEEEEECCCCcEEEeecCCCcccCc-EECCCCCEEEEEEecCC----CceEEEEECCCCceEEcc
Confidence            35888888888777665443221111 111233 3433332221    346777777666655554


No 91 
>PRK04792 tolB translocation protein TolB; Provisional
Probab=32.77  E-value=1.5e+02  Score=19.64  Aligned_cols=59  Identities=12%  Similarity=0.120  Sum_probs=31.4

Q ss_pred             ceEEEeeCCCCeEEeeCCCCCCcccceEEE-EC-CEEEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980          13 NRVECYHIAENTWEYKSPMKEKRYRPGIAV-ID-GKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMS   77 (92)
Q Consensus        13 ~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~-~~-~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~   77 (92)
                      ..++.+|+.+++-..+...+...  ...+. -+ ..|++....++    ...++.+|..+.+.+++.
T Consensus       242 ~~L~~~dl~tg~~~~lt~~~g~~--~~~~wSPDG~~La~~~~~~g----~~~Iy~~dl~tg~~~~lt  302 (448)
T PRK04792        242 AEIFVQDIYTQVREKVTSFPGIN--GAPRFSPDGKKLALVLSKDG----QPEIYVVDIATKALTRIT  302 (448)
T ss_pred             cEEEEEECCCCCeEEecCCCCCc--CCeeECCCCCEEEEEEeCCC----CeEEEEEECCCCCeEECc
Confidence            35788888777665555433211  11222 23 34544432222    356888888777776654


No 92 
>PF09272 Hepsin-SRCR:  Hepsin, SRCR;  InterPro: IPR015352 This entry represents the extracellular domain of the serine protease hepsin. The domain is formed primarily by three elements of regular secondary structure: a 12-residue alpha helix, a twisted five-stranded antiparallel beta sheet, and a second, two-stranded, antiparallel sheet. The two beta-sheets lie at roughly right angles to each other, with the helix nestled between the two, adopting an SRCR fold. The exact function of this domain has not been identified, though it probably may serve to orient the protease domain or place it in the vicinity of its substrate []. ; GO: 0004252 serine-type endopeptidase activity, 0070008 serine-type exopeptidase activity; PDB: 3T2N_B 1Z8G_A 1P57_A 1O5E_L 1O5F_L.
Probab=31.69  E-value=70  Score=16.94  Aligned_cols=20  Identities=10%  Similarity=0.521  Sum_probs=14.8

Q ss_pred             cceEEEEeCCCCceEEcCCC
Q psy2980          60 HDSIECYDVDNDSWEIMSHL   79 (92)
Q Consensus        60 ~~~~~~y~~~~~~W~~~~~~   79 (92)
                      -..+..||..+++|+.+...
T Consensus        11 D~rL~vfD~te~~WRllCss   30 (110)
T PF09272_consen   11 DQRLMVFDSTEGTWRLLCSS   30 (110)
T ss_dssp             T-BEEEEETTTTEEEEEB--
T ss_pred             cceEEEEeccCCEEEEEeec
Confidence            45688899999999988643


No 93 
>PF14298 DUF4374:  Domain of unknown function (DUF4374)
Probab=31.49  E-value=1.7e+02  Score=19.79  Aligned_cols=61  Identities=16%  Similarity=0.277  Sum_probs=36.4

Q ss_pred             cceEEEeeCCCCeEEeeCCCCCC---cccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCCceE
Q psy2980          12 TNRVECYHIAENTWEYKSPMKEK---RYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWE   74 (92)
Q Consensus        12 ~~~v~~yd~~~~~W~~~~~~~~~---~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~   74 (92)
                      ..++-+||..+.+...+..+|..   .......+-++.+|+--.-.  .....-++.+||.+.+-.
T Consensus       366 ~~~laI~d~~~kt~t~V~glP~~~is~~~~~~~ve~G~aYi~Vtt~--~g~~~~IY~iDp~TatAt  429 (435)
T PF14298_consen  366 AKKLAIFDVSNKTFTWVTGLPADLISGFGNAPYVENGKAYIPVTTE--DGSDPYIYKIDPATATAT  429 (435)
T ss_pred             cceEEEEEccCceeEEeccCChhhccccccceEeeCCEEEEEEeec--CCCceeEEEEcCcccccc
Confidence            34577888888877777666654   22223344566666653221  111357999999877543


No 94 
>cd00837 EVH1 EVH1 (Enabled, Vasp-Homology) or WASP Homology (WH1) domain. EVH1 (Enabled, Vasp-Homology) or WASP Homology (WH1) domain. The EVH1 domain binds to other proteins at proline rich sequences in either FPPPP or PPXXF motifs. It is found in the cytoskeletal reorganization proteins Enabled VASP, and WASP, and in the synaptic scaffolding protein Homer. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=31.48  E-value=62  Score=16.70  Aligned_cols=17  Identities=12%  Similarity=0.173  Sum_probs=14.6

Q ss_pred             cceEEEEeCCCCceEEc
Q psy2980          60 HDSIECYDVDNDSWEIM   76 (92)
Q Consensus        60 ~~~~~~y~~~~~~W~~~   76 (92)
                      ...++.+|+.++.|...
T Consensus         8 ~a~v~~~~~~~~~W~~~   24 (104)
T cd00837           8 VAQVYTADPSTGKWVPA   24 (104)
T ss_pred             EEEEEEECCCCCceEEC
Confidence            56788999999999987


No 95 
>KOG0179|consensus
Probab=31.20  E-value=35  Score=20.47  Aligned_cols=19  Identities=21%  Similarity=0.333  Sum_probs=13.8

Q ss_pred             CccccCCCCCCcceEEEeeCC
Q psy2980           1 MLRGWHASTENTNRVECYHIA   21 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~   21 (92)
                      |++|.+.++...  |+.|||.
T Consensus       129 ilaGiDeeGKG~--VySyDPv  147 (235)
T KOG0179|consen  129 ILAGIDEEGKGA--VYSYDPV  147 (235)
T ss_pred             eeecccccCcee--EEeecCC
Confidence            467887666555  8999875


No 96 
>PF13952 DUF4216:  Domain of unknown function (DUF4216)
Probab=30.80  E-value=75  Score=15.57  Aligned_cols=19  Identities=11%  Similarity=0.298  Sum_probs=11.7

Q ss_pred             CCccceEEEEeCC-CCceEE
Q psy2980          57 DGYHDSIECYDVD-NDSWEI   75 (92)
Q Consensus        57 ~~~~~~~~~y~~~-~~~W~~   75 (92)
                      .+...-.+.-||. ...|..
T Consensus        54 sqa~QVfYv~dp~~~~~w~v   73 (75)
T PF13952_consen   54 SQAKQVFYVPDPSSDKKWHV   73 (75)
T ss_pred             hheeEEEEEeCCCCCCCcEE
Confidence            4445566667777 666764


No 97 
>cd03476 Rieske_ArOX_small Small subunit of Arsenite oxidase (ArOX) family, Rieske domain; ArOX is a molybdenum/iron protein involved in the detoxification of arsenic, oxidizing it to arsenate. It consists of two subunits, a large subunit similar to members of the DMSO reductase family of molybdenum enzymes and a small subunit with a Rieske-type [2Fe-2S] cluster. The large subunit of ArOX contains the molybdenum site at which the oxidation of arsenite occurs. The small subunit contains a domain homologous to the Rieske domains of the cytochrome bc(1) and cytochrome b6f complexes as well as naphthalene 1,2-dioxygenase. The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer.
Probab=30.61  E-value=95  Score=16.73  Aligned_cols=35  Identities=14%  Similarity=0.166  Sum_probs=17.6

Q ss_pred             EEeeCCC-CeEEeeCCCCCCcccceEEEE--CCEEEEEc
Q psy2980          16 ECYHIAE-NTWEYKSPMKEKRYRPGIAVI--DGKIYVLG   51 (92)
Q Consensus        16 ~~yd~~~-~~W~~~~~~~~~~~~~~~~~~--~~~iy~~G   51 (92)
                      +.||+.+ ..-...|. +.+...+.+.+-  ++.||+.|
T Consensus        78 ~~Fdl~tgG~~~~gPa-~~~L~~ypv~ve~~~g~V~~~~  115 (126)
T cd03476          78 SQFDPARGGQMVSGQA-TQNLPQIVLEYDEASGDIYAVG  115 (126)
T ss_pred             CEEeCCCCCeEEcCCC-CCCCCeEEEEEECCCCEEEEEE
Confidence            4566653 24433332 223333445555  67788876


No 98 
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=30.34  E-value=1.5e+02  Score=18.93  Aligned_cols=60  Identities=17%  Similarity=0.160  Sum_probs=31.7

Q ss_pred             ceEEEeeCCCCeEEeeCCCCCCcccceEEEECC-EEEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980          13 NRVECYHIAENTWEYKSPMKEKRYRPGIAVIDG-KIYVLGGEEGWDGYHDSIECYDVDNDSWEIMS   77 (92)
Q Consensus        13 ~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~   77 (92)
                      ..++.+|..+++-..+...+..... ....-++ .|++.....+    ...++.+|..+...+++.
T Consensus       214 ~~i~v~d~~~g~~~~~~~~~~~~~~-~~~spDg~~l~~~~~~~~----~~~i~~~d~~~~~~~~l~  274 (417)
T TIGR02800       214 PEIYVQDLATGQREKVASFPGMNGA-PAFSPDGSKLAVSLSKDG----NPDIYVMDLDGKQLTRLT  274 (417)
T ss_pred             cEEEEEECCCCCEEEeecCCCCccc-eEECCCCCEEEEEECCCC----CccEEEEECCCCCEEECC
Confidence            4588889887766555443322211 1112233 4554432222    356888888777666553


No 99 
>PF08950 DUF1861:  Protein of unknown function (DUF1861);  InterPro: IPR015045 This hypothetical protein, found in bacteria and in the eukaryote Leishmania, has no known function. ; PDB: 2B4W_A.
Probab=30.01  E-value=1.5e+02  Score=18.80  Aligned_cols=42  Identities=17%  Similarity=0.379  Sum_probs=25.9

Q ss_pred             CcceEEEeeCC-CCeEEeeCCCC-CCcccceEEEECCEEEEEcCc
Q psy2980          11 NTNRVECYHIA-ENTWEYKSPMK-EKRYRPGIAVIDGKIYVLGGE   53 (92)
Q Consensus        11 ~~~~v~~yd~~-~~~W~~~~~~~-~~~~~~~~~~~~~~iy~~GG~   53 (92)
                      ..+.|..|.-. .++|..++.-+ ......-.+.+++.+ ++||.
T Consensus        52 e~S~V~fF~e~g~~~w~~v~~~~~~~LqDPF~t~I~gel-ifGGv   95 (298)
T PF08950_consen   52 EHSEVRFFEETGKDEWTPVEGAPVFQLQDPFVTRIQGEL-IFGGV   95 (298)
T ss_dssp             S--EEEEEEEEETTEEEE-TT---BS-EEEEEEEETTEE-EEEEE
T ss_pred             hccEEEEEEEeCCCeEEECCCcceEEecCcceeeECCEE-EEeeE
Confidence            34567777655 88999998744 455566677888988 45554


No 100
>cd03529 Rieske_NirD Assimilatory nitrite reductase (NirD) family, Rieske domain; Assimilatory nitrate and nitrite reductases convert nitrate through nitrite to ammonium. Members include bacterial and fungal proteins. The bacterial NirD contains a single Rieske domain while fungal proteins have a C-terminal Rieske domain in addition to several other domains. The fungal NirD is involved in nutrient acquisition, functioning at the soil/fungus interface to control nutrient exchange between the fungus and the host plant. The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. The Rieske [2Fe-2S] cluster is liganded to two histidine and two cysteine residues present in conserved sequences called Rieske motifs. In this family, only a few members contain these residues. Other members may have lost the ability to bind the Rieske [2Fe-2S] cluster.
Probab=29.55  E-value=86  Score=15.88  Aligned_cols=6  Identities=0%  Similarity=0.423  Sum_probs=2.3

Q ss_pred             eeCCCC
Q psy2980          18 YHIAEN   23 (92)
Q Consensus        18 yd~~~~   23 (92)
                      ||+.+.
T Consensus        73 Fdl~tG   78 (103)
T cd03529          73 FSLKTG   78 (103)
T ss_pred             EEcCCC
Confidence            333333


No 101
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=29.15  E-value=2.3e+02  Score=20.67  Aligned_cols=58  Identities=10%  Similarity=0.349  Sum_probs=33.3

Q ss_pred             ceEEEeeCCCCe--EEeeCC---------------CC--CCcccceEEEECCEEEEEcCcCCCCCccceEEEEeCCCC--
Q psy2980          13 NRVECYHIAENT--WEYKSP---------------MK--EKRYRPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVDND--   71 (92)
Q Consensus        13 ~~v~~yd~~~~~--W~~~~~---------------~~--~~~~~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~--   71 (92)
                      -.+..+|+.|.+  |+.--.               ++  .+..+..+++-++.+|+-|..+      ..+..||.++.  
T Consensus       641 G~l~AiDl~tGk~~W~~~~g~~~~~~p~~~~~~~~~~~g~p~~gG~l~TagglvF~~gt~d------~~l~A~D~~tGk~  714 (764)
T TIGR03074       641 GYMAAIDLKTGKVVWQHPNGTVRDTGPMGIRMPLPIPIGVPTLGGPLATAGGLVFIGATQD------NYLRAYDLSTGKE  714 (764)
T ss_pred             EEEEEEECCCCcEeeeeECCccccccccccccccccccCCcccCCcEEEcCCEEEEEeCCC------CEEEEEECCCCce
Confidence            458889988775  755420               01  1222333555566666533322      47899998664  


Q ss_pred             ceEEc
Q psy2980          72 SWEIM   76 (92)
Q Consensus        72 ~W~~~   76 (92)
                      -|+..
T Consensus       715 lW~~~  719 (764)
T TIGR03074       715 LWKAR  719 (764)
T ss_pred             eeEee
Confidence            48754


No 102
>PF05237 MoeZ_MoeB:  MoeZ/MoeB domain;  InterPro: IPR007901 This putative domain is found in the MoeZ protein and the MoeB protein. The domain has two CXXC motifs that are only partly conserved. MoeZ is necessary for the synthesis of pyridine-2,6-bis(thiocarboxylic acid), a small secreted metabolite that has a high affinity for transition metals, increases iron uptake efficiency by 20% in Pseudomonas stutzeri, has the ability to reduce both soluble and mineral forms of iron, and has antimicrobial activity towards several species of bacteria. MoeB is the molybdopterin synthase activating enzyme in the molybdopterin cofactor biosynthesis pathway. Both these enzymes are members of a superfamily consisting of related but structurally distinct proteins that are members of pathways involved in the transfer of sulphur-containing moieties to metabolites [] and both also contain the UBA/THIF-type NAD/FAD binding fold (IPR000594 from INTERPRO). ; PDB: 1JWA_B 1JW9_B 1JWB_B 1ZKM_D 1ZUD_3 1ZFN_D.
Probab=29.13  E-value=81  Score=15.46  Aligned_cols=19  Identities=0%  Similarity=0.034  Sum_probs=15.6

Q ss_pred             cceEEEeeCCCCeEEeeCC
Q psy2980          12 TNRVECYHIAENTWEYKSP   30 (92)
Q Consensus        12 ~~~v~~yd~~~~~W~~~~~   30 (92)
                      ..++..||..+.+|+.+.-
T Consensus        53 ~~~l~~~D~~~~~~~~i~~   71 (84)
T PF05237_consen   53 SGKLLTIDLLNMSFRSIRI   71 (84)
T ss_dssp             BTEEEEEETTTTEEEEEE-
T ss_pred             hhheeeEECCCCeEEEEec
Confidence            4469999999999998863


No 103
>PF00568 WH1:  WH1 domain;  InterPro: IPR000697 The EVH1 (WH1, RanBP1-WASP) domain is found in multi-domain proteins implicated in a diverse range of signalling, nuclear transport and cytoskeletal events. This domain of around 115 amino acids is present in species ranging from yeast to mammals. Many EVH1-containing proteins associate with actin-based structures and play a role in cytoskeletal organisation. EVH1 domains recognise and bind the proline-rich motif FPPPP with low-affinity, further interactions then form between flanking residues [][].  WASP family proteins contain a EVH1 (WH1) in their N-terminals which bind proline-rich sequences in the WASP interacting protein. Proteins of the RanBP1 family contain a WH1 domain in their N-terminal region, which seems to bind a different sequence motif present in the C-terminal part of RanGTP protein [,].  Tertiary structure of the WH1 domain of the Mena protein revealed structure similarities with the pleckstrin homology (PH) domain. The overall fold consists of a compact parallel beta-sandwich, closed along one edge by a long alpha-helix. A highly conserved cluster of three surface-exposed aromatic side-chains forms the recognition site for the molecules target ligands. [].; GO: 0005515 protein binding; PDB: 1I2H_A 1DDV_A 1DDW_A 1EGX_A 3SYX_A 1TJ6_B 1XOD_B 1EVH_A 1I7A_B 2JP2_A ....
Probab=29.04  E-value=88  Score=16.27  Aligned_cols=16  Identities=13%  Similarity=0.104  Sum_probs=13.4

Q ss_pred             ceEEEeeCCCCe-EEee
Q psy2980          13 NRVECYHIAENT-WEYK   28 (92)
Q Consensus        13 ~~v~~yd~~~~~-W~~~   28 (92)
                      ..|+.+|+.+++ |...
T Consensus        16 A~v~~~~p~~~~~W~~~   32 (111)
T PF00568_consen   16 AQVYQADPDTKRQWSPV   32 (111)
T ss_dssp             EEEEEEETTTSESEEES
T ss_pred             EEEEEEEcCCCCcEeeC
Confidence            467899999888 9986


No 104
>KOG1898|consensus
Probab=28.56  E-value=2.8e+02  Score=21.39  Aligned_cols=31  Identities=10%  Similarity=0.083  Sum_probs=21.7

Q ss_pred             cceEEEEeCCCCceEEcCCCCCCccceeeEE
Q psy2980          60 HDSIECYDVDNDSWEIMSHLPSARSWLGCVP   90 (92)
Q Consensus        60 ~~~~~~y~~~~~~W~~~~~~~~~~~~~~~~~   90 (92)
                      .-.+++|++..|+......-|.+|....+..
T Consensus       997 SV~~~~y~~~~n~l~~fadD~~pR~Vt~~~~ 1027 (1205)
T KOG1898|consen  997 SVHFVRYRREDNQLIVFADDPVPRHVTALEL 1027 (1205)
T ss_pred             eEEEEEEecCCCeEEEEeCCCccceeeEEEE
Confidence            4456778888888888777777776554443


No 105
>PF04096 Nucleoporin2:  Nucleoporin autopeptidase;  InterPro: IPR007230 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of autocatalytic serine endopeptidases belong to MEROPS peptidase family S59 (clan SP). The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 50 unique proteins, collectively known as nucleoporins. A number of the peptides are synthesised as precursors and undergo self-catalyzed cleavage.  The proteolytic cleavage site of yeast Nup145p has been mapped upstream of an evolutionary conserved serine residue. Cleavage occurs at the same site when a precursor is artificially expressed in Escherichia coli. A hydroxyl-containing residue is critical for the reaction, although a thiol-containing residue offers an acceptable replacement. In vitro kinetics experiments using a purified precursor molecule demonstrate that the cleavage is self-catalyzed and that the catalytic domain lies within the N-terminal moiety. Taken altogether, the data are consistent with a proteolytic mechanism involving an N>O acyl rearrangement and a subsequent ester intermediate uncovered in other self-processing proteins []. Nup98 is a component of the nuclear pore that plays its primary role in the export of RNAs. Nup98 is expressed in two forms, derived from alternate mRNA splicing. Both forms are processed into two peptides through autoproteolysis mediated by the C-terminal domain of hNup98. The three-dimensional structure of the C-terminal domain reveals a novel protein fold, and thus a new class of autocatalytic proteases. The structure further reveals that the suggested nucleoporin RNA binding motif is unlikely to bind to RNA []. The following nucleoporins share an ~150-residue C-terminal domain responsible for NPC targeting [, ]:  Vertebrate Nup98, a component of the nuclear pore that plays its primary role in the export of RNAs.  Yeast Nup100, plays an important role in several nuclear export and import pathways including poly(A)+ RNA and protein transport.  Yeast Nup116, involved in mRNA export and protein transport.  Yeast Nup145, involved in nuclear poly(A)+ RNA and tRNA export.  The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore [, ]. The NUP C-terminal domain adopts a predominantly beta-strand structure. The molecule consists of a six-stranded beta-sheet sandwiched against a two-stranded beta-sheet and flanked by alpha-helical regions. The N-terminal helical region consists of two short helices, whereas the stretch on the opposite side of molecule consists of a single, longer helix [, ].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 3PBP_E 2AIV_A 1KO6_A 2Q5X_A 2Q5Y_C 3KEP_A 3KES_A 3NF5_B 3TKN_I.
Probab=28.05  E-value=86  Score=17.36  Aligned_cols=16  Identities=19%  Similarity=0.482  Sum_probs=13.0

Q ss_pred             cceEEEeeCCCCeEEe
Q psy2980          12 TNRVECYHIAENTWEY   27 (92)
Q Consensus        12 ~~~v~~yd~~~~~W~~   27 (92)
                      ..++..||+.+.+|.-
T Consensus       121 ~~~FvsYd~~tG~W~F  136 (141)
T PF04096_consen  121 GAEFVSYDPETGTWVF  136 (141)
T ss_dssp             TSEEEEEETTTTEEEE
T ss_pred             CCEEEEEeCCCcEEEE
Confidence            3577889999999964


No 106
>KOG0294|consensus
Probab=27.57  E-value=1.8e+02  Score=18.91  Aligned_cols=27  Identities=19%  Similarity=0.427  Sum_probs=16.0

Q ss_pred             eEEEECCEEEEEcCcCCCCCccceEEEEeCCCC
Q psy2980          39 GIAVIDGKIYVLGGEEGWDGYHDSIECYDVDND   71 (92)
Q Consensus        39 ~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~   71 (92)
                      .++++++...+-||.+      .+++.||..++
T Consensus        47 tavAVs~~~~aSGssD------etI~IYDm~k~   73 (362)
T KOG0294|consen   47 TALAVSGPYVASGSSD------ETIHIYDMRKR   73 (362)
T ss_pred             eEEEecceeEeccCCC------CcEEEEeccch
Confidence            3444566444445443      37888888765


No 107
>PF10946 DUF2625:  Protein of unknown function DUF2625;  InterPro: IPR021239  Some members in this family of proteins are annotated as ybfG however currently no function is known. 
Probab=27.33  E-value=1.3e+02  Score=18.04  Aligned_cols=30  Identities=13%  Similarity=0.314  Sum_probs=20.4

Q ss_pred             EEEEcCcCCCCCccceEEEEeCCCCceEEcC
Q psy2980          47 IYVLGGEEGWDGYHDSIECYDVDNDSWEIMS   77 (92)
Q Consensus        47 iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~   77 (92)
                      +|.+.|..- ......++.|.|.+-+|+.+.
T Consensus       111 ~FAiNgG~l-g~d~G~v~YfAPDtL~WE~l~  140 (208)
T PF10946_consen  111 FFAINGGGL-GEDPGKVYYFAPDTLEWEPLE  140 (208)
T ss_pred             EEEEccCCc-CCCCCeEEEECCCcccccccC
Confidence            455533221 345778999999999999773


No 108
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=27.31  E-value=1.7e+02  Score=18.50  Aligned_cols=76  Identities=14%  Similarity=0.255  Sum_probs=38.4

Q ss_pred             ccccCCCCCCcceEEEeeCCCCeEEeeCCCCCCcccceEEE--ECCEEEEEcCcCCCCCccceEEEEeCCC--CceEEcC
Q psy2980           2 LRGWHASTENTNRVECYHIAENTWEYKSPMKEKRYRPGIAV--IDGKIYVLGGEEGWDGYHDSIECYDVDN--DSWEIMS   77 (92)
Q Consensus         2 ~GG~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~--~~~~iy~~GG~~~~~~~~~~~~~y~~~~--~~W~~~~   77 (92)
                      +|++.......=.++.||..+.++..+......-.-.-++.  -++.||+.....   .....+..|....  .+.+.+.
T Consensus         4 vgsy~~~~~~gI~~~~~d~~~g~l~~~~~~~~~~~Ps~l~~~~~~~~LY~~~e~~---~~~g~v~~~~i~~~~g~L~~~~   80 (345)
T PF10282_consen    4 VGSYTNGKGGGIYVFRFDEETGTLTLVQTVAEGENPSWLAVSPDGRRLYVVNEGS---GDSGGVSSYRIDPDTGTLTLLN   80 (345)
T ss_dssp             EEECCSSSSTEEEEEEEETTTTEEEEEEEEEESSSECCEEE-TTSSEEEEEETTS---STTTEEEEEEEETTTTEEEEEE
T ss_pred             EEcCCCCCCCcEEEEEEcCCCCCceEeeeecCCCCCceEEEEeCCCEEEEEEccc---cCCCCEEEEEECCCcceeEEee
Confidence            56665322222125666678899987764322111111333  356788885432   1244566666544  4666654


Q ss_pred             CCC
Q psy2980          78 HLP   80 (92)
Q Consensus        78 ~~~   80 (92)
                      ..+
T Consensus        81 ~~~   83 (345)
T PF10282_consen   81 SVP   83 (345)
T ss_dssp             EEE
T ss_pred             eec
Confidence            433


No 109
>PF09910 DUF2139:  Uncharacterized protein conserved in archaea (DUF2139);  InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=27.21  E-value=1.8e+02  Score=18.79  Aligned_cols=17  Identities=24%  Similarity=0.552  Sum_probs=14.7

Q ss_pred             CCccceEEEEeCCCCce
Q psy2980          57 DGYHDSIECYDVDNDSW   73 (92)
Q Consensus        57 ~~~~~~~~~y~~~~~~W   73 (92)
                      ..-.+.+.++|..+.+|
T Consensus       169 ~~g~~~i~~~Dli~~~~  185 (339)
T PF09910_consen  169 HKGVSGIHCLDLISGKW  185 (339)
T ss_pred             ccCCceEEEEEccCCeE
Confidence            44578899999999999


No 110
>PF14741 GH114_assoc:  N-terminal glycosyl-hydrolase-114-associated domain
Probab=27.06  E-value=1e+02  Score=16.95  Aligned_cols=20  Identities=15%  Similarity=0.415  Sum_probs=15.9

Q ss_pred             eEEEeeC-CCCeEEeeCCCCC
Q psy2980          14 RVECYHI-AENTWEYKSPMKE   33 (92)
Q Consensus        14 ~v~~yd~-~~~~W~~~~~~~~   33 (92)
                      .+..||. .+++|.++..+..
T Consensus        64 ~vd~~~~~~~~~w~~vGD~t~   84 (129)
T PF14741_consen   64 TVDMFDFDDTSTWVKVGDLTG   84 (129)
T ss_pred             EEEEEEccCCCeeEEeccccc
Confidence            4788888 8889999987643


No 111
>TIGR02377 MocE_fam_FeS Rieske [2Fe-2S] domain protein, MocE subfamily. This model describes a subfamily of the Rieske-like [2Fe-2S] family of ferredoxins that includes MocE, part of the rhizopine (3-O-methyl-scyllo-inosamine) catabolic cluster in Rhizobium. Members of this family are related to, yet distinct from, the small subunit of nitrite reductase [NAD(P)H].
Probab=26.97  E-value=98  Score=15.68  Aligned_cols=13  Identities=8%  Similarity=0.289  Sum_probs=5.2

Q ss_pred             EEEeeCCCCeEEe
Q psy2980          15 VECYHIAENTWEY   27 (92)
Q Consensus        15 v~~yd~~~~~W~~   27 (92)
                      +..|....++..-
T Consensus        26 i~l~r~~~g~~~A   38 (101)
T TIGR02377        26 FAIYRTPDDQYYA   38 (101)
T ss_pred             EEEEEeCCCEEEE
Confidence            4444443334433


No 112
>PRK01029 tolB translocation protein TolB; Provisional
Probab=26.54  E-value=2e+02  Score=19.02  Aligned_cols=59  Identities=8%  Similarity=0.064  Sum_probs=34.1

Q ss_pred             eEEEeeCCCCeEEeeCCCCCCcccceEE-EECC-EEEEEcCcCCCCCccceEEEEeCCCCceEEcCC
Q psy2980          14 RVECYHIAENTWEYKSPMKEKRYRPGIA-VIDG-KIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSH   78 (92)
Q Consensus        14 ~v~~yd~~~~~W~~~~~~~~~~~~~~~~-~~~~-~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~   78 (92)
                      .++.||+.+++...+...+  ....... .-++ .|++.....    ....++.+|....+.+.+..
T Consensus       352 ~I~v~dl~~g~~~~Lt~~~--~~~~~p~wSpDG~~L~f~~~~~----g~~~L~~vdl~~g~~~~Lt~  412 (428)
T PRK01029        352 QICVYDLATGRDYQLTTSP--ENKESPSWAIDSLHLVYSAGNS----NESELYLISLITKKTRKIVI  412 (428)
T ss_pred             EEEEEECCCCCeEEccCCC--CCccceEECCCCCEEEEEECCC----CCceEEEEECCCCCEEEeec
Confidence            5788898888877775432  1111122 2233 454443221    13678899998887777753


No 113
>KOG2111|consensus
Probab=26.38  E-value=1.9e+02  Score=18.77  Aligned_cols=51  Identities=22%  Similarity=0.203  Sum_probs=29.2

Q ss_pred             CccccCCCCCCcceEEEeeCCCCe-EEeeCC---CC--CCcccceEEEECCEEEEEc
Q psy2980           1 MLRGWHASTENTNRVECYHIAENT-WEYKSP---MK--EKRYRPGIAVIDGKIYVLG   51 (92)
Q Consensus         1 v~GG~~~~~~~~~~v~~yd~~~~~-W~~~~~---~~--~~~~~~~~~~~~~~iy~~G   51 (92)
                      ++||-+....+.++|..+|=...+ -.++.-   ..  .-|...-+++..++||++-
T Consensus        63 LVGGg~~pky~pNkviIWDD~k~~~i~el~f~~~I~~V~l~r~riVvvl~~~I~Vyt  119 (346)
T KOG2111|consen   63 LVGGGSRPKYPPNKVIIWDDLKERCIIELSFNSEIKAVKLRRDRIVVVLENKIYVYT  119 (346)
T ss_pred             EecCCCCCCCCCceEEEEecccCcEEEEEEeccceeeEEEcCCeEEEEecCeEEEEE
Confidence            467776666777789988843332 222221   11  1133456777788887763


No 114
>PF13360 PQQ_2:  PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=26.13  E-value=1.4e+02  Score=17.16  Aligned_cols=55  Identities=16%  Similarity=0.327  Sum_probs=33.6

Q ss_pred             eEEEeeCCCCe--EEeeCCCCCCcccce--EEEECCEEEEEcCcCCCCCccceEEEEeCCCC--ceEEcC
Q psy2980          14 RVECYHIAENT--WEYKSPMKEKRYRPG--IAVIDGKIYVLGGEEGWDGYHDSIECYDVDND--SWEIMS   77 (92)
Q Consensus        14 ~v~~yd~~~~~--W~~~~~~~~~~~~~~--~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~--~W~~~~   77 (92)
                      .+.++|+.+.+  |+.-..  .+.....  .+..++.+|+..+       ...++++|+.+.  .|+.-.
T Consensus         4 ~l~~~d~~tG~~~W~~~~~--~~~~~~~~~~~~~~~~v~~~~~-------~~~l~~~d~~tG~~~W~~~~   64 (238)
T PF13360_consen    4 TLSALDPRTGKELWSYDLG--PGIGGPVATAVPDGGRVYVASG-------DGNLYALDAKTGKVLWRFDL   64 (238)
T ss_dssp             EEEEEETTTTEEEEEEECS--SSCSSEEETEEEETTEEEEEET-------TSEEEEEETTTSEEEEEEEC
T ss_pred             EEEEEECCCCCEEEEEECC--CCCCCccceEEEeCCEEEEEcC-------CCEEEEEECCCCCEEEEeec
Confidence            47889987664  766321  1122222  3347888888732       457899998665  476654


No 115
>PF03022 MRJP:  Major royal jelly protein;  InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=25.36  E-value=1.8e+02  Score=18.12  Aligned_cols=62  Identities=15%  Similarity=0.290  Sum_probs=34.7

Q ss_pred             cceEEEeeCCCCeEEeeCCCC----CCccc-ceEEEEC-------CEEEEEcCcCCCCCccceEEEEeCCCC-ceEEcCC
Q psy2980          12 TNRVECYHIAENTWEYKSPMK----EKRYR-PGIAVID-------GKIYVLGGEEGWDGYHDSIECYDVDND-SWEIMSH   78 (92)
Q Consensus        12 ~~~v~~yd~~~~~W~~~~~~~----~~~~~-~~~~~~~-------~~iy~~GG~~~~~~~~~~~~~y~~~~~-~W~~~~~   78 (92)
                      ..++..||+.+++-.+.-.+|    .+... ..+++-.       ..+|+.-      .....+.+||..++ .|+.+..
T Consensus        33 ~pKLv~~Dl~t~~li~~~~~p~~~~~~~s~lndl~VD~~~~~~~~~~aYItD------~~~~glIV~dl~~~~s~Rv~~~  106 (287)
T PF03022_consen   33 PPKLVAFDLKTNQLIRRYPFPPDIAPPDSFLNDLVVDVRDGNCDDGFAYITD------SGGPGLIVYDLATGKSWRVLHN  106 (287)
T ss_dssp             --EEEEEETTTTCEEEEEE--CCCS-TCGGEEEEEEECTTTTS-SEEEEEEE------TTTCEEEEEETTTTEEEEEETC
T ss_pred             CcEEEEEECCCCcEEEEEECChHHcccccccceEEEEccCCCCcceEEEEeC------CCcCcEEEEEccCCcEEEEecC
Confidence            356999999999854332222    12222 2233322       3567763      22348999999886 5777655


Q ss_pred             C
Q psy2980          79 L   79 (92)
Q Consensus        79 ~   79 (92)
                      .
T Consensus       107 ~  107 (287)
T PF03022_consen  107 S  107 (287)
T ss_dssp             G
T ss_pred             C
Confidence            3


No 116
>TIGR02378 nirD_assim_sml nitrite reductase [NAD(P)H], small subunit. This model describes NirD, the small subunit of nitrite reductase [NAD(P)H] (the assimilatory nitrite reductase), which associates with NirB, the large subunit (TIGR02374). In a few bacteria such as Klebsiella pneumoniae and in Fungi, the two regions are fused.
Probab=24.50  E-value=1.1e+02  Score=15.47  Aligned_cols=7  Identities=0%  Similarity=0.468  Sum_probs=2.7

Q ss_pred             EeeCCCC
Q psy2980          17 CYHIAEN   23 (92)
Q Consensus        17 ~yd~~~~   23 (92)
                      .||+.+.
T Consensus        73 ~Fdl~tG   79 (105)
T TIGR02378        73 NFRLEDG   79 (105)
T ss_pred             EEEcCCc
Confidence            3443333


No 117
>PF07453 NUMOD1:  NUMOD1 domain;  InterPro: IPR010896 This helix-turn-helix-containing DNA-binding domain is found associated in homing nucleases [].
Probab=24.06  E-value=68  Score=12.84  Aligned_cols=12  Identities=25%  Similarity=0.343  Sum_probs=8.9

Q ss_pred             eEEEeeCCCCeE
Q psy2980          14 RVECYHIAENTW   25 (92)
Q Consensus        14 ~v~~yd~~~~~W   25 (92)
                      .|++||..++.-
T Consensus         2 ~V~~yd~~~~~i   13 (37)
T PF07453_consen    2 PVYVYDLNTNEI   13 (37)
T ss_pred             eEEEEECCCCeE
Confidence            478888887763


No 118
>PF09816 EAF:  RNA polymerase II transcription elongation factor;  InterPro: IPR019194  This entry represents the N-terminal domain of ELL-associated factor (Eaf) proteins, which act as transcriptional transactivators of ELL and ELL2 RNA Polymerase II (Pol II) transcriptional elongation factors [, , , ]. Eaf proteins form a stable heterodimer complex with ELL proteins to facilitate the binding of RNA polymerase II to activate transcription elongation. ELL and EAF1 are components of Cajal bodies, which have a role in leukemogenesis []. EAF1 also has the capacity to interact with ELL1 and ELL2. The N terminus of approx 120 of EAF1 has a region of high serine, aspartic acid, and glutamic acid residues [, ].
Probab=23.89  E-value=1.2e+02  Score=15.77  Aligned_cols=28  Identities=14%  Similarity=0.317  Sum_probs=19.1

Q ss_pred             EEEEcCcCCCCCccceEEEEeCCCCceE
Q psy2980          47 IYVLGGEEGWDGYHDSIECYDVDNDSWE   74 (92)
Q Consensus        47 iy~~GG~~~~~~~~~~~~~y~~~~~~W~   74 (92)
                      -|+|-|.........-+.+||+.++.+.
T Consensus        65 ~~~f~G~~~~~~~~ecVLifD~~~~~f~   92 (109)
T PF09816_consen   65 TYVFKGSQRPSKEKECVLIFDPETGEFV   92 (109)
T ss_pred             cEEEEeccCCCCCcEEEEEEECCCCEEE
Confidence            4677664333444678999999997654


No 119
>PRK09965 3-phenylpropionate dioxygenase ferredoxin subunit; Provisional
Probab=23.16  E-value=1.2e+02  Score=15.46  Aligned_cols=10  Identities=30%  Similarity=0.747  Sum_probs=4.5

Q ss_pred             EEEECCEEEE
Q psy2980          40 IAVIDGKIYV   49 (92)
Q Consensus        40 ~~~~~~~iy~   49 (92)
                      +.+.++.||+
T Consensus        90 v~v~~g~v~v   99 (106)
T PRK09965         90 VHVEGGDIFI   99 (106)
T ss_pred             EEEECCEEEE
Confidence            3334445544


No 120
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=22.85  E-value=2.3e+02  Score=18.47  Aligned_cols=54  Identities=9%  Similarity=0.087  Sum_probs=34.1

Q ss_pred             CcceEEEeeCCCCeEEeeCCCCCCcccceEEEEC--C-EEEEEcCcCCCCCccceEEEEeCCCCc
Q psy2980          11 NTNRVECYHIAENTWEYKSPMKEKRYRPGIAVID--G-KIYVLGGEEGWDGYHDSIECYDVDNDS   72 (92)
Q Consensus        11 ~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~--~-~iy~~GG~~~~~~~~~~~~~y~~~~~~   72 (92)
                      ..++|+.+|..+.  ..+..++.++...+++.-.  . .+|+..+      ..+++..+|..+.+
T Consensus       277 ~~~~V~ViD~~t~--kvi~~i~vG~~~~~iavS~Dgkp~lyvtn~------~s~~VsViD~~t~k  333 (352)
T TIGR02658       277 ASRFLFVVDAKTG--KRLRKIELGHEIDSINVSQDAKPLLYALST------GDKTLYIFDAETGK  333 (352)
T ss_pred             CCCEEEEEECCCC--eEEEEEeCCCceeeEEECCCCCeEEEEeCC------CCCcEEEEECcCCe
Confidence            3357999998754  4555555555555555543  3 5676643      35678899987763


No 121
>cd03583 NTR_complement_C3 NTR/C345C domain, complement C3 subfamily; The NTR domain found in complement C3 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C3 plays a pivotal role in the activation of the complement systems, as all pathways (classical, alternative, and lectin) result in the processing of C3 by C3 convertase. The larger fragment, activated C3b, contains the NTR/C345C domain and binds covalently, via a reactive thioester, to cell surface carbohydrates including components of bacterial cell walls and immune aggregates. The smaller cleavage product, C3a, acts independently as a diffusible signal to mediate local inflammatory processes. The structure of C3 shows that the NTR/C345C domain is located in an exposed position relative to the rest of the molecule. The function of the domain in complement C3 is poorly understood.
Probab=22.83  E-value=1.5e+02  Score=16.54  Aligned_cols=36  Identities=19%  Similarity=0.179  Sum_probs=22.4

Q ss_pred             CCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCC
Q psy2980          44 DGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHL   79 (92)
Q Consensus        44 ~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~   79 (92)
                      .++-|++-|.+......+.-+.|=..+++|-+.=|.
T Consensus        86 ~gk~YLIMG~~~~~~~~~~~~~YvL~~~TWvE~wP~  121 (149)
T cd03583          86 EGKDYLIMGLSSDLWRIKDKYSYVIGKDTWIEYWPT  121 (149)
T ss_pred             CCCEEEEEeCCCCccccCCcEEEEeCCCeEEEECCC
Confidence            466777777653222223357888899999876433


No 122
>PF12904 Collagen_bind_2:  Putative collagen-binding domain of a collagenase ;  InterPro: IPR024749 This domain is likely to be the collagen-binding domain of a family of bacterial collagenase enzymes. The structure of one family member, Q8A905 from SWISSPROT, has been characterised. The domain occurs in the C-terminal region of the protein.; PDB: 3KZS_D.
Probab=22.70  E-value=1.3e+02  Score=15.45  Aligned_cols=18  Identities=6%  Similarity=0.143  Sum_probs=13.7

Q ss_pred             ceEEEeeCCCCeEEeeCC
Q psy2980          13 NRVECYHIAENTWEYKSP   30 (92)
Q Consensus        13 ~~v~~yd~~~~~W~~~~~   30 (92)
                      .+++.|||.+++...+..
T Consensus        49 ~~a~WfdPR~G~~~~~g~   66 (93)
T PF12904_consen   49 VKAWWFDPRTGKYTYIGE   66 (93)
T ss_dssp             EEEEEEETTT-BEEEEEE
T ss_pred             eeEEEEcCCCCCEEEeee
Confidence            468999999999887754


No 123
>TIGR03352 VI_chp_3 type VI secretion lipoprotein, VC_A0113 family. Work by Mougous, et al. (2006), describes IAHP-related loci as a type VI secretion system (PubMed:16763151). This protein family is associated with type VI secretion loci, although not treated explicitly by Mougous, et al.
Probab=22.55  E-value=1.4e+02  Score=16.60  Aligned_cols=23  Identities=22%  Similarity=0.634  Sum_probs=16.9

Q ss_pred             EEEEeCCCCceEEcCCCCCCccc
Q psy2980          63 IECYDVDNDSWEIMSHLPSARSW   85 (92)
Q Consensus        63 ~~~y~~~~~~W~~~~~~~~~~~~   85 (92)
                      ...+|+...+|+.+-+.+..+..
T Consensus       113 A~fr~~~~~~Wr~~~~~~~~~~~  135 (146)
T TIGR03352       113 AAYRDIDNAEWRVVYKVPPKGKY  135 (146)
T ss_pred             EEeecCCCCceEEEEecCCCCce
Confidence            34578888999998877766543


No 124
>cd03584 NTR_complement_C4 NTR/C345C domain, complement C4 subfamily; The NTR domain found in complement C4 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C4 is a key player in the activation of the component classical pathway. C4 is cleaved by activated C1 to yield C4a anaphylatoxin, and the larger fragment C4b, an essential component of the C3- and C5-convertase enzymes. C4b binds covalently to the surface of pathogens through a reactive thioester. The role of the NTR/C345C domain in C4 (C4b) is unclear.
Probab=22.47  E-value=1.6e+02  Score=16.53  Aligned_cols=36  Identities=22%  Similarity=0.370  Sum_probs=21.4

Q ss_pred             CCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCC
Q psy2980          44 DGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHL   79 (92)
Q Consensus        44 ~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~   79 (92)
                      .++-|++-|.+......+.-+.|=..+++|-+.=|.
T Consensus        90 ~gk~YLImG~~~~~~~~~~~~~Y~L~~~tWvE~wP~  125 (153)
T cd03584          90 KGKEYLIMGKDGATSDSNGHMQYLLDSKTWVEKIPS  125 (153)
T ss_pred             CCCEEEEEcCCCCCcCcCCcEEEEeCCCceEEECCC
Confidence            455666666543222223347888899999876433


No 125
>PF15232 DUF4585:  Domain of unknown function (DUF4585)
Probab=22.43  E-value=1.1e+02  Score=15.12  Aligned_cols=15  Identities=0%  Similarity=-0.115  Sum_probs=11.9

Q ss_pred             EEeeCCCCeEEeeCC
Q psy2980          16 ECYHIAENTWEYKSP   30 (92)
Q Consensus        16 ~~yd~~~~~W~~~~~   30 (92)
                      ..||++|++.-++.-
T Consensus        32 ~lfDPETGqYVeV~i   46 (75)
T PF15232_consen   32 TLFDPETGQYVEVLI   46 (75)
T ss_pred             eeecCCCCcEEEEeC
Confidence            568999999887753


No 126
>cd03548 Rieske_RO_Alpha_OMO_CARDO Rieske non-heme iron oxygenase (RO) family, 2-Oxoquinoline 8-monooxygenase (OMO) and Carbazole 1,9a-dioxygenase (CARDO) subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. OMO catalyzes the NADH-dependent oxidation of the N-heterocyclic aromatic compound 2-oxoquinoline to 8-hydroxy-2-oxoquinoline, the second step in the bacterial degradation of quinoline. OMO consists of a reductase component (OMR) and  an oxygenase component (OMO) that together function to shuttle electrons from the
Probab=22.30  E-value=1.5e+02  Score=16.06  Aligned_cols=37  Identities=14%  Similarity=0.132  Sum_probs=22.2

Q ss_pred             EEeeCCCCeEEeeCCCCC-------CcccceEEEECCEEEEEcC
Q psy2980          16 ECYHIAENTWEYKSPMKE-------KRYRPGIAVIDGKIYVLGG   52 (92)
Q Consensus        16 ~~yd~~~~~W~~~~~~~~-------~~~~~~~~~~~~~iy~~GG   52 (92)
                      +.||+.+.+-..++..|.       ....+.+.+.++.||+.-+
T Consensus        81 w~Fdl~tG~~~~~~~~p~~~~~~~~~L~~ypv~~~~g~V~v~~~  124 (136)
T cd03548          81 WTYRLDDGKLVTILANPDDPLIGRTGLKTYPVEEAKGMIFVFVG  124 (136)
T ss_pred             cEEeCCCccEEEcccCCCccccccCCCceEeEEEECCEEEEEeC
Confidence            778988888776543321       1223445556777777644


No 127
>KOG4499|consensus
Probab=21.89  E-value=2.2e+02  Score=17.88  Aligned_cols=39  Identities=10%  Similarity=0.076  Sum_probs=24.0

Q ss_pred             CcceEEEeeCCCCeEEeeCCCCCCcccceEEEEC----CEEEEEc
Q psy2980          11 NTNRVECYHIAENTWEYKSPMKEKRYRPGIAVID----GKIYVLG   51 (92)
Q Consensus        11 ~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~----~~iy~~G   51 (92)
                      +...|.++||.+.+-..--.+|.++..  +|.++    +.+|+..
T Consensus       231 ng~~V~~~dp~tGK~L~eiklPt~qit--sccFgGkn~d~~yvT~  273 (310)
T KOG4499|consen  231 NGGTVQKVDPTTGKILLEIKLPTPQIT--SCCFGGKNLDILYVTT  273 (310)
T ss_pred             cCcEEEEECCCCCcEEEEEEcCCCceE--EEEecCCCccEEEEEe
Confidence            445688999998876544456766654  33333    3356654


No 128
>PRK04043 tolB translocation protein TolB; Provisional
Probab=21.80  E-value=2.5e+02  Score=18.56  Aligned_cols=63  Identities=16%  Similarity=0.215  Sum_probs=34.9

Q ss_pred             ceEEEeeCCCCeEEeeCCCCCCcccceEEEECC-EEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980          13 NRVECYHIAENTWEYKSPMKEKRYRPGIAVIDG-KIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP   80 (92)
Q Consensus        13 ~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~   80 (92)
                      ..++.+|+.+++=..+...+. ........-++ +|.+.-...+    ..+++.+|.....++++.+.+
T Consensus       213 ~~Iyv~dl~tg~~~~lt~~~g-~~~~~~~SPDG~~la~~~~~~g----~~~Iy~~dl~~g~~~~LT~~~  276 (419)
T PRK04043        213 PTLYKYNLYTGKKEKIASSQG-MLVVSDVSKDGSKLLLTMAPKG----QPDIYLYDTNTKTLTQITNYP  276 (419)
T ss_pred             CEEEEEECCCCcEEEEecCCC-cEEeeEECCCCCEEEEEEccCC----CcEEEEEECCCCcEEEcccCC
Confidence            368888888776666654221 11111122233 4544332221    468999998888888875443


No 129
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=21.16  E-value=2.5e+02  Score=18.28  Aligned_cols=33  Identities=18%  Similarity=0.411  Sum_probs=23.8

Q ss_pred             EECCEEEEEcCcCCCCCccceEEEEeCCCCceEEcCCCC
Q psy2980          42 VIDGKIYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHLP   80 (92)
Q Consensus        42 ~~~~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~~   80 (92)
                      -+++++|+.      .....++..+|+.+.+.+.+...|
T Consensus       210 WhdgrLwvl------dsgtGev~~vD~~~G~~e~Va~vp  242 (335)
T TIGR03032       210 WYQGKLWLL------NSGRGELGYVDPQAGKFQPVAFLP  242 (335)
T ss_pred             EeCCeEEEE------ECCCCEEEEEcCCCCcEEEEEECC
Confidence            456888887      334568888898888887776554


No 130
>PRK04922 tolB translocation protein TolB; Provisional
Probab=21.08  E-value=2.5e+02  Score=18.37  Aligned_cols=58  Identities=10%  Similarity=0.097  Sum_probs=28.9

Q ss_pred             eEEEeeCCCCeEEeeCCCCCCcccceEEEEC-CEEEEEcCcCCCCCccceEEEEeCCCCceEEc
Q psy2980          14 RVECYHIAENTWEYKSPMKEKRYRPGIAVID-GKIYVLGGEEGWDGYHDSIECYDVDNDSWEIM   76 (92)
Q Consensus        14 ~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~-~~iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~   76 (92)
                      .++.+|..+++-..+...+.... .....-+ ..|++....++    ..+++.+|..+..-+++
T Consensus       229 ~l~~~dl~~g~~~~l~~~~g~~~-~~~~SpDG~~l~~~~s~~g----~~~Iy~~d~~~g~~~~l  287 (433)
T PRK04922        229 AIYVQDLATGQRELVASFRGING-APSFSPDGRRLALTLSRDG----NPEIYVMDLGSRQLTRL  287 (433)
T ss_pred             EEEEEECCCCCEEEeccCCCCcc-CceECCCCCEEEEEEeCCC----CceEEEEECCCCCeEEC
Confidence            47888887777666654432111 1111122 34443322221    24677777776665554


No 131
>KOG2445|consensus
Probab=21.03  E-value=2.5e+02  Score=18.26  Aligned_cols=22  Identities=14%  Similarity=0.238  Sum_probs=15.8

Q ss_pred             CCcceEEEeeCCCCeEEeeCCC
Q psy2980          10 ENTNRVECYHIAENTWEYKSPM   31 (92)
Q Consensus        10 ~~~~~v~~yd~~~~~W~~~~~~   31 (92)
                      .+...++.|+-..++|..+..+
T Consensus       198 ~~~~~Iye~~e~~rKw~kva~L  219 (361)
T KOG2445|consen  198 LNKVKIYEYNENGRKWLKVAEL  219 (361)
T ss_pred             ccceEEEEecCCcceeeeehhc
Confidence            4455677788778899988654


No 132
>PF15525 DUF4652:  Domain of unknown function (DUF4652)
Probab=20.87  E-value=2e+02  Score=17.12  Aligned_cols=68  Identities=10%  Similarity=0.130  Sum_probs=40.2

Q ss_pred             CCcceEEEeeCCCCeEEeeC--CCC---CCcccceEEEECCE-EEEEcCcCCCCCccceEEEEeCCCCceEEcCCC
Q psy2980          10 ENTNRVECYHIAENTWEYKS--PMK---EKRYRPGIAVIDGK-IYVLGGEEGWDGYHDSIECYDVDNDSWEIMSHL   79 (92)
Q Consensus        10 ~~~~~v~~yd~~~~~W~~~~--~~~---~~~~~~~~~~~~~~-iy~~GG~~~~~~~~~~~~~y~~~~~~W~~~~~~   79 (92)
                      .-.-+++.+|..++.|..+.  +.+   .|.  ...-.-+.. ++++|...+.-+....++.|++.+..-..+-+.
T Consensus        85 EgiGkIYIkn~~~~~~~~L~i~~~~~k~sPK--~i~WiDD~~L~vIIG~a~GTvS~GGnLy~~nl~tg~~~~ly~~  158 (200)
T PF15525_consen   85 EGIGKIYIKNLNNNNWWSLQIDQNEEKYSPK--YIEWIDDNNLAVIIGYAHGTVSKGGNLYKYNLNTGNLTELYEW  158 (200)
T ss_pred             ccceeEEEEecCCCceEEEEecCcccccCCc--eeEEecCCcEEEEEccccceEccCCeEEEEEccCCceeEeeec
Confidence            34456899999888886552  221   233  223333444 455553333233456799999998887776554


No 133
>PF08309 LVIVD:  LVIVD repeat;  InterPro: IPR013211 This repeat is found in bacterial and archaeal cell surface proteins, many of which are hypothetical. The secondary structure corresponding to this repeat is predicted to comprise 4 beta-strands, which may associate to form a beta-propeller. The repeat copy number varies from 2-14. This repeat is sometimes found with the PKD domain IPR000601 from INTERPRO.
Probab=20.42  E-value=95  Score=13.21  Aligned_cols=26  Identities=23%  Similarity=0.392  Sum_probs=16.8

Q ss_pred             cceEEEECCEEEEEcCcCCCCCccceEEEEeCC
Q psy2980          37 RPGIAVIDGKIYVLGGEEGWDGYHDSIECYDVD   69 (92)
Q Consensus        37 ~~~~~~~~~~iy~~GG~~~~~~~~~~~~~y~~~   69 (92)
                      ...+++.++.+|+..+.       ..+..+|..
T Consensus         4 a~~v~v~g~yaYva~~~-------~Gl~IvDIS   29 (42)
T PF08309_consen    4 ARDVAVSGNYAYVADGN-------NGLVIVDIS   29 (42)
T ss_pred             EEEEEEECCEEEEEeCC-------CCEEEEECC
Confidence            45577788899988443       235666663


Done!