Query         psy6079
Match_columns 79
No_of_seqs    123 out of 1392
Neff          10.3
Searched_HMMs 46136
Date          Fri Aug 16 17:05:16 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy6079.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6079hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4441|consensus               99.8   2E-20 4.4E-25  118.8   8.6   75    3-79    399-476 (571)
  2 KOG4441|consensus               99.8 2.2E-20 4.8E-25  118.6   7.6   75    3-79    352-428 (571)
  3 PHA02713 hypothetical protein;  99.8 5.4E-19 1.2E-23  112.2   9.4   75    3-79    323-399 (557)
  4 PF13964 Kelch_6:  Kelch motif   99.8 6.7E-19 1.4E-23   80.2   6.3   49   19-69      1-50  (50)
  5 PHA02713 hypothetical protein;  99.8 2.1E-18 4.5E-23  109.5   8.5   74    4-79    436-513 (557)
  6 PHA02790 Kelch-like protein; P  99.8 1.3E-17 2.8E-22  104.5   9.2   73    3-79    334-408 (480)
  7 PHA02790 Kelch-like protein; P  99.7 1.4E-16 3.1E-21   99.8   9.1   72    3-79    290-363 (480)
  8 PHA03098 kelch-like protein; P  99.7 1.2E-16 2.6E-21  100.8   8.8   75    3-79    314-390 (534)
  9 PLN02153 epithiospecifier prot  99.7 4.6E-16   1E-20   93.9   9.5   75    3-79     53-138 (341)
 10 PF01344 Kelch_1:  Kelch motif;  99.7 8.6E-17 1.9E-21   72.2   4.1   46   19-66      1-47  (47)
 11 PHA03098 kelch-like protein; P  99.7 7.5E-16 1.6E-20   97.3   9.2   75    3-79    361-438 (534)
 12 PLN02153 epithiospecifier prot  99.7 1.1E-15 2.4E-20   92.2   9.4   75    3-79    104-194 (341)
 13 smart00612 Kelch Kelch domain.  99.7 3.6E-16 7.8E-21   69.8   5.3   47   33-79      1-47  (47)
 14 TIGR03548 mutarot_permut cycli  99.7 1.3E-15 2.8E-20   91.3   9.1   74    4-79     92-172 (323)
 15 TIGR03547 muta_rot_YjhT mutatr  99.7 1.6E-15 3.4E-20   91.6   8.8   75    3-79     88-201 (346)
 16 PLN02193 nitrile-specifier pro  99.6 3.4E-15 7.3E-20   93.5   9.7   75    3-79    196-279 (470)
 17 PLN02193 nitrile-specifier pro  99.6 4.2E-15 9.1E-20   93.1   9.7   75    3-79    247-329 (470)
 18 PF13415 Kelch_3:  Galactose ox  99.6   2E-15 4.3E-20   68.4   5.8   47   31-77      1-49  (49)
 19 TIGR03547 muta_rot_YjhT mutatr  99.6 2.1E-14 4.6E-19   86.7   8.9   74    3-78     32-118 (346)
 20 PF07646 Kelch_2:  Kelch motif;  99.6 1.3E-14 2.8E-19   65.7   6.1   46   19-66      1-49  (49)
 21 PRK14131 N-acetylneuraminic ac  99.6 3.9E-14 8.4E-19   86.6   8.8   75    3-79    109-222 (376)
 22 KOG0379|consensus               99.6 4.4E-14 9.4E-19   88.8   8.5   75    3-79    142-225 (482)
 23 TIGR03548 mutarot_permut cycli  99.5 3.4E-13 7.5E-18   80.9   8.7   67   11-79     52-124 (323)
 24 PF13418 Kelch_4:  Galactose ox  99.5 3.9E-14 8.4E-19   64.0   3.4   47   19-66      1-48  (49)
 25 KOG4693|consensus               99.5   2E-13 4.4E-18   80.2   6.8   75    3-79     47-140 (392)
 26 PRK14131 N-acetylneuraminic ac  99.5 5.6E-13 1.2E-17   81.6   8.9   74    3-78     53-139 (376)
 27 KOG4693|consensus               99.4 1.1E-12 2.3E-17   77.3   6.2   73    5-79    221-303 (392)
 28 KOG1230|consensus               99.3 7.8E-12 1.7E-16   76.9   6.9   72    4-77    158-240 (521)
 29 KOG0379|consensus               99.3 2.4E-11 5.1E-16   76.7   7.9   74    3-78    193-275 (482)
 30 PLN02772 guanylate kinase       99.1 4.3E-10 9.3E-15   69.3   7.6   61   17-79     22-86  (398)
 31 PF13854 Kelch_5:  Kelch motif   99.1 3.3E-10 7.2E-15   49.6   5.0   39   17-57      2-42  (42)
 32 KOG1230|consensus               99.1 5.1E-10 1.1E-14   69.1   6.7   73    3-76    101-185 (521)
 33 KOG4152|consensus               99.0 7.1E-10 1.5E-14   70.3   5.5   72    5-78    235-332 (830)
 34 COG3055 Uncharacterized protei  98.8 1.7E-08 3.6E-13   61.4   5.9   67   10-78     70-145 (381)
 35 COG3055 Uncharacterized protei  98.4   4E-07 8.7E-12   55.5   4.3   70    3-74    116-224 (381)
 36 KOG4152|consensus               98.1 5.3E-06 1.1E-10   53.3   4.0   64   10-76     17-89  (830)
 37 PF07250 Glyoxal_oxid_N:  Glyox  98.0 4.2E-05 9.1E-10   45.0   6.3   74    3-78     49-127 (243)
 38 smart00612 Kelch Kelch domain.  97.8 2.7E-05 5.9E-10   33.9   2.8   26    3-28     18-45  (47)
 39 KOG2437|consensus               97.5 9.5E-05 2.1E-09   47.6   3.0   66    9-74    238-318 (723)
 40 PF13415 Kelch_3:  Galactose ox  97.3 0.00035 7.7E-09   31.1   2.6   26    3-28     22-49  (49)
 41 KOG2437|consensus               97.1 0.00093   2E-08   43.3   4.1   58    4-63    350-420 (723)
 42 PF13964 Kelch_6:  Kelch motif   97.0 0.00064 1.4E-08   30.3   1.8   18    3-20     31-48  (50)
 43 PLN02772 guanylate kinase       97.0  0.0046 9.9E-08   39.0   6.0   51    3-56     54-109 (398)
 44 TIGR01640 F_box_assoc_1 F-box   96.4   0.048   1E-06   31.5   7.4   59    3-63     73-135 (230)
 45 PF01344 Kelch_1:  Kelch motif;  96.4   0.001 2.3E-08   29.0   0.3   17    3-19     31-47  (47)
 46 TIGR01640 F_box_assoc_1 F-box   95.9    0.15 3.2E-06   29.5   7.8   62    3-64     17-87  (230)
 47 PF13418 Kelch_4:  Galactose ox  95.8  0.0058 1.2E-07   26.9   1.3   17    3-19     32-48  (49)
 48 PF07646 Kelch_2:  Kelch motif;  95.4   0.013 2.8E-07   25.9   1.6   17    3-19     33-49  (49)
 49 PF12768 Rax2:  Cortical protei  95.2    0.16 3.4E-06   30.9   6.2   60    4-64     20-81  (281)
 50 PF07893 DUF1668:  Protein of u  89.9     3.3 7.2E-05   25.8   8.1   36    4-41     90-127 (342)
 51 PF07250 Glyoxal_oxid_N:  Glyox  88.5    0.95   2E-05   27.0   3.5   34   30-70    179-214 (243)
 52 COG4257 Vgb Streptogramin lyas  85.4     5.8 0.00013   24.7   5.6   55    3-63    257-313 (353)
 53 PF13570 PQQ_3:  PQQ-like domai  81.0     3.1 6.7E-05   17.2   4.0   37   11-56      4-40  (40)
 54 PF08268 FBA_3:  F-box associat  78.1     8.1 0.00018   20.3   6.1   33   31-64      5-37  (129)
 55 PF06058 DCP1:  Dcp1-like decap  69.0     6.8 0.00015   20.9   2.4   16   47-62     29-44  (122)
 56 PRK09511 nirD nitrite reductas  66.1      16 0.00035   18.9   3.4   31    4-36     76-106 (108)
 57 PF07443 HARP:  HepA-related pr  65.5     2.8 6.1E-05   19.3   0.5   12   50-61     21-32  (55)
 58 PF05096 Glu_cyclase_2:  Glutam  63.1      33 0.00071   21.0   6.4   46   31-79     55-100 (264)
 59 PF08450 SGL:  SMP-30/Gluconola  62.9      28 0.00061   20.2   5.0   54    3-65     25-78  (246)
 60 PF14830 Haemocyan_bet_s:  Haem  62.4     7.7 0.00017   20.2   1.8   24   31-54     36-60  (103)
 61 PF00614 PLDc:  Phospholipase D  61.9      10 0.00022   14.8   2.0   16   24-41      8-23  (28)
 62 PF14157 YmzC:  YmzC-like prote  61.5      14 0.00031   17.5   2.4   34   24-62     23-56  (63)
 63 PF03089 RAG2:  Recombination a  55.2      25 0.00054   22.1   3.3   19   21-41    156-174 (337)
 64 PRK11408 hypothetical protein;  54.9      30 0.00065   19.2   3.3   29   34-63     34-63  (145)
 65 PF13806 Rieske_2:  Rieske-like  54.1      20 0.00043   18.4   2.5   31    4-36     73-103 (104)
 66 COG4880 Secreted protein conta  51.4      18 0.00039   24.1   2.4   29   25-64    571-599 (603)
 67 KOG0278|consensus               44.1      41 0.00089   20.8   3.0   30   23-57    227-256 (334)
 68 TIGR00547 lolA periplasmic cha  43.8      55  0.0012   19.0   3.5   24   31-61     77-100 (204)
 69 cd01207 Ena-Vasp Enabled-VASP-  43.7      30 0.00064   18.3   2.2   18   47-64      9-26  (111)
 70 PF03835 Rad4:  Rad4 transgluta  43.6      34 0.00073   18.6   2.5   18   49-66     39-56  (145)
 71 KOG2321|consensus               42.4      53  0.0012   22.7   3.5   38   17-60    131-168 (703)
 72 PF03088 Str_synth:  Strictosid  42.4      47   0.001   16.7   2.9   33    4-37     41-73  (89)
 73 cd05398 NT_ClassII-CCAase Nucl  41.5      21 0.00044   19.4   1.4   11   31-41     16-26  (139)
 74 cd03583 NTR_complement_C3 NTR/  40.9      52  0.0011   18.2   2.9   32   31-62     86-118 (149)
 75 smart00155 PLDc Phospholipase   40.6      26 0.00056   13.2   2.7   19   21-41      5-23  (28)
 76 PF10946 DUF2625:  Protein of u  40.1      57  0.0012   19.3   3.1   18   46-63    123-140 (208)
 77 KOG2315|consensus               39.4 1.3E+02  0.0027   20.8   4.8   49    4-57    298-346 (566)
 78 cd03584 NTR_complement_C4 NTR/  38.8      64  0.0014   17.9   3.1   32   31-62     90-122 (153)
 79 cd01206 Homer Homer type EVH1   38.1      42 0.00091   17.8   2.2   18   47-64     11-29  (111)
 80 PF07893 DUF1668:  Protein of u  37.8   1E+02  0.0023   19.4   6.6   41   31-77     76-116 (342)
 81 PF10902 DUF2693:  Protein of u  36.7      53  0.0012   16.4   2.3   18   45-62     47-64  (83)
 82 PF11958 DUF3472:  Domain of un  36.5      36 0.00077   19.5   1.9   24   47-70     80-103 (184)
 83 PF13540 RCC1_2:  Regulator of   35.7      34 0.00074   13.2   2.5   16   23-39     10-25  (30)
 84 PF06335 DUF1054:  Protein of u  35.5      71  0.0015   18.7   3.0   34    4-39     59-97  (194)
 85 cd00837 EVH1 EVH1 (Enabled, Va  34.5      52  0.0011   16.8   2.2   16   47-62      9-24  (104)
 86 PF06739 SBBP:  Beta-propeller   33.2      33 0.00072   14.1   1.2   11   31-41     23-33  (38)
 87 PF11090 DUF2833:  Protein of u  31.9      77  0.0017   16.0   2.6   13   30-42      7-19  (86)
 88 KOG0294|consensus               31.9 1.4E+02  0.0031   19.1   4.2   23   31-58     52-74  (362)
 89 KOG1324|consensus               31.8      30 0.00066   20.0   1.1    9   32-40    111-119 (190)
 90 KOG2705|consensus               31.6      28 0.00061   22.4   1.1   18   43-60    304-321 (471)
 91 smart00120 HX Hemopexin-like r  30.2      51  0.0011   13.5   4.0   19   31-56      9-27  (45)
 92 COG2380 Uncharacterized protei  29.7      56  0.0012   20.3   2.1   19   47-65     22-40  (327)
 93 PF14741 GH114_assoc:  N-termin  29.5   1E+02  0.0022   16.9   2.8   22   45-66     61-83  (129)
 94 KOG2006|consensus               27.9      92   0.002   23.0   3.0   53    6-66    321-380 (1023)
 95 PF00186 DHFR_1:  Dihydrofolate  27.6      39 0.00085   18.9   1.1    9   32-40     92-100 (161)
 96 COG4334 Uncharacterized protei  27.2      76  0.0016   16.9   2.0   14   25-40     41-54  (131)
 97 KOG0282|consensus               27.1 1.9E+02   0.004   19.7   4.1   12   31-42    310-322 (503)
 98 PLN03215 ascorbic acid mannose  27.0 1.4E+02  0.0031   19.4   3.6   27    9-37    189-215 (373)
 99 KOG2967|consensus               26.9      39 0.00084   21.3   1.1   13   31-43    200-212 (314)
100 PLN03113 DNA ligase 1; Provisi  26.6 1.9E+02  0.0042   20.7   4.3   56    3-63    561-624 (744)
101 COG0262 FolA Dihydrofolate red  26.5      49  0.0011   18.5   1.4    9   32-40     96-104 (167)
102 PF00415 RCC1:  Regulator of ch  26.3      52  0.0011   13.8   1.3    9   31-39      1-9   (51)
103 TIGR02608 delta_60_rpt delta-6  26.1      80  0.0017   14.4   3.9   26   30-55     10-37  (55)
104 PF10049 DUF2283:  Protein of u  26.0      38 0.00083   14.9   0.8   11    1-11      1-11  (50)
105 smart00284 OLF Olfactomedin-li  25.9 1.7E+02  0.0036   18.0   5.2   54    3-60    156-213 (255)
106 cd03474 Rieske_T4moC Toluene-4  25.2 1.1E+02  0.0023   15.5   3.5    6   31-36     92-97  (108)
107 TIGR03352 VI_chp_3 type VI sec  24.9 1.3E+02  0.0029   16.5   3.7   24   49-72    113-136 (146)
108 PF00352 TBP:  Transcription fa  24.6   1E+02  0.0022   15.1   2.3   10   31-40     57-66  (86)
109 PRK10769 folA dihydrofolate re  24.5      55  0.0012   18.3   1.3    9   32-40     89-97  (159)
110 COG4946 Uncharacterized protei  24.2 1.2E+02  0.0026   20.7   2.9   17   47-63    287-303 (668)
111 PF00045 Hemopexin:  Hemopexin;  23.9      74  0.0016   13.2   4.1   18   31-55      9-26  (45)
112 PF15494 SRCR_2:  Scavenger rec  23.1 1.2E+02  0.0025   15.2   2.3   17   47-63      4-20  (98)
113 smart00461 WH1 WASP homology r  23.1 1.2E+02  0.0026   15.6   2.4   17   47-63     11-27  (106)
114 PF12118 SprA-related:  SprA-re  22.8      52  0.0011   20.5   1.1   10   30-39    182-191 (294)
115 PF06462 Hyd_WA:  Propeller;  I  22.7      68  0.0015   12.7   1.1   12    4-15     20-31  (32)
116 cd03475 Rieske_SoxF_SoxL SoxF   22.2 1.7E+02  0.0037   16.8   4.4    7   32-38    154-160 (171)
117 PRK09965 3-phenylpropionate di  22.1 1.3E+02  0.0027   15.3   3.9    6   31-36     94-99  (106)
118 PF04096 Nucleoporin2:  Nucleop  21.9 1.2E+02  0.0026   16.6   2.3   16   47-62    122-137 (141)
119 cd00094 HX Hemopexin-like repe  21.8 1.7E+02  0.0037   16.6   4.4   22   31-59    110-131 (194)
120 COG2131 ComEB Deoxycytidylate   21.7 1.6E+02  0.0035   16.8   2.8   25   18-44     25-49  (164)
121 PF01743 PolyA_pol:  Poly A pol  21.5      38 0.00082   17.9   0.3    7   34-40      1-7   (126)
122 PF12091 DUF3567:  Protein of u  21.3 1.3E+02  0.0029   15.2   3.0   39    1-39      1-45  (85)
123 PF00568 WH1:  WH1 domain;  Int  20.7 1.4E+02  0.0031   15.3   3.9   15   48-62     17-32  (111)
124 PF04122 CW_binding_2:  Putativ  20.4      79  0.0017   15.5   1.3   11   31-41     73-83  (92)
125 KOG2445|consensus               20.4 2.3E+02  0.0051   18.2   3.5   36   31-66    183-220 (361)

No 1  
>KOG4441|consensus
Probab=99.84  E-value=2e-20  Score=118.77  Aligned_cols=75  Identities=43%  Similarity=0.641  Sum_probs=69.6

Q ss_pred             ceecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCC-CCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079           3 SSICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWS-YLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS   79 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~-~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~   79 (79)
                      .+|||.+++|+.+++|+  |.+++++++  +++||++||.++.. .++++++|||.+++|+.+++|+.+|..++++++++
T Consensus       399 E~YDp~~~~W~~va~m~~~r~~~gv~~~--~g~iYi~GG~~~~~~~l~sve~YDP~t~~W~~~~~M~~~R~~~g~a~~~~  476 (571)
T KOG4441|consen  399 ECYDPVTNKWTPVAPMLTRRSGHGVAVL--GGKLYIIGGGDGSSNCLNSVECYDPETNTWTLIAPMNTRRSGFGVAVLNG  476 (571)
T ss_pred             EEecCCCCcccccCCCCcceeeeEEEEE--CCEEEEEcCcCCCccccceEEEEcCCCCceeecCCcccccccceEEEECC
Confidence            48999999999999885  788999999  99999999988776 79999999999999999999999999999999875


No 2  
>KOG4441|consensus
Probab=99.83  E-value=2.2e-20  Score=118.60  Aligned_cols=75  Identities=43%  Similarity=0.685  Sum_probs=70.5

Q ss_pred             ceecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079           3 SSICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS   79 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~   79 (79)
                      -+||+++++|+.+++|.  |..++++++  ++.||++||+++...++++|+|||.+++|+.+++|+.+|..+++++++|
T Consensus       352 e~YD~~~~~W~~~a~M~~~R~~~~v~~l--~g~iYavGG~dg~~~l~svE~YDp~~~~W~~va~m~~~r~~~gv~~~~g  428 (571)
T KOG4441|consen  352 ERYDPRTNQWTPVAPMNTKRSDFGVAVL--DGKLYAVGGFDGEKSLNSVECYDPVTNKWTPVAPMLTRRSGHGVAVLGG  428 (571)
T ss_pred             EEecCCCCceeccCCccCccccceeEEE--CCEEEEEeccccccccccEEEecCCCCcccccCCCCcceeeeEEEEECC
Confidence            37999999999999886  889999999  9999999999998889999999999999999999999999999999875


No 3  
>PHA02713 hypothetical protein; Provisional
Probab=99.80  E-value=5.4e-19  Score=112.18  Aligned_cols=75  Identities=21%  Similarity=0.356  Sum_probs=67.9

Q ss_pred             ceecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079           3 SSICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS   79 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~   79 (79)
                      .+||+.+++|..+++|+  |..++++++  +++||++||.++....+++++|||.+++|..+++|+.+|..+++++++|
T Consensus       323 ~~Yd~~~n~W~~~~~m~~~R~~~~~~~~--~g~IYviGG~~~~~~~~sve~Ydp~~~~W~~~~~mp~~r~~~~~~~~~g  399 (557)
T PHA02713        323 YKINIENKIHVELPPMIKNRCRFSLAVI--DDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDMPIALSSYGMCVLDQ  399 (557)
T ss_pred             EEEECCCCeEeeCCCCcchhhceeEEEE--CCEEEEECCcCCCCCCceEEEEECCCCeEEECCCCCcccccccEEEECC
Confidence            47999999999999876  788889999  9999999998766667899999999999999999999999999988775


No 4  
>PF13964 Kelch_6:  Kelch motif
Probab=99.79  E-value=6.7e-19  Score=80.16  Aligned_cols=49  Identities=35%  Similarity=0.667  Sum_probs=45.2

Q ss_pred             ccceeeeEEECCCCeEEEEcCCCC-CCCCCeEEEEeCCCCcEEeccCCCCCC
Q psy6079          19 SYMCLGVGVLEPEGPLYAVGGHDG-WSYLNTTERFDPRLRQWSFIAPMLSPR   69 (79)
Q Consensus        19 ~r~~~~~~~~~~~~~iyv~GG~~~-~~~~~~~~~yd~~~~~W~~~~~~~~~r   69 (79)
                      ||.+++++++  +++|||+||..+ ....+++++||+++++|+.+++||.+|
T Consensus         1 pR~~~s~v~~--~~~iyv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~mp~pR   50 (50)
T PF13964_consen    1 PRYGHSAVVV--GGKIYVFGGYDNSGKYSNDVERYDPETNTWEQLPPMPTPR   50 (50)
T ss_pred             CCccCEEEEE--CCEEEEECCCCCCCCccccEEEEcCCCCcEEECCCCCCCC
Confidence            5888999999  999999999887 566899999999999999999999987


No 5  
>PHA02713 hypothetical protein; Provisional
Probab=99.77  E-value=2.1e-18  Score=109.52  Aligned_cols=74  Identities=15%  Similarity=0.113  Sum_probs=65.2

Q ss_pred             eecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCCC-CCeEEEEeCCC-CcEEeccCCCCCCCceeEEEECC
Q psy6079           4 SICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSY-LNTTERFDPRL-RQWSFIAPMLSPRSTAGVAVLNS   79 (79)
Q Consensus         4 ~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~-~~~~~~yd~~~-~~W~~~~~~~~~r~~~~~~~~~~   79 (79)
                      +|||.+++|+.+++|+  |..++++++  +++||++||.++... .+.+++|||.+ ++|+.+++|+.+|..+++++++|
T Consensus       436 ~YDP~td~W~~v~~m~~~r~~~~~~~~--~~~IYv~GG~~~~~~~~~~ve~Ydp~~~~~W~~~~~m~~~r~~~~~~~~~~  513 (557)
T PHA02713        436 RYDTVNNIWETLPNFWTGTIRPGVVSH--KDDIYVVCDIKDEKNVKTCIFRYNTNTYNGWELITTTESRLSALHTILHDN  513 (557)
T ss_pred             EECCCCCeEeecCCCCcccccCcEEEE--CCEEEEEeCCCCCCccceeEEEecCCCCCCeeEccccCcccccceeEEECC
Confidence            6999999999999885  788888999  999999999875433 35689999999 89999999999999999999875


No 6  
>PHA02790 Kelch-like protein; Provisional
Probab=99.75  E-value=1.3e-17  Score=104.47  Aligned_cols=73  Identities=22%  Similarity=0.296  Sum_probs=64.5

Q ss_pred             ceecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079           3 SSICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS   79 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~   79 (79)
                      .+||+.+++|+.+++|+  |..++++++  +++||++||..+.  .+++++|||++++|..+++|+.+|..+++++++|
T Consensus       334 e~ydp~~n~W~~~~~l~~~r~~~~~~~~--~g~IYviGG~~~~--~~~ve~ydp~~~~W~~~~~m~~~r~~~~~~~~~~  408 (480)
T PHA02790        334 ERWFHGDAAWVNMPSLLKPRCNPAVASI--NNVIYVIGGHSET--DTTTEYLLPNHDQWQFGPSTYYPHYKSCALVFGR  408 (480)
T ss_pred             EEEECCCCeEEECCCCCCCCcccEEEEE--CCEEEEecCcCCC--CccEEEEeCCCCEEEeCCCCCCccccceEEEECC
Confidence            47999999999999886  778888889  9999999997643  3679999999999999999999999998888775


No 7  
>PHA02790 Kelch-like protein; Provisional
Probab=99.70  E-value=1.4e-16  Score=99.78  Aligned_cols=72  Identities=22%  Similarity=0.356  Sum_probs=63.4

Q ss_pred             ceecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079           3 SSICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS   79 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~   79 (79)
                      .+|||.+++|..+++|+  |..++++++  +++||++||.++.   +++++||+.+++|..+++|+.+|..+++++++|
T Consensus       290 ~~Ydp~~~~W~~~~~m~~~r~~~~~v~~--~~~iYviGG~~~~---~sve~ydp~~n~W~~~~~l~~~r~~~~~~~~~g  363 (480)
T PHA02790        290 IAVNYISNNWIPIPPMNSPRLYASGVPA--NNKLYVVGGLPNP---TSVERWFHGDAAWVNMPSLLKPRCNPAVASINN  363 (480)
T ss_pred             EEEECCCCEEEECCCCCchhhcceEEEE--CCEEEEECCcCCC---CceEEEECCCCeEEECCCCCCCCcccEEEEECC
Confidence            36999999999999775  777888888  9999999997532   579999999999999999999999999988765


No 8  
>PHA03098 kelch-like protein; Provisional
Probab=99.70  E-value=1.2e-16  Score=100.81  Aligned_cols=75  Identities=27%  Similarity=0.361  Sum_probs=66.9

Q ss_pred             ceecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079           3 SSICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS   79 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~   79 (79)
                      .+||+.+++|..+++|+  |.+++++++  +++||++||..+....+++++||+.+++|..+++||.+|..+++++++|
T Consensus       314 ~~yd~~~~~W~~~~~~~~~R~~~~~~~~--~~~lyv~GG~~~~~~~~~v~~yd~~~~~W~~~~~lp~~r~~~~~~~~~~  390 (534)
T PHA03098        314 VSYDTKTKSWNKVPELIYPRKNPGVTVF--NNRIYVIGGIYNSISLNTVESWKPGESKWREEPPLIFPRYNPCVVNVNN  390 (534)
T ss_pred             EEEeCCCCeeeECCCCCcccccceEEEE--CCEEEEEeCCCCCEecceEEEEcCCCCceeeCCCcCcCCccceEEEECC
Confidence            47999999999999664  788899999  9999999998765567899999999999999999999999999887764


No 9  
>PLN02153 epithiospecifier protein
Probab=99.69  E-value=4.6e-16  Score=93.86  Aligned_cols=75  Identities=15%  Similarity=0.225  Sum_probs=63.2

Q ss_pred             ceecCCCCCeeecCCC---ccc---eeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCC-----CCCCCc
Q psy6079           3 SSICLHLVFWFPLLLL---SYM---CLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPM-----LSPRST   71 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~---~r~---~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~-----~~~r~~   71 (79)
                      .+||+.+++|+.++++   ||.   +++++++  +++||+|||..+...++++++||+++++|+.+++|     |.+|..
T Consensus        53 ~~yd~~~~~W~~~~~~~~~p~~~~~~~~~~~~--~~~iyv~GG~~~~~~~~~v~~yd~~t~~W~~~~~~~~~~~p~~R~~  130 (341)
T PLN02153         53 YVFDFNTHTWSIAPANGDVPRISCLGVRMVAV--GTKLYIFGGRDEKREFSDFYSYDTVKNEWTFLTKLDEEGGPEARTF  130 (341)
T ss_pred             EEEECCCCEEEEcCccCCCCCCccCceEEEEE--CCEEEEECCCCCCCccCcEEEEECCCCEEEEeccCCCCCCCCCcee
Confidence            4799999999998754   342   5777888  99999999987666678999999999999999887     789999


Q ss_pred             eeEEEECC
Q psy6079          72 AGVAVLNS   79 (79)
Q Consensus        72 ~~~~~~~~   79 (79)
                      |+++++++
T Consensus       131 ~~~~~~~~  138 (341)
T PLN02153        131 HSMASDEN  138 (341)
T ss_pred             eEEEEECC
Confidence            99887754


No 10 
>PF01344 Kelch_1:  Kelch motif;  InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=99.68  E-value=8.6e-17  Score=72.18  Aligned_cols=46  Identities=28%  Similarity=0.508  Sum_probs=41.6

Q ss_pred             ccceeeeEEECCCCeEEEEcCCCC-CCCCCeEEEEeCCCCcEEeccCCC
Q psy6079          19 SYMCLGVGVLEPEGPLYAVGGHDG-WSYLNTTERFDPRLRQWSFIAPML   66 (79)
Q Consensus        19 ~r~~~~~~~~~~~~~iyv~GG~~~-~~~~~~~~~yd~~~~~W~~~~~~~   66 (79)
                      ||.+++++++  +++|||+||..+ ....+++++||+++++|+.+++||
T Consensus         1 pR~~~~~~~~--~~~iyv~GG~~~~~~~~~~v~~yd~~~~~W~~~~~mp   47 (47)
T PF01344_consen    1 PRSGHAAVVV--GNKIYVIGGYDGNNQPTNSVEVYDPETNTWEELPPMP   47 (47)
T ss_dssp             -BBSEEEEEE--TTEEEEEEEBESTSSBEEEEEEEETTTTEEEEEEEES
T ss_pred             CCccCEEEEE--CCEEEEEeeecccCceeeeEEEEeCCCCEEEEcCCCC
Confidence            5888999999  999999999887 566999999999999999999886


No 11 
>PHA03098 kelch-like protein; Provisional
Probab=99.67  E-value=7.5e-16  Score=97.26  Aligned_cols=75  Identities=19%  Similarity=0.135  Sum_probs=64.5

Q ss_pred             ceecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCCCC-CCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079           3 SSICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGW-SYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS   79 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~-~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~   79 (79)
                      .+||+.+++|+.++++|  |..++++++  +++||++||.... ..++++++||+.+++|..+++||.+|..++++++++
T Consensus       361 ~~yd~~~~~W~~~~~lp~~r~~~~~~~~--~~~iYv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~~p~~r~~~~~~~~~~  438 (534)
T PHA03098        361 ESWKPGESKWREEPPLIFPRYNPCVVNV--NNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDG  438 (534)
T ss_pred             EEEcCCCCceeeCCCcCcCCccceEEEE--CCEEEEECCcCCCCcccceEEEEeCCCCeeeecCCCCccccCceEEEECC
Confidence            47999999999998664  788888888  9999999996533 346899999999999999999999999998887664


No 12 
>PLN02153 epithiospecifier protein
Probab=99.67  E-value=1.1e-15  Score=92.24  Aligned_cols=75  Identities=21%  Similarity=0.200  Sum_probs=61.6

Q ss_pred             ceecCCCCCeeecCCC-------ccceeeeEEECCCCeEEEEcCCCCCC------CCCeEEEEeCCCCcEEeccCCC---
Q psy6079           3 SSICLHLVFWFPLLLL-------SYMCLGVGVLEPEGPLYAVGGHDGWS------YLNTTERFDPRLRQWSFIAPML---   66 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~-------~r~~~~~~~~~~~~~iyv~GG~~~~~------~~~~~~~yd~~~~~W~~~~~~~---   66 (79)
                      .+||+.+++|+.++++       +|..+++++.  +++|||+||.....      .++++++||+++++|..++++.   
T Consensus       104 ~~yd~~t~~W~~~~~~~~~~~p~~R~~~~~~~~--~~~iyv~GG~~~~~~~~~~~~~~~v~~yd~~~~~W~~l~~~~~~~  181 (341)
T PLN02153        104 YSYDTVKNEWTFLTKLDEEGGPEARTFHSMASD--ENHVYVFGGVSKGGLMKTPERFRTIEAYNIADGKWVQLPDPGENF  181 (341)
T ss_pred             EEEECCCCEEEEeccCCCCCCCCCceeeEEEEE--CCEEEEECCccCCCccCCCcccceEEEEECCCCeEeeCCCCCCCC
Confidence            4799999999988743       5888888888  99999999976321      3578999999999999998764   


Q ss_pred             CCCCceeEEEECC
Q psy6079          67 SPRSTAGVAVLNS   79 (79)
Q Consensus        67 ~~r~~~~~~~~~~   79 (79)
                      .+|..++++++++
T Consensus       182 ~~r~~~~~~~~~~  194 (341)
T PLN02153        182 EKRGGAGFAVVQG  194 (341)
T ss_pred             CCCCcceEEEECC
Confidence            7888888877654


No 13 
>smart00612 Kelch Kelch domain.
Probab=99.66  E-value=3.6e-16  Score=69.78  Aligned_cols=47  Identities=43%  Similarity=0.800  Sum_probs=42.7

Q ss_pred             eEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079          33 PLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS   79 (79)
Q Consensus        33 ~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~   79 (79)
                      +||++||..+....+++++||+.+++|+.+++|+.+|..+++++++|
T Consensus         1 ~iyv~GG~~~~~~~~~v~~yd~~~~~W~~~~~~~~~r~~~~~~~~~g   47 (47)
T smart00612        1 KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING   47 (47)
T ss_pred             CEEEEeCCCCCceeeeEEEECCCCCeEccCCCCCCccccceEEEeCC
Confidence            48999998765568899999999999999999999999999999875


No 14 
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=99.66  E-value=1.3e-15  Score=91.32  Aligned_cols=74  Identities=19%  Similarity=0.179  Sum_probs=61.3

Q ss_pred             eecCCCCCe----eecCCCc--cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCC-CCCCceeEEE
Q psy6079           4 SICLHLVFW----FPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPML-SPRSTAGVAV   76 (79)
Q Consensus         4 ~~~~~~~~W----~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~-~~r~~~~~~~   76 (79)
                      +||+.+++|    +.++++|  |..++++++  +++||++||.......+++++||+.+++|+.+++|+ .+|..+++++
T Consensus        92 ~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~--~~~iYv~GG~~~~~~~~~v~~yd~~~~~W~~~~~~p~~~r~~~~~~~  169 (323)
T TIGR03548        92 RITLDESKEELICETIGNLPFTFENGSACYK--DGTLYVGGGNRNGKPSNKSYLFNLETQEWFELPDFPGEPRVQPVCVK  169 (323)
T ss_pred             EEEEcCCceeeeeeEcCCCCcCccCceEEEE--CCEEEEEeCcCCCccCceEEEEcCCCCCeeECCCCCCCCCCcceEEE
Confidence            688888887    6777665  677888888  999999999765555789999999999999999988 4788888777


Q ss_pred             ECC
Q psy6079          77 LNS   79 (79)
Q Consensus        77 ~~~   79 (79)
                      +++
T Consensus       170 ~~~  172 (323)
T TIGR03548       170 LQN  172 (323)
T ss_pred             ECC
Confidence            654


No 15 
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=99.65  E-value=1.6e-15  Score=91.61  Aligned_cols=75  Identities=13%  Similarity=0.118  Sum_probs=58.9

Q ss_pred             ceecCCCCCeeecC-CCcc--ceeeeE-EECCCCeEEEEcCCCCCC----------------------------------
Q psy6079           3 SSICLHLVFWFPLL-LLSY--MCLGVG-VLEPEGPLYAVGGHDGWS----------------------------------   44 (79)
Q Consensus         3 ~~~~~~~~~W~~~~-~~~r--~~~~~~-~~~~~~~iyv~GG~~~~~----------------------------------   44 (79)
                      .+||+.+++|+.++ ++||  .+++++ ++  +++||++||.+...                                  
T Consensus        88 ~~Yd~~~~~W~~~~~~~p~~~~~~~~~~~~--~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  165 (346)
T TIGR03547        88 YRYDPKKNSWQKLDTRSPVGLLGASGFSLH--NGQAYFTGGVNKNIFDGYFADLSAADKDSEPKDKLIAAYFSQPPEDYF  165 (346)
T ss_pred             EEEECCCCEEecCCCCCCCcccceeEEEEe--CCEEEEEcCcChHHHHHHHhhHhhcCccchhhhhhHHHHhCCChhHcC
Confidence            47999999999997 5565  344444 56  99999999976321                                  


Q ss_pred             CCCeEEEEeCCCCcEEeccCCCC-CCCceeEEEECC
Q psy6079          45 YLNTTERFDPRLRQWSFIAPMLS-PRSTAGVAVLNS   79 (79)
Q Consensus        45 ~~~~~~~yd~~~~~W~~~~~~~~-~r~~~~~~~~~~   79 (79)
                      ..+++++||+.+++|..+++||. +|..++++++++
T Consensus       166 ~~~~v~~YDp~t~~W~~~~~~p~~~r~~~~~~~~~~  201 (346)
T TIGR03547       166 WNKNVLSYDPSTNQWRNLGENPFLGTAGSAIVHKGN  201 (346)
T ss_pred             ccceEEEEECCCCceeECccCCCCcCCCceEEEECC
Confidence            13789999999999999999986 688888877764


No 16 
>PLN02193 nitrile-specifier protein
Probab=99.64  E-value=3.4e-15  Score=93.48  Aligned_cols=75  Identities=21%  Similarity=0.344  Sum_probs=63.2

Q ss_pred             ceecCCCCCeeecCC---Cc---cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCC---CCCCCcee
Q psy6079           3 SSICLHLVFWFPLLL---LS---YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPM---LSPRSTAG   73 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~---~~---r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~---~~~r~~~~   73 (79)
                      .+||+.+++|+.+++   +|   |..++++++  +++|||+||.+....++++++||+.+++|..++++   |.+|..|+
T Consensus       196 ~~yD~~~~~W~~~~~~g~~P~~~~~~~~~v~~--~~~lYvfGG~~~~~~~ndv~~yD~~t~~W~~l~~~~~~P~~R~~h~  273 (470)
T PLN02193        196 YVFDLETRTWSISPATGDVPHLSCLGVRMVSI--GSTLYVFGGRDASRQYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHS  273 (470)
T ss_pred             EEEECCCCEEEeCCCCCCCCCCcccceEEEEE--CCEEEEECCCCCCCCCccEEEEECCCCEEEEcCcCCCCCCCccceE
Confidence            479999999998763   33   356778888  99999999988766689999999999999999888   78999999


Q ss_pred             EEEECC
Q psy6079          74 VAVLNS   79 (79)
Q Consensus        74 ~~~~~~   79 (79)
                      +++.++
T Consensus       274 ~~~~~~  279 (470)
T PLN02193        274 MAADEE  279 (470)
T ss_pred             EEEECC
Confidence            887653


No 17 
>PLN02193 nitrile-specifier protein
Probab=99.63  E-value=4.2e-15  Score=93.06  Aligned_cols=75  Identities=20%  Similarity=0.254  Sum_probs=64.2

Q ss_pred             ceecCCCCCeeecCCC-----ccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccC---CCCCCCceeE
Q psy6079           3 SSICLHLVFWFPLLLL-----SYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAP---MLSPRSTAGV   74 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~-----~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~---~~~~r~~~~~   74 (79)
                      .+||+.+++|+.++++     +|..+++++.  +++|||+||.+....++++++||+.+++|..++.   ++.+|..+++
T Consensus       247 ~~yD~~t~~W~~l~~~~~~P~~R~~h~~~~~--~~~iYv~GG~~~~~~~~~~~~yd~~t~~W~~~~~~~~~~~~R~~~~~  324 (470)
T PLN02193        247 YSFDTTTNEWKLLTPVEEGPTPRSFHSMAAD--EENVYVFGGVSATARLKTLDSYNIVDKKWFHCSTPGDSFSIRGGAGL  324 (470)
T ss_pred             EEEECCCCEEEEcCcCCCCCCCccceEEEEE--CCEEEEECCCCCCCCcceEEEEECCCCEEEeCCCCCCCCCCCCCcEE
Confidence            4799999999998754     4888888888  9999999998876668899999999999999864   5678988888


Q ss_pred             EEECC
Q psy6079          75 AVLNS   79 (79)
Q Consensus        75 ~~~~~   79 (79)
                      +++++
T Consensus       325 ~~~~g  329 (470)
T PLN02193        325 EVVQG  329 (470)
T ss_pred             EEECC
Confidence            87653


No 18 
>PF13415 Kelch_3:  Galactose oxidase, central domain
Probab=99.63  E-value=2e-15  Score=68.37  Aligned_cols=47  Identities=34%  Similarity=0.527  Sum_probs=42.4

Q ss_pred             CCeEEEEcCCC--CCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEE
Q psy6079          31 EGPLYAVGGHD--GWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVL   77 (79)
Q Consensus        31 ~~~iyv~GG~~--~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~   77 (79)
                      +++|||+||..  ....++++++||+.+++|++++++|.+|..|+++++
T Consensus         1 g~~~~vfGG~~~~~~~~~nd~~~~~~~~~~W~~~~~~P~~R~~h~~~~i   49 (49)
T PF13415_consen    1 GNKLYVFGGYDDDGGTRLNDVWVFDLDTNTWTRIGDLPPPRSGHTATVI   49 (49)
T ss_pred             CCEEEEECCcCCCCCCEecCEEEEECCCCEEEECCCCCCCccceEEEEC
Confidence            47899999988  455689999999999999999999999999999874


No 19 
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=99.58  E-value=2.1e-14  Score=86.70  Aligned_cols=74  Identities=18%  Similarity=0.222  Sum_probs=57.2

Q ss_pred             ceecC--CCCCeeecCCCc---cceeeeEEECCCCeEEEEcCCCCCC------CCCeEEEEeCCCCcEEecc-CCCCCCC
Q psy6079           3 SSICL--HLVFWFPLLLLS---YMCLGVGVLEPEGPLYAVGGHDGWS------YLNTTERFDPRLRQWSFIA-PMLSPRS   70 (79)
Q Consensus         3 ~~~~~--~~~~W~~~~~~~---r~~~~~~~~~~~~~iyv~GG~~~~~------~~~~~~~yd~~~~~W~~~~-~~~~~r~   70 (79)
                      .+||+  .+++|..+++||   |..++++++  +++|||+||.....      .++++++||+.+++|+.++ +++.+|.
T Consensus        32 ~~~d~~~~~~~W~~l~~~p~~~R~~~~~~~~--~~~iYv~GG~~~~~~~~~~~~~~~v~~Yd~~~~~W~~~~~~~p~~~~  109 (346)
T TIGR03547        32 YKLDLKKPSKGWQKIADFPGGPRNQAVAAAI--DGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKNSWQKLDTRSPVGLL  109 (346)
T ss_pred             EEEECCCCCCCceECCCCCCCCcccceEEEE--CCEEEEEeCCCCCCCCCcceecccEEEEECCCCEEecCCCCCCCccc
Confidence            35665  578999999764   677888889  99999999975321      3678999999999999997 4566676


Q ss_pred             ceeEE-EEC
Q psy6079          71 TAGVA-VLN   78 (79)
Q Consensus        71 ~~~~~-~~~   78 (79)
                      .++++ +++
T Consensus       110 ~~~~~~~~~  118 (346)
T TIGR03547       110 GASGFSLHN  118 (346)
T ss_pred             ceeEEEEeC
Confidence            66655 444


No 20 
>PF07646 Kelch_2:  Kelch motif;  InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=99.58  E-value=1.3e-14  Score=65.66  Aligned_cols=46  Identities=33%  Similarity=0.488  Sum_probs=40.4

Q ss_pred             ccceeeeEEECCCCeEEEEcCC---CCCCCCCeEEEEeCCCCcEEeccCCC
Q psy6079          19 SYMCLGVGVLEPEGPLYAVGGH---DGWSYLNTTERFDPRLRQWSFIAPML   66 (79)
Q Consensus        19 ~r~~~~~~~~~~~~~iyv~GG~---~~~~~~~~~~~yd~~~~~W~~~~~~~   66 (79)
                      ||.+++++++  +++||++||+   ......+++++||+++++|+.+++|+
T Consensus         1 ~r~~hs~~~~--~~kiyv~GG~~~~~~~~~~~~v~~~d~~t~~W~~~~~~g   49 (49)
T PF07646_consen    1 PRYGHSAVVL--DGKIYVFGGYGTDNGGSSSNDVWVFDTETNQWTELSPMG   49 (49)
T ss_pred             CccceEEEEE--CCEEEEECCcccCCCCcccceeEEEECCCCEEeecCCCC
Confidence            5788999999  9999999999   34455899999999999999998874


No 21 
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=99.56  E-value=3.9e-14  Score=86.63  Aligned_cols=75  Identities=15%  Similarity=0.118  Sum_probs=57.8

Q ss_pred             ceecCCCCCeeecCC-Ccc--ceeeeEE-ECCCCeEEEEcCCCCC----------------------------------C
Q psy6079           3 SSICLHLVFWFPLLL-LSY--MCLGVGV-LEPEGPLYAVGGHDGW----------------------------------S   44 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~-~~r--~~~~~~~-~~~~~~iyv~GG~~~~----------------------------------~   44 (79)
                      .+||+.+++|+.+++ +|+  .++++++ .  +++||++||....                                  .
T Consensus       109 ~~YD~~~n~W~~~~~~~p~~~~~~~~~~~~--~~~IYv~GG~~~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~  186 (376)
T PRK14131        109 YKYDPKTNSWQKLDTRSPVGLAGHVAVSLH--NGKAYITGGVNKNIFDGYFEDLAAAGKDKTPKDKINDAYFDKKPEDYF  186 (376)
T ss_pred             EEEeCCCCEEEeCCCCCCCcccceEEEEee--CCEEEEECCCCHHHHHHHHhhhhhcccchhhhhhhHHHHhcCChhhcC
Confidence            369999999999984 354  4455555 6  9999999997531                                  0


Q ss_pred             CCCeEEEEeCCCCcEEeccCCCC-CCCceeEEEECC
Q psy6079          45 YLNTTERFDPRLRQWSFIAPMLS-PRSTAGVAVLNS   79 (79)
Q Consensus        45 ~~~~~~~yd~~~~~W~~~~~~~~-~r~~~~~~~~~~   79 (79)
                      ..+++++||+.+++|..+++||. +|..++++++++
T Consensus       187 ~~~~v~~YD~~t~~W~~~~~~p~~~~~~~a~v~~~~  222 (376)
T PRK14131        187 FNKEVLSYDPSTNQWKNAGESPFLGTAGSAVVIKGN  222 (376)
T ss_pred             cCceEEEEECCCCeeeECCcCCCCCCCcceEEEECC
Confidence            14689999999999999999996 677777777654


No 22 
>KOG0379|consensus
Probab=99.55  E-value=4.4e-14  Score=88.82  Aligned_cols=75  Identities=24%  Similarity=0.272  Sum_probs=65.9

Q ss_pred             ceecCCCCCeeecC-----CCccceeeeEEECCCCeEEEEcCCCCCC-CCCeEEEEeCCCCcEEec---cCCCCCCCcee
Q psy6079           3 SSICLHLVFWFPLL-----LLSYMCLGVGVLEPEGPLYAVGGHDGWS-YLNTTERFDPRLRQWSFI---APMLSPRSTAG   73 (79)
Q Consensus         3 ~~~~~~~~~W~~~~-----~~~r~~~~~~~~~~~~~iyv~GG~~~~~-~~~~~~~yd~~~~~W~~~---~~~~~~r~~~~   73 (79)
                      |.||+.+.+|..+.     |.+|.+|++++.  ++++||+||..... ..+++++||+++.+|.++   ...|.||.+|+
T Consensus       142 ~~~d~~t~~W~~l~~~~~~P~~r~~Hs~~~~--g~~l~vfGG~~~~~~~~ndl~i~d~~~~~W~~~~~~g~~P~pR~gH~  219 (482)
T KOG0379|consen  142 HSLDLSTRTWSLLSPTGDPPPPRAGHSATVV--GTKLVVFGGIGGTGDSLNDLHIYDLETSTWSELDTQGEAPSPRYGHA  219 (482)
T ss_pred             EeccCCCCcEEEecCcCCCCCCcccceEEEE--CCEEEEECCccCcccceeeeeeeccccccceecccCCCCCCCCCCce
Confidence            67999999999986     556999999999  99999999987665 689999999999999988   45678999999


Q ss_pred             EEEECC
Q psy6079          74 VAVLNS   79 (79)
Q Consensus        74 ~~~~~~   79 (79)
                      +++.++
T Consensus       220 ~~~~~~  225 (482)
T KOG0379|consen  220 MVVVGN  225 (482)
T ss_pred             EEEECC
Confidence            988753


No 23 
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=99.49  E-value=3.4e-13  Score=80.94  Aligned_cols=67  Identities=12%  Similarity=0.079  Sum_probs=57.4

Q ss_pred             CeeecCCCc--cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcE----EeccCCCCCCCceeEEEECC
Q psy6079          11 FWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQW----SFIAPMLSPRSTAGVAVLNS   79 (79)
Q Consensus        11 ~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W----~~~~~~~~~r~~~~~~~~~~   79 (79)
                      +|..+++||  |..++++++  +++||++||..+...++++++||+.+++|    ..+++||.+|..++++++++
T Consensus        52 ~W~~~~~lp~~r~~~~~~~~--~~~lyviGG~~~~~~~~~v~~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~~~  124 (323)
T TIGR03548        52 KWVKDGQLPYEAAYGASVSV--ENGIYYIGGSNSSERFSSVYRITLDESKEELICETIGNLPFTFENGSACYKDG  124 (323)
T ss_pred             eEEEcccCCccccceEEEEE--CCEEEEEcCCCCCCCceeEEEEEEcCCceeeeeeEcCCCCcCccCceEEEECC
Confidence            799988665  666777888  99999999987766688999999999998    78899999999999888764


No 24 
>PF13418 Kelch_4:  Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=99.49  E-value=3.9e-14  Score=64.01  Aligned_cols=47  Identities=23%  Similarity=0.342  Sum_probs=30.1

Q ss_pred             ccceeeeEEECCCCeEEEEcCCCCC-CCCCeEEEEeCCCCcEEeccCCC
Q psy6079          19 SYMCLGVGVLEPEGPLYAVGGHDGW-SYLNTTERFDPRLRQWSFIAPML   66 (79)
Q Consensus        19 ~r~~~~~~~~~~~~~iyv~GG~~~~-~~~~~~~~yd~~~~~W~~~~~~~   66 (79)
                      ||.+|+++.+. +++|||+||.+.. ..++++++||+++++|++++++|
T Consensus         1 pR~~h~~~~~~-~~~i~v~GG~~~~~~~~~d~~~~d~~~~~W~~~~~~P   48 (49)
T PF13418_consen    1 PRYGHSAVSIG-DNSIYVFGGRDSSGSPLNDLWIFDIETNTWTRLPSMP   48 (49)
T ss_dssp             --BS-EEEEE--TTEEEEE--EEE-TEE---EEEEETTTTEEEE--SS-
T ss_pred             CcceEEEEEEe-CCeEEEECCCCCCCcccCCEEEEECCCCEEEECCCCC
Confidence            57888888872 4799999998876 46899999999999999998887


No 25 
>KOG4693|consensus
Probab=99.48  E-value=2e-13  Score=80.22  Aligned_cols=75  Identities=16%  Similarity=0.158  Sum_probs=63.5

Q ss_pred             ceecCCCCCeeecCC---------------CccceeeeEEECCCCeEEEEcCCCCC-CCCCeEEEEeCCCCcEEec---c
Q psy6079           3 SSICLHLVFWFPLLL---------------LSYMCLGVGVLEPEGPLYAVGGHDGW-SYLNTTERFDPRLRQWSFI---A   63 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~---------------~~r~~~~~~~~~~~~~iyv~GG~~~~-~~~~~~~~yd~~~~~W~~~---~   63 (79)
                      |.+|.++-+|++++|               ..|++++++.+  ++++|+.||+++. ...+.+++|||++++|.+.   .
T Consensus        47 H~lNa~~~RWtk~pp~~~ka~i~~~yp~VPyqRYGHtvV~y--~d~~yvWGGRND~egaCN~Ly~fDp~t~~W~~p~v~G  124 (392)
T KOG4693|consen   47 HVLNAENYRWTKMPPGITKATIESPYPAVPYQRYGHTVVEY--QDKAYVWGGRNDDEGACNLLYEFDPETNVWKKPEVEG  124 (392)
T ss_pred             EEeeccceeEEecCcccccccccCCCCccchhhcCceEEEE--cceEEEEcCccCcccccceeeeeccccccccccceee
Confidence            567888889999873               23788999988  9999999999874 4578999999999999976   5


Q ss_pred             CCCCCCCceeEEEECC
Q psy6079          64 PMLSPRSTAGVAVLNS   79 (79)
Q Consensus        64 ~~~~~r~~~~~~~~~~   79 (79)
                      ..|.+|.+|++|++++
T Consensus       125 ~vPgaRDGHsAcV~gn  140 (392)
T KOG4693|consen  125 FVPGARDGHSACVWGN  140 (392)
T ss_pred             ecCCccCCceeeEECc
Confidence            6788999999999864


No 26 
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=99.48  E-value=5.6e-13  Score=81.57  Aligned_cols=74  Identities=15%  Similarity=0.240  Sum_probs=56.9

Q ss_pred             ceecCC--CCCeeecCCC---ccceeeeEEECCCCeEEEEcCCCC-C-----CCCCeEEEEeCCCCcEEeccC-CCCCCC
Q psy6079           3 SSICLH--LVFWFPLLLL---SYMCLGVGVLEPEGPLYAVGGHDG-W-----SYLNTTERFDPRLRQWSFIAP-MLSPRS   70 (79)
Q Consensus         3 ~~~~~~--~~~W~~~~~~---~r~~~~~~~~~~~~~iyv~GG~~~-~-----~~~~~~~~yd~~~~~W~~~~~-~~~~r~   70 (79)
                      .+||+.  +++|..++++   +|.+++++++  +++|||+||... .     ..++++++||+.+++|..+++ ++.+|.
T Consensus        53 ~~~d~~~~~~~W~~l~~~p~~~r~~~~~v~~--~~~IYV~GG~~~~~~~~~~~~~~~v~~YD~~~n~W~~~~~~~p~~~~  130 (376)
T PRK14131         53 YKLDLNAPSKGWTKIAAFPGGPREQAVAAFI--DGKLYVFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKLDTRSPVGLA  130 (376)
T ss_pred             EEEECCCCCCCeEECCcCCCCCcccceEEEE--CCEEEEEcCCCCCCCCCceeEcccEEEEeCCCCEEEeCCCCCCCccc
Confidence            357775  4789999865   4778888888  999999999764 1     236789999999999999985 466666


Q ss_pred             ceeEEE-EC
Q psy6079          71 TAGVAV-LN   78 (79)
Q Consensus        71 ~~~~~~-~~   78 (79)
                      .+++++ .+
T Consensus       131 ~~~~~~~~~  139 (376)
T PRK14131        131 GHVAVSLHN  139 (376)
T ss_pred             ceEEEEeeC
Confidence            666655 44


No 27 
>KOG4693|consensus
Probab=99.40  E-value=1.1e-12  Score=77.25  Aligned_cols=73  Identities=25%  Similarity=0.293  Sum_probs=60.2

Q ss_pred             ecCCCCCeeecC-----CCccceeeeEEECCCCeEEEEcCCCCC--CCCCeEEEEeCCCCcEEecc---CCCCCCCceeE
Q psy6079           5 ICLHLVFWFPLL-----LLSYMCLGVGVLEPEGPLYAVGGHDGW--SYLNTTERFDPRLRQWSFIA---PMLSPRSTAGV   74 (79)
Q Consensus         5 ~~~~~~~W~~~~-----~~~r~~~~~~~~~~~~~iyv~GG~~~~--~~~~~~~~yd~~~~~W~~~~---~~~~~r~~~~~   74 (79)
                      +|..|+.|..-+     |-.|..++..+.  ++++|+|||+++.  ..++++++|||++..|..|.   .-|.+|...++
T Consensus       221 ld~~T~aW~r~p~~~~~P~GRRSHS~fvY--ng~~Y~FGGYng~ln~HfndLy~FdP~t~~W~~I~~~Gk~P~aRRRqC~  298 (392)
T KOG4693|consen  221 LDLATGAWTRTPENTMKPGGRRSHSTFVY--NGKMYMFGGYNGTLNVHFNDLYCFDPKTSMWSVISVRGKYPSARRRQCS  298 (392)
T ss_pred             EeccccccccCCCCCcCCCcccccceEEE--cceEEEecccchhhhhhhcceeecccccchheeeeccCCCCCcccceeE
Confidence            578899998876     223778889999  9999999999873  35899999999999999984   34788888888


Q ss_pred             EEECC
Q psy6079          75 AVLNS   79 (79)
Q Consensus        75 ~~~~~   79 (79)
                      ++.++
T Consensus       299 ~v~g~  303 (392)
T KOG4693|consen  299 VVSGG  303 (392)
T ss_pred             EEECC
Confidence            87654


No 28 
>KOG1230|consensus
Probab=99.33  E-value=7.8e-12  Score=76.87  Aligned_cols=72  Identities=24%  Similarity=0.257  Sum_probs=61.4

Q ss_pred             eecCCCCCeeecC----CCccceeeeEEECCCCeEEEEcCCCCCC----CCCeEEEEeCCCCcEEeccC---CCCCCCce
Q psy6079           4 SICLHLVFWFPLL----LLSYMCLGVGVLEPEGPLYAVGGHDGWS----YLNTTERFDPRLRQWSFIAP---MLSPRSTA   72 (79)
Q Consensus         4 ~~~~~~~~W~~~~----~~~r~~~~~~~~~~~~~iyv~GG~~~~~----~~~~~~~yd~~~~~W~~~~~---~~~~r~~~   72 (79)
                      .+|..+.+|+++.    |.||.+|.+++.  .++|++|||+.+..    +++++++||+.+-+|.++.+   -|.+|+++
T Consensus       158 ~fd~~trkweql~~~g~PS~RSGHRMvaw--K~~lilFGGFhd~nr~y~YyNDvy~FdLdtykW~Klepsga~PtpRSGc  235 (521)
T KOG1230|consen  158 LFDLKTRKWEQLEFGGGPSPRSGHRMVAW--KRQLILFGGFHDSNRDYIYYNDVYAFDLDTYKWSKLEPSGAGPTPRSGC  235 (521)
T ss_pred             eeeeccchheeeccCCCCCCCccceeEEe--eeeEEEEcceecCCCceEEeeeeEEEeccceeeeeccCCCCCCCCCCcc
Confidence            4788999999997    888999999999  99999999976533    48999999999999999844   27899988


Q ss_pred             eEEEE
Q psy6079          73 GVAVL   77 (79)
Q Consensus        73 ~~~~~   77 (79)
                      ...+.
T Consensus       236 q~~vt  240 (521)
T KOG1230|consen  236 QFSVT  240 (521)
T ss_pred             eEEec
Confidence            77654


No 29 
>KOG0379|consensus
Probab=99.29  E-value=2.4e-11  Score=76.65  Aligned_cols=74  Identities=26%  Similarity=0.336  Sum_probs=64.3

Q ss_pred             ceecCCCCCeeecC-----CCccceeeeEEECCCCeEEEEcCCC-CCCCCCeEEEEeCCCCcEEecc---CCCCCCCcee
Q psy6079           3 SSICLHLVFWFPLL-----LLSYMCLGVGVLEPEGPLYAVGGHD-GWSYLNTTERFDPRLRQWSFIA---PMLSPRSTAG   73 (79)
Q Consensus         3 ~~~~~~~~~W~~~~-----~~~r~~~~~~~~~~~~~iyv~GG~~-~~~~~~~~~~yd~~~~~W~~~~---~~~~~r~~~~   73 (79)
                      +.||+++.+|.++.     |.||.+|+++++  ++++++|||.. +...+++++++|..+.+|..++   ..|.+|..|.
T Consensus       193 ~i~d~~~~~W~~~~~~g~~P~pR~gH~~~~~--~~~~~v~gG~~~~~~~l~D~~~ldl~~~~W~~~~~~g~~p~~R~~h~  270 (482)
T KOG0379|consen  193 HIYDLETSTWSELDTQGEAPSPRYGHAMVVV--GNKLLVFGGGDDGDVYLNDVHILDLSTWEWKLLPTGGDLPSPRSGHS  270 (482)
T ss_pred             eeeccccccceecccCCCCCCCCCCceEEEE--CCeEEEEeccccCCceecceEeeecccceeeeccccCCCCCCcceee
Confidence            67999999999986     668999999999  99999999877 5667999999999999999664   5679999998


Q ss_pred             EEEEC
Q psy6079          74 VAVLN   78 (79)
Q Consensus        74 ~~~~~   78 (79)
                      .++.+
T Consensus       271 ~~~~~  275 (482)
T KOG0379|consen  271 LTVSG  275 (482)
T ss_pred             eEEEC
Confidence            87543


No 30 
>PLN02772 guanylate kinase
Probab=99.13  E-value=4.3e-10  Score=69.29  Aligned_cols=61  Identities=15%  Similarity=0.117  Sum_probs=52.3

Q ss_pred             CCccceeeeEEECCCCeEEEEcCCCCCC-CCCeEEEEeCCCCcEEec---cCCCCCCCceeEEEECC
Q psy6079          17 LLSYMCLGVGVLEPEGPLYAVGGHDGWS-YLNTTERFDPRLRQWSFI---APMLSPRSTAGVAVLNS   79 (79)
Q Consensus        17 ~~~r~~~~~~~~~~~~~iyv~GG~~~~~-~~~~~~~yd~~~~~W~~~---~~~~~~r~~~~~~~~~~   79 (79)
                      .-|+..++++++  ++++||+||.++.. ..+.+++||..+++|...   ...|.+|.+|+++++++
T Consensus        22 ~~~~~~~tav~i--gdk~yv~GG~~d~~~~~~~v~i~D~~t~~W~~P~V~G~~P~~r~GhSa~v~~~   86 (398)
T PLN02772         22 VKPKNRETSVTI--GDKTYVIGGNHEGNTLSIGVQILDKITNNWVSPIVLGTGPKPCKGYSAVVLNK   86 (398)
T ss_pred             CCCCCcceeEEE--CCEEEEEcccCCCccccceEEEEECCCCcEecccccCCCCCCCCcceEEEECC
Confidence            346888999999  99999999988754 578999999999999975   46689999999999864


No 31 
>PF13854 Kelch_5:  Kelch motif
Probab=99.12  E-value=3.3e-10  Score=49.56  Aligned_cols=39  Identities=23%  Similarity=0.169  Sum_probs=33.4

Q ss_pred             CCccceeeeEEECCCCeEEEEcCCCC--CCCCCeEEEEeCCCC
Q psy6079          17 LLSYMCLGVGVLEPEGPLYAVGGHDG--WSYLNTTERFDPRLR   57 (79)
Q Consensus        17 ~~~r~~~~~~~~~~~~~iyv~GG~~~--~~~~~~~~~yd~~~~   57 (79)
                      |.+|.+|+++++  +++||++||.+.  ...++++++||..+.
T Consensus         2 P~~R~~hs~~~~--~~~iyi~GG~~~~~~~~~~d~~~l~l~sf   42 (42)
T PF13854_consen    2 PSPRYGHSAVVV--GNNIYIFGGYSGNNNSYSNDLYVLDLPSF   42 (42)
T ss_pred             CCCccceEEEEE--CCEEEEEcCccCCCCCEECcEEEEECCCC
Confidence            567999999999  999999999883  556899999998763


No 32 
>KOG1230|consensus
Probab=99.09  E-value=5.1e-10  Score=69.08  Aligned_cols=73  Identities=18%  Similarity=0.153  Sum_probs=58.6

Q ss_pred             ceecCCCCCeeecC----CCccceeeeEEECCCCeEEEEcCCCCC------CCCCeEEEEeCCCCcEEec--cCCCCCCC
Q psy6079           3 SSICLHLVFWFPLL----LLSYMCLGVGVLEPEGPLYAVGGHDGW------SYLNTTERFDPRLRQWSFI--APMLSPRS   70 (79)
Q Consensus         3 ~~~~~~~~~W~~~~----~~~r~~~~~~~~~~~~~iyv~GG~~~~------~~~~~~~~yd~~~~~W~~~--~~~~~~r~   70 (79)
                      .+||.++++|..+.    |.||..|.+++++ .+.+|++||--..      ....++|.||..+++|+++  +.-|.+|+
T Consensus       101 y~Yn~k~~eWkk~~spn~P~pRsshq~va~~-s~~l~~fGGEfaSPnq~qF~HYkD~W~fd~~trkweql~~~g~PS~RS  179 (521)
T KOG1230|consen  101 YSYNTKKNEWKKVVSPNAPPPRSSHQAVAVP-SNILWLFGGEFASPNQEQFHHYKDLWLFDLKTRKWEQLEFGGGPSPRS  179 (521)
T ss_pred             eEEeccccceeEeccCCCcCCCccceeEEec-cCeEEEeccccCCcchhhhhhhhheeeeeeccchheeeccCCCCCCCc
Confidence            37899999999985    6779888888874 4799999993211      1378999999999999998  45689999


Q ss_pred             ceeEEE
Q psy6079          71 TAGVAV   76 (79)
Q Consensus        71 ~~~~~~   76 (79)
                      +|-.++
T Consensus       180 GHRMva  185 (521)
T KOG1230|consen  180 GHRMVA  185 (521)
T ss_pred             cceeEE
Confidence            997665


No 33 
>KOG4152|consensus
Probab=99.03  E-value=7.1e-10  Score=70.27  Aligned_cols=72  Identities=11%  Similarity=0.170  Sum_probs=58.4

Q ss_pred             ecCCCCCeeecC-----CCccceeeeEEECCCCeEEEEcCCC------------CC--CCCCeEEEEeCCCCcEEecc--
Q psy6079           5 ICLHLVFWFPLL-----LLSYMCLGVGVLEPEGPLYAVGGHD------------GW--SYLNTTERFDPRLRQWSFIA--   63 (79)
Q Consensus         5 ~~~~~~~W~~~~-----~~~r~~~~~~~~~~~~~iyv~GG~~------------~~--~~~~~~~~yd~~~~~W~~~~--   63 (79)
                      .|++|..|.+..     ||||.-|++.++  ++++|||||.-            +.  ...+++.++|..+..|+.+-  
T Consensus       235 Ldl~Tl~W~kp~~~G~~PlPRSLHsa~~I--GnKMyvfGGWVPl~~~~~~~~~hekEWkCTssl~clNldt~~W~tl~~d  312 (830)
T KOG4152|consen  235 LDLDTLTWNKPSLSGVAPLPRSLHSATTI--GNKMYVFGGWVPLVMDDVKVATHEKEWKCTSSLACLNLDTMAWETLLMD  312 (830)
T ss_pred             EecceeecccccccCCCCCCcccccceee--cceeEEecceeeeeccccccccccceeeeccceeeeeecchheeeeeec
Confidence            477888898764     999999999999  99999999941            11  12678889999999999872  


Q ss_pred             -----CCCCCCCceeEEEEC
Q psy6079          64 -----PMLSPRSTAGVAVLN   78 (79)
Q Consensus        64 -----~~~~~r~~~~~~~~~   78 (79)
                           ..|.+|.+|++++++
T Consensus       313 ~~ed~tiPR~RAGHCAvAig  332 (830)
T KOG4152|consen  313 TLEDNTIPRARAGHCAVAIG  332 (830)
T ss_pred             cccccccccccccceeEEec
Confidence                 368899999998875


No 34 
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=98.82  E-value=1.7e-08  Score=61.37  Aligned_cols=67  Identities=22%  Similarity=0.296  Sum_probs=50.4

Q ss_pred             CCeeecCCCc---cceeeeEEECCCCeEEEEcCCCCCC-----CCCeEEEEeCCCCcEEeccCC-CCCCCceeEEEEC
Q psy6079          10 VFWFPLLLLS---YMCLGVGVLEPEGPLYAVGGHDGWS-----YLNTTERFDPRLRQWSFIAPM-LSPRSTAGVAVLN   78 (79)
Q Consensus        10 ~~W~~~~~~~---r~~~~~~~~~~~~~iyv~GG~~~~~-----~~~~~~~yd~~~~~W~~~~~~-~~~r~~~~~~~~~   78 (79)
                      ..|+.++..|   |.+...+++  +++||+|||.....     .++++++|||.+++|+++... |+.-..+.++.++
T Consensus        70 k~W~~~a~FpG~~rnqa~~a~~--~~kLyvFgG~Gk~~~~~~~~~nd~Y~y~p~~nsW~kl~t~sP~gl~G~~~~~~~  145 (381)
T COG3055          70 KGWTKIADFPGGARNQAVAAVI--GGKLYVFGGYGKSVSSSPQVFNDAYRYDPSTNSWHKLDTRSPTGLVGASTFSLN  145 (381)
T ss_pred             CCceEcccCCCcccccchheee--CCeEEEeeccccCCCCCceEeeeeEEecCCCChhheeccccccccccceeEecC
Confidence            5899998544   899999999  99999999965322     278999999999999998543 3343444555544


No 35 
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=98.43  E-value=4e-07  Score=55.54  Aligned_cols=70  Identities=17%  Similarity=0.195  Sum_probs=50.7

Q ss_pred             ceecCCCCCeeecC-CCcc--ceeeeEEECCCC-eEEEEcCCCCC----------------------------------C
Q psy6079           3 SSICLHLVFWFPLL-LLSY--MCLGVGVLEPEG-PLYAVGGHDGW----------------------------------S   44 (79)
Q Consensus         3 ~~~~~~~~~W~~~~-~~~r--~~~~~~~~~~~~-~iyv~GG~~~~----------------------------------~   44 (79)
                      ++|||.+|+|.++. ..|+  .++.++.+  ++ +||++||++..                                  .
T Consensus       116 Y~y~p~~nsW~kl~t~sP~gl~G~~~~~~--~~~~i~f~GGvn~~if~~yf~dv~~a~~d~~~~~~i~~~yf~~~~~dy~  193 (381)
T COG3055         116 YRYDPSTNSWHKLDTRSPTGLVGASTFSL--NGTKIYFFGGVNQNIFNGYFEDVGAAGKDKEAVDKIIAHYFDKKAEDYF  193 (381)
T ss_pred             EEecCCCChhheeccccccccccceeEec--CCceEEEEccccHHhhhhhHHhhhhhcccHHHHHHHHHHHhCCCHHHhc
Confidence            58999999999998 3344  56666666  77 99999996321                                  0


Q ss_pred             CCCeEEEEeCCCCcEEeccCCC-CCCCceeE
Q psy6079          45 YLNTTERFDPRLRQWSFIAPML-SPRSTAGV   74 (79)
Q Consensus        45 ~~~~~~~yd~~~~~W~~~~~~~-~~r~~~~~   74 (79)
                      ....+..|+|.+++|+.....| .+++..+.
T Consensus       194 ~n~ev~sy~p~~n~W~~~G~~pf~~~aGsa~  224 (381)
T COG3055         194 FNKEVLSYDPSTNQWRNLGENPFYGNAGSAV  224 (381)
T ss_pred             ccccccccccccchhhhcCcCcccCccCcce
Confidence            1356778999999999997555 56665433


No 36 
>KOG4152|consensus
Probab=98.09  E-value=5.3e-06  Score=53.30  Aligned_cols=64  Identities=11%  Similarity=0.135  Sum_probs=49.5

Q ss_pred             CCeeecC------CCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEec---cCCCCCCCceeEEE
Q psy6079          10 VFWFPLL------LLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFI---APMLSPRSTAGVAV   76 (79)
Q Consensus        10 ~~W~~~~------~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~---~~~~~~r~~~~~~~   76 (79)
                      -+|.++.      |-||.+|.++++  .+.|.||||-++. ...++++||..+++|..-   .++|.+...|+++.
T Consensus        17 ~rWrrV~~~tGPvPrpRHGHRAVai--kELiviFGGGNEG-iiDELHvYNTatnqWf~PavrGDiPpgcAA~Gfvc   89 (830)
T KOG4152|consen   17 VRWRRVQQSTGPVPRPRHGHRAVAI--KELIVIFGGGNEG-IIDELHVYNTATNQWFAPAVRGDIPPGCAAFGFVC   89 (830)
T ss_pred             cceEEEecccCCCCCccccchheee--eeeEEEecCCccc-chhhhhhhccccceeecchhcCCCCCchhhcceEe
Confidence            4788875      445799999999  9999999986543 468899999999999864   45666666666654


No 37 
>PF07250 Glyoxal_oxid_N:  Glyoxal oxidase N-terminus;  InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=98.00  E-value=4.2e-05  Score=45.00  Aligned_cols=74  Identities=20%  Similarity=0.231  Sum_probs=54.4

Q ss_pred             ceecCCCCCeeecCCCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCC----CcEEecc-CCCCCCCceeEEEE
Q psy6079           3 SSICLHLVFWFPLLLLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRL----RQWSFIA-PMLSPRSTAGVAVL   77 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~----~~W~~~~-~~~~~r~~~~~~~~   77 (79)
                      ..||+.+++++.+....-..++..++-.++.+.++||..+.  ...+..|++.+    ..|...+ .|..+|..+++..+
T Consensus        49 ~~yD~~tn~~rpl~v~td~FCSgg~~L~dG~ll~tGG~~~G--~~~ir~~~p~~~~~~~~w~e~~~~m~~~RWYpT~~~L  126 (243)
T PF07250_consen   49 VEYDPNTNTFRPLTVQTDTFCSGGAFLPDGRLLQTGGDNDG--NKAIRIFTPCTSDGTCDWTESPNDMQSGRWYPTATTL  126 (243)
T ss_pred             EEEecCCCcEEeccCCCCCcccCcCCCCCCCEEEeCCCCcc--ccceEEEecCCCCCCCCceECcccccCCCccccceEC
Confidence            57999999999987544333333333348899999997652  45678888876    6798775 58999999988876


Q ss_pred             C
Q psy6079          78 N   78 (79)
Q Consensus        78 ~   78 (79)
                      .
T Consensus       127 ~  127 (243)
T PF07250_consen  127 P  127 (243)
T ss_pred             C
Confidence            4


No 38 
>smart00612 Kelch Kelch domain.
Probab=97.82  E-value=2.7e-05  Score=33.86  Aligned_cols=26  Identities=23%  Similarity=0.240  Sum_probs=20.9

Q ss_pred             ceecCCCCCeeecCCCc--cceeeeEEE
Q psy6079           3 SSICLHLVFWFPLLLLS--YMCLGVGVL   28 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~--r~~~~~~~~   28 (79)
                      .+||+.+++|+.+++|+  |..++++++
T Consensus        18 ~~yd~~~~~W~~~~~~~~~r~~~~~~~~   45 (47)
T smart00612       18 EVYDPETNKWTPLPSMPTPRSGHGVAVI   45 (47)
T ss_pred             EEECCCCCeEccCCCCCCccccceEEEe
Confidence            47999999999999776  666666665


No 39 
>KOG2437|consensus
Probab=97.53  E-value=9.5e-05  Score=47.61  Aligned_cols=66  Identities=18%  Similarity=0.154  Sum_probs=51.0

Q ss_pred             CCCeeecCC------------CccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEecc---CCCCCCCcee
Q psy6079           9 LVFWFPLLL------------LSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIA---PMLSPRSTAG   73 (79)
Q Consensus         9 ~~~W~~~~~------------~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~---~~~~~r~~~~   73 (79)
                      +.+|++.++            -.|.+|-++....++.+|+-||.++...+.++|.|+...++|..+.   ..|..|..|-
T Consensus       238 ~~~W~~i~~~~~~~~~~~~~p~~RgGHQMV~~~~~~CiYLYGGWdG~~~l~DFW~Y~v~e~~W~~iN~~t~~PG~RsCHR  317 (723)
T KOG2437|consen  238 KPRWSQIIPKSTKGDGEDNRPGMRGGHQMVIDVQTECVYLYGGWDGTQDLADFWAYSVKENQWTCINRDTEGPGARSCHR  317 (723)
T ss_pred             cccccccCchhhcccccccCccccCcceEEEeCCCcEEEEecCcccchhHHHHHhhcCCcceeEEeecCCCCCcchhhhh
Confidence            567887652            2378888888744558999999999888999999999999999873   3566777664


Q ss_pred             E
Q psy6079          74 V   74 (79)
Q Consensus        74 ~   74 (79)
                      .
T Consensus       318 M  318 (723)
T KOG2437|consen  318 M  318 (723)
T ss_pred             h
Confidence            4


No 40 
>PF13415 Kelch_3:  Galactose oxidase, central domain
Probab=97.28  E-value=0.00035  Score=31.07  Aligned_cols=26  Identities=12%  Similarity=-0.068  Sum_probs=20.7

Q ss_pred             ceecCCCCCeeecC--CCccceeeeEEE
Q psy6079           3 SSICLHLVFWFPLL--LLSYMCLGVGVL   28 (79)
Q Consensus         3 ~~~~~~~~~W~~~~--~~~r~~~~~~~~   28 (79)
                      ..||+.+++|++++  |.+|.+|+++++
T Consensus        22 ~~~~~~~~~W~~~~~~P~~R~~h~~~~i   49 (49)
T PF13415_consen   22 WVFDLDTNTWTRIGDLPPPRSGHTATVI   49 (49)
T ss_pred             EEEECCCCEEEECCCCCCCccceEEEEC
Confidence            47999999999998  445888887763


No 41 
>KOG2437|consensus
Probab=97.12  E-value=0.00093  Score=43.26  Aligned_cols=58  Identities=14%  Similarity=0.180  Sum_probs=44.6

Q ss_pred             eecCCCCCeeecC--------CCccceeeeEEECCCCe--EEEEcCCCC--C-CCCCeEEEEeCCCCcEEecc
Q psy6079           4 SICLHLVFWFPLL--------LLSYMCLGVGVLEPEGP--LYAVGGHDG--W-SYLNTTERFDPRLRQWSFIA   63 (79)
Q Consensus         4 ~~~~~~~~W~~~~--------~~~r~~~~~~~~~~~~~--iyv~GG~~~--~-~~~~~~~~yd~~~~~W~~~~   63 (79)
                      +||..++.|..+.        |-....|.+++.  .++  +||+||+.-  . ..+..++.||..-..|+...
T Consensus       350 ~FDi~~~~W~~ls~dt~~dGGP~~vfDHqM~Vd--~~k~~iyVfGGr~~~~~e~~f~GLYaf~~~~~~w~~l~  420 (723)
T KOG2437|consen  350 RFDIDTNTWMLLSEDTAADGGPKLVFDHQMCVD--SEKHMIYVFGGRILTCNEPQFSGLYAFNCQCQTWKLLR  420 (723)
T ss_pred             EEecCCceeEEecccccccCCcceeecceeeEe--cCcceEEEecCeeccCCCccccceEEEecCCccHHHHH
Confidence            5788899998886        222467888888  666  999999652  2 34788999999999998764


No 42 
>PF13964 Kelch_6:  Kelch motif
Probab=96.97  E-value=0.00064  Score=30.25  Aligned_cols=18  Identities=11%  Similarity=-0.011  Sum_probs=15.9

Q ss_pred             ceecCCCCCeeecCCCcc
Q psy6079           3 SSICLHLVFWFPLLLLSY   20 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~r   20 (79)
                      .+||+++++|+.+++||.
T Consensus        31 ~~yd~~t~~W~~~~~mp~   48 (50)
T PF13964_consen   31 ERYDPETNTWEQLPPMPT   48 (50)
T ss_pred             EEEcCCCCcEEECCCCCC
Confidence            589999999999998873


No 43 
>PLN02772 guanylate kinase
Probab=96.97  E-value=0.0046  Score=38.96  Aligned_cols=51  Identities=6%  Similarity=-0.052  Sum_probs=37.1

Q ss_pred             ceecCCCCCeeecC-----CCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCC
Q psy6079           3 SSICLHLVFWFPLL-----LLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRL   56 (79)
Q Consensus         3 ~~~~~~~~~W~~~~-----~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~   56 (79)
                      +.||..+++|....     |.||.+|+++++. +++|+|+++....  ..++|.....|
T Consensus        54 ~i~D~~t~~W~~P~V~G~~P~~r~GhSa~v~~-~~rilv~~~~~~~--~~~~w~l~~~t  109 (398)
T PLN02772         54 QILDKITNNWVSPIVLGTGPKPCKGYSAVVLN-KDRILVIKKGSAP--DDSIWFLEVDT  109 (398)
T ss_pred             EEEECCCCcEecccccCCCCCCCCcceEEEEC-CceEEEEeCCCCC--ccceEEEEcCC
Confidence            46899999999875     7779999999883 5689999764332  25566555444


No 44 
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=96.43  E-value=0.048  Score=31.54  Aligned_cols=59  Identities=14%  Similarity=-0.013  Sum_probs=39.1

Q ss_pred             ceecCCCCCeeecCCCc---cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEe-cc
Q psy6079           3 SSICLHLVFWFPLLLLS---YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSF-IA   63 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~---r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~-~~   63 (79)
                      ..|+..++.|..+...+   ......+.+  ++.+|-+...........+..||..+.+|.. ++
T Consensus        73 ~Vys~~~~~Wr~~~~~~~~~~~~~~~v~~--~G~lyw~~~~~~~~~~~~IvsFDl~~E~f~~~i~  135 (230)
T TIGR01640        73 QVYTLGSNSWRTIECSPPHHPLKSRGVCI--NGVLYYLAYTLKTNPDYFIVSFDVSSERFKEFIP  135 (230)
T ss_pred             EEEEeCCCCccccccCCCCccccCCeEEE--CCEEEEEEEECCCCCcEEEEEEEcccceEeeeee
Confidence            47888999999987322   111225667  9998887642221112268999999999995 53


No 45 
>PF01344 Kelch_1:  Kelch motif;  InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=96.38  E-value=0.001  Score=29.01  Aligned_cols=17  Identities=12%  Similarity=0.004  Sum_probs=15.0

Q ss_pred             ceecCCCCCeeecCCCc
Q psy6079           3 SSICLHLVFWFPLLLLS   19 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~   19 (79)
                      .+||+++++|+.+++||
T Consensus        31 ~~yd~~~~~W~~~~~mp   47 (47)
T PF01344_consen   31 EVYDPETNTWEELPPMP   47 (47)
T ss_dssp             EEEETTTTEEEEEEEES
T ss_pred             EEEeCCCCEEEEcCCCC
Confidence            47999999999998875


No 46 
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=95.92  E-value=0.15  Score=29.50  Aligned_cols=62  Identities=8%  Similarity=-0.122  Sum_probs=35.9

Q ss_pred             ceecCCCCCeeecCCCcc----ceeeeEEECCCC-----eEEEEcCCCCCCCCCeEEEEeCCCCcEEeccC
Q psy6079           3 SSICLHLVFWFPLLLLSY----MCLGVGVLEPEG-----PLYAVGGHDGWSYLNTTERFDPRLRQWSFIAP   64 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~r----~~~~~~~~~~~~-----~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~   64 (79)
                      ...||.|.+|..+|+.+.    .......+.++.     |+..+...........+++|+..++.|+.+..
T Consensus        17 ~V~NP~T~~~~~LP~~~~~~~~~~~~~~~~G~d~~~~~YKVv~~~~~~~~~~~~~~~Vys~~~~~Wr~~~~   87 (230)
T TIGR01640        17 VVWNPSTGQSRWLPTPKSRRSNKESDTYFLGYDPIEKQYKVLCFSDRSGNRNQSEHQVYTLGSNSWRTIEC   87 (230)
T ss_pred             EEECCCCCCEEecCCCCCcccccccceEEEeecccCCcEEEEEEEeecCCCCCccEEEEEeCCCCcccccc
Confidence            367999999999974331    111011121122     44444332111124578999999999999864


No 47 
>PF13418 Kelch_4:  Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=95.83  E-value=0.0058  Score=26.91  Aligned_cols=17  Identities=12%  Similarity=0.082  Sum_probs=12.4

Q ss_pred             ceecCCCCCeeecCCCc
Q psy6079           3 SSICLHLVFWFPLLLLS   19 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~   19 (79)
                      .+||+++++|++++++|
T Consensus        32 ~~~d~~~~~W~~~~~~P   48 (49)
T PF13418_consen   32 WIFDIETNTWTRLPSMP   48 (49)
T ss_dssp             EEEETTTTEEEE--SS-
T ss_pred             EEEECCCCEEEECCCCC
Confidence            47999999999998776


No 48 
>PF07646 Kelch_2:  Kelch motif;  InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=95.37  E-value=0.013  Score=25.90  Aligned_cols=17  Identities=12%  Similarity=0.018  Sum_probs=14.8

Q ss_pred             ceecCCCCCeeecCCCc
Q psy6079           3 SSICLHLVFWFPLLLLS   19 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~   19 (79)
                      +.||+++++|+.+++++
T Consensus        33 ~~~d~~t~~W~~~~~~g   49 (49)
T PF07646_consen   33 WVFDTETNQWTELSPMG   49 (49)
T ss_pred             EEEECCCCEEeecCCCC
Confidence            57999999999998775


No 49 
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=95.19  E-value=0.16  Score=30.87  Aligned_cols=60  Identities=12%  Similarity=-0.008  Sum_probs=39.3

Q ss_pred             eecCCCCCeeecCCC-ccceeeeEEECCCCeEEEEcCCCCCC-CCCeEEEEeCCCCcEEeccC
Q psy6079           4 SICLHLVFWFPLLLL-SYMCLGVGVLEPEGPLYAVGGHDGWS-YLNTTERFDPRLRQWSFIAP   64 (79)
Q Consensus         4 ~~~~~~~~W~~~~~~-~r~~~~~~~~~~~~~iyv~GG~~~~~-~~~~~~~yd~~~~~W~~~~~   64 (79)
                      .||+...+|..+..- .-.-..+... -++++|+.|-+.-.. ....+..||.++.+|..++.
T Consensus        20 ~yd~~~~qW~~~g~~i~G~V~~l~~~-~~~~Llv~G~ft~~~~~~~~la~yd~~~~~w~~~~~   81 (281)
T PF12768_consen   20 LYDTDNSQWSSPGNGISGTVTDLQWA-SNNQLLVGGNFTLNGTNSSNLATYDFKNQTWSSLGG   81 (281)
T ss_pred             EEECCCCEeecCCCCceEEEEEEEEe-cCCEEEEEEeeEECCCCceeEEEEecCCCeeeecCC
Confidence            589999999998732 1111222222 256777777544333 35678999999999988865


No 50 
>PF07893 DUF1668:  Protein of unknown function (DUF1668);  InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function. 
Probab=89.88  E-value=3.3  Score=25.82  Aligned_cols=36  Identities=14%  Similarity=-0.101  Sum_probs=24.8

Q ss_pred             eecCCCCCeeecCCCc--cceeeeEEECCCCeEEEEcCCC
Q psy6079           4 SICLHLVFWFPLLLLS--YMCLGVGVLEPEGPLYAVGGHD   41 (79)
Q Consensus         4 ~~~~~~~~W~~~~~~~--r~~~~~~~~~~~~~iyv~GG~~   41 (79)
                      .||+++..-..+|.+.  .....++.+  +++||++....
T Consensus        90 vyDt~t~av~~~P~l~~pk~~pisv~V--G~~LY~m~~~~  127 (342)
T PF07893_consen   90 VYDTDTRAVATGPRLHSPKRCPISVSV--GDKLYAMDRSP  127 (342)
T ss_pred             EEECCCCeEeccCCCCCCCcceEEEEe--CCeEEEeeccC
Confidence            5888888877777554  233345556  88999997643


No 51 
>PF07250 Glyoxal_oxid_N:  Glyoxal oxidase N-terminus;  InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=88.51  E-value=0.95  Score=27.03  Aligned_cols=34  Identities=18%  Similarity=0.401  Sum_probs=26.2

Q ss_pred             CCCeEEEEcCCCCCCCCCeEEEEeCCCCcE-EeccCCCC-CCC
Q psy6079          30 PEGPLYAVGGHDGWSYLNTTERFDPRLRQW-SFIAPMLS-PRS   70 (79)
Q Consensus        30 ~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W-~~~~~~~~-~r~   70 (79)
                      .+++||+++.       ....+||+.++++ ..+|.+|. +|.
T Consensus       179 PdG~lFi~an-------~~s~i~d~~~n~v~~~lP~lPg~~R~  214 (243)
T PF07250_consen  179 PDGNLFIFAN-------RGSIIYDYKTNTVVRTLPDLPGGPRN  214 (243)
T ss_pred             CCCCEEEEEc-------CCcEEEeCCCCeEEeeCCCCCCCcee
Confidence            4999999975       3457789999987 77888884 454


No 52 
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=85.37  E-value=5.8  Score=24.69  Aligned_cols=55  Identities=13%  Similarity=0.173  Sum_probs=33.6

Q ss_pred             ceecCCCCCeeecC--CCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEecc
Q psy6079           3 SSICLHLVFWFPLL--LLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIA   63 (79)
Q Consensus         3 ~~~~~~~~~W~~~~--~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~   63 (79)
                      ++|||....|..-+  ...-..+++.+-. .+.+++-     ....+.+.+|||++.+...++
T Consensus       257 ~rfdPs~~sW~eypLPgs~arpys~rVD~-~grVW~s-----ea~agai~rfdpeta~ftv~p  313 (353)
T COG4257         257 HRFDPSVTSWIEYPLPGSKARPYSMRVDR-HGRVWLS-----EADAGAIGRFDPETARFTVLP  313 (353)
T ss_pred             eEeCcccccceeeeCCCCCCCcceeeecc-CCcEEee-----ccccCceeecCcccceEEEec
Confidence            68999999998754  2222233444331 3345542     112467889999998877663


No 53 
>PF13570 PQQ_3:  PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=81.01  E-value=3.1  Score=17.18  Aligned_cols=37  Identities=16%  Similarity=0.074  Sum_probs=18.5

Q ss_pred             CeeecCCCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCC
Q psy6079          11 FWFPLLLLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRL   56 (79)
Q Consensus        11 ~W~~~~~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~   56 (79)
                      .|+.-.+- ....+.++.  ++.+|+.+.      ...++++|+++
T Consensus         4 ~W~~~~~~-~~~~~~~v~--~g~vyv~~~------dg~l~ald~~t   40 (40)
T PF13570_consen    4 LWSYDTGG-PIWSSPAVA--GGRVYVGTG------DGNLYALDAAT   40 (40)
T ss_dssp             EEEEE-SS----S--EEC--TSEEEEE-T------TSEEEEEETT-
T ss_pred             eEEEECCC-CcCcCCEEE--CCEEEEEcC------CCEEEEEeCCC
Confidence            46664322 222334455  778887654      35788888764


No 54 
>PF08268 FBA_3:  F-box associated domain;  InterPro: IPR013187 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=78.15  E-value=8.1  Score=20.33  Aligned_cols=33  Identities=30%  Similarity=0.301  Sum_probs=24.1

Q ss_pred             CCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccC
Q psy6079          31 EGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAP   64 (79)
Q Consensus        31 ~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~   64 (79)
                      +|-+|-..-. .......+-+||.++.+|..+..
T Consensus         5 nGvly~~a~~-~~~~~~~IvsFDv~~E~f~~i~~   37 (129)
T PF08268_consen    5 NGVLYWLAWS-EDSDNNVIVSFDVRSEKFRFIKL   37 (129)
T ss_pred             CcEEEeEEEE-CCCCCcEEEEEEcCCceEEEEEe
Confidence            7777776543 22235789999999999998854


No 55 
>PF06058 DCP1:  Dcp1-like decapping family;  InterPro: IPR010334 An essential step in mRNA turnover is decapping. In yeast, two proteins have been identified that are essential for decapping, Dcp1 (this family) and Dcp2 (IPR007722 from INTERPRO). The precise role of these proteins in the decapping reaction has not been established. Evidence suggests that the Dcp1 may enhance the function of Dcp2 [].; PDB: 1Q67_A 2QKM_C 2QKL_A.
Probab=69.01  E-value=6.8  Score=20.91  Aligned_cols=16  Identities=19%  Similarity=0.418  Sum_probs=14.0

Q ss_pred             CeEEEEeCCCCcEEec
Q psy6079          47 NTTERFDPRLRQWSFI   62 (79)
Q Consensus        47 ~~~~~yd~~~~~W~~~   62 (79)
                      ..++.||+.+++|++.
T Consensus        29 v~vY~f~~~~~~W~K~   44 (122)
T PF06058_consen   29 VVVYKFDHETNEWEKT   44 (122)
T ss_dssp             EEEEEEETTTTEEEEE
T ss_pred             EEEEeecCCCCcEeec
Confidence            5688899999999986


No 56 
>PRK09511 nirD nitrite reductase small subunit; Provisional
Probab=66.12  E-value=16  Score=18.86  Aligned_cols=31  Identities=13%  Similarity=-0.039  Sum_probs=12.6

Q ss_pred             eecCCCCCeeecCCCccceeeeEEECCCCeEEE
Q psy6079           4 SICLHLVFWFPLLLLSYMCLGVGVLEPEGPLYA   36 (79)
Q Consensus         4 ~~~~~~~~W~~~~~~~r~~~~~~~~~~~~~iyv   36 (79)
                      +||.+|++-...+..+..-+-+.+.  ++.|||
T Consensus        76 ~Fdl~TG~~~~~~~~~l~typV~ve--~g~V~v  106 (108)
T PRK09511         76 RFRLSDGLCMEDEQFSVKHYDARVK--DGVVQL  106 (108)
T ss_pred             EEECCCcccCCCCCccEeeEeEEEE--CCEEEE
Confidence            4455554443322222233333444  555554


No 57 
>PF07443 HARP:  HepA-related protein (HARP);  InterPro: IPR010003 This entry represents a conserved region approximately 60 residues long within eukaryotic HepA-related protein (HARP). This exhibits single-stranded DNA-dependent ATPase activity, and is ubiquitously expressed in human and mouse tissues []. Family members may contain more than one copy of this region.; GO: 0004386 helicase activity, 0005524 ATP binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0016568 chromatin modification, 0005634 nucleus
Probab=65.49  E-value=2.8  Score=19.28  Aligned_cols=12  Identities=42%  Similarity=1.326  Sum_probs=10.1

Q ss_pred             EEEeCCCCcEEe
Q psy6079          50 ERFDPRLRQWSF   61 (79)
Q Consensus        50 ~~yd~~~~~W~~   61 (79)
                      ..||+.|+.|.-
T Consensus        21 r~YD~~Tr~W~F   32 (55)
T PF07443_consen   21 RNYDPKTRKWNF   32 (55)
T ss_pred             cccCccceeeee
Confidence            468999999984


No 58 
>PF05096 Glu_cyclase_2:  Glutamine cyclotransferase;  InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=63.06  E-value=33  Score=21.04  Aligned_cols=46  Identities=20%  Similarity=0.317  Sum_probs=31.4

Q ss_pred             CCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEECC
Q psy6079          31 EGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVLNS   79 (79)
Q Consensus        31 ~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~   79 (79)
                      ++.+|.--|..+   .+.+..||+.+.+-....+++..-.+-+.+.++|
T Consensus        55 ~g~LyESTG~yG---~S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~~d  100 (264)
T PF05096_consen   55 DGTLYESTGLYG---QSSLRKVDLETGKVLQSVPLPPRYFGEGITILGD  100 (264)
T ss_dssp             TTEEEEEECSTT---EEEEEEEETTTSSEEEEEE-TTT--EEEEEEETT
T ss_pred             CCEEEEeCCCCC---cEEEEEEECCCCcEEEEEECCccccceeEEEECC
Confidence            567777656544   3789999999998777777776666667777664


No 59 
>PF08450 SGL:  SMP-30/Gluconolaconase/LRE-like region;  InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=62.86  E-value=28  Score=20.21  Aligned_cols=54  Identities=17%  Similarity=0.137  Sum_probs=30.2

Q ss_pred             ceecCCCCCeeecCCCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCC
Q psy6079           3 SSICLHLVFWFPLLLLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPM   65 (79)
Q Consensus         3 ~~~~~~~~~W~~~~~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~   65 (79)
                      +++|+.+.+-.......  ..+++....++.+|+...       .....+|+.+.+++.+...
T Consensus        25 ~~~~~~~~~~~~~~~~~--~~G~~~~~~~g~l~v~~~-------~~~~~~d~~~g~~~~~~~~   78 (246)
T PF08450_consen   25 YRVDPDTGEVEVIDLPG--PNGMAFDRPDGRLYVADS-------GGIAVVDPDTGKVTVLADL   78 (246)
T ss_dssp             EEEETTTTEEEEEESSS--EEEEEEECTTSEEEEEET-------TCEEEEETTTTEEEEEEEE
T ss_pred             EEEECCCCeEEEEecCC--CceEEEEccCCEEEEEEc-------CceEEEecCCCcEEEEeec
Confidence            35566665544433222  222333213678888753       2346669999988877655


No 60 
>PF14830 Haemocyan_bet_s:  Haemocyanin beta-sandwich; PDB: 1JS8_B 3QJO_A 1LNL_A.
Probab=62.44  E-value=7.7  Score=20.21  Aligned_cols=24  Identities=17%  Similarity=0.451  Sum_probs=11.9

Q ss_pred             CCeEEEEcCCCCC-CCCCeEEEEeC
Q psy6079          31 EGPLYAVGGHDGW-SYLNTTERFDP   54 (79)
Q Consensus        31 ~~~iyv~GG~~~~-~~~~~~~~yd~   54 (79)
                      .+.++|.||..+. +.+..++.||.
T Consensus        36 AG~F~vLGG~~EMpW~FdRlykydI   60 (103)
T PF14830_consen   36 AGTFFVLGGEKEMPWAFDRLYKYDI   60 (103)
T ss_dssp             EEEEEE---TTS---B-SS-EEEE-
T ss_pred             ccEEEEcCCCccCccccCccchhhH
Confidence            4568999997764 34778888886


No 61 
>PF00614 PLDc:  Phospholipase D Active site motif;  InterPro: IPR001736 Phosphatidylcholine-hydrolysing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certain types of transport vesicles or may be constitutive vesicular transport to signal transduction pathways. PC-hydrolysing PLD is a homologue of cardiolipin synthase, phosphatidylserine synthase, bacterial PLDs, and viral proteins. Each of these appears to possess a domain duplication which is apparent by the presence of two motifs containing well-conserved histidine, lysine, and/or asparagine residues which may contribute to the active site aspartic acid. An Escherichia coli endonuclease (nuc) and similar proteins appear to be PLD homologues but possess only one of these motifs [, , , ].; GO: 0003824 catalytic activity, 0008152 metabolic process; PDB: 3HSI_C.
Probab=61.89  E-value=10  Score=14.78  Aligned_cols=16  Identities=25%  Similarity=0.378  Sum_probs=9.4

Q ss_pred             eeEEECCCCeEEEEcCCC
Q psy6079          24 GVGVLEPEGPLYAVGGHD   41 (79)
Q Consensus        24 ~~~~~~~~~~iyv~GG~~   41 (79)
                      -..++  ++++..+||.+
T Consensus         8 K~~vv--D~~~a~vGg~n   23 (28)
T PF00614_consen    8 KFVVV--DDRVAFVGGAN   23 (28)
T ss_dssp             -EEEE--TTTEEEEE---
T ss_pred             EEEEE--cCCEEEECcee
Confidence            35567  88888888865


No 62 
>PF14157 YmzC:  YmzC-like protein; PDB: 3KVP_E.
Probab=61.55  E-value=14  Score=17.45  Aligned_cols=34  Identities=18%  Similarity=0.147  Sum_probs=19.4

Q ss_pred             eeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEec
Q psy6079          24 GVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFI   62 (79)
Q Consensus        24 ~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~   62 (79)
                      .++.+  .+.-|.+-|-.   ..-.++.||+++++-..+
T Consensus        23 ~m~ql--~~N~Fav~~e~---~~iKIfkyd~~tNei~L~   56 (63)
T PF14157_consen   23 NMTQL--EHNHFAVVDED---GQIKIFKYDEDTNEITLK   56 (63)
T ss_dssp             EEEE---STTEEEEE-ET---TEEEEEEEETTTTEEEEE
T ss_pred             ceEEe--cCCEEEEEecC---CeEEEEEeCCCCCeEEEE
Confidence            45555  44444443211   135789999999987654


No 63 
>PF03089 RAG2:  Recombination activating protein 2;  InterPro: IPR004321 The variable portion of the genes encoding immunoglobulins and T cell receptors are assembled from component V, D, and J DNA segments by a site-specific recombination reaction termed V(D)J recombination. V(D)J recombination is targeted to specific sites on the chromosome by recombination signal sequences (RSSs) that flank antigen receptor gene segments. The RSS consists of a conserved heptamer (consensus, 5'-CACAGTG-3') and nonamer (consensus, 5'-ACAAAAACC-3') separated by a spacer of either 12 or 23 bp. Efficient recombination occurs between a 12-RSS and a 23-RSS, a restriction known as the 12/23 rule. V(D)J recombination can be divided into two phases, DNA cleavage and DNA joining. DNA cleavage requires two lymphocyte-specific factors, the products of the recombination activating genes, RAG1 and RAG2, which together recognise the RSSs and create double strand breaks at the RSS-coding segment junctions []. RAG-mediated DNA cleavage occurs in a synaptic complex termed the paired complex, which is constituted from two distinct RSS-RAG complexes, a 12-SC and a 23-SC (where SC stands for signal complex). The DNA cleavage reaction involves two distinct enzymatic steps, initial nicking that creates a 3'-OH between a coding segment and its RSS, followed by hairpin formation in which the newly created 3'-OH attacks a phosphodiester bond on the opposite DNA strand. This generates a blunt, 5' phosphorylated signal end containing all of the RSS elements, and a covalently sealed hairpin coding end.  The second phase of V(D)J recombination, in which broken DNA fragments are processed and joined, is less well characterised. Signal ends are typically joined precisely to form a signal joint, whereas joining of the coding ends requires the hairpin structure to be opened and typically involves nucleotide addition and deletion before formation of the coding joint. The factors involved in these processes include ubiquitously expressed proteins involved in the repair of DNA double strand breaks by nonhomologous end joining, terminal deoxynucleotidyl transferase, and Artemis protein. In addition to their critical roles in RSS recognition and DNA cleavage, the RAG proteins may perform two distinct types of functions in the postcleavage phase of V(D)J. A structural function has been inferred from the finding that, after DNA cleavage in vitro, the DNA ends remain associated with the RAG proteins in a "four end" complex known as the cleaved signal complex. After release of the coding ends in vitro, and after coding joint formation in vivo, the RAG proteins remain in a stable signal end complex (SEC) containing the two signal ends. These postcleavage complexes may serve as essential scaffolds for the second phase of the reaction, with the RAG proteins acting to organise the DNA processing and joining events.  The second type of RAG protein-mediated postcleavage activity is the catalysis of phosphodiester bond hydrolysis and strand transfer reactions. The RAG proteins are capable of opening hairpin coding ends in vitro. The RAG proteins also show 3' flap endonuclease activity that may contribute to coding end processing/joining and can utilise the 3' OH group on the signal ends to attack hairpin coding ends (forming hybrid or open/shut joints) or virtually any DNA duplex (forming a transposition product).; GO: 0003677 DNA binding, 0006310 DNA recombination, 0005634 nucleus
Probab=55.20  E-value=25  Score=22.06  Aligned_cols=19  Identities=21%  Similarity=0.235  Sum_probs=14.6

Q ss_pred             ceeeeEEECCCCeEEEEcCCC
Q psy6079          21 MCLGVGVLEPEGPLYAVGGHD   41 (79)
Q Consensus        21 ~~~~~~~~~~~~~iyv~GG~~   41 (79)
                      ..|.+.+.  ++.+|++||..
T Consensus       156 SFHvslar--~D~VYilGGHs  174 (337)
T PF03089_consen  156 SFHVSLAR--NDCVYILGGHS  174 (337)
T ss_pred             EEEEEEec--CceEEEEccEE
Confidence            55666666  88999999954


No 64 
>PRK11408 hypothetical protein; Provisional
Probab=54.91  E-value=30  Score=19.24  Aligned_cols=29  Identities=24%  Similarity=0.528  Sum_probs=20.0

Q ss_pred             EEEE-cCCCCCCCCCeEEEEeCCCCcEEecc
Q psy6079          34 LYAV-GGHDGWSYLNTTERFDPRLRQWSFIA   63 (79)
Q Consensus        34 iyv~-GG~~~~~~~~~~~~yd~~~~~W~~~~   63 (79)
                      +|.+ ||.-+.. ...++.|-|++-+|+.+.
T Consensus        34 ~FAiNgG~lg~~-~G~v~Y~aPDtL~WE~l~   63 (145)
T PRK11408         34 FFALNGGGLGED-VGEVYYFAPDTLKWESLE   63 (145)
T ss_pred             eEEeccCccCCC-CCeEEEeCCCccceeccC
Confidence            4455 4433322 578999999999999763


No 65 
>PF13806 Rieske_2:  Rieske-like [2Fe-2S] domain; PDB: 2JO6_A 3C0D_A 3D89_A 2JZA_A.
Probab=54.08  E-value=20  Score=18.45  Aligned_cols=31  Identities=19%  Similarity=0.066  Sum_probs=13.9

Q ss_pred             eecCCCCCeeecCCCccceeeeEEECCCCeEEE
Q psy6079           4 SICLHLVFWFPLLLLSYMCLGVGVLEPEGPLYA   36 (79)
Q Consensus         4 ~~~~~~~~W~~~~~~~r~~~~~~~~~~~~~iyv   36 (79)
                      +||++|++....+..+..-+-+-+.  ++.|||
T Consensus        73 ~f~L~tG~~~~~~~~~l~~ypvrv~--~g~V~V  103 (104)
T PF13806_consen   73 RFDLRTGECLEDPDVSLRTYPVRVE--DGQVYV  103 (104)
T ss_dssp             EEETTTTEESSECSEBSBEEEEEEC--TTEEEE
T ss_pred             eEECCCcCcCCCCCCcEEeEEEEEE--CCEEEE
Confidence            4555555443333323333444444  555554


No 66 
>COG4880 Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats [General function prediction only]
Probab=51.43  E-value=18  Score=24.06  Aligned_cols=29  Identities=28%  Similarity=0.544  Sum_probs=20.2

Q ss_pred             eEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccC
Q psy6079          25 VGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAP   64 (79)
Q Consensus        25 ~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~   64 (79)
                      +..+  ++.+|++||       +.++.+|  .|.|+.+..
T Consensus       571 A~fi--~dylY~vg~-------~ev~~ld--enswe~Vge  599 (603)
T COG4880         571 AFFI--KDYLYLVGG-------NEVWKLD--ENSWEVVGE  599 (603)
T ss_pred             eEEe--cceEEEecc-------ceeEEec--cchHhhhhh
Confidence            4455  888999987       5677775  456877643


No 67 
>KOG0278|consensus
Probab=44.09  E-value=41  Score=20.85  Aligned_cols=30  Identities=23%  Similarity=0.449  Sum_probs=18.1

Q ss_pred             eeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCC
Q psy6079          23 LGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLR   57 (79)
Q Consensus        23 ~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~   57 (79)
                      -.++.+.++..+||.||-+     -.++.||-.|.
T Consensus       227 V~SASL~P~k~~fVaGged-----~~~~kfDy~Tg  256 (334)
T KOG0278|consen  227 VESASLHPKKEFFVAGGED-----FKVYKFDYNTG  256 (334)
T ss_pred             cccccccCCCceEEecCcc-----eEEEEEeccCC
Confidence            3445555577899999843     23455555444


No 68 
>TIGR00547 lolA periplasmic chaperone LolA. This protein, LolA, is known so far only in the gamma and beta subdivisions of the Proteobacteria. The E. coli major outer lipoprotein (Lpp) of E. coli is released from the inner membrane as a complex with this chaperone in an energy-requiring process, and is then delivered to LolB for insertion into the outer membrane. LolA is involved in the delivery of lipoproteins generally, rather than just Lpp, and is an essential protein in E. coli, unlike Lpp itself.
Probab=43.80  E-value=55  Score=19.02  Aligned_cols=24  Identities=21%  Similarity=0.259  Sum_probs=16.6

Q ss_pred             CCeEEEEcCCCCCCCCCeEEEEeCCCCcEEe
Q psy6079          31 EGPLYAVGGHDGWSYLNTTERFDPRLRQWSF   61 (79)
Q Consensus        31 ~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~   61 (79)
                      .+.+.|..|       ..++.|||+.++...
T Consensus        77 ~~q~iVsdG-------~~vw~Ydp~leQvt~  100 (204)
T TIGR00547        77 DESIIISDG-------KTLWFYDPFVEQATA  100 (204)
T ss_pred             CceEEEECC-------CEEEEECCCCceeee
Confidence            445555554       578999998888653


No 69 
>cd01207 Ena-Vasp Enabled-VASP-type homology (EVH1) domain. Enabled-VASP-type homology (EVH1) domain. The EVH1 domain binds to other proteins at proline rich sequences. It is found in proteins involved in cytoskeletal reorganization such as Enabled and VASP. Ena-VASP type EVH1 domains specifically recognize FPPPP motifs in the focal adhesion proteins zyxin and vinculin, and the ActA surface protein of Listeria monocytogenes.  It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=43.71  E-value=30  Score=18.30  Aligned_cols=18  Identities=11%  Similarity=0.276  Sum_probs=14.7

Q ss_pred             CeEEEEeCCCCcEEeccC
Q psy6079          47 NTTERFDPRLRQWSFIAP   64 (79)
Q Consensus        47 ~~~~~yd~~~~~W~~~~~   64 (79)
                      ..+..||+.++.|.+...
T Consensus         9 A~Vm~~d~~tk~W~P~~~   26 (111)
T cd01207           9 ASVMVYDDSNKKWVPAGG   26 (111)
T ss_pred             EEeeEEcCCCCcEEcCCC
Confidence            467889999999988644


No 70 
>PF03835 Rad4:  Rad4 transglutaminase-like domain;  InterPro: IPR018325 RAD4/Xp-C proteins contain an ancient transglutaminase fold that is also found in peptide-N-glycanases (PNGases), which remove glycans from glycoproteins during their degradation. The PNGases retain the catalytic triad that is typical of this fold and are predicted to have a reaction mechanism similar to that involved in transglutamination. In contrast, the RAD4/Xp-C proteins are predicted to be inactive and are likely to only possess the interaction function in DNA repair []. ; GO: 0003684 damaged DNA binding, 0006289 nucleotide-excision repair, 0005634 nucleus; PDB: 2QSG_A 2QSF_A 2QSH_A 1X3W_A 1X3Z_A 3ESW_A.
Probab=43.57  E-value=34  Score=18.58  Aligned_cols=18  Identities=22%  Similarity=0.791  Sum_probs=15.2

Q ss_pred             EEEEeCCCCcEEeccCCC
Q psy6079          49 TERFDPRLRQWSFIAPML   66 (79)
Q Consensus        49 ~~~yd~~~~~W~~~~~~~   66 (79)
                      +|+|++..++|-.+.++.
T Consensus        39 ~EV~~~~~~rWI~VDp~~   56 (145)
T PF03835_consen   39 VEVYSPEEKRWIHVDPVV   56 (145)
T ss_dssp             EEEEETTTTEEEEEETTT
T ss_pred             EEEEecCCCeEEEeeeec
Confidence            688999999999886654


No 71 
>KOG2321|consensus
Probab=42.43  E-value=53  Score=22.73  Aligned_cols=38  Identities=16%  Similarity=0.198  Sum_probs=24.7

Q ss_pred             CCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEE
Q psy6079          17 LLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWS   60 (79)
Q Consensus        17 ~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~   60 (79)
                      .+|+.+.-++.-...-.||+.|.      ..+++++|.+...|-
T Consensus       131 RIP~~GRDm~y~~~scDly~~gs------g~evYRlNLEqGrfL  168 (703)
T KOG2321|consen  131 RIPKFGRDMKYHKPSCDLYLVGS------GSEVYRLNLEQGRFL  168 (703)
T ss_pred             ecCcCCccccccCCCccEEEeec------CcceEEEEccccccc
Confidence            34555555554433556888763      367899999988874


No 72 
>PF03088 Str_synth:  Strictosidine synthase;  InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=42.40  E-value=47  Score=16.71  Aligned_cols=33  Identities=15%  Similarity=-0.086  Sum_probs=17.8

Q ss_pred             eecCCCCCeeecCCCccceeeeEEECCCCeEEEE
Q psy6079           4 SICLHLVFWFPLLLLSYMCLGVGVLEPEGPLYAV   37 (79)
Q Consensus         4 ~~~~~~~~W~~~~~~~r~~~~~~~~~~~~~iyv~   37 (79)
                      +|||.+++.+.+.. .......+++..++...++
T Consensus        41 ~ydp~t~~~~vl~~-~L~fpNGVals~d~~~vlv   73 (89)
T PF03088_consen   41 RYDPSTKETTVLLD-GLYFPNGVALSPDESFVLV   73 (89)
T ss_dssp             EEETTTTEEEEEEE-EESSEEEEEE-TTSSEEEE
T ss_pred             EEECCCCeEEEehh-CCCccCeEEEcCCCCEEEE
Confidence            68888888877642 1223334444445554333


No 73 
>cd05398 NT_ClassII-CCAase Nucleotidyltransferase (NT) domain of ClassII CCA-adding enzymes. CCA-adding enzymes add the sequence [cytidine(C)-cytidine-adenosine (A)], one nucleotide at a time, onto the 3' end of tRNA, in a template-independent reaction. This Class II group is comprised mainly of eubacterial and eukaryotic enzymes and includes Bacillus stearothermophilus CCAase, Escherichia coli poly(A) polymerase I, human mitochondrial CCAase, and Saccharomyces cerevisiae CCAase (CCA1). CCA-adding enzymes have a single catalytic pocket, which recognizes both ATP and CTP substrates. Included in this subgroup are CC- and A-adding enzymes from various ancient species of bacteria such as Aquifex aeolicus; these enzymes collaborate to add CCA to tRNAs. This family belongs to the Pol beta-like NT superfamily. In the majority of enzymes in this superfamily, two carboxylates, Dx[D/E], together with a third more distal carboxylate, coordinate two divalent metal cations involved in a two-metal io
Probab=41.50  E-value=21  Score=19.36  Aligned_cols=11  Identities=36%  Similarity=0.395  Sum_probs=8.9

Q ss_pred             CCeEEEEcCCC
Q psy6079          31 EGPLYAVGGHD   41 (79)
Q Consensus        31 ~~~iyv~GG~~   41 (79)
                      +.++|++||+-
T Consensus        16 g~~~ylVGG~V   26 (139)
T cd05398          16 GYEAYLVGGAV   26 (139)
T ss_pred             CceEEEECChH
Confidence            66899999953


No 74 
>cd03583 NTR_complement_C3 NTR/C345C domain, complement C3 subfamily; The NTR domain found in complement C3 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C3 plays a pivotal role in the activation of the complement systems, as all pathways (classical, alternative, and lectin) result in the processing of C3 by C3 convertase. The larger fragment, activated C3b, contains the NTR/C345C domain and binds covalently, via a reactive thioester, to cell surface carbohydrates including components of bacterial cell walls and immune aggregates. The smaller cleavage product, C3a, acts independently as a diffusible signal to mediate local inflammatory processes. The structure of C3 shows that the NTR/C345C domain is located in an exposed position relative to the rest of the molecule. The function of the domain in complement C3 is poorly understood.
Probab=40.86  E-value=52  Score=18.23  Aligned_cols=32  Identities=16%  Similarity=0.200  Sum_probs=21.5

Q ss_pred             CCeEEEEcCCCCCCC-CCeEEEEeCCCCcEEec
Q psy6079          31 EGPLYAVGGHDGWSY-LNTTERFDPRLRQWSFI   62 (79)
Q Consensus        31 ~~~iyv~GG~~~~~~-~~~~~~yd~~~~~W~~~   62 (79)
                      .++.|++-|.++... ...-+.|-..+++|-..
T Consensus        86 ~gk~YLIMG~~~~~~~~~~~~~YvL~~~TWvE~  118 (149)
T cd03583          86 EGKDYLIMGLSSDLWRIKDKYSYVIGKDTWIEY  118 (149)
T ss_pred             CCCEEEEEeCCCCccccCCcEEEEeCCCeEEEE
Confidence            467788877664432 23347888899999765


No 75 
>smart00155 PLDc Phospholipase D. Active site motifs. Phosphatidylcholine-hydrolyzing phospholipase D (PLD) isoforms are  activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic  acid from phosphatidylcholine, which may be essential for the formation  of certain types of transport vesicles or may be constitutive vesicular transport to signal transduction pathways.  PC-hydrolysing PLD is a homologue of cardiolipin synthase, phosphatidylserine synthase, bacterial PLDs, and viral proteins. Each of these appears to  possess a domain duplication which is apparent by the presence of two motifs containing well-conserved histidine, lysine, aspartic acid,  and/or asparagine residues which may contribute to the active site. An E. coli endonuclease (nuc) and similar proteins appear to be PLD homologues but possess only one of these motifs. The profile contained here represents only the putative active site regions, since an accurate multiple alignment of the repeat units has not be
Probab=40.59  E-value=26  Score=13.17  Aligned_cols=19  Identities=5%  Similarity=0.256  Sum_probs=12.9

Q ss_pred             ceeeeEEECCCCeEEEEcCCC
Q psy6079          21 MCLGVGVLEPEGPLYAVGGHD   41 (79)
Q Consensus        21 ~~~~~~~~~~~~~iyv~GG~~   41 (79)
                      .+.-..++  +++..++|+.+
T Consensus         5 ~H~K~~v~--D~~~~~iGs~N   23 (28)
T smart00155        5 LHTKLMIV--DDEIAYIGSAN   23 (28)
T ss_pred             EEeEEEEE--cCCEEEEeCcc
Confidence            34445667  88888888765


No 76 
>PF10946 DUF2625:  Protein of unknown function DUF2625;  InterPro: IPR021239  Some members in this family of proteins are annotated as ybfG however currently no function is known. 
Probab=40.06  E-value=57  Score=19.30  Aligned_cols=18  Identities=17%  Similarity=0.359  Sum_probs=15.7

Q ss_pred             CCeEEEEeCCCCcEEecc
Q psy6079          46 LNTTERFDPRLRQWSFIA   63 (79)
Q Consensus        46 ~~~~~~yd~~~~~W~~~~   63 (79)
                      ...++.|.|++-+|+.+.
T Consensus       123 ~G~v~YfAPDtL~WE~l~  140 (208)
T PF10946_consen  123 PGKVYYFAPDTLEWEPLE  140 (208)
T ss_pred             CCeEEEECCCcccccccC
Confidence            578999999999999763


No 77 
>KOG2315|consensus
Probab=39.38  E-value=1.3e+02  Score=20.75  Aligned_cols=49  Identities=16%  Similarity=0.120  Sum_probs=27.9

Q ss_pred             eecCCCCCeeecCCCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCC
Q psy6079           4 SICLHLVFWFPLLLLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLR   57 (79)
Q Consensus         4 ~~~~~~~~W~~~~~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~   57 (79)
                      .||++.+---.++.-||.. ..  ....+.|.++.|+.+.  -.+++++|..+.
T Consensus       298 ifnlr~~~v~df~egpRN~-~~--fnp~g~ii~lAGFGNL--~G~mEvwDv~n~  346 (566)
T KOG2315|consen  298 IFNLRGKPVFDFPEGPRNT-AF--FNPHGNIILLAGFGNL--PGDMEVWDVPNR  346 (566)
T ss_pred             EEcCCCCEeEeCCCCCccc-eE--ECCCCCEEEEeecCCC--CCceEEEeccch
Confidence            4555555555555445533 12  2226667777777653  367888887663


No 78 
>cd03584 NTR_complement_C4 NTR/C345C domain, complement C4 subfamily; The NTR domain found in complement C4 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C4 is a key player in the activation of the component classical pathway. C4 is cleaved by activated C1 to yield C4a anaphylatoxin, and the larger fragment C4b, an essential component of the C3- and C5-convertase enzymes. C4b binds covalently to the surface of pathogens through a reactive thioester. The role of the NTR/C345C domain in C4 (C4b) is unclear.
Probab=38.80  E-value=64  Score=17.91  Aligned_cols=32  Identities=22%  Similarity=0.326  Sum_probs=20.7

Q ss_pred             CCeEEEEcCCCCCCC-CCeEEEEeCCCCcEEec
Q psy6079          31 EGPLYAVGGHDGWSY-LNTTERFDPRLRQWSFI   62 (79)
Q Consensus        31 ~~~iyv~GG~~~~~~-~~~~~~yd~~~~~W~~~   62 (79)
                      -++-|++-|.++... ...-+.|-..+++|-..
T Consensus        90 ~gk~YLImG~~~~~~~~~~~~~Y~L~~~tWvE~  122 (153)
T cd03584          90 KGKEYLIMGKDGATSDSNGHMQYLLDSKTWVEK  122 (153)
T ss_pred             CCCEEEEEcCCCCCcCcCCcEEEEeCCCceEEE
Confidence            556787766554322 23346788899999765


No 79 
>cd01206 Homer Homer type EVH1 domain. Homer type EVH1 domain. Homer is a synaptic scaffolding protein, involved in neuronal signaling. It contains an  EVH1 domain, which binds to both neurotransmitter receptors, such as the metabotropic glutamate receptor (mGluR) and to other scaffolding proteins via PPXXF motifs, in order to target them to the synaptic junction. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=38.14  E-value=42  Score=17.80  Aligned_cols=18  Identities=17%  Similarity=0.434  Sum_probs=13.7

Q ss_pred             CeEEEEeCCCC-cEEeccC
Q psy6079          47 NTTERFDPRLR-QWSFIAP   64 (79)
Q Consensus        47 ~~~~~yd~~~~-~W~~~~~   64 (79)
                      ..+..||+.++ .|.+..+
T Consensus        11 A~V~~yd~~tKk~WvPs~~   29 (111)
T cd01206          11 AHVFQIDPKTKKNWIPASK   29 (111)
T ss_pred             eEEEEECCCCcceeEeCCC
Confidence            56888999875 8987654


No 80 
>PF07893 DUF1668:  Protein of unknown function (DUF1668);  InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function. 
Probab=37.79  E-value=1e+02  Score=19.36  Aligned_cols=41  Identities=17%  Similarity=0.130  Sum_probs=29.7

Q ss_pred             CCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCCCCCCceeEEEE
Q psy6079          31 EGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPMLSPRSTAGVAVL   77 (79)
Q Consensus        31 ~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~~~r~~~~~~~~   77 (79)
                      +.+|++++..      ...-+||.++..-...|.+..+.....++.+
T Consensus        76 gskIv~~d~~------~~t~vyDt~t~av~~~P~l~~pk~~pisv~V  116 (342)
T PF07893_consen   76 GSKIVAVDQS------GRTLVYDTDTRAVATGPRLHSPKRCPISVSV  116 (342)
T ss_pred             CCeEEEEcCC------CCeEEEECCCCeEeccCCCCCCCcceEEEEe
Confidence            7788887543      3477899999998888888776665544443


No 81 
>PF10902 DUF2693:  Protein of unknown function (DUF2693);  InterPro: IPR024401 This family of proteins is found in bacteria and bacteriophages. Its function is unknown. 
Probab=36.72  E-value=53  Score=16.45  Aligned_cols=18  Identities=17%  Similarity=0.362  Sum_probs=15.1

Q ss_pred             CCCeEEEEeCCCCcEEec
Q psy6079          45 YLNTTERFDPRLRQWSFI   62 (79)
Q Consensus        45 ~~~~~~~yd~~~~~W~~~   62 (79)
                      ...++.+||.+.+.|...
T Consensus        47 s~~s~~yfDve~~~WRSF   64 (83)
T PF10902_consen   47 STTSVRYFDVEKKGWRSF   64 (83)
T ss_pred             CcceEEEEEeccCceeee
Confidence            467799999999999864


No 82 
>PF11958 DUF3472:  Domain of unknown function (DUF3472);  InterPro: IPR021862  This presumed domain is functionally uncharacterised. This domain is found in bacteria, eukaryotes and viruses. This domain is typically between 174 to 190 amino acids in length. This domain has a single completely conserved residue G that may be functionally important. 
Probab=36.50  E-value=36  Score=19.49  Aligned_cols=24  Identities=25%  Similarity=0.542  Sum_probs=18.8

Q ss_pred             CeEEEEeCCCCcEEeccCCCCCCC
Q psy6079          47 NTTERFDPRLRQWSFIAPMLSPRS   70 (79)
Q Consensus        47 ~~~~~yd~~~~~W~~~~~~~~~r~   70 (79)
                      -..+++|+.+++|..|+.+..|+.
T Consensus        80 ~~~~v~d~~t~~~~~Igs~~~p~~  103 (184)
T PF11958_consen   80 YTGYVTDPTTGKWTLIGSIKRPKT  103 (184)
T ss_pred             EEEEEEECCCCcEEEEEEEEEcCc
Confidence            356789999999999987765543


No 83 
>PF13540 RCC1_2:  Regulator of chromosome condensation (RCC1) repeat; PDB: 3QI0_D 1JTD_B 3QHY_B.
Probab=35.74  E-value=34  Score=13.15  Aligned_cols=16  Identities=19%  Similarity=0.443  Sum_probs=9.6

Q ss_pred             eeeEEECCCCeEEEEcC
Q psy6079          23 LGVGVLEPEGPLYAVGG   39 (79)
Q Consensus        23 ~~~~~~~~~~~iyv~GG   39 (79)
                      |++++.. ++++|.+|-
T Consensus        10 ht~al~~-~g~v~~wG~   25 (30)
T PF13540_consen   10 HTCALTS-DGEVYCWGD   25 (30)
T ss_dssp             EEEEEE--TTEEEEEE-
T ss_pred             EEEEEEc-CCCEEEEcC
Confidence            3444332 889999984


No 84 
>PF06335 DUF1054:  Protein of unknown function (DUF1054);  InterPro: IPR009403 This entry consists of several hypothetical bacterial proteins of unknown function.; PDB: 2A8E_A 3MQZ_A.
Probab=35.52  E-value=71  Score=18.71  Aligned_cols=34  Identities=9%  Similarity=0.105  Sum_probs=24.5

Q ss_pred             eecCCCCCeeecCCCcc-----ceeeeEEECCCCeEEEEcC
Q psy6079           4 SICLHLVFWFPLLLLSY-----MCLGVGVLEPEGPLYAVGG   39 (79)
Q Consensus         4 ~~~~~~~~W~~~~~~~r-----~~~~~~~~~~~~~iyv~GG   39 (79)
                      +-||..++|-...+-+|     .++.+...  ++.+||.=+
T Consensus        59 tvNPP~dTWvAf~~nKRGYK~~PHFqiGl~--~d~lFi~~a   97 (194)
T PF06335_consen   59 TVNPPKDTWVAFSPNKRGYKMLPHFQIGLW--DDHLFIWFA   97 (194)
T ss_dssp             SSS--SEEEEEEESSSS-STTS-EEEEEEE--TTEEEEEEE
T ss_pred             cCCCCccchhhccCCCCccccCCceEEeec--CceEEEEEE
Confidence            35788899999987776     67788888  888888654


No 85 
>cd00837 EVH1 EVH1 (Enabled, Vasp-Homology) or WASP Homology (WH1) domain. EVH1 (Enabled, Vasp-Homology) or WASP Homology (WH1) domain. The EVH1 domain binds to other proteins at proline rich sequences in either FPPPP or PPXXF motifs. It is found in the cytoskeletal reorganization proteins Enabled VASP, and WASP, and in the synaptic scaffolding protein Homer. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=34.51  E-value=52  Score=16.84  Aligned_cols=16  Identities=19%  Similarity=0.302  Sum_probs=13.3

Q ss_pred             CeEEEEeCCCCcEEec
Q psy6079          47 NTTERFDPRLRQWSFI   62 (79)
Q Consensus        47 ~~~~~yd~~~~~W~~~   62 (79)
                      ..++.+|+.++.|...
T Consensus         9 a~v~~~~~~~~~W~~~   24 (104)
T cd00837           9 AQVYTADPSTGKWVPA   24 (104)
T ss_pred             EEEEEECCCCCceEEC
Confidence            4678889999999886


No 86 
>PF06739 SBBP:  Beta-propeller repeat;  InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=33.22  E-value=33  Score=14.14  Aligned_cols=11  Identities=27%  Similarity=0.661  Sum_probs=8.0

Q ss_pred             CCeEEEEcCCC
Q psy6079          31 EGPLYAVGGHD   41 (79)
Q Consensus        31 ~~~iyv~GG~~   41 (79)
                      .+.+||.|-.+
T Consensus        23 ~GNiYv~G~T~   33 (38)
T PF06739_consen   23 NGNIYVTGYTN   33 (38)
T ss_pred             CCCEEEEEeec
Confidence            56799998543


No 87 
>PF11090 DUF2833:  Protein of unknown function (DUF2833);  InterPro: IPR020335 This entry contains proteins with no known function.
Probab=31.92  E-value=77  Score=16.03  Aligned_cols=13  Identities=38%  Similarity=0.825  Sum_probs=10.6

Q ss_pred             CCCeEEEEcCCCC
Q psy6079          30 PEGPLYAVGGHDG   42 (79)
Q Consensus        30 ~~~~iyv~GG~~~   42 (79)
                      +.+.++.+||..+
T Consensus         7 ~~g~~lAiGG~~g   19 (86)
T PF11090_consen    7 HKGRPLAIGGNNG   19 (86)
T ss_pred             cCCeEEEEccccC
Confidence            4889999999764


No 88 
>KOG0294|consensus
Probab=31.87  E-value=1.4e+02  Score=19.15  Aligned_cols=23  Identities=30%  Similarity=0.536  Sum_probs=14.8

Q ss_pred             CCeEEEEcCCCCCCCCCeEEEEeCCCCc
Q psy6079          31 EGPLYAVGGHDGWSYLNTTERFDPRLRQ   58 (79)
Q Consensus        31 ~~~iyv~GG~~~~~~~~~~~~yd~~~~~   58 (79)
                      ++ .|++.|.++    ..+.+||..++.
T Consensus        52 s~-~~~aSGssD----etI~IYDm~k~~   74 (362)
T KOG0294|consen   52 SG-PYVASGSSD----ETIHIYDMRKRK   74 (362)
T ss_pred             cc-eeEeccCCC----CcEEEEeccchh
Confidence            55 466655443    468899987765


No 89 
>KOG1324|consensus
Probab=31.77  E-value=30  Score=20.04  Aligned_cols=9  Identities=22%  Similarity=0.737  Sum_probs=7.4

Q ss_pred             CeEEEEcCC
Q psy6079          32 GPLYAVGGH   40 (79)
Q Consensus        32 ~~iyv~GG~   40 (79)
                      +++||+||.
T Consensus       111 e~vfvIGG~  119 (190)
T KOG1324|consen  111 EMVFVIGGS  119 (190)
T ss_pred             eEEEEEcCH
Confidence            579999993


No 90 
>KOG2705|consensus
Probab=31.61  E-value=28  Score=22.43  Aligned_cols=18  Identities=22%  Similarity=0.508  Sum_probs=14.6

Q ss_pred             CCCCCeEEEEeCCCCcEE
Q psy6079          43 WSYLNTTERFDPRLRQWS   60 (79)
Q Consensus        43 ~~~~~~~~~yd~~~~~W~   60 (79)
                      +...+.++.||..||+|-
T Consensus       304 ~~~~~~v~SFN~nTN~wv  321 (471)
T KOG2705|consen  304 HDYNSVVESFNCNTNTWV  321 (471)
T ss_pred             CccceeeeeeecChhHHH
Confidence            344678999999999995


No 91 
>smart00120 HX Hemopexin-like repeats. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix  metalloproteinases family (matrixins). The HX repeats of some matrixins  bind tissue inhibitor of metalloproteinases (TIMPs).
Probab=30.25  E-value=51  Score=13.46  Aligned_cols=19  Identities=37%  Similarity=0.655  Sum_probs=13.9

Q ss_pred             CCeEEEEcCCCCCCCCCeEEEEeCCC
Q psy6079          31 EGPLYAVGGHDGWSYLNTTERFDPRL   56 (79)
Q Consensus        31 ~~~iyv~GG~~~~~~~~~~~~yd~~~   56 (79)
                      ++++|++-|       +..+.||..+
T Consensus         9 ~~~~yfFkg-------~~yw~~~~~~   27 (45)
T smart00120        9 NGKTYFFKG-------DKYWRFDPKR   27 (45)
T ss_pred             CCeEEEEeC-------CEEEEEcCCc
Confidence            568999976       5678888654


No 92 
>COG2380 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.67  E-value=56  Score=20.28  Aligned_cols=19  Identities=26%  Similarity=0.536  Sum_probs=15.2

Q ss_pred             CeEEEEeCCCCcEEeccCC
Q psy6079          47 NTTERFDPRLRQWSFIAPM   65 (79)
Q Consensus        47 ~~~~~yd~~~~~W~~~~~~   65 (79)
                      .....+|.+|+.|+.++.+
T Consensus        22 f~~~~~~vEtdew~~i~~~   40 (327)
T COG2380          22 FAEESFDVETDEWEAIPTL   40 (327)
T ss_pred             ccceeeecccccccccCcc
Confidence            4567789999999998654


No 93 
>PF14741 GH114_assoc:  N-terminal glycosyl-hydrolase-114-associated domain
Probab=29.53  E-value=1e+02  Score=16.86  Aligned_cols=22  Identities=18%  Similarity=0.273  Sum_probs=17.7

Q ss_pred             CCCeEEEEeC-CCCcEEeccCCC
Q psy6079          45 YLNTTERFDP-RLRQWSFIAPML   66 (79)
Q Consensus        45 ~~~~~~~yd~-~~~~W~~~~~~~   66 (79)
                      ..-.++.||. .+++|..+.++.
T Consensus        61 ~~W~vd~~~~~~~~~w~~vGD~t   83 (129)
T PF14741_consen   61 MRWTVDMFDFDDTSTWVKVGDLT   83 (129)
T ss_pred             ceeEEEEEEccCCCeeEEecccc
Confidence            3567888999 899999998764


No 94 
>KOG2006|consensus
Probab=27.85  E-value=92  Score=23.03  Aligned_cols=53  Identities=17%  Similarity=0.218  Sum_probs=35.9

Q ss_pred             cCCCCCeeecC-CCc------cceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEEeccCCC
Q psy6079           6 CLHLVFWFPLL-LLS------YMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWSFIAPML   66 (79)
Q Consensus         6 ~~~~~~W~~~~-~~~------r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~~~~~~~   66 (79)
                      +....+|..+. |.+      ..++.+.-.  +++.+++.|..      -...|+..+++|...+...
T Consensus       321 ~~~~~~w~~~~~P~~y~ssnwpiry~s~~~--dg~~lavAgl~------gla~y~~~~~~Wk~f~~~s  380 (1023)
T KOG2006|consen  321 ITDIEKWDHLALPLTYTSSNWPIRYVSIDY--DGKHLAVAGLH------GLARYSLSSKRWKLFGDES  380 (1023)
T ss_pred             ccchhhhheeecccccccccCceeEEeecC--CccEEEEeecc------cceEEEeecceeecCCccc
Confidence            45667888876 333      133444444  88889888854      3567899999999876543


No 95 
>PF00186 DHFR_1:  Dihydrofolate reductase;  InterPro: IPR001796 Dihydrofolate reductase (DHFR) (1.5.1.3 from EC) catalyses the NADPH-dependent reduction of dihydrofolate to tetrahydrofolate, an essential step in de novo synthesis both of glycine and of purines and deoxythymidine phosphate (the precursors of DNA synthesis) [], and important also in the conversion of deoxyuridine monophosphate to deoxythymidine monophosphate. Although DHFR is found ubiquitously in prokaryotes and eukaryotes, and is found in all dividing cells, maintaining levels of fully reduced folate coenzymes, the catabolic steps are still not well understood []. Bacterial species possesses distinct DHFR enzymes (based on their pattern of binding diaminoheterocyclic molecules), but mammalian DHFRs are highly similar []. The active site is situated in the N-terminal half of the sequence, which includes a conserved Pro-Trp dipeptide; the tryptophan has been shown [] to be involved in the binding of substrate by the enzyme. Its central role in DNA precursor synthesis, coupled with its inhibition by antagonists such as trimethoprim and methotrexate, which are used as anti-bacterial or anti-cancer agents, has made DHFR a target of anticancer chemotherapy. However, resistance has developed against some drugs, as a result of changes in DHFR itself [].; GO: 0004146 dihydrofolate reductase activity, 0006545 glycine biosynthetic process, 0009165 nucleotide biosynthetic process, 0055114 oxidation-reduction process; PDB: 1ZDR_B 3SA2_B 3JWK_B 3E0B_A 3S9U_B 3FL9_H 3FL8_F 2QK8_A 3JW3_A 3SA1_B ....
Probab=27.56  E-value=39  Score=18.88  Aligned_cols=9  Identities=22%  Similarity=0.822  Sum_probs=6.5

Q ss_pred             CeEEEEcCC
Q psy6079          32 GPLYAVGGH   40 (79)
Q Consensus        32 ~~iyv~GG~   40 (79)
                      +.+||+||.
T Consensus        92 ~~i~ViGG~  100 (161)
T PF00186_consen   92 EEIFVIGGA  100 (161)
T ss_dssp             SEEEEEE-H
T ss_pred             CcEEEECCH
Confidence            579999993


No 96 
>COG4334 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.16  E-value=76  Score=16.92  Aligned_cols=14  Identities=29%  Similarity=0.458  Sum_probs=11.2

Q ss_pred             eEEECCCCeEEEEcCC
Q psy6079          25 VGVLEPEGPLYAVGGH   40 (79)
Q Consensus        25 ~~~~~~~~~iyv~GG~   40 (79)
                      ++.+  ++.+||-|+.
T Consensus        41 ~v~v--DgdLyVrgy~   54 (131)
T COG4334          41 FVYV--DGDLYVRGYR   54 (131)
T ss_pred             EEEE--CCceEEeecC
Confidence            4567  9999999874


No 97 
>KOG0282|consensus
Probab=27.12  E-value=1.9e+02  Score=19.65  Aligned_cols=12  Identities=33%  Similarity=0.803  Sum_probs=9.3

Q ss_pred             CC-eEEEEcCCCC
Q psy6079          31 EG-PLYAVGGHDG   42 (79)
Q Consensus        31 ~~-~iyv~GG~~~   42 (79)
                      ++ .+|++||.+.
T Consensus       310 d~~n~fl~G~sd~  322 (503)
T KOG0282|consen  310 DNQNIFLVGGSDK  322 (503)
T ss_pred             CCCcEEEEecCCC
Confidence            44 8999999764


No 98 
>PLN03215 ascorbic acid mannose pathway regulator 1; Provisional
Probab=27.01  E-value=1.4e+02  Score=19.38  Aligned_cols=27  Identities=22%  Similarity=0.192  Sum_probs=15.5

Q ss_pred             CCCeeecCCCccceeeeEEECCCCeEEEE
Q psy6079           9 LVFWFPLLLLSYMCLGVGVLEPEGPLYAV   37 (79)
Q Consensus         9 ~~~W~~~~~~~r~~~~~~~~~~~~~iyv~   37 (79)
                      .+.|+.+..+.-..--++..  +|++|++
T Consensus       189 ~~~Wt~l~~~~~~~~DIi~~--kGkfYAv  215 (373)
T PLN03215        189 GNVLKALKQMGYHFSDIIVH--KGQTYAL  215 (373)
T ss_pred             CCeeeEccCCCceeeEEEEE--CCEEEEE
Confidence            37899886533333334444  6666665


No 99 
>KOG2967|consensus
Probab=26.87  E-value=39  Score=21.30  Aligned_cols=13  Identities=23%  Similarity=0.409  Sum_probs=9.5

Q ss_pred             CCeEEEEcCCCCC
Q psy6079          31 EGPLYAVGGHDGW   43 (79)
Q Consensus        31 ~~~iyv~GG~~~~   43 (79)
                      .+++|||||.-+.
T Consensus       200 p~~vYiIGglVD~  212 (314)
T KOG2967|consen  200 PDKVYIIGGLVDK  212 (314)
T ss_pred             CCcEEEEEEEecC
Confidence            4579999996543


No 100
>PLN03113 DNA ligase 1; Provisional
Probab=26.64  E-value=1.9e+02  Score=20.70  Aligned_cols=56  Identities=13%  Similarity=0.141  Sum_probs=28.8

Q ss_pred             ceecCC--CCCeeecCCCccceeeeEEECCCCeEEEEcCCCCCC----CCCe--EEEEeCCCCcEEecc
Q psy6079           3 SSICLH--LVFWFPLLLLSYMCLGVGVLEPEGPLYAVGGHDGWS----YLNT--TERFDPRLRQWSFIA   63 (79)
Q Consensus         3 ~~~~~~--~~~W~~~~~~~r~~~~~~~~~~~~~iyv~GG~~~~~----~~~~--~~~yd~~~~~W~~~~   63 (79)
                      +.|-|.  .+.|-++.+-...+.+-     .-.+.|+||+.+..    ....  +-+||+.++.|..++
T Consensus       561 S~Y~pGkRs~~WlKlK~dy~~~~~d-----tlDlVvIGa~~G~GkR~g~~g~fLla~yd~~~~~~~~v~  624 (744)
T PLN03113        561 ATYEPSKRSNNWLKLKKDYMESIGD-----SLDLVPIAAFHGRGKRTGVYGAFLLACYDSNKEEFQSIC  624 (744)
T ss_pred             CCccCCCCCCCeEEEechhhccccc-----cccEEEEEEEeCCCCcCCccceEEEEEEcCCCCEEEEee
Confidence            356663  45788876432221110     11255566654321    1233  447999888887664


No 101
>COG0262 FolA Dihydrofolate reductase [Coenzyme metabolism]
Probab=26.55  E-value=49  Score=18.51  Aligned_cols=9  Identities=22%  Similarity=0.759  Sum_probs=7.3

Q ss_pred             CeEEEEcCC
Q psy6079          32 GPLYAVGGH   40 (79)
Q Consensus        32 ~~iyv~GG~   40 (79)
                      +.|||+||.
T Consensus        96 ~~i~IiGG~  104 (167)
T COG0262          96 EDIFIIGGG  104 (167)
T ss_pred             CeEEEEcCH
Confidence            569999993


No 102
>PF00415 RCC1:  Regulator of chromosome condensation (RCC1) repeat;  InterPro: IPR000408 The regulator of chromosome condensation (RCC1) [] is a eukaryotic protein which binds to chromatin and interacts with ran, a nuclear GTP-binding protein IPR002041 from INTERPRO, to promote the loss of bound GDP and the uptake of fresh GTP, thus acting as a guanine-nucleotide dissociation stimulator (GDS). The interaction of RCC1 with ran probably plays an important role in the regulation of gene expression. RCC1, known as PRP20 or SRM1 in yeast, pim1 in fission yeast and BJ1 in Drosophila, is a protein that contains seven tandem repeats of a domain of about 50 to 60 amino acids. As shown in the following schematic representation, the repeats make up the major part of the length of the protein. Outside the repeat region, there is just a small N-terminal domain of about 40 to 50 residues and, in the Drosophila protein only, a C-terminal domain of about 130 residues.  +----+-------+-------+-------+-------+-------+-------+-------+-------------+ |N-t.|Rpt. 1 |Rpt. 2 |Rpt. 3 |Rpt. 4 |Rpt. 5 |Rpt. 6 |Rpt. 7 | C-terminal | +----+-------+-------+-------+-------+-------+-------+-------+-------------+  The RCC1-type of repeat is also found in the X-linked retinitis pigmentosa GTPase regulator []. The RCC repeats form a beta-propeller structure.; PDB: 3MVD_L 3OF7_A 1I2M_D 1A12_B 3KCI_A 4DNV_B 4DNU_A 4D9S_B 4DNW_A.
Probab=26.29  E-value=52  Score=13.85  Aligned_cols=9  Identities=33%  Similarity=0.870  Sum_probs=6.9

Q ss_pred             CCeEEEEcC
Q psy6079          31 EGPLYAVGG   39 (79)
Q Consensus        31 ~~~iyv~GG   39 (79)
                      ++++|.+|-
T Consensus         1 dG~vy~wG~    9 (51)
T PF00415_consen    1 DGRVYSWGS    9 (51)
T ss_dssp             TSEEEEEEE
T ss_pred             CCcEEEEEC
Confidence            468899983


No 103
>TIGR02608 delta_60_rpt delta-60 repeat domain. This domain occurs in tandem repeats, as many as 13, in proteins from Bdellovibrio bacteriovorus, Azotobacter vinelandii, Geobacter sulfurreducens, Pirellula sp. 1, Myxococcus xanthus, and others, many of which are Deltaproteobacteria. The periodicity of the repeat ranges from about 57 to 61 amino acids, and a core region of about 54 is represented by this model and seed alignment.
Probab=26.05  E-value=80  Score=14.37  Aligned_cols=26  Identities=15%  Similarity=0.179  Sum_probs=15.0

Q ss_pred             CCCeEEEEcCCCC--CCCCCeEEEEeCC
Q psy6079          30 PEGPLYAVGGHDG--WSYLNTTERFDPR   55 (79)
Q Consensus        30 ~~~~iyv~GG~~~--~~~~~~~~~yd~~   55 (79)
                      .+++|++.|-...  ......+.+|++.
T Consensus        10 ~DGkIlv~G~~~~~~~~~~~~l~Rln~D   37 (55)
T TIGR02608        10 SDGKILVAGYVDNSSGNNDFVLARLNAD   37 (55)
T ss_pred             CCCcEEEEEEeecCCCcccEEEEEECCC
Confidence            4889998885432  1223346666654


No 104
>PF10049 DUF2283:  Protein of unknown function (DUF2283);  InterPro: IPR019270  Members of this family of hypothetical proteins have no known function. 
Probab=25.98  E-value=38  Score=14.88  Aligned_cols=11  Identities=9%  Similarity=-0.239  Sum_probs=7.3

Q ss_pred             CcceecCCCCC
Q psy6079           1 MQSSICLHLVF   11 (79)
Q Consensus         1 ~~~~~~~~~~~   11 (79)
                      |.-+||++++.
T Consensus         1 Mki~YD~~~D~   11 (50)
T PF10049_consen    1 MKIEYDPEADA   11 (50)
T ss_pred             CEeEEcCcCCE
Confidence            56677777664


No 105
>smart00284 OLF Olfactomedin-like domains.
Probab=25.90  E-value=1.7e+02  Score=17.97  Aligned_cols=54  Identities=11%  Similarity=0.060  Sum_probs=27.9

Q ss_pred             ceecCCCC----CeeecCCCccceeeeEEECCCCeEEEEcCCCCCCCCCeEEEEeCCCCcEE
Q psy6079           3 SSICLHLV----FWFPLLLLSYMCLGVGVLEPEGPLYAVGGHDGWSYLNTTERFDPRLRQWS   60 (79)
Q Consensus         3 ~~~~~~~~----~W~~~~~~~r~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~yd~~~~~W~   60 (79)
                      ++.||.+-    .|..--+-+..+ .+.++  =+.+|++-.... ....-.+.||..+++=+
T Consensus       156 SkLnp~tL~ve~tW~T~~~k~sa~-naFmv--CGvLY~~~s~~~-~~~~I~yayDt~t~~~~  213 (255)
T smart00284      156 SKLNPATLTIENTWITTYNKRSAS-NAFMI--CGILYVTRSLGS-KGEKVFYAYDTNTGKEG  213 (255)
T ss_pred             EeeCcccceEEEEEEcCCCccccc-ccEEE--eeEEEEEccCCC-CCcEEEEEEECCCCccc
Confidence            45666653    555522222222 23333  356888853221 12345778999988743


No 106
>cd03474 Rieske_T4moC Toluene-4-monooxygenase effector protein complex (T4mo), Rieske ferredoxin subunit; The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. T4mo is a four-protein complex that catalyzes the NADH- and O2-dependent hydroxylation of toluene to form p-cresol. T4mo consists of an NADH oxidoreductase (T4moF), a diiron hydroxylase (T4moH), a catalytic effector protein (T4moD), and a Rieske ferredoxin (T4moC). T4moC contains a Rieske domain and functions as an obligate electron carrier between T4moF and T4moH. Rieske ferredoxins are found as subunits of membrane oxidase complexes, cis-dihydrodiol-forming aromatic dioxygenases, bacterial assimilatory nitrite reductases, and arsenite oxidase. Rieske ferredoxins are also found as soluble electron carriers in bacterial dioxygenase and monooxygenase complexes.
Probab=25.19  E-value=1.1e+02  Score=15.46  Aligned_cols=6  Identities=17%  Similarity=0.446  Sum_probs=2.5

Q ss_pred             CCeEEE
Q psy6079          31 EGPLYA   36 (79)
Q Consensus        31 ~~~iyv   36 (79)
                      ++.|||
T Consensus        92 ~g~v~v   97 (108)
T cd03474          92 GGDILV   97 (108)
T ss_pred             CCEEEE
Confidence            444443


No 107
>TIGR03352 VI_chp_3 type VI secretion lipoprotein, VC_A0113 family. Work by Mougous, et al. (2006), describes IAHP-related loci as a type VI secretion system (PubMed:16763151). This protein family is associated with type VI secretion loci, although not treated explicitly by Mougous, et al.
Probab=24.85  E-value=1.3e+02  Score=16.51  Aligned_cols=24  Identities=8%  Similarity=0.083  Sum_probs=17.7

Q ss_pred             EEEEeCCCCcEEeccCCCCCCCce
Q psy6079          49 TERFDPRLRQWSFIAPMLSPRSTA   72 (79)
Q Consensus        49 ~~~yd~~~~~W~~~~~~~~~r~~~   72 (79)
                      ...+|+...+|..+-+.+..+...
T Consensus       113 A~fr~~~~~~Wr~~~~~~~~~~~~  136 (146)
T TIGR03352       113 AAYRDIDNAEWRVVYKVPPKGKYK  136 (146)
T ss_pred             EEeecCCCCceEEEEecCCCCcee
Confidence            445788889999987777666443


No 108
>PF00352 TBP:  Transcription factor TFIID (or TATA-binding protein, TBP);  InterPro: IPR000814 The TATA-box binding protein (TBP) is required for the initiation of transcription by RNA polymerases I, II and III, from promoters with or without a TATA box [, ]. TBP associates with a host of factors, including the general transcription factors TFIIA, -B, -D, -E, and -H, to form huge multi-subunit pre-initiation complexes on the core promoter. Through its association with different transcription factors, TBP can initiate transcription from different RNA polymerases. There are several related TBPs, including TBP-like (TBPL) proteins []. The C-terminal core of TBP (~180 residues) is highly conserved and contains two 77-amino acid repeats that produce a saddle-shaped structure that straddles the DNA; this region binds to the TATA box and interacts with transcription factors and regulatory proteins []. By contrast, the N-terminal region varies in both length and sequence.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0006367 transcription initiation from RNA polymerase II promoter; PDB: 1D3U_A 1PCZ_B 1AIS_A 1NGM_A 1TBP_A 1TBA_B 1YTB_A 1RM1_A 1YTF_A 1NH2_A ....
Probab=24.64  E-value=1e+02  Score=15.07  Aligned_cols=10  Identities=20%  Similarity=0.494  Sum_probs=7.7

Q ss_pred             CCeEEEEcCC
Q psy6079          31 EGPLYAVGGH   40 (79)
Q Consensus        31 ~~~iyv~GG~   40 (79)
                      .|++.+.|..
T Consensus        57 sGki~itGak   66 (86)
T PF00352_consen   57 SGKIVITGAK   66 (86)
T ss_dssp             TSEEEEEEES
T ss_pred             CCEEEEEecC
Confidence            7888888863


No 109
>PRK10769 folA dihydrofolate reductase; Provisional
Probab=24.47  E-value=55  Score=18.26  Aligned_cols=9  Identities=22%  Similarity=0.575  Sum_probs=7.2

Q ss_pred             CeEEEEcCC
Q psy6079          32 GPLYAVGGH   40 (79)
Q Consensus        32 ~~iyv~GG~   40 (79)
                      +.|||+||.
T Consensus        89 ~~I~viGG~   97 (159)
T PRK10769         89 PEIMVIGGG   97 (159)
T ss_pred             CCEEEECcH
Confidence            469999993


No 110
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=24.17  E-value=1.2e+02  Score=20.74  Aligned_cols=17  Identities=12%  Similarity=0.198  Sum_probs=14.3

Q ss_pred             CeEEEEeCCCCcEEecc
Q psy6079          47 NTTERFDPRLRQWSFIA   63 (79)
Q Consensus        47 ~~~~~yd~~~~~W~~~~   63 (79)
                      .+++.|||.++.-+++.
T Consensus       287 GdIylydP~td~lekld  303 (668)
T COG4946         287 GDIYLYDPETDSLEKLD  303 (668)
T ss_pred             CcEEEeCCCcCcceeee
Confidence            57899999999988763


No 111
>PF00045 Hemopexin:  Hemopexin;  InterPro: IPR018487 Hemopexin (3.2.1.35 from EC) is a serum glycoprotein that binds haem and transports it to the liver for breakdown and iron recovery, after which the free hemopexin returns to the circulation []. Hemopexin prevents haem-mediated oxidative stress. Structurally hemopexin consists of two similar halves of approximately two hundred amino acid residues connected by a histidine-rich hinge region. Each half is itself formed by the repetition of a basic unit of some 35 to 45 residues. Hemopexin-like domains have been found in two other types of proteins, vitronectin [], a cell adhesion and spreading factor found in plasma and tissues, and matrixins MMP-1, MMP-2, MMP-3, MMP-9, MMP-10, MMP-11, MMP-12, MMP-14, MMP-15 and MMP-16, members of the matrix metalloproteinase family that cleave extracellular matrix constituents []. These zinc endopeptidases, which belong to MEROPS peptidase subfamily M10A, have a single hemopexin-like domain in their C-terminal section. It is suggested that the hemopexin domain facilitates binding to a variety of molecules and proteins, for example the HX repeats of some matrixins bind tissue inhibitor of metallopeptidases (TIMPs). This entry represents the repeat found in hempoxein and related domains.; PDB: 1SU3_B 2CLT_B 3V6N_B 3C7X_A 1GXD_A 1RTG_A 1CK7_A 1GEN_A 1ITV_A 2JXY_A ....
Probab=23.89  E-value=74  Score=13.24  Aligned_cols=18  Identities=39%  Similarity=0.556  Sum_probs=13.8

Q ss_pred             CCeEEEEcCCCCCCCCCeEEEEeCC
Q psy6079          31 EGPLYAVGGHDGWSYLNTTERFDPR   55 (79)
Q Consensus        31 ~~~iyv~GG~~~~~~~~~~~~yd~~   55 (79)
                      ++++|+|-|       +..++||..
T Consensus         9 ~g~~yfFkg-------~~ywr~~~~   26 (45)
T PF00045_consen    9 NGKTYFFKG-------NQYWRFDES   26 (45)
T ss_dssp             TTEEEEEET-------TEEEEEETT
T ss_pred             CCcEEEEEC-------CEEEEEcCc
Confidence            389999976       567888776


No 112
>PF15494 SRCR_2:  Scavenger receptor cysteine-rich domain
Probab=23.14  E-value=1.2e+02  Score=15.21  Aligned_cols=17  Identities=6%  Similarity=0.243  Sum_probs=14.0

Q ss_pred             CeEEEEeCCCCcEEecc
Q psy6079          47 NTTERFDPRLRQWSFIA   63 (79)
Q Consensus        47 ~~~~~yd~~~~~W~~~~   63 (79)
                      +.+++|...+..|..+.
T Consensus         4 s~LQV~~~~~~~W~~VC   20 (98)
T PF15494_consen    4 SLLQVYSAASGSWLPVC   20 (98)
T ss_pred             cEEEEEecCCCCEeeec
Confidence            56889999999999774


No 113
>smart00461 WH1 WASP homology region 1. Region of the Wiskott-Aldrich syndrome protein (WASp) that contains point mutations in the majority of patients with WAS. Unknown function. Ena-like WH1 domains bind polyproline-containing peptides, and that Homer contains a WH1 domain.
Probab=23.11  E-value=1.2e+02  Score=15.62  Aligned_cols=17  Identities=12%  Similarity=0.417  Sum_probs=13.5

Q ss_pred             CeEEEEeCCCCcEEecc
Q psy6079          47 NTTERFDPRLRQWSFIA   63 (79)
Q Consensus        47 ~~~~~yd~~~~~W~~~~   63 (79)
                      ..+..|+..++.|.+..
T Consensus        11 avV~~y~~~~~~W~~~~   27 (106)
T smart00461       11 AVVQLYDADTKKWVPTG   27 (106)
T ss_pred             EEEEEEeCCCCCeEECC
Confidence            46788999999998754


No 114
>PF12118 SprA-related:  SprA-related family;  InterPro: IPR021973  Most members of this uncharacterised protein family are annotated as being SprA-related. 
Probab=22.78  E-value=52  Score=20.54  Aligned_cols=10  Identities=70%  Similarity=1.361  Sum_probs=8.9

Q ss_pred             CCCeEEEEcC
Q psy6079          30 PEGPLYAVGG   39 (79)
Q Consensus        30 ~~~~iyv~GG   39 (79)
                      .|++.|.+||
T Consensus       182 pDg~~YavgG  191 (294)
T PF12118_consen  182 PDGKRYAVGG  191 (294)
T ss_pred             CCCceeeccc
Confidence            5889999998


No 115
>PF06462 Hyd_WA:  Propeller;  InterPro: IPR006624  Tectonins I and II are two dominant proteins in the nuclei and nuclear matrix from plasmodia of Physarum polycephalum (Slime mold) which encode 217 and 353 amino acids, respectively. Tectonin I is homologous to the C-terminal two-thirds of tectonin II. Both proteins contain six tandem repeats that are each 33-37 amino acids in length and define a new consensus sequence. Homologous repeats are found in L-6, a bacterial lipopolysaccharide-binding lectin from horseshoe crab hemocytes. The repetitive sequences of the tectonins and L-6 are reminiscent of the WD repeats of the beta-subunit of G proteins, suggesting that they form beta-propeller domains. The tectonins may be lectins that function as part of a transmembrane signalling complex during phagocytosis [].
Probab=22.70  E-value=68  Score=12.67  Aligned_cols=12  Identities=8%  Similarity=-0.006  Sum_probs=7.1

Q ss_pred             eecCCCCCeeec
Q psy6079           4 SICLHLVFWFPL   15 (79)
Q Consensus         4 ~~~~~~~~W~~~   15 (79)
                      ..+|....|..+
T Consensus        20 ~~~P~G~~W~~i   31 (32)
T PF06462_consen   20 PSNPEGTSWEHI   31 (32)
T ss_pred             CCCCCCCCcEEC
Confidence            345666667654


No 116
>cd03475 Rieske_SoxF_SoxL SoxF and SoxL family, Rieske domain; The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. SoxF is a subunit of the terminal oxidase supercomplex SoxM in the plasma membrane of Sulfolobus acidocaldarius that combines features of a cytochrome bc(1) complex and a cytochrome. The Rieske domain of SoxF has a 12 residue insertion which is not found in eukaryotic and bacterial Rieske proteins and is thought to influence the redox properties of the iron-sulfur cluster. SoxL is a Rieske protein which may be part of an archaeal bc-complex homologue whose physiological function is still unknown. SoxL has two features not seen in other Rieske proteins; (i) a significantly greater distance between the two cluster-binding sites and  (ii) an unexpected Pro - Asp substitution at one of the cluster binding sites. SoxF and SoxL are found in archaea and in bacteria.
Probab=22.18  E-value=1.7e+02  Score=16.79  Aligned_cols=7  Identities=71%  Similarity=1.123  Sum_probs=4.8

Q ss_pred             CeEEEEc
Q psy6079          32 GPLYAVG   38 (79)
Q Consensus        32 ~~iyv~G   38 (79)
                      +.||++|
T Consensus       154 d~iyAvG  160 (171)
T cd03475         154 DDLYAVG  160 (171)
T ss_pred             CcEEEEe
Confidence            4677776


No 117
>PRK09965 3-phenylpropionate dioxygenase ferredoxin subunit; Provisional
Probab=22.12  E-value=1.3e+02  Score=15.25  Aligned_cols=6  Identities=17%  Similarity=0.678  Sum_probs=2.5

Q ss_pred             CCeEEE
Q psy6079          31 EGPLYA   36 (79)
Q Consensus        31 ~~~iyv   36 (79)
                      ++.|||
T Consensus        94 ~g~v~v   99 (106)
T PRK09965         94 GGDIFI   99 (106)
T ss_pred             CCEEEE
Confidence            444443


No 118
>PF04096 Nucleoporin2:  Nucleoporin autopeptidase;  InterPro: IPR007230 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of autocatalytic serine endopeptidases belong to MEROPS peptidase family S59 (clan SP). The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 50 unique proteins, collectively known as nucleoporins. A number of the peptides are synthesised as precursors and undergo self-catalyzed cleavage.  The proteolytic cleavage site of yeast Nup145p has been mapped upstream of an evolutionary conserved serine residue. Cleavage occurs at the same site when a precursor is artificially expressed in Escherichia coli. A hydroxyl-containing residue is critical for the reaction, although a thiol-containing residue offers an acceptable replacement. In vitro kinetics experiments using a purified precursor molecule demonstrate that the cleavage is self-catalyzed and that the catalytic domain lies within the N-terminal moiety. Taken altogether, the data are consistent with a proteolytic mechanism involving an N>O acyl rearrangement and a subsequent ester intermediate uncovered in other self-processing proteins []. Nup98 is a component of the nuclear pore that plays its primary role in the export of RNAs. Nup98 is expressed in two forms, derived from alternate mRNA splicing. Both forms are processed into two peptides through autoproteolysis mediated by the C-terminal domain of hNup98. The three-dimensional structure of the C-terminal domain reveals a novel protein fold, and thus a new class of autocatalytic proteases. The structure further reveals that the suggested nucleoporin RNA binding motif is unlikely to bind to RNA []. The following nucleoporins share an ~150-residue C-terminal domain responsible for NPC targeting [, ]:  Vertebrate Nup98, a component of the nuclear pore that plays its primary role in the export of RNAs.  Yeast Nup100, plays an important role in several nuclear export and import pathways including poly(A)+ RNA and protein transport.  Yeast Nup116, involved in mRNA export and protein transport.  Yeast Nup145, involved in nuclear poly(A)+ RNA and tRNA export.  The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore [, ]. The NUP C-terminal domain adopts a predominantly beta-strand structure. The molecule consists of a six-stranded beta-sheet sandwiched against a two-stranded beta-sheet and flanked by alpha-helical regions. The N-terminal helical region consists of two short helices, whereas the stretch on the opposite side of molecule consists of a single, longer helix [, ].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 3PBP_E 2AIV_A 1KO6_A 2Q5X_A 2Q5Y_C 3KEP_A 3KES_A 3NF5_B 3TKN_I.
Probab=21.95  E-value=1.2e+02  Score=16.64  Aligned_cols=16  Identities=25%  Similarity=0.596  Sum_probs=12.7

Q ss_pred             CeEEEEeCCCCcEEec
Q psy6079          47 NTTERFDPRLRQWSFI   62 (79)
Q Consensus        47 ~~~~~yd~~~~~W~~~   62 (79)
                      ..+..||+.+..|.-.
T Consensus       122 ~~FvsYd~~tG~W~F~  137 (141)
T PF04096_consen  122 AEFVSYDPETGTWVFR  137 (141)
T ss_dssp             SEEEEEETTTTEEEEE
T ss_pred             CEEEEEeCCCcEEEEE
Confidence            4567899999999753


No 119
>cd00094 HX Hemopexin-like repeats.; Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs). This CD contains 4 instances of the repeat.
Probab=21.77  E-value=1.7e+02  Score=16.59  Aligned_cols=22  Identities=23%  Similarity=0.459  Sum_probs=15.3

Q ss_pred             CCeEEEEcCCCCCCCCCeEEEEeCCCCcE
Q psy6079          31 EGPLYAVGGHDGWSYLNTTERFDPRLRQW   59 (79)
Q Consensus        31 ~~~iyv~GG~~~~~~~~~~~~yd~~~~~W   59 (79)
                      ++++|++.|       +..+.||..+++.
T Consensus       110 ~~~~yfFkg-------~~y~ry~~~~~~v  131 (194)
T cd00094         110 NGKTYFFKG-------DKYWRYDEKTQKM  131 (194)
T ss_pred             CCEEEEEeC-------CEEEEEeCCCccc
Confidence            578999977       4567777655543


No 120
>COG2131 ComEB Deoxycytidylate deaminase [Nucleotide transport and metabolism]
Probab=21.74  E-value=1.6e+02  Score=16.80  Aligned_cols=25  Identities=12%  Similarity=0.110  Sum_probs=16.0

Q ss_pred             CccceeeeEEECCCCeEEEEcCCCCCC
Q psy6079          18 LSYMCLGVGVLEPEGPLYAVGGHDGWS   44 (79)
Q Consensus        18 ~~r~~~~~~~~~~~~~iyv~GG~~~~~   44 (79)
                      -+|...+++.+  .++-.|..|+++..
T Consensus        25 c~r~~VGAvIv--kd~rIiatGYNG~p   49 (164)
T COG2131          25 CPRRQVGAVIV--KDGRIIATGYNGAP   49 (164)
T ss_pred             CcccceeEEEE--eCCeEEEeecCCCC
Confidence            35666777777  55556666777643


No 121
>PF01743 PolyA_pol:  Poly A polymerase head domain;  InterPro: IPR002646 This group includes nucleic acid independent RNA polymerases, such as polynucleotide adenylyltransferase (2.7.7.19 from EC), which adds the poly (A) tail to mRNA. This group also includes the tRNA nucleotidyltransferase that adds the CCA to the 3' of the tRNA 2.7.7.25 from EC.; GO: 0003723 RNA binding, 0016779 nucleotidyltransferase activity, 0006396 RNA processing; PDB: 1VFG_A 3H38_A 3H3A_B 3H39_B 3H37_A 1MIY_A 1MIV_B 1MIW_B 1OU5_B 3AQN_A ....
Probab=21.50  E-value=38  Score=17.93  Aligned_cols=7  Identities=57%  Similarity=1.009  Sum_probs=4.8

Q ss_pred             EEEEcCC
Q psy6079          34 LYAVGGH   40 (79)
Q Consensus        34 iyv~GG~   40 (79)
                      +|++||+
T Consensus         1 ~ylVGG~    7 (126)
T PF01743_consen    1 AYLVGGA    7 (126)
T ss_dssp             EEEETHH
T ss_pred             CEEecHH
Confidence            4777774


No 122
>PF12091 DUF3567:  Protein of unknown function (DUF3567);  InterPro: IPR021951  This family of proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are about 90 amino acids in length. This protein has a conserved EIVDK sequence motif. 
Probab=21.33  E-value=1.3e+02  Score=15.17  Aligned_cols=39  Identities=10%  Similarity=-0.055  Sum_probs=20.2

Q ss_pred             CcceecCCCCCeeecC------CCccceeeeEEECCCCeEEEEcC
Q psy6079           1 MQSSICLHLVFWFPLL------LLSYMCLGVGVLEPEGPLYAVGG   39 (79)
Q Consensus         1 ~~~~~~~~~~~W~~~~------~~~r~~~~~~~~~~~~~iyv~GG   39 (79)
                      |+-.||...-.--.++      .+.|.++-.+=-..+..||+-|-
T Consensus         1 M~miYnSd~y~VV~~~~~~~~~~l~~gGyEIVDK~~~rEifi~G~   45 (85)
T PF12091_consen    1 MQMIYNSDNYCVVEFPPDAGHPALARGGYEIVDKNARREIFIDGS   45 (85)
T ss_pred             CceeecCCceEEEEecCCCCccchhcCCcEEeecCCCceEEeCcH
Confidence            5666776653333333      23355655554323456777654


No 123
>PF00568 WH1:  WH1 domain;  InterPro: IPR000697 The EVH1 (WH1, RanBP1-WASP) domain is found in multi-domain proteins implicated in a diverse range of signalling, nuclear transport and cytoskeletal events. This domain of around 115 amino acids is present in species ranging from yeast to mammals. Many EVH1-containing proteins associate with actin-based structures and play a role in cytoskeletal organisation. EVH1 domains recognise and bind the proline-rich motif FPPPP with low-affinity, further interactions then form between flanking residues [][].  WASP family proteins contain a EVH1 (WH1) in their N-terminals which bind proline-rich sequences in the WASP interacting protein. Proteins of the RanBP1 family contain a WH1 domain in their N-terminal region, which seems to bind a different sequence motif present in the C-terminal part of RanGTP protein [,].  Tertiary structure of the WH1 domain of the Mena protein revealed structure similarities with the pleckstrin homology (PH) domain. The overall fold consists of a compact parallel beta-sandwich, closed along one edge by a long alpha-helix. A highly conserved cluster of three surface-exposed aromatic side-chains forms the recognition site for the molecules target ligands. [].; GO: 0005515 protein binding; PDB: 1I2H_A 1DDV_A 1DDW_A 1EGX_A 3SYX_A 1TJ6_B 1XOD_B 1EVH_A 1I7A_B 2JP2_A ....
Probab=20.72  E-value=1.4e+02  Score=15.33  Aligned_cols=15  Identities=27%  Similarity=0.603  Sum_probs=10.8

Q ss_pred             eEEEEeCCCCc-EEec
Q psy6079          48 TTERFDPRLRQ-WSFI   62 (79)
Q Consensus        48 ~~~~yd~~~~~-W~~~   62 (79)
                      .++.+||.+++ |...
T Consensus        17 ~v~~~~p~~~~~W~~~   32 (111)
T PF00568_consen   17 QVYQADPDTKRQWSPV   32 (111)
T ss_dssp             EEEEEETTTSESEEES
T ss_pred             EEEEEEcCCCCcEeeC
Confidence            46677887777 8874


No 124
>PF04122 CW_binding_2:  Putative cell wall binding repeat 2;  InterPro: IPR007253 This repeat is found in multiple tandem copies in proteins including amidase enhancers [] and adhesins [].
Probab=20.38  E-value=79  Score=15.46  Aligned_cols=11  Identities=27%  Similarity=0.643  Sum_probs=8.0

Q ss_pred             CCeEEEEcCCC
Q psy6079          31 EGPLYAVGGHD   41 (79)
Q Consensus        31 ~~~iyv~GG~~   41 (79)
                      ..++|++||..
T Consensus        73 ~~~v~iiGg~~   83 (92)
T PF04122_consen   73 IKKVYIIGGEG   83 (92)
T ss_pred             CCEEEEECCCC
Confidence            35789999853


No 125
>KOG2445|consensus
Probab=20.36  E-value=2.3e+02  Score=18.22  Aligned_cols=36  Identities=28%  Similarity=0.457  Sum_probs=21.8

Q ss_pred             CCeEEEEcCCCCCCCCC--eEEEEeCCCCcEEeccCCC
Q psy6079          31 EGPLYAVGGHDGWSYLN--TTERFDPRLRQWSFIAPML   66 (79)
Q Consensus        31 ~~~iyv~GG~~~~~~~~--~~~~yd~~~~~W~~~~~~~   66 (79)
                      ...++++|--......+  .++.|+...++|.+++.++
T Consensus       183 ~~p~iAvgs~e~a~~~~~~~Iye~~e~~rKw~kva~L~  220 (361)
T KOG2445|consen  183 HEPLIAVGSDEDAPHLNKVKIYEYNENGRKWLKVAELP  220 (361)
T ss_pred             cCceEEEEcccCCccccceEEEEecCCcceeeeehhcC
Confidence            34566666433333333  4566777788999986553


Done!