Query         psy7050
Match_columns 61
No_of_seqs    137 out of 1598
Neff          10.0
Searched_HMMs 46136
Date          Fri Aug 16 16:52:36 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy7050.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7050hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PLN03134 glycine-rich RNA-bind  99.7 1.7E-16 3.7E-21   81.4   6.8   52   10-61     31-82  (144)
  2 KOG0149|consensus               99.7 1.6E-16 3.4E-21   85.9   6.1   50   12-61     11-60  (247)
  3 KOG4207|consensus               99.7 3.6E-16 7.8E-21   83.7   5.8   61    1-61      1-61  (256)
  4 TIGR01661 ELAV_HUD_SF ELAV/HuD  99.6 2.7E-15 5.8E-20   84.8   7.7   51   11-61    267-317 (352)
  5 TIGR01661 ELAV_HUD_SF ELAV/HuD  99.6 1.6E-15 3.4E-20   85.8   6.8   49   13-61      3-51  (352)
  6 TIGR01659 sex-lethal sex-letha  99.6 2.7E-15 5.8E-20   85.6   7.5   53    9-61    103-155 (346)
  7 TIGR01648 hnRNP-R-Q heterogene  99.6   2E-14 4.2E-19   86.1   7.4   58    3-61     43-105 (578)
  8 TIGR01659 sex-lethal sex-letha  99.6 1.6E-14 3.4E-19   82.5   6.6   52   10-61    190-241 (346)
  9 KOG0113|consensus               99.5 1.8E-14 3.9E-19   80.3   5.8   51   11-61     99-149 (335)
 10 KOG0122|consensus               99.5 2.1E-14 4.6E-19   78.2   5.4   53    9-61    185-237 (270)
 11 TIGR01645 half-pint poly-U bin  99.5 3.7E-14 8.1E-19   85.3   6.5   51   11-61    105-155 (612)
 12 KOG0126|consensus               99.5   3E-15 6.5E-20   78.9   1.3   52    9-60     31-82  (219)
 13 KOG0121|consensus               99.5 2.9E-14 6.2E-19   71.7   4.4   51   11-61     34-84  (153)
 14 KOG0117|consensus               99.5 7.1E-14 1.5E-18   81.2   6.0   51   11-61     81-131 (506)
 15 PF00076 RRM_1:  RNA recognitio  99.5 1.2E-13 2.7E-18   62.4   5.5   45   16-61      1-45  (70)
 16 KOG0144|consensus               99.5 1.1E-13 2.3E-18   80.3   5.3   51   11-61     32-82  (510)
 17 TIGR01622 SF-CC1 splicing fact  99.5 3.1E-13 6.7E-18   78.7   7.0   50   12-61     88-137 (457)
 18 TIGR01642 U2AF_lg U2 snRNP aux  99.5 4.3E-13 9.2E-18   78.9   7.3   50   12-61    294-343 (509)
 19 TIGR01645 half-pint poly-U bin  99.5 2.8E-13   6E-18   81.6   6.3   50   12-61    203-252 (612)
 20 KOG0125|consensus               99.4 2.3E-13   5E-18   76.8   5.1   51    9-61     92-142 (376)
 21 TIGR01628 PABP-1234 polyadenyl  99.4 4.6E-13   1E-17   79.8   6.5   48   14-61      1-48  (562)
 22 PF14259 RRM_6:  RNA recognitio  99.4   8E-13 1.7E-17   60.1   5.7   45   16-61      1-45  (70)
 23 PLN03213 repressor of silencin  99.4 5.2E-13 1.1E-17   79.0   6.2   46   12-61      9-54  (759)
 24 KOG0131|consensus               99.4 2.8E-13 6.2E-18   71.3   4.3   54    8-61      4-57  (203)
 25 PLN03120 nucleic acid binding   99.4 7.7E-13 1.7E-17   73.0   6.1   46   13-61      4-49  (260)
 26 PLN03121 nucleic acid binding   99.4   2E-12 4.3E-17   70.7   6.4   47   12-61      4-50  (243)
 27 TIGR01622 SF-CC1 splicing fact  99.4 2.1E-12 4.5E-17   75.3   6.4   49   13-61    186-234 (457)
 28 KOG0127|consensus               99.4 1.8E-12 3.9E-17   77.1   6.0   51   11-61    290-340 (678)
 29 KOG0145|consensus               99.4 2.5E-13 5.5E-18   75.1   2.2   60    2-61    116-175 (360)
 30 COG0724 RNA-binding proteins (  99.4 3.8E-12 8.2E-17   68.5   6.0   49   13-61    115-163 (306)
 31 KOG0130|consensus               99.3 2.5E-12 5.4E-17   65.4   4.6   55    7-61     66-120 (170)
 32 KOG0107|consensus               99.3 5.5E-12 1.2E-16   66.2   5.1   46   11-61      8-53  (195)
 33 TIGR01628 PABP-1234 polyadenyl  99.3 8.2E-12 1.8E-16   74.5   6.5   50   11-61    283-332 (562)
 34 KOG0148|consensus               99.3 6.6E-12 1.4E-16   69.6   4.7   51   10-60     59-109 (321)
 35 KOG4205|consensus               99.3 8.6E-12 1.9E-16   70.5   5.0   50   12-61      5-54  (311)
 36 KOG0111|consensus               99.3 2.2E-12 4.7E-17   70.1   1.9   52   10-61      7-58  (298)
 37 KOG0124|consensus               99.2 3.9E-12 8.4E-17   73.2   2.4   49   13-61    113-161 (544)
 38 KOG0108|consensus               99.2 1.8E-11 3.9E-16   71.7   4.9   48   14-61     19-66  (435)
 39 smart00362 RRM_2 RNA recogniti  99.2 5.9E-11 1.3E-15   53.0   5.5   45   15-61      1-45  (72)
 40 KOG0144|consensus               99.2 5.1E-12 1.1E-16   73.5   1.7   48   13-61    124-171 (510)
 41 KOG0145|consensus               99.2 6.9E-11 1.5E-15   65.6   5.9   53    9-61    274-326 (360)
 42 smart00360 RRM RNA recognition  99.2 9.6E-11 2.1E-15   52.1   4.8   44   18-61      1-44  (71)
 43 KOG0114|consensus               99.2 2.5E-10 5.4E-15   55.9   6.1   48   10-60     15-62  (124)
 44 KOG0148|consensus               99.1 1.6E-10 3.4E-15   64.3   5.3   46   10-61    161-206 (321)
 45 KOG0147|consensus               99.1 2.6E-11 5.7E-16   71.8   2.3   46   16-61    281-326 (549)
 46 KOG0127|consensus               99.1 1.8E-10 3.9E-15   68.8   4.7   49   13-61      5-53  (678)
 47 KOG0124|consensus               99.1 2.9E-10 6.2E-15   65.7   5.3   50   12-61    209-258 (544)
 48 TIGR01649 hnRNP-L_PTB hnRNP-L/  99.1 3.4E-10 7.4E-15   67.0   5.7   43   13-61      2-44  (481)
 49 KOG0105|consensus               99.1 4.2E-10 9.2E-15   60.0   4.9   48   11-61      4-51  (241)
 50 cd00590 RRM RRM (RNA recogniti  99.1 1.1E-09 2.5E-14   49.0   5.8   46   15-61      1-46  (74)
 51 TIGR01649 hnRNP-L_PTB hnRNP-L/  99.1 1.1E-09 2.4E-14   64.8   6.7   45   12-61    274-319 (481)
 52 KOG0109|consensus               99.0   7E-10 1.5E-14   62.3   3.4   35   14-48      3-37  (346)
 53 KOG0146|consensus               99.0 1.7E-09 3.7E-14   60.4   4.8   53    9-61    281-333 (371)
 54 KOG0415|consensus               98.9 2.7E-09 5.8E-14   61.5   3.8   48   14-61    240-287 (479)
 55 KOG4205|consensus               98.9 4.2E-09 9.2E-14   59.8   4.6   49   13-61     97-145 (311)
 56 KOG0146|consensus               98.8 3.1E-09 6.8E-14   59.4   3.2   49   12-61     18-66  (371)
 57 KOG0131|consensus               98.8 4.7E-09   1E-13   55.6   3.2   51   10-60     93-144 (203)
 58 KOG4208|consensus               98.8 2.5E-08 5.4E-13   53.7   5.4   53    9-61     45-98  (214)
 59 TIGR01642 U2AF_lg U2 snRNP aux  98.8 3.6E-08 7.7E-13   58.4   6.4   46   10-61    172-229 (509)
 60 KOG0110|consensus               98.8 6.2E-09 1.3E-13   63.6   3.2   49   13-61    613-661 (725)
 61 TIGR01648 hnRNP-R-Q heterogene  98.7 4.1E-08 8.9E-13   59.5   6.0   42   12-61    232-275 (578)
 62 KOG0153|consensus               98.7 3.1E-08 6.8E-13   56.7   4.7   48    8-61    223-270 (377)
 63 KOG0116|consensus               98.7 4.5E-08 9.8E-13   57.5   4.5   49   13-61    288-336 (419)
 64 PF04059 RRM_2:  RNA recognitio  98.7 2.1E-07 4.6E-12   45.1   6.1   47   14-60      2-50  (97)
 65 KOG4212|consensus               98.6 1.3E-07 2.8E-12   56.0   5.1   48   13-61     44-92  (608)
 66 KOG0117|consensus               98.6   1E-07 2.2E-12   56.2   4.0   40   13-60    259-298 (506)
 67 KOG0123|consensus               98.6 1.6E-07 3.5E-12   54.5   4.6   45   14-61     77-121 (369)
 68 KOG4661|consensus               98.5 1.8E-07   4E-12   56.9   4.5   49   13-61    405-453 (940)
 69 KOG4206|consensus               98.5 5.2E-07 1.1E-11   49.1   5.8   47   11-60      7-57  (221)
 70 KOG4209|consensus               98.5 1.2E-07 2.7E-12   52.0   3.3   51   11-61     99-149 (231)
 71 smart00361 RRM_1 RNA recogniti  98.5 3.2E-07   7E-12   41.8   3.9   35   27-61      2-43  (70)
 72 KOG0109|consensus               98.5 1.9E-07 4.2E-12   52.7   3.3   38   11-48     76-113 (346)
 73 KOG0132|consensus               98.5 2.7E-07 5.9E-12   57.3   4.1   43   13-61    421-463 (894)
 74 KOG0123|consensus               98.3 1.7E-06 3.6E-11   50.4   4.8   51   10-61    267-317 (369)
 75 KOG4454|consensus               98.2 7.8E-07 1.7E-11   48.7   2.1   50   10-61      6-55  (267)
 76 KOG4849|consensus               98.2 4.8E-07   1E-11   52.4   1.1   47   14-60     81-129 (498)
 77 KOG0533|consensus               98.2 5.9E-06 1.3E-10   45.8   5.1   49   11-60     81-129 (243)
 78 KOG4212|consensus               98.2   2E-06 4.4E-11   51.1   3.3   45    9-56    532-576 (608)
 79 KOG0226|consensus               98.1 2.9E-06 6.2E-11   47.3   3.2   50   11-60    188-237 (290)
 80 KOG4211|consensus               98.1 1.7E-05 3.7E-10   47.5   5.7   50    9-61      6-55  (510)
 81 KOG0120|consensus               98.1 6.1E-06 1.3E-10   49.6   3.7   53    9-61    285-337 (500)
 82 KOG1548|consensus               98.0 3.8E-05 8.3E-10   44.4   5.8   49   10-59    131-187 (382)
 83 KOG0110|consensus               98.0 1.4E-05   3E-10   49.5   4.0   48   14-61    516-566 (725)
 84 KOG4210|consensus               97.9 8.6E-06 1.9E-10   46.1   2.3   50   12-61    183-233 (285)
 85 KOG0151|consensus               97.8   4E-05 8.7E-10   47.9   4.5   39   11-49    172-210 (877)
 86 KOG0147|consensus               97.8 3.5E-06 7.5E-11   50.7  -0.3   51   11-61    177-227 (549)
 87 KOG0106|consensus               97.8 2.7E-05 5.9E-10   42.6   2.9   32   14-45      2-33  (216)
 88 KOG4660|consensus               97.8 4.9E-05 1.1E-09   46.0   4.0   45   12-61     74-118 (549)
 89 PF13893 RRM_5:  RNA recognitio  97.7 7.7E-05 1.7E-09   32.3   3.1   27   30-61      1-27  (56)
 90 KOG0129|consensus               97.7 0.00026 5.6E-09   42.8   5.8   52    9-60    366-418 (520)
 91 KOG0129|consensus               97.3 0.00059 1.3E-08   41.3   4.3   33   13-46    259-291 (520)
 92 KOG4211|consensus               97.3  0.0012 2.6E-08   39.9   5.2   49   12-61    102-151 (510)
 93 PF08777 RRM_3:  RNA binding mo  97.2 0.00049 1.1E-08   33.8   2.9   36   13-48      1-36  (105)
 94 KOG1995|consensus               97.2 0.00076 1.6E-08   39.2   3.9   52    9-60     62-121 (351)
 95 KOG1190|consensus               97.1  0.0007 1.5E-08   40.3   3.3   37   12-48     27-63  (492)
 96 KOG1457|consensus               97.1  0.0026 5.5E-08   35.5   5.2   51   11-61     32-83  (284)
 97 KOG3152|consensus               97.0   0.002 4.4E-08   36.3   3.9   39   12-50     73-111 (278)
 98 KOG1457|consensus               96.9   0.002 4.4E-08   35.8   3.5   33   11-43    208-240 (284)
 99 PF08675 RNA_bind:  RNA binding  96.8    0.01 2.2E-07   28.3   5.2   33   14-48     10-42  (87)
100 KOG0105|consensus               96.8  0.0061 1.3E-07   33.2   4.7   40    9-48    111-150 (241)
101 KOG4206|consensus               96.6   0.012 2.5E-07   32.5   5.1   46   11-61    144-189 (221)
102 KOG0115|consensus               96.3  0.0048   1E-07   34.8   2.6   35   14-48     32-66  (275)
103 KOG0128|consensus               96.3  0.0032   7E-08   40.3   2.1   46   14-60    737-782 (881)
104 KOG0128|consensus               95.8 0.00015 3.2E-09   46.0  -5.2   50   12-61    666-715 (881)
105 KOG1855|consensus               95.4   0.038 8.2E-07   33.4   3.8   37   12-48    230-266 (484)
106 KOG1365|consensus               95.3   0.024 5.2E-07   33.9   2.9   48   13-61    280-330 (508)
107 KOG2314|consensus               95.1    0.14 3.1E-06   32.2   5.8   50   11-61     56-111 (698)
108 PF14605 Nup35_RRM_2:  Nup53/35  94.6     0.1 2.2E-06   22.5   3.3   32   14-46      2-33  (53)
109 KOG1365|consensus               93.7    0.32 6.8E-06   29.4   4.9   46   14-60    162-211 (508)
110 KOG4676|consensus               93.1    0.12 2.7E-06   31.1   2.7   35   13-47      7-41  (479)
111 KOG0112|consensus               92.8   0.043 9.2E-07   35.8   0.6   37   12-48    371-407 (975)
112 KOG0106|consensus               92.3    0.13 2.8E-06   28.5   2.0   33   11-43     97-129 (216)
113 KOG1190|consensus               92.2    0.47   1E-05   28.9   4.2   38   13-50    297-335 (492)
114 KOG1456|consensus               91.0     1.2 2.6E-05   27.0   5.1   37   10-46     28-64  (494)
115 KOG4307|consensus               90.9    0.35 7.5E-06   31.3   3.0   51   10-61    431-482 (944)
116 KOG1456|consensus               90.8     1.7 3.6E-05   26.4   5.5   41    9-49    283-324 (494)
117 KOG0112|consensus               90.6    0.86 1.9E-05   30.2   4.5   36   10-45    452-487 (975)
118 KOG4307|consensus               90.0       1 2.2E-05   29.4   4.4   46   13-58    867-912 (944)
119 PF03467 Smg4_UPF3:  Smg-4/UPF3  89.6    0.74 1.6E-05   24.6   3.2   30   11-40      5-35  (176)
120 KOG2193|consensus               89.5    0.33 7.2E-06   29.7   2.0   24   14-37      2-25  (584)
121 PF09707 Cas_Cas2CT1978:  CRISP  87.2     1.5 3.3E-05   20.9   3.1   45   13-60     25-69  (86)
122 KOG4660|consensus               85.7    0.92   2E-05   28.4   2.4   47   14-60    389-436 (549)
123 COG5175 MOT2 Transcriptional r  85.6     1.1 2.4E-05   26.8   2.6   35   13-47    114-154 (480)
124 KOG2253|consensus               82.4     1.2 2.7E-05   28.5   2.0   35   10-44     37-71  (668)
125 KOG2416|consensus               80.8       1 2.2E-05   28.8   1.3   37    7-43    438-475 (718)
126 KOG4008|consensus               80.6       3 6.6E-05   23.7   2.9   32    9-40     36-67  (261)
127 PRK11558 putative ssRNA endonu  80.3     4.1 8.8E-05   20.0   3.0   45   13-60     27-71  (97)
128 KOG4410|consensus               79.3      11 0.00023   22.4   5.0   26   14-39    331-356 (396)
129 PF07292 NID:  Nmi/IFP 35 domai  78.4       6 0.00013   19.0   3.4   22   13-34     52-73  (88)
130 COG0030 KsgA Dimethyladenosine  78.1     2.6 5.6E-05   24.1   2.2   35   14-48     96-130 (259)
131 KOG4454|consensus               76.8    0.52 1.1E-05   26.6  -0.7   49   11-60     78-130 (267)
132 PF05172 Nup35_RRM:  Nup53/35/4  75.0     4.8  0.0001   19.7   2.5   30   14-44      7-36  (100)
133 PF11608 Limkain-b1:  Limkain b  74.9     6.1 0.00013   19.1   2.7   30   14-43      3-37  (90)
134 KOG4210|consensus               74.5     1.7 3.7E-05   25.1   1.0   49   12-60     87-135 (285)
135 PF10567 Nab6_mRNP_bdg:  RNA-re  73.4     3.2 6.9E-05   24.3   1.8   37   13-49     15-51  (309)
136 COG5584 Predicted small secret  69.5       6 0.00013   19.5   2.0   29   21-49     30-58  (103)
137 KOG2891|consensus               67.6     9.8 0.00021   22.5   3.0   35   12-46    148-194 (445)
138 PTZ00338 dimethyladenosine tra  66.1       7 0.00015   22.6   2.2   34   15-48    103-136 (294)
139 TIGR01873 cas_CT1978 CRISPR-as  65.6      14  0.0003   17.8   2.8   24   13-36     25-48  (87)
140 PRK15464 cold shock-like prote  64.3       4 8.6E-05   18.6   0.8    8   54-61     15-22  (70)
141 PF14893 PNMA:  PNMA             64.1     8.7 0.00019   22.8   2.4   26   10-35     15-40  (331)
142 PRK00274 ksgA 16S ribosomal RN  63.7     9.6 0.00021   21.6   2.4   34   15-48    107-140 (272)
143 PF10309 DUF2414:  Protein of u  63.2      13 0.00029   16.6   4.1   35   13-48      5-42  (62)
144 PF15513 DUF4651:  Domain of un  62.6      14  0.0003   16.7   4.0   17   28-44      9-25  (62)
145 PRK15463 cold shock-like prote  61.6     4.5 9.8E-05   18.4   0.8    8   54-61     15-22  (70)
146 COG0481 LepA Membrane GTPase L  59.5      15 0.00032   23.5   2.8   45   14-59    298-343 (603)
147 KOG2318|consensus               59.3      38 0.00082   22.1   4.5   36   10-45    171-211 (650)
148 PRK09507 cspE cold shock prote  59.1     4.4 9.6E-05   18.3   0.5    8   54-61     14-21  (69)
149 PRK10943 cold shock-like prote  58.2     4.6  0.0001   18.3   0.5    8   54-61     14-21  (69)
150 PRK10354 RNA chaperone/anti-te  55.4     5.7 0.00012   18.0   0.5    8   54-61     15-22  (70)
151 PRK09890 cold shock protein Cs  55.3     5.6 0.00012   18.0   0.5    8   54-61     15-22  (70)
152 cd00027 BRCT Breast Cancer Sup  53.3      18 0.00038   15.0   3.6   26   14-39      2-27  (72)
153 PF00398 RrnaAD:  Ribosomal RNA  52.9      25 0.00053   19.8   2.9   31   13-43     97-129 (262)
154 PF03468 XS:  XS domain;  Inter  50.4      33 0.00072   17.2   2.9   33   26-61     30-62  (116)
155 PF12157 DUF3591:  Protein of u  48.8      32 0.00069   21.6   3.1   40   19-61    359-398 (457)
156 PF11411 DNA_ligase_IV:  DNA li  48.7      19  0.0004   14.4   1.5   15   24-38     20-34  (36)
157 TIGR00755 ksgA dimethyladenosi  47.2      43 0.00094   18.7   3.3   25   15-39     96-120 (253)
158 PF03672 UPF0154:  Uncharacteri  46.7      30 0.00064   15.7   2.1   18   22-39     29-46  (64)
159 PHA01632 hypothetical protein   46.3      29 0.00063   15.4   2.3   18   18-35     21-38  (64)
160 PF07230 Peptidase_S80:  Bacter  46.3      20 0.00043   22.7   2.0   36   15-54    280-315 (501)
161 PF13046 DUF3906:  Protein of u  46.2      31 0.00067   15.7   2.2   32   27-60     32-63  (64)
162 PHA02531 20 portal vertex prot  45.4      15 0.00033   23.2   1.4   36   14-53    282-317 (514)
163 PRK00523 hypothetical protein;  44.0      34 0.00074   15.9   2.1   18   22-39     37-54  (72)
164 PF15407 Spo7_2_N:  Sporulation  43.9     7.6 0.00017   17.7   0.0   18   11-28     25-42  (67)
165 smart00650 rADc Ribosomal RNA   41.4      53  0.0012   17.0   3.4   23   14-36     78-100 (169)
166 PF15063 TC1:  Thyroid cancer p  41.0      23  0.0005   16.7   1.3   26   14-39     26-51  (79)
167 PF01823 MACPF:  MAC/Perforin d  40.0      26 0.00056   18.6   1.7   22   18-39     53-77  (212)
168 COG5638 Uncharacterized conser  39.8      98  0.0021   19.6   4.3   37    9-45    142-183 (622)
169 KOG2135|consensus               39.4      18 0.00038   22.9   1.0   35   14-48    373-408 (526)
170 PRK01844 hypothetical protein;  39.2      45 0.00097   15.5   2.1   18   22-39     36-53  (72)
171 PF11004 Kdo_hydroxy:  3-deoxy-  38.0      55  0.0012   19.3   2.8   47   10-57     18-65  (281)
172 smart00457 MACPF membrane-atta  37.9      24 0.00053   18.9   1.4   21   19-39     31-51  (194)
173 PF09176 Mpt_N:  Methylene-tetr  37.4     9.2  0.0002   18.2  -0.3   30    3-33     35-64  (81)
174 PF00220 Hormone_4:  Neurohypop  36.9      16 0.00035   10.0   0.3    6   17-22      2-7   (9)
175 PF14111 DUF4283:  Domain of un  36.1      62  0.0013   16.3   3.5   31   16-46    107-138 (153)
176 smart00596 PRE_C2HC PRE_C2HC d  33.7      56  0.0012   15.1   3.9   26   29-54      3-29  (69)
177 KOG4365|consensus               33.3      28  0.0006   22.0   1.2   45   16-60      6-50  (572)
178 PF10281 Ish1:  Putative stress  32.4      41  0.0009   13.1   1.9   16   24-39      3-18  (38)
179 PF05189 RTC_insert:  RNA 3'-te  32.2      66  0.0014   15.4   4.3   45   15-59     12-61  (103)
180 KOG2591|consensus               30.9 1.5E+02  0.0032   19.6   4.0   35   12-46    174-210 (684)
181 KOG3424|consensus               30.6      86  0.0019   16.2   3.3   36   25-60     35-75  (132)
182 cd00874 RNA_Cyclase_Class_II R  30.2      88  0.0019   18.7   2.9   43   15-59    188-234 (326)
183 cd06257 DnaJ DnaJ domain or J-  29.8      51  0.0011   13.3   2.2   18   19-36      6-23  (55)
184 PF12385 Peptidase_C70:  Papain  29.5      68  0.0015   17.4   2.1   20   24-43     95-114 (166)
185 PF06014 DUF910:  Bacterial pro  28.6      65  0.0014   14.5   1.7   16   27-42      4-19  (62)
186 COG3763 Uncharacterized protei  28.4      74  0.0016   14.8   2.1   18   22-39     36-53  (71)
187 COG4647 AcxC Acetone carboxyla  28.3      69  0.0015   16.8   2.0   31   13-43     89-132 (165)
188 PF00226 DnaJ:  DnaJ domain;  I  28.3      61  0.0013   13.7   2.2   18   19-36      6-23  (64)
189 smart00703 NRF N-terminal doma  28.2      56  0.0012   15.9   1.6   23   15-37     71-94  (110)
190 KOG0008|consensus               27.3   1E+02  0.0022   22.6   3.0   39   20-61    820-859 (1563)
191 KOG0820|consensus               27.0      78  0.0017   18.9   2.2   22   14-35    124-145 (315)
192 PF04008 Adenosine_kin:  Adenos  26.6   1E+02  0.0023   16.5   2.5   27   11-37      6-32  (155)
193 PF13600 DUF4140:  N-terminal d  25.7      26 0.00057   16.8   0.2   35   13-47     23-57  (104)
194 smart00540 LEM in nuclear memb  24.8      70  0.0015   13.3   1.8   15   25-39      6-20  (44)
195 KOG2068|consensus               24.7      48   0.001   19.9   1.2   36   14-49     78-116 (327)
196 smart00195 DSPc Dual specifici  24.5      61  0.0013   16.0   1.4   18   15-32      7-24  (138)
197 PF11249 DUF3047:  Protein of u  23.6 1.4E+02   0.003   16.2   4.3   33   28-60    145-178 (183)
198 PRK04239 hypothetical protein;  23.3 1.1E+02  0.0024   15.4   2.1   19   19-37     80-98  (110)
199 PF15051 FAM198:  FAM198 protei  23.3      88  0.0019   18.8   2.0   30   14-43      3-33  (326)
200 PF01984 dsDNA_bind:  Double-st  23.2      76  0.0016   15.7   1.6   20   19-38     75-94  (107)
201 PF05023 Phytochelatin:  Phytoc  23.1      43 0.00092   18.7   0.7   14   48-61    199-212 (212)
202 TIGR00006 S-adenosyl-methyltra  22.3      76  0.0016   18.8   1.6   19   24-42    140-158 (305)
203 PF00054 Laminin_G_1:  Laminin   22.2      72  0.0016   15.8   1.4   11   12-22     91-101 (131)
204 smart00271 DnaJ DnaJ molecular  22.2      80  0.0017   13.0   2.2   18   19-36      7-24  (60)
205 PF15023 DUF4523:  Protein of u  22.1 1.5E+02  0.0032   16.0   3.5   33   13-45     86-122 (166)
206 PRK00050 16S rRNA m(4)C1402 me  22.1      78  0.0017   18.6   1.7   19   24-42    138-156 (296)
207 PF00533 BRCT:  BRCA1 C Terminu  21.8      89  0.0019   13.4   2.8   29   12-41      7-35  (78)
208 KOG0984|consensus               21.4 1.9E+02   0.004   16.9   3.6   30    6-35     24-53  (282)
209 KOG0718|consensus               20.7   1E+02  0.0023   19.8   2.0   19   19-37     15-33  (546)
210 COG2118 DNA-binding protein [G  20.7 1.4E+02  0.0029   15.3   2.1   20   18-37     82-101 (116)
211 KOG4285|consensus               20.2      98  0.0021   18.7   1.8   29   16-45    200-228 (350)

No 1  
>PLN03134 glycine-rich RNA-binding protein 4; Provisional
Probab=99.69  E-value=1.7e-16  Score=81.35  Aligned_cols=52  Identities=35%  Similarity=0.443  Sum_probs=48.6

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ....++|||+|||+.+++++|+++|++||.|.++.++.|+.+++++|||||+
T Consensus        31 ~~~~~~lfVgnL~~~~te~~L~~~F~~~G~I~~v~i~~d~~tg~~kGfaFV~   82 (144)
T PLN03134         31 RLMSTKLFIGGLSWGTDDASLRDAFAHFGDVVDAKVIVDRETGRSRGFGFVN   82 (144)
T ss_pred             cCCCCEEEEeCCCCCCCHHHHHHHHhcCCCeEEEEEEecCCCCCcceEEEEE
Confidence            3446899999999999999999999999999999999999999999999995


No 2  
>KOG0149|consensus
Probab=99.68  E-value=1.6e-16  Score=85.95  Aligned_cols=50  Identities=32%  Similarity=0.571  Sum_probs=47.7

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .-.+||||+|+|.+..+.|++.|++||+|.+..++.|+.||+++|||||+
T Consensus        11 ~~TKifVggL~w~T~~~~l~~yFeqfGeI~eavvitd~~t~rskGyGfVT   60 (247)
T KOG0149|consen   11 TFTKIFVGGLAWETHKETLRRYFEQFGEIVEAVVITDKNTGRSKGYGFVT   60 (247)
T ss_pred             eEEEEEEcCcccccchHHHHHHHHHhCceEEEEEEeccCCccccceeeEE
Confidence            34789999999999999999999999999999999999999999999996


No 3  
>KOG4207|consensus
Probab=99.66  E-value=3.6e-16  Score=83.68  Aligned_cols=61  Identities=84%  Similarity=1.456  Sum_probs=59.3

Q ss_pred             CCCCCCCCCCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050           1 MSYGRPPPRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus         1 m~~~~~~~~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      |+|+.+|++.+....|.|-||.+.++.++|+.+|++||.|-++.|+.|+.|++.+|||||.
T Consensus         1 MS~g~~PPdv~gm~SLkVdNLTyRTspd~LrrvFekYG~vgDVyIPrdr~Tr~sRgFaFVr   61 (256)
T KOG4207|consen    1 MSYGRPPPDVEGMTSLKVDNLTYRTSPDDLRRVFEKYGRVGDVYIPRDRYTRQSRGFAFVR   61 (256)
T ss_pred             CCCCCCCCCcccceeEEecceeccCCHHHHHHHHHHhCcccceecccccccccccceeEEE
Confidence            9999999999999999999999999999999999999999999999999999999999983


No 4  
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=99.63  E-value=2.7e-15  Score=84.84  Aligned_cols=51  Identities=29%  Similarity=0.386  Sum_probs=47.5

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..+.+|||+|||+.+++++|+++|++||.|.+++++.|+.|+.++|||||+
T Consensus       267 ~~~~~lfV~NL~~~~~e~~L~~~F~~fG~v~~v~i~~d~~t~~skG~aFV~  317 (352)
T TIGR01661       267 GAGYCIFVYNLSPDTDETVLWQLFGPFGAVQNVKIIRDLTTNQCKGYGFVS  317 (352)
T ss_pred             CCCcEEEEeCCCCCCCHHHHHHHHHhCCCeEEEEEeEcCCCCCccceEEEE
Confidence            334579999999999999999999999999999999999999999999995


No 5  
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=99.63  E-value=1.6e-15  Score=85.78  Aligned_cols=49  Identities=31%  Similarity=0.536  Sum_probs=47.1

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..+|||+|||..+++++|+++|+.||+|.++.+++|+.+|+++|||||+
T Consensus         3 ~~~l~V~nLp~~~~e~~l~~~F~~~G~i~~v~i~~d~~~g~s~g~afV~   51 (352)
T TIGR01661         3 KTNLIVNYLPQTMTQEEIRSLFTSIGEIESCKLVRDKVTGQSLGYGFVN   51 (352)
T ss_pred             CcEEEEeCCCCCCCHHHHHHHHHccCCEEEEEEEEcCCCCccceEEEEE
Confidence            4899999999999999999999999999999999999999999999995


No 6  
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=99.63  E-value=2.7e-15  Score=85.58  Aligned_cols=53  Identities=30%  Similarity=0.326  Sum_probs=49.3

Q ss_pred             CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050           9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +....++|||+|||+++++++|+++|+.+|.|.+++++.|+.+++++|||||+
T Consensus       103 ~~~~~~~LfVgnLp~~~te~~L~~lF~~~G~V~~v~i~~d~~tg~srGyaFVe  155 (346)
T TIGR01659       103 TNNSGTNLIVNYLPQDMTDRELYALFRTIGPINTCRIMRDYKTGYSFGYAFVD  155 (346)
T ss_pred             CCCCCcEEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecCCCCccCcEEEEE
Confidence            34456899999999999999999999999999999999999999999999995


No 7  
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=99.56  E-value=2e-14  Score=86.13  Aligned_cols=58  Identities=31%  Similarity=0.592  Sum_probs=50.8

Q ss_pred             CCCCCCC-----CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050           3 YGRPPPR-----IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus         3 ~~~~~~~-----~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      |+.||+.     ....++|||+|||+++++++|+++|+++|.|.++++++| .+++++|||||+
T Consensus        43 ~g~Pp~~~~~~~p~~~~~lFVgnLp~~~tEd~L~~~F~~~G~I~~vrl~~D-~sG~sRGfaFV~  105 (578)
T TIGR01648        43 YGGPPPGWSGVQPGRGCEVFVGKIPRDLYEDELVPLFEKAGPIYELRLMMD-FSGQNRGYAFVT  105 (578)
T ss_pred             CCCCCCcccCCCCCCCCEEEeCCCCCCCCHHHHHHHHHhhCCEEEEEEEEC-CCCCccceEEEE
Confidence            5555542     344689999999999999999999999999999999999 789999999995


No 8  
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=99.56  E-value=1.6e-14  Score=82.49  Aligned_cols=52  Identities=37%  Similarity=0.496  Sum_probs=48.2

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .....+|||+|||+.+++++|+++|++||.|..+.++.++.+++.+|||||+
T Consensus       190 ~~~~~~lfV~nLp~~vtee~L~~~F~~fG~V~~v~i~~d~~tg~~kG~aFV~  241 (346)
T TIGR01659       190 SIKDTNLYVTNLPRTITDDQLDTIFGKYGQIVQKNILRDKLTGTPRGVAFVR  241 (346)
T ss_pred             ccccceeEEeCCCCcccHHHHHHHHHhcCCEEEEEEeecCCCCccceEEEEE
Confidence            3445789999999999999999999999999999999999999999999995


No 9  
>KOG0113|consensus
Probab=99.55  E-value=1.8e-14  Score=80.29  Aligned_cols=51  Identities=37%  Similarity=0.660  Sum_probs=48.3

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +.=+||||+-|+.++++..|+..|+.||.|+.++++.|+.||+++|||||+
T Consensus        99 DPy~TLFv~RLnydT~EskLrreF~~YG~IkrirlV~d~vTgkskGYAFIe  149 (335)
T KOG0113|consen   99 DPYKTLFVARLNYDTSESKLRREFEKYGPIKRIRLVRDKVTGKSKGYAFIE  149 (335)
T ss_pred             CccceeeeeeccccccHHHHHHHHHhcCcceeEEEeeecccCCccceEEEE
Confidence            334899999999999999999999999999999999999999999999995


No 10 
>KOG0122|consensus
Probab=99.54  E-value=2.1e-14  Score=78.25  Aligned_cols=53  Identities=34%  Similarity=0.597  Sum_probs=49.8

Q ss_pred             CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050           9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +.++.++|.|.||+.++++++|+++|.+||.+..+.+.+|+.||.++|||||+
T Consensus       185 ~R~D~~tvRvtNLsed~~E~dL~eLf~~fg~i~rvylardK~TG~~kGFAFVt  237 (270)
T KOG0122|consen  185 ERDDEATVRVTNLSEDMREDDLEELFRPFGPITRVYLARDKETGLSKGFAFVT  237 (270)
T ss_pred             cCCccceeEEecCccccChhHHHHHhhccCccceeEEEEccccCcccceEEEE
Confidence            45567899999999999999999999999999999999999999999999984


No 11 
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=99.53  E-value=3.7e-14  Score=85.29  Aligned_cols=51  Identities=29%  Similarity=0.540  Sum_probs=48.3

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ...++|||+||++.+++++|+++|.+||.|.+++++.|+.+++++|||||+
T Consensus       105 ~~~~rLfVGnLp~~~tEe~Lr~lF~~fG~I~sV~I~~D~~TgkskGfAFVe  155 (612)
T TIGR01645       105 AIMCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDPATGKHKGFAFVE  155 (612)
T ss_pred             cCCCEEEEcCCCCCCCHHHHHHHHHccCCEEEEEEeecCCCCCcCCeEEEE
Confidence            446899999999999999999999999999999999999999999999995


No 12 
>KOG0126|consensus
Probab=99.52  E-value=3e-15  Score=78.86  Aligned_cols=52  Identities=35%  Similarity=0.569  Sum_probs=48.8

Q ss_pred             CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050           9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      +-.+.+-|||||||+..++.+|.-.|++||.+..+.+++|+.||+++||||+
T Consensus        31 ~YkdsA~Iyiggl~~~LtEgDil~VFSqyGe~vdinLiRDk~TGKSKGFaFL   82 (219)
T KOG0126|consen   31 EYKDSAYIYIGGLPYELTEGDILCVFSQYGEIVDINLIRDKKTGKSKGFAFL   82 (219)
T ss_pred             hcccceEEEECCCcccccCCcEEEEeeccCceEEEEEEecCCCCcccceEEE
Confidence            3456688999999999999999999999999999999999999999999996


No 13 
>KOG0121|consensus
Probab=99.51  E-value=2.9e-14  Score=71.72  Aligned_cols=51  Identities=31%  Similarity=0.675  Sum_probs=48.1

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .+.++|||+||++.++++.|.++|+.+|.|+.+.+-.|+.+..+.|||||+
T Consensus        34 r~S~tvyVgNlSfyttEEqiyELFs~cG~irriiMGLdr~kktpCGFCFVe   84 (153)
T KOG0121|consen   34 RKSCTVYVGNLSFYTTEEQIYELFSKCGDIRRIIMGLDRFKKTPCGFCFVE   84 (153)
T ss_pred             hhcceEEEeeeeeeecHHHHHHHHHhccchheeEeccccCCcCccceEEEE
Confidence            457999999999999999999999999999999999999999999999995


No 14 
>KOG0117|consensus
Probab=99.50  E-value=7.1e-14  Score=81.16  Aligned_cols=51  Identities=25%  Similarity=0.470  Sum_probs=48.9

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..++.||||.||.++.+++|..+|++.|+|-+++|++|+.+|.+||||||+
T Consensus        81 ~~G~EVfvGkIPrD~~EdeLvplfEkiG~I~elRLMmD~~sG~nRGYAFVt  131 (506)
T KOG0117|consen   81 PRGCEVFVGKIPRDVFEDELVPLFEKIGKIYELRLMMDPFSGDNRGYAFVT  131 (506)
T ss_pred             CCCceEEecCCCccccchhhHHHHHhccceeeEEEeecccCCCCcceEEEE
Confidence            457999999999999999999999999999999999999999999999995


No 15 
>PF00076 RRM_1:  RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain);  InterPro: IPR000504 Many eukaryotic proteins containing one or more copies of a putative RNA-binding domain of about 90 amino acids are known to bind single-stranded RNAs [, , ]. The largest group of single strand RNA-binding proteins is the eukaryotic RNA recognition motif (RRM) family that contains an eight amino acid RNP-1 consensus sequence [, ]. RRM proteins have a variety of RNA binding preferences and functions, and include heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing (SR, U2AF, Sxl), protein components of small nuclear ribonucleoproteins (U1 and U2 snRNPs), and proteins that regulate RNA stability and translation (PABP, La, Hu) [, , ]. The RRM in heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) appears to have two RRM-like domains with specialised features for protein recognition []. The motif also appears in a few single stranded DNA binding proteins. The typical RRM consists of four anti-parallel beta-strands and two alpha-helices arranged in a beta-alpha-beta-beta-alpha-beta fold with side chains that stack with RNA bases. Specificity of RNA binding is determined by multiple contacts with surrounding amino acids. A third helix is present during RNA binding in some cases []. The RRM is reviewed in a number of publications [, , ].; GO: 0003676 nucleic acid binding; PDB: 2RNE_A 2DGO_A 2DO4_A 1YTY_B 2VOO_B 2VOP_A 2VON_B 1ZH5_B 2VOD_A 1S79_A ....
Probab=99.50  E-value=1.2e-13  Score=62.36  Aligned_cols=45  Identities=36%  Similarity=0.612  Sum_probs=42.0

Q ss_pred             EEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        16 l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      |||+|||+.+++++|+++|++||.+..+.+..+ .++..+|+|||+
T Consensus         1 l~v~nlp~~~t~~~l~~~f~~~g~i~~~~~~~~-~~~~~~~~a~V~   45 (70)
T PF00076_consen    1 LYVGNLPPDVTEEELRDFFSQFGKIESIKVMRN-SSGKSKGYAFVE   45 (70)
T ss_dssp             EEEESETTTSSHHHHHHHHHTTSTEEEEEEEEE-TTSSEEEEEEEE
T ss_pred             cEEcCCCCcCCHHHHHHHHHHhhhccccccccc-ccccccceEEEE
Confidence            799999999999999999999999999999887 678899999985


No 16 
>KOG0144|consensus
Probab=99.47  E-value=1.1e-13  Score=80.28  Aligned_cols=51  Identities=33%  Similarity=0.562  Sum_probs=47.9

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .+.-++||+.+|..++|.+|+++|++||.|.+|.+++|+.|+..+|||||+
T Consensus        32 ~~~vKlfVgqIprt~sE~dlr~lFe~yg~V~einl~kDk~t~~s~gcCFv~   82 (510)
T KOG0144|consen   32 GSAVKLFVGQIPRTASEKDLRELFEKYGNVYEINLIKDKSTGQSKGCCFVK   82 (510)
T ss_pred             chhhhheeccCCccccHHHHHHHHHHhCceeEEEeecccccCcccceEEEE
Confidence            344689999999999999999999999999999999999999999999985


No 17 
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.47  E-value=3.1e-13  Score=78.73  Aligned_cols=50  Identities=30%  Similarity=0.471  Sum_probs=47.4

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..++|||+|||..+++++|+++|+.+|.|..+.++.++.+++++|||||+
T Consensus        88 ~~~~l~V~nlp~~~~~~~l~~~F~~~G~v~~v~i~~d~~~~~skg~afVe  137 (457)
T TIGR01622        88 DDRTVFVLQLALKARERDLYEFFSKVGKVRDVQCIKDRNSRRSKGVAYVE  137 (457)
T ss_pred             CCcEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEeecCCCCCcceEEEEE
Confidence            35799999999999999999999999999999999999999999999995


No 18 
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=99.46  E-value=4.3e-13  Score=78.93  Aligned_cols=50  Identities=22%  Similarity=0.389  Sum_probs=47.2

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ...+|||+|||..+++++|+++|+.||.|..+.++.++.+|.++|||||+
T Consensus       294 ~~~~l~v~nlp~~~~~~~l~~~f~~~G~i~~~~~~~~~~~g~~~g~afv~  343 (509)
T TIGR01642       294 SKDRIYIGNLPLYLGEDQIKELLESFGDLKAFNLIKDIATGLSKGYAFCE  343 (509)
T ss_pred             CCCEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEEecCCCCCcCeEEEEE
Confidence            45799999999999999999999999999999999999999999999985


No 19 
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=99.45  E-value=2.8e-13  Score=81.61  Aligned_cols=50  Identities=22%  Similarity=0.423  Sum_probs=47.1

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..++|||+||++.+++++|+++|+.||.|.++.++.++.+++++|||||+
T Consensus       203 ~~~rLfVgnLp~~vteedLk~lFs~FG~I~svrl~~D~~tgksKGfGFVe  252 (612)
T TIGR01645       203 KFNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAPTGRGHKGYGFIE  252 (612)
T ss_pred             ccceEEeecCCCCCCHHHHHHHHhhcCCeeEEEEEecCCCCCcCCeEEEE
Confidence            34799999999999999999999999999999999999999999999985


No 20 
>KOG0125|consensus
Probab=99.44  E-value=2.3e-13  Score=76.77  Aligned_cols=51  Identities=35%  Similarity=0.548  Sum_probs=45.0

Q ss_pred             CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050           9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..+..++|+|+|+|+...+.||+.+|.+||.|.++.|+.+. .| +||||||+
T Consensus        92 s~~~pkRLhVSNIPFrFRdpDL~aMF~kfG~VldVEIIfNE-RG-SKGFGFVT  142 (376)
T KOG0125|consen   92 SKDTPKRLHVSNIPFRFRDPDLRAMFEKFGKVLDVEIIFNE-RG-SKGFGFVT  142 (376)
T ss_pred             CCCCCceeEeecCCccccCccHHHHHHhhCceeeEEEEecc-CC-CCccceEE
Confidence            34567899999999999999999999999999999999873 34 79999996


No 21 
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.44  E-value=4.6e-13  Score=79.80  Aligned_cols=48  Identities=33%  Similarity=0.471  Sum_probs=45.8

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .+|||+|||.++++++|+++|+++|.|.++++.+|..|++++|||||+
T Consensus         1 ~sl~VgnLp~~vte~~L~~~F~~~G~v~~v~v~~d~~t~~s~G~afV~   48 (562)
T TIGR01628         1 ASLYVGDLDPDVTEAKLYDLFKPFGPVLSVRVCRDSVTRRSLGYGYVN   48 (562)
T ss_pred             CeEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecCCCCCcceEEEEE
Confidence            379999999999999999999999999999999999999999999985


No 22 
>PF14259 RRM_6:  RNA recognition motif (a.k.a. RRM, RBD, or RNP domain); PDB: 2DNN_A 1WI6_A 2EVZ_A 2ADC_A 1QM9_A 2KG1_A 2HGN_A 1U2F_A 2G4B_A 2HZC_A ....
Probab=99.43  E-value=8e-13  Score=60.09  Aligned_cols=45  Identities=40%  Similarity=0.654  Sum_probs=40.4

Q ss_pred             EEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        16 l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      |||+|||+.+++++|+++|+.+|.|..+.+..++. +..+|+|||+
T Consensus         1 v~i~nlp~~~~~~~l~~~f~~~g~v~~v~~~~~~~-~~~~~~a~v~   45 (70)
T PF14259_consen    1 VYISNLPPSTTEEDLRNFFSRFGPVEKVRLIKNKD-GQSRGFAFVE   45 (70)
T ss_dssp             EEEESSTTT--HHHHHHHCTTSSBEEEEEEEESTT-SSEEEEEEEE
T ss_pred             CEEeCCCCCCCHHHHHHHHHhcCCcceEEEEeeec-cccCCEEEEE
Confidence            79999999999999999999999999999999876 8899999985


No 23 
>PLN03213 repressor of silencing 3; Provisional
Probab=99.43  E-value=5.2e-13  Score=78.97  Aligned_cols=46  Identities=33%  Similarity=0.497  Sum_probs=42.0

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ...+||||||++.+++++|+.+|..||.|..+.++  +.+|  ||||||+
T Consensus         9 ~gMRIYVGNLSydVTEDDLravFSeFGsVkdVEIp--RETG--RGFAFVE   54 (759)
T PLN03213          9 GGVRLHVGGLGESVGRDDLLKIFSPMGTVDAVEFV--RTKG--RSFAYID   54 (759)
T ss_pred             cceEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEe--cccC--CceEEEE
Confidence            34789999999999999999999999999999998  4567  9999995


No 24 
>KOG0131|consensus
Probab=99.42  E-value=2.8e-13  Score=71.34  Aligned_cols=54  Identities=28%  Similarity=0.473  Sum_probs=51.1

Q ss_pred             CCCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050           8 PRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus         8 ~~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .+.+++.+|||+||+..+++..|.++|.+.|+|..+++++|+.+..++||||++
T Consensus         4 ~~rnqd~tiyvgnld~kvs~~~l~EL~iqagpVv~i~iPkDrv~~~~qGygF~E   57 (203)
T KOG0131|consen    4 IERNQDATLYVGNLDEKVSEELLYELFIQAGPVVNLHIPKDRVTQKHQGYGFAE   57 (203)
T ss_pred             cccCCCceEEEecCCHHHHHHHHHHHHHhcCceeeeecchhhhcccccceeEEE
Confidence            467788999999999999999999999999999999999999999999999986


No 25 
>PLN03120 nucleic acid binding protein; Provisional
Probab=99.42  E-value=7.7e-13  Score=72.97  Aligned_cols=46  Identities=20%  Similarity=0.418  Sum_probs=42.0

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .++|||+|||+.+++++|+++|+.||.|.++.++.++.   .+|||||+
T Consensus         4 ~rtVfVgNLs~~tTE~dLrefFS~~G~I~~V~I~~d~~---~~GfAFVt   49 (260)
T PLN03120          4 VRTVKVSNVSLKATERDIKEFFSFSGDIEYVEMQSENE---RSQIAYVT   49 (260)
T ss_pred             CCEEEEeCCCCCCCHHHHHHHHHhcCCeEEEEEeecCC---CCCEEEEE
Confidence            47999999999999999999999999999999998853   46899985


No 26 
>PLN03121 nucleic acid binding protein; Provisional
Probab=99.39  E-value=2e-12  Score=70.68  Aligned_cols=47  Identities=28%  Similarity=0.413  Sum_probs=42.2

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .+.+|||+||++.+++++|+++|+.||.|.++.++++..   .+|+|||+
T Consensus         4 ~g~TV~V~NLS~~tTE~dLrefFS~~G~I~~V~I~~D~e---t~gfAfVt   50 (243)
T PLN03121          4 GGYTAEVTNLSPKATEKDVYDFFSHCGAIEHVEIIRSGE---YACTAYVT   50 (243)
T ss_pred             CceEEEEecCCCCCCHHHHHHHHHhcCCeEEEEEecCCC---cceEEEEE
Confidence            458999999999999999999999999999999998843   45799985


No 27 
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.38  E-value=2.1e-12  Score=75.33  Aligned_cols=49  Identities=37%  Similarity=0.718  Sum_probs=46.7

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..+|||+|||..+++++|+++|+.||.|..+.++.++.+|+++|||||+
T Consensus       186 ~~~l~v~nl~~~~te~~l~~~f~~~G~i~~v~~~~d~~~g~~~g~afV~  234 (457)
T TIGR01622       186 FLKLYVGNLHFNITEQELRQIFEPFGDIEDVQLHRDPETGRSKGFGFIQ  234 (457)
T ss_pred             CCEEEEcCCCCCCCHHHHHHHHHhcCCeEEEEEEEcCCCCccceEEEEE
Confidence            5899999999999999999999999999999999999999999999985


No 28 
>KOG0127|consensus
Probab=99.38  E-value=1.8e-12  Score=77.06  Aligned_cols=51  Identities=35%  Similarity=0.501  Sum_probs=48.1

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..+.+|||+|||++++++.|.++|++||.|....++.++.|+.++|.|||.
T Consensus       290 ~~~~tVFvRNL~fD~tEEel~~~fskFG~v~ya~iV~~k~T~~skGtAFv~  340 (678)
T KOG0127|consen  290 TEGKTVFVRNLPFDTTEEELKEHFSKFGEVKYAIIVKDKDTGHSKGTAFVK  340 (678)
T ss_pred             cccceEEEecCCccccHHHHHHHHHhhccceeEEEEeccCCCCcccceEEE
Confidence            345899999999999999999999999999999999999999999999984


No 29 
>KOG0145|consensus
Probab=99.37  E-value=2.5e-13  Score=75.12  Aligned_cols=60  Identities=32%  Similarity=0.438  Sum_probs=56.6

Q ss_pred             CCCCCCCCCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050           2 SYGRPPPRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus         2 ~~~~~~~~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ||.+|+.+.-++.+|||++||..++..+|+.+|++||.|...+++.|..||.++|.+||.
T Consensus       116 SyARPSs~~Ik~aNLYvSGlPktMtqkelE~iFs~fGrIItSRiL~dqvtg~srGVgFiR  175 (360)
T KOG0145|consen  116 SYARPSSDSIKDANLYVSGLPKTMTQKELEQIFSPFGRIITSRILVDQVTGLSRGVGFIR  175 (360)
T ss_pred             EeccCChhhhcccceEEecCCccchHHHHHHHHHHhhhhhhhhhhhhcccceecceeEEE
Confidence            788999888899999999999999999999999999999999999999999999999984


No 30 
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=99.35  E-value=3.8e-12  Score=68.48  Aligned_cols=49  Identities=47%  Similarity=0.708  Sum_probs=46.8

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..+|||+|||..+++++|+++|..+|.+..+.+..++.++..+|+|||+
T Consensus       115 ~~~l~v~nL~~~~~~~~l~~~F~~~g~~~~~~~~~d~~~~~~~g~~~v~  163 (306)
T COG0724         115 NNTLFVGNLPYDVTEEDLRELFKKFGPVKRVRLVRDRETGKSRGFAFVE  163 (306)
T ss_pred             CceEEEeCCCCCCCHHHHHHHHHhcCceeEEEeeeccccCccCceEEEE
Confidence            4899999999999999999999999999999999998899999999985


No 31 
>KOG0130|consensus
Probab=99.35  E-value=2.5e-12  Score=65.38  Aligned_cols=55  Identities=27%  Similarity=0.379  Sum_probs=50.9

Q ss_pred             CCCCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050           7 PPRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus         7 ~~~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      |+..-++..|||.+++..++++++.+.|..||+|+.+++-.|+.||..+|||+|+
T Consensus        66 PqrSVEGwIi~VtgvHeEatEedi~d~F~dyGeiKNihLNLDRRtGy~KGYaLvE  120 (170)
T KOG0130|consen   66 PQRSVEGWIIFVTGVHEEATEEDIHDKFADYGEIKNIHLNLDRRTGYVKGYALVE  120 (170)
T ss_pred             CccceeeEEEEEeccCcchhHHHHHHHHhhcccccceeeccccccccccceeeee
Confidence            4455667999999999999999999999999999999999999999999999985


No 32 
>KOG0107|consensus
Probab=99.32  E-value=5.5e-12  Score=66.22  Aligned_cols=46  Identities=35%  Similarity=0.673  Sum_probs=41.4

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +..++|||+||+..+++.+|+..|..||.+.++++..+     +.||||||
T Consensus         8 ~~~~kVYVGnL~~~a~k~eLE~~F~~yG~lrsvWvArn-----PPGfAFVE   53 (195)
T KOG0107|consen    8 NGNTKVYVGNLGSRATKRELERAFSKYGPLRSVWVARN-----PPGFAFVE   53 (195)
T ss_pred             CCCceEEeccCCCCcchHHHHHHHHhcCcceeEEEeec-----CCCceEEe
Confidence            34699999999999999999999999999999988765     57899996


No 33 
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.32  E-value=8.2e-12  Score=74.54  Aligned_cols=50  Identities=38%  Similarity=0.540  Sum_probs=46.2

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ....+|||+||+..+++++|+++|+.||.|.++.++.+ .+|.++|||||+
T Consensus       283 ~~~~~l~V~nl~~~~~~~~L~~~F~~~G~i~~~~i~~d-~~g~~~g~gfV~  332 (562)
T TIGR01628       283 AQGVNLYVKNLDDTVTDEKLRELFSECGEITSAKVMLD-EKGVSRGFGFVC  332 (562)
T ss_pred             cCCCEEEEeCCCCccCHHHHHHHHHhcCCeEEEEEEEC-CCCCcCCeEEEE
Confidence            34578999999999999999999999999999999998 689999999985


No 34 
>KOG0148|consensus
Probab=99.29  E-value=6.6e-12  Score=69.62  Aligned_cols=51  Identities=31%  Similarity=0.465  Sum_probs=47.6

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      .....-+||+-|...++.++|++.|.+||+|.++++++|..|++++||+||
T Consensus        59 ~~~hfhvfvgdls~eI~~e~lr~aF~pFGevS~akvirD~~T~KsKGYgFV  109 (321)
T KOG0148|consen   59 SNQHFHVFVGDLSPEIDNEKLREAFAPFGEVSDAKVIRDMNTGKSKGYGFV  109 (321)
T ss_pred             cccceeEEehhcchhcchHHHHHHhccccccccceEeecccCCcccceeEE
Confidence            344567999999999999999999999999999999999999999999998


No 35 
>KOG4205|consensus
Probab=99.28  E-value=8.6e-12  Score=70.49  Aligned_cols=50  Identities=40%  Similarity=0.581  Sum_probs=48.2

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +.+++||++|+|.++++.|++.|.++|.+.++.+++|+.++..+||+||+
T Consensus         5 ~~~KlfiGgisw~ttee~Lr~yf~~~Gev~d~~vm~d~~t~rsrgFgfv~   54 (311)
T KOG4205|consen    5 ESGKLFIGGLSWETTEESLREYFSQFGEVTDCVVMRDPSTGRSRGFGFVT   54 (311)
T ss_pred             CCcceeecCcCccccHHHHHHHhcccCceeeEEEeccCCCCCccccccee
Confidence            67999999999999999999999999999999999999999999999985


No 36 
>KOG0111|consensus
Probab=99.27  E-value=2.2e-12  Score=70.06  Aligned_cols=52  Identities=33%  Similarity=0.461  Sum_probs=48.9

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..+.++||||+|...+++.-|...|.+||.|..+.++.|-+++++||||||+
T Consensus         7 a~~KrtlYVGGladeVtekvLhaAFIPFGDI~dIqiPlDyesqkHRgFgFVe   58 (298)
T KOG0111|consen    7 ANQKRTLYVGGLADEVTEKVLHAAFIPFGDIKDIQIPLDYESQKHRGFGFVE   58 (298)
T ss_pred             cccceeEEeccchHHHHHHHHHhccccccchhhcccccchhcccccceeEEE
Confidence            3456899999999999999999999999999999999999999999999996


No 37 
>KOG0124|consensus
Probab=99.25  E-value=3.9e-12  Score=73.18  Aligned_cols=49  Identities=31%  Similarity=0.592  Sum_probs=47.2

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      -|++|||.+.+...++.|+..|.+||+|+++.+..|+.|++++|||||+
T Consensus       113 McRvYVGSIsfEl~EDtiR~AF~PFGPIKSInMSWDp~T~kHKgFAFVE  161 (544)
T KOG0124|consen  113 MCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDPATGKHKGFAFVE  161 (544)
T ss_pred             hHheeeeeeEEEechHHHHhhccCCCCcceeecccccccccccceEEEE
Confidence            3789999999999999999999999999999999999999999999996


No 38 
>KOG0108|consensus
Probab=99.24  E-value=1.8e-11  Score=71.67  Aligned_cols=48  Identities=29%  Similarity=0.482  Sum_probs=46.8

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +.+||||+|++++++.|..+|+..|.|.+++++.|++||+.+||||++
T Consensus        19 ~~v~vgnip~~~se~~l~~~~~~~g~v~s~~~v~D~~tG~~~G~~f~~   66 (435)
T KOG0108|consen   19 SSVFVGNIPYEGSEEQLLSIFSGVGPVLSFRLVYDRETGKPKGFGFCE   66 (435)
T ss_pred             cceEecCCCCcccHHHHHHHHhccCccceeeecccccCCCcCceeeEe
Confidence            899999999999999999999999999999999999999999999985


No 39 
>smart00362 RRM_2 RNA recognition motif.
Probab=99.23  E-value=5.9e-11  Score=53.00  Aligned_cols=45  Identities=40%  Similarity=0.727  Sum_probs=40.6

Q ss_pred             EEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        15 ~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +|+|+|||..+++++|+++|.++|.+..+.+..++  +.++|+|||+
T Consensus         1 ~v~i~~l~~~~~~~~l~~~~~~~g~v~~~~~~~~~--~~~~~~~~v~   45 (72)
T smart00362        1 TLFVGNLPPDVTEEDLKELFSKFGPIESVKIPKDT--GKSKGFAFVE   45 (72)
T ss_pred             CEEEcCCCCcCCHHHHHHHHHhcCCEEEEEEecCC--CCCCceEEEE
Confidence            58999999999999999999999999999888775  6788999984


No 40 
>KOG0144|consensus
Probab=99.21  E-value=5.1e-12  Score=73.49  Aligned_cols=48  Identities=40%  Similarity=0.596  Sum_probs=45.1

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..+||||.|+..++|.+++++|++||.|++|.+++|. -+.+||||||+
T Consensus       124 e~KLFvg~lsK~~te~evr~iFs~fG~Ied~~ilrd~-~~~sRGcaFV~  171 (510)
T KOG0144|consen  124 ERKLFVGMLSKQCTENEVREIFSRFGHIEDCYILRDP-DGLSRGCAFVK  171 (510)
T ss_pred             chhhhhhhccccccHHHHHHHHHhhCccchhhheecc-cccccceeEEE
Confidence            5899999999999999999999999999999999995 48899999985


No 41 
>KOG0145|consensus
Probab=99.20  E-value=6.9e-11  Score=65.64  Aligned_cols=53  Identities=28%  Similarity=0.359  Sum_probs=49.0

Q ss_pred             CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050           9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ....+.-|||-||.+++++.-|.++|.+||.|..+.+++|..|.+.+||+||+
T Consensus       274 ~~~~g~ciFvYNLspd~de~~LWQlFgpFGAv~nVKvirD~ttnkCKGfgFVt  326 (360)
T KOG0145|consen  274 GPGGGWCIFVYNLSPDADESILWQLFGPFGAVTNVKVIRDFTTNKCKGFGFVT  326 (360)
T ss_pred             CCCCeeEEEEEecCCCchHhHHHHHhCcccceeeEEEEecCCcccccceeEEE
Confidence            34456899999999999999999999999999999999999999999999985


No 42 
>smart00360 RRM RNA recognition motif.
Probab=99.18  E-value=9.6e-11  Score=52.09  Aligned_cols=44  Identities=39%  Similarity=0.684  Sum_probs=40.4

Q ss_pred             EecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          18 VDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        18 v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      |+|||..+++++|+++|.++|.+..+.+..++.++.++|+|||+
T Consensus         1 i~~l~~~~~~~~l~~~f~~~g~v~~~~i~~~~~~~~~~~~a~v~   44 (71)
T smart00360        1 VGNLPPDVTEEELRELFSKFGKIESVRLVRDKDTGKSKGFAFVE   44 (71)
T ss_pred             CCCCCcccCHHHHHHHHHhhCCEeEEEEEeCCCCCCCCceEEEE
Confidence            57899999999999999999999999998887788999999985


No 43 
>KOG0114|consensus
Probab=99.17  E-value=2.5e-10  Score=55.87  Aligned_cols=48  Identities=31%  Similarity=0.520  Sum_probs=41.7

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      .+..+.|||+|||+.++.+++.++|.+||.|.++++-..+   ..+|-|||
T Consensus        15 pevnriLyirNLp~~ITseemydlFGkyg~IrQIRiG~~k---~TrGTAFV   62 (124)
T KOG0114|consen   15 PEVNRILYIRNLPFKITSEEMYDLFGKYGTIRQIRIGNTK---ETRGTAFV   62 (124)
T ss_pred             hhhheeEEEecCCccccHHHHHHHhhcccceEEEEecCcc---CcCceEEE
Confidence            3456789999999999999999999999999999987654   35788987


No 44 
>KOG0148|consensus
Probab=99.14  E-value=1.6e-10  Score=64.30  Aligned_cols=46  Identities=30%  Similarity=0.514  Sum_probs=41.1

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..+.++|||||++..++++.|++.|++||.|.++++..+      +||+||+
T Consensus       161 sp~NtsVY~G~I~~~lte~~mr~~Fs~fG~I~EVRvFk~------qGYaFVr  206 (321)
T KOG0148|consen  161 SPDNTSVYVGNIASGLTEDLMRQTFSPFGPIQEVRVFKD------QGYAFVR  206 (321)
T ss_pred             CCCCceEEeCCcCccccHHHHHHhcccCCcceEEEEecc------cceEEEE
Confidence            345699999999999999999999999999999998766      6799984


No 45 
>KOG0147|consensus
Probab=99.13  E-value=2.6e-11  Score=71.84  Aligned_cols=46  Identities=37%  Similarity=0.665  Sum_probs=44.7

Q ss_pred             EEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        16 l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +|||||++++++++|+.+|++||.|..+++.+|.+||.++||+|++
T Consensus       281 l~vgnLHfNite~~lr~ifepfg~Ie~v~l~~d~~tG~skgfGfi~  326 (549)
T KOG0147|consen  281 LYVGNLHFNITEDMLRGIFEPFGKIENVQLTKDSETGRSKGFGFIT  326 (549)
T ss_pred             hhhcccccCchHHHHhhhccCcccceeeeeccccccccccCcceEE
Confidence            8999999999999999999999999999999998899999999985


No 46 
>KOG0127|consensus
Probab=99.10  E-value=1.8e-10  Score=68.82  Aligned_cols=49  Identities=27%  Similarity=0.475  Sum_probs=46.8

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +.||||++||+.++.++|.++|+..|+|..+.++.++.++.+|||+||+
T Consensus         5 g~TlfV~~lp~~~~~~qL~e~FS~vGPik~~~vVt~~gs~~~RGfgfVt   53 (678)
T KOG0127|consen    5 GATLFVSRLPFSSTGEQLEEFFSYVGPIKHAVVVTNKGSSEKRGFGFVT   53 (678)
T ss_pred             CceEEEecCCCccchhHHHHhhhcccCcceeEEecCCCcccccCcccee
Confidence            4899999999999999999999999999999999999999999999984


No 47 
>KOG0124|consensus
Probab=99.10  E-value=2.9e-10  Score=65.67  Aligned_cols=50  Identities=22%  Similarity=0.423  Sum_probs=46.9

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .-.+|||..++++.+++||+..|+-||+|.+|.+.+++.++.++||+|++
T Consensus       209 ~fnRiYVaSvHpDLSe~DiKSVFEAFG~I~~C~LAr~pt~~~HkGyGfiE  258 (544)
T KOG0124|consen  209 KFNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAPTGRGHKGYGFIE  258 (544)
T ss_pred             hhheEEeeecCCCccHHHHHHHHHhhcceeeEEeeccCCCCCccceeeEE
Confidence            34689999999999999999999999999999999999888999999986


No 48 
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=99.09  E-value=3.4e-10  Score=66.97  Aligned_cols=43  Identities=21%  Similarity=0.218  Sum_probs=38.7

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .++|||+|||+.+++++|+++|++||.|.++.++.+      +|+|||+
T Consensus         2 s~vv~V~nLp~~~te~~L~~~f~~fG~V~~v~i~~~------k~~afVe   44 (481)
T TIGR01649         2 SPVVHVRNLPQDVVEADLVEALIPFGPVSYVMMLPG------KRQALVE   44 (481)
T ss_pred             ccEEEEcCCCCCCCHHHHHHHHHhcCCeeEEEEECC------CCEEEEE
Confidence            378999999999999999999999999999988753      5799985


No 49 
>KOG0105|consensus
Probab=99.07  E-value=4.2e-10  Score=59.95  Aligned_cols=48  Identities=21%  Similarity=0.347  Sum_probs=40.6

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ...++|||+|||.++.+.+|+++|.+||.|..+.|...+   ...+||||+
T Consensus         4 r~~~~iyvGNLP~diRekeieDlFyKyg~i~~ieLK~r~---g~ppfafVe   51 (241)
T KOG0105|consen    4 RNSRRIYVGNLPGDIREKEIEDLFYKYGRIREIELKNRP---GPPPFAFVE   51 (241)
T ss_pred             cccceEEecCCCcchhhccHHHHHhhhcceEEEEeccCC---CCCCeeEEE
Confidence            345899999999999999999999999999999875432   246799985


No 50 
>cd00590 RRM RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices.  RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight.  The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Probab=99.07  E-value=1.1e-09  Score=48.96  Aligned_cols=46  Identities=46%  Similarity=0.703  Sum_probs=40.6

Q ss_pred             EEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        15 ~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +|+|+|||..+++++++++|+.+|.+..+.+..++.+ ..+|+|||+
T Consensus         1 ~i~i~~l~~~~~~~~i~~~~~~~g~i~~~~~~~~~~~-~~~~~~~v~   46 (74)
T cd00590           1 TLFVGNLPPDVTEEDLRELFSKFGKVESVRIVRDKDT-KSKGFAFVE   46 (74)
T ss_pred             CEEEeCCCCccCHHHHHHHHHhcCCEEEEEEeeCCCC-CcceEEEEE
Confidence            4899999999999999999999999999998887654 678899874


No 51 
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=99.05  E-value=1.1e-09  Score=64.85  Aligned_cols=45  Identities=20%  Similarity=0.244  Sum_probs=40.2

Q ss_pred             CCcEEEEecCCC-CCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLKVDNLTY-RTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~v~~l~~-~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .+++|||+||++ .+++++|+++|+.||.|.++.++++     .+|+|||+
T Consensus       274 ~~~~l~v~nL~~~~vt~~~L~~lF~~yG~V~~vki~~~-----~~g~afV~  319 (481)
T TIGR01649       274 PGSVLMVSGLHQEKVNCDRLFNLFCVYGNVERVKFMKN-----KKETALIE  319 (481)
T ss_pred             CCCEEEEeCCCCCCCCHHHHHHHHHhcCCeEEEEEEeC-----CCCEEEEE
Confidence            457999999998 6999999999999999999999876     26899985


No 52 
>KOG0109|consensus
Probab=98.97  E-value=7e-10  Score=62.25  Aligned_cols=35  Identities=34%  Similarity=0.522  Sum_probs=33.1

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD   48 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~   48 (61)
                      .+|||||||..+++.+|+.+|++||+|.+|+++++
T Consensus         3 ~KLFIGNLp~~~~~~elr~lFe~ygkVlECDIvKN   37 (346)
T KOG0109|consen    3 VKLFIGNLPREATEQELRSLFEQYGKVLECDIVKN   37 (346)
T ss_pred             cchhccCCCcccchHHHHHHHHhhCceEeeeeecc
Confidence            57999999999999999999999999999999876


No 53 
>KOG0146|consensus
Probab=98.96  E-value=1.7e-09  Score=60.42  Aligned_cols=53  Identities=23%  Similarity=0.231  Sum_probs=49.6

Q ss_pred             CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050           9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +..++|+|||-.||.+..+.+|..+|.+||.|.+.++..|+.|..+++|+||.
T Consensus       281 eGPeGCNlFIYHLPQEFgDaEliQmF~PFGhivSaKVFvDRATNQSKCFGFVS  333 (371)
T KOG0146|consen  281 EGPEGCNLFIYHLPQEFGDAELIQMFLPFGHIVSAKVFVDRATNQSKCFGFVS  333 (371)
T ss_pred             cCCCcceEEEEeCchhhccHHHHHHhccccceeeeeeeehhccccccceeeEe
Confidence            45678999999999999999999999999999999999999999999999983


No 54 
>KOG0415|consensus
Probab=98.88  E-value=2.7e-09  Score=61.45  Aligned_cols=48  Identities=35%  Similarity=0.436  Sum_probs=46.6

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..|||..|++.++.++|.-+|+.||.|.+|.+++|..||.+-.||||+
T Consensus       240 NVLFVCKLNPVTtDeDLeiIFSrFG~i~sceVIRD~ktgdsLqyaFiE  287 (479)
T KOG0415|consen  240 NVLFVCKLNPVTTDEDLEIIFSRFGKIVSCEVIRDRKTGDSLQYAFIE  287 (479)
T ss_pred             ceEEEEecCCcccccchhhHHhhcccceeeeEEecccccchhheeeee
Confidence            789999999999999999999999999999999999999999999986


No 55 
>KOG4205|consensus
Probab=98.88  E-value=4.2e-09  Score=59.77  Aligned_cols=49  Identities=37%  Similarity=0.571  Sum_probs=46.5

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..++||++||..+++.++++.|.++|.|..+.++.|..+.+++||+||+
T Consensus        97 tkkiFvGG~~~~~~e~~~r~yfe~~g~v~~~~~~~d~~~~~~rgFgfv~  145 (311)
T KOG4205|consen   97 TKKIFVGGLPPDTTEEDFKDYFEQFGKVADVVIMYDKTTSRPRGFGFVT  145 (311)
T ss_pred             eeEEEecCcCCCCchHHHhhhhhccceeEeeEEeecccccccccceeeE
Confidence            5699999999999999999999999999999999999999999999984


No 56 
>KOG0146|consensus
Probab=98.85  E-value=3.1e-09  Score=59.39  Aligned_cols=49  Identities=31%  Similarity=0.459  Sum_probs=45.0

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ++++||||-|...-.|++++.+|..||.+.+|.+.+.+. |.++|+|||+
T Consensus        18 ~drklfvgml~kqq~e~dvrrlf~pfG~~~e~tvlrg~d-g~sKGCAFVK   66 (371)
T KOG0146|consen   18 DDRKLFVGMLNKQQSEDDVRRLFQPFGNIEECTVLRGPD-GNSKGCAFVK   66 (371)
T ss_pred             cchhhhhhhhcccccHHHHHHHhcccCCcceeEEecCCC-CCCCCceEEE
Confidence            568999999999999999999999999999999988864 8899999985


No 57 
>KOG0131|consensus
Probab=98.82  E-value=4.7e-09  Score=55.65  Aligned_cols=51  Identities=27%  Similarity=0.301  Sum_probs=45.3

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEE-EEccCCCCCCceeEEEE
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDI-YIPRDRFTRESRGFAFV   60 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~-~~~~~~~t~~~~g~~fv   60 (61)
                      .+.+.++||+||.+.+++..|.+.|+.||.+.+. .+++++.||.++||||+
T Consensus        93 l~vganlfvgNLd~~vDe~~L~dtFsafG~l~~~P~i~rd~~tg~~~~~g~i  144 (203)
T KOG0131|consen   93 LDVGANLFVGNLDPEVDEKLLYDTFSAFGVLISPPKIMRDPDTGNPKGFGFI  144 (203)
T ss_pred             ccccccccccccCcchhHHHHHHHHHhccccccCCcccccccCCCCCCCeEE
Confidence            4456899999999999999999999999987664 68889999999999987


No 58 
>KOG4208|consensus
Probab=98.79  E-value=2.5e-08  Score=53.66  Aligned_cols=53  Identities=25%  Similarity=0.348  Sum_probs=46.3

Q ss_pred             CCCCCcEEEEecCCCCCCHHHHHHHhhhC-CCeEEEEEccCCCCCCceeEEEEC
Q psy7050           9 RIEGMVSLKVDNLTYRTTPDDLRRVFERC-GEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~-G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +.....-+|+..+|....+..+...|.++ |.+...++.+++.||.++|||||+
T Consensus        45 ~~~~~g~~~~~~~p~g~~e~~~~~~~~q~~g~v~r~rlsRnkrTGNSKgYAFVE   98 (214)
T KOG4208|consen   45 EQEIEGVVYVDHIPHGFFETEILNYFRQFGGTVTRFRLSRNKRTGNSKGYAFVE   98 (214)
T ss_pred             ccCCccceeecccccchhHHHHhhhhhhcCCeeEEEEeecccccCCcCceEEEE
Confidence            44445678999999999999999999887 788888888999999999999996


No 59 
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=98.78  E-value=3.6e-08  Score=58.40  Aligned_cols=46  Identities=17%  Similarity=0.291  Sum_probs=34.7

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhC------------CCeEEEEEccCCCCCCceeEEEEC
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERC------------GEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~------------G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ....++|||+|||+.+++++|.++|..+            +.|..+.+      .+.+|||||+
T Consensus       172 ~~~~r~lyVgnLp~~~t~~~l~~~F~~~~~~~~~~~~~~~~~v~~~~~------~~~kg~afVe  229 (509)
T TIGR01642       172 TRQARRLYVGGIPPEFVEEAVVDFFNDLMIATGYHKAEDGKHVSSVNI------NKEKNFAFLE  229 (509)
T ss_pred             CccccEEEEeCCCCCCCHHHHHHHHHHHHHhcCCCCCCCCCceEEEEE------CCCCCEEEEE
Confidence            3446899999999999999999999874            23333333      3457899995


No 60 
>KOG0110|consensus
Probab=98.78  E-value=6.2e-09  Score=63.55  Aligned_cols=49  Identities=24%  Similarity=0.535  Sum_probs=45.0

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..+|.|+|+|+.++-.+++.+|..||.+.+++++.-...+.++|||||+
T Consensus       613 ~tKIlVRNipFeAt~rEVr~LF~aFGqlksvRlPKK~~k~a~rGF~Fv~  661 (725)
T KOG0110|consen  613 GTKILVRNIPFEATKREVRKLFTAFGQLKSVRLPKKIGKGAHRGFGFVD  661 (725)
T ss_pred             cceeeeeccchHHHHHHHHHHHhcccceeeeccchhhcchhhccceeee
Confidence            5789999999999999999999999999999999876667889999985


No 61 
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=98.75  E-value=4.1e-08  Score=59.55  Aligned_cols=42  Identities=31%  Similarity=0.413  Sum_probs=36.6

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhC--CCeEEEEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERC--GEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~--G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..++|||+||++.+++++|+++|+.|  |.|.++.++        ++||||+
T Consensus       232 ~~k~LfVgNL~~~~tee~L~~~F~~f~~G~I~rV~~~--------rgfAFVe  275 (578)
T TIGR01648       232 KVKILYVRNLMTTTTEEIIEKSFSEFKPGKVERVKKI--------RDYAFVH  275 (578)
T ss_pred             cccEEEEeCCCCCCCHHHHHHHHHhcCCCceEEEEee--------cCeEEEE
Confidence            34789999999999999999999999  999998654        4589985


No 62 
>KOG0153|consensus
Probab=98.72  E-value=3.1e-08  Score=56.69  Aligned_cols=48  Identities=29%  Similarity=0.454  Sum_probs=40.0

Q ss_pred             CCCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050           8 PRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus         8 ~~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ++...-.+|||++|...+++.+|+++|.+||+|+++.+.-.      +++|||+
T Consensus       223 PeD~~I~tLyIg~l~d~v~e~dIrdhFyqyGeirsi~~~~~------~~CAFv~  270 (377)
T KOG0153|consen  223 PEDTSIKTLYIGGLNDEVLEQDIRDHFYQYGEIRSIRILPR------KGCAFVT  270 (377)
T ss_pred             CcccceeEEEecccccchhHHHHHHHHhhcCCeeeEEeecc------cccceee
Confidence            34555689999999999999999999999999999987643      4478874


No 63 
>KOG0116|consensus
Probab=98.67  E-value=4.5e-08  Score=57.48  Aligned_cols=49  Identities=27%  Similarity=0.355  Sum_probs=40.0

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..+|||+|||++++.+.|++.|..||.|+...+..-...++..+||||+
T Consensus       288 ~~~i~V~nlP~da~~~~l~~~Fk~FG~Ik~~~I~vr~~~~~~~~fgFV~  336 (419)
T KOG0116|consen  288 GLGIFVKNLPPDATPAELEEVFKQFGPIKEGGIQVRSPGGKNPCFGFVE  336 (419)
T ss_pred             ccceEeecCCCCCCHHHHHHHHhhcccccccceEEeccCCCcCceEEEE
Confidence            4569999999999999999999999999998775543334445899984


No 64 
>PF04059 RRM_2:  RNA recognition motif 2;  InterPro: IPR007201 This RNA recognition motif 2 is found in Meiosis protein mei2. It is found C-terminal to the RNA-binding region RNP-1 (IPR000504 from INTERPRO).
Probab=98.67  E-value=2.1e-07  Score=45.14  Aligned_cols=47  Identities=23%  Similarity=0.441  Sum_probs=41.7

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhh--CCCeEEEEEccCCCCCCceeEEEE
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFER--CGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~--~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      +||.++|+|...+.++|.+++..  .|...-+.++.|..++.+.|||||
T Consensus         2 TTvMirNIPn~~t~~~L~~~l~~~~~g~yDF~YLPiDf~~~~N~GYAFV   50 (97)
T PF04059_consen    2 TTVMIRNIPNKYTQEMLIQILDEHFKGKYDFFYLPIDFKNKCNLGYAFV   50 (97)
T ss_pred             eeEEEecCCCCCCHHHHHHHHHHhccCcceEEEeeeeccCCCceEEEEE
Confidence            68999999999999999888865  367777889999999999999998


No 65 
>KOG4212|consensus
Probab=98.61  E-value=1.3e-07  Score=55.95  Aligned_cols=48  Identities=27%  Similarity=0.340  Sum_probs=43.5

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhh-CCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFER-CGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~-~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .+.+||.|+|+++.|.+|++++.. -|+|..+.+..| .+|+.+|+|.||
T Consensus        44 ~R~vfItNIpyd~rWqdLKdLvrekvGev~yveLl~D-~~GK~rGcavVE   92 (608)
T KOG4212|consen   44 DRSVFITNIPYDYRWQDLKDLVREKVGEVEYVELLFD-ESGKARGCAVVE   92 (608)
T ss_pred             cceEEEecCcchhhhHhHHHHHHHhcCceEeeeeecc-cCCCcCCceEEE
Confidence            356999999999999999999975 799999999999 579999999986


No 66 
>KOG0117|consensus
Probab=98.57  E-value=1e-07  Score=56.20  Aligned_cols=40  Identities=40%  Similarity=0.650  Sum_probs=36.1

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      -+.|||+||+.++|++.|+++|..||.|..+.-++|        ||||
T Consensus       259 VKvLYVRNL~~~tTeE~lk~~F~~~G~veRVkk~rD--------YaFV  298 (506)
T KOG0117|consen  259 VKVLYVRNLMESTTEETLKKLFNEFGKVERVKKPRD--------YAFV  298 (506)
T ss_pred             eeeeeeeccchhhhHHHHHHHHHhccceEEeecccc--------eeEE
Confidence            467999999999999999999999999999877655        8887


No 67 
>KOG0123|consensus
Probab=98.56  E-value=1.6e-07  Score=54.52  Aligned_cols=45  Identities=22%  Similarity=0.359  Sum_probs=40.8

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..+||.||++.++.++|.++|+.||.|.+|.+..+.. | .+|| ||+
T Consensus        77 ~~~~i~nl~~~~~~~~~~d~f~~~g~ilS~kv~~~~~-g-~kg~-FV~  121 (369)
T KOG0123|consen   77 SLVFIKNLDESIDNKSLYDTFSEFGNILSCKVATDEN-G-SKGY-FVQ  121 (369)
T ss_pred             ceeeecCCCcccCcHHHHHHHHhhcCeeEEEEEEcCC-C-ceee-EEE
Confidence            4499999999999999999999999999999999854 5 8999 984


No 68 
>KOG4661|consensus
Probab=98.54  E-value=1.8e-07  Score=56.93  Aligned_cols=49  Identities=24%  Similarity=0.363  Sum_probs=44.8

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +++|+|.+|+..+...+|+.+|++||.|.-..++.+..+.-.++|+||+
T Consensus       405 gRNlWVSGLSstTRAtDLKnlFSKyGKVvGAKVVTNaRsPGaRCYGfVT  453 (940)
T KOG4661|consen  405 GRNLWVSGLSSTTRATDLKNLFSKYGKVVGAKVVTNARSPGARCYGFVT  453 (940)
T ss_pred             ccceeeeccccchhhhHHHHHHHHhcceeceeeeecCCCCCcceeEEEE
Confidence            5789999999999999999999999999999999887777789999996


No 69 
>KOG4206|consensus
Probab=98.53  E-value=5.2e-07  Score=49.12  Aligned_cols=47  Identities=34%  Similarity=0.473  Sum_probs=39.2

Q ss_pred             CCCcEEEEecCCCCCCHHHHHH----HhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRR----VFERCGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~----~f~~~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      +...||||.||+..+..++|+.    +|++||.|.++...   .+.+.+|-|||
T Consensus         7 ~pn~TlYInnLnekI~~~elkrsL~~LFsqfG~ildI~a~---kt~KmRGQA~V   57 (221)
T KOG4206|consen    7 NPNGTLYINNLNEKIKKDELKRSLYLLFSQFGKILDISAF---KTPKMRGQAFV   57 (221)
T ss_pred             CCCceEeehhccccccHHHHHHHHHHHHHhhCCeEEEEec---CCCCccCceEE
Confidence            3335999999999999888776    99999999988765   35778999987


No 70 
>KOG4209|consensus
Probab=98.52  E-value=1.2e-07  Score=52.04  Aligned_cols=51  Identities=29%  Similarity=0.621  Sum_probs=46.8

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      -+...+|++|+.+.++...+..+|+.||.+..+.++.|+.+++++||+||+
T Consensus        99 ~d~~sv~v~nvd~~~t~~~~e~hf~~Cg~i~~~ti~~d~~~~~~k~~~yve  149 (231)
T KOG4209|consen   99 VDAPSVWVGNVDFLVTLTKIELHFESCGGINRVTVPKDKFRGHPKGFAYVE  149 (231)
T ss_pred             cCCceEEEeccccccccchhhheeeccCCccceeeeccccCCCcceeEEEe
Confidence            345789999999999998899999999999999999999999999999985


No 71 
>smart00361 RRM_1 RNA recognition motif.
Probab=98.50  E-value=3.2e-07  Score=41.80  Aligned_cols=35  Identities=34%  Similarity=0.566  Sum_probs=28.7

Q ss_pred             HHHHHHHhh----hCCCeEEEE-EccCCCC--CCceeEEEEC
Q psy7050          27 PDDLRRVFE----RCGEVGDIY-IPRDRFT--RESRGFAFVR   61 (61)
Q Consensus        27 ~~~l~~~f~----~~G~i~~~~-~~~~~~t--~~~~g~~fv~   61 (61)
                      +++|++.|+    +||.+.++. +..++.+  +.++|||||+
T Consensus         2 ~~~l~~~~~~~~~~fG~v~~v~~v~~~~~~~~~~~rG~~fV~   43 (70)
T smart00361        2 DEDFEREFSEEEEYFGEVGKINKIYIDNVGYENHKRGNVYIT   43 (70)
T ss_pred             chhHHHHHHHHHHhcCCeeEEEEEEeCCCCCCCCCcEEEEEE
Confidence            467788887    999999985 7777666  8899999985


No 72 
>KOG0109|consensus
Probab=98.47  E-value=1.9e-07  Score=52.70  Aligned_cols=38  Identities=32%  Similarity=0.445  Sum_probs=34.8

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD   48 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~   48 (61)
                      ....+++|+|+.+.++.+++++.|++||.+.++++++|
T Consensus        76 k~stkl~vgNis~tctn~ElRa~fe~ygpviecdivkd  113 (346)
T KOG0109|consen   76 KASTKLHVGNISPTCTNQELRAKFEKYGPVIECDIVKD  113 (346)
T ss_pred             CCccccccCCCCccccCHHHhhhhcccCCceeeeeecc
Confidence            34579999999999999999999999999999998866


No 73 
>KOG0132|consensus
Probab=98.47  E-value=2.7e-07  Score=57.33  Aligned_cols=43  Identities=33%  Similarity=0.531  Sum_probs=38.0

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .+|||||+|+..+++.+|..+|+.||.|.++.+.-      ++|+|||+
T Consensus       421 SrTLwvG~i~k~v~e~dL~~~feefGeiqSi~li~------~R~cAfI~  463 (894)
T KOG0132|consen  421 SRTLWVGGIPKNVTEQDLANLFEEFGEIQSIILIP------PRGCAFIK  463 (894)
T ss_pred             eeeeeeccccchhhHHHHHHHHHhcccceeEeecc------CCceeEEE
Confidence            47999999999999999999999999999987653      57789974


No 74 
>KOG0123|consensus
Probab=98.32  E-value=1.7e-06  Score=50.40  Aligned_cols=51  Identities=29%  Similarity=0.431  Sum_probs=45.5

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .-...+|||.|++..++.+.|+++|+.+|.|.++.++.+ ..+.++||+||+
T Consensus       267 ~~~~~nl~vknld~~~~~e~L~~~f~~~GeI~s~kv~~~-~~g~skG~gfV~  317 (369)
T KOG0123|consen  267 SLQGANLYVKNLDETLSDEKLRKIFSSFGEITSAKVMVD-ENGKSKGFGFVE  317 (369)
T ss_pred             cccccccccccCccccchhHHHHHHhcccceeeEEEEec-cCCCccceEEEE
Confidence            344578999999999999999999999999999999888 568999999984


No 75 
>KOG4454|consensus
Probab=98.23  E-value=7.8e-07  Score=48.71  Aligned_cols=50  Identities=28%  Similarity=0.349  Sum_probs=42.9

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .+..++|||+|+...++++-|.++|-+.|+|.++.|+.++. ++.+ ||||.
T Consensus         6 ae~drtl~v~n~~~~v~eelL~ElfiqaGPV~kv~ip~~~d-~~~k-Fa~v~   55 (267)
T KOG4454|consen    6 AEMDRTLLVQNMYSGVSEELLSELFIQAGPVYKVGIPSGQD-QEQK-FAYVF   55 (267)
T ss_pred             cchhhHHHHHhhhhhhhHHHHHHHhhccCceEEEeCCCCcc-CCCc-eeeee
Confidence            45568999999999999999999999999999999988865 4445 88874


No 76 
>KOG4849|consensus
Probab=98.22  E-value=4.8e-07  Score=52.40  Aligned_cols=47  Identities=28%  Similarity=0.457  Sum_probs=40.2

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCC--CeEEEEEccCCCCCCceeEEEE
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCG--EVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G--~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      ..+|||||-|++++++|.+.+...|  .+.++.+..++.+|+++|||+|
T Consensus        81 ~~~YvGNL~W~TTD~DL~~A~~S~G~~~~~dmKFFENR~NGQSKG~AL~  129 (498)
T KOG4849|consen   81 YCCYVGNLLWYTTDADLLKALQSTGLAQFADMKFFENRTNGQSKGYALL  129 (498)
T ss_pred             EEEEecceeEEeccHHHHHHHHhhhHHHHhhhhhhhcccCCcccceEEE
Confidence            4689999999999999998887765  4566778888999999999986


No 77 
>KOG0533|consensus
Probab=98.20  E-value=5.9e-06  Score=45.81  Aligned_cols=49  Identities=31%  Similarity=0.438  Sum_probs=41.4

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      ....+|+|.|||..+.+++|+++|..+|.+..+.+.+++ .|.+.|.|-|
T Consensus        81 ~~~~~v~v~NL~~~V~~~Dl~eLF~~~~~~~r~~vhy~~-~G~s~Gta~v  129 (243)
T KOG0533|consen   81 TRSTKVNVSNLPYGVIDADLKELFAEFGELKRVAVHYDR-AGRSLGTADV  129 (243)
T ss_pred             CCcceeeeecCCcCcchHHHHHHHHHhccceEEeeccCC-CCCCCcccee
Confidence            334789999999999999999999999988888888885 5777776644


No 78 
>KOG4212|consensus
Probab=98.19  E-value=2e-06  Score=51.08  Aligned_cols=45  Identities=24%  Similarity=0.253  Sum_probs=38.7

Q ss_pred             CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCcee
Q psy7050           9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRG   56 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g   56 (61)
                      ...+.|+|+|+|||+++||+.|++-|..+|.+....++.   .|+.+|
T Consensus       532 aarKa~qIiirNlP~dfTWqmlrDKfre~G~v~yadime---~GkskG  576 (608)
T KOG4212|consen  532 AARKACQIIIRNLPFDFTWQMLRDKFREIGHVLYADIME---NGKSKG  576 (608)
T ss_pred             ccccccEEEEecCCccccHHHHHHHHHhccceehhhhhc---cCCccc
Confidence            356679999999999999999999999999999988853   366776


No 79 
>KOG0226|consensus
Probab=98.15  E-value=2.9e-06  Score=47.30  Aligned_cols=50  Identities=26%  Similarity=0.395  Sum_probs=44.1

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      +.+.+||+|.|..+++++.|...|.+|-.-....+++|+.||+++||+||
T Consensus       188 ~~DfRIfcgdlgNevnd~vl~raf~Kfpsf~~akviRdkRTgKSkgygfV  237 (290)
T KOG0226|consen  188 EDDFRIFCGDLGNEVNDDVLARAFKKFPSFQKAKVIRDKRTGKSKGYGFV  237 (290)
T ss_pred             cccceeecccccccccHHHHHHHHHhccchhhccccccccccccccceee
Confidence            34589999999999999999999999866556678899999999999998


No 80 
>KOG4211|consensus
Probab=98.08  E-value=1.7e-05  Score=47.47  Aligned_cols=50  Identities=22%  Similarity=0.485  Sum_probs=40.6

Q ss_pred             CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050           9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +.+...-|.+++|||.+|+++|.++|+.++ |..+.+.  +.+|+..|=|||+
T Consensus         6 e~~~~~~vr~rGLPwsat~~ei~~Ff~~~~-I~~~~~~--r~~Gr~sGeA~Ve   55 (510)
T KOG4211|consen    6 EGSTAFEVRLRGLPWSATEKEILDFFSNCG-IENLEIP--RRNGRPSGEAYVE   55 (510)
T ss_pred             CCCcceEEEecCCCccccHHHHHHHHhcCc-eeEEEEe--ccCCCcCcceEEE
Confidence            445567888999999999999999999985 6665444  4579999999985


No 81 
>KOG0120|consensus
Probab=98.07  E-value=6.1e-06  Score=49.61  Aligned_cols=53  Identities=23%  Similarity=0.370  Sum_probs=48.2

Q ss_pred             CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050           9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ......+++|++||...++..+.++...||.+....++.+..+|.++||||.+
T Consensus       285 ~~~~~~ki~v~~lp~~l~~~q~~Ell~~fg~lk~f~lv~d~~~g~skg~af~e  337 (500)
T KOG0120|consen  285 VPDSPNKIFVGGLPLYLTEDQVKELLDSFGPLKAFRLVKDSATGNSKGFAFCE  337 (500)
T ss_pred             cccccchhhhccCcCccCHHHHHHHHHhcccchhheeecccccccccceeeee
Confidence            34556899999999999999999999999999999999999999999999964


No 82 
>KOG1548|consensus
Probab=97.99  E-value=3.8e-05  Score=44.44  Aligned_cols=49  Identities=24%  Similarity=0.360  Sum_probs=36.3

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEE--------EEEccCCCCCCceeEEE
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGD--------IYIPRDRFTRESRGFAF   59 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~--------~~~~~~~~t~~~~g~~f   59 (61)
                      ......|||.|||.++|-+.+.++|++||-|..        |.+-++. .|+.+|=|.
T Consensus       131 ~~~Nt~VYVsgLP~DiT~dE~~~~~sKcGiI~~d~~t~epk~KlYrd~-~G~lKGDaL  187 (382)
T KOG1548|consen  131 PKVNTSVYVSGLPLDITVDEFAEVMSKCGIIMRDPQTGEPKVKLYRDN-QGKLKGDAL  187 (382)
T ss_pred             cccCceEEecCCCCcccHHHHHHHHHhcceEeccCCCCCeeEEEEecC-CCCccCceE
Confidence            445677999999999999999999999997643        4455553 366666443


No 83 
>KOG0110|consensus
Probab=97.97  E-value=1.4e-05  Score=49.52  Aligned_cols=48  Identities=38%  Similarity=0.532  Sum_probs=39.3

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCC---CceeEEEEC
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTR---ESRGFAFVR   61 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~---~~~g~~fv~   61 (61)
                      ++||+.||++.++.+++..+|...|.|.++.+...+...   -+.|||||+
T Consensus       516 t~lfvkNlnf~Tt~e~l~~~F~k~G~VlS~~I~kkkd~~~k~lSmGfgFVE  566 (725)
T KOG0110|consen  516 TKLFVKNLNFDTTLEDLEDLFSKQGTVLSIEISKKKDPANKYLSMGFGFVE  566 (725)
T ss_pred             hhhhhhcCCcccchhHHHHHHHhcCeEEEEEEeccccccccccccceeEEE
Confidence            459999999999999999999999999998776554321   234999996


No 84 
>KOG4210|consensus
Probab=97.91  E-value=8.6e-06  Score=46.09  Aligned_cols=50  Identities=32%  Similarity=0.573  Sum_probs=45.0

Q ss_pred             CCcEEE-EecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLK-VDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~-v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ...++| ++++++.+++++|+.+|..+|.+..++++.++.++..+|||+++
T Consensus       183 ~s~~~~~~~~~~f~~~~d~~~~~~~~~~~i~~~r~~~~~~s~~~kg~a~~~  233 (285)
T KOG4210|consen  183 PSDTIFFVGELDFSLTRDDLKEHFVSSGEITSVRLPTDEESGDSKGFAYVD  233 (285)
T ss_pred             ccccceeecccccccchHHHhhhccCcCcceeeccCCCCCccchhhhhhhh
Confidence            345666 99999999999999999999999999999999999999999863


No 85 
>KOG0151|consensus
Probab=97.85  E-value=4e-05  Score=47.86  Aligned_cols=39  Identities=28%  Similarity=0.317  Sum_probs=34.1

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDR   49 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~   49 (61)
                      ...+++|++||++.++++.|...|..||+|.++.++..+
T Consensus       172 P~TTNlyv~Nlnpsv~E~~ll~tfGrfgPlasvKimwpR  210 (877)
T KOG0151|consen  172 PQTTNLYVGNLNPSVDENFLLRTFGRFGPLASVKIMWPR  210 (877)
T ss_pred             CcccceeeecCCccccHHHHHHHhcccCcccceeeeccc
Confidence            445789999999999999999999999999988776543


No 86 
>KOG0147|consensus
Probab=97.81  E-value=3.5e-06  Score=50.74  Aligned_cols=51  Identities=33%  Similarity=0.513  Sum_probs=46.7

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ...+++|+-.++...+.-+|.++|+..|+|..+.++.|+.++..+|.+||+
T Consensus       177 Rd~Rtvf~~qla~r~~pRdL~efFs~~gkVrdVriI~Dr~s~rskgi~Yve  227 (549)
T KOG0147|consen  177 RDQRTVFCMQLARRNPPRDLEEFFSIVGKVRDVRIIGDRNSRRSKGIAYVE  227 (549)
T ss_pred             HhHHHHHHHHHhhcCCchhHHHHHHhhcCcceeEeeccccchhhcceeEEE
Confidence            345788998999999999999999999999999999999999999999985


No 87 
>KOG0106|consensus
Probab=97.79  E-value=2.7e-05  Score=42.56  Aligned_cols=32  Identities=25%  Similarity=0.508  Sum_probs=29.2

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEE
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYI   45 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~   45 (61)
                      ..+||++||+.+.+.++..+|..||.+..+.+
T Consensus         2 ~rv~vg~~~~~~~~~d~E~~f~~yg~~~d~~m   33 (216)
T KOG0106|consen    2 PRVYIGRLPYRARERDVERFFKGYGKIPDADM   33 (216)
T ss_pred             CceeecccCCccchhHHHHHHhhcccccccee
Confidence            46899999999999999999999999888754


No 88 
>KOG4660|consensus
Probab=97.78  E-value=4.9e-05  Score=46.03  Aligned_cols=45  Identities=36%  Similarity=0.473  Sum_probs=36.9

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      ...+|+|-|||..+++++|+++|+.||+|+.++     .|...+|..||+
T Consensus        74 ~~~~L~v~nl~~~Vsn~~L~~~f~~yGeir~ir-----~t~~~~~~~~v~  118 (549)
T KOG4660|consen   74 NQGTLVVFNLPRSVSNDTLLRIFGAYGEIREIR-----ETPNKRGIVFVE  118 (549)
T ss_pred             ccceEEEEecCCcCCHHHHHHHHHhhcchhhhh-----cccccCceEEEE
Confidence            347999999999999999999999999999854     344456777764


No 89 
>PF13893 RRM_5:  RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); PDB: 2PE8_A 2PEH_A 2D9O_A 1A9N_D 2DIT_A 3S6E_A 2LQ5_A 1SJQ_A 2AD9_A 1X4D_A ....
Probab=97.69  E-value=7.7e-05  Score=32.28  Aligned_cols=27  Identities=48%  Similarity=0.843  Sum_probs=20.9

Q ss_pred             HHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          30 LRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        30 l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      |.++|++||.|.++.+..+.     +++|||+
T Consensus         1 L~~~f~~fG~V~~i~~~~~~-----~~~a~V~   27 (56)
T PF13893_consen    1 LYKLFSKFGEVKKIKIFKKK-----RGFAFVE   27 (56)
T ss_dssp             HHHHHTTTS-EEEEEEETTS-----TTEEEEE
T ss_pred             ChHHhCCcccEEEEEEEeCC-----CCEEEEE
Confidence            67899999999999886553     5788885


No 90 
>KOG0129|consensus
Probab=97.66  E-value=0.00026  Score=42.80  Aligned_cols=52  Identities=23%  Similarity=0.214  Sum_probs=43.5

Q ss_pred             CCCCCcEEEEecCCCCCCHHHHHHHhhh-CCCeEEEEEccCCCCCCceeEEEE
Q psy7050           9 RIEGMVSLKVDNLTYRTTPDDLRRVFER-CGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~-~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      ..+..+|||||+||.-++..+|..+|.. ||.|..+.|=.|++=+.++|-|-|
T Consensus       366 ~lDprrTVFVGgvprpl~A~eLA~imd~lyGgV~yaGIDtD~k~KYPkGaGRV  418 (520)
T KOG0129|consen  366 PIDPRRTVFVGGLPRPLTAEELAMIMEDLFGGVLYVGIDTDPKLKYPKGAGRV  418 (520)
T ss_pred             ccCccceEEecCCCCcchHHHHHHHHHHhcCceEEEEeccCcccCCCCCccee
Confidence            4566689999999999999999999995 999999988778666667776655


No 91 
>KOG0129|consensus
Probab=97.31  E-value=0.00059  Score=41.32  Aligned_cols=33  Identities=21%  Similarity=0.347  Sum_probs=29.4

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEc
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIP   46 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~   46 (61)
                      .++||||+||++++|+.|...|..||.+ .+.|+
T Consensus       259 S~KVFvGGlp~dise~~i~~~F~~FGs~-~VdWP  291 (520)
T KOG0129|consen  259 SRKVFVGGLPWDITEAQINASFGQFGSV-KVDWP  291 (520)
T ss_pred             ccceeecCCCccccHHHHHhhcccccce-EeecC
Confidence            5799999999999999999999999985 46666


No 92 
>KOG4211|consensus
Probab=97.27  E-value=0.0012  Score=39.88  Aligned_cols=49  Identities=27%  Similarity=0.419  Sum_probs=38.5

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEE-EEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGD-IYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~-~~~~~~~~t~~~~g~~fv~   61 (61)
                      ..-.|.+++||+.|++++|.++|+..--+.. +.++.++ .++..|-|||+
T Consensus       102 ~d~vVRLRGLPfscte~dI~~FFaGL~Iv~~gi~l~~d~-rgR~tGEAfVq  151 (510)
T KOG4211|consen  102 NDGVVRLRGLPFSCTEEDIVEFFAGLEIVPDGILLPMDQ-RGRPTGEAFVQ  151 (510)
T ss_pred             CCceEEecCCCccCcHHHHHHHhcCCcccccceeeeccC-CCCcccceEEE
Confidence            4568999999999999999999987644444 5567775 46688888884


No 93 
>PF08777 RRM_3:  RNA binding motif;  InterPro: IPR014886 This domain is found in protein La which functions as an RNA chaperone during RNA polymerase III transcription, and can also stimulate translation initiation. It contains a five stranded beta sheet which forms an atypical RNA recognition motif []. ; PDB: 1OWX_A.
Probab=97.22  E-value=0.00049  Score=33.81  Aligned_cols=36  Identities=28%  Similarity=0.412  Sum_probs=22.4

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD   48 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~   48 (61)
                      ++.|++.+++..++.++|++.|+.+|.|..+.+...
T Consensus         1 G~il~~~g~~~~~~re~iK~~f~~~g~V~yVD~~~G   36 (105)
T PF08777_consen    1 GCILKFSGLGEPTSREDIKEAFSQFGEVAYVDFSRG   36 (105)
T ss_dssp             --EEEEEE--SS--HHHHHHHT-SS--EEEEE--TT
T ss_pred             CeEEEEecCCCCcCHHHHHHHHHhcCCcceEEecCC
Confidence            367889999999999999999999999998887653


No 94 
>KOG1995|consensus
Probab=97.21  E-value=0.00076  Score=39.24  Aligned_cols=52  Identities=23%  Similarity=0.210  Sum_probs=43.1

Q ss_pred             CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeE--------EEEEccCCCCCCceeEEEE
Q psy7050           9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVG--------DIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~--------~~~~~~~~~t~~~~g~~fv   60 (61)
                      ......+|||.+|+..++++++.++|.++|.|.        .+++-+|++|++.+|-|.|
T Consensus        62 ~~s~~~ti~v~g~~d~~~~~~~~~~f~qcg~ikrnK~t~kPki~~y~dkeT~~~KGeatv  121 (351)
T KOG1995|consen   62 DKSDNETIFVWGCPDSVCENDNADFFLQCGVIKRNKRTGKPKIKIYTDKETGAPKGEATV  121 (351)
T ss_pred             cccccccceeeccCccchHHHHHHHHhhcceeccCCCCCCcchhccccccccCcCCceee
Confidence            445567999999999999999999999999763        3567788888988887765


No 95 
>KOG1190|consensus
Probab=97.12  E-value=0.0007  Score=40.30  Aligned_cols=37  Identities=16%  Similarity=0.268  Sum_probs=32.6

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD   48 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~   48 (61)
                      ..+.++++|||++++|.++..++.+||.|..+.+.+.
T Consensus        27 pSkV~HlRnlp~e~tE~elI~Lg~pFG~vtn~~~lkG   63 (492)
T KOG1190|consen   27 PSKVVHLRNLPWEVTEEELISLGLPFGKVTNLLMLKG   63 (492)
T ss_pred             CcceeEeccCCccccHHHHHHhcccccceeeeeeecc
Confidence            4578999999999999999999999999998876543


No 96 
>KOG1457|consensus
Probab=97.11  E-value=0.0026  Score=35.45  Aligned_cols=51  Identities=16%  Similarity=0.205  Sum_probs=34.9

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCC-CceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTR-ESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~-~~~g~~fv~   61 (61)
                      ..-+||||.+||.++-..+|..+|..|-.-+.+.+-.....+ -.+-+||++
T Consensus        32 ~~VRTLFVSGLP~DvKpREiynLFR~f~GYEgslLK~Tsk~~~~~~pvaFat   83 (284)
T KOG1457|consen   32 GAVRTLFVSGLPNDVKPREIYNLFRRFHGYEGSLLKYTSKGDQVCKPVAFAT   83 (284)
T ss_pred             cccceeeeccCCcccCHHHHHHHhccCCCccceeeeeccCCCccccceEEEE
Confidence            346899999999999999999999987554555443322222 233566653


No 97 
>KOG3152|consensus
Probab=96.96  E-value=0.002  Score=36.26  Aligned_cols=39  Identities=23%  Similarity=0.406  Sum_probs=33.9

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRF   50 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~   50 (61)
                      ..-.||++++|+...-..|+++|+.||.|-.+.+.....
T Consensus        73 k~GVvylS~IPp~m~~~rlReil~~yGeVGRvylqpE~~  111 (278)
T KOG3152|consen   73 KTGVVYLSNIPPYMDPVRLREILSQYGEVGRVYLQPEDD  111 (278)
T ss_pred             cceEEEeccCCCccCHHHHHHHHHhccccceEEecchhh
Confidence            446899999999999999999999999999998765443


No 98 
>KOG1457|consensus
Probab=96.88  E-value=0.002  Score=35.83  Aligned_cols=33  Identities=21%  Similarity=0.289  Sum_probs=26.8

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEE
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDI   43 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~   43 (61)
                      ....+|||.||..+++++.|+.+|+.|..-..+
T Consensus       208 ~acstlfianl~~~~~ed~l~~~~~~~~gf~~l  240 (284)
T KOG1457|consen  208 RACSTLFIANLGPNCTEDELKQLLSRYPGFHIL  240 (284)
T ss_pred             hhhhhHhhhccCCCCCHHHHHHHHHhCCCceEE
Confidence            344689999999999999999999998543333


No 99 
>PF08675 RNA_bind:  RNA binding domain;  InterPro: IPR014789 This domain corresponds to the RNA binding domain of Poly(A)-specific ribonuclease (PARN). ; GO: 0003723 RNA binding, 0004535 poly(A)-specific ribonuclease activity, 0046872 metal ion binding, 0006402 mRNA catabolic process, 0005634 nucleus, 0005737 cytoplasm; PDB: 3CTR_A 2ROK_A 3D45_B 1WHV_A.
Probab=96.82  E-value=0.01  Score=28.27  Aligned_cols=33  Identities=12%  Similarity=0.231  Sum_probs=22.1

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD   48 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~   48 (61)
                      ...++. .|..+-..+|.++|++||.|. +.|+.|
T Consensus        10 HVFhlt-FPkeWK~~DI~qlFspfG~I~-VsWi~d   42 (87)
T PF08675_consen   10 HVFHLT-FPKEWKTSDIYQLFSPFGQIY-VSWIND   42 (87)
T ss_dssp             CEEEEE---TT--HHHHHHHCCCCCCEE-EEEECT
T ss_pred             eEEEEe-CchHhhhhhHHHHhccCCcEE-EEEEcC
Confidence            445555 999999999999999999874 555544


No 100
>KOG0105|consensus
Probab=96.77  E-value=0.0061  Score=33.16  Aligned_cols=40  Identities=20%  Similarity=0.196  Sum_probs=34.6

Q ss_pred             CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050           9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD   48 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~   48 (61)
                      .......|.|.+||+..++++|+++....|.++...+.+|
T Consensus       111 srrSe~RVvVsGLp~SgSWQDLKDHmReaGdvCfadv~rD  150 (241)
T KOG0105|consen  111 SRRSEYRVVVSGLPPSGSWQDLKDHMREAGDVCFADVQRD  150 (241)
T ss_pred             ccccceeEEEecCCCCCchHHHHHHHHhhCCeeeeeeecc
Confidence            3444578999999999999999999999999988887766


No 101
>KOG4206|consensus
Probab=96.58  E-value=0.012  Score=32.54  Aligned_cols=46  Identities=24%  Similarity=0.312  Sum_probs=37.5

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .....+++.|+|..++.+.+..+|.+|..-..++++...     .+.|||+
T Consensus       144 ppn~ilf~~niP~es~~e~l~~lf~qf~g~keir~i~~~-----~~iAfve  189 (221)
T KOG4206|consen  144 PPNNILFLTNIPSESESEMLSDLFEQFPGFKEIRLIPPR-----SGIAFVE  189 (221)
T ss_pred             CCceEEEEecCCcchhHHHHHHHHhhCcccceeEeccCC-----CceeEEe
Confidence            445789999999999999999999999888888876542     4577774


No 102
>KOG0115|consensus
Probab=96.30  E-value=0.0048  Score=34.78  Aligned_cols=35  Identities=29%  Similarity=0.324  Sum_probs=30.7

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD   48 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~   48 (61)
                      +.|||.||+..++.+.+...|+.||.+....+..|
T Consensus        32 a~l~V~nl~~~~sndll~~~f~~fg~~e~av~~vD   66 (275)
T KOG0115|consen   32 AELYVVNLMQGASNDLLEQAFRRFGPIERAVAKVD   66 (275)
T ss_pred             ceEEEEecchhhhhHHHHHhhhhcCccchheeeec
Confidence            78999999999999999999999999877655544


No 103
>KOG0128|consensus
Probab=96.29  E-value=0.0032  Score=40.29  Aligned_cols=46  Identities=20%  Similarity=0.388  Sum_probs=41.2

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      ..++|+|.|+..+.+.++.++...|.+.+..++..+ .|+++|.|+|
T Consensus       737 ~~v~i~g~pf~gt~e~~k~l~~~~gn~~~~~~vt~r-~gkpkg~a~v  782 (881)
T KOG0128|consen  737 ISVAISGPPFQGTKEELKSLASKTGNVTSLRLVTVR-AGKPKGKARV  782 (881)
T ss_pred             hhhheeCCCCCCchHHHHhhccccCCccccchhhhh-ccccccceec
Confidence            578999999999999999999999999999887775 5889998886


No 104
>KOG0128|consensus
Probab=95.80  E-value=0.00015  Score=46.00  Aligned_cols=50  Identities=24%  Similarity=0.410  Sum_probs=41.5

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +.+++|++||+..+.+.+|...|..+|.+..+.+.-....++.+|+||++
T Consensus       666 ~~~~~fvsnl~~~~~~~dl~~~~~~~~~~e~vqi~~h~n~~~~rG~~Y~~  715 (881)
T KOG0128|consen  666 DLIKIFVSNLSPKMSEEDLSERFSPSGTIEVVQIVIHKNEKRFRGKAYVE  715 (881)
T ss_pred             HHHHHHHhhcchhhcCchhhhhcCccchhhhHHHHHHhhccccccceeeE
Confidence            34689999999999999999999999887777665455678899999873


No 105
>KOG1855|consensus
Probab=95.36  E-value=0.038  Score=33.40  Aligned_cols=37  Identities=19%  Similarity=0.371  Sum_probs=33.6

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD   48 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~   48 (61)
                      +.++|.+-|||.+-.-+.|.++|..+|.|..+++...
T Consensus       230 ~srtivaenLP~Dh~~enl~kiFg~~G~IksIRIckP  266 (484)
T KOG1855|consen  230 PSRTIVAENLPLDHSYENLSKIFGTVGSIKSIRICKP  266 (484)
T ss_pred             ccceEEEecCCcchHHHHHHHHhhcccceeeeeecCC
Confidence            5689999999999998999999999999999998755


No 106
>KOG1365|consensus
Probab=95.31  E-value=0.024  Score=33.94  Aligned_cols=48  Identities=15%  Similarity=0.241  Sum_probs=37.9

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCC-eEE--EEEccCCCCCCceeEEEEC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGE-VGD--IYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~-i~~--~~~~~~~~t~~~~g~~fv~   61 (61)
                      ...|..++||+.++-++|..+|..|.. +..  ++++.+ ..|+..|-|||+
T Consensus       280 kdcvRLRGLPy~AtvEdIL~FlgdFa~~i~f~gVHmv~N-~qGrPSGeAFIq  330 (508)
T KOG1365|consen  280 KDCVRLRGLPYEATVEDILDFLGDFATDIRFQGVHMVLN-GQGRPSGEAFIQ  330 (508)
T ss_pred             CCeeEecCCChhhhHHHHHHHHHHHhhhcccceeEEEEc-CCCCcChhhhhh
Confidence            456889999999999999999988753 333  667766 468888888875


No 107
>KOG2314|consensus
Probab=95.14  E-value=0.14  Score=32.17  Aligned_cols=50  Identities=24%  Similarity=0.400  Sum_probs=38.4

Q ss_pred             CCCcEEEEecCCCCCCH------HHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          11 EGMVSLKVDNLTYRTTP------DDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~------~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .-...|.|.|+|-.-..      .-|..+|+++|++....++.+.++| .+||.|++
T Consensus        56 g~D~vVvv~g~PvV~~~rl~klk~vl~kvfsk~gk~vn~~~P~~e~gg-tkG~lf~E  111 (698)
T KOG2314|consen   56 GFDSVVVVDGAPVVGPARLEKLKKVLTKVFSKAGKIVNMYYPIDEEGG-TKGYLFVE  111 (698)
T ss_pred             CcceEEEECCCcccChhHHHHHHHHHHHHHHhhccccceeeccCccCC-eeeEEEEE
Confidence            33467888888865432      2456889999999999999887766 89999975


No 108
>PF14605 Nup35_RRM_2:  Nup53/35/40-type RNA recognition motif
Probab=94.59  E-value=0.1  Score=22.53  Aligned_cols=32  Identities=25%  Similarity=0.506  Sum_probs=24.2

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEc
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIP   46 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~   46 (61)
                      +.|-|.+.+.... +.+..+|..+|+|....+.
T Consensus         2 ~wI~V~Gf~~~~~-~~vl~~F~~fGeI~~~~~~   33 (53)
T PF14605_consen    2 TWISVSGFPPDLA-EEVLEHFASFGEIVDIYVP   33 (53)
T ss_pred             cEEEEEeECchHH-HHHHHHHHhcCCEEEEEcC
Confidence            4577888887655 4556689999999998775


No 109
>KOG1365|consensus
Probab=93.67  E-value=0.32  Score=29.44  Aligned_cols=46  Identities=22%  Similarity=0.319  Sum_probs=33.0

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhC----CCeEEEEEccCCCCCCceeEEEE
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERC----GEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~----G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      -.|.+++||+++++.++.++|.+-    |....+.++..+ .|+..|=|||
T Consensus       162 vivRmRGLPfdat~~dVv~FF~~~cpv~~g~egvLFV~rp-dgrpTGdAFv  211 (508)
T KOG1365|consen  162 VIVRMRGLPFDATALDVVEFFGPPCPVTGGTEGVLFVTRP-DGRPTGDAFV  211 (508)
T ss_pred             eEEEecCCCCCcchHHHHHhcCCCCcccCCccceEEEECC-CCCcccceEE
Confidence            567889999999999999999631    234455555443 4777777776


No 110
>KOG4676|consensus
Probab=93.07  E-value=0.12  Score=31.09  Aligned_cols=35  Identities=17%  Similarity=0.396  Sum_probs=30.9

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEcc
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPR   47 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~   47 (61)
                      ...|.|.||.+.++.++++.+|...|.|..+.|.-
T Consensus         7 ~~vIqvanispsat~dqm~tlFg~lGkI~elrlyp   41 (479)
T KOG4676|consen    7 LGVIQVANISPSATKDQMQTLFGNLGKIPELRLYP   41 (479)
T ss_pred             CceeeecccCchhhHHHHHHHHhhccccccccccC
Confidence            35899999999999999999999999998887643


No 111
>KOG0112|consensus
Probab=92.81  E-value=0.043  Score=35.80  Aligned_cols=37  Identities=24%  Similarity=0.427  Sum_probs=33.0

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD   48 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~   48 (61)
                      ...+||++||+..+++.+++..|..+|.+.++.+-..
T Consensus       371 atrTLf~Gnl~~kl~eseiR~af~e~gkve~VDiKtP  407 (975)
T KOG0112|consen  371 ATRTLFLGNLDSKLTESEIRPAFDESGKVEEVDIKTP  407 (975)
T ss_pred             hhhhhhhcCcccchhhhhhhhhhhhhccccccccccC
Confidence            3579999999999999999999999999999877544


No 112
>KOG0106|consensus
Probab=92.30  E-value=0.13  Score=28.51  Aligned_cols=33  Identities=33%  Similarity=0.437  Sum_probs=28.2

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEE
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDI   43 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~   43 (61)
                      ...+.+.+.+++..+.+.+|.++|.++|.+...
T Consensus        97 ~s~~r~~~~~~~~r~~~qdl~d~~~~~g~~~~~  129 (216)
T KOG0106|consen   97 RTHFRLIVRNLSLRVSWQDLKDHFRPAGEVTYV  129 (216)
T ss_pred             cccceeeeccchhhhhHHHHhhhhcccCCCchh
Confidence            345788999999999999999999999988443


No 113
>KOG1190|consensus
Probab=92.18  E-value=0.47  Score=28.90  Aligned_cols=38  Identities=34%  Similarity=0.386  Sum_probs=32.8

Q ss_pred             CcEEEEecCCCC-CCHHHHHHHhhhCCCeEEEEEccCCC
Q psy7050          13 MVSLKVDNLTYR-TTPDDLRRVFERCGEVGDIYIPRDRF   50 (61)
Q Consensus        13 ~~~l~v~~l~~~-~~~~~l~~~f~~~G~i~~~~~~~~~~   50 (61)
                      ...|.|.||... ++.+.|..+|..||.|.++.+..++.
T Consensus       297 n~vllvsnln~~~VT~d~LftlFgvYGdVqRVkil~nkk  335 (492)
T KOG1190|consen  297 NVVLLVSNLNEEAVTPDVLFTLFGVYGDVQRVKILYNKK  335 (492)
T ss_pred             ceEEEEecCchhccchhHHHHHHhhhcceEEEEeeecCC
Confidence            477889999776 68899999999999999999988753


No 114
>KOG1456|consensus
Probab=90.99  E-value=1.2  Score=27.00  Aligned_cols=37  Identities=14%  Similarity=0.111  Sum_probs=31.7

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEc
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIP   46 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~   46 (61)
                      .+..-.+.|++|...+.+.+|-+.++.||.|..+..+
T Consensus        28 ~~~spvvhvr~l~~~v~eadl~eal~~fG~i~yvt~~   64 (494)
T KOG1456|consen   28 PNPSPVVHVRGLHQGVVEADLVEALSNFGPIAYVTCM   64 (494)
T ss_pred             CCCCceEEEeccccccchhHHHHHHhcCCceEEEEec
Confidence            3445689999999999999999999999999887544


No 115
>KOG4307|consensus
Probab=90.88  E-value=0.35  Score=31.34  Aligned_cols=51  Identities=27%  Similarity=0.046  Sum_probs=38.8

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEE-EEEccCCCCCCceeEEEEC
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGD-IYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~-~~~~~~~~t~~~~g~~fv~   61 (61)
                      ...+..|||-.||..+++.++-++|...-.|++ +.|..- .+++.++-|||+
T Consensus       431 ~~ag~~lyv~~lP~~t~~~~~v~~f~~~~~Ved~I~lt~~-P~~~~~~~afv~  482 (944)
T KOG4307|consen  431 GGAGGALYVFQLPVMTPIVPPVNKFMGAAAVEDFIELTRL-PTDLLRPAAFVA  482 (944)
T ss_pred             CCccceEEeccCCccccccchhhhhhhhhhhhheeEeccC-Ccccccchhhhe
Confidence            345678999999999999999999988766666 555444 356677777763


No 116
>KOG1456|consensus
Probab=90.77  E-value=1.7  Score=26.44  Aligned_cols=41  Identities=17%  Similarity=0.104  Sum_probs=33.5

Q ss_pred             CCCCCcEEEEecCCCCC-CHHHHHHHhhhCCCeEEEEEccCC
Q psy7050           9 RIEGMVSLKVDNLTYRT-TPDDLRRVFERCGEVGDIYIPRDR   49 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~-~~~~l~~~f~~~G~i~~~~~~~~~   49 (61)
                      ....++.+.|-+|.... +-+.|..+|-.||.|..+.+++.+
T Consensus       283 g~~~g~VmMVyGLdh~k~N~drlFNl~ClYGNV~rvkFmkTk  324 (494)
T KOG1456|consen  283 GGAPGCVMMVYGLDHGKMNCDRLFNLFCLYGNVERVKFMKTK  324 (494)
T ss_pred             CCCCCcEEEEEeccccccchhhhhhhhhhcCceeeEEEeecc
Confidence            34556899999998875 557789999999999999998764


No 117
>KOG0112|consensus
Probab=90.55  E-value=0.86  Score=30.24  Aligned_cols=36  Identities=22%  Similarity=0.229  Sum_probs=31.4

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEE
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYI   45 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~   45 (61)
                      ....+.+++++|..++.-..|...|..||.+..+.+
T Consensus       452 st~ttr~~sgglg~w~p~~~l~r~fd~fGpir~Idy  487 (975)
T KOG0112|consen  452 STPTTRLQSGGLGPWSPVSRLNREFDRFGPIRIIDY  487 (975)
T ss_pred             cccceeeccCCCCCCChHHHHHHHhhccCcceeeec
Confidence            344578999999999999999999999999988764


No 118
>KOG4307|consensus
Probab=89.95  E-value=1  Score=29.37  Aligned_cols=46  Identities=15%  Similarity=0.127  Sum_probs=31.8

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEE
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFA   58 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~   58 (61)
                      .+.|-+.|+|++++-++|-++|..|-.+......+-.+-|...|-|
T Consensus       867 p~V~~~~n~Pf~v~l~dI~~FF~dY~~~p~sI~~r~nd~G~pTGe~  912 (944)
T KOG4307|consen  867 PRVLSCNNFPFDVTLEDIVEFFNDYEPDPNSIRIRRNDDGVPTGEC  912 (944)
T ss_pred             CeEEEecCCCccccHHHHHHHhcccccCCCceeEeecCCCCcccce
Confidence            3478899999999999999999998654332223223446555544


No 119
>PF03467 Smg4_UPF3:  Smg-4/UPF3 family;  InterPro: IPR005120 Nonsense-mediated mRNA decay (NMD) is a surveillance mechanism by which eukaryotic cells detect and degrade transcripts containing premature termination codons. Three 'up-frameshift' proteins, UPF1, UPF2 and UPF3, are essential for this process in organisms ranging from yeast, human to plants []. Exon junction complexes (EJCs) are deposited ~24 nucleotides upstream of exon-exon junctions after splicing. Translation causes displacement of the EJCs, however, premature translation termination upstream of one or more EJCs triggers the recruitment of UPF1, UPF2 and UPF3 and activates the NMD pathway [, ].  This family contains UPF3. The crystal structure of the complex between human UPF2 and UPF3b, which are, respectively, a MIF4G (middle portion of eIF4G) domain and an RNP domain (ribonucleoprotein-type RNA-binding domain) has been determined to 1.95A. The protein-protein interface is mediated by highly conserved charged residues in UPF2 and UPF3b and involves the beta-sheet surface of the UPF3b ribonucleoprotein (RNP) domain, which is generally used by these domains to bind nucleic acids. In UPF3b the RNP domain does not bind RNA, whereas the UPF2 construct and the complex do. It is clear that some RNP domains have evolved for specific protein-protein interactions rather than as nucleic acid binding modules [].; PDB: 1UW4_A 2L08_A.
Probab=89.58  E-value=0.74  Score=24.64  Aligned_cols=30  Identities=7%  Similarity=0.057  Sum_probs=19.4

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhh-CCCe
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFER-CGEV   40 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~-~G~i   40 (61)
                      ....+|-|++||+..+++.+.+.+.. ++.-
T Consensus         5 ~~~~KvVIR~LPP~LteeeF~~~i~~~l~~~   35 (176)
T PF03467_consen    5 KEGTKVVIRRLPPNLTEEEFWEQISPWLPDE   35 (176)
T ss_dssp             ----EEEEEEE-TTS-HHHHCCCCSS--SSE
T ss_pred             ccCceEEEeCCCCCCCHHHHHHHhhhhcccc
Confidence            34579999999999999999887766 5543


No 120
>KOG2193|consensus
Probab=89.49  E-value=0.33  Score=29.73  Aligned_cols=24  Identities=38%  Similarity=0.556  Sum_probs=21.3

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhC
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERC   37 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~   37 (61)
                      .++|++||.+..+..+++.+|...
T Consensus         2 nklyignL~p~~~psdl~svfg~a   25 (584)
T KOG2193|consen    2 NKLYIGNLSPQVTPSDLESVFGDA   25 (584)
T ss_pred             CcccccccCCCCChHHHHHHhccc
Confidence            478999999999999999998754


No 121
>PF09707 Cas_Cas2CT1978:  CRISPR-associated protein (Cas_Cas2CT1978);  InterPro: IPR010152 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny.  This entry represents a minor branch of the Cas2 family of CRISPR-associated protein which are found in IPR003799 from INTERPRO. Cas2 is one of four protein families (Cas1 to Cas4) that are associated with CRISPR elements and always occur near a repeat cluster, usually in the order cas3-cas4-cas1-cas2. The function of Cas2 (and Cas1) is unknown. Cas3 proteins appear to be helicases while Cas4 proteins resemble RecB-type exonucleases, suggesting that these genes are involved in DNA metabolism or gene expression []. 
Probab=87.18  E-value=1.5  Score=20.94  Aligned_cols=45  Identities=24%  Similarity=0.327  Sum_probs=28.9

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      ..-+||++++..+.+.-...+.+..+....+-+..+.   ...||+|-
T Consensus        25 ~~GVyVg~~s~rVRe~lW~~v~~~~~~G~a~m~~~~~---neqG~~~~   69 (86)
T PF09707_consen   25 RPGVYVGNVSARVRERLWERVTEWIGDGSAVMVWSDN---NEQGFDFR   69 (86)
T ss_pred             CCCcEEcCCCHHHHHHHHHHHHhhCCCccEEEEEccC---CCCCEEEE
Confidence            3569999999988887777776655544444333332   15677763


No 122
>KOG4660|consensus
Probab=85.73  E-value=0.92  Score=28.42  Aligned_cols=47  Identities=26%  Similarity=0.395  Sum_probs=30.0

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhh-CCCeEEEEEccCCCCCCceeEEEE
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFER-CGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~-~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      +++.|.|++...|...|.+.-.+ .|.-..+.++.|-.+..+.|||||
T Consensus       389 tt~~iknipNK~T~~ml~~~d~~~~gtYDFlYLPiDF~nkcNvGYAFI  436 (549)
T KOG4660|consen  389 TTLMIKNIPNKYTSKMLLAADEKNKGTYDFLYLPIDFKNKCNVGYAFI  436 (549)
T ss_pred             hhhHhhccCchhhHHhhhhhhccccCccceEEeccccccccccceeEE
Confidence            44555555555555544444322 455566778888888889999997


No 123
>COG5175 MOT2 Transcriptional repressor [Transcription]
Probab=85.62  E-value=1.1  Score=26.84  Aligned_cols=35  Identities=14%  Similarity=0.297  Sum_probs=26.8

Q ss_pred             CcEEEEecCCCCCCHHH----H--HHHhhhCCCeEEEEEcc
Q psy7050          13 MVSLKVDNLTYRTTPDD----L--RRVFERCGEVGDIYIPR   47 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~----l--~~~f~~~G~i~~~~~~~   47 (61)
                      ..-+||-+|++.+..+.    |  .+.|.+||.|..+.+-+
T Consensus       114 KNLvYVigi~pkva~Ee~~~vLk~~eyFGQyGkI~KIvvNk  154 (480)
T COG5175         114 KNLVYVIGIPPKVADEEVAPVLKRHEYFGQYGKIKKIVVNK  154 (480)
T ss_pred             cceeEEecCCCCCCcccccccccchhhhhhccceeEEEecc
Confidence            35689999999886654    2  47899999999986643


No 124
>KOG2253|consensus
Probab=82.42  E-value=1.2  Score=28.49  Aligned_cols=35  Identities=14%  Similarity=0.226  Sum_probs=29.8

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEE
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIY   44 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~   44 (61)
                      .....++||+|+...+....+..++..+|.|.++.
T Consensus        37 ~~~~~~vfv~~~~~~~s~~~~~~il~~~g~v~s~k   71 (668)
T KOG2253|consen   37 LPPRDTVFVGNISYLVSQEFWKSILAKSGFVPSWK   71 (668)
T ss_pred             CCCCceeEecchhhhhhHHHHHHHHhhCCcchhhh
Confidence            34557999999999999999999999999877653


No 125
>KOG2416|consensus
Probab=80.85  E-value=1  Score=28.77  Aligned_cols=37  Identities=32%  Similarity=0.461  Sum_probs=28.7

Q ss_pred             CCCCCCCcEEEEecCCCCCCHHHHHHHhhh-CCCeEEE
Q psy7050           7 PPRIEGMVSLKVDNLTYRTTPDDLRRVFER-CGEVGDI   43 (61)
Q Consensus         7 ~~~~~~~~~l~v~~l~~~~~~~~l~~~f~~-~G~i~~~   43 (61)
                      |+.......|||.||-.-.|.-.|++++.. +|.|...
T Consensus       438 PsR~~~SnvlhI~nLvRPFTlgQLkelL~rtgg~Vee~  475 (718)
T KOG2416|consen  438 PSRKEPSNVLHIDNLVRPFTLGQLKELLGRTGGNVEEF  475 (718)
T ss_pred             CCCCCccceEeeecccccchHHHHHHHHhhccCchHHH
Confidence            344455688999999999999999999985 5555544


No 126
>KOG4008|consensus
Probab=80.61  E-value=3  Score=23.73  Aligned_cols=32  Identities=22%  Similarity=0.275  Sum_probs=27.3

Q ss_pred             CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCe
Q psy7050           9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEV   40 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i   40 (61)
                      ...+...+|+-|+|..++++.|.+..+.+|-+
T Consensus        36 ~~~eKd~lfl~Nvp~~~tee~lkr~vsqlg~v   67 (261)
T KOG4008|consen   36 NSNEKDCLFLVNVPLLSTEEHLKRFVSQLGHV   67 (261)
T ss_pred             ccccccceeeecccccccHHHHHHHHHHhhhh
Confidence            34556889999999999999999999998854


No 127
>PRK11558 putative ssRNA endonuclease; Provisional
Probab=80.30  E-value=4.1  Score=19.97  Aligned_cols=45  Identities=16%  Similarity=0.140  Sum_probs=26.4

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      ..-+|||+++..+.+.-...+-+.++.-..+-+..+.   .-.||+|-
T Consensus        27 ~~GVyVg~~S~rVRd~lW~~v~~~~~~G~avmv~~~~---~eqG~~~~   71 (97)
T PRK11558         27 RAGVYVGDVSRRIREMIWQQVTQLAEEGNVVMAWATN---TESGFEFQ   71 (97)
T ss_pred             CCCcEEcCCCHHHHHHHHHHHHHhCCCCcEEEEEcCC---CCCCcEEE
Confidence            3569999988888776666665555443333222221   23377764


No 128
>KOG4410|consensus
Probab=79.33  E-value=11  Score=22.38  Aligned_cols=26  Identities=19%  Similarity=0.281  Sum_probs=22.4

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCC
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGE   39 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~   39 (61)
                      .-|+++||+.++.-.+|+..+.+.+.
T Consensus       331 ~di~~~nl~rd~rv~dlk~~lr~~~~  356 (396)
T KOG4410|consen  331 TDIKLTNLSRDIRVKDLKSELRKREC  356 (396)
T ss_pred             cceeeccCccccchHHHHHHHHhcCC
Confidence            56999999999999999988877653


No 129
>PF07292 NID:  Nmi/IFP 35 domain (NID);  InterPro: IPR009909 This entry represents a domain of approximately 90 residues that is tandemly repeated within interferon-induced 35 kDa protein (IFP 35) and the homologous N-myc-interactor (Nmi). This domain mediates Nmi-Nmi protein interactions and subcellular localisation [].
Probab=78.37  E-value=6  Score=18.99  Aligned_cols=22  Identities=14%  Similarity=0.191  Sum_probs=19.1

Q ss_pred             CcEEEEecCCCCCCHHHHHHHh
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVF   34 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f   34 (61)
                      .++|.+.|+|....++.|++.+
T Consensus        52 ~rtVlvsgip~~l~ee~l~D~L   73 (88)
T PF07292_consen   52 KRTVLVSGIPDVLDEEELRDKL   73 (88)
T ss_pred             CCEEEEeCCCCCCChhhheeeE
Confidence            3789999999999999998764


No 130
>COG0030 KsgA Dimethyladenosine transferase (rRNA methylation) [Translation, ribosomal structure and biogenesis]
Probab=78.07  E-value=2.6  Score=24.12  Aligned_cols=35  Identities=20%  Similarity=0.159  Sum_probs=25.6

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD   48 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~   48 (61)
                      ....|+|||+.++..-+..++...-.+....++.+
T Consensus        96 ~~~vVaNlPY~Isspii~kll~~~~~~~~~v~M~Q  130 (259)
T COG0030          96 PYKVVANLPYNISSPILFKLLEEKFIIQDMVLMVQ  130 (259)
T ss_pred             CCEEEEcCCCcccHHHHHHHHhccCccceEEEEeH
Confidence            45678999999999999988876544444555443


No 131
>KOG4454|consensus
Probab=76.79  E-value=0.52  Score=26.55  Aligned_cols=49  Identities=35%  Similarity=0.538  Sum_probs=36.4

Q ss_pred             CCCcEEEEec----CCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050          11 EGMVSLKVDN----LTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        11 ~~~~~l~v~~----l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      +...+++.|+    |....+.+.+.+.|+..|.+..+++..+.+ +.++-++|+
T Consensus        78 e~q~~~r~G~shapld~r~~~ei~~~v~s~a~p~~~~R~~~~~d-~rnrn~~~~  130 (267)
T KOG4454|consen   78 EEQRTLRCGNSHAPLDERVTEEILYEVFSQAGPIEGVRIPTDND-GRNRNFGFV  130 (267)
T ss_pred             hhhcccccCCCcchhhhhcchhhheeeecccCCCCCcccccccc-CCccCccch
Confidence            3346777777    777778888888999999988888877755 566666654


No 132
>PF05172 Nup35_RRM:  Nup53/35/40-type RNA recognition motif;  InterPro: IPR007846 The MPPN (Mitotic PhosphoProtein N end) family is uncharacterised however it probably plays a role in the cell cycle because the family includes mitotic phosphoproteins O13026 from SWISSPROT []. This family also includes Q05166 from SWISSPROT a suppressor of thermosensitive mutations in the DNA polymerase delta gene, Pol III []. The conserved central region appears to be distantly related to the RNA-binding region RNP-1 (RNA recognition motif, IPR000504 from INTERPRO), suggesting an RNA binding function for this protein.; PDB: 1WWH_C 3P3D_A.
Probab=74.96  E-value=4.8  Score=19.72  Aligned_cols=30  Identities=13%  Similarity=0.225  Sum_probs=19.8

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCCeEEEE
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIY   44 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~   44 (61)
                      .-|.|=+.|+. ....+.+.|++||.|.+..
T Consensus         7 ~wVtVFGfp~~-~~~~Vl~~F~~~G~Ile~~   36 (100)
T PF05172_consen    7 TWVTVFGFPPS-ASNQVLRHFSSFGTILEHF   36 (100)
T ss_dssp             CEEEEE---GG-GHHHHHHHHHCCS-EECEE
T ss_pred             eEEEEEccCHH-HHHHHHHHHHhcceEEEee
Confidence            55777788887 5567788999999987764


No 133
>PF11608 Limkain-b1:  Limkain b1;  InterPro: IPR024582 This entry represents a conserved domain found in limkain b1, which is a novel human autoantigen, localised to a subset of ABCD3 and PXF marked peroxisomes. Limkain b1 may be a relatively common target of human autoantibodies reactive to cytoplasmic vesicle-like structures [].; PDB: 2DIU_A.
Probab=74.91  E-value=6.1  Score=19.07  Aligned_cols=30  Identities=33%  Similarity=0.576  Sum_probs=17.3

Q ss_pred             cEEEEecCCCCCCHHH----HHHHhhhCC-CeEEE
Q psy7050          14 VSLKVDNLTYRTTPDD----LRRVFERCG-EVGDI   43 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~----l~~~f~~~G-~i~~~   43 (61)
                      ..|+|.|||.+.....    |+.++.-|| .+..+
T Consensus         3 s~L~V~NLP~~~d~~~I~~RL~qLsdNCGGkVl~v   37 (90)
T PF11608_consen    3 SLLYVSNLPTNKDPSSIKNRLRQLSDNCGGKVLSV   37 (90)
T ss_dssp             EEEEEES--TTS-HHHHHHHHHHHHHTTT--EEE-
T ss_pred             cEEEEecCCCCCCHHHHHHHHHHHhhccCCEEEEE
Confidence            4689999999988655    556666665 45443


No 134
>KOG4210|consensus
Probab=74.51  E-value=1.7  Score=25.05  Aligned_cols=49  Identities=14%  Similarity=-0.010  Sum_probs=35.9

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      ..+++|++++.+.+.+.+...++..+|......+.........+|++++
T Consensus        87 ~~~~~f~g~~s~~~e~~~~~~~~~~~g~~~~~~~S~~~~~~~sk~~~s~  135 (285)
T KOG4210|consen   87 SSSTFFVGELSENIEESEDDNFSSEAGLRVDARSSSLEDSLSSKGGLSV  135 (285)
T ss_pred             ccccccccccccchhhccccccchhhcCcccchhhhhccccccccceee
Confidence            4688999999999988877888888887666655554455566666654


No 135
>PF10567 Nab6_mRNP_bdg:  RNA-recognition motif;  InterPro: IPR018885  This conserved domain is found in fungal proteins and appears to be involved in RNA-processing. It binds to poly-adenylated RNA, interacts genetically with mRNA 3'-end processing factors, co-purifies with the nuclear cap-binding protein Cbp20p, and is found in complexes containing other translation factors, such as EIF4G as in P39935 from SWISSPROT and P39936 from SWISSPROT. 
Probab=73.37  E-value=3.2  Score=24.34  Aligned_cols=37  Identities=19%  Similarity=0.327  Sum_probs=31.5

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDR   49 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~   49 (61)
                      .+.|...|+...++-..+...|.+||+|+++.++.+.
T Consensus        15 TRSLLfeNv~~sidLh~Fl~~fv~~~pIESiYL~~~~   51 (309)
T PF10567_consen   15 TRSLLFENVNNSIDLHSFLTKFVKFGPIESIYLIKSN   51 (309)
T ss_pred             eHHHHHhhccccccHHHHHHHhhccCceeEEEEecCC
Confidence            4568888999888888888889999999999988764


No 136
>COG5584 Predicted small secreted protein [Function unknown]
Probab=69.49  E-value=6  Score=19.47  Aligned_cols=29  Identities=7%  Similarity=0.200  Sum_probs=20.5

Q ss_pred             CCCCCCHHHHHHHhhhCCCeEEEEEccCC
Q psy7050          21 LTYRTTPDDLRRVFERCGEVGDIYIPRDR   49 (61)
Q Consensus        21 l~~~~~~~~l~~~f~~~G~i~~~~~~~~~   49 (61)
                      ++....-+-+++.|+++|.|.-.++...+
T Consensus        30 is~e~alk~vk~afk~~mnI~GSwI~~~p   58 (103)
T COG5584          30 ISRENALKVVKEAFKQFMNIKGSWIVYEP   58 (103)
T ss_pred             cChhHHHHHHHHHhcccCCcceeEEEEec
Confidence            33444445678999999999888776655


No 137
>KOG2891|consensus
Probab=67.57  E-value=9.8  Score=22.47  Aligned_cols=35  Identities=26%  Similarity=0.555  Sum_probs=26.7

Q ss_pred             CCcEEEEecCCCC------------CCHHHHHHHhhhCCCeEEEEEc
Q psy7050          12 GMVSLKVDNLTYR------------TTPDDLRRVFERCGEVGDIYIP   46 (61)
Q Consensus        12 ~~~~l~v~~l~~~------------~~~~~l~~~f~~~G~i~~~~~~   46 (61)
                      .+.+||+.++|-.            .+++.|+..|..||.|..+.++
T Consensus       148 rpdti~la~ip~kwf~lkedg~~dlpse~rlr~a~eafg~ir~vdip  194 (445)
T KOG2891|consen  148 RPDTIHLAGIPCKWFALKEDGSEDLPSEDRLRKAFEAFGEIRNVDIP  194 (445)
T ss_pred             CCCceeecCCcceeeeecccccccCChHHHHHHHHHHhccceecCCc
Confidence            3467888877643            3467799999999999988765


No 138
>PTZ00338 dimethyladenosine transferase-like protein; Provisional
Probab=66.08  E-value=7  Score=22.63  Aligned_cols=34  Identities=9%  Similarity=0.055  Sum_probs=24.2

Q ss_pred             EEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050          15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD   48 (61)
Q Consensus        15 ~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~   48 (61)
                      .+.|+|+|+.++...+..++.....+....++.+
T Consensus       103 d~VvaNlPY~Istpil~~ll~~~~~~~~~vlm~Q  136 (294)
T PTZ00338        103 DVCVANVPYQISSPLVFKLLAHRPLFRCAVLMFQ  136 (294)
T ss_pred             CEEEecCCcccCcHHHHHHHhcCCCCceeeeeeh
Confidence            4678999999999888888865444455555444


No 139
>TIGR01873 cas_CT1978 CRISPR-associated endoribonuclease Cas2, E. coli subfamily. CRISPR is a term for Clustered, Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. This model represents a minor branch of the Cas2 family of CRISPR-associated endonuclease, whereas most Cas2 proteins are modeled instead by TIGR01573. This form of Cas2 is characteristic for the Ecoli subtype of CRISPR/Cas locus.
Probab=65.60  E-value=14  Score=17.78  Aligned_cols=24  Identities=4%  Similarity=-0.032  Sum_probs=16.4

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhh
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFER   36 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~   36 (61)
                      ..-+||++++..+.+.-...+-+.
T Consensus        25 ~~GVyVg~~s~rVRe~lW~~v~~~   48 (87)
T TIGR01873        25 RAGVYVGGVSASVRERIWDYLAQH   48 (87)
T ss_pred             CCCcEEcCCCHHHHHHHHHHHHHh
Confidence            356999998888776555544444


No 140
>PRK15464 cold shock-like protein CspH; Provisional
Probab=64.33  E-value=4  Score=18.65  Aligned_cols=8  Identities=25%  Similarity=0.476  Sum_probs=6.3

Q ss_pred             ceeEEEEC
Q psy7050          54 SRGFAFVR   61 (61)
Q Consensus        54 ~~g~~fv~   61 (61)
                      .+||+||+
T Consensus        15 ~KGfGFI~   22 (70)
T PRK15464         15 KSGKGFII   22 (70)
T ss_pred             CCCeEEEc
Confidence            47899984


No 141
>PF14893 PNMA:  PNMA
Probab=64.09  E-value=8.7  Score=22.84  Aligned_cols=26  Identities=8%  Similarity=0.164  Sum_probs=21.3

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhh
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFE   35 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~   35 (61)
                      .+..+.+.|.++|.+++++++++.+.
T Consensus        15 ~~~~r~lLv~giP~dc~~~ei~e~l~   40 (331)
T PF14893_consen   15 VDPQRALLVLGIPEDCEEAEIEEALQ   40 (331)
T ss_pred             cChhhhheeecCCCCCCHHHHHHHHH
Confidence            44557899999999999998887664


No 142
>PRK00274 ksgA 16S ribosomal RNA methyltransferase KsgA/Dim1 family protein; Reviewed
Probab=63.69  E-value=9.6  Score=21.61  Aligned_cols=34  Identities=24%  Similarity=0.246  Sum_probs=23.1

Q ss_pred             EEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050          15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD   48 (61)
Q Consensus        15 ~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~   48 (61)
                      -+.|+|+|+.++..-|..++.....+....++.+
T Consensus       107 ~~vv~NlPY~iss~ii~~~l~~~~~~~~~~l~~Q  140 (272)
T PRK00274        107 LKVVANLPYNITTPLLFHLLEERDPIRDMVVMVQ  140 (272)
T ss_pred             ceEEEeCCccchHHHHHHHHhcCCCCCeeEEEeH
Confidence            4678999999998888888754222344444443


No 143
>PF10309 DUF2414:  Protein of unknown function (DUF2414);  InterPro: IPR019416  This entry contains proteins that have no known function. 
Probab=63.23  E-value=13  Score=16.64  Aligned_cols=35  Identities=14%  Similarity=0.140  Sum_probs=25.3

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhC---CCeEEEEEccC
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERC---GEVGDIYIPRD   48 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~---G~i~~~~~~~~   48 (61)
                      ..+|+|+++.. .+.++++..|..|   .....+.|+-|
T Consensus         5 peavhirGvd~-lsT~dI~~y~~~y~~~~~~~~IEWIdD   42 (62)
T PF10309_consen    5 PEAVHIRGVDE-LSTDDIKAYFSEYFDEEGPFRIEWIDD   42 (62)
T ss_pred             eceEEEEcCCC-CCHHHHHHHHHHhcccCCCceEEEecC
Confidence            46899999865 5677888888877   13456777665


No 144
>PF15513 DUF4651:  Domain of unknown function (DUF4651)
Probab=62.63  E-value=14  Score=16.65  Aligned_cols=17  Identities=29%  Similarity=0.788  Sum_probs=14.0

Q ss_pred             HHHHHHhhhCCCeEEEE
Q psy7050          28 DDLRRVFERCGEVGDIY   44 (61)
Q Consensus        28 ~~l~~~f~~~G~i~~~~   44 (61)
                      ++|++.|+..|.|.-+.
T Consensus         9 ~~iR~~fs~lG~I~vLY   25 (62)
T PF15513_consen    9 AEIRQFFSQLGEIAVLY   25 (62)
T ss_pred             HHHHHHHHhcCcEEEEE
Confidence            57899999999986554


No 145
>PRK15463 cold shock-like protein CspF; Provisional
Probab=61.65  E-value=4.5  Score=18.40  Aligned_cols=8  Identities=13%  Similarity=0.360  Sum_probs=6.3

Q ss_pred             ceeEEEEC
Q psy7050          54 SRGFAFVR   61 (61)
Q Consensus        54 ~~g~~fv~   61 (61)
                      .+||+||+
T Consensus        15 ~kGfGFI~   22 (70)
T PRK15463         15 KSGKGLIT   22 (70)
T ss_pred             CCceEEEe
Confidence            48899984


No 146
>COG0481 LepA Membrane GTPase LepA [Cell envelope biogenesis, outer membrane]
Probab=59.53  E-value=15  Score=23.52  Aligned_cols=45  Identities=20%  Similarity=0.324  Sum_probs=35.0

Q ss_pred             cEEEEecCCCCCCH-HHHHHHhhhCCCeEEEEEccCCCCCCceeEEE
Q psy7050          14 VSLKVDNLTYRTTP-DDLRRVFERCGEVGDIYIPRDRFTRESRGFAF   59 (61)
Q Consensus        14 ~~l~v~~l~~~~~~-~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~f   59 (61)
                      -.+|.|-.|-+.++ ++|++.+++. .+.+..+...++|...-||||
T Consensus       298 P~Vf~GlyPid~~dye~LrdAleKL-~LNDasl~~E~EtS~ALGfGf  343 (603)
T COG0481         298 PMVFAGLYPVDSDDYEDLRDALEKL-QLNDASLTYEPETSQALGFGF  343 (603)
T ss_pred             ceEEEeecccChhHHHHHHHHHHhc-ccccceeeeccccchhccCce
Confidence            46899988888776 7899998876 456666777778887778877


No 147
>KOG2318|consensus
Probab=59.27  E-value=38  Score=22.09  Aligned_cols=36  Identities=25%  Similarity=0.318  Sum_probs=27.2

Q ss_pred             CCCCcEEEEecCCCCC-CHHHHHHHhhhC----CCeEEEEE
Q psy7050          10 IEGMVSLKVDNLTYRT-TPDDLRRVFERC----GEVGDIYI   45 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~-~~~~l~~~f~~~----G~i~~~~~   45 (61)
                      .+..++|-|-|+.|.. ...+|.-+|..|    |.|.++.|
T Consensus       171 ~~~T~RLAVvNMDWd~v~AkDL~v~~nSFlP~gGsilSV~I  211 (650)
T KOG2318|consen  171 GEETKRLAVVNMDWDRVKAKDLFVLFNSFLPKGGSILSVKI  211 (650)
T ss_pred             ccccceeeEeccccccccHHHHHHHHHhhcCCCCceeEEEe
Confidence            3556889999999984 668888888754    57777765


No 148
>PRK09507 cspE cold shock protein CspE; Reviewed
Probab=59.07  E-value=4.4  Score=18.32  Aligned_cols=8  Identities=50%  Similarity=1.202  Sum_probs=6.2

Q ss_pred             ceeEEEEC
Q psy7050          54 SRGFAFVR   61 (61)
Q Consensus        54 ~~g~~fv~   61 (61)
                      .+||+||+
T Consensus        14 ~kGyGFI~   21 (69)
T PRK09507         14 SKGFGFIT   21 (69)
T ss_pred             CCCcEEEe
Confidence            47899984


No 149
>PRK10943 cold shock-like protein CspC; Provisional
Probab=58.18  E-value=4.6  Score=18.25  Aligned_cols=8  Identities=50%  Similarity=1.202  Sum_probs=6.2

Q ss_pred             ceeEEEEC
Q psy7050          54 SRGFAFVR   61 (61)
Q Consensus        54 ~~g~~fv~   61 (61)
                      .+||+||+
T Consensus        14 ~kGfGFI~   21 (69)
T PRK10943         14 SKGFGFIT   21 (69)
T ss_pred             CCCcEEEe
Confidence            47899984


No 150
>PRK10354 RNA chaperone/anti-terminator; Provisional
Probab=55.41  E-value=5.7  Score=17.97  Aligned_cols=8  Identities=38%  Similarity=1.132  Sum_probs=6.1

Q ss_pred             ceeEEEEC
Q psy7050          54 SRGFAFVR   61 (61)
Q Consensus        54 ~~g~~fv~   61 (61)
                      .+||+||+
T Consensus        15 ~kGfGFI~   22 (70)
T PRK10354         15 DKGFGFIT   22 (70)
T ss_pred             CCCcEEEe
Confidence            47899984


No 151
>PRK09890 cold shock protein CspG; Provisional
Probab=55.26  E-value=5.6  Score=18.04  Aligned_cols=8  Identities=38%  Similarity=1.132  Sum_probs=6.1

Q ss_pred             ceeEEEEC
Q psy7050          54 SRGFAFVR   61 (61)
Q Consensus        54 ~~g~~fv~   61 (61)
                      .+||+||+
T Consensus        15 ~kGfGFI~   22 (70)
T PRK09890         15 DKGFGFIT   22 (70)
T ss_pred             CCCcEEEe
Confidence            47899984


No 152
>cd00027 BRCT Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
Probab=53.32  E-value=18  Score=14.95  Aligned_cols=26  Identities=15%  Similarity=0.326  Sum_probs=20.4

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCC
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGE   39 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~   39 (61)
                      ..+++.+.........|.++...+|.
T Consensus         2 ~~~~i~g~~~~~~~~~l~~~i~~~Gg   27 (72)
T cd00027           2 LTFVITGDLPSEERDELKELIEKLGG   27 (72)
T ss_pred             CEEEEEecCCCcCHHHHHHHHHHcCC
Confidence            46777777667788899999988875


No 153
>PF00398 RrnaAD:  Ribosomal RNA adenine dimethylase;  InterPro: IPR001737 This family of proteins include rRNA adenine dimethylases (e.g. KsgA) and the Erythromycin resistance methylases (Erm).  The bacterial enzyme KsgA catalyses the transfer of a total of four methyl groups from S-adenosyl-l-methionine (S-AdoMet) to two adjacent adenosine bases in 16S rRNA. This enzyme and the resulting modified adenosine bases appear to be conserved in all species of eubacteria, eukaryotes, and archaea, and in eukaryotic organelles. Bacterial resistance to the aminoglycoside antibiotic kasugamycin involves inactivation of KsgA and resulting loss of the dimethylations, with modest consequences to the overall fitness of the organism. In contrast, the yeast ortholog, Dim1, is essential. In Saccharomyces cerevisiae (Baker's yeast), and presumably in other eukaryotes, the enzyme performs a vital role in pre-rRNA processing in addition to its methylating activity. The best conserved region in these enzymes is located in the N-terminal section and corresponds to a region that is probably involved in S-adenosyl methionine (SAM) binding domain. The crystal structure of KsgA from Escherichia coli has been solved to a resolution of 2.1A. It bears a strong similarity to the crystal structure of ErmC' from Bacillus stearothermophilus and a lesser similarity to the yeast mitochondrial transcription factor, sc-mtTFB []. The Erm family of RNA methyltransferases, which methylate a single adenosine base in 23S rRNA confer resistance to the MLS-B group of antibiotics. Despite their sequence similarity, the two enzyme families have strikingly different levels of regulation that remain to be elucidated. Other orthologs, of this family include the yeast and Homo sapiens (Human) mitochondrial transcription factors (MTF1 and h-mtTFB respectively), which are nuclear encoded []. Human-mtTFB is able to stimulate transcription in vitro independently of its S-adenosylmethionine binding and rRNA methyltransferase activity [].; GO: 0000179 rRNA (adenine-N6,N6-)-dimethyltransferase activity, 0008649 rRNA methyltransferase activity, 0000154 rRNA modification; PDB: 3FTF_A 3R9X_B 3FTE_A 3FTC_A 3FTD_A 3GRY_A 3FYC_A 3GRU_A 3FYD_A 3GRV_A ....
Probab=52.94  E-value=25  Score=19.85  Aligned_cols=31  Identities=35%  Similarity=0.494  Sum_probs=23.1

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhh--hCCCeEEE
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFE--RCGEVGDI   43 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~--~~G~i~~~   43 (61)
                      ..-+.|+|||...+..-+..++.  .+|...-+
T Consensus        97 ~~~~vv~NlPy~is~~il~~ll~~~~~g~~~~~  129 (262)
T PF00398_consen   97 QPLLVVGNLPYNISSPILRKLLELYRFGRVRMV  129 (262)
T ss_dssp             SEEEEEEEETGTGHHHHHHHHHHHGGGCEEEEE
T ss_pred             CceEEEEEecccchHHHHHHHhhcccccccceE
Confidence            35688999999999888888876  45544433


No 154
>PF03468 XS:  XS domain;  InterPro: IPR005380 The XS (rice gene X and SGS3) domain is found in a family of plant proteins including gene X Q9SBW2 from SWISSPROT and SGS3 Q9LDX1 from SWISSPROT. SGS3 is thought to be involved in post-transcriptional gene silencing (PTGS). This domain contains a conserved aspartate residue that may be functionally important.  The XS domain containing proteins contain coiled-coils, which suggests that they will oligomerise. Most coiled-coil proteins form either a dimeric or a trimeric structure. It is possible that different members of the XS domain family could oligomerise via their coiled-coils forming a variety of complexes [].; PDB: 4E8U_C.
Probab=50.43  E-value=33  Score=17.21  Aligned_cols=33  Identities=15%  Similarity=0.225  Sum_probs=18.7

Q ss_pred             CHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          26 TPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        26 ~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      +.+.|.+.|..|..++ +..+.+..  -+.|++.|+
T Consensus        30 ~~~~l~~~l~~f~p~k-v~~l~~~~--gh~g~aiv~   62 (116)
T PF03468_consen   30 SNEELLDKLAEFNPLK-VKPLYGKQ--GHTGFAIVE   62 (116)
T ss_dssp             -SHHHHHHHHH---SE-EEEEEETT--EEEEEEEEE
T ss_pred             CHHHHHHHHHhcCCce-eEECcCCC--CCcEEEEEE
Confidence            4578889998898765 43444432  367888763


No 155
>PF12157 DUF3591:  Protein of unknown function (DUF3591);  InterPro: IPR022591  This functionally uncharacterised domain is found centrally in the eukaryotic transcription initiation factor TFIID subunit 1. 
Probab=48.85  E-value=32  Score=21.57  Aligned_cols=40  Identities=20%  Similarity=0.283  Sum_probs=24.6

Q ss_pred             ecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          19 DNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        19 ~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      .-.||.+|...+...=.+  ....+.=.-|+ ||...||.||.
T Consensus       359 ~laPWn~T~nFi~A~qgk--a~L~l~G~gDP-TG~GeGfSfir  398 (457)
T PF12157_consen  359 QLAPWNTTKNFINATQGK--AMLQLHGEGDP-TGRGEGFSFIR  398 (457)
T ss_pred             hCCChHhHHHHHHHhccC--cEEEEecCCCC-CCCCCceEEEE
Confidence            345777787777765221  13344334564 68889999974


No 156
>PF11411 DNA_ligase_IV:  DNA ligase IV;  InterPro: IPR021536  DNA ligase IV along with Xrcc4 functions in DNA non-homologous end joining. This process is required to mend double-strand breaks. Upon ligase binding to an Xrcc4 dimer, the helical tails unwind leading to a flat interaction surface []. ; GO: 0003910 DNA ligase (ATP) activity; PDB: 3II6_Y 2E2W_A 1IK9_C.
Probab=48.65  E-value=19  Score=14.45  Aligned_cols=15  Identities=40%  Similarity=0.627  Sum_probs=9.1

Q ss_pred             CCCHHHHHHHhhhCC
Q psy7050          24 RTTPDDLRRVFERCG   38 (61)
Q Consensus        24 ~~~~~~l~~~f~~~G   38 (61)
                      +++++.|++.|.+..
T Consensus        20 Dtd~~~Lk~vF~~i~   34 (36)
T PF11411_consen   20 DTDEDQLKEVFNRIK   34 (36)
T ss_dssp             ---HHHHHHHHHCS-
T ss_pred             cCCHHHHHHHHHHhc
Confidence            567888999987653


No 157
>TIGR00755 ksgA dimethyladenosine transferase. Alternate name: S-adenosylmethionine--6-N',N'-adenosyl (rRNA) dimethyltransferase
Probab=47.25  E-value=43  Score=18.69  Aligned_cols=25  Identities=20%  Similarity=0.200  Sum_probs=20.1

Q ss_pred             EEEEecCCCCCCHHHHHHHhhhCCC
Q psy7050          15 SLKVDNLTYRTTPDDLRRVFERCGE   39 (61)
Q Consensus        15 ~l~v~~l~~~~~~~~l~~~f~~~G~   39 (61)
                      -+.|+|+|+.++...+..++...|.
T Consensus        96 ~~vvsNlPy~i~~~il~~ll~~~~~  120 (253)
T TIGR00755        96 LKVVSNLPYNISSPLIFKLLEKPKF  120 (253)
T ss_pred             ceEEEcCChhhHHHHHHHHhccCCC
Confidence            3778999999999999988864443


No 158
>PF03672 UPF0154:  Uncharacterised protein family (UPF0154);  InterPro: IPR005359 The proteins in this entry are functionally uncharacterised.
Probab=46.75  E-value=30  Score=15.70  Aligned_cols=18  Identities=11%  Similarity=0.170  Sum_probs=14.8

Q ss_pred             CCCCCHHHHHHHhhhCCC
Q psy7050          22 TYRTTPDDLRRVFERCGE   39 (61)
Q Consensus        22 ~~~~~~~~l~~~f~~~G~   39 (61)
                      ++.++++.++.++.+.|.
T Consensus        29 NPpine~mir~M~~QMG~   46 (64)
T PF03672_consen   29 NPPINEKMIRAMMMQMGR   46 (64)
T ss_pred             CCCCCHHHHHHHHHHhCC
Confidence            456789999999998874


No 159
>PHA01632 hypothetical protein
Probab=46.34  E-value=29  Score=15.37  Aligned_cols=18  Identities=22%  Similarity=0.615  Sum_probs=14.3

Q ss_pred             EecCCCCCCHHHHHHHhh
Q psy7050          18 VDNLTYRTTPDDLRRVFE   35 (61)
Q Consensus        18 v~~l~~~~~~~~l~~~f~   35 (61)
                      |..+|..-++++|+..+.
T Consensus        21 ieqvp~kpteeelrkvlp   38 (64)
T PHA01632         21 IEQVPQKPTEEELRKVLP   38 (64)
T ss_pred             hhhcCCCCCHHHHHHHHH
Confidence            357899999999987664


No 160
>PF07230 Peptidase_S80:  Bacteriophage T4-like capsid assembly protein (Gp20);  InterPro: IPR010823 This family of proteins is essential for capsid assembly in the T4-like bacteriophages []. Gp20 forms a unique 12 subunit portal vertex through which DNA enters during packaging and exits during infection. The Gp20 vertex acts as an initiator for the assembly of the major capsid protein and the scaffolding proteins into a prolate icosahedron of precise dimensions. The regulation of portal protein gene expression is an important regulator of prohead assembly in bacteriophage T4 [].
Probab=46.33  E-value=20  Score=22.68  Aligned_cols=36  Identities=22%  Similarity=0.251  Sum_probs=26.3

Q ss_pred             EEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCc
Q psy7050          15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRES   54 (61)
Q Consensus        15 ~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~   54 (61)
                      .|=|||||..-.++-|+++..+|    .-+++.|..||.-
T Consensus       280 yIDvGnlpk~KAeqyl~~iM~k~----knklvYDa~TGev  315 (501)
T PF07230_consen  280 YIDVGNLPKQKAEQYLRDIMNKY----KNKLVYDASTGEV  315 (501)
T ss_pred             EEECCCCChHhHHHHHHHHHHHh----cceeeecCCCCee
Confidence            44569999999999999998887    2345556666643


No 161
>PF13046 DUF3906:  Protein of unknown function (DUF3906)
Probab=46.16  E-value=31  Score=15.66  Aligned_cols=32  Identities=22%  Similarity=0.328  Sum_probs=20.3

Q ss_pred             HHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050          27 PDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        27 ~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      +..|...|-+--.+.++.+...+.-  .+|-|||
T Consensus        32 e~eler~fl~~P~v~e~~l~EKKri--~~G~gyV   63 (64)
T PF13046_consen   32 EVELERHFLPLPEVKEVALYEKKRI--RKGAGYV   63 (64)
T ss_pred             HHHhhhhccCCCCceEEEEEEEEee--eCCceeE
Confidence            4567777777778888877755433  3445554


No 162
>PHA02531 20 portal vertex protein; Provisional
Probab=45.43  E-value=15  Score=23.23  Aligned_cols=36  Identities=25%  Similarity=0.266  Sum_probs=25.6

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCC
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRE   53 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~   53 (61)
                      .-|=|||||..-.++-|+++..+|-    -+++.|..||.
T Consensus       282 FYiDVGNlPk~KAeqYlr~vm~~yk----NklvYDa~TGe  317 (514)
T PHA02531        282 FYIDVGNLPKRKAEEYLNNVMQRYK----NRVVYDANTGK  317 (514)
T ss_pred             EEEEcCCCChhhHHHHHHHHHHHhh----hhEEEeCCCCe
Confidence            3455699999999999999988772    23444555554


No 163
>PRK00523 hypothetical protein; Provisional
Probab=44.00  E-value=34  Score=15.90  Aligned_cols=18  Identities=17%  Similarity=0.303  Sum_probs=14.8

Q ss_pred             CCCCCHHHHHHHhhhCCC
Q psy7050          22 TYRTTPDDLRRVFERCGE   39 (61)
Q Consensus        22 ~~~~~~~~l~~~f~~~G~   39 (61)
                      ++-++|+.++.++.+.|.
T Consensus        37 NPpine~mir~M~~QMGq   54 (72)
T PRK00523         37 NPPITENMIRAMYMQMGR   54 (72)
T ss_pred             CcCCCHHHHHHHHHHhCC
Confidence            456789999999998885


No 164
>PF15407 Spo7_2_N:  Sporulation protein family 7
Probab=43.94  E-value=7.6  Score=17.67  Aligned_cols=18  Identities=6%  Similarity=-0.191  Sum_probs=13.9

Q ss_pred             CCCcEEEEecCCCCCCHH
Q psy7050          11 EGMVSLKVDNLTYRTTPD   28 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~   28 (61)
                      .-.+.+|+|++|..+-++
T Consensus        25 ~tSr~vflG~IP~~W~~~   42 (67)
T PF15407_consen   25 LTSRRVFLGPIPEIWLQD   42 (67)
T ss_pred             HcCceEEECCCChHHHHc
Confidence            345889999999987554


No 165
>smart00650 rADc Ribosomal RNA adenine dimethylases.
Probab=41.40  E-value=53  Score=17.01  Aligned_cols=23  Identities=26%  Similarity=0.288  Sum_probs=18.5

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhh
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFER   36 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~   36 (61)
                      .-+.++|+|+..+.+.+..++..
T Consensus        78 ~d~vi~n~Py~~~~~~i~~~l~~  100 (169)
T smart00650       78 PYKVVGNLPYNISTPILFKLLEE  100 (169)
T ss_pred             CCEEEECCCcccHHHHHHHHHhc
Confidence            34668999999988888888764


No 166
>PF15063 TC1:  Thyroid cancer protein 1
Probab=41.02  E-value=23  Score=16.68  Aligned_cols=26  Identities=23%  Similarity=0.423  Sum_probs=20.8

Q ss_pred             cEEEEecCCCCCCHHHHHHHhhhCCC
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFERCGE   39 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~~~G~   39 (61)
                      ++--+.|+=.+++...|+.+|...|.
T Consensus        26 RKkasaNIFe~vn~~qlqrLF~~sGD   51 (79)
T PF15063_consen   26 RKKASANIFENVNLDQLQRLFQKSGD   51 (79)
T ss_pred             hhhhhhhhhhccCHHHHHHHHHHccc
Confidence            34456677788899999999998885


No 167
>PF01823 MACPF:  MAC/Perforin domain;  InterPro: IPR020864 The membrane attack complex/perforin (MACPF) domain is conserved in bacteria, fungi, mammals and plants. It was originally identified and named as being common to five complement components (C6, C7, C8-alpha, C8-beta, and C9) and perforin. These molecules perform critical functions in innate and adaptive immunity. The MAC family proteins and perforin are known to participate in lytic pore formation. In response to pathogen infection, a sequential and highly specific interaction between the constituent elements occurs to form transmembrane channels which are known as the membrane-attack complex (MAC).Only a few other MACPF proteins have been characterised and several are thought to form pores for invasion or protection [, , ]. Examples are proteins from malarial parasites [], the cytolytic toxins from sea anemones [], and proteins that provide plant immunity [, ]. Functionally uncharacterised MACPF proteins are also evident in pathogenic bacteria such as Chlamydia spp [] and Photorhabdus luminescens (Xenorhabdus luminescens) []. The MACPF domain is commonly found to be associated with other N- and C-terminal domains, such as TSP1 (see PDOC50092 from PROSITEDOC), LDLRA (see PDOC00929 from PROSITEDOC), EGF-like (see PDOC00021 from PROSITEDOC),Sushi/CCP/SCR (see PDOC50923 from PROSITEDOC), FIMAC or C2 (see PDOC00380 from PROSITEDOC). They probably control or target MACPF function [, ]. The MACPF domain oligomerizes, undergoes conformational change, and is required for lytic activity. The MACPF domain consists of a central kinked four-stranded antiparallel beta sheet surrounded by alpha helices and beta strands, forming two structural segments. Overall, the MACPF domain has a thin L-shaped appearance. MACPF domains exhibit limited sequence similarity but contain a signature [YW]-G-[TS]-H-[FY]-x(6)-G-G motif [, , ]. Some proteins known to contain a MACPF domain are listed below:  Vertebrate complement proteins C6 to C9. Complement factors C6 to C9 assemble to form a scaffold, the membrane attack complex (MAC), that permits C9 polymerisation into pores that lyse Gram-negative pathogens [, ]. Vertebrate perforin. It is delivered by natural killer cells and cytotoxic T lymphocytes and forms oligomeric pores (12 to 18 monomers) in the plasma membrane of either virus-infected or transformed cells.  Arabidopsis thaliana (Mouse-ear cress) constitutively activated cell death 1 (CAD1) protein. It is likely to act as a mediator that recognises plant signals for pathogen infection [].  Arabidopsis thaliana (Mouse-ear cress) necrotic spotted lesions 1 (NSL1) protein []. Venomous sea anemone Phyllodiscus semoni (Night anemone) toxins PsTX-60A and PsTX-60B []. Venomous sea anemone Actineria villosa (Okinawan sea anemone) toxin AvTX-60A []. Plasmodium sporozoite microneme protein essential for cell traversal 2 (SPECT2). It is essential for the membrane-wounding activity of the sporozoite and is involved in its traversal of the sinusoidal cell layer prior to hepatocyte-infection []. P. luminescens Plu-MACPF. Although nonlytic, it was shown to bind to cell membranes []. Chlamydial putative uncharacterised protein CT153 []. ; PDB: 2QP2_A 3OJY_B 3NSJ_A 4E0S_B 3T5O_A 4A5W_B 2QQH_A 2RD7_A.
Probab=40.05  E-value=26  Score=18.60  Aligned_cols=22  Identities=23%  Similarity=0.505  Sum_probs=14.1

Q ss_pred             EecCCCCCCHHH---HHHHhhhCCC
Q psy7050          18 VDNLTYRTTPDD---LRRVFERCGE   39 (61)
Q Consensus        18 v~~l~~~~~~~~---l~~~f~~~G~   39 (61)
                      +.+||...+...   +.++|..||.
T Consensus        53 l~~L~~~~~~~~~~~y~~f~~~yGT   77 (212)
T PF01823_consen   53 LNALPAEYNSDNTDEYYRFFDKYGT   77 (212)
T ss_dssp             HHTSHSS--HHHHHHHHHHHHHH-S
T ss_pred             HHhhCcccCccchHHHHHHHHHhCc
Confidence            345777777666   7789999996


No 168
>COG5638 Uncharacterized conserved protein [Function unknown]
Probab=39.85  E-value=98  Score=19.61  Aligned_cols=37  Identities=22%  Similarity=0.371  Sum_probs=25.9

Q ss_pred             CCCCCcEEEEecCCCC-CCHHHHHHHhhhC----CCeEEEEE
Q psy7050           9 RIEGMVSLKVDNLTYR-TTPDDLRRVFERC----GEVGDIYI   45 (61)
Q Consensus         9 ~~~~~~~l~v~~l~~~-~~~~~l~~~f~~~----G~i~~~~~   45 (61)
                      +.+...+|-|-|+.|+ +...+|..+|+.|    |.+..+.+
T Consensus       142 ~G~~tkrLAvVnmDWd~v~a~DLf~~fsSf~P~ggkl~kV~i  183 (622)
T COG5638         142 EGNPTKRLAVVNMDWDRVDAKDLFKIFSSFLPYGGKLSKVKI  183 (622)
T ss_pred             CCCcccceeEeecccccchHHHHHHHHHhhCCCCCccceeEe
Confidence            3455678999999997 4567888777753    55665554


No 169
>KOG2135|consensus
Probab=39.42  E-value=18  Score=22.88  Aligned_cols=35  Identities=23%  Similarity=0.388  Sum_probs=25.0

Q ss_pred             cEEEEecCCCCC-CHHHHHHHhhhCCCeEEEEEccC
Q psy7050          14 VSLKVDNLTYRT-TPDDLRRVFERCGEVGDIYIPRD   48 (61)
Q Consensus        14 ~~l~v~~l~~~~-~~~~l~~~f~~~G~i~~~~~~~~   48 (61)
                      +.+-+.-.++.. +.++|..+|.+||.|..|.+-..
T Consensus       373 s~l~lek~~~glnt~a~ln~hfA~fG~i~n~qv~~~  408 (526)
T KOG2135|consen  373 SPLALEKSPFGLNTIADLNPHFAQFGEIENIQVDYS  408 (526)
T ss_pred             chhhhhccCCCCchHhhhhhhhhhcCccccccccCc
Confidence            444455556665 46899999999999988865433


No 170
>PRK01844 hypothetical protein; Provisional
Probab=39.17  E-value=45  Score=15.51  Aligned_cols=18  Identities=11%  Similarity=0.167  Sum_probs=14.6

Q ss_pred             CCCCCHHHHHHHhhhCCC
Q psy7050          22 TYRTTPDDLRRVFERCGE   39 (61)
Q Consensus        22 ~~~~~~~~l~~~f~~~G~   39 (61)
                      ++-++|+.|+.++.+-|.
T Consensus        36 NPpine~mir~Mm~QMGq   53 (72)
T PRK01844         36 NPPINEQMLKMMMMQMGQ   53 (72)
T ss_pred             CCCCCHHHHHHHHHHhCC
Confidence            456789999999998874


No 171
>PF11004 Kdo_hydroxy:  3-deoxy-D-manno-oct-2-ulosonic acid (Kdo) hydroxylase;  InterPro: IPR021266  This bacterial family of proteins has no known function. 
Probab=38.00  E-value=55  Score=19.27  Aligned_cols=47  Identities=17%  Similarity=0.158  Sum_probs=28.7

Q ss_pred             CCCCcEEEEecCCCCCCHHHHHHHhhh-CCCeEEEEEccCCCCCCceeE
Q psy7050          10 IEGMVSLKVDNLTYRTTPDDLRRVFER-CGEVGDIYIPRDRFTRESRGF   57 (61)
Q Consensus        10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~-~G~i~~~~~~~~~~t~~~~g~   57 (61)
                      .+.+..||.-+|++.....+. .++.+ .-....-.+..++.++..+|.
T Consensus        18 LE~G~VL~fP~l~F~l~~~E~-~fL~p~~~d~k~KNIs~~p~~~~l~G~   65 (281)
T PF11004_consen   18 LEQGKVLYFPQLRFPLSPEER-AFLDPAIVDPKRKNISYRPNTGSLRGA   65 (281)
T ss_pred             hhCCCEEEeCCCCCCCChhHH-hhcChhhhCCCCCceeecCCCCceecc
Confidence            467899999999998887665 33332 211122234446666666664


No 172
>smart00457 MACPF membrane-attack complex / perforin.
Probab=37.90  E-value=24  Score=18.90  Aligned_cols=21  Identities=14%  Similarity=0.300  Sum_probs=17.5

Q ss_pred             ecCCCCCCHHHHHHHhhhCCC
Q psy7050          19 DNLTYRTTPDDLRRVFERCGE   39 (61)
Q Consensus        19 ~~l~~~~~~~~l~~~f~~~G~   39 (61)
                      .+||..........++..||.
T Consensus        31 ~~Lp~~~~~~~~~~fi~~yGT   51 (194)
T smart00457       31 RDLPDQYNRGAYARFIDKYGT   51 (194)
T ss_pred             HhCccccCHHHHHHHHHHhCC
Confidence            467888888888999999995


No 173
>PF09176 Mpt_N:  Methylene-tetrahydromethanopterin dehydrogenase, N-terminal;  InterPro: IPR015259 Prokaryotic methylene-tetrahydromethanopterin dehydrogenase catalyses the dehydrogenation of methylene-tetrahydromethanopterin during growth on one-carbon compounds such as methanol. It can also catalyse the reversible dehydrogenation of methylene-tetrahydrofolate, though at much lower efficiency []. The pterin domain of this protein is composed of two alpha-beta segments found at the N- and C-terminal ends of the polypeptide respectivly. This entry represents the N-terminal segment of the pterin domain, with a core comprising three alpha/beta/alpha layers in which each sheet contains four strands. ; PDB: 1LUA_A 1LU9_C.
Probab=37.38  E-value=9.2  Score=18.15  Aligned_cols=30  Identities=13%  Similarity=0.123  Sum_probs=15.8

Q ss_pred             CCCCCCCCCCCcEEEEecCCCCCCHHHHHHH
Q psy7050           3 YGRPPPRIEGMVSLKVDNLTYRTTPDDLRRV   33 (61)
Q Consensus         3 ~~~~~~~~~~~~~l~v~~l~~~~~~~~l~~~   33 (61)
                      |+|+|. .-+.+-||||+-+-...++.|+..
T Consensus        35 FsR~P~-~~~~TaiFIGG~d~~~a~dml~~a   64 (81)
T PF09176_consen   35 FSRGPK-DLKRTAIFIGGRDVALAEDMLEAA   64 (81)
T ss_dssp             SSS-GG-GGGGEEEEEE-S-HHHHHHHHHHH
T ss_pred             EccCCc-CCceeEEEECCccHHHHHHHHHHH
Confidence            455553 334467999986655555555443


No 174
>PF00220 Hormone_4:  Neurohypophysial hormones, N-terminal Domain;  InterPro: IPR022423 Oxytocin (or ocytocin) and vasopressin [] are small (nine amino acid residues), structurally and functionally related neurohypophysial peptide hormones. Oxytocin causes contraction of the smooth muscle of the uterus and of the mammary gland while vasopressin has a direct antidiuretic action on the kidney and also causes vasoconstriction of the peripheral vessels. Like the majority of active peptides, both hormones are synthesized as larger protein precursors that are enzymatically converted to their mature forms. Peptides belonging to this family are also found in birds, fish, reptiles and amphibians (mesotocin, isotocin, valitocin, glumitocin, aspargtocin, vasotocin, seritocin, asvatocin, phasvatocin), in worms (annetocin), octopi (cephalotocin), locust (locupressin or neuropeptide F1/F2) and in molluscs (conopressins G and S) [].  The pattern developed to detect this category of peptides spans their entire sequence and includes four invariant amino acid residues.  .; GO: 0005185 neurohypophyseal hormone activity, 0005576 extracellular region
Probab=36.86  E-value=16  Score=10.04  Aligned_cols=6  Identities=17%  Similarity=0.174  Sum_probs=2.8

Q ss_pred             EEecCC
Q psy7050          17 KVDNLT   22 (61)
Q Consensus        17 ~v~~l~   22 (61)
                      ||.|.|
T Consensus         2 ~i~nCP    7 (9)
T PF00220_consen    2 YIRNCP    7 (9)
T ss_pred             ccccCC
Confidence            445544


No 175
>PF14111 DUF4283:  Domain of unknown function (DUF4283)
Probab=36.15  E-value=62  Score=16.26  Aligned_cols=31  Identities=13%  Similarity=0.191  Sum_probs=24.5

Q ss_pred             EEEecCCCCC-CHHHHHHHhhhCCCeEEEEEc
Q psy7050          16 LKVDNLTYRT-TPDDLRRVFERCGEVGDIYIP   46 (61)
Q Consensus        16 l~v~~l~~~~-~~~~l~~~f~~~G~i~~~~~~   46 (61)
                      |.+.|||... +++-++.+-+..|.+..++..
T Consensus       107 Vri~glP~~~~~~~~~~~i~~~iG~~i~vD~~  138 (153)
T PF14111_consen  107 VRIYGLPLHLWSEEILKAIGSKIGEPIEVDEN  138 (153)
T ss_pred             hhhccCCHHHhhhHHHHHHHHhcCCeEEEEcC
Confidence            5667999884 567788888899999888654


No 176
>smart00596 PRE_C2HC PRE_C2HC domain.
Probab=33.74  E-value=56  Score=15.05  Aligned_cols=26  Identities=19%  Similarity=0.134  Sum_probs=15.5

Q ss_pred             HHHHHhhhCC-CeEEEEEccCCCCCCc
Q psy7050          29 DLRRVFERCG-EVGDIYIPRDRFTRES   54 (61)
Q Consensus        29 ~l~~~f~~~G-~i~~~~~~~~~~t~~~   54 (61)
                      .+.+.|...| ++..++-+..+.++.+
T Consensus         3 ~I~~~L~~~G~~v~~i~~m~~~~~r~P   29 (69)
T smart00596        3 QIEEALKDIGFPVLFIHNMLNRDTKNP   29 (69)
T ss_pred             HHHHHHHHcCCceeEEEcccccCCCCc
Confidence            4566666666 4666666666554433


No 177
>KOG4365|consensus
Probab=33.27  E-value=28  Score=22.00  Aligned_cols=45  Identities=7%  Similarity=-0.187  Sum_probs=30.7

Q ss_pred             EEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050          16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        16 l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      .++..++-..++.++.-+|..+|-|..+.+-..-..+...-.+|+
T Consensus         6 ~~l~d~~~~~~~~~~~~~~~d~~~i~~~d~~~~~~~~~~~v~~f~   50 (572)
T KOG4365|consen    6 KSLKDSVASNNKDQNSMKHEDPSIISMEDGSPYVNGSLGEVTPFQ   50 (572)
T ss_pred             hhHhhcccccccchhhhhccCCcceeeccCCccccCCcceeeeee
Confidence            445567777788889889999998887765544344444555554


No 178
>PF10281 Ish1:  Putative stress-responsive nuclear envelope protein;  InterPro: IPR018803  This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues []. 
Probab=32.38  E-value=41  Score=13.11  Aligned_cols=16  Identities=19%  Similarity=0.248  Sum_probs=12.9

Q ss_pred             CCCHHHHHHHhhhCCC
Q psy7050          24 RTTPDDLRRVFERCGE   39 (61)
Q Consensus        24 ~~~~~~l~~~f~~~G~   39 (61)
                      .++.++|++-+..+|-
T Consensus         3 tWs~~~L~~wL~~~gi   18 (38)
T PF10281_consen    3 TWSDSDLKSWLKSHGI   18 (38)
T ss_pred             CCCHHHHHHHHHHcCC
Confidence            4678899999988874


No 179
>PF05189 RTC_insert:  RNA 3'-terminal phosphate cyclase (RTC), insert domain;  InterPro: IPR013796 RNA cyclases are a family of RNA-modifying enzymes that are conserved in eukaryotes, bacteria and archaea. RNA 3'-terminal phosphate cyclase (6.5.1.4 from EC) [, ] catalyses the conversion of 3'-phosphate to a 2',3'-cyclic phosphodiester at the end of RNA.  ATP + RNA 3'-terminal-phosphate = AMP + diphosphate + RNA terminal-2',3'-cyclic-phosphate  These enzymes might be responsible for production of the cyclic phosphate RNA ends that are known to be required by many RNA ligases in both prokaryotes and eukaryotes. RNA cyclase is a protein of from 36 to 42 kDa. The best conserved region is a glycine-rich stretch of residues located in the central part of the sequence and which is reminiscent of various ATP, GTP or AMP glycine-rich loops. The crystal structure of RNA 3'-terminal phosphate cyclase shows that each molecule consists of two domains. The larger domain contains three repeats of a folding unit comprising two parallel alpha helices and a four-stranded beta sheet; this fold was previously identified in translation initiation factor 3 (IF3). The large domain is similar to one of the two domains of 5-enolpyruvylshikimate-3-phosphate synthase and UDP-N-acetylglucosamine enolpyruvyl transferase. The smaller domain uses a similar secondary structure element with different topology, observed in many other proteins such as thioredoxin []. Although the active site of this enzyme could not be unambiguously assigned, it can be mapped to a region surrounding His309, an adenylate acceptor, in which a number of amino acids are highly conserved in the enzyme from different sources [].  This entry contains the insert-domain of approximately 100 amino acids.; PDB: 3TV1_B 3KGD_D 1QMI_D 1QMH_B 3TUX_A 3TUT_A 3TW3_A 3PQV_A.
Probab=32.22  E-value=66  Score=15.41  Aligned_cols=45  Identities=13%  Similarity=0.012  Sum_probs=24.3

Q ss_pred             EEEEecCCCCCCHHHH---HHHhhhCCCeEEEEE--ccCCCCCCceeEEE
Q psy7050          15 SLKVDNLTYRTTPDDL---RRVFERCGEVGDIYI--PRDRFTRESRGFAF   59 (61)
Q Consensus        15 ~l~v~~l~~~~~~~~l---~~~f~~~G~i~~~~~--~~~~~t~~~~g~~f   59 (61)
                      ..|+.+||..+.+..+   +..+..+..-..+..  ......+...|++.
T Consensus        12 ~a~~a~lp~~va~R~~~~a~~~L~~~~~~v~i~~d~~~~~~~~~~~G~gi   61 (103)
T PF05189_consen   12 IAFVAGLPSSVAERMANAARKRLNWYGPDVEIETDYRESDDSAFGPGSGI   61 (103)
T ss_dssp             EEEEESS-CHHHHHHHHHHHHHHCTTCSEEEEEEEEE-CCCCGCSSEEEE
T ss_pred             EEEEcCCCHHHHHHHHHHHHHHhhhhCCCeEEEEecccCccCCCCCceEE
Confidence            4578899998877544   455555554334332  12234455666654


No 180
>KOG2591|consensus
Probab=30.87  E-value=1.5e+02  Score=19.61  Aligned_cols=35  Identities=11%  Similarity=0.175  Sum_probs=27.5

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhh--CCCeEEEEEc
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFER--CGEVGDIYIP   46 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~--~G~i~~~~~~   46 (61)
                      +.|.+.++-||..+..+.++.+|+.  |-.+.+|.+.
T Consensus       174 kRcIvilREIpettp~e~Vk~lf~~encPk~iscefa  210 (684)
T KOG2591|consen  174 KRCIVILREIPETTPIEVVKALFKGENCPKVISCEFA  210 (684)
T ss_pred             ceeEEEEeecCCCChHHHHHHHhccCCCCCceeeeee
Confidence            3477888999999999999999975  5566666543


No 181
>KOG3424|consensus
Probab=30.62  E-value=86  Score=16.19  Aligned_cols=36  Identities=17%  Similarity=0.235  Sum_probs=20.1

Q ss_pred             CCHHHHHHHhhh-CCCeEEE----EEccCCCCCCceeEEEE
Q psy7050          25 TTPDDLRRVFER-CGEVGDI----YIPRDRFTRESRGFAFV   60 (61)
Q Consensus        25 ~~~~~l~~~f~~-~G~i~~~----~~~~~~~t~~~~g~~fv   60 (61)
                      ++.++|++-++. |-.-.+.    .+-..-.+|++.|||.|
T Consensus        35 vsK~EirEKla~mYkt~~d~V~vfgfrt~~GggkstgfalI   75 (132)
T KOG3424|consen   35 VSKTEIREKLAKMYKTTPDAVFVFGFRTHFGGGKSTGFALI   75 (132)
T ss_pred             CCHHHHHHHHHHHhcCCcceEEEEEeeeccCCcccceeeee
Confidence            466777776654 4322222    23333356778888875


No 182
>cd00874 RNA_Cyclase_Class_II RNA 3' phosphate cyclase domain (class II). These proteins function as RNA cyclase to catalyze the ATP-dependent conversion of 3'-phosphate to a 2'.3'-cyclic phosphodiester at the end of RNA molecule. A conserved catalytic histidine residue is found in all members of this subfamily.
Probab=30.23  E-value=88  Score=18.66  Aligned_cols=43  Identities=14%  Similarity=0.121  Sum_probs=24.2

Q ss_pred             EEEEecCCCCCCHHH---HHHHhhh-CCCeEEEEEccCCCCCCceeEEE
Q psy7050          15 SLKVDNLTYRTTPDD---LRRVFER-CGEVGDIYIPRDRFTRESRGFAF   59 (61)
Q Consensus        15 ~l~v~~l~~~~~~~~---l~~~f~~-~G~i~~~~~~~~~~t~~~~g~~f   59 (61)
                      ..++.++|..+.+..   .++.+.+ ++  .++.+..+...+.+.|++.
T Consensus       188 ~~~~~~l~~~va~r~~~~a~~~L~~~~~--~dv~i~~~~~~~~s~G~~i  234 (326)
T cd00874         188 ISHAANLPPHVAERQAEAAAALLRKALG--LQIEIEPEDQSALGPGSGI  234 (326)
T ss_pred             EEEEccCCHHHHHHHHHHHHHHHhhccC--CCeEEEEEecCCCCCCEEE
Confidence            455678888776644   3455555 43  2334443434466777765


No 183
>cd06257 DnaJ DnaJ domain or J-domain.  DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
Probab=29.76  E-value=51  Score=13.34  Aligned_cols=18  Identities=0%  Similarity=0.338  Sum_probs=13.5

Q ss_pred             ecCCCCCCHHHHHHHhhh
Q psy7050          19 DNLTYRTTPDDLRRVFER   36 (61)
Q Consensus        19 ~~l~~~~~~~~l~~~f~~   36 (61)
                      -+++..++.++|+..|..
T Consensus         6 Lgl~~~~~~~~ik~~y~~   23 (55)
T cd06257           6 LGVPPDASDEEIKKAYRK   23 (55)
T ss_pred             cCCCCCCCHHHHHHHHHH
Confidence            367788888888877754


No 184
>PF12385 Peptidase_C70:  Papain-like cysteine protease AvrRpt2;  InterPro: IPR022118  This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 []. 
Probab=29.46  E-value=68  Score=17.38  Aligned_cols=20  Identities=15%  Similarity=0.193  Sum_probs=16.5

Q ss_pred             CCCHHHHHHHhhhCCCeEEE
Q psy7050          24 RTTPDDLRRVFERCGEVGDI   43 (61)
Q Consensus        24 ~~~~~~l~~~f~~~G~i~~~   43 (61)
                      ..+.+.+..++..||+|.-.
T Consensus        95 ~~t~e~~~~LL~~yGPLwv~  114 (166)
T PF12385_consen   95 SYTAEGLANLLREYGPLWVA  114 (166)
T ss_pred             ccCHHHHHHHHHHcCCeEEE
Confidence            56789999999999987543


No 185
>PF06014 DUF910:  Bacterial protein of unknown function (DUF910);  InterPro: IPR009256 This family consists of several short bacterial proteins of unknown function.; PDB: 2NN4_A.
Probab=28.60  E-value=65  Score=14.51  Aligned_cols=16  Identities=13%  Similarity=0.231  Sum_probs=10.1

Q ss_pred             HHHHHHHhhhCCCeEE
Q psy7050          27 PDDLRRVFERCGEVGD   42 (61)
Q Consensus        27 ~~~l~~~f~~~G~i~~   42 (61)
                      -.|+++++.+||.+..
T Consensus         4 lyDVqQLLK~fG~~IY   19 (62)
T PF06014_consen    4 LYDVQQLLKKFGIIIY   19 (62)
T ss_dssp             HHHHHHHHHTTS----
T ss_pred             HHHHHHHHHHCCEEEE
Confidence            4678899999997644


No 186
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=28.39  E-value=74  Score=14.75  Aligned_cols=18  Identities=11%  Similarity=0.252  Sum_probs=14.5

Q ss_pred             CCCCCHHHHHHHhhhCCC
Q psy7050          22 TYRTTPDDLRRVFERCGE   39 (61)
Q Consensus        22 ~~~~~~~~l~~~f~~~G~   39 (61)
                      ++.++++.++.++.+-|.
T Consensus        36 NPpine~~iR~M~~qmGq   53 (71)
T COG3763          36 NPPINEEMIRMMMAQMGQ   53 (71)
T ss_pred             CCCCCHHHHHHHHHHhCC
Confidence            456789999999988874


No 187
>COG4647 AcxC Acetone carboxylase, gamma subunit [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=28.33  E-value=69  Score=16.78  Aligned_cols=31  Identities=13%  Similarity=0.208  Sum_probs=17.3

Q ss_pred             CcEEEEec------------CCCCCCHHHHHHHhhh-CCCeEEE
Q psy7050          13 MVSLKVDN------------LTYRTTPDDLRRVFER-CGEVGDI   43 (61)
Q Consensus        13 ~~~l~v~~------------l~~~~~~~~l~~~f~~-~G~i~~~   43 (61)
                      ..+|+|++            .-++..+..+++++-+ +|.+..+
T Consensus        89 ~a~i~vrdtee~lreiyp~s~ipdp~wme~reficpecg~l~ev  132 (165)
T COG4647          89 HANIYVRDTEEKLREIYPKSDIPDPQWMEIREFICPECGILHEV  132 (165)
T ss_pred             eeEEEEcchHHHHHHhCcccCCCCchHHHHHHhhCccccceeee
Confidence            45777775            2333445566666654 6655444


No 188
>PF00226 DnaJ:  DnaJ domain;  InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation:  +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+   It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=28.26  E-value=61  Score=13.73  Aligned_cols=18  Identities=11%  Similarity=0.416  Sum_probs=13.7

Q ss_pred             ecCCCCCCHHHHHHHhhh
Q psy7050          19 DNLTYRTTPDDLRRVFER   36 (61)
Q Consensus        19 ~~l~~~~~~~~l~~~f~~   36 (61)
                      -+|+.+++.++|+..|..
T Consensus         6 Lgl~~~~~~~eik~~y~~   23 (64)
T PF00226_consen    6 LGLPPDASDEEIKKAYRR   23 (64)
T ss_dssp             CTSTTTSSHHHHHHHHHH
T ss_pred             CCCCCCCCHHHHHHHHHh
Confidence            367888888888877754


No 189
>smart00703 NRF N-terminal domain in C. elegans NRF-6 (Nose Resistant to Fluoxetine-4) and NDG-4 (resistant to nordihydroguaiaretic acid-4). Also present in several other worm and fly proteins.
Probab=28.23  E-value=56  Score=15.92  Aligned_cols=23  Identities=17%  Similarity=0.333  Sum_probs=17.7

Q ss_pred             EEEEe-cCCCCCCHHHHHHHhhhC
Q psy7050          15 SLKVD-NLTYRTTPDDLRRVFERC   37 (61)
Q Consensus        15 ~l~v~-~l~~~~~~~~l~~~f~~~   37 (61)
                      .+.+| .+|..++.+++..++..+
T Consensus        71 ~~~~gvC~P~sCs~~dv~~l~~~~   94 (110)
T smart00703       71 VLRTGVCLPSSCSAADLKTLLNQL   94 (110)
T ss_pred             ceEEEEEeCCCCCHHHHHHHHHHH
Confidence            34444 579999999999988765


No 190
>KOG0008|consensus
Probab=27.27  E-value=1e+02  Score=22.56  Aligned_cols=39  Identities=21%  Similarity=0.221  Sum_probs=23.2

Q ss_pred             cCCCCCCHHHHH-HHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050          20 NLTYRTTPDDLR-RVFERCGEVGDIYIPRDRFTRESRGFAFVR   61 (61)
Q Consensus        20 ~l~~~~~~~~l~-~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~   61 (61)
                      -.||.++...|. ..=..  ....+.=+-|+ ||...||.||.
T Consensus       820 ~aPWNtTrnFInqa~~gk--~lLeltGvgDP-TGcGeGFSyvR  859 (1563)
T KOG0008|consen  820 LAPWNTTRNFINQATQGK--CLLELTGVGDP-TGCGEGFSYVR  859 (1563)
T ss_pred             cCcchhhHHHHHHHhccc--eeEEeecCCCC-CcccCcceeEe
Confidence            357777877774 22111  13333335564 78899999984


No 191
>KOG0820|consensus
Probab=26.98  E-value=78  Score=18.91  Aligned_cols=22  Identities=18%  Similarity=0.205  Sum_probs=17.9

Q ss_pred             cEEEEecCCCCCCHHHHHHHhh
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFE   35 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~   35 (61)
                      .-+.|.|+|+.++..-+..++.
T Consensus       124 fd~cVsNlPyqISSp~vfKLL~  145 (315)
T KOG0820|consen  124 FDGCVSNLPYQISSPLVFKLLL  145 (315)
T ss_pred             cceeeccCCccccCHHHHHhcC
Confidence            4578999999999988877653


No 192
>PF04008 Adenosine_kin:  Adenosine specific kinase;  InterPro: IPR007153 The structure of a member of this family from the hyperthermophilic archaeon Pyrobaculum aerophilum contains a modified histidine residue which is interpreted as stable phosphorylation. In vitro binding studies confirmed that adenosine and AMP but not ADP or ATP bind to the protein [].; PDB: 1VGG_A 1RLH_A 1WVQ_A 2GL0_F 2JB7_B 2EKM_C 2D16_D.
Probab=26.63  E-value=1e+02  Score=16.47  Aligned_cols=27  Identities=11%  Similarity=0.121  Sum_probs=21.4

Q ss_pred             CCCcEEEEecCCCCCCHHHHHHHhhhC
Q psy7050          11 EGMVSLKVDNLTYRTTPDDLRRVFERC   37 (61)
Q Consensus        11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~   37 (61)
                      +++.++.+|.-++--|-+||.+.+...
T Consensus         6 Pe~~N~IlGqsHFIKTvEDl~Ealv~s   32 (155)
T PF04008_consen    6 PEGCNVILGQSHFIKTVEDLYEALVTS   32 (155)
T ss_dssp             -TTEEEEEEEESSTTHHHHHHHHHHCC
T ss_pred             CCCceEEEeccchhhhHHHHHHHHhhc
Confidence            456899999999977889998887654


No 193
>PF13600 DUF4140:  N-terminal domain of unknown function (DUF4140)
Probab=25.68  E-value=26  Score=16.76  Aligned_cols=35  Identities=20%  Similarity=0.147  Sum_probs=22.0

Q ss_pred             CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEcc
Q psy7050          13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPR   47 (61)
Q Consensus        13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~   47 (61)
                      ...|.+.+||..+..+.|+--...-..|..+.+..
T Consensus        23 ~~~i~~~~Lp~~~d~~Sl~V~~~g~~~i~~v~~~~   57 (104)
T PF13600_consen   23 ENEIIFEGLPPSLDPDSLRVSGEGGVTILSVRFRR   57 (104)
T ss_pred             ceEEEEeCCCcccCCCcEEEEecCCEEEEEEEEEE
Confidence            36899999999998887753211112455555443


No 194
>smart00540 LEM in nuclear membrane-associated proteins. LEM, domain in nuclear membrane-associated proteins, including lamino-associated polypeptide 2 and emerin.
Probab=24.83  E-value=70  Score=13.28  Aligned_cols=15  Identities=20%  Similarity=0.359  Sum_probs=12.0

Q ss_pred             CCHHHHHHHhhhCCC
Q psy7050          25 TTPDDLRRVFERCGE   39 (61)
Q Consensus        25 ~~~~~l~~~f~~~G~   39 (61)
                      .++++|++.+..+|-
T Consensus         6 LSd~eL~~~L~~~G~   20 (44)
T smart00540        6 LSDAELRAELKQYGL   20 (44)
T ss_pred             cCHHHHHHHHHHcCC
Confidence            577889999888873


No 195
>KOG2068|consensus
Probab=24.75  E-value=48  Score=19.94  Aligned_cols=36  Identities=17%  Similarity=0.244  Sum_probs=25.6

Q ss_pred             cEEEEecCCCCCCHHHHH---HHhhhCCCeEEEEEccCC
Q psy7050          14 VSLKVDNLTYRTTPDDLR---RVFERCGEVGDIYIPRDR   49 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~---~~f~~~G~i~~~~~~~~~   49 (61)
                      .-+|+-+|+.....+.+.   +.|.+||.+.++..-.++
T Consensus        78 nlvyvvgl~~~~ade~~l~~~eyfgqygki~ki~~~~~~  116 (327)
T KOG2068|consen   78 NLVYVVGLPLDLADESVLERTEYFGQYGKINKIVKNKDP  116 (327)
T ss_pred             hhhhhhCCCccccchhhhhCcccccccccceEEeecCCc
Confidence            457888888887655443   567788899888776654


No 196
>smart00195 DSPc Dual specificity phosphatase, catalytic domain.
Probab=24.49  E-value=61  Score=15.99  Aligned_cols=18  Identities=11%  Similarity=0.104  Sum_probs=12.7

Q ss_pred             EEEEecCCCCCCHHHHHH
Q psy7050          15 SLKVDNLTYRTTPDDLRR   32 (61)
Q Consensus        15 ~l~v~~l~~~~~~~~l~~   32 (61)
                      .||+|++|.....+.|++
T Consensus         7 ~l~~G~~~~~~~~~~l~~   24 (138)
T smart00195        7 HLYLGSYSSALNLALLKK   24 (138)
T ss_pred             CeEECChhHcCCHHHHHH
Confidence            599999987765554443


No 197
>PF11249 DUF3047:  Protein of unknown function (DUF3047);  InterPro: IPR021409  This bacterial family of proteins has no known function. 
Probab=23.59  E-value=1.4e+02  Score=16.25  Aligned_cols=33  Identities=21%  Similarity=0.222  Sum_probs=24.4

Q ss_pred             HHHHHHhhh-CCCeEEEEEccCCCCCCceeEEEE
Q psy7050          28 DDLRRVFER-CGEVGDIYIPRDRFTRESRGFAFV   60 (61)
Q Consensus        28 ~~l~~~f~~-~G~i~~~~~~~~~~t~~~~g~~fv   60 (61)
                      +|.+++|.. .+.|..+-++.|..+...++-|++
T Consensus       145 ~Dy~~~FG~~p~~i~~vai~tDsDnT~~~a~A~y  178 (183)
T PF11249_consen  145 ADYRRAFGEEPPRIVGVAIMTDSDNTGGSARAYY  178 (183)
T ss_pred             HHHHHHhCCCCCceeEEEEEEEcCCCCCEEEEEE
Confidence            677777754 678888999988777666666653


No 198
>PRK04239 hypothetical protein; Provisional
Probab=23.31  E-value=1.1e+02  Score=15.41  Aligned_cols=19  Identities=11%  Similarity=0.289  Sum_probs=16.1

Q ss_pred             ecCCCCCCHHHHHHHhhhC
Q psy7050          19 DNLTYRTTPDDLRRVFERC   37 (61)
Q Consensus        19 ~~l~~~~~~~~l~~~f~~~   37 (61)
                      |.|...++++.|..++...
T Consensus        80 G~i~~ki~e~~L~~lL~~v   98 (110)
T PRK04239         80 GRIQGPIDDEQLKEILEQL   98 (110)
T ss_pred             CCCCCCcCHHHHHHHHHHH
Confidence            5678889999999999874


No 199
>PF15051 FAM198:  FAM198 protein
Probab=23.29  E-value=88  Score=18.83  Aligned_cols=30  Identities=20%  Similarity=0.216  Sum_probs=21.0

Q ss_pred             cEEEEecCCCCCCHHHHHHHhh-hCCCeEEE
Q psy7050          14 VSLKVDNLTYRTTPDDLRRVFE-RCGEVGDI   43 (61)
Q Consensus        14 ~~l~v~~l~~~~~~~~l~~~f~-~~G~i~~~   43 (61)
                      -.||-...|++.+++|++.+-- .-|+|..+
T Consensus         3 iRiyse~aPpWfS~~Di~~MrlLAd~~V~~~   33 (326)
T PF15051_consen    3 IRIYSESAPPWFSEDDIRKMRLLADGEVVSK   33 (326)
T ss_pred             ccccCCCCCCcCCHHHHHHHHHhcCCceeee
Confidence            4577788999999999987632 34555443


No 200
>PF01984 dsDNA_bind:  Double-stranded DNA-binding domain;  InterPro: IPR002836 This protein family is found in archaea and eukaryota. The human TFAR19 (TF-1 cell apoptosis-related protein 19) encodes a protein which shares significant homology to the corresponding proteins of species ranging from yeast to mice. TFAR19 exhibits a ubiquitous expression pattern and its expression is up-regulated in the tumour cells undergoing apoptosis. TFAR19 may play a general role in the apoptotic process []. Also included in this family is a DNA-binding protein from the archaea, Methanobacterium thermoautotrophicum.; GO: 0003677 DNA binding; PDB: 1EIJ_A 2K6B_A 2CRU_A 1YYB_A 2JXN_A 2FH0_A.
Probab=23.22  E-value=76  Score=15.75  Aligned_cols=20  Identities=15%  Similarity=0.371  Sum_probs=14.8

Q ss_pred             ecCCCCCCHHHHHHHhhhCC
Q psy7050          19 DNLTYRTTPDDLRRVFERCG   38 (61)
Q Consensus        19 ~~l~~~~~~~~l~~~f~~~G   38 (61)
                      |.|...++++.|.+++....
T Consensus        75 G~l~~kI~d~~L~~iL~~i~   94 (107)
T PF01984_consen   75 GQLRGKIDDEQLKEILEQIS   94 (107)
T ss_dssp             TSSSS-B-HHHHHHHHHHHC
T ss_pred             CCCCCCcCHHHHHHHHHHHh
Confidence            56788899999999998754


No 201
>PF05023 Phytochelatin:  Phytochelatin synthase;  InterPro: IPR007719 This entry represents plant phytochelatin synthases (also known as glutathione gamma-glutamylcysteinyltransferase; 2.3.2.15 from EC), which is involved in the synthesis of phytochelatins (PC) and homophytochelatins (hPC), the heavy-metal-binding peptides of plants. This enzyme is required for detoxification of heavy metals such as cadmium and arsenate. The N-terminal region of phytochelatin synthase contains the active site, as well as four highly conserved cysteine residues that appear to play an important role in heavy-metal-induced phytochelatin catalysis. The C-terminal region is rich in cysteines, and may act as a metal sensor, whereby the Cys residues bind cadmium ions to bring them into closer proximity and transferring them to the activation site in the N-terminal catalytic domain []. The C-terminal region displays homology to the functional domains of metallothionein and metallochaperone.; GO: 0016756 glutathione gamma-glutamylcysteinyltransferase activity, 0046872 metal ion binding, 0010038 response to metal ion, 0046938 phytochelatin biosynthetic process; PDB: 2BTW_A 2BU3_B.
Probab=23.15  E-value=43  Score=18.72  Aligned_cols=14  Identities=29%  Similarity=0.555  Sum_probs=9.3

Q ss_pred             CCCCCCceeEEEEC
Q psy7050          48 DRFTRESRGFAFVR   61 (61)
Q Consensus        48 ~~~t~~~~g~~fv~   61 (61)
                      |+.++++|||-.|+
T Consensus       199 D~~s~~~RG~~~is  212 (212)
T PF05023_consen  199 DPDSGKSRGYLLIS  212 (212)
T ss_dssp             ETTTTEE-EEEEEE
T ss_pred             CCCCCCcceEEEeC
Confidence            56778888887663


No 202
>TIGR00006 S-adenosyl-methyltransferase MraW. Genetics paper in 1972 links mra cluster to peptidoglycan biosynthesis in E. coli. Seems to be common in proteobacteria.wn.
Probab=22.28  E-value=76  Score=18.81  Aligned_cols=19  Identities=26%  Similarity=0.468  Sum_probs=15.7

Q ss_pred             CCCHHHHHHHhhhCCCeEE
Q psy7050          24 RTTPDDLRRVFERCGEVGD   42 (61)
Q Consensus        24 ~~~~~~l~~~f~~~G~i~~   42 (61)
                      .+++++|.++|..||+...
T Consensus       140 ~~~e~~L~~i~~~yGEe~~  158 (305)
T TIGR00006       140 TYSEEDLERILKKYGEEKF  158 (305)
T ss_pred             hCCHHHHHHHHHHhcCcch
Confidence            4678999999999997654


No 203
>PF00054 Laminin_G_1:  Laminin G domain;  InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=22.19  E-value=72  Score=15.82  Aligned_cols=11  Identities=27%  Similarity=0.024  Sum_probs=8.9

Q ss_pred             CCcEEEEecCC
Q psy7050          12 GMVSLKVDNLT   22 (61)
Q Consensus        12 ~~~~l~v~~l~   22 (61)
                      ....||||++|
T Consensus        91 ~~~~lyvGG~p  101 (131)
T PF00054_consen   91 VDGPLYVGGLP  101 (131)
T ss_dssp             ECSEEEESSSS
T ss_pred             cccCEEEccCC
Confidence            34569999999


No 204
>smart00271 DnaJ DnaJ molecular chaperone homology domain.
Probab=22.18  E-value=80  Score=13.02  Aligned_cols=18  Identities=6%  Similarity=0.370  Sum_probs=13.3

Q ss_pred             ecCCCCCCHHHHHHHhhh
Q psy7050          19 DNLTYRTTPDDLRRVFER   36 (61)
Q Consensus        19 ~~l~~~~~~~~l~~~f~~   36 (61)
                      -+++..++.++|+..|..
T Consensus         7 Lgl~~~~~~~~ik~ay~~   24 (60)
T smart00271        7 LGVPRDASLDEIKKAYRK   24 (60)
T ss_pred             cCCCCCCCHHHHHHHHHH
Confidence            367777888888877754


No 205
>PF15023 DUF4523:  Protein of unknown function (DUF4523)
Probab=22.06  E-value=1.5e+02  Score=16.00  Aligned_cols=33  Identities=12%  Similarity=0.172  Sum_probs=21.6

Q ss_pred             CcEEEEecCCCCCC----HHHHHHHhhhCCCeEEEEE
Q psy7050          13 MVSLKVDNLTYRTT----PDDLRRVFERCGEVGDIYI   45 (61)
Q Consensus        13 ~~~l~v~~l~~~~~----~~~l~~~f~~~G~i~~~~~   45 (61)
                      -.+|-|+=|...+.    -+.+...++.||+|.++.+
T Consensus        86 MsTIVVRWlkknm~~~edl~sV~~~Ls~fGpI~SVT~  122 (166)
T PF15023_consen   86 MSTIVVRWLKKNMQPTEDLKSVIQRLSVFGPIQSVTL  122 (166)
T ss_pred             ceeEEeehhhhcCChHHHHHHHHHHHHhcCCcceeee
Confidence            36788876655553    2345556678999988753


No 206
>PRK00050 16S rRNA m(4)C1402 methyltranserfase; Provisional
Probab=22.05  E-value=78  Score=18.62  Aligned_cols=19  Identities=26%  Similarity=0.464  Sum_probs=15.5

Q ss_pred             CCCHHHHHHHhhhCCCeEE
Q psy7050          24 RTTPDDLRRVFERCGEVGD   42 (61)
Q Consensus        24 ~~~~~~l~~~f~~~G~i~~   42 (61)
                      .+++++|..+|..||....
T Consensus       138 ~~~~~~L~~i~~~yGee~~  156 (296)
T PRK00050        138 TYSEEELARIFKEYGEERF  156 (296)
T ss_pred             hCCHHHHHHHHHHhcCcch
Confidence            4578899999999998654


No 207
>PF00533 BRCT:  BRCA1 C Terminus (BRCT) domain;  InterPro: IPR001357 The BRCT domain (after the C_terminal domain of a breast cancer susceptibility protein) is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage [], for example as found in the breast cancer DNA-repair protein BRCA1. The domain is an approximately 100 amino acid tandem repeat, which appears to act as a phospho-protein binding domain []. A chitin biosynthesis protein from yeast also seems to belong to this group. ; GO: 0005622 intracellular; PDB: 3L46_A 3AL3_A 3AL2_A 1WF6_A 3II6_X 2NTE_B 3FA2_A 2R1Z_A 2COK_A 2K7F_A ....
Probab=21.84  E-value=89  Score=13.38  Aligned_cols=29  Identities=7%  Similarity=0.156  Sum_probs=19.6

Q ss_pred             CCcEEEEecCCCCCCHHHHHHHhhhCCCeE
Q psy7050          12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVG   41 (61)
Q Consensus        12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~   41 (61)
                      +++++++.+ -.....+.|..+...+|...
T Consensus         7 ~g~~f~i~~-~~~~~~~~l~~~i~~~GG~v   35 (78)
T PF00533_consen    7 EGCTFCISG-FDSDEREELEQLIKKHGGTV   35 (78)
T ss_dssp             TTEEEEESS-TSSSHHHHHHHHHHHTTEEE
T ss_pred             CCEEEEEcc-CCCCCHHHHHHHHHHcCCEE
Confidence            467888833 33445677888888888543


No 208
>KOG0984|consensus
Probab=21.42  E-value=1.9e+02  Score=16.92  Aligned_cols=30  Identities=20%  Similarity=0.449  Sum_probs=22.8

Q ss_pred             CCCCCCCCcEEEEecCCCCCCHHHHHHHhh
Q psy7050           6 PPPRIEGMVSLKVDNLTYRTTPDDLRRVFE   35 (61)
Q Consensus         6 ~~~~~~~~~~l~v~~l~~~~~~~~l~~~f~   35 (61)
                      +|...+..+.++|++-.+++.+++|...-+
T Consensus        24 pp~~Ld~~a~i~I~~~~~eV~ad~L~~i~e   53 (282)
T KOG0984|consen   24 PPRNLDSRALIYIGDRNFEVPADDLVGIEE   53 (282)
T ss_pred             CCcccCCCceEEEecCccccchhhhhhhhh
Confidence            334456678899999889999999876653


No 209
>KOG0718|consensus
Probab=20.72  E-value=1e+02  Score=19.80  Aligned_cols=19  Identities=21%  Similarity=0.462  Sum_probs=16.0

Q ss_pred             ecCCCCCCHHHHHHHhhhC
Q psy7050          19 DNLTYRTTPDDLRRVFERC   37 (61)
Q Consensus        19 ~~l~~~~~~~~l~~~f~~~   37 (61)
                      -|||.++++++|++.+.+.
T Consensus        15 LNlpkdAt~eeI~~AYrr~   33 (546)
T KOG0718|consen   15 LNLPKDATDEEIKKAYRRL   33 (546)
T ss_pred             hCCCcccCHHHHHHHHHHH
Confidence            5999999999999887653


No 210
>COG2118 DNA-binding protein [General function prediction only]
Probab=20.69  E-value=1.4e+02  Score=15.29  Aligned_cols=20  Identities=20%  Similarity=0.499  Sum_probs=16.6

Q ss_pred             EecCCCCCCHHHHHHHhhhC
Q psy7050          18 VDNLTYRTTPDDLRRVFERC   37 (61)
Q Consensus        18 v~~l~~~~~~~~l~~~f~~~   37 (61)
                      .|.+...+++++|.+++...
T Consensus        82 tGri~~~I~e~~lk~IL~~i  101 (116)
T COG2118          82 TGRITHKIDEEELKEILERI  101 (116)
T ss_pred             cCCCCCCCCHHHHHHHHHHH
Confidence            36788899999999998764


No 211
>KOG4285|consensus
Probab=20.17  E-value=98  Score=18.69  Aligned_cols=29  Identities=24%  Similarity=0.322  Sum_probs=18.9

Q ss_pred             EEEecCCCCCCHHHHHHHhhhCCCeEEEEE
Q psy7050          16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYI   45 (61)
Q Consensus        16 l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~   45 (61)
                      |-|-++++... .-+..+|++||.|.+...
T Consensus       200 VTVfGFppg~~-s~vL~~F~~cG~Vvkhv~  228 (350)
T KOG4285|consen  200 VTVFGFPPGQV-SIVLNLFSRCGEVVKHVT  228 (350)
T ss_pred             EEEeccCccch-hHHHHHHHhhCeeeeeec
Confidence            44445555533 446678999999887643


Done!