Query         018993
Match_columns 348
No_of_seqs    152 out of 199
Neff          4.4 
Searched_HMMs 46136
Date          Fri Mar 29 05:48:16 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/018993.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/018993hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2953 mRNA-binding protein E 100.0 3.8E-43 8.1E-48  348.5  15.0  312   11-343    86-413 (432)
  2 cd02642 R3H_encore_like R3H do  99.8 2.9E-20 6.3E-25  142.0   7.2   63   34-101     1-63  (63)
  3 PF12752 SUZ:  SUZ domain;  Int  99.5 6.5E-14 1.4E-18  106.2   4.6   50  127-176     2-59  (59)
  4 PF01424 R3H:  R3H domain;  Int  99.3 7.4E-12 1.6E-16   94.3   6.5   63   34-101     1-63  (63)
  5 cd02325 R3H R3H domain. The na  99.2 5.4E-11 1.2E-15   85.7   6.1   59   38-100     1-59  (59)
  6 smart00393 R3H Putative single  99.1 7.6E-11 1.6E-15   93.2   5.1   74   23-101     5-79  (79)
  7 cd02641 R3H_Smubp-2_like R3H d  98.9 1.9E-09 4.2E-14   82.1   6.5   58   39-100     2-60  (60)
  8 cd06006 R3H_unknown_2 R3H doma  98.9 2.9E-09 6.4E-14   81.3   6.6   57   40-100     3-59  (59)
  9 cd02646 R3H_G-patch R3H domain  98.9   3E-09 6.5E-14   80.2   5.9   58   38-100     1-58  (58)
 10 cd02640 R3H_NRF R3H domain of   98.6 1.7E-07 3.6E-12   71.8   6.3   57   40-100     3-60  (60)
 11 cd06007 R3H_DEXH_helicase R3H   98.5 2.1E-07 4.6E-12   71.0   6.0   57   39-100     2-59  (59)
 12 cd02636 R3H_sperm-antigen R3H   98.5 2.4E-07 5.3E-12   71.3   6.0   58   40-100     3-60  (61)
 13 cd02643 R3H_NF-X1 R3H domain o  98.5 2.4E-07 5.3E-12   73.2   5.8   63   33-99      6-73  (74)
 14 cd02644 R3H_jag R3H domain fou  97.2  0.0012 2.7E-08   51.4   7.0   61   35-100     5-66  (67)
 15 cd02639 R3H_RRM R3H domain of   96.9 0.00049 1.1E-08   52.9   1.6   59   39-100     2-60  (60)
 16 cd02645 R3H_AAA R3H domain of   96.4    0.01 2.2E-07   45.7   5.9   53   42-99      7-59  (60)
 17 cd02638 R3H_unknown_1 R3H doma  95.7   0.028   6E-07   43.8   5.6   56   40-99      3-60  (62)
 18 cd02637 R3H_PARN R3H domain of  89.5    0.49 1.1E-05   36.9   3.8   36   41-78      4-39  (65)
 19 COG1847 Jag Predicted RNA-bind  75.7      10 0.00022   36.2   7.0   60   36-100   147-207 (208)
 20 KOG1952 Transcription factor N  68.1       4 8.7E-05   45.7   2.9   80   33-116   817-903 (950)
 21 PF12206 DUF3599:  Domain of un  43.2     4.4 9.6E-05   35.3  -1.5   19   65-84      2-20  (117)
 22 KOG2953 mRNA-binding protein E  32.5       7 0.00015   40.8  -2.3   54   30-85     23-76  (432)
 23 cd01611 GABARAP Ubiquitin doma  30.5      45 0.00098   28.5   2.6   18  153-170     3-20  (112)
 24 PF06262 DUF1025:  Possibl zinc  24.4      39 0.00085   28.3   1.2   17   67-83     74-90  (97)
 25 PTZ00380 microtubule-associate  22.8      71  0.0015   28.1   2.5   19  152-170     5-23  (121)
 26 PF09851 SHOCT:  Short C-termin  22.2      63  0.0014   21.5   1.6   12  161-172    19-30  (31)
 27 PF05572 Peptidase_M43:  Pregna  21.6      53  0.0012   29.2   1.5   22   64-85     67-88  (154)
 28 KOG3248 Transcription factor T  20.8 1.5E+02  0.0032   30.7   4.5   65  267-331    54-127 (421)
 29 KOG3379 Diadenosine polyphosph  20.1 1.2E+02  0.0027   27.5   3.4   22  150-171   129-150 (150)

No 1  
>KOG2953 consensus mRNA-binding protein Encore [RNA processing and modification]
Probab=100.00  E-value=3.8e-43  Score=348.54  Aligned_cols=312  Identities=41%  Similarity=0.536  Sum_probs=272.6

Q ss_pred             hhhhccCCCCchHHHHHHhcCchhHHHHHHHHHHHHHHhcCCCccceecCCCCChHHHHHHHHhhhhhccceeeecc---
Q 018993           11 AAIVNERESMVDPFLVEALQNPRHRLTILRMELDIQRFLQNPDQQHFEFQHFPTSYLRLAAHRVSQHYGLVTMVQEN---   87 (348)
Q Consensus        11 ~~~~~~~~~~vd~~L~eAL~npkDRl~lLrlE~di~~FI~d~~~~~lel~pmpnSY~RLLvHRvA~yygL~h~v~d~---   87 (348)
                      .+.+-+.+..+|.|++|||+|||+|++|+++|.+|.+|+++...++++|+++++||+|++.||||++|+|.+.+.+.   
T Consensus        86 ~~~~~p~e~~~dy~~ve~~qnpR~~~~lsR~El~~~~~~Q~~~~qqt~~q~~~ts~~~~~~~rvaq~y~l~T~~~p~~~~  165 (432)
T KOG2953|consen   86 QANKIPNEQAVDYFLVEALQNPRHRLTLSRKELDIQCQFQGPVQQQTEFQNYPTSYLRLAAHRVAQHYGLATTGEPSYIS  165 (432)
T ss_pred             ccccCchhhcccHHHHhhhhcchhhhhhhcccchhhhhhcCcccccccCCCccccchhhhhccccccccccccccccccc
Confidence            34466788899999999999999999999999999999999999999999999999999999999999999988766   


Q ss_pred             cccCCccEEEEEecCCCCCCcccccccccCCcCcCchhhhhhhhhccCCCCCCCCC-CCCCCCCCCCCCCHHHHHHHHHH
Q 018993           88 GIEGLGNRILVRKTAESKYPAVRLSEIPAKQSEESDKLEKIKIAIRRRPNAGCVNG-ANETGTKRSPVRSVEERKEEYDR  166 (348)
Q Consensus        88 ~~Dgs~~~Ivv~KT~~triP~vrLsdl~~~~~~~s~~~~~~K~~ImkR~~k~s~~~-~~~~~~k~~~~kS~EEREeeY~r  166 (348)
                      +.|+...+|+++|+.+++.|.++|++++.+.+.+++..+..|+.|.-|+.+++... .++.+......+|+||||++|.+
T Consensus       166 ~~~p~eqR~l~~k~~~s~~P~~~~~~~P~ssp~~~~~~~~~~~~~sp~p~~g~G~~~~~p~~~~~~~~~S~~~~kq~yd~  245 (432)
T KOG2953|consen  166 GIDPYEQRILVTKTGESRFPGVSLSEIPVSSPSSNGWSEQRKGDISPRPTSGGGVSLSSPSNPQVTLLRSVEERKQEYDK  245 (432)
T ss_pred             ccCchhccccccccccccCCchhhccccccCccccccccccccccCCCCCCCCcccccCCcCCCccccccchhhhhhhhh
Confidence            57888899999999999999999999999766667889999999999988765443 22333344578999999999999


Q ss_pred             HHHhhcCCCCCCCCccccccccCC---CCC--CCCCCchhhhhcc-cccccccccccCCCCCCCceeeecccccccCCCc
Q 018993          167 ARARIFSGPSSPNSEDTLTQVSTD---MKN--IGFNRDEREIVRN-SITDAEKIISIRDGAGLSRVAIFRDREKDRTDPD  240 (348)
Q Consensus       167 AReRIF~~~~s~d~~~~~~~~~~~---~~~--~~~~r~e~~~~~~-~~~~~ek~~~~r~~~~~~rvAi~RdrekDr~DPD  240 (348)
                      ||+|||+.....+++|++...++.   ..+  ++.+|.+.+.+.| .++.-+++...|+.|+.+||||+|||||||+|||
T Consensus       246 ~r~r~g~~~~~~~s~Dss~q~~p~~~~~~~g~~~~~~~~~p~~~N~~Pv~~~~~g~~~~~gps~~v~~nr~rr~~ry~p~  325 (432)
T KOG2953|consen  246 ARGRIGSKPVTNDSKDSSSQQPPQNYQSGNGDPRLSRLEQPVSYNSPPLMHGPNGITRESGPSPRVAGNRDRRPDRYDPD  325 (432)
T ss_pred             hhccccCccccccCcccccccCCccccCCCCccccccCCcccccCCCcccccCCCcccCCCCCcccccccccchhhcCcc
Confidence            999999999999999999887765   444  5689999999888 7888889999999999999999999999999999


Q ss_pred             hhhh-----ccCCCCCCCCCCCCCCCCcccCCCccccCCCCCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCcc
Q 018993          241 YDRS-----YERSLPTNQGFSLPPFNMQKVQLPFMQYDTGFPQFSQIPRTQASLSFRPPSSPVMSPYCAVGPNQTSVEAA  315 (348)
Q Consensus       241 ydR~-----y~~~~~~~~~f~~~~~~~q~~~~p~~~y~~~f~q~~~~~~~~~~~~~~~~~~~~m~p~~~~~~~~~~~~~~  315 (348)
                      |||+     |-+.+|||+.|+..++   .+.+|+  +...|+...+.+ +        ++++.|+||..       -+++
T Consensus       326 ~dr~~~~~~yv~~~Pp~q~~~~~~~---ql~~~~--~~i~~~~~pq~~-~--------~~n~~~s~~s~-------~a~~  384 (432)
T KOG2953|consen  326 YDRSCGFVRYVTMLPPGQTFMQYQK---QLHTPY--HKIPFPNDPQGN-G--------GDNPARSEASH-------LAAK  384 (432)
T ss_pred             cccCCCCcceeccCCCccccccccc---ccCCcc--cccccCCCCcCC-C--------CCCcccccccc-------cccc
Confidence            9999     2299999999999987   356677  888888854433 1        66799999933       4899


Q ss_pred             cccCC-CccccccCchhHhhhhhhhhhhh
Q 018993          316 YMQWP-SAAMMYAHSYEQFRQAAFQVFFG  343 (348)
Q Consensus       316 y~~~p-~~~m~y~h~~~~~~~~~~~~~~~  343 (348)
                      |++|| .|.|+|+|....+++..+++.|-
T Consensus       385 yt~~p~~p~~~~a~n~~~~~~~~~ras~~  413 (432)
T KOG2953|consen  385 YTVLPAFPSMSYASNANEKKNGNSRASFK  413 (432)
T ss_pred             eeeccccccchhccchhhhhcCceeeecc
Confidence            99999 99999999999999999999874


No 2  
>cd02642 R3H_encore_like R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=99.82  E-value=2.9e-20  Score=142.05  Aligned_cols=63  Identities=41%  Similarity=0.711  Sum_probs=58.5

Q ss_pred             hHHHHHHHHHHHHHHhcCCCccceecCCCCChHHHHHHHHhhhhhccceeeecccccCCccEEEEEec
Q 018993           34 HRLTILRMELDIQRFLQNPDQQHFEFQHFPTSYLRLAAHRVSQHYGLVTMVQENGIEGLGNRILVRKT  101 (348)
Q Consensus        34 DRl~lLrlE~di~~FI~d~~~~~lel~pmpnSY~RLLvHRvA~yygL~h~v~d~~~Dgs~~~Ivv~KT  101 (348)
                      ||++||+||++|++||+++..+.++|+|| |||+|||+|+||+||||.|++++.+    +.+|+|.||
T Consensus         1 dr~~~l~~E~~i~~Fi~~~~~~~~~f~pm-~sy~RllvH~la~~~gL~s~s~~~~----~r~vvv~kt   63 (63)
T cd02642           1 DRLFVLKLEKDLLAFIKDSTRQSLELPPM-NSYYRLLAHRVAQYYGLDHNVDNSG----GKCVIVNKT   63 (63)
T ss_pred             CchHHHHHHHHHHHHHhCCCCCeeEcCCC-CcHHHHHHHHHHHHhCCeeEeecCC----ceEEEEEeC
Confidence            79999999999999999997788999999 9999999999999999999997643    578999987


No 3  
>PF12752 SUZ:  SUZ domain;  InterPro: IPR024771 The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched in positively charged amino acids. It was first characterised in the Caenorhabditis elegans protein SZY-20 where it has been shown to bind RNA and allow their localization to the centrosome [].
Probab=99.45  E-value=6.5e-14  Score=106.18  Aligned_cols=50  Identities=42%  Similarity=0.677  Sum_probs=37.5

Q ss_pred             hhhhhhccCCCCCCCCC--------CCCCCCCCCCCCCHHHHHHHHHHHHHhhcCCCC
Q 018993          127 KIKIAIRRRPNAGCVNG--------ANETGTKRSPVRSVEERKEEYDRARARIFSGPS  176 (348)
Q Consensus       127 ~~K~~ImkR~~k~s~~~--------~~~~~~k~~~~kS~EEREeeY~rAReRIF~~~~  176 (348)
                      .++++||||+++++...        .+..+.+....||+||||++|++||+|||++++
T Consensus         2 ~p~~~IlkRp~~~~~~~~~~~~~~~~~~~~~~~~~~kSlEERE~eY~~AR~RIFg~~~   59 (59)
T PF12752_consen    2 KPKRKILKRPSKGSSSSDSGSSGSSPNSSSRKKRPSKSLEEREAEYAEARARIFGSSE   59 (59)
T ss_pred             CCCCeEecCCCCCCCcccccccccCCCcccccccccCCHHHHHHHHHHHHHHHhCCCC
Confidence            46788999986554322        112234568899999999999999999999864


No 4  
>PF01424 R3H:  R3H domain;  InterPro: IPR001374 The R3H motif: a domain that binds single-stranded nucleic acids. The most prominent feature of the R3H motif is the presence of an invariant arginine residue and a highly conserved histidine residue that are separated by three residues. The motif also displays a conserved pattern of hydrophobic residues, prolines and glycines. The R3H motif is present in proteins from a diverse range of organisms that includes Eubacteria, green plants, fungi and various groups of metazoans. Intriguingly, it has not yet been identified in Archaea and Escherichia coli. The sequences that contain the R3H domain, many of which are hypothetical proteins predicted from genome sequencing projects, can be grouped into eight families on the basis of similarities outside the R3H region. Three of the families contain ATPase domains either upstream (families II and VII) or downstream of the R3H domain (family VIII). The N-terminal part of members of family VII contains an SF1 helicase domain5. The C-terminal part of family VIII contains an SF2 DEAH helicase domain5. The ATPase domain in the members of family II is similar to the stage-III sporulation protein AA (S3AA_BACSU), the proteasome ATPase, bacterial transcription-termination factor r and the mitochondrial F1-ATPase b subunit (the F5 helicase family5). Family VI contains Cys-rich repeats6, as well as a ring-type zinc finger upstream of the R3H domain. JAG bacterial proteins (family I) contain a KH domain N-terminal to the R3H domain. The functions of other domains in R3H proteins support the notion that the R3H domain might be involved in interactions with single-stranded nucleic acids [].; GO: 0003676 nucleic acid binding; PDB: 1WHR_A 1MSZ_A 1UG8_A 3GKU_B 2CPM_A.
Probab=99.28  E-value=7.4e-12  Score=94.34  Aligned_cols=63  Identities=25%  Similarity=0.471  Sum_probs=52.7

Q ss_pred             hHHHHHHHHHHHHHHhcCCCccceecCCCCChHHHHHHHHhhhhhccceeeecccccCCccEEEEEec
Q 018993           34 HRLTILRMELDIQRFLQNPDQQHFEFQHFPTSYLRLAAHRVSQHYGLVTMVQENGIEGLGNRILVRKT  101 (348)
Q Consensus        34 DRl~lLrlE~di~~FI~d~~~~~lel~pmpnSY~RLLvHRvA~yygL~h~v~d~~~Dgs~~~Ivv~KT  101 (348)
                      .|..|+++++++++|+.++.. .++|+|| |+|+|++||.+|++|||.|.+.+.+   ...+|+|+||
T Consensus         1 r~~~l~~~~~~~~~~~~~~~~-~~~f~pm-~~~~R~~iH~~a~~~gL~s~S~g~~---~~R~vvv~k~   63 (63)
T PF01424_consen    1 RREELEKIEEKLIEFFLSSGE-SLEFPPM-NSFERKLIHELAEYYGLKSKSEGEG---PNRRVVVSKT   63 (63)
T ss_dssp             HHHHHHHHHHHHHHHHHHCSS-EEEEEC---SHHHHHHHHHHHHCTEEEEEESSS---SSSEEEEEES
T ss_pred             ChHHHHHHHHHHHHHHHcCCC-EEEECCC-CHHHHHHHHHHHHHCCCEEEEecCC---CCeEEEEEeC
Confidence            367899999999999976665 7999999 9999999999999999999997643   4467999886


No 5  
>cd02325 R3H R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. R3H domains are found in proteins together with ATPase domains, SF1 helicase domains, SF2 DEAH helicase domains, Cys-rich repeats, ring-type zinc fingers, and KH domains. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=99.17  E-value=5.4e-11  Score=85.72  Aligned_cols=59  Identities=22%  Similarity=0.412  Sum_probs=50.4

Q ss_pred             HHHHHHHHHHHhcCCCccceecCCCCChHHHHHHHHhhhhhccceeeecccccCCccEEEEEe
Q 018993           38 ILRMELDIQRFLQNPDQQHFEFQHFPTSYLRLAAHRVSQHYGLVTMVQENGIEGLGNRILVRK  100 (348)
Q Consensus        38 lLrlE~di~~FI~d~~~~~lel~pmpnSY~RLLvHRvA~yygL~h~v~d~~~Dgs~~~Ivv~K  100 (348)
                      ++++|+.|++|+.+.....++|+|| |+|+|.++|++|++|||.+.+.+.+   ...+|+|.+
T Consensus         1 ~~~~~~~l~~f~~~~~~~~~~~~p~-~~~~R~~vH~la~~~~L~s~s~g~~---~~r~v~i~~   59 (59)
T cd02325           1 REEREEELEAFAKDAAGKSLELPPM-NSYERKLIHDLAEYYGLKSESEGEG---PNRRVVITK   59 (59)
T ss_pred             ChHHHHHHHHHHHhhcCCeEEcCCC-CHHHHHHHHHHHHHCCCEEEEecCC---CCcEEEEeC
Confidence            4689999999999996678999999 9999999999999999999997643   345677653


No 6  
>smart00393 R3H Putative single-stranded nucleic acids-binding domain.
Probab=99.12  E-value=7.6e-11  Score=93.19  Aligned_cols=74  Identities=30%  Similarity=0.597  Sum_probs=62.6

Q ss_pred             HHHHHHhc-CchhHHHHHHHHHHHHHHhcCCCccceecCCCCChHHHHHHHHhhhhhccceeeecccccCCccEEEEEec
Q 018993           23 PFLVEALQ-NPRHRLTILRMELDIQRFLQNPDQQHFEFQHFPTSYLRLAAHRVSQHYGLVTMVQENGIEGLGNRILVRKT  101 (348)
Q Consensus        23 ~~L~eAL~-npkDRl~lLrlE~di~~FI~d~~~~~lel~pmpnSY~RLLvHRvA~yygL~h~v~d~~~Dgs~~~Ivv~KT  101 (348)
                      +++++.+. +++.+..|++++.++.+|+..... .++|+|| |+|+|.++|++|+.|||.|.+.+.|   ...+|+|.|+
T Consensus         5 ~~~~d~~~~~~~~~~~l~~~~~~~~~~v~~~~~-~~~~~pm-~~~~R~~iH~~a~~~~l~s~S~g~g---~~R~vvv~~~   79 (79)
T smart00393        5 PVTLDALSYRPRRREELIELELEIARFVKSTKE-SVELPPM-NSYERKIVHELAEKYGLESESFGEG---PKRRVVISKK   79 (79)
T ss_pred             eEEEECCccCHHHHHHHHHHHHHHHHHHhccCC-eEEcCCC-CHHHHHHHHHHHHHcCCEEEEEcCC---CCcEEEEEeC
Confidence            34455664 789999999999999999987765 6999999 9999999999999999999997754   3367888764


No 7  
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins.  Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=98.95  E-value=1.9e-09  Score=82.15  Aligned_cols=58  Identities=24%  Similarity=0.411  Sum_probs=49.7

Q ss_pred             HHHHHHHHHHhcCCCccceecCC-CCChHHHHHHHHhhhhhccceeeecccccCCccEEEEEe
Q 018993           39 LRMELDIQRFLQNPDQQHFEFQH-FPTSYLRLAAHRVSQHYGLVTMVQENGIEGLGNRILVRK  100 (348)
Q Consensus        39 LrlE~di~~FI~d~~~~~lel~p-mpnSY~RLLvHRvA~yygL~h~v~d~~~Dgs~~~Ivv~K  100 (348)
                      .++|+.|.+||+++....++||| | |+++|.+||.||+.|||.|...+.|   ....|+|.|
T Consensus         2 ~~~~~~i~~F~~~~~~~~l~F~p~l-s~~eR~~vH~lA~~~gL~s~S~G~g---~~R~v~v~k   60 (60)
T cd02641           2 KHLKAMVKAFMKDPKATELEFPPTL-SSHDRLLVHELAEELGLRHESTGEG---SDRVITVSK   60 (60)
T ss_pred             hhHHHHHHHHHcCCCcCcEECCCCC-CHHHHHHHHHHHHHcCCceEeeCCC---CceEEEeeC
Confidence            46899999999999877899999 9 9999999999999999999987643   334566654


No 8  
>cd06006 R3H_unknown_2 R3H domain of a group of fungal proteins with unknown function. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA  or ssRNA in a sequence-specific manner.
Probab=98.92  E-value=2.9e-09  Score=81.30  Aligned_cols=57  Identities=21%  Similarity=0.463  Sum_probs=50.4

Q ss_pred             HHHHHHHHHhcCCCccceecCCCCChHHHHHHHHhhhhhccceeeecccccCCccEEEEEe
Q 018993           40 RMELDIQRFLQNPDQQHFEFQHFPTSYLRLAAHRVSQHYGLVTMVQENGIEGLGNRILVRK  100 (348)
Q Consensus        40 rlE~di~~FI~d~~~~~lel~pmpnSY~RLLvHRvA~yygL~h~v~d~~~Dgs~~~Ivv~K  100 (348)
                      ++|+.|.+||+|.....+.|+|| |+++|-+||-||++|||.++..|.+++   .+|+|.|
T Consensus         3 ~~E~~l~~fv~d~~~~~~~f~pM-~~~~R~~vHdla~~~gl~SeS~d~Ep~---R~V~v~k   59 (59)
T cd06006           3 QIESTLRKFINDKSKRSLRFPPM-RSPQRAFIHELAKDYGLYSESQDPEPK---RSVFVKK   59 (59)
T ss_pred             hHHHHHHHHHhCCCCCceeCCCC-CHHHHHHHHHHHHHcCCeeEecCCCCC---cEEEEeC
Confidence            79999999999987778999999 999999999999999999999876553   4577764


No 9  
>cd02646 R3H_G-patch R3H domain of a group of fungal and plant proteins with unknown function, who also contain a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the R3H domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=98.90  E-value=3e-09  Score=80.16  Aligned_cols=58  Identities=21%  Similarity=0.329  Sum_probs=49.5

Q ss_pred             HHHHHHHHHHHhcCCCccceecCCCCChHHHHHHHHhhhhhccceeeecccccCCccEEEEEe
Q 018993           38 ILRMELDIQRFLQNPDQQHFEFQHFPTSYLRLAAHRVSQHYGLVTMVQENGIEGLGNRILVRK  100 (348)
Q Consensus        38 lLrlE~di~~FI~d~~~~~lel~pmpnSY~RLLvHRvA~yygL~h~v~d~~~Dgs~~~Ivv~K  100 (348)
                      |-++|++|.+|+.++. ..+.|||| ++++|.+||+||+.|||.+...+.|   ....|+|+|
T Consensus         1 ~~~i~~~i~~F~~~~~-~~~~fppm-~~~~R~~vH~lA~~~~L~S~S~G~g---~~R~v~v~k   58 (58)
T cd02646           1 IEDIKDEIEAFLLDSR-DSLSFPPM-DKHGRKTIHKLANCYNLKSKSRGKG---KKRFVTVTK   58 (58)
T ss_pred             ChHHHHHHHHHHhCCC-ceEecCCC-CHHHHHHHHHHHHHcCCcccccccC---CceEEEEEC
Confidence            3478999999999885 57999999 9999999999999999999987643   445688875


No 10 
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=98.56  E-value=1.7e-07  Score=71.81  Aligned_cols=57  Identities=23%  Similarity=0.374  Sum_probs=47.5

Q ss_pred             HHHHHHHHHhcCCCccceecCC-CCChHHHHHHHHhhhhhccceeeecccccCCccEEEEEe
Q 018993           40 RMELDIQRFLQNPDQQHFEFQH-FPTSYLRLAAHRVSQHYGLVTMVQENGIEGLGNRILVRK  100 (348)
Q Consensus        40 rlE~di~~FI~d~~~~~lel~p-mpnSY~RLLvHRvA~yygL~h~v~d~~~Dgs~~~Ivv~K  100 (348)
                      .+++.|.+|+.+.....+.||| | ++++|.+||.||.-+||.|.....   |....|+|+|
T Consensus         3 ~~~~~i~~F~~s~~~~~l~f~p~l-t~~eR~~vH~~a~~~gL~s~S~G~---g~~R~v~v~k   60 (60)
T cd02640           3 DYRQIIQNYAHSDDIRDMVFSPEF-SKEERALIHQIAQKYGLKSRSYGS---GNDRYLVISK   60 (60)
T ss_pred             hHHHHHHHHHcCCccceEEcCCCC-CHHHHHHHHHHHHHcCCceeeEeC---CCCeEEEEeC
Confidence            5789999999998777899999 9 999999999999999999998653   2334566654


No 11 
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=98.52  E-value=2.1e-07  Score=71.04  Aligned_cols=57  Identities=23%  Similarity=0.429  Sum_probs=47.1

Q ss_pred             HHHHHHHHHHhcCCCccceecCC-CCChHHHHHHHHhhhhhccceeeecccccCCccEEEEEe
Q 018993           39 LRMELDIQRFLQNPDQQHFEFQH-FPTSYLRLAAHRVSQHYGLVTMVQENGIEGLGNRILVRK  100 (348)
Q Consensus        39 LrlE~di~~FI~d~~~~~lel~p-mpnSY~RLLvHRvA~yygL~h~v~d~~~Dgs~~~Ivv~K  100 (348)
                      +.+++.|.+|+++. ...++||| | |+++|.+||+||..+||.|.....   |....|+|.|
T Consensus         2 i~i~~~i~~F~~~~-~~~l~Fpp~l-s~~eR~~vH~~a~~~gL~s~S~G~---g~~R~v~v~K   59 (59)
T cd06007           2 IAINKALEDFRASD-NEEYEFPSSL-TNHERAVIHRLCRKLGLKSKSKGK---GSNRRLSVYK   59 (59)
T ss_pred             ccHHHHHHHHHcCc-ccEEEcCCCC-CHHHHHHHHHHHHHcCCCceeecC---CCCeEEEEeC
Confidence            45789999999988 67899999 8 999999999999999999997543   3334566654


No 12 
>cd02636 R3H_sperm-antigen R3H domain of a group of metazoan proteins that is related to the sperm-associated antigen 7. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=98.51  E-value=2.4e-07  Score=71.33  Aligned_cols=58  Identities=19%  Similarity=0.359  Sum_probs=49.0

Q ss_pred             HHHHHHHHHhcCCCccceecCCCCChHHHHHHHHhhhhhccceeeecccccCCccEEEEEe
Q 018993           40 RMELDIQRFLQNPDQQHFEFQHFPTSYLRLAAHRVSQHYGLVTMVQENGIEGLGNRILVRK  100 (348)
Q Consensus        40 rlE~di~~FI~d~~~~~lel~pmpnSY~RLLvHRvA~yygL~h~v~d~~~Dgs~~~Ivv~K  100 (348)
                      ++|+.+.+||+|...+..+|+|| |+|+|-+||.+|+..||.+.....  ++...+|+|.|
T Consensus         3 ~~e~~~~~f~~d~~~~~~~l~pM-~~~eRkivHDv~~~~Gl~S~S~Ge--ee~~R~VVv~~   60 (61)
T cd02636           3 SMEKEVSKFIKDSVRTREKFQPM-DKVERSIVHDVAEVAGLTSFSFGE--DEVDRYVMIFK   60 (61)
T ss_pred             hHHHHHHHHhhcccccccccCCC-CHHHHHHHHHHHHhcCceeEecCC--CCCceEEEEec
Confidence            68999999999988788899999 999999999999999999988643  22335677764


No 13 
>cd02643 R3H_NF-X1 R3H domain of the X1 box binding protein (NF-X1) and related proteins. Human NF-X1 is a transcription factor that regulates the expression of class II major histocompatibility complex (MHC) genes. The Drosophila homolog shuttle craft (STC) has been shown to be a DNA- or RNA-binding protein required for proper axon guidance in the central nervous system and, the yeast homolog FAP1 encodes a dosage suppressor of rapamycin toxicity. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=98.49  E-value=2.4e-07  Score=73.24  Aligned_cols=63  Identities=11%  Similarity=0.305  Sum_probs=52.1

Q ss_pred             hhHHHHHHHHHHHHHHhcCCC-----ccceecCCCCChHHHHHHHHhhhhhccceeeecccccCCccEEEEE
Q 018993           33 RHRLTILRMELDIQRFLQNPD-----QQHFEFQHFPTSYLRLAAHRVSQHYGLVTMVQENGIEGLGNRILVR   99 (348)
Q Consensus        33 kDRl~lLrlE~di~~FI~d~~-----~~~lel~pmpnSY~RLLvHRvA~yygL~h~v~d~~~Dgs~~~Ivv~   99 (348)
                      ++--|+.++|+.|..|+.+..     ...+.|+|| |+|+|-+||-+|++|||.+...+.++.   .+|+|+
T Consensus         6 ~~~~~~~~vE~~l~~la~~~~~~~~~~~~~~l~PM-~~~eR~iIH~la~~~~l~S~S~G~ep~---R~VvI~   73 (74)
T cd02643           6 KDPKFVKDVEKDLIELVESVNKGKQTSRSHSFPPM-NREKRRIVHELAEHFGIESVSYDQEPK---RNVVAT   73 (74)
T ss_pred             HCHHHHHHHHHHHHHHHHHHHhccccCCeeECCCC-CHHHHHHHHHHHhhCCCEEEecCCCCC---ceEEEe
Confidence            445699999999999999643     246899999 999999999999999999999875543   467775


No 14 
>cd02644 R3H_jag R3H domain found in proteins homologous to Bacillus subtilus Jag, which is associated with SpoIIIJ. SpoIIIJ is necessary for the third stage of sporulation. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=97.24  E-value=0.0012  Score=51.41  Aligned_cols=61  Identities=13%  Similarity=0.211  Sum_probs=49.1

Q ss_pred             HHHHHHHHHHHHHHhcCCCccceecCCCCChHHHHHHHHhhhhhc-cceeeecccccCCccEEEEEe
Q 018993           35 RLTILRMELDIQRFLQNPDQQHFEFQHFPTSYLRLAAHRVSQHYG-LVTMVQENGIEGLGNRILVRK  100 (348)
Q Consensus        35 Rl~lLrlE~di~~FI~d~~~~~lel~pmpnSY~RLLvHRvA~yyg-L~h~v~d~~~Dgs~~~Ivv~K  100 (348)
                      .-.|.+|-+.+.+.+..... .+.|+|| |+|+|-+||.++.-|. |.+...+.|   ...+|+|.+
T Consensus         5 ~~~L~~~A~~~a~~v~~tg~-~~~l~PM-~~~eRrivH~~~~~~~~l~T~S~G~~---~~R~vvI~~   66 (67)
T cd02644           5 EETLIRLAERAAEKVRRTGK-PVKLEPM-NAYERRIIHDALANDEDVETESEGEG---PYRRVVISP   66 (67)
T ss_pred             HHHHHHHHHHHHHHHHHHCC-eeEeCCC-CHHHHHHHHHHHHhCCCceEEeecCC---CCeEEEEEe
Confidence            34677788888888887775 5999999 9999999999999877 999987643   346788764


No 15 
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=96.87  E-value=0.00049  Score=52.87  Aligned_cols=59  Identities=15%  Similarity=0.228  Sum_probs=45.0

Q ss_pred             HHHHHHHHHHhcCCCccceecCCCCChHHHHHHHHhhhhhccceeeecccccCCccEEEEEe
Q 018993           39 LRMELDIQRFLQNPDQQHFEFQHFPTSYLRLAAHRVSQHYGLVTMVQENGIEGLGNRILVRK  100 (348)
Q Consensus        39 LrlE~di~~FI~d~~~~~lel~pmpnSY~RLLvHRvA~yygL~h~v~d~~~Dgs~~~Ivv~K  100 (348)
                      |.+-.+|+-|..+.....+.|||--+.-+|.++|.||..+||.|.....   |....|+|+|
T Consensus         2 l~~YsqlllFkdd~~~~eL~Fp~~ls~~eRriih~la~~lGL~~~s~G~---g~~R~v~v~k   60 (60)
T cd02639           2 LEIYSQLLLFKDDRMRDELAFPSSLSPAERRIVHLLASRLGLNHVSDGT---GERRQVQITK   60 (60)
T ss_pred             ccceeeEEEEecCCCceEEEcCCCCCHHHHHHHHHHHHHcCCceEEeCC---CceEEEeecC
Confidence            3444566778888888889999833999999999999999999998653   2334565543


No 16 
>cd02645 R3H_AAA R3H domain of a group of proteins with unknown function, who also contain a AAA-ATPase (AAA) domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA or ssRNA in a sequence-specific manner.
Probab=96.40  E-value=0.01  Score=45.65  Aligned_cols=53  Identities=19%  Similarity=0.317  Sum_probs=38.5

Q ss_pred             HHHHHHHhcCCCccceecCCCCChHHHHHHHHhhhhhccceeeecccccCCccEEEEE
Q 018993           42 ELDIQRFLQNPDQQHFEFQHFPTSYLRLAAHRVSQHYGLVTMVQENGIEGLGNRILVR   99 (348)
Q Consensus        42 E~di~~FI~d~~~~~lel~pmpnSY~RLLvHRvA~yygL~h~v~d~~~Dgs~~~Ivv~   99 (348)
                      +..+..-+. +.....+|.|| |+|-|-++|.+.+.|||.+.....++   ..+|+|.
T Consensus         7 ~~aa~~V~~-~~~~~veL~Pm-~~~eRri~H~~v~~~~l~s~S~G~ep---~RrvvI~   59 (60)
T cd02645           7 RLAIEQVVI-PKGEPVELLPR-SAYIRRLQHDLVERYQLRSESFGSEP---NRRLRIL   59 (60)
T ss_pred             HHHHHHHHh-cCCceEEcCCC-CHHHHHHHHHHHHHCCCeEEEecCCC---CcEEEEe
Confidence            334444443 44235899999 99999999999999999999975433   3467764


No 17 
>cd02638 R3H_unknown_1 R3H domain of a group of eukaryotic proteins with unknown function. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=95.73  E-value=0.028  Score=43.78  Aligned_cols=56  Identities=21%  Similarity=0.318  Sum_probs=40.2

Q ss_pred             HHHHHHHHHhcCCCc-cceecCCCCChHHHHHHHHhhh-hhccceeeecccccCCccEEEEE
Q 018993           40 RMELDIQRFLQNPDQ-QHFEFQHFPTSYLRLAAHRVSQ-HYGLVTMVQENGIEGLGNRILVR   99 (348)
Q Consensus        40 rlE~di~~FI~d~~~-~~lel~pmpnSY~RLLvHRvA~-yygL~h~v~d~~~Dgs~~~Ivv~   99 (348)
                      ...++|+-|++.... ..+.|+|| |+|.|-++|...+ +-++.+..+..   +...+|+|.
T Consensus         3 ~~~~~~~~f~~~~~~~r~v~LePM-~~~ERkIIH~~Lq~~~~v~T~S~G~---ep~RrVVI~   60 (62)
T cd02638           3 RVSEELEIFLLSFQRYRVLLFPPL-NSRRRYLIHQTVENRFLLSTFSVGE---GWARRTVVC   60 (62)
T ss_pred             hhHHHHHHHHHhcccCCeEecCCC-ChHHHHHHHHHHhcCCCceEEEccC---CCCcEEEEe
Confidence            356777788887633 45889999 9999999998654 66777777543   344567663


No 18 
>cd02637 R3H_PARN R3H domain of Poly(A)-specific ribonuclease (PARN). PARN is a poly(A)-specific 3' exonuclease from the RNase D family that, in Xenopus, deadenylates a specific class of maternal mRNAs which results in their translational repression. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=89.54  E-value=0.49  Score=36.92  Aligned_cols=36  Identities=14%  Similarity=0.255  Sum_probs=29.9

Q ss_pred             HHHHHHHHhcCCCccceecCCCCChHHHHHHHHhhhhh
Q 018993           41 MELDIQRFLQNPDQQHFEFQHFPTSYLRLAAHRVSQHY   78 (348)
Q Consensus        41 lE~di~~FI~d~~~~~lel~pmpnSY~RLLvHRvA~yy   78 (348)
                      +...|.+|+++. ...+++++| |+|+|-|++....+.
T Consensus         4 v~~~i~~fl~s~-~~~l~le~c-ngf~RkLiyq~l~~~   39 (65)
T cd02637           4 VIERIEAFLESE-EDDLELEPC-NGFQRKLIYQTLEQK   39 (65)
T ss_pred             HHHHHHHHHhcC-ccccccccc-ccHHHHHHHHHHHHH
Confidence            446678899887 557999999 999999999887765


No 19 
>COG1847 Jag Predicted RNA-binding protein [General function prediction only]
Probab=75.72  E-value=10  Score=36.16  Aligned_cols=60  Identities=17%  Similarity=0.276  Sum_probs=41.7

Q ss_pred             HHHHHHHHHHHHHhcCCCccceecCCCCChHHHHHHHHh-hhhhccceeeecccccCCccEEEEEe
Q 018993           36 LTILRMELDIQRFLQNPDQQHFEFQHFPTSYLRLAAHRV-SQHYGLVTMVQENGIEGLGNRILVRK  100 (348)
Q Consensus        36 l~lLrlE~di~~FI~d~~~~~lel~pmpnSY~RLLvHRv-A~yygL~h~v~d~~~Dgs~~~Ivv~K  100 (348)
                      -.|.+|-+.+-.=+....+ ..+|.|| ++|.|-+||.. .++=|+.+..+..   +...+|||.+
T Consensus       147 e~L~~LA~~~A~rV~~tg~-~v~L~pM-~~~ERkIVH~~l~~~~~V~T~SeG~---ep~R~vVV~~  207 (208)
T COG1847         147 ETLIKLAERAAERVLETGR-SVELEPM-PPFERKIVHTALSANPGVETYSEGE---EPNRRVVVRP  207 (208)
T ss_pred             HHHHHHHHHHHHHHHhhCC-eeecCCC-CHHHHHHHHHHHHhcCCcceeecCC---CCceEEEEec
Confidence            3555665555555554443 5899999 99999999984 6677888887643   3335677753


No 20 
>KOG1952 consensus Transcription factor NF-X1, contains NFX-type Zn2+-binding and R3H domains [Transcription]
Probab=68.12  E-value=4  Score=45.72  Aligned_cols=80  Identities=13%  Similarity=0.283  Sum_probs=57.8

Q ss_pred             hhHHHHHHHHHHHHHHhcCCCc------cceecCCCCChHHHHHHHHhhhhhccceeeecccccCCccEEEEEecC-CCC
Q 018993           33 RHRLTILRMELDIQRFLQNPDQ------QHFEFQHFPTSYLRLAAHRVSQHYGLVTMVQENGIEGLGNRILVRKTA-ESK  105 (348)
Q Consensus        33 kDRl~lLrlE~di~~FI~d~~~------~~lel~pmpnSY~RLLvHRvA~yygL~h~v~d~~~Dgs~~~Ivv~KT~-~tr  105 (348)
                      ++-.|+.-+|++++.|+.....      -+..||+| +-..|-+||-+|+.|+|.....+..+.   ..++++.+. .+.
T Consensus       817 ~~~~f~~sv~~e~~~lv~~~~~~~~~~~k~~~~p~m-s~~~rr~vh~~~e~~~l~~~sa~~~pk---r~~v~t~ir~~s~  892 (950)
T KOG1952|consen  817 KDLKFVKSVEKELEFLVELVKRGKNYSKKSHSFPPM-SRDKRRLVHELAEVFGLESVSADSEPK---RNVVVTAIRGKSV  892 (950)
T ss_pred             hchhhhccchhhhHHHHHHHhhcccccccccccCch-hHHHHHHHHhhhhccCCcccccCCCcc---cceeeEeeccccc
Confidence            4556888888888888765433      24569999 999999999999999999887664433   347777774 555


Q ss_pred             CCccccccccc
Q 018993          106 YPAVRLSEIPA  116 (348)
Q Consensus       106 iP~vrLsdl~~  116 (348)
                      +|.+.+++++.
T Consensus       893 ~~~~~~~~~~~  903 (950)
T KOG1952|consen  893 FPATTITGVLN  903 (950)
T ss_pred             CchhhHHHHHH
Confidence            56655665543


No 21 
>PF12206 DUF3599:  Domain of unknown function (DUF3599);  InterPro: IPR024556 This family of bacterial proteins includes phage-like element PBSX protein xkdH from Bacillus subtilis. The function of the family is unknown.; PDB: 3F3B_A.
Probab=43.23  E-value=4.4  Score=35.29  Aligned_cols=19  Identities=37%  Similarity=0.525  Sum_probs=6.3

Q ss_pred             hHHHHHHHHhhhhhccceee
Q 018993           65 SYLRLAAHRVSQHYGLVTMV   84 (348)
Q Consensus        65 SY~RLLvHRvA~yygL~h~v   84 (348)
                      ||++||+||| +.|||+...
T Consensus         2 Syq~mL~hrC-DIYHl~~~e   20 (117)
T PF12206_consen    2 SYQRMLTHRC-DIYHLEQKE   20 (117)
T ss_dssp             -------EEE-EEE--EEE-
T ss_pred             CHHHhhhccc-cccchhhhc
Confidence            8999999986 678885543


No 22 
>KOG2953 consensus mRNA-binding protein Encore [RNA processing and modification]
Probab=32.55  E-value=7  Score=40.76  Aligned_cols=54  Identities=13%  Similarity=-0.020  Sum_probs=38.0

Q ss_pred             cCchhHHHHHHHHHHHHHHhcCCCccceecCCCCChHHHHHHHHhhhhhccceeee
Q 018993           30 QNPRHRLTILRMELDIQRFLQNPDQQHFEFQHFPTSYLRLAAHRVSQHYGLVTMVQ   85 (348)
Q Consensus        30 ~npkDRl~lLrlE~di~~FI~d~~~~~lel~pmpnSY~RLLvHRvA~yygL~h~v~   85 (348)
                      .++-++..||..+.-|...++....+.-++.- ||||+|++-|+| .+|++++.+.
T Consensus        23 ~s~~~~~~~~~~~~~m~~~~~~~s~q~~~~~~-~ss~~~~~~~~c-v~f~~~~~q~   76 (432)
T KOG2953|consen   23 VSFINSNQLLFQLRPMQPYYQLLSHQIAPGHY-PSSVLQYRPDSC-VLFKGENNQK   76 (432)
T ss_pred             ccCCCcchhhhcccccCchhhcchhccCCccC-ccchhhccccce-eeeccccCcc
Confidence            56677777777777777777777665444444 488888888887 7888877663


No 23 
>cd01611 GABARAP Ubiquitin domain of GABA-receptor-associated protein. GABARAP  (GABA-receptor-associated protein) belongs ot a large family of proteins that mediate intracellular membrane trafficking and/or fusion.  GABARAP binds not only to GABA, type A but also to tubulin, gephrin, and ULK1.  Orthologues of GABARAP include Gate-16 (golgi-associated ATPase enhancer), LC3 (microtubule-associated protein light chain 3), and ATG8 (autophagy protein 8).  ATG8 is a ubiquitin-like protein that is conjugated to the membrane phospholipid, phosphatidylethanolamine as part of a ubiquitin-like conjugation system essential for autophagosome-formation.
Probab=30.53  E-value=45  Score=28.53  Aligned_cols=18  Identities=39%  Similarity=0.560  Sum_probs=16.0

Q ss_pred             CCCCHHHHHHHHHHHHHh
Q 018993          153 PVRSVEERKEEYDRARAR  170 (348)
Q Consensus       153 ~~kS~EEREeeY~rAReR  170 (348)
                      ...|+|||.+++++.|++
T Consensus         3 ~~~s~e~R~~e~~~ir~k   20 (112)
T cd01611           3 ERHPFEKRKAEVERIRAK   20 (112)
T ss_pred             cccCHHHHHHHHHHHHHH
Confidence            467999999999999986


No 24 
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=24.44  E-value=39  Score=28.30  Aligned_cols=17  Identities=18%  Similarity=0.479  Sum_probs=12.1

Q ss_pred             HHHHHHHhhhhhcccee
Q 018993           67 LRLAAHRVSQHYGLVTM   83 (348)
Q Consensus        67 ~RLLvHRvA~yygL~h~   83 (348)
                      +.-++|.||+|||+...
T Consensus        74 ~~tlvhEiah~fG~~~e   90 (97)
T PF06262_consen   74 RDTLVHEIAHHFGISDE   90 (97)
T ss_dssp             HHHHHHHHHHHTT--HH
T ss_pred             HHHHHHHHHHHcCCCHH
Confidence            45679999999999653


No 25 
>PTZ00380 microtubule-associated protein (MAP); Provisional
Probab=22.80  E-value=71  Score=28.05  Aligned_cols=19  Identities=32%  Similarity=0.478  Sum_probs=16.4

Q ss_pred             CCCCCHHHHHHHHHHHHHh
Q 018993          152 SPVRSVEERKEEYDRARAR  170 (348)
Q Consensus       152 ~~~kS~EEREeeY~rAReR  170 (348)
                      +...|+|+|.+|+++.|++
T Consensus         5 K~~~s~e~R~~e~~~Ir~k   23 (121)
T PTZ00380          5 HSSNPVEARRAECARLQAK   23 (121)
T ss_pred             hhcCCHHHHHHHHHHHHHH
Confidence            3467999999999999985


No 26 
>PF09851 SHOCT:  Short C-terminal domain;  InterPro: IPR018649  This family of hypothetical prokaryotic proteins has no known function. 
Probab=22.23  E-value=63  Score=21.47  Aligned_cols=12  Identities=42%  Similarity=0.960  Sum_probs=9.9

Q ss_pred             HHHHHHHHHhhc
Q 018993          161 KEEYDRARARIF  172 (348)
Q Consensus       161 EeeY~rAReRIF  172 (348)
                      ++||+++|++|-
T Consensus        19 eeEy~~~k~~ll   30 (31)
T PF09851_consen   19 EEEYEQKKARLL   30 (31)
T ss_pred             HHHHHHHHHHHh
Confidence            578999999884


No 27 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=21.57  E-value=53  Score=29.25  Aligned_cols=22  Identities=18%  Similarity=0.286  Sum_probs=15.7

Q ss_pred             ChHHHHHHHHhhhhhccceeee
Q 018993           64 TSYLRLAAHRVSQHYGLVTMVQ   85 (348)
Q Consensus        64 nSY~RLLvHRvA~yygL~h~v~   85 (348)
                      .+..|.|+|-|..|+||.|...
T Consensus        67 ~~~g~TltHEvGH~LGL~HtF~   88 (154)
T PF05572_consen   67 YNFGKTLTHEVGHWLGLYHTFG   88 (154)
T ss_dssp             S-SSHHHHHHHHHHTT---TT-
T ss_pred             cccccchhhhhhhhhccccccc
Confidence            6678999999999999999874


No 28 
>KOG3248 consensus Transcription factor TCF-4 [Transcription]
Probab=20.77  E-value=1.5e+02  Score=30.70  Aligned_cols=65  Identities=23%  Similarity=0.475  Sum_probs=40.4

Q ss_pred             CCcccc-CCCCCCCCCCCCC-----CCCcCCCCCCCCCCCCCCCCCCCCCCCCcccccCC---CccccccCchh
Q 018993          267 LPFMQY-DTGFPQFSQIPRT-----QASLSFRPPSSPVMSPYCAVGPNQTSVEAAYMQWP---SAAMMYAHSYE  331 (348)
Q Consensus       267 ~p~~~y-~~~f~q~~~~~~~-----~~~~~~~~~~~~~m~p~~~~~~~~~~~~~~y~~~p---~~~m~y~h~~~  331 (348)
                      +|.+.| |.-|+--.-|.-.     +-+=+|++|..|+++||-+.-.|+.-...--|-||   .|+-.|-|+|-
T Consensus        54 ~pli~ys~ehF~p~~pps~~p~dis~k~g~~r~~~~pd~~p~y~ls~gavgqip~~l~wp~y~~pt~~~~~p~p  127 (421)
T KOG3248|consen   54 TPLITYSNEHFSPGSPPSPLPADISPKQGIPRPPHPPDLSPFYPLSPGAVGQIPHPLGWPVYPIPTFGFRHPYP  127 (421)
T ss_pred             CchhhhhhhhCCCCCCCCCCcccccccCCCCCCCCCccccccccCCccccccCCCccCCccccCCCCCCCCCCc
Confidence            566666 4456543222222     23447888888999999887766665555566675   55556666665


No 29 
>KOG3379 consensus Diadenosine polyphosphate hydrolase and related proteins of the histidine triad (HIT) family [Nucleotide transport and metabolism; General function prediction only]
Probab=20.10  E-value=1.2e+02  Score=27.53  Aligned_cols=22  Identities=36%  Similarity=0.386  Sum_probs=19.1

Q ss_pred             CCCCCCCHHHHHHHHHHHHHhh
Q 018993          150 KRSPVRSVEERKEEYDRARARI  171 (348)
Q Consensus       150 k~~~~kS~EEREeeY~rAReRI  171 (348)
                      .+++.+|+||+++|=+.-|+++
T Consensus       129 ~~r~~Rs~eEM~eEA~~lr~~~  150 (150)
T KOG3379|consen  129 EDRKPRSLEEMAEEAQRLREYF  150 (150)
T ss_pred             ccCCcchHHHHHHHHHHHHhhC
Confidence            5688999999999999888764


Done!