Query         015516
Match_columns 405
No_of_seqs    79 out of 81
Neff          2.9 
Searched_HMMs 46136
Date          Fri Mar 29 07:02:43 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/015516.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/015516hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF12937 F-box-like:  F-box-lik  98.9 1.8E-09 3.9E-14   77.0   4.2   44   94-137     2-45  (47)
  2 PF00646 F-box:  F-box domain;   98.7 9.8E-09 2.1E-13   72.5   2.2   47   91-137     1-47  (48)
  3 smart00256 FBOX A Receptor for  98.5 1.4E-07   3E-12   63.5   3.9   40   96-135     1-40  (41)
  4 KOG2997 F-box protein FBX9 [Ge  95.4   0.013 2.7E-07   59.8   3.2   52   92-143   106-162 (366)
  5 KOG2120 SCF ubiquitin ligase,   95.2   0.019   4E-07   58.9   3.9   50   87-136    92-141 (419)
  6 PF06881 Elongin_A:  RNA polyme  94.8   0.036 7.8E-07   46.7   3.9   60   93-154     4-66  (109)
  7 PF13013 F-box-like_2:  F-box-l  93.0    0.12 2.7E-06   44.8   4.0   44   91-136    20-63  (109)
  8 PLN03215 ascorbic acid mannose  92.8   0.098 2.1E-06   53.6   3.5   39   91-129     2-41  (373)
  9 KOG3926 F-box proteins [Amino   84.1     1.4 2.9E-05   44.8   4.3   50   89-138   198-248 (332)
 10 KOG0281 Beta-TrCP (transducin   84.1     1.3 2.8E-05   46.5   4.3   68   93-160    75-149 (499)
 11 KOG0274 Cdc4 and related F-box  82.9    0.64 1.4E-05   49.5   1.6   54   91-144   106-159 (537)
 12 KOG1708 Mitochondrial/chloropl  80.9       2 4.4E-05   41.9   4.1   82  268-386    47-129 (236)
 13 PF04970 LRAT:  Lecithin retino  74.0     1.8 3.8E-05   36.7   1.4   16  296-311     3-18  (125)
 14 smart00743 Agenet Tudor-like d  73.6     3.8 8.2E-05   30.6   3.0   23  299-325     2-24  (61)
 15 PF00018 SH3_1:  SH3 domain;  I  73.3     2.8 6.1E-05   30.0   2.1   24  299-326    15-38  (48)
 16 PF14250 AbrB-like:  AbrB-like   73.0     2.3   5E-05   35.2   1.7   21  292-312    45-65  (71)
 17 PF07653 SH3_2:  Variant SH3 do  70.7       3 6.5E-05   30.7   1.8   24  299-325    17-40  (55)
 18 KOG4408 Putative Mg2+ and Co2+  66.2     1.4 3.1E-05   45.6  -1.0   52   90-141     5-56  (386)
 19 PF00855 PWWP:  PWWP domain;  I  63.3      10 0.00022   29.5   3.6   52  301-367     2-53  (86)
 20 COG1064 AdhP Zn-dependent alco  59.7     5.8 0.00013   40.5   2.0   47  283-340    57-110 (339)
 21 cd05829 Sortase_E Sortase E (S  56.7     5.4 0.00012   35.2   1.1   23  295-317    68-90  (144)
 22 PF12969 DUF3857:  Domain of Un  51.4      13 0.00028   32.0   2.5   23  295-317    85-107 (177)
 23 PF07886 BA14K:  BA14K-like pro  49.9     6.1 0.00013   27.9   0.3   23  242-267     6-28  (31)
 24 PF05641 Agenet:  Agenet domain  44.7      34 0.00073   26.7   3.7   44  300-356     1-44  (68)
 25 PF09372 PRANC:  PRANC domain;   41.9      24 0.00052   28.9   2.6   27   90-116    69-95  (97)
 26 PF03489 SapB_2:  Saposin-like   41.3      27 0.00059   23.7   2.4   23   98-120    13-35  (35)
 27 cd05835 Dnmt3b_related The PWW  40.9      42 0.00091   27.5   3.9   40  301-350     2-41  (87)
 28 PF00970 FAD_binding_6:  Oxidor  40.4     9.4  0.0002   30.2   0.0   18  292-309    74-91  (99)
 29 TIGR03784 marine_sortase sorta  39.3      14 0.00029   34.3   0.9   22  295-316   104-125 (174)
 30 PF14604 SH3_9:  Variant SH3 do  39.3      25 0.00055   25.8   2.2   23  299-325    14-36  (49)
 31 smart00326 SH3 Src homology 3   39.1      34 0.00074   23.3   2.7   23  299-325    20-42  (58)
 32 smart00306 HintN Hint (Hedgeho  38.2      16 0.00035   28.3   1.0   25  285-309    68-99  (100)
 33 cd04459 Rho_CSD Rho_CSD: Rho p  37.7      35 0.00075   27.6   2.9   36  279-314    15-55  (68)
 34 PF07497 Rho_RNA_bind:  Rho ter  36.9      33 0.00072   28.6   2.7   38  278-315    16-58  (78)
 35 COG1018 Hmp Flavodoxin reducta  33.3      25 0.00055   34.3   1.7   28  292-320    79-107 (266)
 36 PF11270 DUF3070:  Protein of u  32.4      30 0.00065   24.9   1.5   24  282-305     2-26  (32)
 37 cd05162 PWWP The PWWP domain,   32.2      68  0.0015   25.6   3.8   29  300-333     1-29  (87)
 38 cd00174 SH3 Src homology 3 dom  30.7      57  0.0012   22.1   2.7   23  299-325    17-39  (54)
 39 TIGR03635 S17_bact 30S ribosom  27.5      48   0.001   27.0   2.1   13  297-309    46-58  (71)
 40 cd03470 Rieske_cytochrome_bc1   27.0      43 0.00092   29.4   1.9   19  297-315     6-24  (126)
 41 COG1545 Predicted nucleic-acid  26.6 1.3E+02  0.0028   27.0   4.9   96  219-322    35-132 (140)
 42 PF11604 CusF_Ec:  Copper bindi  24.0      59  0.0013   25.8   2.0   30  286-315    29-58  (70)
 43 PF04225 OapA:  Opacity-associa  23.5      41  0.0009   27.7   1.1   16  295-310    38-53  (85)
 44 cd04453 S1_RNase_E S1_RNase_E:  23.0      40 0.00086   27.6   0.9   15  298-312    55-69  (88)
 45 PF13550 Phage-tail_3:  Putativ  22.8      56  0.0012   27.7   1.8   14  298-311   138-151 (164)
 46 TIGR02988 YaaA_near_RecF S4 do  21.9      54  0.0012   24.6   1.4   10  299-308    49-58  (59)
 47 cd06166 Sortase_D_5 Sortase D   21.8      36 0.00078   29.1   0.4   18  294-311    59-76  (126)
 48 cd00081 Hint Hedgehog/Intein d  21.7      45 0.00098   27.4   1.0   25  285-309    68-99  (136)
 49 PRK06944 sulfur carrier protei  21.5      56  0.0012   24.5   1.4    9  300-308    50-58  (65)
 50 PF13621 Cupin_8:  Cupin-like d  21.4      49  0.0011   29.5   1.2   21  299-327   212-232 (251)
 51 PF03658 Ub-RnfH:  RnfH family   21.4      47   0.001   28.0   1.1   10  299-308    61-70  (84)
 52 PRK05863 sulfur carrier protei  20.9      58  0.0013   25.2   1.4    9  300-308    50-58  (65)
 53 KOG0162 Myosin class I heavy c  20.9 1.2E+02  0.0025   35.3   4.2   27  299-329  1069-1095(1106)
 54 PRK05886 yajC preprotein trans  20.6      69  0.0015   28.2   1.9   48  296-366    35-82  (109)
 55 PF06945 DUF1289:  Protein of u  20.5      60  0.0013   24.6   1.4   16  386-401    21-36  (51)

No 1  
>PF12937 F-box-like:  F-box-like; PDB: 1P22_A 2OVP_B 2OVR_B 2OVQ_B 1FS1_A 1FS2_C 1FQV_I 1LDK_E 2AST_B 2ASS_B.
Probab=98.89  E-value=1.8e-09  Score=77.03  Aligned_cols=44  Identities=27%  Similarity=0.506  Sum_probs=40.0

Q ss_pred             CCCcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHhh
Q 015516           94 LDLPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKHM  137 (405)
Q Consensus        94 LDLPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWeklV  137 (405)
                      .+||+|++..|+++|+|.||++++.||+.+|+.+.++.||.++.
T Consensus         2 ~~LP~Eil~~If~~L~~~dl~~~~~vcr~w~~~~~~~~lW~~~~   45 (47)
T PF12937_consen    2 SSLPDEILLEIFSYLDPRDLLRLSLVCRRWRRIANDNSLWRRLC   45 (47)
T ss_dssp             CCS-HHHHHHHHTTS-HHHHHHHTTSSHHHHHHHTCCCHHHHHC
T ss_pred             hHhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHCChhhhhhhc
Confidence            58999999999999999999999999999999999999999864


No 2  
>PF00646 F-box:  F-box domain;  InterPro: IPR001810 The F-box domain was first described as a sequence motif found in cyclin-F that interacts with the protein SKP1 [, ]. This relatively conserved structural motif is present in numerous proteins and serves as a link between a target protein and a ubiquitin-conjugating enzyme. The SCF complex (e.g., Skp1-Cullin-F-box) plays a similar role as an E3 ligase in the ubiquitin protein degradation pathway [, ]. Different F-box proteins as a part of SCF complex recruit particular substrates for ubiquitination through specific protein-protein interaction domains.  Many mammalian F-box domains contain leucine-rich or WD-40 repeats (IPR001680 from INTERPRO). However, several F-box proteins either have other previously described domains such as Sec7 domain found in FBS protein or do not contain defined protein-protein interaction domains or motifs.; GO: 0005515 protein binding; PDB: 2E32_A 2E31_A 3V7D_B 1NEX_B 3MKS_D 3L2O_B.
Probab=98.67  E-value=9.8e-09  Score=72.47  Aligned_cols=47  Identities=32%  Similarity=0.526  Sum_probs=40.9

Q ss_pred             CCCCCCcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHhh
Q 015516           91 MSVLDLPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKHM  137 (405)
Q Consensus        91 ~sLLDLPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWeklV  137 (405)
                      +++.+||++++..|+++|++.|+++++.||+.+|+.+.++.+|.+..
T Consensus         1 ~~~~~LP~~il~~Il~~l~~~~~~~l~~vsk~~~~~~~~~~~~~~~~   47 (48)
T PF00646_consen    1 FPLSDLPDEILQEILSYLDPKDLLRLSLVSKRWRSLVDSPRLWKKII   47 (48)
T ss_dssp             -HHHHS-HHHHHHHHHTS-HHHHHHHCTT-HHHHHHHTTHHHHHHHH
T ss_pred             CCHHHCCHHHHHHHHHHCcHHHHHHHHHHhhHHHHHHcCCCccHHHh
Confidence            36789999999999999999999999999999999999999998864


No 3  
>smart00256 FBOX A Receptor for Ubiquitination Targets.
Probab=98.49  E-value=1.4e-07  Score=63.54  Aligned_cols=40  Identities=40%  Similarity=0.620  Sum_probs=38.7

Q ss_pred             CcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHH
Q 015516           96 LPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEK  135 (405)
Q Consensus        96 LPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWek  135 (405)
                      ||++++..|+++|++.|++++++||+.+|..+.++.+|.+
T Consensus         1 lP~~ll~~I~~~l~~~d~~~~~~vc~~~~~~~~~~~~~~~   40 (41)
T smart00256        1 LPDEILEEILSKLPPKDLLRLRKVSRRWRSLIDSHDFWFK   40 (41)
T ss_pred             CCHHHHHHHHHcCCHHHHHHHHHHHHHHHHHhcChhhhhc
Confidence            7999999999999999999999999999999999999975


No 4  
>KOG2997 consensus F-box protein FBX9 [General function prediction only]
Probab=95.36  E-value=0.013  Score=59.77  Aligned_cols=52  Identities=25%  Similarity=0.406  Sum_probs=44.4

Q ss_pred             CCCCCcHHHHHHHHccCC-----hhhHHHHHhhcHHHHHhccChhHHHHhhhhhhcc
Q 015516           92 SVLDLPELVLECILEKLP-----PSGLCNMAAVCSSLRDRCMSDHLWEKHMKQKWGR  143 (405)
Q Consensus        92 sLLDLPeelLd~ILs~LS-----P~DLcrmAaVCrsLRerc~SD~LWeklV~~kwp~  143 (405)
                      .+..||+|+|=.|+...=     -.+|-++|+||+.|+..|+.|.||...--.-|++
T Consensus       106 ~~~~LPdEvLm~I~~~vv~~~~d~rsL~~~s~vCr~F~~~~R~~~lwR~aC~KvW~~  162 (366)
T KOG2997|consen  106 SISVLPDEVLMRIFRWVVSSLLDLRSLEQLSLVCRGFYKCARDPELWRLACLKVWQR  162 (366)
T ss_pred             hhhhCCHHHHHHHHHHHHhhhcchhhHHHhHhhHHHHHHHHcChHHHHHHHHHHHHH
Confidence            368999999999887654     5999999999999999999999999876655553


No 5  
>KOG2120 consensus SCF ubiquitin ligase, Skp2 component [Posttranslational modification, protein turnover, chaperones]
Probab=95.21  E-value=0.019  Score=58.87  Aligned_cols=50  Identities=22%  Similarity=0.361  Sum_probs=45.5

Q ss_pred             CCCCCCCCCCcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHh
Q 015516           87 GGAEMSVLDLPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKH  136 (405)
Q Consensus        87 ~~~~~sLLDLPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWekl  136 (405)
                      --.+.+.-.||+|++.+|.+.|.-.||.+|+.||+.+..-+....+|..+
T Consensus        92 ~npgv~~~slpDEill~IFs~L~kk~LL~~~~VC~Rfyr~~~de~lW~~l  141 (419)
T KOG2120|consen   92 NNPGVSWDSLPDEILLGIFSCLCKKELLKVSGVCKRFYRLASDESLWQTL  141 (419)
T ss_pred             cCCCCCcccCCHHHHHHHHHhccHHHHHHHHHHHHHHhhccccccceeee
Confidence            34556788999999999999999999999999999999999999999764


No 6  
>PF06881 Elongin_A:  RNA polymerase II transcription factor SIII (Elongin) subunit A;  InterPro: IPR010684 This family represents a conserved region within RNA polymerase II transcription factor SIII (Elongin) subunit A. In mammals, the Elongin complex activates elongation by RNA polymerase II by suppressing transient pausing of the polymerase at many sites within transcription units. Elongin is a heterotrimer composed of A, B, and C subunits of 110, 18, and 15 kilodaltons, respectively. Subunit A has been shown to function as the transcriptionally active component of Elongin [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus, 0016021 integral to membrane
Probab=94.80  E-value=0.036  Score=46.73  Aligned_cols=60  Identities=30%  Similarity=0.496  Sum_probs=50.9

Q ss_pred             CCCCcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHhhhhhhccc---cCccccchhh
Q 015516           93 VLDLPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKHMKQKWGRI---VGPAAYREWQ  154 (405)
Q Consensus        93 LLDLPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWeklV~~kwp~v---Lgpaa~~eWq  154 (405)
                      +.++|-.+|.-||...+|..|.++-.-|..|...  +|.||.+|+++..|.-   -.|....+|.
T Consensus         4 vG~~py~ll~piL~~~~~~QL~~iE~~np~l~~~--tdeLW~~~i~rdFp~~~~~~~~~~~~~Wr   66 (109)
T PF06881_consen    4 VGDVPYHLLRPILEKCSPEQLRRIEDNNPHLIED--TDELWKKLIKRDFPEESKRQKPKEPESWR   66 (109)
T ss_pred             cCCCCHHHHHHHHccCCHHHHHHHHHhCCCcchh--hHHHHHHHHHhHCcChhhcccccccchHH
Confidence            6789999999999999999999999999888776  7999999999999952   2444445773


No 7  
>PF13013 F-box-like_2:  F-box-like domain
Probab=93.02  E-value=0.12  Score=44.77  Aligned_cols=44  Identities=27%  Similarity=0.493  Sum_probs=37.7

Q ss_pred             CCCCCCcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHh
Q 015516           91 MSVLDLPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKH  136 (405)
Q Consensus        91 ~sLLDLPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWekl  136 (405)
                      ..++|||+|++..|+.+-.+.++..+..+|+.+|.+.  +++|..+
T Consensus        20 ltl~DLP~ELl~~I~~~C~~~~l~~l~~~~~~~r~~r--~~~~~~L   63 (109)
T PF13013_consen   20 LTLLDLPWELLQLIFDYCNDPILLALSRTCRAYRSWR--DHIWYLL   63 (109)
T ss_pred             cchhhChHHHHHHHHhhcCcHHHHHHHHHHHHHHHHH--HHHHHHh
Confidence            4699999999999999999999999999999998765  3455544


No 8  
>PLN03215 ascorbic acid mannose pathway regulator 1; Provisional
Probab=92.76  E-value=0.098  Score=53.60  Aligned_cols=39  Identities=26%  Similarity=0.366  Sum_probs=35.6

Q ss_pred             CCCCCCcHHHHHHHHccC-ChhhHHHHHhhcHHHHHhccC
Q 015516           91 MSVLDLPELVLECILEKL-PPSGLCNMAAVCSSLRDRCMS  129 (405)
Q Consensus        91 ~sLLDLPeelLd~ILs~L-SP~DLcrmAaVCrsLRerc~S  129 (405)
                      ++.-+||+++|+.|..+| +..|+.++++||++-|.+...
T Consensus         2 ~~Ws~Lp~dll~~i~~~l~~~~d~~~~~~vC~sWr~a~~~   41 (373)
T PLN03215          2 ADWSTLPEELLHMIAGRLFSNVELKRFRSICRSWRSSVSG   41 (373)
T ss_pred             CChhhCCHHHHHHHHhhCCcHHHHHHHHhhhhhHHHhccc
Confidence            356799999999999999 889999999999999998775


No 9  
>KOG3926 consensus F-box proteins [Amino acid transport and metabolism]
Probab=84.11  E-value=1.4  Score=44.84  Aligned_cols=50  Identities=24%  Similarity=0.357  Sum_probs=45.6

Q ss_pred             CCCCCCCCcHHHHHHHHccCCh-hhHHHHHhhcHHHHHhccChhHHHHhhh
Q 015516           89 AEMSVLDLPELVLECILEKLPP-SGLCNMAAVCSSLRDRCMSDHLWEKHMK  138 (405)
Q Consensus        89 ~~~sLLDLPeelLd~ILs~LSP-~DLcrmAaVCrsLRerc~SD~LWeklV~  138 (405)
                      .+.-|.|||++++-.||-+||- .||..+|+|=.+|.+-...+.+|.++-+
T Consensus       198 ~~ltl~dLP~e~vl~Il~rlsDh~dL~s~aqa~etl~~l~~e~~iWkkLcq  248 (332)
T KOG3926|consen  198 AGLTLHDLPLECVLNILLRLSDHRDLESLAQAWETLAKLSEERRIWKKLCQ  248 (332)
T ss_pred             CCCCcccchHHHHHHHHHHccCcchHHHHHHhhHHHHHHHHHHHHHHHHHH
Confidence            4577999999999999999984 8999999999999999999999998754


No 10 
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=84.08  E-value=1.3  Score=46.46  Aligned_cols=68  Identities=25%  Similarity=0.447  Sum_probs=58.2

Q ss_pred             CCCCc----HHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHhhhhhhccc-c--Cccccchhhhhhhcc
Q 015516           93 VLDLP----ELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKHMKQKWGRI-V--GPAAYREWQWHLASR  160 (405)
Q Consensus        93 LLDLP----eelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWeklV~~kwp~v-L--gpaa~~eWq~h~~~~  160 (405)
                      ...||    +++-+.||+||...|||+--.||+.-+......-+|.+++.+++... +  |-+-++.|..++-+.
T Consensus        75 i~~lP~~gl~hi~e~ilsyld~~sLc~celv~k~W~r~l~dg~~WKkLie~~vr~dslWrgl~e~rqw~~~lf~~  149 (499)
T KOG0281|consen   75 ITALPEQGLDHIAENILSYLDALSLCACELVCKEWKRVLSDGMLWKKLIERMVRTDSLWRGLSERRQWDQYLFKN  149 (499)
T ss_pred             HHhcccccHHHHHHHHHHhcchhhhhHHHHHHHHHHHHhccchHHHHHHHHhcchHHHHhhhhhccCcchhhccC
Confidence            34688    99999999999999999999999999999999999999999998733 2  777777887766543


No 11 
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=82.91  E-value=0.64  Score=49.46  Aligned_cols=54  Identities=22%  Similarity=0.283  Sum_probs=49.0

Q ss_pred             CCCCCCcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHhhhhhhccc
Q 015516           91 MSVLDLPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKHMKQKWGRI  144 (405)
Q Consensus        91 ~sLLDLPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWeklV~~kwp~v  144 (405)
                      --+..||.|+.-.||.+|++.+|+++++||+..+..+..|..|-+...+.-+..
T Consensus       106 dfi~~lp~el~~~il~~Ld~~~l~~~~~v~~~w~~~~~~~~~~~~~~~~~~~~~  159 (537)
T KOG0274|consen  106 DFLSLLPSELSLHILSFLDGRDLLAVRQVCRNWNKLLDDDKVWWRMCRELIGRL  159 (537)
T ss_pred             chhhcccchhcccccccCCHHHhhhhhhhcchhhhhhhccchhhhhhhhhcccC
Confidence            358899999999999999999999999999999999999999998877766653


No 12 
>KOG1708 consensus Mitochondrial/chloroplast ribosomal protein L24 [Translation, ribosomal structure and biogenesis]
Probab=80.94  E-value=2  Score=41.88  Aligned_cols=82  Identities=32%  Similarity=0.487  Sum_probs=56.3

Q ss_pred             eeeecCCCccccccCCCCCC-CcccccccccCCCCCCCeeEEEeeccCCCCccceeeeeccccccCCCCCceecccCCeE
Q 015516          268 IAIECGVPWERLRAAPVDTS-PHDLHISDCLNDLHPGDHIEIQWRRNKEFPYGWWYGVVGHLESCDGNENFCHCHNSETL  346 (405)
Q Consensus       268 ~~ie~~i~WdRlRapPvdT~-~~dlh~Sdcl~dLkPGDHiEIQWRrnkefPYgWWYgvVGHlesC~gn~~~C~Ch~sdtv  346 (405)
                      .+...+-+|+|.|.+|++-. -||.|       +.+||.|||-               ||---.=.|-.-.-.-|+|++|
T Consensus        47 ~~~qk~~~w~rrr~~~~e~i~d~dw~-------ff~GDtVeVl---------------vGkDkGkqG~Vtqv~r~~s~Vv  104 (236)
T KOG1708|consen   47 LVLQKNKPWERRRCVPVEPIIDEDWH-------FFFGDTVEVL---------------VGKDKGKQGEVTQVIRHRSWVV  104 (236)
T ss_pred             hHHhhcCccccccCCCCCCcccccee-------EecCCEEEEE---------------ecccCCccceEEEEeecCceEE
Confidence            34456789999999999877 47887       7899999994               4443333344334444555555


Q ss_pred             EEEEeecCCCCcceeeecccccccccccccCccccceecc
Q 015516          347 ILEFNQYTPGSRWRKTTINRKDHREEGNEADGFYGGIRKV  386 (405)
Q Consensus       347 vlEF~qY~~gSrWR~t~inRk~hrEeG~e~dGfyGGIRKL  386 (405)
                      |=..              |=| .+-.|.|.+||=|-|+|.
T Consensus       105 V~gl--------------n~k-~r~~gsekeg~pgtivk~  129 (236)
T KOG1708|consen  105 VKGL--------------NTK-YRHMGSEKEGEPGTIVKS  129 (236)
T ss_pred             Eccc--------------chh-hhhhcccccCCCceEEee
Confidence            5322              222 267899999999999986


No 13 
>PF04970 LRAT:  Lecithin retinol acyltransferase;  InterPro: IPR007053 This entry represents a conserved sequence region found in proteins from viruses, bacteria and eukaryotes. It contains a well-conserved NCEHF motif, though its function in these proteins is unknown.; PDB: 2KYT_A 4DOT_A 4FA0_A.
Probab=73.98  E-value=1.8  Score=36.67  Aligned_cols=16  Identities=44%  Similarity=0.680  Sum_probs=8.6

Q ss_pred             ccCCCCCCCeeEEEee
Q 015516          296 CLNDLHPGDHIEIQWR  311 (405)
Q Consensus       296 cl~dLkPGDHiEIQWR  311 (405)
                      +...|+||||||++-.
T Consensus         3 ~~~~~~~GD~I~~~r~   18 (125)
T PF04970_consen    3 DKKRLKPGDHIEVPRG   18 (125)
T ss_dssp             ---S--TT-EEEEEET
T ss_pred             cccCCCCCCEEEEecC
Confidence            4578999999999754


No 14 
>smart00743 Agenet Tudor-like domain present in plant sequences. Domain in plant sequences with possible chromatin-associated functions.
Probab=73.64  E-value=3.8  Score=30.60  Aligned_cols=23  Identities=35%  Similarity=0.807  Sum_probs=19.9

Q ss_pred             CCCCCCeeEEEeeccCCCCccceeeee
Q 015516          299 DLHPGDHIEIQWRRNKEFPYGWWYGVV  325 (405)
Q Consensus       299 dLkPGDHiEIQWRrnkefPYgWWYgvV  325 (405)
                      +++.||.||++|+-    --+||-|+|
T Consensus         2 ~~~~G~~Ve~~~~~----~~~W~~a~V   24 (61)
T smart00743        2 DFKKGDRVEVFSKE----EDSWWEAVV   24 (61)
T ss_pred             CcCCCCEEEEEECC----CCEEEEEEE
Confidence            57999999999975    468999977


No 15 
>PF00018 SH3_1:  SH3 domain;  InterPro: IPR001452 SH3 (src Homology-3) domains are small protein modules containing approximately 50 amino acid residues [, ]. They are found in a great variety of intracellular or membrane-associated proteins [, , ] for example, in a variety of proteins with enzymatic activity, in adaptor proteins that lack catalytic sequences and in cytoskeletal proteins, such as fodrin and yeast actin binding protein ABP-1. The SH3 domain has a characteristic fold which consists of five or six beta-strands arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain short helices []. The surface of the SH3-domain bears a flat, hydrophobic ligand-binding pocket which consists of three shallow grooves defined by conservative aromatic residues in which the ligand adopts an extended left-handed helical arrangement. The ligand binds with low affinity but this may be enhanced by multiple interactions. The region bound by the SH3 domain is in all cases proline-rich and contains PXXP as a core-conserved binding motif. The function of the SH3 domain is not well understood but they may mediate many diverse processes such as increasing local concentration of proteins, altering their subcellular location and mediating the assembly of large multiprotein complexes []. The crystal structure of the SH3 domain of the cytoskeletal protein spectrin, and the solution structures of SH3 domains of phospholipase C (PLC-y) and phosphatidylinositol 3-kinase p85 alpha-subunit, have been determined [, , ]. In spite of relatively limited sequence similarity, their overall structures are similar. The domains belong to the alpha+beta structural class, with 5 to 8 beta-strands forming 2 tightly-packed, anti-parallel beta-sheets arranged in a barrel-like structure, and intervening loops sometimes forming helices. Conserved aliphatic and aromatic residues form a hydrophobic core (A11, L23, A29, V34, W42, L52 and V59 in PLC-y []) and a hydrophobic pocket on the molecular surface (L12, F13, W53 and P55 in PLC-y). The conserved core is believed to stabilise the fold, while the pocket is thought to serve as a binding site for target proteins. Conserved carboxylic amino acids located in the loops, on the periphery of the pocket (D14 and E22), may be involved in protein-protein interactions via proline-rich regions. The N- and C-termini are packed in close proximity, indicating that they are independent structural modules.; GO: 0005515 protein binding; PDB: 1UHF_A 1W1F_A 1WA7_A 1SEM_A 1KFZ_A 2SEM_B 1K76_A 3SEM_B 1X2Q_A 2J06_B ....
Probab=73.28  E-value=2.8  Score=30.03  Aligned_cols=24  Identities=29%  Similarity=0.642  Sum_probs=18.9

Q ss_pred             CCCCCCeeEEEeeccCCCCccceeeeec
Q 015516          299 DLHPGDHIEIQWRRNKEFPYGWWYGVVG  326 (405)
Q Consensus       299 dLkPGDHiEIQWRrnkefPYgWWYgvVG  326 (405)
                      .++.||.|+|-=+.+.    |||+|-..
T Consensus        15 s~~~Gd~i~v~~~~~~----~Ww~~~~~   38 (48)
T PF00018_consen   15 SFKKGDIIEVLEKSDD----GWWKVRNE   38 (48)
T ss_dssp             EB-TTEEEEEEEESSS----SEEEEEET
T ss_pred             eEECCCEEEEEEecCC----CEEEEEEC
Confidence            3799999999876666    99999763


No 16 
>PF14250 AbrB-like:  AbrB-like transcriptional regulator
Probab=72.98  E-value=2.3  Score=35.21  Aligned_cols=21  Identities=33%  Similarity=0.439  Sum_probs=17.0

Q ss_pred             ccccccCCCCCCCeeEEEeec
Q 015516          292 HISDCLNDLHPGDHIEIQWRR  312 (405)
Q Consensus       292 h~Sdcl~dLkPGDHiEIQWRr  312 (405)
                      |+..=.-+|+|||.+||+=.|
T Consensus        45 ~AYT~~m~L~PGdEFeI~Lgr   65 (71)
T PF14250_consen   45 SAYTKQMGLKPGDEFEIKLGR   65 (71)
T ss_pred             HHHHHHhCCCCCCEEEEEeCc
Confidence            456667799999999999655


No 17 
>PF07653 SH3_2:  Variant SH3 domain;  InterPro: IPR011511 SH3 (src Homology-3) domains are small protein modules containing approximately 50 amino acid residues [, ]. They are found in a great variety of intracellular or membrane-associated proteins [, , ] for example, in a variety of proteins with enzymatic activity, in adaptor proteins that lack catalytic sequences and in cytoskeletal proteins, such as fodrin and yeast actin binding protein ABP-1. The SH3 domain has a characteristic fold which consists of five or six beta-strands arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain short helices []. The surface of the SH3-domain bears a flat, hydrophobic ligand-binding pocket which consists of three shallow grooves defined by conservative aromatic residues in which the ligand adopts an extended left-handed helical arrangement. The ligand binds with low affinity but this may be enhanced by multiple interactions. The region bound by the SH3 domain is in all cases proline-rich and contains PXXP as a core-conserved binding motif. The function of the SH3 domain is not well understood but they may mediate many diverse processes such as increasing local concentration of proteins, altering their subcellular location and mediating the assembly of large multiprotein complexes []. This entry represents a variant of the SH3 domain.; PDB: 1I1J_B 1K0X_A 1HJD_A 2KEA_A 1KJW_A 1JXM_A 1JXO_B 2EBP_A 2DL3_A 2EYX_A ....
Probab=70.72  E-value=3  Score=30.72  Aligned_cols=24  Identities=38%  Similarity=0.819  Sum_probs=17.3

Q ss_pred             CCCCCCeeEEEeeccCCCCccceeeee
Q 015516          299 DLHPGDHIEIQWRRNKEFPYGWWYGVV  325 (405)
Q Consensus       299 dLkPGDHiEIQWRrnkefPYgWWYgvV  325 (405)
                      .|+.||.|+|.  +.++-+ |||+|..
T Consensus        17 s~~~Gd~i~v~--~~~~~~-~ww~~~~   40 (55)
T PF07653_consen   17 SFKKGDVIEVL--GEKDDD-GWWLGEN   40 (55)
T ss_dssp             EB-TTEEEEEE--EEECST-SEEEEEE
T ss_pred             EEecCCEEEEE--EeecCC-CEEEEEE
Confidence            47899999998  233333 9999977


No 18 
>KOG4408 consensus Putative Mg2+ and Co2+ transporter CorD [Inorganic ion transport and metabolism]
Probab=66.17  E-value=1.4  Score=45.65  Aligned_cols=52  Identities=25%  Similarity=0.241  Sum_probs=46.9

Q ss_pred             CCCCCCCcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHhhhhhh
Q 015516           90 EMSVLDLPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKHMKQKW  141 (405)
Q Consensus        90 ~~sLLDLPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWeklV~~kw  141 (405)
                      .+.+-+||..+++.||+++-+.+|.+.|.|.+.|.+-++-+.+|.++-+..|
T Consensus         5 ~~~le~~~~~~l~~vls~~~~~~~~~~a~vs~rLk~~~s~~~lw~r~c~k~l   56 (386)
T KOG4408|consen    5 PLGLEWLPRDPLHLVLSFLLYRDLINCAYVSRRLKELGSHLPLWNRPCKKYL   56 (386)
T ss_pred             ccchhhcccccceeeecccchhhhhcceeechHHhhhhhccccccccccccc
Confidence            3678889999999999999999999999999999999999999998874433


No 19 
>PF00855 PWWP:  PWWP domain;  InterPro: IPR000313 Upon characterisation of WHSC1, a gene mapping to the Wolf-Hirschhornsyndrome critical region and at its C terminus similar to the Drosophila melanogaster ASH1/trithorax group proteins, a novel protein domain designated PWWP domain was identified []. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. It is present in proteins of nuclear origin and plays a role in cell growth and differentiation. Due to its position, the composition of amino acids close to the PWWP motif and the pattern of other domains present it has been suggested that the domain is involved in protein-protein interactions [].; PDB: 3LYI_B 2L89_A 2NLU_A 1RI0_A 1KHC_A 3QKJ_C 2DAQ_A 1N27_A 3PFS_B 3QJ6_A ....
Probab=63.32  E-value=10  Score=29.54  Aligned_cols=52  Identities=21%  Similarity=0.551  Sum_probs=34.6

Q ss_pred             CCCCeeEEEeeccCCCCccceeeeeccccccCCCCCceecccCCeEEEEEeecCCCCcceeeecccc
Q 015516          301 HPGDHIEIQWRRNKEFPYGWWYGVVGHLESCDGNENFCHCHNSETLILEFNQYTPGSRWRKTTINRK  367 (405)
Q Consensus       301 kPGDHiEIQWRrnkefPYgWWYgvVGHlesC~gn~~~C~Ch~sdtvvlEF~qY~~gSrWR~t~inRk  367 (405)
                      +|||.|   |=+-+.||  ||=|+|=+.+.+.++     -...+.+.|.|    .|+. ..+-|+.+
T Consensus         2 ~~GdlV---WaK~~g~p--wWPa~V~~~~~~~~~-----~~~~~~~~V~F----fg~~-~~~wv~~~   53 (86)
T PF00855_consen    2 RPGDLV---WAKLKGYP--WWPARVCDPDEKSKK-----KRKDGHVLVRF----FGDN-DYAWVKPS   53 (86)
T ss_dssp             STTEEE---EEEETTSE--EEEEEEEECCHCTSC-----SSSSTEEEEEE----TTTT-EEEEEEGG
T ss_pred             CCCCEE---EEEeCCCC--CCceEEeeccccccc-----CCCCCEEEEEe----cCCC-CEEEECHH
Confidence            678766   66778887  999999888766553     34566777777    2344 44445443


No 20 
>COG1064 AdhP Zn-dependent alcohol dehydrogenases [General function prediction only]
Probab=59.69  E-value=5.8  Score=40.55  Aligned_cols=47  Identities=30%  Similarity=0.543  Sum_probs=36.3

Q ss_pred             CCCCCCcccc------cccccCCCCCCCeeEEEeeccCCCCccceeeeeccccccC-CCCCceec
Q 015516          283 PVDTSPHDLH------ISDCLNDLHPGDHIEIQWRRNKEFPYGWWYGVVGHLESCD-GNENFCHC  340 (405)
Q Consensus       283 PvdT~~~dlh------~Sdcl~dLkPGDHiEIQWRrnkefPYgWWYgvVGHlesC~-gn~~~C~C  340 (405)
                      ++-+|-|..=      -++. ..+|+||.+=|          ||++.-.||=|.|. |+||+|.=
T Consensus        57 ~P~ipGHEivG~V~~vG~~V-~~~k~GDrVgV----------~~~~~~Cg~C~~C~~G~E~~C~~  110 (339)
T COG1064          57 LPLIPGHEIVGTVVEVGEGV-TGLKVGDRVGV----------GWLVISCGECEYCRSGNENLCPN  110 (339)
T ss_pred             CCccCCcceEEEEEEecCCC-ccCCCCCEEEe----------cCccCCCCCCccccCcccccCCC
Confidence            4556666654      1333 37999999998          78999999999997 89999764


No 21 
>cd05829 Sortase_E Sortase E (SrtE) is a membrane transpeptidase found in gram-positive bacteria that cleaves surface proteins at a cell sorting motif and catalyzes a transpeptidation reaction in which the surface protein substrate is covalently linked to peptidoglycan for display on the bacterial surface. Sortases are grouped into different classes and subfamilies based on sequence, membrane topology, genomic positioning, and cleavage site preference. The function of Sortase E is unknown. In two different sortase families, the N-terminus either functions as both a signal peptide for secretion and a stop-transfer signal for membrane anchoring, or it contains a signal peptide only and the C-terminus serves as a membrane anchor. Most gram-positive bacteria contain more than one sortase and it is thought that the different sortases anchor different surface protein classes. The sortase domain is a modified beta-barrel flanked by two (SrtA) or three (SrtB) short alpha-helices.
Probab=56.74  E-value=5.4  Score=35.16  Aligned_cols=23  Identities=26%  Similarity=0.367  Sum_probs=17.5

Q ss_pred             cccCCCCCCCeeEEEeeccCCCC
Q 015516          295 DCLNDLHPGDHIEIQWRRNKEFP  317 (405)
Q Consensus       295 dcl~dLkPGDHiEIQWRrnkefP  317 (405)
                      .-|++|++||.|.|+....+.+=
T Consensus        68 ~~L~~l~~GD~I~v~~~~g~~~~   90 (144)
T cd05829          68 FRLGDLRKGDKVEVTRADGQTAT   90 (144)
T ss_pred             cchhcCCCCCEEEEEECCCCEEE
Confidence            34679999999999996554443


No 22 
>PF12969 DUF3857:  Domain of Unknown Function with PDB structure (DUF3857);  InterPro: IPR024618 This domain is based on the first domain of the PDB structure 3KD4 (residues 1-228). It is structurally similar to domains in other hydrolases, eg. M1 family aminopeptidase, despite lack of any significant sequence similarity. The domain is N-terminal to a transglutaminase domain, which is found in many proteins known to have transglutaminase activity. The function of this domain is unknown. ; PDB: 3KD4_A.
Probab=51.41  E-value=13  Score=31.99  Aligned_cols=23  Identities=26%  Similarity=0.388  Sum_probs=14.0

Q ss_pred             cccCCCCCCCeeEEEeeccCCCC
Q 015516          295 DCLNDLHPGDHIEIQWRRNKEFP  317 (405)
Q Consensus       295 dcl~dLkPGDHiEIQWRrnkefP  317 (405)
                      --+.+|+|||-||++++....-|
T Consensus        85 ~~~p~v~~GdiIe~~y~~~~~~p  107 (177)
T PF12969_consen   85 FAFPDVRVGDIIEYSYTIKSKNP  107 (177)
T ss_dssp             EE--S--TT-EEEEEEEEEE-TT
T ss_pred             EEcCCCCCCcEEEEEEEEEecCC
Confidence            56889999999999999865544


No 23 
>PF07886 BA14K:  BA14K-like protein;  InterPro: IPR012413 The sequences found in this family are similar to the BA14K proteins expressed by Brucella abortus (Q44701 from SWISSPROT) and by Brucella suis (Q8FVU0 from SWISSPROT). BA14K was found to be strongly immunoreactive; it induces both humoral and cellular responses in hosts throughout the infective process []. 
Probab=49.95  E-value=6.1  Score=27.91  Aligned_cols=23  Identities=35%  Similarity=0.562  Sum_probs=18.3

Q ss_pred             ecceeeeeecCCcceeecCCCCCCCc
Q 015516          242 CYDAELSYDQSTDTFQARYPPHGRRP  267 (405)
Q Consensus       242 CYDA~lsYd~~TDTF~ARYpphGrr~  267 (405)
                      |.+--=|||++|+||++.   .|+|.
T Consensus         6 C~~rYRSy~p~~~Ty~~~---~G~r~   28 (31)
T PF07886_consen    6 CARRYRSYDPRDNTYQPY---DGPRR   28 (31)
T ss_pred             HHHHhcCCCCCCCcEeCC---CCccc
Confidence            566667999999999987   67664


No 24 
>PF05641 Agenet:  Agenet domain;  InterPro: IPR008395 This domain is related to the TUDOR domain IPR008191 from INTERPRO []. The function of the agenet domain is unknown. This signature matches one of the two Agenet domains in the FMR proteins [].; GO: 0003723 RNA binding; PDB: 2BKD_N 3O8V_A 3KUF_A 3H8Z_A.
Probab=44.75  E-value=34  Score=26.68  Aligned_cols=44  Identities=18%  Similarity=0.517  Sum_probs=27.5

Q ss_pred             CCCCCeeEEEeeccCCCCccceeeeeccccccCCCCCceecccCCeEEEEEeecCCC
Q 015516          300 LHPGDHIEIQWRRNKEFPYGWWYGVVGHLESCDGNENFCHCHNSETLILEFNQYTPG  356 (405)
Q Consensus       300 LkPGDHiEIQWRrnkefPYgWWYgvVGHlesC~gn~~~C~Ch~sdtvvlEF~qY~~g  356 (405)
                      .++||.|||- .....|.=.|+-|+|   .+..++.         ++.+||.-+...
T Consensus         1 F~~G~~VEV~-s~e~g~~gaWf~a~V---~~~~~~~---------~~~V~Y~~~~~~   44 (68)
T PF05641_consen    1 FKKGDEVEVS-SDEDGFRGAWFPATV---LKENGDD---------KYLVEYDDLPDE   44 (68)
T ss_dssp             --TT-EEEEE-E-SBTT--EEEEEEE---EEEETT----------EEEEEETT-SS-
T ss_pred             CCCCCEEEEE-EcCCCCCcEEEEEEE---EEeCCCc---------EEEEEECCcccc
Confidence            4799999996 455677777888999   6776664         788888766643


No 25 
>PF09372 PRANC:  PRANC domain;  InterPro: IPR018272 This presumed domain is found at the C terminus of a variety of Pox virus proteins. The PRANC (Pox proteins Repeats of ANkyrin, C-terminal) domain is also found on its own in some proteins []. The function of this domain is unknown, but it appears to be related to the F-box domain and may play a similar role. 
Probab=41.90  E-value=24  Score=28.90  Aligned_cols=27  Identities=26%  Similarity=0.089  Sum_probs=24.8

Q ss_pred             CCCCCCCcHHHHHHHHccCChhhHHHH
Q 015516           90 EMSVLDLPELVLECILEKLPPSGLCNM  116 (405)
Q Consensus        90 ~~sLLDLPeelLd~ILs~LSP~DLcrm  116 (405)
                      ...+-.||.|+-..||++|+-.||..+
T Consensus        69 ~~~w~~LP~EIk~~Il~~L~~~dL~~l   95 (97)
T PF09372_consen   69 NNYWNILPIEIKYKILEYLSNKDLKKL   95 (97)
T ss_pred             CCchhhCCHHHHHHHHHcCCHHHHHHH
Confidence            467999999999999999999999876


No 26 
>PF03489 SapB_2:  Saposin-like type B, region 2;  InterPro: IPR008138 Saposins are small lysosomal proteins that serve as activators of various lysosomal lipid-degrading enzymes []. They probably act by isolating the lipid substrate from the membrane surroundings, thus making it more accessible to the soluble degradative enzymes. All mammalian saposins are synthesized as a single precursor molecule (prosaposin) which contains four Saposin-B domains, yielding the active saposins after proteolytic cleavage, and two Saposin-A domains that are removed in the activation reaction. The Saposin-B domains also occur in other proteins, many of them active in the lysis of membranes [, ].; PDB: 3BQQ_A 2RB3_B 2R0R_A 3BQP_A 2R1Q_A 1NKL_A 1L9L_A 1QDM_C 3RFI_A 4DDJ_A ....
Probab=41.31  E-value=27  Score=23.66  Aligned_cols=23  Identities=17%  Similarity=0.626  Sum_probs=20.3

Q ss_pred             HHHHHHHHccCChhhHHHHHhhc
Q 015516           98 ELVLECILEKLPPSGLCNMAAVC  120 (405)
Q Consensus        98 eelLd~ILs~LSP~DLcrmAaVC  120 (405)
                      +.+++.+.+.++|.++|.+-..|
T Consensus        13 ~~ii~~l~~~~~p~~iC~~i~~C   35 (35)
T PF03489_consen   13 PQIIQLLEKQLDPQQICTKIGLC   35 (35)
T ss_dssp             HHHHHHHHTTSTHHHHHHHTTSS
T ss_pred             HHHHHHHHhcCChHHHHHHcCCC
Confidence            47899999999999999987776


No 27 
>cd05835 Dnmt3b_related The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis.  In tumorigenesis, DNA methylation by Dnmt3b is known to play a role in the inactivation of tumor suppressor genes.  In addition, a point mutation in the PWWP domain of Dnmt3b has been identified in patients with ICF syndrome (immunodeficiency, centromeric instability, and facial anomalies), a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=40.94  E-value=42  Score=27.55  Aligned_cols=40  Identities=25%  Similarity=0.506  Sum_probs=25.3

Q ss_pred             CCCCeeEEEeeccCCCCccceeeeeccccccCCCCCceecccCCeEEEEE
Q 015516          301 HPGDHIEIQWRRNKEFPYGWWYGVVGHLESCDGNENFCHCHNSETLILEF  350 (405)
Q Consensus       301 kPGDHiEIQWRrnkefPYgWWYgvVGHlesC~gn~~~C~Ch~sdtvvlEF  350 (405)
                      ++||.|   |=+-|.||  ||=|.|=+..     ....+.+...++.+.|
T Consensus         2 ~vGDlV---WaK~kg~p--wWP~~V~~~~-----~~~~~~~~~~~~~V~f   41 (87)
T cd05835           2 NVGDLV---WGKIKGFP--WWPGRVVSIT-----VTSKRPPVVGMRWVTW   41 (87)
T ss_pred             CCCCEE---EEecCCCC--CCCeEEechh-----hcccccCCCCeEEEEE
Confidence            688877   77778887  9999994432     1111134455666666


No 28 
>PF00970 FAD_binding_6:  Oxidoreductase FAD-binding domain;  InterPro: IPR008333 These sequences contain an oxidoreductase FAD-binding domain.  To date, the 3D-structures of the flavoprotein domain of Zea mays (Maize) nitrate reductase [] and of pig NADH:cytochrome b5 reductase [] have been solved. The overall fold is similar to that of ferredoxin:NADP+ reductase []: the FAD-binding domain (N-terminal) has the topology of an anti-parallel beta-barrel, while the NAD(P)-binding domain (C-terminal) has the topology of a classical pyridine dinucleotide-binding fold (i.e. a central parallel beta-sheet flanked by 2 helices on each side).; PDB: 1JB9_A 3LVB_A 3LO8_A 1FRN_A 1FND_A 1BX1_A 1FNC_A 1FNB_A 1BX0_A 1FRQ_A ....
Probab=40.43  E-value=9.4  Score=30.17  Aligned_cols=18  Identities=44%  Similarity=0.875  Sum_probs=15.3

Q ss_pred             ccccccCCCCCCCeeEEE
Q 015516          292 HISDCLNDLHPGDHIEIQ  309 (405)
Q Consensus       292 h~Sdcl~dLkPGDHiEIQ  309 (405)
                      .+|..|++|++||.|+|-
T Consensus        74 ~~S~~L~~l~~Gd~v~i~   91 (99)
T PF00970_consen   74 RVSRYLHQLKPGDEVEIR   91 (99)
T ss_dssp             HHHHHHHTSCTTSEEEEE
T ss_pred             HHHHHHHhCCCCCEEEEE
Confidence            357889999999999984


No 29 
>TIGR03784 marine_sortase sortase, marine proteobacterial type. Members of this protein family are sortase enzymes, cysteine transpeptidases involved in protein sorting activities. Members of this family tend to be found in proteobacteria, rather than in Gram-positive bacteria where sortases attach proteins to the Gram-positive cell wall or participate in pilin cross-linking. Many species with this sortase appear to contain a signal target sequence, a protein with a Vault protein inter-alpha-trypsin domain (pfam08487) and a von Willebrand factor type A domain (pfam00092), encoded by an adjacent gene. These sortases are designated subfamily 6 according to Comfort and Clubb (2004).
Probab=39.26  E-value=14  Score=34.25  Aligned_cols=22  Identities=32%  Similarity=0.426  Sum_probs=17.9

Q ss_pred             cccCCCCCCCeeEEEeeccCCC
Q 015516          295 DCLNDLHPGDHIEIQWRRNKEF  316 (405)
Q Consensus       295 dcl~dLkPGDHiEIQWRrnkef  316 (405)
                      .-|++|++||.|+||-..++.+
T Consensus       104 ~~L~~L~~GD~I~v~~~~g~~~  125 (174)
T TIGR03784       104 AFLQELRPGDVIRLQTPDGQWQ  125 (174)
T ss_pred             CChhhCCCCCEEEEEECCCeEE
Confidence            3489999999999998766543


No 30 
>PF14604 SH3_9:  Variant SH3 domain; PDB: 2CRE_A 2E5K_A 2CT3_A 2DE0_X 2D8H_A 2DA9_A 2X3X_E 2X3W_D 2KRN_A 2ED0_A ....
Probab=39.25  E-value=25  Score=25.75  Aligned_cols=23  Identities=35%  Similarity=0.790  Sum_probs=17.1

Q ss_pred             CCCCCCeeEEEeeccCCCCccceeeee
Q 015516          299 DLHPGDHIEIQWRRNKEFPYGWWYGVV  325 (405)
Q Consensus       299 dLkPGDHiEIQWRrnkefPYgWWYgvV  325 (405)
                      .|+.||.|+|--+    -.=|||+|..
T Consensus        14 s~~~Gd~i~v~~~----~~~~W~~g~~   36 (49)
T PF14604_consen   14 SFKKGDVITVLEK----SDDGWWYGRN   36 (49)
T ss_dssp             EB-TTEEEEEEEE----SSTSEEEEEE
T ss_pred             eEcCCCEEEEEEe----CCCCEEEEEE
Confidence            4799999999844    3469999963


No 31 
>smart00326 SH3 Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Probab=39.12  E-value=34  Score=23.33  Aligned_cols=23  Identities=30%  Similarity=0.739  Sum_probs=18.4

Q ss_pred             CCCCCCeeEEEeeccCCCCccceeeee
Q 015516          299 DLHPGDHIEIQWRRNKEFPYGWWYGVV  325 (405)
Q Consensus       299 dLkPGDHiEIQWRrnkefPYgWWYgvV  325 (405)
                      .|++||.|+|-=+.    .-|||+|..
T Consensus        20 ~~~~Gd~v~v~~~~----~~~w~~~~~   42 (58)
T smart00326       20 SFKKGDIITVLEKS----DDGWWKGRL   42 (58)
T ss_pred             CCCCCCEEEEEEcC----CCCeEEEEe
Confidence            58999999998554    469999864


No 32 
>smart00306 HintN Hint (Hedgehog/Intein) domain N-terminal region. Hedgehog/Intein domain, N-terminal region. Domain has been split to accommodate large insertions of endonucleases.
Probab=38.20  E-value=16  Score=28.31  Aligned_cols=25  Identities=32%  Similarity=0.465  Sum_probs=20.1

Q ss_pred             CCCCcccccccc-------cCCCCCCCeeEEE
Q 015516          285 DTSPHDLHISDC-------LNDLHPGDHIEIQ  309 (405)
Q Consensus       285 dT~~~dlh~Sdc-------l~dLkPGDHiEIQ  309 (405)
                      -|+-|-+.+.+-       ..+|++||+|-+|
T Consensus        68 ~T~~H~~~~~~~~~~~w~~a~~l~~gd~v~~~   99 (100)
T smart00306       68 LTPDHLLLVRDGGKLVWVFASELKPGDYVLVP   99 (100)
T ss_pred             ECCCCEEEEecCCcEEEEEHHHCCCCCEEEec
Confidence            477788887766       7899999999875


No 33 
>cd04459 Rho_CSD Rho_CSD: Rho protein cold-shock domain (CSD). Rho protein is a transcription termination factor in most bacteria. In bacteria, there are two distinct mechanisms for mRNA transcription termination. In intrinsic termination, RNA polymerase and nascent mRNA are released from DNA template by an mRNA stem loop structure, which resembles the transcription termination mechanism used by eukaryotic pol III. The second mechanism is mediated by Rho factor. Rho factor terminates transcription by using energy from ATP hydrolysis to forcibly dissociate the transcripts from RNA polymerase. Rho protein contains an N-terminal S1-like domain, which binds single-stranded RNA. Rho has a C-terminal ATPase domain which hydrolyzes ATP to provide energy to strip RNA polymerase and mRNA from the DNA template. Rho functions as a homohexamer.
Probab=37.73  E-value=35  Score=27.63  Aligned_cols=36  Identities=31%  Similarity=0.368  Sum_probs=27.8

Q ss_pred             cccCCC--CCCCccccccccc---CCCCCCCeeEEEeeccC
Q 015516          279 LRAAPV--DTSPHDLHISDCL---NDLHPGDHIEIQWRRNK  314 (405)
Q Consensus       279 lRapPv--dT~~~dlh~Sdcl---~dLkPGDHiEIQWRrnk  314 (405)
                      ||.+-.  ...+-|.++|--+   ..||+||.|+=+=|..+
T Consensus        15 LR~~~~~y~~~~~DvyVs~~~Irr~~LR~GD~V~G~vr~p~   55 (68)
T cd04459          15 LRSSGYNYLPGPDDIYVSPSQIRRFNLRTGDTVVGQIRPPK   55 (68)
T ss_pred             EecCCcCCCCCCCCEEECHHHHHHhCCCCCCEEEEEEeCCC
Confidence            555533  5678888888876   68999999999988644


No 34 
>PF07497 Rho_RNA_bind:  Rho termination factor, RNA-binding domain;  InterPro: IPR011113 The Rho termination factor disengages newly transcribed RNA from its DNA template at certain, specific transcripts. It is thought that two copies of Rho bind to RNA and that Rho functions as a hexamer of protomers [].; GO: 0003723 RNA binding, 0006353 transcription termination, DNA-dependent; PDB: 1A8V_B 1PVO_A 1PV4_D 3ICE_A 1XPU_C 1XPO_D 1XPR_F 2A8V_B 2HT1_B 1A63_A ....
Probab=36.88  E-value=33  Score=28.56  Aligned_cols=38  Identities=34%  Similarity=0.363  Sum_probs=24.2

Q ss_pred             ccccC--CCCCCCccccccccc---CCCCCCCeeEEEeeccCC
Q 015516          278 RLRAA--PVDTSPHDLHISDCL---NDLHPGDHIEIQWRRNKE  315 (405)
Q Consensus       278 RlRap--PvdT~~~dlh~Sdcl---~dLkPGDHiEIQWRrnke  315 (405)
                      -||.+  --.+.+.|.|+|-.+   ..||.||.|+=+=|..|+
T Consensus        16 FLR~~~~~y~~~~~DvYVs~~qIrrf~LR~GD~V~G~vr~p~~   58 (78)
T PF07497_consen   16 FLRSPDNNYLPSPDDVYVSPSQIRRFGLRTGDLVEGQVRPPRE   58 (78)
T ss_dssp             EEE-GGGTTS-STTSEEE-CCCCCCTT--TTEEEEEEEE--ST
T ss_pred             EeECCCcCCCCCCCCEEECHHHHHHcCCCCCCEEEEEEeCCCC
Confidence            36666  556678899998876   589999999988887654


No 35 
>COG1018 Hmp Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 [Energy production and conversion]
Probab=33.29  E-value=25  Score=34.27  Aligned_cols=28  Identities=25%  Similarity=0.275  Sum_probs=23.4

Q ss_pred             ccccccC-CCCCCCeeEEEeeccCCCCccc
Q 015516          292 HISDCLN-DLHPGDHIEIQWRRNKEFPYGW  320 (405)
Q Consensus       292 h~Sdcl~-dLkPGDHiEIQWRrnkefPYgW  320 (405)
                      -.|..|+ +|||||.|+| =.-.-+|++.-
T Consensus        79 ~~S~~Lh~~lk~Gd~l~v-~~P~G~F~l~~  107 (266)
T COG1018          79 GGSNWLHDHLKVGDTLEV-SAPAGDFVLDD  107 (266)
T ss_pred             cccHHHHhcCCCCCEEEE-ecCCCCccCCC
Confidence            3688898 9999999999 66777888765


No 36 
>PF11270 DUF3070:  Protein of unknown function (DUF3070);  InterPro: IPR021423  This eukaryotic family of proteins has no known function. 
Probab=32.44  E-value=30  Score=24.95  Aligned_cols=24  Identities=33%  Similarity=0.553  Sum_probs=18.5

Q ss_pred             CCCCCC-CcccccccccCCCCCCCe
Q 015516          282 APVDTS-PHDLHISDCLNDLHPGDH  305 (405)
Q Consensus       282 pPvdT~-~~dlh~Sdcl~dLkPGDH  305 (405)
                      |||+.| +-|-|..--++++.||||
T Consensus         2 aPa~~Pp~Eeaheap~~s~~~~~d~   26 (32)
T PF11270_consen    2 APAEEPPAEEAHEAPHDSDLYPSDH   26 (32)
T ss_pred             CcccCCChhhhccCcccccCCCCCc
Confidence            444444 678888888899999998


No 37 
>cd05162 PWWP The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids.  The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation.  Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.  The function of the PWWP domain is still not known precisely; however, based on the fact that other regions of PWWP-domain proteins are responsible for nuclear localization and DNA-binding, is likely that the PWWP domain acts as a site for protein-protein binding interactions, influencing chromatin remodeling and thereby regulating transcriptional processes.  Some PWWP-domain proteins have been linked to cancer or other diseases; some are known to function as growth factors.
Probab=32.24  E-value=68  Score=25.58  Aligned_cols=29  Identities=31%  Similarity=0.845  Sum_probs=21.4

Q ss_pred             CCCCCeeEEEeeccCCCCccceeeeeccccccCC
Q 015516          300 LHPGDHIEIQWRRNKEFPYGWWYGVVGHLESCDG  333 (405)
Q Consensus       300 LkPGDHiEIQWRrnkefPYgWWYgvVGHlesC~g  333 (405)
                      ++|||.|   |=+-+.||  ||=|+|-+......
T Consensus         1 f~~GdlV---waK~~g~p--wWPa~V~~~~~~~~   29 (87)
T cd05162           1 FRPGDLV---WAKMKGYP--WWPALVVDPPKDSK   29 (87)
T ss_pred             CCCCCEE---EEeCCCCC--CCCEEEccccccch
Confidence            4789877   55566666  99999987776643


No 38 
>cd00174 SH3 Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Probab=30.74  E-value=57  Score=22.06  Aligned_cols=23  Identities=35%  Similarity=0.826  Sum_probs=17.7

Q ss_pred             CCCCCCeeEEEeeccCCCCccceeeee
Q 015516          299 DLHPGDHIEIQWRRNKEFPYGWWYGVV  325 (405)
Q Consensus       299 dLkPGDHiEIQWRrnkefPYgWWYgvV  325 (405)
                      .|++||-|.|-=+.+    =|||+|-.
T Consensus        17 ~~~~Gd~v~v~~~~~----~~w~~~~~   39 (54)
T cd00174          17 SFKKGDIIEVLEKSD----DGWWEGRL   39 (54)
T ss_pred             CCCCCCEEEEEEcCC----CCeEEEEE
Confidence            589999999986633    28998864


No 39 
>TIGR03635 S17_bact 30S ribosomal protein S17. This model describes the bacterial ribosomal small subunit protein S17, while excluding cytosolic eukaryotic homologs and archaeal homologs. The model finds many, but not, chloroplast and mitochondrial counterparts to bacterial S17.
Probab=27.45  E-value=48  Score=26.97  Aligned_cols=13  Identities=31%  Similarity=0.404  Sum_probs=11.3

Q ss_pred             cCCCCCCCeeEEE
Q 015516          297 LNDLHPGDHIEIQ  309 (405)
Q Consensus       297 l~dLkPGDHiEIQ  309 (405)
                      -|+.+.||-|+|+
T Consensus        46 ~~~~k~GD~V~I~   58 (71)
T TIGR03635        46 NNECKVGDVVRII   58 (71)
T ss_pred             CCCCCCCCEEEEE
Confidence            3589999999997


No 40 
>cd03470 Rieske_cytochrome_bc1 Iron-sulfur protein (ISP) component of the bc(1) complex family, Rieske domain; The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. The bc(1) complex is a multisubunit enzyme found in many different organisms including uni- and multi-cellular eukaryotes, plants (in their mitochondria) and bacteria. The cytochrome bc(1) and b6f complexes are central components of the respiratory and photosynthetic electron transport chains, respectively, which carry out similar core electron and proton transfer steps. The bc(1) and b6f complexes share a common core structure of three catalytic subunits: cyt b, the Rieske ISP, and either a cyt c1 in the bc(1) complex or cyt f in the b6f complex, which are arranged in an integral membrane-bound dimeric complex. While the core of the b6f complex is similar to that of the bc(1) complex, the domain arrangement outside the core and the complement of prosthetic groups are strikingly different.
Probab=26.96  E-value=43  Score=29.40  Aligned_cols=19  Identities=26%  Similarity=0.678  Sum_probs=16.0

Q ss_pred             cCCCCCCCeeEEEeeccCC
Q 015516          297 LNDLHPGDHIEIQWRRNKE  315 (405)
Q Consensus       297 l~dLkPGDHiEIQWRrnke  315 (405)
                      +.+|.||.-..+.||+..-
T Consensus         6 l~~l~~G~~~~v~w~Gkpv   24 (126)
T cd03470           6 LSKIEEGQLITVEWRGKPV   24 (126)
T ss_pred             hhhCCCCCEEEEEECCeEE
Confidence            6789999999999997643


No 41 
>COG1545 Predicted nucleic-acid-binding protein containing a Zn-ribbon [General function prediction only]
Probab=26.64  E-value=1.3e+02  Score=26.96  Aligned_cols=96  Identities=21%  Similarity=0.199  Sum_probs=51.4

Q ss_pred             cCcccccceeeeccCCeeeeeeeecceeeeeecCCcceeecCCCC-CCCceeeecCCCccccccCCCCCCCccccccc-c
Q 015516          219 TGRFWFPAQVYNRENGHVGFMLSCYDAELSYDQSTDTFQARYPPH-GRRPIAIECGVPWERLRAAPVDTSPHDLHISD-C  296 (405)
Q Consensus       219 sG~FWFPAQVYnrenghvGfmLSCYDA~lsYd~~TDTF~ARYpph-Grr~~~ie~~i~WdRlRapPvdT~~~dlh~Sd-c  296 (405)
                      ||+--||.|.|....|--+ -|  -..+||=.-.=.||+.=|.+- |-+.   +....=.-++..-  ...--.|+.| -
T Consensus        35 CG~v~~PPr~~Cp~C~~~~-~~--E~vels~~G~V~t~Tv~~~~~~~~~~---~~P~viaiV~l~~--~~~i~~~i~~~~  106 (140)
T COG1545          35 CGRVYFPPRAYCPKCGSET-EL--EWVELSGEGKVETYTVVYVKPPGFSL---EEPYVIAIVELEE--GGRILGQLVDVD  106 (140)
T ss_pred             CCeEEcCCcccCCCCCCCC-ce--EEEEeCCCeEEEEEEEEeeCCCCccc---CCCEEEEEEEeCC--CCceEEEEEecC
Confidence            6899999999987655443 11  223333333333454443222 2110   0111111111111  1134456666 4


Q ss_pred             cCCCCCCCeeEEEeeccCCCCcccee
Q 015516          297 LNDLHPGDHIEIQWRRNKEFPYGWWY  322 (405)
Q Consensus       297 l~dLkPGDHiEIQWRrnkefPYgWWY  322 (405)
                      -.+++.|+-||..||+..+-.+..+.
T Consensus       107 p~~v~iGm~V~~v~~~~~~~~~~~~~  132 (140)
T COG1545         107 PDDVEIGMKVEAVFRKREEDGGRGYI  132 (140)
T ss_pred             cccccCCCEEEEEEEEccccCCceEE
Confidence            67999999999999998887766543


No 42 
>PF11604 CusF_Ec:  Copper binding periplasmic protein CusF;  InterPro: IPR021647  CusF is a periplasmic protein involved in copper and silver resistance in Escherichia coil. CusF forms a five-stranded beta-barrel OB fold. Cu(I) binds to H36, M47 and M49 which are conserved residues in the protein []. ; PDB: 2L55_A 2VB3_X 1ZEQ_X 2QCP_X 3E6Z_X 2VB2_X.
Probab=24.01  E-value=59  Score=25.79  Aligned_cols=30  Identities=23%  Similarity=0.400  Sum_probs=19.1

Q ss_pred             CCCcccccccccCCCCCCCeeEEEeeccCC
Q 015516          286 TSPHDLHISDCLNDLHPGDHIEIQWRRNKE  315 (405)
Q Consensus       286 T~~~dlh~Sdcl~dLkPGDHiEIQWRrnke  315 (405)
                      |-...+--+--+.+|+|||.|+++-.+..+
T Consensus        29 TM~F~v~~~~~l~~l~~Gd~V~F~~~~~~~   58 (70)
T PF11604_consen   29 TMDFPVADPVDLAGLKPGDKVRFTFERTDD   58 (70)
T ss_dssp             EEEEE--TTSEESS-STT-EEEEEEEEETT
T ss_pred             EEEEEcCChhhhhcCCCCCEEEEEEEECCC
Confidence            334444445578999999999999887654


No 43 
>PF04225 OapA:  Opacity-associated protein A LysM-like domain;  InterPro: IPR007340 This entry includes the Haemophilus influenzae opacity-associated protein. This protein is required for efficient nasopharyngeal mucosal colonization, and its expression is associated with a distinctive transparent colony phenotype. OapA is thought to be a secreted protein, and its expression exhibits high-frequency phase variation [].; PDB: 2GU1_A.
Probab=23.49  E-value=41  Score=27.70  Aligned_cols=16  Identities=38%  Similarity=0.634  Sum_probs=9.2

Q ss_pred             cccCCCCCCCeeEEEe
Q 015516          295 DCLNDLHPGDHIEIQW  310 (405)
Q Consensus       295 dcl~dLkPGDHiEIQW  310 (405)
                      .-|..|||||.|+++=
T Consensus        38 k~L~~L~pGq~l~f~~   53 (85)
T PF04225_consen   38 KPLTRLKPGQTLEFQL   53 (85)
T ss_dssp             --GGG--TT-EEEEEE
T ss_pred             chHhhCCCCCEEEEEE
Confidence            4788999999998763


No 44 
>cd04453 S1_RNase_E S1_RNase_E: RNase E and RNase G, S1-like RNA-binding domain. RNase E is an essential endoribonuclease in the processing and degradation of RNA. In addition to its role in mRNA degradation, RNase E has also been implicated in the processing of rRNA, and the maturation of tRNA, 10Sa RNA and the M1 precursor of RNase P. RNase E associates with PNPase (3' to 5' exonuclease), Rhl B (DEAD-box RNA helicase) and enolase (glycolytic enzyme)  to form the RNA degradosome. RNase E tends to cut mRNA within single-stranded regions that are rich in A/U nucleotides. The N-terminal region of RNase E contains the catalytic site. Within the conserved N-terminal domain of RNAse E and RNase G, there is an S1-like subdomain, which is an ancient single-stranded RNA-binding domain. S1 domain is an RNA-binding module originally identified in the ribosomal protein S1. The S1 domain is required for RNA cleavage by RNase E. RNase G is paralogous to RNase E with an N-terminal catalytic domain th
Probab=23.05  E-value=40  Score=27.61  Aligned_cols=15  Identities=27%  Similarity=0.310  Sum_probs=12.7

Q ss_pred             CCCCCCCeeEEEeec
Q 015516          298 NDLHPGDHIEIQWRR  312 (405)
Q Consensus       298 ~dLkPGDHiEIQWRr  312 (405)
                      +.|+.||.|++|=-+
T Consensus        55 ~~~~~Gd~v~VqV~~   69 (88)
T cd04453          55 KLLKEGQEILVQVVK   69 (88)
T ss_pred             HcCCCCCEEEEEEEE
Confidence            459999999999765


No 45 
>PF13550 Phage-tail_3:  Putative phage tail protein
Probab=22.77  E-value=56  Score=27.71  Aligned_cols=14  Identities=36%  Similarity=0.465  Sum_probs=12.4

Q ss_pred             CCCCCCCeeEEEee
Q 015516          298 NDLHPGDHIEIQWR  311 (405)
Q Consensus       298 ~dLkPGDHiEIQWR  311 (405)
                      -.|.|||.|+|++-
T Consensus       138 ~~l~pGDvi~l~~~  151 (164)
T PF13550_consen  138 LALEPGDVIALSDD  151 (164)
T ss_pred             ccCCCCCEEEEEeC
Confidence            45999999999986


No 46 
>TIGR02988 YaaA_near_RecF S4 domain protein YaaA. This small protein has a single S4 domain (pfam01479), as do bacterial ribosomal protein S4, some pseudouridine synthases, tyrosyl-tRNA synthetases. The S4 domain may bind RNA. Members of this protein family are found almost exclusively in the Firmicutes, and almost invariably just a few nucleotides upstream of the gene for the DNA replication and repair protein RecF. The few members of this family that are not near recF are found instead near dnaA and/or dnaN, the usual neighbors of recF, near the origin of replication. The conserved location suggests a possible role in replication in the Firmicutes lineage.
Probab=21.87  E-value=54  Score=24.60  Aligned_cols=10  Identities=70%  Similarity=1.026  Sum_probs=7.0

Q ss_pred             CCCCCCeeEE
Q 015516          299 DLHPGDHIEI  308 (405)
Q Consensus       299 dLkPGDHiEI  308 (405)
                      -|++||+|+|
T Consensus        49 ~l~~Gd~v~i   58 (59)
T TIGR02988        49 KLYPGDVIEI   58 (59)
T ss_pred             CCCCCCEEEe
Confidence            4677777776


No 47 
>cd06166 Sortase_D_5 Sortase D (SrtD) is a membrane transpeptidase found in gram-positive bacteria that anchors surface proteins to peptidoglycans of the bacterial cell wall envelope. This involves a transpeptidation reaction in which the surface protein substrate is cleaved at the cell wall sorting signal and covalently linked to peptidoglycan for display on the bacterial surface. Sortases are grouped into different classes and subfamilies based on sequence, membrane topology, genomic positioning, and cleavage site preference. Class D sortases are further classified into subfamilies 4 and 5. This group contains a subset of Class D sortases belonging to subfamily-5, represented by Clostridium perfringens CPE2315. Subfamily-5 sortases recognize a nonstandard sorting signal (LAXTG) and have replaced Sortase A in some gram-postive bacteria. They may play a housekeeping role in the cell.
Probab=21.75  E-value=36  Score=29.13  Aligned_cols=18  Identities=22%  Similarity=0.499  Sum_probs=15.2

Q ss_pred             ccccCCCCCCCeeEEEee
Q 015516          294 SDCLNDLHPGDHIEIQWR  311 (405)
Q Consensus       294 Sdcl~dLkPGDHiEIQWR  311 (405)
                      -.-|++|++||.|.|+..
T Consensus        59 F~~L~~l~~Gd~v~v~~~   76 (126)
T cd06166          59 FNRLDEVEKGDEIKVTTK   76 (126)
T ss_pred             cCChHHCCCCCEEEEEEC
Confidence            345889999999999885


No 48 
>cd00081 Hint Hedgehog/Intein domain, found in Hedgehog proteins as well as proteins which contain inteins and undergo protein splicing (e.g. DnaB, RIR1-2, GyrA and Pol). In protein splicing an intervening polypeptide sequence - the intein - is excised from a protein, and the flanking polypeptide sequences - the exteins - are joined by a peptide bond. In addition to the autocatalytic splicing domain, many inteins contain an inserted endonuclease domain, which plays a role in spreading inteins. Hedgehog proteins are a major class of intercellular signaling molecules, which control inductive interactions during animal development. The mature signaling forms of hedgehog proteins are the N-terminal fragments, which are covalently linked to cholesterol at their C-termini. This modification is the result of an autoprocessing step catalyzed by the C-terminal fragments, which are aligned here.
Probab=21.69  E-value=45  Score=27.37  Aligned_cols=25  Identities=32%  Similarity=0.448  Sum_probs=22.3

Q ss_pred             CCCCcccccccc-------cCCCCCCCeeEEE
Q 015516          285 DTSPHDLHISDC-------LNDLHPGDHIEIQ  309 (405)
Q Consensus       285 dT~~~dlh~Sdc-------l~dLkPGDHiEIQ  309 (405)
                      -|+-|-+.+.+.       ..+|++||.|-+|
T Consensus        68 ~T~~H~~~~~~~~~~~~~~a~~l~~gd~l~~~   99 (136)
T cd00081          68 LTPDHLLFVLEDGELKWVFASDLKPGDYVLVP   99 (136)
T ss_pred             EcCCCEEEEEeCCeEEEEEHHHCCCCCEEEEc
Confidence            588899998885       7999999999999


No 49 
>PRK06944 sulfur carrier protein ThiS; Provisional
Probab=21.54  E-value=56  Score=24.52  Aligned_cols=9  Identities=33%  Similarity=0.877  Sum_probs=8.2

Q ss_pred             CCCCCeeEE
Q 015516          300 LHPGDHIEI  308 (405)
Q Consensus       300 LkPGDHiEI  308 (405)
                      |+.||.|||
T Consensus        50 L~~gD~vei   58 (65)
T PRK06944         50 LAAGDRLDL   58 (65)
T ss_pred             CCCCCEEEE
Confidence            888999998


No 50 
>PF13621 Cupin_8:  Cupin-like domain; PDB: 3AL6_C 3AL5_C 2XUM_A 2Y0I_A 1MZE_A 3KCY_A 1MZF_A 1YCI_A 2ILM_A 1H2L_A ....
Probab=21.43  E-value=49  Score=29.54  Aligned_cols=21  Identities=48%  Similarity=0.914  Sum_probs=15.3

Q ss_pred             CCCCCCeeEEEeeccCCCCccceeeeecc
Q 015516          299 DLHPGDHIEIQWRRNKEFPYGWWYGVVGH  327 (405)
Q Consensus       299 dLkPGDHiEIQWRrnkefPYgWWYgvVGH  327 (405)
                      .|.|||-+=|        |=|||+.|.-.
T Consensus       212 ~l~pGD~Lfi--------P~gWwH~V~~~  232 (251)
T PF13621_consen  212 VLEPGDVLFI--------PPGWWHQVENL  232 (251)
T ss_dssp             EEETT-EEEE---------TT-EEEEEES
T ss_pred             EECCCeEEEE--------CCCCeEEEEEc
Confidence            6899999976        99999999743


No 51 
>PF03658 Ub-RnfH:  RnfH family Ubiquitin;  InterPro: IPR005346 This is a small family of proteins of unknown function.; PDB: 2HJ1_B.
Probab=21.43  E-value=47  Score=28.05  Aligned_cols=10  Identities=50%  Similarity=0.787  Sum_probs=6.1

Q ss_pred             CCCCCCeeEE
Q 015516          299 DLHPGDHIEI  308 (405)
Q Consensus       299 dLkPGDHiEI  308 (405)
                      -|+.||.|||
T Consensus        61 ~L~~GDRVEI   70 (84)
T PF03658_consen   61 VLRDGDRVEI   70 (84)
T ss_dssp             B--TT-EEEE
T ss_pred             cCCCCCEEEE
Confidence            4889999998


No 52 
>PRK05863 sulfur carrier protein ThiS; Provisional
Probab=20.92  E-value=58  Score=25.16  Aligned_cols=9  Identities=33%  Similarity=0.752  Sum_probs=8.4

Q ss_pred             CCCCCeeEE
Q 015516          300 LHPGDHIEI  308 (405)
Q Consensus       300 LkPGDHiEI  308 (405)
                      |+.||.|||
T Consensus        50 L~~gD~ieI   58 (65)
T PRK05863         50 LRDGARLEV   58 (65)
T ss_pred             cCCCCEEEE
Confidence            899999998


No 53 
>KOG0162 consensus Myosin class I heavy chain [Cytoskeleton]
Probab=20.90  E-value=1.2e+02  Score=35.30  Aligned_cols=27  Identities=44%  Similarity=0.887  Sum_probs=22.1

Q ss_pred             CCCCCCeeEEEeeccCCCCccceeeeecccc
Q 015516          299 DLHPGDHIEIQWRRNKEFPYGWWYGVVGHLE  329 (405)
Q Consensus       299 dLkPGDHiEIQWRrnkefPYgWWYgvVGHle  329 (405)
                      -++-||.|||-    ++=|=|||-|-+|-.|
T Consensus      1069 s~~~~diIei~----~edpSGWw~gk~~~ke 1095 (1106)
T KOG0162|consen 1069 SFKKGDIIEIM----REDPSGWWLGKLNGKE 1095 (1106)
T ss_pred             cccCCCEEEEe----ccCCCcchhhccCCcc
Confidence            36899999995    6779999999876544


No 54 
>PRK05886 yajC preprotein translocase subunit YajC; Validated
Probab=20.57  E-value=69  Score=28.21  Aligned_cols=48  Identities=21%  Similarity=0.387  Sum_probs=29.9

Q ss_pred             ccCCCCCCCeeEEEeeccCCCCccceeeeeccccccCCCCCceecccCCeEEEEEeecCCCCcceeeeccc
Q 015516          296 CLNDLHPGDHIEIQWRRNKEFPYGWWYGVVGHLESCDGNENFCHCHNSETLILEFNQYTPGSRWRKTTINR  366 (405)
Q Consensus       296 cl~dLkPGDHiEIQWRrnkefPYgWWYgvVGHlesC~gn~~~C~Ch~sdtvvlEF~qY~~gSrWR~t~inR  366 (405)
                      =+++|+|||.|---            -|++|--.+-+.          |+|+||... ..--+|.|..|.+
T Consensus        35 m~~~Lk~GD~VvT~------------gGi~G~V~~I~d----------~~v~leia~-gv~i~~~r~AI~~   82 (109)
T PRK05886         35 LHESLQPGDRVHTT------------SGLQATIVGITD----------DTVDLEIAP-GVVTTWMKLAVRD   82 (109)
T ss_pred             HHHhcCCCCEEEEC------------CCeEEEEEEEeC----------CEEEEEECC-CeEEEEEhhheee
Confidence            35789999998642            355555565543          489999752 1223455666664


No 55 
>PF06945 DUF1289:  Protein of unknown function (DUF1289);  InterPro: IPR010710 This family consists of a number of hypothetical bacterial proteins. The aligned region spans around 56 residues and contains 4 highly conserved cysteine residues towards the N terminus. The function of this family is unknown.
Probab=20.54  E-value=60  Score=24.56  Aligned_cols=16  Identities=25%  Similarity=0.372  Sum_probs=12.3

Q ss_pred             cCCHHHHHHHhhcCCc
Q 015516          386 VNNEEEIKMWKRLWPV  401 (405)
Q Consensus       386 L~~~eEI~~Wk~lwp~  401 (405)
                      .+|.|||..|+.+=|.
T Consensus        21 ~RT~dEI~~W~~~s~~   36 (51)
T PF06945_consen   21 GRTLDEIRDWKSMSDD   36 (51)
T ss_pred             CCcHHHHHHHhhCCHH
Confidence            4678999999987443


Done!