Query         psy8451
Match_columns 110
No_of_seqs    142 out of 861
Neff          8.5 
Searched_HMMs 46136
Date          Fri Aug 16 19:12:41 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy8451.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8451hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG3780|consensus              100.0 4.3E-28 9.4E-33  181.5  11.6  109    2-110   101-209 (427)
  2 PF00339 Arrestin_N:  Arrestin   99.8 1.9E-21 4.1E-26  126.5   4.0   61    2-62     89-149 (149)
  3 KOG3865|consensus               99.4   1E-12 2.2E-17   94.7   6.8  100    5-110   106-216 (402)
  4 PF13002 LDB19:  Arrestin_N ter  99.2 9.8E-11 2.1E-15   79.5   8.6   58    2-59     46-115 (191)
  5 PF03643 Vps26:  Vacuolar prote  98.6 9.7E-07 2.1E-11   63.6  11.4   92    3-107    93-184 (275)
  6 PF08737 Rgp1:  Rgp1;  InterPro  98.0 3.2E-05 6.9E-10   58.7   7.5   57    1-57    113-173 (415)
  7 KOG2717|consensus               97.0 0.00094   2E-08   47.2   3.8   61    3-63     84-151 (313)
  8 PF07070 Spo0M:  SpoOM protein;  97.0  0.0027   6E-08   44.4   6.1   52    2-57     81-133 (218)
  9 PF02752 Arrestin_C:  Arrestin   96.2   0.025 5.4E-07   35.4   6.1   53    5-57     78-133 (136)
 10 KOG3063|consensus               95.8   0.041 8.8E-07   39.2   6.0   64    8-75    104-167 (301)
 11 COG4326 Spo0M Sporulation cont  92.0    0.39 8.5E-06   33.6   4.6   51    2-56    101-152 (270)
 12 cd00917 PG-PI_TP The phosphati  87.7     2.2 4.8E-05   26.9   5.3   32    3-42     79-110 (122)
 13 PF04425 Bul1_N:  Bul1 N termin  84.6     3.4 7.4E-05   32.0   5.7   17    2-18    255-272 (438)
 14 PF02221 E1_DerP2_DerF2:  ML do  83.6     5.7 0.00012   24.8   5.7   32    3-42     88-120 (134)
 15 cd00258 GM2-AP GM2 activator p  78.8      10 0.00022   25.5   5.6   37    3-43    109-147 (162)
 16 cd08544 Reeler Reeler, the N-t  78.0     2.6 5.6E-05   26.9   2.7   22   89-110    19-40  (135)
 17 KOG4469|consensus               77.2     8.9 0.00019   27.4   5.3   44    2-45    104-148 (391)
 18 smart00737 ML Domain involved   75.4      12 0.00027   22.9   5.2   31    3-41     74-105 (118)
 19 PF08694 UFC1:  Ubiquitin-fold   68.2     6.3 0.00014   26.0   2.7   19    6-24     73-91  (161)
 20 PF02014 Reeler:  Reeler domain  65.6     5.7 0.00012   25.2   2.1   21   90-110    20-40  (132)
 21 cd02859 AMPKbeta_GBD_like AMP-  63.6     3.9 8.5E-05   23.7   1.0   13    2-14     42-54  (79)
 22 PF07703 A2M_N_2:  Alpha-2-macr  62.3     7.5 0.00016   24.3   2.2   23   88-110    94-116 (136)
 23 PF07610 DUF1573:  Protein of u  59.1      12 0.00026   19.2   2.3   21   90-110    23-43  (45)
 24 PF08737 Rgp1:  Rgp1;  InterPro  55.2      33 0.00071   26.4   4.9   32    9-40    381-412 (415)
 25 cd00912 ML The ML (MD-2-relate  53.0      17 0.00037   22.7   2.7   31    3-41     82-114 (127)
 26 PF08845 SymE_toxin:  Toxin Sym  48.6     6.5 0.00014   21.7   0.2   17   94-110    32-48  (57)
 27 PF04729 ASF1_hist_chap:  ASF1   48.0      24 0.00053   23.5   2.9   39    3-41     65-119 (154)
 28 PF03643 Vps26:  Vacuolar prote  46.4      68  0.0015   23.3   5.2   32    9-40    237-271 (275)
 29 cd02861 E_set_proteins_like E   45.8      15 0.00032   21.2   1.5   11    3-13     44-54  (82)
 30 TIGR03000 plancto_dom_1 Planct  44.0      66  0.0014   18.8   4.0   26   30-56     41-66  (75)
 31 PF01835 A2M_N:  MG2 domain;  I  43.7      67  0.0015   18.8   5.6   24    9-40     63-86  (99)
 32 PF07495 Y_Y_Y:  Y_Y_Y domain;   42.8      15 0.00033   19.8   1.2   11    3-13     36-46  (66)
 33 KOG3063|consensus               41.8      94   0.002   22.5   5.1   52    9-60    244-298 (301)
 34 PF06688 DUF1187:  Protein of u  41.3      43 0.00093   18.7   2.7   14   33-46      1-14  (61)
 35 PF09608 Alph_Pro_TM:  Putative  41.0 1.1E+02  0.0024   21.7   5.5   42    8-57    155-196 (236)
 36 PF08750 CNP1:  CNP1-like famil  40.6      50  0.0011   21.6   3.4   15   26-40     47-61  (139)
 37 COG3126 Uncharacterized protei  40.5      57  0.0012   21.8   3.7   35    4-44     87-121 (158)
 38 KOG3780|consensus               40.2 1.2E+02  0.0027   22.7   6.1   50    6-56    263-316 (427)
 39 PF08875 DUF1833:  Domain of un  39.2      64  0.0014   21.0   3.9   10    9-18     53-62  (150)
 40 PF14347 DUF4399:  Domain of un  37.8      25 0.00054   21.0   1.6   14    1-14     57-70  (87)
 41 PF08111 Pea-VEAacid:  Pea-VEAa  34.7      15 0.00033   14.3   0.2    8    2-9       2-9   (15)
 42 PF10342 GPI-anchored:  Ser-Thr  34.0      95  0.0021   17.7   3.8   20   12-38     58-77  (93)
 43 PF05688 DUF824:  Salmonella re  33.8      24 0.00052   18.7   0.9   11  100-110     9-19  (47)
 44 KOG3262|consensus               32.4      96  0.0021   21.5   3.9   22   87-108   102-127 (215)
 45 PRK09750 hypothetical protein;  31.4      59  0.0013   18.2   2.2   13   32-44      3-15  (64)
 46 KOG1584|consensus               30.6      34 0.00074   25.3   1.6   25   12-36    107-132 (297)
 47 PF01345 DUF11:  Domain of unkn  30.5      82  0.0018   17.6   3.0   26   85-110    22-47  (76)
 48 KOG0420|consensus               30.5      51  0.0011   22.5   2.3   17    4-20     69-88  (184)
 49 TIGR02567 YscW type III secret  28.5   1E+02  0.0023   19.7   3.4   25   30-55     81-105 (124)
 50 COG2373 Large extracellular al  27.5      47   0.001   30.2   2.2   22   89-110   751-772 (1621)
 51 PRK02939 lipoprotein; Reviewed  27.3 2.4E+02  0.0051   20.2   7.1   26   18-43     47-72  (236)
 52 TIGR02186 alph_Pro_TM conserve  26.6 2.6E+02  0.0055   20.4   5.5   40   10-57    182-221 (261)
 53 KOG1366|consensus               26.4 1.1E+02  0.0025   27.6   4.2   28   83-110   109-136 (1436)
 54 PF12857 TOBE_3:  TOBE-like dom  26.4      92   0.002   16.7   2.6   24   85-109    29-56  (58)
 55 PF05132 RNA_pol_Rpc4:  RNA pol  25.1      54  0.0012   20.8   1.7   14   12-25      6-19  (131)
 56 TIGR03769 P_ac_wall_RPT actino  24.9      94   0.002   15.7   2.2   12    4-15     11-22  (41)
 57 PF12761 End3:  Actin cytoskele  24.3      15 0.00032   25.4  -1.1   24   13-36     18-43  (195)
 58 TIGR01451 B_ant_repeat conserv  23.4      54  0.0012   17.4   1.2   13   96-108     4-16  (53)
 59 PF13584 BatD:  Oxygen toleranc  23.3   1E+02  0.0023   23.7   3.2   26   85-110   124-149 (484)
 60 PF13584 BatD:  Oxygen toleranc  22.3      89  0.0019   24.1   2.7   24   87-110   271-294 (484)
 61 PF02303 Phage_DNA_bind:  Helix  22.2      57  0.0012   19.5   1.2   15    6-20     38-52  (86)
 62 PF14734 DUF4469:  Domain of un  21.6 1.7E+02  0.0038   17.9   3.4   22   12-41     66-87  (102)
 63 PF05078 DUF679:  Protein of un  21.3      66  0.0014   21.8   1.5   16   22-37     58-73  (170)
 64 smart00736 CADG Dystroglycan-t  21.1 1.8E+02  0.0039   17.1   3.4   25   11-38     14-39  (97)
 65 KOG3265|consensus               21.0 1.5E+02  0.0033   21.2   3.3   37    3-39     65-117 (250)

No 1  
>KOG3780|consensus
Probab=99.95  E-value=4.3e-28  Score=181.51  Aligned_cols=109  Identities=42%  Similarity=0.827  Sum_probs=104.2

Q ss_pred             eecCeeeEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEEccccCCCCCCCCCceeEEEeeeEEE
Q psy8451           2 EIPQGSHVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVVTPYDLNTDMRAKEPINQNVSKNFRF   81 (110)
Q Consensus         2 ~l~~G~h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~~~~~~~~p~~~~p~~~~~~~~~~~   81 (110)
                      .|++|.|+|||+|+||.+||+||+|.+|+|||.|+|.++|+|+.+....+.|+|++.+|+|..|....|.....++.++|
T Consensus       101 ~l~~G~~~~pF~~~LP~~~P~Sfeg~~G~irY~vk~~idr~~~~~~~~~~~~~V~~~~~ln~~p~~~~~~~~~~~k~~~~  180 (427)
T KOG3780|consen  101 VLPPGNYEFPFSFTLPLNLPPSFEGKFGHVRYFVKAEIDRPWKLNKKNRKPFTVIETVDLNSSPSLLEPIISKASKKLGC  180 (427)
T ss_pred             ecCCCceEEeEeccCCCCCCCceeeCCceEEEEEEEEEecCCCCCccceeeEEEecccccccCccccCcchhhhhheeeE
Confidence            48899999999999999999999999999999999999999999999999999999999998888899999999999999


Q ss_pred             EEccCccEEEEEEeCCeeeeCCCeEEEEC
Q psy8451          82 LWRQSGPLTLSVSAPFSGFVPGQTIPCMI  110 (110)
Q Consensus        82 ~~~~~g~v~~~~~l~k~~y~pGe~i~i~v  110 (110)
                      ||+.+|.+.+++++++.+|++||.|++.+
T Consensus       181 ~~~~~g~v~~~~~ip~~~~~~ge~i~~~~  209 (427)
T KOG3780|consen  181 VCFSSGPVSLELTIPKTGYVPGETIPVTL  209 (427)
T ss_pred             EEecCCcEEEEEEcccccCcCCccEEEEE
Confidence            99999999999999999999999998763


No 2  
>PF00339 Arrestin_N:  Arrestin (or S-antigen), N-terminal domain;  InterPro: IPR011021 G protein-coupled receptors are a large family of signalling molecules that respond to a wide variety of extracellular stimuli. The receptors relay the information encoded by the ligand through the activation of heterotrimeric G proteins and intracellular effector molecules. To ensure the appropriate regulation of the signalling cascade, it is vital to properly inactivate the receptor. This inactivation is achieved, in part, by the binding of a soluble protein, arrestin, which uncouples the receptor from the downstream G protein after the receptors are phosphorylated by G protein-coupled receptor kinases. In addition to the inactivation of G protein-coupled receptors, arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways. Arrestin (retinal S-antigen) is a major protein of the retinal rod outer segments. It interacts with photo-activated phosphorylated rhodopsin, inhibiting or 'arresting' its ability to interact with transducin []. The protein binds calcium, and shows similarity in its C terminus to alpha-transducin and other purine nucleotide-binding proteins. In mammals, arrestin is associated with autoimmune uveitis. Arrestins comprise a family of closely-related proteins that includes beta-arrestin-1 and -2, which regulate the function of beta-adrenergic receptors by binding to their phosphorylated forms, impairing their capacity to activate G(S) proteins; Cone photoreceptors C-arrestin (arrestin-X) [], which could bind to phosphorylated red/green opsins; and Drosophila phosrestins I and II, which undergo light-induced phosphorylation, and probably play a role in photoreceptor transduction [, , ].  The crystal structure of bovine retinal arrestin comprises two domains of antiparallel beta-sheets connected through a hinge region and one short alpha-helix on the back of the amino-terminal fold []. The binding region for phosphorylated light-activated rhodopsin is located at the N-terminal domain, as indicated by the docking of the photoreceptor to the three-dimensional structure of arrestin.  The N-terminal domain consists of an immunoglobulin-like beta-sandwich structure. This entry represents proteins with immunoglobulin-like domains that are similar to those found in arrestin.; PDB: 1SUJ_A 3UGX_A 1CF1_B 1AYR_A 3UGU_A 3P2D_B 1ZSH_A 2WTR_B 3GC3_A 1G4R_A ....
Probab=99.84  E-value=1.9e-21  Score=126.52  Aligned_cols=61  Identities=49%  Similarity=0.931  Sum_probs=47.2

Q ss_pred             eecCeeeEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEEccccCC
Q psy8451           2 EIPQGSHVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVVTPYDLN   62 (110)
Q Consensus         2 ~l~~G~h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~~~~~~   62 (110)
                      .|++|+|+|||+|+||.+||+||++.+|+|+|.|+|.+++|+..+.+++++|+|++.+|+|
T Consensus        89 ~l~~G~~~fpF~f~LP~~lP~S~~~~~g~I~Y~l~a~l~~~~~~~~~~~~~~~v~~~id~n  149 (149)
T PF00339_consen   89 ILPPGEYEFPFEFQLPSNLPSSFEGSHGSIRYKLKATLDRPGKKDHKAKREFTVVEPIDLN  149 (149)
T ss_dssp             ---C-TTEEEEEE---TTS--SEEEE-SEEEEEEEEEESSTTSE--CGGEEEEEEEEEEB-
T ss_pred             cccCCCEEEEEEEECCCCCCceEeccCcCEEEEEEEEEECCCCCCcEEEEEEEEECcccCC
Confidence            4789999999999999999999999999999999999999999999999999999999986


No 3  
>KOG3865|consensus
Probab=99.39  E-value=1e-12  Score=94.69  Aligned_cols=100  Identities=27%  Similarity=0.428  Sum_probs=72.0

Q ss_pred             CeeeEEEeEEECCCCCCCcee--------cCCeEEEEEEEEEEeCCC-CCCc-eEEEEEEEEccccCCCCCCCCCce-eE
Q psy8451           5 QGSHVYPFSYTLPPTIPSSFD--------GEHGRVRYKVTARLDRPW-KFDK-TTEAVFSVVTPYDLNTDMRAKEPI-NQ   73 (110)
Q Consensus         5 ~G~h~fpF~f~LP~~lPsSf~--------~~~g~I~Y~i~a~i~~~~-~~~~-~~~~~~~V~~~~~~~~~p~~~~p~-~~   73 (110)
                      -|.+.|||.|++|.++|+|+.        |+-+.|.|.|+|.+.... ...+ +.+-.+. ++.+.  ..|.-..|+ ..
T Consensus       106 LG~nAyPF~f~~pp~~P~SVtLQp~p~D~gKpcGVdyevkaF~~~s~edk~hKr~sVrL~-IRKvq--yAP~~~GpqP~~  182 (402)
T KOG3865|consen  106 LGSNAYPFTFEFPPNLPCSVTLQPGPEDTGKPCGVDYEVKAFVADSEEDKIHKRNSVRLV-IRKVQ--YAPLEPGPQPSA  182 (402)
T ss_pred             hCCCCCceEEeCCCCCCceEEeccCCccCCCcccceEEEEEEecCCcccccccccceeee-eeeee--ecCCCCCCCchh
Confidence            389999999999999999995        566899999999996442 1122 2222333 33332  234333343 23


Q ss_pred             EEeeeEEEEEccCccEEEEEEeCCeeeeCCCeEEEEC
Q psy8451          74 NVSKNFRFLWRQSGPLTLSVSAPFSGFVPGQTIPCMI  110 (110)
Q Consensus        74 ~~~~~~~~~~~~~g~v~~~~~l~k~~y~pGe~i~i~v  110 (110)
                      +..+.|   .++.|+++++++|||.-|..||.|.++|
T Consensus       183 ~v~k~F---lmS~~~lhLevsLDkEiYyHGE~isvnV  216 (402)
T KOG3865|consen  183 EVSKQF---LMSDGPLHLEVSLDKEIYYHGEPISVNV  216 (402)
T ss_pred             HhhHhh---ccCCCceEEEEEecchheecCCceeEEE
Confidence            445655   6899999999999999999999999986


No 4  
>PF13002 LDB19:  Arrestin_N terminal like;  InterPro: IPR024391 This entry represents a predicted Ig-like beta sandwich domain found towards the N terminus of protein LDB19 []. It is also found in other sequences and is related to the arrestin N-terminal fold [].
Probab=99.22  E-value=9.8e-11  Score=79.50  Aligned_cols=58  Identities=28%  Similarity=0.502  Sum_probs=46.5

Q ss_pred             eecCeeeEEEeEEECCCCCCCcee---cCCeEEEEEEEEEEeC--CC------CC-CceEEEEEEEEccc
Q psy8451           2 EIPQGSHVYPFSYTLPPTIPSSFD---GEHGRVRYKVTARLDR--PW------KF-DKTTEAVFSVVTPY   59 (110)
Q Consensus         2 ~l~~G~h~fpF~f~LP~~lPsSf~---~~~g~I~Y~i~a~i~~--~~------~~-~~~~~~~~~V~~~~   59 (110)
                      .|..|.|.|||++-||.+||+|.+   +..++|.|.++|++..  +.      +. .++.++.+.|.+.+
T Consensus        46 ~l~~G~h~fPFS~LiPG~LPaS~~lgs~~l~~I~Yel~A~a~~~~~~~~~~~~~~~~~~~~~pl~V~Rsi  115 (191)
T PF13002_consen   46 TLTKGSHAFPFSYLIPGHLPASMDLGSTPLVSIKYELKAEATYKDPRRGSSSSKPRVLKLKRPLPVKRSI  115 (191)
T ss_pred             ccCCCcccCCeeEECCCCCccccccCCCCcEEEEEEEEEEEEEccCccccCCCcceeEEEeeeEEEEEec
Confidence            478999999999999999999999   7889999999999976  21      11 13455667776654


No 5  
>PF03643 Vps26:  Vacuolar protein sorting-associated protein 26 ;  InterPro: IPR005377  The movement of lipid and protein components between intracellular organelles requires the regulated interactions of many molecules. Vacuolar protein sorting-associated protein (Vps)5 is a yeast protein that is a subunit of a large multimeric complex, termed the retromer complex, involved in retrograde transport of proteins from endosomes to the trans-Golgi network. Sorting nexin (SNX) 1 and SNX2 are its mammalian orthologs []. To carry out its biological functions, Vps5 forms the retromer complex with at least four other proteins: Vps17, Vps26, Vps29, and Vps35 []. This family of Vps26-proteins also contains Down syndrome critical region 3/A.; GO: 0007034 vacuolar transport, 0030904 retromer complex; PDB: 3LHA_A 3LH9_A 2R51_A 3LH8_B 2FAU_A.
Probab=98.63  E-value=9.7e-07  Score=63.57  Aligned_cols=92  Identities=21%  Similarity=0.357  Sum_probs=54.5

Q ss_pred             ecCeeeEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEEccccCCCCCCCCCceeEEEeeeEEEE
Q psy8451           3 IPQGSHVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVVTPYDLNTDMRAKEPINQNVSKNFRFL   82 (110)
Q Consensus         3 l~~G~h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~~~~~~~~p~~~~p~~~~~~~~~~~~   82 (110)
                      |+.|.- |||+|.+-..--.||.|.+-+|+|.|+|.+.|++ .+...+.+|.|...   ...|+...|...+.       
T Consensus        93 l~~~~t-~pFeF~~~~k~yETY~G~~v~i~Y~lrv~v~R~~-~~i~k~~ef~V~~~---~~~p~~~~~ik~ev-------  160 (275)
T PF03643_consen   93 LPEGKT-FPFEFPLVEKPYETYHGVNVNIRYFLRVTVKRSY-KDISKEQEFWVQNF---SITPESNQPIKMEV-------  160 (275)
T ss_dssp             E-S-EE-EEEEE-SB---S--EE-SSEEEEEEEEEEE--SS-S-EEEEEEEEEE-E---B--------EEEEE-------
T ss_pred             cCCCcE-EeeEeCCCCCCCccEeeeEEEEEEEEEEEEEccC-CCcceEEEEEEEec---cCCCCCCCCccccc-------
Confidence            566666 9999988544478999999999999999999999 78899999999843   22255556655543       


Q ss_pred             EccCccEEEEEEeCCeeeeCCCeEE
Q psy8451          83 WRQSGPLTLSVSAPFSGFVPGQTIP  107 (110)
Q Consensus        83 ~~~~g~v~~~~~l~k~~y~pGe~i~  107 (110)
                       --.+.++++..++++.|.-.|.|.
T Consensus       161 -gie~~lhief~~~k~~~~l~d~i~  184 (275)
T PF03643_consen  161 -GIEDCLHIEFEYDKSKYHLKDVIT  184 (275)
T ss_dssp             -CETTTEEEEEEES-SEEETT-EEE
T ss_pred             -CCCccEEEEEEEcccceECCCCEE
Confidence             124566888888888888777653


No 6  
>PF08737 Rgp1:  Rgp1;  InterPro: IPR014848 Rgp1 forms heterodimer with Ric1 (IPR009771 from INTERPRO) which associates with Golgi membranes and functions as a guanyl-nucleotide exchange factor []. 
Probab=97.99  E-value=3.2e-05  Score=58.73  Aligned_cols=57  Identities=23%  Similarity=0.469  Sum_probs=44.2

Q ss_pred             CeecCeee-EEEeEEECCCCCCCceecCCeEEEEEEEEEEeCC---CCCCceEEEEEEEEc
Q psy8451           1 MEIPQGSH-VYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRP---WKFDKTTEAVFSVVT   57 (110)
Q Consensus         1 ~~l~~G~h-~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~---~~~~~~~~~~~~V~~   57 (110)
                      +.|.||+- +|-|++.||.+||+||.|+.-+|.|.+.+.+.+.   ........-+|.|..
T Consensus       113 l~L~pge~k~f~~~~~lP~~lPPsy~g~~i~~~Y~l~vg~~~~~~~~~~~~~~~~Pi~v~~  173 (415)
T PF08737_consen  113 LRLAPGESKSFHFSFTLPKDLPPSYRGKAIKISYSLVVGTQRGSSVNSKVKSIRVPIRVLP  173 (415)
T ss_pred             eEECCCCcEEEEEEEeCCCCCCCCCcCcEEEEEEEEEEEEEEcCCCCcceEEEEeCEEEee
Confidence            46788764 7999999999999999999999999999999982   222333444566654


No 7  
>KOG2717|consensus
Probab=97.02  E-value=0.00094  Score=47.24  Aligned_cols=61  Identities=26%  Similarity=0.458  Sum_probs=48.6

Q ss_pred             ecCeeeEEEeEEECC-----CCCCCceecCCeEEEEEEEEEEeCCC-CCCceEEEEEEEEc-cccCCC
Q psy8451           3 IPQGSHVYPFSYTLP-----PTIPSSFDGEHGRVRYKVTARLDRPW-KFDKTTEAVFSVVT-PYDLNT   63 (110)
Q Consensus         3 l~~G~h~fpF~f~LP-----~~lPsSf~~~~g~I~Y~i~a~i~~~~-~~~~~~~~~~~V~~-~~~~~~   63 (110)
                      +++|.-++||++.|-     .-|-.+|.|.+-+|.|.+++.+.|.. ..+.....+|.|.. ++|+..
T Consensus        84 ~p~G~tEipFelpL~~kge~~~lYETyHGvfiNiqY~LtcdikR~~L~K~ltkt~eFiv~s~pv~l~e  151 (313)
T KOG2717|consen   84 IPPGTTEIPFELPLREKGEGEKLYETYHGVFINIQYLLTCDIKRGYLHKPLTKTMEFIVESGPVDLPE  151 (313)
T ss_pred             CCCCceeeeeeeeeccCCCccEeeeeecceEEEEEEEEEEecccchhcCchhhhheeeeccCCccccc
Confidence            789999999999984     45677888888899999999999984 55667778888854 555543


No 8  
>PF07070 Spo0M:  SpoOM protein;  InterPro: IPR009776 This family consists of several bacterial SpoOM proteins which are thought to control sporulation in Bacillus subtilis.Spo0M exerts certain negative effects on sporulation and its gene expression is controlled by sigmaH [].
Probab=97.01  E-value=0.0027  Score=44.41  Aligned_cols=52  Identities=27%  Similarity=0.328  Sum_probs=41.8

Q ss_pred             eecCe-eeEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEEc
Q psy8451           2 EIPQG-SHVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVVT   57 (110)
Q Consensus         2 ~l~~G-~h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~   57 (110)
                      +++|| ++++||+|+||-++|-|.    +..+|.++..+|.++..|..-.-++.|..
T Consensus        81 ~I~~ge~~~iPF~~~lP~etPiT~----~~~~v~l~T~LdI~~avD~~D~D~i~V~P  133 (218)
T PF07070_consen   81 TIEPGEEKEIPFSFPLPWETPITE----GGMRVWLRTGLDIAGAVDPGDLDPIEVEP  133 (218)
T ss_pred             EECCCCEEEEeEEEECCCCCCccC----CCcEEEEEEEEEeCCCCCCCCceeEEEeC
Confidence            45565 469999999999999887    78999999999988766666666677754


No 9  
>PF02752 Arrestin_C:  Arrestin (or S-antigen), C-terminal domain;  InterPro: IPR011022 G protein-coupled receptors are a large family of signalling molecules that respond to a wide variety of extracellular stimuli. The receptors relay the information encoded by the ligand through the activation of heterotrimeric G proteins and intracellular effector molecules. To ensure the appropriate regulation of the signalling cascade, it is vital to properly inactivate the receptor. This inactivation is achieved, in part, by the binding of a soluble protein, arrestin, which uncouples the receptor from the downstream G protein after the receptors are phosphorylated by G protein-coupled receptor kinases. In addition to the inactivation of G protein-coupled receptors, arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways. Arrestin (retinal S-antigen) is a major protein of the retinal rod outer segments. It interacts with photo-activated phosphorylated rhodopsin, inhibiting or 'arresting' its ability to interact with transducin []. The protein binds calcium, and shows similarity in its C terminus to alpha-transducin and other purine nucleotide-binding proteins. In mammals, arrestin is associated with autoimmune uveitis. Arrestins comprise a family of closely-related proteins that includes beta-arrestin-1 and -2, which regulate the function of beta-adrenergic receptors by binding to their phosphorylated forms, impairing their capacity to activate G(S) proteins; Cone photoreceptors C-arrestin (arrestin-X) [], which could bind to phosphorylated red/green opsins; and Drosophila phosrestins I and II, which undergo light-induced phosphorylation, and probably play a role in photoreceptor transduction [, , ].  The crystal structure of bovine retinal arrestin comprises two domains of antiparallel beta-sheets connected through a hinge region and one short alpha-helix on the back of the amino-terminal fold []. The binding region for phosphorylated light-activated rhodopsin is located at the N-terminal domain, as indicated by the docking of the photoreceptor to the three-dimensional structure of arrestin.  The C-terminal domain consists of an immunoglobulin-like beta-sandwich structure. This entry represents proteins with immunoglobulin-like domains that are similar to those found in arrestin.; PDB: 1SUJ_A 3UGX_A 1CF1_B 1AYR_A 3UGU_A 3P2D_B 1ZSH_A 2WTR_B 3GC3_A 1G4R_A ....
Probab=96.21  E-value=0.025  Score=35.43  Aligned_cols=53  Identities=19%  Similarity=0.321  Sum_probs=32.1

Q ss_pred             CeeeEEEeEEECCCCCCCce--ecCCeEEEEEEEEEEeCC-CCCCceEEEEEEEEc
Q psy8451           5 QGSHVYPFSYTLPPTIPSSF--DGEHGRVRYKVTARLDRP-WKFDKTTEAVFSVVT   57 (110)
Q Consensus         5 ~G~h~fpF~f~LP~~lPsSf--~~~~g~I~Y~i~a~i~~~-~~~~~~~~~~~~V~~   57 (110)
                      .+.+.+..++.||.++++|.  .+..-+|.|.|++.+.-. ...+...+-+++|..
T Consensus        78 ~~~~~~~~~l~lP~~~~~s~~~~~~~i~v~Y~l~v~~~~~~~~~~~~~~~PI~I~~  133 (136)
T PF02752_consen   78 SGSFEFNIQLQLPSNLPPSTSTNSRLIQVEYQLEVTVKLSGCTSDLRLELPITIGS  133 (136)
T ss_dssp             TEEEEEEEEE-----B-----CGGGSEEEEEEEEEEEEEETTSEEEEEEEEEEEEB
T ss_pred             CceEEEEEEEcCCCccCcccccCCcEEEEEEEEEEEEEECCceeEEEEEccEEEEe
Confidence            34566669999998888887  678889999999999976 344556666666643


No 10 
>KOG3063|consensus
Probab=95.80  E-value=0.041  Score=39.17  Aligned_cols=64  Identities=22%  Similarity=0.311  Sum_probs=46.0

Q ss_pred             eEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEEccccCCCCCCCCCceeEEE
Q psy8451           8 HVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVVTPYDLNTDMRAKEPINQNV   75 (110)
Q Consensus         8 h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~~~~~~~~p~~~~p~~~~~   75 (110)
                      .+|||.|.==..=-.|+.|++-+.||.++|++.|... ++...+.|.|..-   ...|+...|..++.
T Consensus       104 ~~fpFeF~~vekpyEsY~G~NV~lrY~lkvTv~Rr~~-di~ke~d~~V~~~---~~~P~~nn~IkmeV  167 (301)
T KOG3063|consen  104 QSFPFEFPHVEKPYESYIGKNVRLRYFLKVTVSRRLT-DIVKEKDLVVHNL---STYPEINNSIKMEV  167 (301)
T ss_pred             ccCCccccccccchhhhcCcceEEEEEEEEEEEechh-hhhhhhheeeEec---ccCCCCCCceeEee
Confidence            4678877644444478889999999999999999865 6677777877654   23366666666543


No 11 
>COG4326 Spo0M Sporulation control protein [General function prediction only]
Probab=92.01  E-value=0.39  Score=33.64  Aligned_cols=51  Identities=27%  Similarity=0.492  Sum_probs=34.5

Q ss_pred             eecCe-eeEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEE
Q psy8451           2 EIPQG-SHVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVV   56 (110)
Q Consensus         2 ~l~~G-~h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~   56 (110)
                      +++|| +|.|||++.||.+.|=|+    |++.-.+.--+|.....|..-+..++|.
T Consensus       101 TIqpgEe~~fpf~l~lP~~tPvT~----G~~~V~v~TgLDI~~aidp~D~D~l~Vr  152 (270)
T COG4326         101 TIQPGEERNFPFELSLPWNTPVTI----GDAKVWVETGLDIALAIDPTDKDILTVR  152 (270)
T ss_pred             EecCCceEeccEEEecCCCCceee----cceeEEEEeccchhccCCCcccceEEEe
Confidence            34555 689999999999999776    6666666666666544343334445554


No 12 
>cd00917 PG-PI_TP The phosphatidylinositol/phosphatidylglycerol transfer protein (PG/PI-TP) has been shown to bind phosphatidylglycerol and phosphatidylinositol, but the biological significance of this is still obscure. These proteins belong to the ML domain family.
Probab=87.74  E-value=2.2  Score=26.87  Aligned_cols=32  Identities=31%  Similarity=0.429  Sum_probs=23.9

Q ss_pred             ecCeeeEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCC
Q psy8451           3 IPQGSHVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRP   42 (110)
Q Consensus         3 l~~G~h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~   42 (110)
                      +++|.+.+..+..||...|+        ++|.+++.+...
T Consensus        79 i~~G~~~~~~~~~ip~~~P~--------g~y~v~~~l~d~  110 (122)
T cd00917          79 IEPGDKFLTKLVDLPGEIPP--------GKYTVSARAYTK  110 (122)
T ss_pred             cCCCcEEEEEEeeCCCCCCC--------ceEEEEEEEECC
Confidence            56788888888888866665        468888887654


No 13 
>PF04425 Bul1_N:  Bul1 N terminus;  InterPro: IPR007519 This domain is the N terminus of Saccharomyces cerevisiae (Baker's yeast) Bul1. Bul1 binds the ubiquitin ligase Rsp5, via an N-terminal PPSY motif (157-160 in P48524 from SWISSPROT) []. The complex containing Bul1 and Rsp5 is involved in intracellular trafficking of the general amino acid permease Gap1 [], degradation of Rog1 in cooperation with Bul2 and GSK-3 [], and mitochondrial inheritance []. Bul1 may contain HEAT repeats. The C terminus is IPR007520 from INTERPRO.
Probab=84.61  E-value=3.4  Score=32.00  Aligned_cols=17  Identities=18%  Similarity=0.391  Sum_probs=13.8

Q ss_pred             eecCe-eeEEEeEEECCC
Q psy8451           2 EIPQG-SHVYPFSYTLPP   18 (110)
Q Consensus         2 ~l~~G-~h~fpF~f~LP~   18 (110)
                      .|.|| .|..+|.|+||.
T Consensus       255 ~l~p~~~Yk~fF~FkiP~  272 (438)
T PF04425_consen  255 ILEPGVKYKKFFTFKIPE  272 (438)
T ss_pred             eecCCCeEeceeEEeCCc
Confidence            45554 699999999996


No 14 
>PF02221 E1_DerP2_DerF2:  ML domain;  InterPro: IPR003172  The MD-2-related lipid-recognition (ML) domain is implicated in lipid recognition, particularly in the recognition of pathogen related products. It has an immunoglobulin-like beta-sandwich fold similar to that of E-set Ig domains. This domain is present in the following proteins:  Epididymal secretory protein E1 (also known as Niemann-Pick C2 protein), which is known to bind cholesterol. Niemann-Pick disease type C2 is a fatal hereditary disease characterised by accumulation of low-density lipoprotein-derived cholesterol in lysosomes [].  House-dust mite allergen proteins such as Der f 2 from Dermatophagoides farinae and Der p 2 from Dermatophagoides pteronyssinus [].  ; PDB: 2AG9_B 1G13_B 2AG2_B 2AG4_A 1TJJ_C 1PU5_C 1PUB_A 2AF9_A 3T6Q_D 3M7O_B ....
Probab=83.57  E-value=5.7  Score=24.83  Aligned_cols=32  Identities=22%  Similarity=0.572  Sum_probs=20.6

Q ss_pred             ecCeee-EEEeEEECCCCCCCceecCCeEEEEEEEEEEeCC
Q psy8451           3 IPQGSH-VYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRP   42 (110)
Q Consensus         3 l~~G~h-~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~   42 (110)
                      +.+|++ .|.+++.+|..+|.-        .|.+++.+...
T Consensus        88 i~~G~~~~~~~~~~i~~~~p~~--------~~~i~~~l~d~  120 (134)
T PF02221_consen   88 IKAGEYYTYTYTIPIPKIYPPG--------KYTIQWKLTDQ  120 (134)
T ss_dssp             BTTTEEEEEEEEEEESTTSSSE--------EEEEEEEEEET
T ss_pred             cCCCcEEEEEEEEEcccceeeE--------EEEEEEEEEeC
Confidence            456766 777777777666643        66666666544


No 15 
>cd00258 GM2-AP GM2 activator protein (GM2-AP) is a non-enzymatic lysosomal protein that acts as cofactor in the sequential degradation of gangliosides. GM2A is an essential cofactor for beta-hexosaminidase A (Hex A) in the enzymatic hydrolysis of GM2 ganglioside to GM3. Mutation of the gene results in the AB variant of Tay-Sachs disease. GM2-AP and similar proteins belong to the ML domain family.
Probab=78.78  E-value=10  Score=25.52  Aligned_cols=37  Identities=30%  Similarity=0.503  Sum_probs=26.1

Q ss_pred             ecCeeeEEEe-EEECC-CCCCCceecCCeEEEEEEEEEEeCCC
Q psy8451           3 IPQGSHVYPF-SYTLP-PTIPSSFDGEHGRVRYKVTARLDRPW   43 (110)
Q Consensus         3 l~~G~h~fpF-~f~LP-~~lPsSf~~~~g~I~Y~i~a~i~~~~   43 (110)
                      +++|+|+.|= +|.|| .+||+...  .|  .|++++.++...
T Consensus       109 ~~~G~y~lp~s~f~lP~~~LPs~l~--~G--~Y~i~~~l~~~g  147 (162)
T cd00258         109 FKEGVYSLPDSTFTLPNVDLPSWLT--NG--NYRITGILMADG  147 (162)
T ss_pred             CCCcceEccceeeecccccCCCccC--CC--cEEEEEEECCCC
Confidence            5789999964 55999 68897664  23  378888886433


No 16 
>cd08544 Reeler Reeler, the N-terminal domain of reelin, F-spondin, and a variety of other proteins. This domain is found at the N-terminus of F-spondin, a protein attached to the extracellular matrix, which plays roles in neuronal development and vascular remodelling. The F-spondin reeler domain has been reported to bind heparin. The reeler domain is also found at the N-terminus of reelin, an extracellular glycoprotein involved in the development of the brain cortex, and in a variety of other eukaryotic proteins with different domain architectures, including the animal ferric-chelate reductase 1 or stromal cell-derived receptor 2, a member of the cytochrome B561 family, which reduces ferric iron before its transport from the endosome to the cytoplasm. Also included is the insect putative defense protein 1, which is expressed upon bacterial infection and appears to contain a single reeler domain.
Probab=78.04  E-value=2.6  Score=26.85  Aligned_cols=22  Identities=23%  Similarity=0.440  Sum_probs=17.9

Q ss_pred             EEEEEEeCCeeeeCCCeEEEEC
Q psy8451          89 LTLSVSAPFSGFVPGQTIPCMI  110 (110)
Q Consensus        89 v~~~~~l~k~~y~pGe~i~i~v  110 (110)
                      .-+.+.++...|.|||.+.|+|
T Consensus        19 ~py~i~~~~~~y~pG~~~~Vtl   40 (135)
T cd08544          19 SPYSITISGNSYVPGETYTVTL   40 (135)
T ss_pred             CCEEEEeCCCEECCCCEEEEEE
Confidence            3577788888999999998875


No 17 
>KOG4469|consensus
Probab=77.24  E-value=8.9  Score=27.43  Aligned_cols=44  Identities=30%  Similarity=0.423  Sum_probs=35.2

Q ss_pred             eecCee-eEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCC
Q psy8451           2 EIPQGS-HVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKF   45 (110)
Q Consensus         2 ~l~~G~-h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~   45 (110)
                      .|.+|+ .+|.++-.||-.=|+||.|..-+-.|++..-+.|-+.+
T Consensus       104 rldpgesksysysevlpiegppsfrgqsvkyvykltigcqrvnsp  148 (391)
T KOG4469|consen  104 RLDPGESKSYSYSEVLPIEGPPSFRGQSVKYVYKLTIGCQRVNSP  148 (391)
T ss_pred             ecCCCccccccceeeeeccCCCccCCceeEEEEEEEeeeEecCCc
Confidence            456665 47889999999999999998888888888888876654


No 18 
>smart00737 ML Domain involved in innate immunity and lipid metabolism. ML (MD-2-related lipid-recognition) is a novel domain identified in MD-1, MD-2, GM2A, Npc2 and multiple proteins of unknown function in plants, animals and fungi. These single-domain proteins were predicted to form a beta-rich fold containing multiple strands, and to mediate diverse biological functions through interacting with specific lipids.
Probab=75.40  E-value=12  Score=22.95  Aligned_cols=31  Identities=32%  Similarity=0.485  Sum_probs=15.9

Q ss_pred             ecCeee-EEEeEEECCCCCCCceecCCeEEEEEEEEEEeC
Q psy8451           3 IPQGSH-VYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDR   41 (110)
Q Consensus         3 l~~G~h-~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~   41 (110)
                      +.+|++ .|..++.+|..+|        .+.|.+++++..
T Consensus        74 l~~G~~~~~~~~~~v~~~~P--------~~~~~v~~~l~d  105 (118)
T smart00737       74 IEKGETVNYTNSLTVPGIFP--------PGKYTVKWELTD  105 (118)
T ss_pred             CCCCeeEEEEEeeEccccCC--------CeEEEEEEEEEc
Confidence            345554 2555555555444        235666666643


No 19 
>PF08694 UFC1:  Ubiquitin-fold modifier-conjugating enzyme 1;  InterPro: IPR014806 Ubiquitin-like (UBL) post-translational modifiers are covalently linked to most, if not all, target protein(s) through an enzymatic cascade analogous to ubiquitylation, consisting of E1 (activating), E2 (conjugating), and E3 (ligating) enzymes. Ubiquitin-fold modifier 1 (Ufm1) a ubiquitin-like protein is activated by a novel E1-like enzyme, Uba5, by forming a high-energy thioester bond. Activated Ufm1 is then transferred to its cognate E2-like enzyme, Ufc1, in a similar thioester linkage. This family represents the E2-like enzyme [].; PDB: 2Z6P_A 2K07_A 2Z6O_A 3EVX_D 3KPA_A.
Probab=68.19  E-value=6.3  Score=25.99  Aligned_cols=19  Identities=16%  Similarity=0.387  Sum_probs=13.3

Q ss_pred             eeeEEEeEEECCCCCCCce
Q psy8451           6 GSHVYPFSYTLPPTIPSSF   24 (110)
Q Consensus         6 G~h~fpF~f~LP~~lPsSf   24 (110)
                      -.|.|.|+|.+|...|.+=
T Consensus        73 ~kYEF~~eFdIP~tYP~t~   91 (161)
T PF08694_consen   73 LKYEFDLEFDIPVTYPTTA   91 (161)
T ss_dssp             EEEEEEEEEE--TTTTTS-
T ss_pred             eeEEEeeecCCCccCCCCC
Confidence            3689999999998888653


No 20 
>PF02014 Reeler:  Reeler domain Schematic picture including Reeler domain;  InterPro: IPR002861 Extracellular matrix (ECM) proteins play an important role in early cortical development, specifically in the formation of neural connections and in controlling the cyto-architecture of the central nervous system. The product of the reeler gene in mouse is reelin,a large extracellular protein secreted by pioneer neurons that coordinates cell positioning during neurodevelopment []. F-spondin and mindin are a family of matrix-attached adhesion molecules that share structural similarities and overlapping domains of expression. Both F-spondin and mindin promote adhesion and outgrowth of hippocampal embryonic neurons and bind to a putative receptor(s) expressed on both hippocampal and sensory neurons []. This domain of unknown function is found at the N terminus of reelin and F-spondin.; PDB: 2ZOT_B 2ZOU_B 3COO_A.
Probab=65.60  E-value=5.7  Score=25.20  Aligned_cols=21  Identities=33%  Similarity=0.494  Sum_probs=14.9

Q ss_pred             EEEEEeCCeeeeCCCeEEEEC
Q psy8451          90 TLSVSAPFSGFVPGQTIPCMI  110 (110)
Q Consensus        90 ~~~~~l~k~~y~pGe~i~i~v  110 (110)
                      -+.+.++...|.||+.+.|+|
T Consensus        20 py~i~~~~~~y~pg~~~~Vtl   40 (132)
T PF02014_consen   20 PYSISVSPSSYEPGQTYTVTL   40 (132)
T ss_dssp             SEEEEET-SSB-TTBEEEEEE
T ss_pred             CEEEEeCCCeEcCCCEEEEEE
Confidence            456666799999999998875


No 21 
>cd02859 AMPKbeta_GBD_like AMP-activated protein kinase (AMPK) beta subunit glycogen binding  domain (GBD). AMPK is a metabolic stress sensing protein that senses AMP/ATP and has recently been found to act as a glycogen sensor as well. The protein functions as a alpha-beta-gamma heterotrimer. This domain is the glycogen binding domain of the beta subunit.
Probab=63.58  E-value=3.9  Score=23.72  Aligned_cols=13  Identities=31%  Similarity=0.639  Sum_probs=9.1

Q ss_pred             eecCeeeEEEeEE
Q psy8451           2 EIPQGSHVYPFSY   14 (110)
Q Consensus         2 ~l~~G~h~fpF~f   14 (110)
                      .|++|.|.|.|-.
T Consensus        42 ~L~~g~y~YkF~V   54 (79)
T cd02859          42 RLPPGKYQYKFIV   54 (79)
T ss_pred             EcCCCCEEEEEEE
Confidence            5777887777743


No 22 
>PF07703 A2M_N_2:  Alpha-2-macroglobulin family N-terminal region;  InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=62.25  E-value=7.5  Score=24.32  Aligned_cols=23  Identities=22%  Similarity=0.447  Sum_probs=19.3

Q ss_pred             cEEEEEEeCCeeeeCCCeEEEEC
Q psy8451          88 PLTLSVSAPFSGFVPGQTIPCMI  110 (110)
Q Consensus        88 ~v~~~~~l~k~~y~pGe~i~i~v  110 (110)
                      ..+++++.++..|.|||++.++|
T Consensus        94 ~~~v~l~~~~~~~~Pg~~~~~~i  116 (136)
T PF07703_consen   94 ELKVELTASPDEYKPGEEVTLRI  116 (136)
T ss_dssp             SSSEEEEESSSSBTTTSEEEEEE
T ss_pred             cceEEEEEecceeCCCCEEEEEE
Confidence            45777888899999999998875


No 23 
>PF07610 DUF1573:  Protein of unknown function (DUF1573);  InterPro: IPR011467 These hypothetical proteins from bacteria, such as Rhodopirellula baltica, Bacteroides thetaiotaomicron and Porphyromonas gingivalis, share a region of conserved sequence towards their N termini.
Probab=59.10  E-value=12  Score=19.21  Aligned_cols=21  Identities=10%  Similarity=0.120  Sum_probs=17.4

Q ss_pred             EEEEEeCCeeeeCCCeEEEEC
Q psy8451          90 TLSVSAPFSGFVPGQTIPCMI  110 (110)
Q Consensus        90 ~~~~~l~k~~y~pGe~i~i~v  110 (110)
                      ...+..++....|||+..|.|
T Consensus        23 Ct~~~~~~~~i~PGes~~i~v   43 (45)
T PF07610_consen   23 CTTAEYSKKPIAPGESGKIKV   43 (45)
T ss_pred             CEEeeCCcceECCCCEEEEEE
Confidence            357788999999999988864


No 24 
>PF08737 Rgp1:  Rgp1;  InterPro: IPR014848 Rgp1 forms heterodimer with Ric1 (IPR009771 from INTERPRO) which associates with Golgi membranes and functions as a guanyl-nucleotide exchange factor []. 
Probab=55.17  E-value=33  Score=26.40  Aligned_cols=32  Identities=13%  Similarity=0.288  Sum_probs=28.6

Q ss_pred             EEEeEEECCCCCCCceecCCeEEEEEEEEEEe
Q psy8451           9 VYPFSYTLPPTIPSSFDGEHGRVRYKVTARLD   40 (110)
Q Consensus         9 ~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~   40 (110)
                      +-+|.+.+|.+.+++|.-....++|.|+....
T Consensus       381 ~~~f~l~IP~~~tp~F~T~~v~lkW~LrfeFv  412 (415)
T PF08737_consen  381 RTSFSLPIPLSATPQFQTSGVSLKWRLRFEFV  412 (415)
T ss_pred             eEEEEeeCCCCCCCceEeCCEEEEEEEEEEEE
Confidence            67899999999999999988899999988764


No 25 
>cd00912 ML The ML (MD-2-related lipid-recognition) domain is present in MD-1, MD-2, GM2 activator protein, Niemann-Pick type C2 (Npc2) protein, phosphatidylinositol/phosphatidylglycerol transfer protein (PG/PI-TP), mite allergen Der p 2  and several proteins of unknown function in plants, animals and fungi. These single-domain proteins form two anti-parallel beta-pleated sheets stabilized by three disulfide bonds and with an accessible central hydrophobic cavity, and are predicted to mediate diverse biological functions through interaction with specific lipids.
Probab=53.00  E-value=17  Score=22.75  Aligned_cols=31  Identities=26%  Similarity=0.472  Sum_probs=21.8

Q ss_pred             ecCe-eeEEEeEEECCC-CCCCceecCCeEEEEEEEEEEeC
Q psy8451           3 IPQG-SHVYPFSYTLPP-TIPSSFDGEHGRVRYKVTARLDR   41 (110)
Q Consensus         3 l~~G-~h~fpF~f~LP~-~lPsSf~~~~g~I~Y~i~a~i~~   41 (110)
                      |.+| .|.|..++.||. ..|        .+.|.++..+..
T Consensus        82 l~~G~~~~~~~~~~v~~~~~P--------~~~~~v~~~l~~  114 (127)
T cd00912          82 LRKGQQYSYAKTVNVPEFTIP--------TIEYQVVLEDVT  114 (127)
T ss_pred             cCCCCEEEEEEEEecCcccCC--------CeeEEEEEEEEc
Confidence            5788 899999998886 555        345666666554


No 26 
>PF08845 SymE_toxin:  Toxin SymE, type I toxin-antitoxin system;  InterPro: IPR014944 This entry represents a SOS-induced gene whose product shows homology to the antitoxin MazE (SymE), the coding region contains a cis-encoded antisense RNA. The small antisense RNA and the gene have the all the hallmarks of a toxin-antitoxin module. The synthesis of the SymE is tightly repressed at multiple levels; at the transcriptional level by the LexA repressor, at the level of mRNA stability and translation by the SymR RNA and at the level of protein stability by the Lon protease. SymE co-purifies with ribosomes and overproduction of the protein leads to cell growth inhibition, decreased protein synthesis and increased RNA degradation. These properties are shared with several RNA endonuclease toxins of toxin-antitoxin modules. It seems probable that the SymE protein represents an evolutionary derivative of a toxin containing the AbrB fold, whose representatives are typically antitoxins. The SymE promoted cleavage of RNA cleavage may be important for the recycling of RNAs damaged under SOS-inducing conditions []. ; GO: 0003723 RNA binding, 0016788 hydrolase activity, acting on ester bonds, 0016070 RNA metabolic process, 0005737 cytoplasm
Probab=48.60  E-value=6.5  Score=21.67  Aligned_cols=17  Identities=18%  Similarity=0.290  Sum_probs=13.7

Q ss_pred             EeCCeeeeCCCeEEEEC
Q psy8451          94 SAPFSGFVPGQTIPCMI  110 (110)
Q Consensus        94 ~l~k~~y~pGe~i~i~v  110 (110)
                      +|...||.+|+.+.|+|
T Consensus        32 WL~~aGF~~G~~v~V~v   48 (57)
T PF08845_consen   32 WLEEAGFTIGDPVKVRV   48 (57)
T ss_pred             hhHHhCCCCCCEEEEEE
Confidence            67788888888888764


No 27 
>PF04729 ASF1_hist_chap:  ASF1 like histone chaperone;  InterPro: IPR006818 This family includes the yeast and human ASF1 protein. These proteins have histone chaperone activity []. ASF1 participates in both the replication-dependent and replication-independent pathways. The structure three-dimensional has been determined as a compact immunoglobulin-like beta sandwich fold topped by three helical linkers [].; GO: 0006333 chromatin assembly or disassembly, 0005634 nucleus; PDB: 2YGV_C 1WG3_A 2HUE_A 2IDC_A 4EO5_A 1ROC_A 1TEY_A 2IO5_A 3AAD_D 2IIJ_A ....
Probab=47.98  E-value=24  Score=23.48  Aligned_cols=39  Identities=33%  Similarity=0.579  Sum_probs=24.7

Q ss_pred             ecCeeeEEEeEEECC--CCCCCc-------------eec-CCeEEEEEEEEEEeC
Q psy8451           3 IPQGSHVYPFSYTLP--PTIPSS-------------FDG-EHGRVRYKVTARLDR   41 (110)
Q Consensus         3 l~~G~h~fpF~f~LP--~~lPsS-------------f~~-~~g~I~Y~i~a~i~~   41 (110)
                      ++.|.|+|-|+-.-|  ..+|.+             +++ ++-+|-|.+......
T Consensus        65 i~~G~~kF~~~adpPd~~~Ip~~dllGvTvillt~sY~~~EFiRVGYYVnney~d  119 (154)
T PF04729_consen   65 IPVGVNKFVFEADPPDPSKIPPEDLLGVTVILLTCSYRGQEFIRVGYYVNNEYPD  119 (154)
T ss_dssp             -ESEEEEEEEEE----GGGSSCCHHSEEEEEEEEEEETTEEEEEEEEEEEEEESS
T ss_pred             CcceeEEEEEEccCCChhhCChhhccccEEEEEEEEECCeEEEEEeeEEEeeeCC
Confidence            578999999999998  577742             222 345777888777654


No 28 
>PF03643 Vps26:  Vacuolar protein sorting-associated protein 26 ;  InterPro: IPR005377  The movement of lipid and protein components between intracellular organelles requires the regulated interactions of many molecules. Vacuolar protein sorting-associated protein (Vps)5 is a yeast protein that is a subunit of a large multimeric complex, termed the retromer complex, involved in retrograde transport of proteins from endosomes to the trans-Golgi network. Sorting nexin (SNX) 1 and SNX2 are its mammalian orthologs []. To carry out its biological functions, Vps5 forms the retromer complex with at least four other proteins: Vps17, Vps26, Vps29, and Vps35 []. This family of Vps26-proteins also contains Down syndrome critical region 3/A.; GO: 0007034 vacuolar transport, 0030904 retromer complex; PDB: 3LHA_A 3LH9_A 2R51_A 3LH8_B 2FAU_A.
Probab=46.38  E-value=68  Score=23.32  Aligned_cols=32  Identities=19%  Similarity=0.287  Sum_probs=22.5

Q ss_pred             EEEeEEECC-CCCCCcee--cCCeEEEEEEEEEEe
Q psy8451           9 VYPFSYTLP-PTIPSSFD--GEHGRVRYKVTARLD   40 (110)
Q Consensus         9 ~fpF~f~LP-~~lPsSf~--~~~g~I~Y~i~a~i~   40 (110)
                      ..||.+.|| -++=+|+.  .+.-+|+|.+.+.+.
T Consensus       237 ~IPirl~l~~l~l~Pt~~~~~~~FsV~y~lnlvli  271 (275)
T PF03643_consen  237 SIPIRLFLPRLFLCPTYKNVNNKFSVEYELNLVLI  271 (275)
T ss_dssp             EEEEEEECCCT-----EEEECTTEEEEEEEEEEEE
T ss_pred             eeeEEEEcCCcccCCcchhcCCcEEEEEEEEEEEE
Confidence            579999999 57777777  456899999998874


No 29 
>cd02861 E_set_proteins_like E or "early" set-like proteins.  These alpha amylase-like sugar utilizing enzymes which may be related to the immunoglobulin and/or fibronectin type III superfamilies are associated with different types of catalytic domains at  either the N-terminal or C-terminal end.  Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=45.85  E-value=15  Score=21.20  Aligned_cols=11  Identities=27%  Similarity=0.522  Sum_probs=5.3

Q ss_pred             ecCeeeEEEeE
Q psy8451           3 IPQGSHVYPFS   13 (110)
Q Consensus         3 l~~G~h~fpF~   13 (110)
                      |++|.|.|.|.
T Consensus        44 l~~G~y~Ykf~   54 (82)
T cd02861          44 LRPGRYEYKFV   54 (82)
T ss_pred             CCCCcEEEEEE
Confidence            44555544443


No 30 
>TIGR03000 plancto_dom_1 Planctomycetes uncharacterized domain TIGR03000. Domains described by this model are found, so far, only in the Planctomycetes (Pirellula sp. strain 1 and Gemmata obscuriglobus), in up to six proteins per genome, and may be duplicated within a protein. The function is unknown.
Probab=43.97  E-value=66  Score=18.80  Aligned_cols=26  Identities=31%  Similarity=0.181  Sum_probs=17.2

Q ss_pred             EEEEEEEEEEeCCCCCCceEEEEEEEE
Q psy8451          30 RVRYKVTARLDRPWKFDKTTEAVFSVV   56 (110)
Q Consensus        30 ~I~Y~i~a~i~~~~~~~~~~~~~~~V~   56 (110)
                      .=+|.++|.+++.. .....++.+.+.
T Consensus        41 ~y~Y~v~a~~~~dG-~~~t~~~~V~vr   66 (75)
T TIGR03000        41 EYEYTVTAEYDRDG-RILTRTRTVVVR   66 (75)
T ss_pred             EEEEEEEEEEecCC-cEEEEEEEEEEc
Confidence            34599999999877 344455555554


No 31 
>PF01835 A2M_N:  MG2 domain;  InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=43.67  E-value=67  Score=18.81  Aligned_cols=24  Identities=29%  Similarity=0.690  Sum_probs=11.0

Q ss_pred             EEEeEEECCCCCCCceecCCeEEEEEEEEEEe
Q psy8451           9 VYPFSYTLPPTIPSSFDGEHGRVRYKVTARLD   40 (110)
Q Consensus         9 ~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~   40 (110)
                      .|.++|+||.+.+.      |  .|.|+|..+
T Consensus        63 ~~~~~~~lp~~~~~------G--~y~i~~~~~   86 (99)
T PF01835_consen   63 IFSGSFQLPDDAPL------G--TYTIRVKTD   86 (99)
T ss_dssp             EEEEEEE--SS---------E--EEEEEEEET
T ss_pred             EEEEEEECCCCCCC------E--eEEEEEEEc
Confidence            34556777766542      2  366666665


No 32 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=42.82  E-value=15  Score=19.83  Aligned_cols=11  Identities=18%  Similarity=0.428  Sum_probs=4.2

Q ss_pred             ecCeeeEEEeE
Q psy8451           3 IPQGSHVYPFS   13 (110)
Q Consensus         3 l~~G~h~fpF~   13 (110)
                      |++|+|+|--.
T Consensus        36 L~~G~Y~l~V~   46 (66)
T PF07495_consen   36 LPPGKYTLEVR   46 (66)
T ss_dssp             --SEEEEEEEE
T ss_pred             CCCEEEEEEEE
Confidence            45555554433


No 33 
>KOG3063|consensus
Probab=41.76  E-value=94  Score=22.54  Aligned_cols=52  Identities=12%  Similarity=0.075  Sum_probs=37.4

Q ss_pred             EEEeEEECC-CCCCCcee--cCCeEEEEEEEEEEeCCCCCCceEEEEEEEEcccc
Q psy8451           9 VYPFSYTLP-PTIPSSFD--GEHGRVRYKVTARLDRPWKFDKTTEAVFSVVTPYD   60 (110)
Q Consensus         9 ~fpF~f~LP-~~lPsSf~--~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~~~~   60 (110)
                      +.|-...|- -+|-+|+.  .+.-+|+|.+...+..-........++++.-+.-|
T Consensus       244 sIPiRlFLagYdlTPtmrdinkkFsVkyyLnLVlvDeedRRYFKQqEItLwR~~d  298 (301)
T KOG3063|consen  244 SIPIRLFLAGYDLTPTMRDINKKFSVKYYLNLVLVDEEDRRYFKQQEITLWRKAD  298 (301)
T ss_pred             eeeeEEEecccCCCcchhhhcceeeeeeEEEEEEEchhhhhhhhheeEEEEEecc
Confidence            567777776 79999998  57789999999998654444444566777765533


No 34 
>PF06688 DUF1187:  Protein of unknown function (DUF1187);  InterPro: IPR009572 This family consists of several short, hypothetical bacterial proteins of around 62 residues in length. Members of this family are found in Escherichia coli and Salmonella typhi. The function of this family is unknown.
Probab=41.29  E-value=43  Score=18.69  Aligned_cols=14  Identities=36%  Similarity=0.524  Sum_probs=11.6

Q ss_pred             EEEEEEEeCCCCCC
Q psy8451          33 YKVTARLDRPWKFD   46 (110)
Q Consensus        33 Y~i~a~i~~~~~~~   46 (110)
                      |+|+|+|..|....
T Consensus         1 YkItAtI~KpG~~P   14 (61)
T PF06688_consen    1 YKITATIIKPGNTP   14 (61)
T ss_pred             CceEEEEEcCCCCC
Confidence            89999999986543


No 35 
>PF09608 Alph_Pro_TM:  Putative transmembrane protein (Alph_Pro_TM);  InterPro: IPR019088  This entry consists of predicted transmembrane proteins of about 270 amino acids. They are found predominantly, though not exclusively, in alphaproteobacteria, generally only once in each genome. 
Probab=40.98  E-value=1.1e+02  Score=21.72  Aligned_cols=42  Identities=21%  Similarity=0.324  Sum_probs=31.9

Q ss_pred             eEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEEc
Q psy8451           8 HVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVVT   57 (110)
Q Consensus         8 h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~   57 (110)
                      --|.=++.||.++|.-        .|++++.+-+..+.-...+..|.|.+
T Consensus       155 ~lFra~i~LPanvp~G--------~Y~v~v~l~rdG~vv~~~~~~l~V~K  196 (236)
T PF09608_consen  155 TLFRARIPLPANVPPG--------DYTVRVYLFRDGQVVASQETPLRVRK  196 (236)
T ss_pred             CeEEEEeEcCCCCCcc--------eEEEEEEEEECCEEEEEEeeEEEEEE
Confidence            4577788999999954        48888988888776666677777754


No 36 
>PF08750 CNP1:  CNP1-like family;  InterPro: IPR014861 This group of proteins are likely to be lipoproteins. CNP1 (cryptic neisserial protein) has been expressed in Escherichia coli and shown to be localised periplasmicly []. 
Probab=40.61  E-value=50  Score=21.57  Aligned_cols=15  Identities=40%  Similarity=0.687  Sum_probs=12.1

Q ss_pred             cCCeEEEEEEEEEEe
Q psy8451          26 GEHGRVRYKVTARLD   40 (110)
Q Consensus        26 ~~~g~I~Y~i~a~i~   40 (110)
                      |.+|.|||.+.++=.
T Consensus        47 g~DgvVRY~lv~~S~   61 (139)
T PF08750_consen   47 GPDGVVRYTLVVRSP   61 (139)
T ss_pred             CCCCcEEEEEEEcCC
Confidence            678999999887654


No 37 
>COG3126 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.49  E-value=57  Score=21.79  Aligned_cols=35  Identities=34%  Similarity=0.617  Sum_probs=23.3

Q ss_pred             cCeeeEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCC
Q psy8451           4 PQGSHVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWK   44 (110)
Q Consensus         4 ~~G~h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~   44 (110)
                      +.|  .-||.|.||-| |+.++..   -||.+.|.|...++
T Consensus        87 ~~G--q~P~~F~L~fd-p~~i~p~---~ryalsArI~~~gk  121 (158)
T COG3126          87 TEG--QVPFPFVLPFD-PSDIQPN---HRYALSARITVNGK  121 (158)
T ss_pred             ccC--ccceeEEeccC-hhhCCCC---cEEEEEEEEEECCE
Confidence            445  67888888732 2333222   68999999987765


No 38 
>KOG3780|consensus
Probab=40.23  E-value=1.2e+02  Score=22.72  Aligned_cols=50  Identities=18%  Similarity=0.215  Sum_probs=35.3

Q ss_pred             eeeEEEeEEECCCCCCCceec--CCeEEEEEEEEEEeCCC--CCCceEEEEEEEE
Q psy8451           6 GSHVYPFSYTLPPTIPSSFDG--EHGRVRYKVTARLDRPW--KFDKTTEAVFSVV   56 (110)
Q Consensus         6 G~h~fpF~f~LP~~lPsSf~~--~~g~I~Y~i~a~i~~~~--~~~~~~~~~~~V~   56 (110)
                      ..+.|-.+|+||..+| |+..  ..-++.|.+++.+.-..  ..+....-.+.|-
T Consensus       263 ~~~~~~~~~~iP~~~P-s~~~~~~~i~v~y~l~v~~~~~~~~~~~~~l~~pi~ig  316 (427)
T KOG3780|consen  263 SEDKFEKELRIPPVPP-SILPDTPIIRVEYELKVTLKTSSLRHSELALELPIIIG  316 (427)
T ss_pred             CccccceEEEcCCCCC-ccCCCCceEEEEEEEEEEEecCcccccceeeeeceEEe
Confidence            5677788888998888 6663  55699999999998663  3344444455554


No 39 
>PF08875 DUF1833:  Domain of unknown function (DUF1833);  InterPro: IPR014974 This entry is represented by Bacteriophage D3, Orf22. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=39.24  E-value=64  Score=21.02  Aligned_cols=10  Identities=50%  Similarity=0.913  Sum_probs=8.2

Q ss_pred             EEEeEEECCC
Q psy8451           9 VYPFSYTLPP   18 (110)
Q Consensus         9 ~fpF~f~LP~   18 (110)
                      -+||.+.||.
T Consensus        53 a~~f~i~lP~   62 (150)
T PF08875_consen   53 ACPFDITLPD   62 (150)
T ss_pred             EEEEEEEcCc
Confidence            4799999994


No 40 
>PF14347 DUF4399:  Domain of unknown function (DUF4399)
Probab=37.77  E-value=25  Score=21.02  Aligned_cols=14  Identities=29%  Similarity=0.745  Sum_probs=9.3

Q ss_pred             CeecCeeeEEEeEE
Q psy8451           1 MEIPQGSHVYPFSY   14 (110)
Q Consensus         1 ~~l~~G~h~fpF~f   14 (110)
                      |.|+||+|+.--.|
T Consensus        57 I~L~PG~htLtl~~   70 (87)
T PF14347_consen   57 IELPPGKHTLTLQL   70 (87)
T ss_pred             EEeCCCCEEEEEEe
Confidence            46788888765544


No 41 
>PF08111 Pea-VEAacid:  Pea-VEAacid family;  InterPro: IPR012593 This family consists of the PEA-VEAacid neuropeptides family. These neuropeptides are isolated from the abdominal perisympathetic organs of the American cockroach. These peptides are found together with Pea-YLS-amide and Pea-SKNacid, giving a unique neuropeptide pattern in abdominal perisympathetic organs. The functions of these neuropeptides are unknown [].; GO: 0005184 neuropeptide hormone activity, 0007218 neuropeptide signaling pathway, 0005576 extracellular region
Probab=34.69  E-value=15  Score=14.33  Aligned_cols=8  Identities=50%  Similarity=0.866  Sum_probs=4.8

Q ss_pred             eecCeeeE
Q psy8451           2 EIPQGSHV    9 (110)
Q Consensus         2 ~l~~G~h~    9 (110)
                      +|.||.|.
T Consensus         2 tltpgshv    9 (15)
T PF08111_consen    2 TLTPGSHV    9 (15)
T ss_pred             cccCccch
Confidence            45666664


No 42 
>PF10342 GPI-anchored:  Ser-Thr-rich glycosyl-phosphatidyl-inositol-anchored membrane family;  InterPro: IPR018466 This entry represents glycoproteins involved in cell wall (1-->6)-beta-glucan assembly. In yeast a null mutation leads to severe growth defects, aberrant multi-budded morphology, and mating defects [, ]. The entry includes DRMIP and Hesp-379, which are involved in both fruiting body formation and in host attack respectively. Hesp-379 is a haustorially expressed secreted protein; the haustorium being the small sucker that penetrates host tissue []. 
Probab=33.95  E-value=95  Score=17.74  Aligned_cols=20  Identities=25%  Similarity=0.605  Sum_probs=10.5

Q ss_pred             eEEECCCCCCCceecCCeEEEEEEEEE
Q psy8451          12 FSYTLPPTIPSSFDGEHGRVRYKVTAR   38 (110)
Q Consensus        12 F~f~LP~~lPsSf~~~~g~I~Y~i~a~   38 (110)
                      |.+.+|.+++++=       .|.|++.
T Consensus        58 ~~~~~p~~l~~~~-------~Y~i~~~   77 (93)
T PF10342_consen   58 YTWTIPSDLPSGG-------DYFIQIV   77 (93)
T ss_pred             EEEEcCCCCCCCC-------cEEEEEE
Confidence            4445555555322       3666666


No 43 
>PF05688 DUF824:  Salmonella repeat of unknown function (DUF824);  InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=33.78  E-value=24  Score=18.66  Aligned_cols=11  Identities=36%  Similarity=0.534  Sum_probs=7.9

Q ss_pred             eeCCCeEEEEC
Q psy8451         100 FVPGQTIPCMI  110 (110)
Q Consensus       100 y~pGe~i~i~v  110 (110)
                      ..-||+|+++|
T Consensus         9 aK~Ge~I~ltV   19 (47)
T PF05688_consen    9 AKVGETIPLTV   19 (47)
T ss_pred             eecCCeEEEEE
Confidence            34588888875


No 44 
>KOG3262|consensus
Probab=32.38  E-value=96  Score=21.53  Aligned_cols=22  Identities=23%  Similarity=0.392  Sum_probs=14.5

Q ss_pred             ccEEEEEEeCC----eeeeCCCeEEE
Q psy8451          87 GPLTLSVSAPF----SGFVPGQTIPC  108 (110)
Q Consensus        87 g~v~~~~~l~k----~~y~pGe~i~i  108 (110)
                      ..+.++|+++.    +.|.+||++.|
T Consensus       102 ~d~~fsIK~~dgv~assfk~g~k~fi  127 (215)
T KOG3262|consen  102 NDVHFSIKPSDGVQASSFKPGDKLFI  127 (215)
T ss_pred             cccEEEEecCCCceeecccCCCeEEe
Confidence            44566666655    66778887765


No 45 
>PRK09750 hypothetical protein; Provisional
Probab=31.43  E-value=59  Score=18.16  Aligned_cols=13  Identities=31%  Similarity=0.506  Sum_probs=11.3

Q ss_pred             EEEEEEEEeCCCC
Q psy8451          32 RYKVTARLDRPWK   44 (110)
Q Consensus        32 ~Y~i~a~i~~~~~   44 (110)
                      .|.|.|+|..|..
T Consensus         3 kY~I~Ati~KpGg   15 (64)
T PRK09750          3 MYKITATIEKEGG   15 (64)
T ss_pred             eeEEEEEEECCCC
Confidence            6999999999864


No 46 
>KOG1584|consensus
Probab=30.55  E-value=34  Score=25.27  Aligned_cols=25  Identities=28%  Similarity=0.511  Sum_probs=20.3

Q ss_pred             eEEECC-CCCCCceecCCeEEEEEEE
Q psy8451          12 FSYTLP-PTIPSSFDGEHGRVRYKVT   36 (110)
Q Consensus        12 F~f~LP-~~lPsSf~~~~g~I~Y~i~   36 (110)
                      |+=.|| ..||.|+..+.++|-|..+
T Consensus       107 ~kTHlP~~lLp~s~~~~~cKvVYv~R  132 (297)
T KOG1584|consen  107 FKTHLPFQLLPESLKESKCKVVYVCR  132 (297)
T ss_pred             eeccCChhhcchhhhcCCCcEEEEec
Confidence            444578 7899999999999999654


No 47 
>PF01345 DUF11:  Domain of unknown function DUF11;  InterPro: IPR001434 This group of sequences is represented by a conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis (Streptococcus faecalis), and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydia trachomatis outer membrane proteins.  In C. trachomatis, three cysteine-rich proteins (also believed to be lipoproteins), MOMP, OMP6 and OMP3, make up the extracellular matrix of the outer membrane []. They are involved in the essential structural integrity of both the elementary body (EB) and recticulate body (RB) phase. They are thought to be involved in porin formation and, as these bacteria lack the peptidoglycan layer common to most Gram-negative microbes, such proteins are highly important in the pathogenicity of the organism.; GO: 0005727 extrachromosomal circular DNA
Probab=30.53  E-value=82  Score=17.56  Aligned_cols=26  Identities=23%  Similarity=0.271  Sum_probs=19.6

Q ss_pred             cCccEEEEEEeCCeeeeCCCeEEEEC
Q psy8451          85 QSGPLTLSVSAPFSGFVPGQTIPCMI  110 (110)
Q Consensus        85 ~~g~v~~~~~l~k~~y~pGe~i~i~v  110 (110)
                      ....+.+.-+.++....+||.+..+|
T Consensus        22 ~~~~~~~~k~~~~~~~~~Gd~v~yti   47 (76)
T PF01345_consen   22 AIPDLSITKTVNPSTANPGDTVTYTI   47 (76)
T ss_pred             CCCCEEEEEecCCCcccCCCEEEEEE
Confidence            34567777788888889999887654


No 48 
>KOG0420|consensus
Probab=30.46  E-value=51  Score=22.48  Aligned_cols=17  Identities=18%  Similarity=0.528  Sum_probs=12.4

Q ss_pred             cCeeeE---EEeEEECCCCC
Q psy8451           4 PQGSHV---YPFSYTLPPTI   20 (110)
Q Consensus         4 ~~G~h~---fpF~f~LP~~l   20 (110)
                      ..|-|.   |-|+|++|..-
T Consensus        69 dEGyY~gGkf~F~~~v~~~Y   88 (184)
T KOG0420|consen   69 DEGYYQGGKFRFKFKVPNAY   88 (184)
T ss_pred             CcceecCceEEEEEECCCCC
Confidence            457787   88888888543


No 49 
>TIGR02567 YscW type III secretion system chaperone YscW. This family of proteins is found within type III secretion operons. The protein has been characterized as a chaperone for the outer membrane pore component YscC (TIGR02516). YscW is a lipoprotein which is itself localized to the outer membrane and, it is believed, facilitates the oligomerization and localization of YscC.
Probab=28.45  E-value=1e+02  Score=19.72  Aligned_cols=25  Identities=16%  Similarity=0.046  Sum_probs=14.2

Q ss_pred             EEEEEEEEEEeCCCCCCceEEEEEEE
Q psy8451          30 RVRYKVTARLDRPWKFDKTTEAVFSV   55 (110)
Q Consensus        30 ~I~Y~i~a~i~~~~~~~~~~~~~~~V   55 (110)
                      ...| |.|.+.-..+.-..+++.+.|
T Consensus        81 g~~Y-v~A~L~~~g~~~~q~~~~~~v  105 (124)
T TIGR02567        81 GELY-LRGKLSWQGQRAVQAEAQQIF  105 (124)
T ss_pred             CceE-EEEEEEECCEEEEEEeeeeEE
Confidence            3456 888877665544444444444


No 50 
>COG2373 Large extracellular alpha-helical protein [General function prediction only]
Probab=27.52  E-value=47  Score=30.21  Aligned_cols=22  Identities=23%  Similarity=0.327  Sum_probs=19.4

Q ss_pred             EEEEEEeCCeeeeCCCeEEEEC
Q psy8451          89 LTLSVSAPFSGFVPGQTIPCMI  110 (110)
Q Consensus        89 v~~~~~l~k~~y~pGe~i~i~v  110 (110)
                      =.+++.++|..|.+||++.+.|
T Consensus       751 d~l~i~ldK~~Y~~Gdtakl~I  772 (1621)
T COG2373         751 DVLKLKLDKPEYQPGDTAKLLI  772 (1621)
T ss_pred             cceEEecCccccCCCceEEEEe
Confidence            3488999999999999999875


No 51 
>PRK02939 lipoprotein; Reviewed
Probab=27.34  E-value=2.4e+02  Score=20.24  Aligned_cols=26  Identities=19%  Similarity=0.290  Sum_probs=21.1

Q ss_pred             CCCCCceecCCeEEEEEEEEEEeCCC
Q psy8451          18 PTIPSSFDGEHGRVRYKVTARLDRPW   43 (110)
Q Consensus        18 ~~lPsSf~~~~g~I~Y~i~a~i~~~~   43 (110)
                      +++-++....+|+|.|.+.+++++..
T Consensus        47 K~~tqt~~ne~g~v~~~v~~~~~~eG   72 (236)
T PRK02939         47 KDFTQTLMDEQGEVTKRVSGTLSEEG   72 (236)
T ss_pred             eeeEEEEEcCCCcEEEEEEEEEcCCC
Confidence            35556667789999999999998765


No 52 
>TIGR02186 alph_Pro_TM conserved hypothetical protein. This family consists of predicted transmembrane proteins of about 270 amino acids. Members are found, so far, only among the Alphaproteobacteria and only once in each genome.
Probab=26.58  E-value=2.6e+02  Score=20.36  Aligned_cols=40  Identities=28%  Similarity=0.364  Sum_probs=30.6

Q ss_pred             EEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEEc
Q psy8451          10 YPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVVT   57 (110)
Q Consensus        10 fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~   57 (110)
                      |.=++.||.|+|.-        .|.+++.+-+..+.-...+..|.|.+
T Consensus       182 Fra~i~LPAnvp~G--------~Y~v~v~L~r~G~vv~~~~t~l~V~K  221 (261)
T TIGR02186       182 FRATLRLPANVPNG--------THEVRAYLFRGGVFIARTELALEIVK  221 (261)
T ss_pred             EEEeeecCCCCCCc--------eEEEEEEEEeCCEEEEEEEeEEEEEE
Confidence            66778899999953        48899999888776666677777754


No 53 
>KOG1366|consensus
Probab=26.42  E-value=1.1e+02  Score=27.57  Aligned_cols=28  Identities=14%  Similarity=0.150  Sum_probs=24.4

Q ss_pred             EccCccEEEEEEeCCeeeeCCCeEEEEC
Q psy8451          83 WRQSGPLTLSVSAPFSGFVPGQTIPCMI  110 (110)
Q Consensus        83 ~~~~g~v~~~~~l~k~~y~pGe~i~i~v  110 (110)
                      .+.+-...+-++.||.-|.||+++...|
T Consensus       109 ~~~~~~~~vfIqTDkpiY~pgqtV~~Rv  136 (1436)
T KOG1366|consen  109 LFSQRRGSVFIQTDKPIYKPGQTVRYRV  136 (1436)
T ss_pred             eeecccceEEEecCCCccCCCCEEEEEE
Confidence            4667788999999999999999998764


No 54 
>PF12857 TOBE_3:  TOBE-like domain;  InterPro: IPR024765 The TOBE (transport-associated OB) domain [] always occurs as a dimer and it is found in ABC transporters immediately after the ATPase domain. This entry represents a TOBE-like domain, found in the C terminus of ATPase subunit CysA. CysA is part of the CysATWP ABC transporter complex, involved in sulphate/thiosulphate import [, ]. 
Probab=26.39  E-value=92  Score=16.67  Aligned_cols=24  Identities=21%  Similarity=0.401  Sum_probs=17.2

Q ss_pred             cCccEEEEEEeCCee----eeCCCeEEEE
Q psy8451          85 QSGPLTLSVSAPFSG----FVPGQTIPCM  109 (110)
Q Consensus        85 ~~g~v~~~~~l~k~~----y~pGe~i~i~  109 (110)
                      ++|. .+++.|++..    ..+||++.+.
T Consensus        29 ~~~~-~iEvel~~~~~~l~l~~G~~V~l~   56 (58)
T PF12857_consen   29 DDGE-PIEVELPRERRQLGLQPGDRVYLR   56 (58)
T ss_pred             CCCC-EEEEEeCHhHHhcCCCCCCEEEEE
Confidence            4555 7777777776    7889988764


No 55 
>PF05132 RNA_pol_Rpc4:  RNA polymerase III RPC4;  InterPro: IPR007811 This family comprises a specific subunit for Pol III, the tRNA specific polymerase.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006383 transcription from RNA polymerase III promoter, 0005666 DNA-directed RNA polymerase III complex
Probab=25.15  E-value=54  Score=20.84  Aligned_cols=14  Identities=36%  Similarity=0.790  Sum_probs=11.4

Q ss_pred             eEEECCCCCCCcee
Q psy8451          12 FSYTLPPTIPSSFD   25 (110)
Q Consensus        12 F~f~LP~~lPsSf~   25 (110)
                      |=||||..||.--.
T Consensus         6 ~lfQlP~~LP~~~~   19 (131)
T PF05132_consen    6 FLFQLPPVLPPFKP   19 (131)
T ss_pred             EEEECcCCCCCCCC
Confidence            66999999997654


No 56 
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=24.90  E-value=94  Score=15.71  Aligned_cols=12  Identities=17%  Similarity=0.163  Sum_probs=8.6

Q ss_pred             cCeeeEEEeEEE
Q psy8451           4 PQGSHVYPFSYT   15 (110)
Q Consensus         4 ~~G~h~fpF~f~   15 (110)
                      .||.|+..|+..
T Consensus        11 ~PG~Y~l~~~a~   22 (41)
T TIGR03769        11 KPGTYTLTVQAT   22 (41)
T ss_pred             CCeEEEEEEEEE
Confidence            467777777765


No 57 
>PF12761 End3:  Actin cytoskeleton-regulatory complex protein END3
Probab=24.28  E-value=15  Score=25.45  Aligned_cols=24  Identities=25%  Similarity=0.721  Sum_probs=19.8

Q ss_pred             EEECCCCCCCceec--CCeEEEEEEE
Q psy8451          13 SYTLPPTIPSSFDG--EHGRVRYKVT   36 (110)
Q Consensus        13 ~f~LP~~lPsSf~~--~~g~I~Y~i~   36 (110)
                      -+.||..+|+|...  ....|.|-|.
T Consensus        18 G~rIPr~vPasLrasf~k~~i~Ydl~   43 (195)
T PF12761_consen   18 GYRIPREVPASLRASFEKEQIDYDLN   43 (195)
T ss_pred             CCcCCccCCHHHHHHHhcCCcCcccc
Confidence            36899999999986  4578999887


No 58 
>TIGR01451 B_ant_repeat conserved repeat domain. This model represents the conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis, and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydial outer membrane proteins.
Probab=23.38  E-value=54  Score=17.36  Aligned_cols=13  Identities=23%  Similarity=0.077  Sum_probs=6.0

Q ss_pred             CCeeeeCCCeEEE
Q psy8451          96 PFSGFVPGQTIPC  108 (110)
Q Consensus        96 ~k~~y~pGe~i~i  108 (110)
                      ++....+||.|..
T Consensus         4 d~~~~~~Gd~v~Y   16 (53)
T TIGR01451         4 DKTVATIGDTITY   16 (53)
T ss_pred             CccccCCCCEEEE
Confidence            3444445555443


No 59 
>PF13584 BatD:  Oxygen tolerance
Probab=23.34  E-value=1e+02  Score=23.73  Aligned_cols=26  Identities=8%  Similarity=0.109  Sum_probs=23.1

Q ss_pred             cCccEEEEEEeCCeeeeCCCeEEEEC
Q psy8451          85 QSGPLTLSVSAPFSGFVPGQTIPCMI  110 (110)
Q Consensus        85 ~~g~v~~~~~l~k~~y~pGe~i~i~v  110 (110)
                      .++.+.+++.+++...+.||.+.+++
T Consensus       124 ~~~~~~l~~~v~~~~~Yvge~v~lt~  149 (484)
T PF13584_consen  124 ADDDVFLEAEVSKKSVYVGEPVILTL  149 (484)
T ss_pred             ccccEEEEEEeCCCceecCCcEEEEE
Confidence            46789999999999999999998874


No 60 
>PF13584 BatD:  Oxygen tolerance
Probab=22.34  E-value=89  Score=24.11  Aligned_cols=24  Identities=25%  Similarity=0.253  Sum_probs=21.7

Q ss_pred             ccEEEEEEeCCeeeeCCCeEEEEC
Q psy8451          87 GPLTLSVSAPFSGFVPGQTIPCMI  110 (110)
Q Consensus        87 g~v~~~~~l~k~~y~pGe~i~i~v  110 (110)
                      |.++++.++++..+..||.|.++|
T Consensus       271 g~f~l~~~~~~~~~~~Ge~vt~ti  294 (484)
T PF13584_consen  271 GNFSLSQSWDPTEVKVGEPVTRTI  294 (484)
T ss_pred             eEEEEEEEcCcccccCCCeEEEEE
Confidence            789999999999999999998764


No 61 
>PF02303 Phage_DNA_bind:  Helix-destabilising protein;  InterPro: IPR003512 This entry is represented by the Bacteriophage M13, G5P, DNA-binding protein. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. G5P is the bacteriophage helix-destabilising protein, or single-stranded DNA binding protein, required for DNA synthesis. The protein binds to DNA in a highly cooperative manner without pronounced sequence specificity. In the presence of single-stranded DNA it binds cooperatively to form a helical protein-DNA complex. It prevents the conversion during synthesis of the single-stranded (progeny) viral DNA back into the double-stranded replicative form.; GO: 0003697 single-stranded DNA binding, 0006260 DNA replication; PDB: 1VQI_A 1VQG_A 1AE3_A 1VQA_A 1VQD_A 1VQJ_A 1VQE_A 1YHB_A 1VQF_A 1VQC_A ....
Probab=22.23  E-value=57  Score=19.54  Aligned_cols=15  Identities=20%  Similarity=0.197  Sum_probs=8.9

Q ss_pred             eeeEEEeEEECCCCC
Q psy8451           6 GSHVYPFSYTLPPTI   20 (110)
Q Consensus         6 G~h~fpF~f~LP~~l   20 (110)
                      |.|-.+|.++|-.+-
T Consensus        38 G~fp~~f~v~Le~gq   52 (86)
T PF02303_consen   38 GKFPVEFKVRLEDGQ   52 (86)
T ss_dssp             SSS-EEEEEEE-TTS
T ss_pred             CCccEEEEEEcCCCC
Confidence            567777888776553


No 62 
>PF14734 DUF4469:  Domain of unknown function (DUF4469) with IG-like fold
Probab=21.57  E-value=1.7e+02  Score=17.91  Aligned_cols=22  Identities=23%  Similarity=0.578  Sum_probs=12.4

Q ss_pred             eEEECCCCCCCceecCCeEEEEEEEEEEeC
Q psy8451          12 FSYTLPPTIPSSFDGEHGRVRYKVTARLDR   41 (110)
Q Consensus        12 F~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~   41 (110)
                      ..|.||++|+      .|.  |.|+++=..
T Consensus        66 l~~~lPa~L~------~G~--Y~l~V~Tq~   87 (102)
T PF14734_consen   66 LIFILPADLA------AGE--YTLEVRTQY   87 (102)
T ss_pred             EEEECcCccC------ceE--EEEEEEEEe
Confidence            3455555554      455  777666553


No 63 
>PF05078 DUF679:  Protein of unknown function (DUF679);  InterPro: IPR007770 This family contains uncharacterised plant proteins of unknown function.
Probab=21.32  E-value=66  Score=21.81  Aligned_cols=16  Identities=38%  Similarity=0.740  Sum_probs=13.0

Q ss_pred             CceecCCeEEEEEEEE
Q psy8451          22 SSFDGEHGRVRYKVTA   37 (110)
Q Consensus        22 sSf~~~~g~I~Y~i~a   37 (110)
                      -||.+.+|+++|-+--
T Consensus        58 DS~~~~dGkvyYG~aT   73 (170)
T PF05078_consen   58 DSFRGSDGKVYYGFAT   73 (170)
T ss_pred             ceeECCCCCEEEEEEE
Confidence            4778899999998754


No 64 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=21.09  E-value=1.8e+02  Score=17.09  Aligned_cols=25  Identities=32%  Similarity=0.691  Sum_probs=16.8

Q ss_pred             EeEEECCCCCCCceecCC-eEEEEEEEEE
Q psy8451          11 PFSYTLPPTIPSSFDGEH-GRVRYKVTAR   38 (110)
Q Consensus        11 pF~f~LP~~lPsSf~~~~-g~I~Y~i~a~   38 (110)
                      +|+|++|.+   +|.+.. ..+.|.+...
T Consensus        14 ~f~~~i~~~---tF~d~d~~~lty~~~~~   39 (97)
T smart00736       14 SFSYTIPSS---TFTDADGDTLTYSATLS   39 (97)
T ss_pred             eEEEEcChh---ceECCCCCeEEEEEEeC
Confidence            788888866   555433 4788887643


No 65 
>KOG3265|consensus
Probab=20.99  E-value=1.5e+02  Score=21.25  Aligned_cols=37  Identities=38%  Similarity=0.688  Sum_probs=25.2

Q ss_pred             ecCeeeEEEeEEECC--CCCCC-------------ceec-CCeEEEEEEEEEE
Q psy8451           3 IPQGSHVYPFSYTLP--PTIPS-------------SFDG-EHGRVRYKVTARL   39 (110)
Q Consensus         3 l~~G~h~fpF~f~LP--~~lPs-------------Sf~~-~~g~I~Y~i~a~i   39 (110)
                      +|.|.|.|-|+-.=|  ..+|-             ||.| .+-+|-|.|....
T Consensus        65 VP~G~~~FVf~AD~Pd~~kIP~~d~vGVTviLltC~Y~gQEFIRvGYyVnNeY  117 (250)
T KOG3265|consen   65 VPVGRHKFVFQADAPDPSKIPEDDIVGVTVILLTCSYRGQEFIRVGYYVNNEY  117 (250)
T ss_pred             ccccceEEEEecCCCCcccCcccceeeeEEEEEEEEEcCceeEEEEEEecCCC
Confidence            578999999998887  45552             3344 3457777776554


Done!