Query         psy2082
Match_columns 314
No_of_seqs    230 out of 568
Neff          6.4 
Searched_HMMs 46136
Date          Fri Aug 16 23:50:19 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy2082.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2082hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG3865|consensus              100.0  5E-113  1E-117  789.9  30.1  305    1-306     1-354 (402)
  2 KOG3780|consensus               99.9 5.5E-25 1.2E-29  216.6  30.6  258   18-299     8-317 (427)
  3 PF02752 Arrestin_C:  Arrestin   99.7 3.4E-17 7.4E-22  135.1  14.3  110  193-302     2-136 (136)
  4 PF00339 Arrestin_N:  Arrestin   99.6 1.5E-14 3.3E-19  121.4  14.3  122   18-161     2-133 (149)
  5 PF03643 Vps26:  Vacuolar prote  98.4 0.00018   4E-09   68.0  24.9  209   32-282    37-272 (275)
  6 PF13002 LDB19:  Arrestin_N ter  98.2 4.9E-05 1.1E-09   67.8  13.2  123  105-242    47-181 (191)
  7 PF08737 Rgp1:  Rgp1;  InterPro  97.2   0.077 1.7E-06   53.1  20.8   49  192-240   300-348 (415)
  8 PF00339 Arrestin_N:  Arrestin   95.3   0.025 5.4E-07   46.9   4.4  100  199-300     2-145 (149)
  9 PF07070 Spo0M:  SpoOM protein;  94.9    0.39 8.6E-06   44.0  11.2   91   29-150    23-116 (218)
 10 PF01835 A2M_N:  MG2 domain;  I  88.8       1 2.2E-05   35.2   5.2   27  198-224     2-28  (99)
 11 KOG3063|consensus               88.2      22 0.00049   33.2  18.8  211   46-281    42-278 (301)
 12 KOG2717|consensus               88.0      23  0.0005   33.1  19.1  126  106-240    85-222 (313)
 13 KOG3780|consensus               87.7     9.8 0.00021   37.5  12.6  100  198-299     7-156 (427)
 14 PF01345 DUF11:  Domain of unkn  85.8     3.2 6.9E-05   31.0   6.2   42  192-233    22-63  (76)
 15 PF00207 A2M:  Alpha-2-macroglo  85.0     2.9 6.3E-05   32.5   5.8   41  192-234    52-92  (92)
 16 TIGR01451 B_ant_repeat conserv  83.5     2.5 5.4E-05   29.9   4.4   32  202-233     3-34  (53)
 17 PF03643 Vps26:  Vacuolar prote  82.1      25 0.00054   33.4  11.9   90  206-299    34-145 (275)
 18 PF02752 Arrestin_C:  Arrestin   78.0      33 0.00072   27.3  10.1  106   17-150     7-115 (136)
 19 PF00927 Transglut_C:  Transglu  73.5     9.6 0.00021   30.2   5.6   38  197-235     2-39  (107)
 20 PF04425 Bul1_N:  Bul1 N termin  68.2      32  0.0007   34.9   9.0  118   30-149   160-345 (438)
 21 PF10633 NPCBM_assoc:  NPCBM-as  68.1     5.3 0.00011   30.0   2.8   28  208-235     2-29  (78)
 22 PF07070 Spo0M:  SpoOM protein;  67.5   1E+02  0.0022   28.3  13.4   50  191-240     8-58  (218)
 23 PF05688 DUF824:  Salmonella re  67.3     8.8 0.00019   26.8   3.5   30  208-237    10-39  (47)
 24 PF07705 CARDB:  CARDB;  InterP  59.8      36 0.00079   25.7   6.2   40  196-235     4-43  (101)
 25 PF06030 DUF916:  Bacterial pro  58.5      23  0.0005   29.3   5.1   27  208-235    24-50  (121)
 26 PF09624 DUF2393:  Protein of u  58.2      28 0.00061   29.4   5.8   38  196-234    48-85  (149)
 27 PF07703 A2M_N_2:  Alpha-2-macr  55.7      17 0.00036   29.7   3.9   30  195-224    94-123 (136)
 28 PF09478 CBM49:  Carbohydrate b  55.5      31 0.00067   26.2   5.0   39  197-235     2-41  (80)
 29 PF04744 Monooxygenase_B:  Mono  54.3      14  0.0003   36.6   3.5   42  198-242   249-291 (381)
 30 PF00963 Cohesin:  Cohesin doma  51.9      43 0.00093   27.8   5.8   37  198-235     1-37  (141)
 31 PF05326 SVA:  Seminal vesicle   47.8      37 0.00081   28.5   4.7   61  192-254    30-90  (124)
 32 cd00258 GM2-AP GM2 activator p  47.2      72  0.0016   28.1   6.5   78   52-149    62-143 (162)
 33 smart00809 Alpha_adaptinC2 Ada  45.1      72  0.0016   24.7   5.9   40  192-235     3-42  (104)
 34 TIGR03079 CH4_NH3mon_ox_B meth  45.1      30 0.00065   34.3   4.2   47  194-243   264-311 (399)
 35 COG5554 NifU Nitrogen fixation  44.5      61  0.0013   23.9   4.7   39    6-54      2-40  (69)
 36 PF13199 Glyco_hydro_66:  Glyco  44.2      38 0.00082   35.4   5.1   27  201-227     1-27  (559)
 37 PF02014 Reeler:  Reeler domain  43.1      53  0.0011   27.1   5.0   38  194-235    19-56  (132)
 38 PF07703 A2M_N_2:  Alpha-2-macr  43.1      31 0.00068   28.0   3.6   25  198-222     1-25  (136)
 39 PF06159 DUF974:  Protein of un  41.4      37 0.00081   31.6   4.2   29  207-235    10-38  (249)
 40 PF04426 Bul1_C:  Bul1 C termin  39.4      62  0.0013   30.7   5.4   36  262-298   235-270 (272)
 41 COG4326 Spo0M Sporulation cont  39.3      28  0.0006   32.0   2.8   20  107-126   104-124 (270)
 42 COG2373 Large extracellular al  38.5 1.5E+02  0.0033   35.1   9.2   43  191-233   493-535 (1621)
 43 KOG0452|consensus               37.5      35 0.00077   35.8   3.6   88  144-246   794-881 (892)
 44 PF02883 Alpha_adaptinC2:  Adap  35.0 1.5E+02  0.0032   23.5   6.3   44  189-234     4-47  (115)
 45 PRK12676 bifunctional inositol  34.6      27 0.00058   32.4   2.1   49   31-83     84-132 (263)
 46 KOG2540|consensus               34.6      74  0.0016   29.5   4.8   28  204-231   155-183 (269)
 47 cd08548 Type_I_cohesin_like Ty  33.8      91   0.002   26.1   5.0   39  199-237     2-40  (135)
 48 PF13595 DUF4138:  Domain of un  33.3      58  0.0013   30.4   4.1   36  198-234   130-167 (246)
 49 cd08544 Reeler Reeler, the N-t  32.6      89  0.0019   25.7   4.8   37  198-236    21-57  (135)
 50 COG4326 Spo0M Sporulation cont  32.5 1.2E+02  0.0026   27.9   5.8   43  192-234    29-72  (270)
 51 cd01641 Bacterial_IMPase_like_  30.3      37 0.00081   31.0   2.3   46   33-82     77-122 (248)
 52 PF06988 NifT:  NifT/FixU prote  30.3      50  0.0011   24.6   2.5   21    7-27      3-23  (64)
 53 cd01639 IMPase IMPase, inosito  30.3      34 0.00074   31.1   2.0   47   32-82     79-125 (244)
 54 PF11355 DUF3157:  Protein of u  30.0 1.3E+02  0.0028   27.4   5.5   36  198-234   100-135 (199)
 55 PRK10757 inositol monophosphat  29.3      37 0.00081   31.5   2.1   47   32-82     82-128 (267)
 56 TIGR02934 nifT_nitrog probable  28.9      57  0.0012   24.5   2.6   18   10-27      6-23  (67)
 57 PF07919 Gryzun:  Gryzun, putat  28.7 1.7E+02  0.0037   29.8   7.0   39  191-229   168-208 (554)
 58 TIGR01331 bisphos_cysQ 3'(2'),  27.9      35 0.00075   31.3   1.6   47   32-82     80-126 (249)
 59 PF14796 AP3B1_C:  Clathrin-ada  27.9 1.6E+02  0.0034   25.4   5.5   44  191-234    64-108 (145)
 60 PLN02911 inositol-phosphate ph  27.2      40 0.00087   32.0   2.0   48   31-82    112-159 (296)
 61 cd01515 Arch_FBPase_1 Archaeal  27.2      40 0.00087   31.1   1.9   48   31-82     79-128 (257)
 62 cd01517 PAP_phosphatase PAP-ph  26.1      55  0.0012   30.4   2.7   47   31-82     75-128 (274)
 63 cd08546 cohesin_like Cohesin d  25.9 2.1E+02  0.0046   22.8   5.9   35  198-235     3-37  (135)
 64 TIGR02067 his_9_proposed histi  24.9      44 0.00095   30.6   1.7   47   32-82     78-124 (251)
 65 PLN02553 inositol-phosphate ph  24.4      52  0.0011   30.5   2.1   47   32-82     89-135 (270)
 66 PF12389 Peptidase_M73:  Camely  24.4 1.6E+02  0.0034   26.8   5.1   30  205-234    59-88  (199)
 67 COG2309 AmpS Leucyl aminopepti  23.9 1.5E+02  0.0033   29.6   5.4   54   10-64    193-267 (385)
 68 cd01643 Bacterial_IMPase_like_  23.7      52  0.0011   30.1   2.0   47   32-82     75-121 (242)
 69 PRK03298 hypothetical protein;  23.6 5.9E+02   0.013   23.6   8.7   46  138-183     7-55  (224)
 70 PRK05089 cytochrome C oxidase   22.5 1.2E+02  0.0027   27.2   4.0   25  207-231    90-114 (188)
 71 TIGR03780 Bac_Flav_CT_N Bacter  21.9 1.2E+02  0.0027   29.0   4.1   37  198-235   169-207 (285)
 72 KOG2625|consensus               21.7      53  0.0012   30.6   1.6   28  207-234    11-38  (348)
 73 PRK10931 adenosine-3'(2'),5'-b  21.3      52  0.0011   30.1   1.4   47   32-82     81-127 (246)
 74 PF04442 CtaG_Cox11:  Cytochrom  20.7   1E+02  0.0022   26.8   3.0   25  207-231    63-87  (152)
 75 cd08908 START_STARD12-like C-t  20.1 2.5E+02  0.0054   25.4   5.6   36    4-39     84-127 (204)
 76 cd08759 Type_III_cohesin_like   20.1 2.2E+02  0.0048   24.9   5.0   35  198-234     3-37  (160)

No 1  
>KOG3865|consensus
Probab=100.00  E-value=5.2e-113  Score=789.91  Aligned_cols=305  Identities=54%  Similarity=0.930  Sum_probs=294.7

Q ss_pred             CCccceEEEeecCCCcEEEEecceeeEecCCCceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEE
Q psy2082           1 MVANFKVFKKCSPNGKLTLYMGKRDFVDYVTGVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYL   80 (314)
Q Consensus         1 ~~~~~~vfkk~s~n~~~t~yl~~rd~~d~~~~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~   80 (314)
                      |.+++|||||+|||||||+||||||||||++.||||||||+|||||+|+||||+||+|+|||||||+|||||+||||||+
T Consensus         1 ~~~~~rVFKK~SpNgkiT~YLgkRDFvDhvd~vdPvDGvVlvDpeYlK~RKvfv~L~caFRYGREDldVlGLtFrKdL~~   80 (402)
T KOG3865|consen    1 MKVGTRVFKKASPNGKITVYLGKRDFVDHVDQVDPVDGVVLVDPEYLKDRKVFVQLTCAFRYGREDLDVLGLTFRKDLYL   80 (402)
T ss_pred             CCcceeEeEecCCCCcEEEEecccccccccccccccceeEEEChHHhccceEEEEEEeeeecccccceeeeeEEEeeeEE
Confidence            67899999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             EEEEEecCCCCCCCCcchhhhhhhccCCCeeEeEEeeCCCCCCCeEEecCCCCCCCCCeEEEEEEEEEEccCCccCcccc
Q psy2082          81 ASEQIYPRSEKQHTSLTKMQDCLLKKLGPKAYPFTFNITPAAPPSVTLQPGPDETGEPCGVTYYVKLFVGENETDRSHRR  160 (314)
Q Consensus        81 ~~~qv~p~~~~~~~~lt~lq~~l~~k~G~h~fPF~f~LP~~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~~~~~dr~~K~  160 (314)
                      ++.|+||+.+ ...|||+|||||++|+|.|+|||.|++|+++|||++|||+++|.|++|||+|+||||++++.+||+|||
T Consensus        81 ~~~Qv~Pp~~-~~~plT~lQErLlkKLG~nAyPF~f~~pp~~P~SVtLQp~p~D~gKpcGVdyevkaF~~~s~edk~hKr  159 (402)
T KOG3865|consen   81 ATVQVYPPPE-DSRPLTRLQERLLKKLGSNAYPFTFEFPPNLPCSVTLQPGPEDTGKPCGVDYEVKAFVADSEEDKIHKR  159 (402)
T ss_pred             EEEEeeCCCc-CCCcccHHHHHHHHHhCCCCCceEEeCCCCCCceEEeccCCccCCCcccceEEEEEEecCCcccccccc
Confidence            9999999843 356799999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             ceEEEeeeeccCCCCCCCCCCeEEEeEEEEeeCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEE
Q psy2082         161 STVALGIRKIQYAPSKNGRQPCTVVRKDFMLSPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDV  240 (314)
Q Consensus       161 ~~v~l~irkl~~~p~~~~~~p~~e~~k~f~~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~  240 (314)
                      |+|+|+|||+||||..++++|+++++|.|+||+|+|+|+|+|||++|||||+|.+||+|+|||+|+||+||+.+.|++++
T Consensus       160 ~sVrL~IRKvqyAP~~~GpqP~~~v~k~FlmS~~~lhLevsLDkEiYyHGE~isvnV~V~NNsnKtVKkIK~~V~Q~adi  239 (402)
T KOG3865|consen  160 NSVRLVIRKVQYAPLEPGPQPSAEVSKQFLMSDGPLHLEVSLDKEIYYHGEPISVNVHVTNNSNKTVKKIKISVRQVADI  239 (402)
T ss_pred             cceeeeeeeeeecCCCCCCCchhHhhHhhccCCCceEEEEEecchheecCCceeEEEEEecCCcceeeeeEEEeEeeceE
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             EEEecCeEEEEEEEEeec------------------------------------------------cCCCCCCCceeEEE
Q psy2082         241 VLFQNGQFRHSVDCVETH------------------------------------------------LASPDPKDAFGIVV  272 (314)
Q Consensus       241 ~~~~~~~~~~~Va~~E~~------------------------------------------------L~~~~~~~~~~I~V  272 (314)
                      ++|++++|+|+||.+|+.                                                +.+++.++.+||.|
T Consensus       240 ~Lfs~aqy~~~VA~~E~~eGc~v~Pgstl~Kvf~l~PllanN~dkrGlALDG~lKhEDtnLASSTii~~~~~re~lGI~V  319 (402)
T KOG3865|consen  240 CLFSTAQYKKPVAMEETDEGCPVAPGSTLSKVFTLTPLLANNKDKRGLALDGKLKHEDTNLASSTIIREGADREALGILV  319 (402)
T ss_pred             EEEecccccceeeeeecccCCccCCCCeeeeeEEechhhhcCcccccccccccccccccccchhheecCCCCcceeEEEE
Confidence            999999999999999876                                                26778899999999


Q ss_pred             EEEEEEEEEECC-CCceeEEEEeEEEecCCCCCcc
Q psy2082         273 SYMVKVKLYLGA-LGGELSAELPFILMHPKVRLKM  306 (314)
Q Consensus       273 sY~vkV~l~v~~-~~~~v~velPi~I~~p~p~~~~  306 (314)
                      ||+|+|+|.+++ .++++.+||||+||||+|.+++
T Consensus       320 sY~VkVkL~vs~ll~ge~~~ElPF~LmhPkP~~~p  354 (402)
T KOG3865|consen  320 SYKVKVKLVVSRLLGGEVAAELPFTLMHPKPGEEP  354 (402)
T ss_pred             EEEEEEEEEEecccCCceeeecceEEecCCCCCCc
Confidence            999999999994 7899999999999999997753


No 2  
>KOG3780|consensus
Probab=99.95  E-value=5.5e-25  Score=216.56  Aligned_cols=258  Identities=26%  Similarity=0.339  Sum_probs=178.7

Q ss_pred             EEEecceeeEecCCCceeeeeEEEEe-cceeeceeEEEEEEEeeeccccccc---------------ccCcccc-ceEEE
Q psy2082          18 TLYMGKRDFVDYVTGVEPIDGVVVFE-EDYIQDRKVFGQVVCSFRYGREEDE---------------ILGLNFQ-KELYL   80 (314)
Q Consensus        18 t~yl~~rd~~d~~~~vd~v~GvV~vd-~~~lk~rkv~~~~~~~fryG~~d~d---------------v~gl~f~-k~l~~   80 (314)
                      .+.|..++  --...-++|.|-|++. .+.++.|.|.+++.+..+= .|...               --...|. +..|+
T Consensus         8 ~i~~d~~~--~iy~~G~~vsG~v~l~~~~~~~~~~i~l~~~G~~~t-~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~   84 (427)
T KOG3780|consen    8 EIVLDNPE--AIYFPGEPVSGSVVLSTKEPIKVRAIKLQLKGRART-SWSESERGTKLNSKSEGSIKSSTVNYTAKETYL   84 (427)
T ss_pred             EEEeCCCc--cccCCCCeEEEEEEEEeCCccceeEEEEEEEEeEEE-eecccccccccccccccccccceEEeeceEEEe
Confidence            34444444  1224558999999999 8889999999998877752 11110               0111222 23333


Q ss_pred             -EEEEEecCCCCCCCCcchhhhhhhccCCCeeEeEEeeCCCCCCCeEEecCCCCCCCCCeEEEEEEEEEEccCCccCccc
Q psy2082          81 -ASEQIYPRSEKQHTSLTKMQDCLLKKLGPKAYPFTFNITPAAPPSVTLQPGPDETGEPCGVTYYVKLFVGENETDRSHR  159 (314)
Q Consensus        81 -~~~qv~p~~~~~~~~lt~lq~~l~~k~G~h~fPF~f~LP~~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~~~~~dr~~K  159 (314)
                       ....+|-..+....        .+..+|.|.|||+|+||.+||+||+        |++|+|||.|+|.+     ||+|+
T Consensus        85 ~~~~~~~~~~~g~~~--------~~l~~G~~~~pF~~~LP~~~P~Sfe--------g~~G~irY~vk~~i-----dr~~~  143 (427)
T KOG3780|consen   85 DSKTILWTSSNGSNS--------RVLPPGNYEFPFSFTLPLNLPPSFE--------GKFGHVRYFVKAEI-----DRPWK  143 (427)
T ss_pred             eeeeEEeeccCCCCc--------eecCCCceEEeEeccCCCCCCCcee--------eCCceEEEEEEEEE-----ecCCC
Confidence             34555552221111        0345999999999999999999999        99999999999995     78999


Q ss_pred             cceE-EE---eeee--ccCCCCCCCCCCeE--EEeEEEEeeCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEE
Q psy2082         160 RSTV-AL---GIRK--IQYAPSKNGRQPCT--VVRKDFMLSPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIK  231 (314)
Q Consensus       160 ~~~v-~l---~irk--l~~~p~~~~~~p~~--e~~k~f~~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ik  231 (314)
                      .+.. +.   +++.  ++..|....+.-..  +...+++|..|++.++++|++.+|++||.|.+++.|+|.|++.+++++
T Consensus       144 ~~~~~~~~~~V~~~~~ln~~p~~~~~~~~~~~k~~~~~~~~~g~v~~~~~ip~~~~~~ge~i~~~~~i~n~ss~~~~~~~  223 (427)
T KOG3780|consen  144 LNKKNRKPFTVIETVDLNSSPSLLEPIISKASKKLGCVCFSSGPVSLELTIPKTGYVPGETIPVTLEIENKSSRTIKKVK  223 (427)
T ss_pred             CCccceeeEEEecccccccCccccCcchhhhhheeeEEEecCCcEEEEEEcccccCcCCccEEEEEEEecCCCCcceeeE
Confidence            8842 22   2222  23233333222111  112235689999999999999999999999999999999999999999


Q ss_pred             EEEEEeEEEEEEecC---eE---EEEEEEEeec----------------c---CCCCCCCceeEEEEEEEEEEEEECCC-
Q psy2082         232 AMVQQGVDVVLFQNG---QF---RHSVDCVETH----------------L---ASPDPKDAFGIVVSYMVKVKLYLGAL-  285 (314)
Q Consensus       232 v~L~Q~~~~~~~~~~---~~---~~~Va~~E~~----------------L---~~~~~~~~~~I~VsY~vkV~l~v~~~-  285 (314)
                      ++|.|.+++.++..+   +.   .+.++.....                +   .|.....|.+|.|+|.++|.+.++++ 
T Consensus       224 ~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~iP~~~Ps~~~~~~~i~v~y~l~v~~~~~~~~  303 (427)
T KOG3780|consen  224 AKLIQKISYLAFSYGEHTKTKKSEKTLIKSRGSLEVAPRSEDKFEKELRIPPVPPSILPDTPIIRVEYELKVTLKTSSLR  303 (427)
T ss_pred             EEEEEEEEEEeecCCccccceeeeeEEeeeccccccCCCCccccceEEEcCCCCCccCCCCceEEEEEEEEEEEecCccc
Confidence            999999999988532   11   1223221111                2   33344457899999999999999743 


Q ss_pred             CceeEEEEeEEEec
Q psy2082         286 GGELSAELPFILMH  299 (314)
Q Consensus       286 ~~~v~velPi~I~~  299 (314)
                      ++++.+++|++|++
T Consensus       304 ~~~~~l~~pi~igt  317 (427)
T KOG3780|consen  304 HSELALELPIIIGT  317 (427)
T ss_pred             ccceeeeeceEEec
Confidence            68999999999996


No 3  
>PF02752 Arrestin_C:  Arrestin (or S-antigen), C-terminal domain;  InterPro: IPR011022 G protein-coupled receptors are a large family of signalling molecules that respond to a wide variety of extracellular stimuli. The receptors relay the information encoded by the ligand through the activation of heterotrimeric G proteins and intracellular effector molecules. To ensure the appropriate regulation of the signalling cascade, it is vital to properly inactivate the receptor. This inactivation is achieved, in part, by the binding of a soluble protein, arrestin, which uncouples the receptor from the downstream G protein after the receptors are phosphorylated by G protein-coupled receptor kinases. In addition to the inactivation of G protein-coupled receptors, arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways. Arrestin (retinal S-antigen) is a major protein of the retinal rod outer segments. It interacts with photo-activated phosphorylated rhodopsin, inhibiting or 'arresting' its ability to interact with transducin []. The protein binds calcium, and shows similarity in its C terminus to alpha-transducin and other purine nucleotide-binding proteins. In mammals, arrestin is associated with autoimmune uveitis. Arrestins comprise a family of closely-related proteins that includes beta-arrestin-1 and -2, which regulate the function of beta-adrenergic receptors by binding to their phosphorylated forms, impairing their capacity to activate G(S) proteins; Cone photoreceptors C-arrestin (arrestin-X) [], which could bind to phosphorylated red/green opsins; and Drosophila phosrestins I and II, which undergo light-induced phosphorylation, and probably play a role in photoreceptor transduction [, , ].  The crystal structure of bovine retinal arrestin comprises two domains of antiparallel beta-sheets connected through a hinge region and one short alpha-helix on the back of the amino-terminal fold []. The binding region for phosphorylated light-activated rhodopsin is located at the N-terminal domain, as indicated by the docking of the photoreceptor to the three-dimensional structure of arrestin.  The C-terminal domain consists of an immunoglobulin-like beta-sandwich structure. This entry represents proteins with immunoglobulin-like domains that are similar to those found in arrestin.; PDB: 1SUJ_A 3UGX_A 1CF1_B 1AYR_A 3UGU_A 3P2D_B 1ZSH_A 2WTR_B 3GC3_A 1G4R_A ....
Probab=99.75  E-value=3.4e-17  Score=135.15  Aligned_cols=110  Identities=23%  Similarity=0.325  Sum_probs=82.5

Q ss_pred             CCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEEEec----CeEEEEEEEEeec----------
Q psy2082         193 PGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVLFQN----GQFRHSVDCVETH----------  258 (314)
Q Consensus       193 ~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~~~~----~~~~~~Va~~E~~----------  258 (314)
                      +|++++++++||.+|.|||.|+++++|+|.|++.|++|+++|.|..+|.+...    ....+.+++....          
T Consensus         2 ~g~i~~~~~i~~~~~~~Ge~i~v~v~i~n~s~~~i~~I~v~L~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~   81 (136)
T PF02752_consen    2 SGKISLSISIPRTAYVPGETIPVNVEIDNQSKKKIKKIKVSLVERITYKAKGGKDESKSEKRVVAKSKNCGVDPGSSGSF   81 (136)
T ss_dssp             TEEEEEEEEES-SEEETT--EEEEEEEEE-SSSEEEEEEEEEEEEEEE-SS----S-EEEEEEEEEEECCEB-B-TTEEE
T ss_pred             CCEEEEEEEECCCEECCCCEEEEEEEEEECCCCEEEEEEEEEEEEEEEEEeeccccceEEEEEEEEEecCCccCCCCceE
Confidence            69999999999999999999999999999999999999999999998877521    2233455542211          


Q ss_pred             -------cC----CCCCCCceeEEEEEEEEEEEEECCCCceeEEEEeEEEecCCC
Q psy2082         259 -------LA----SPDPKDAFGIVVSYMVKVKLYLGALGGELSAELPFILMHPKV  302 (314)
Q Consensus       259 -------L~----~~~~~~~~~I~VsY~vkV~l~v~~~~~~v~velPi~I~~p~p  302 (314)
                             |+    +++...+..|.|+|.|+|++.++++..++.+++||+|+++++
T Consensus        82 ~~~~~l~lP~~~~~s~~~~~~~i~v~Y~l~v~~~~~~~~~~~~~~~PI~I~~~p~  136 (136)
T PF02752_consen   82 EFNIQLQLPSNLPPSTSTNSRLIQVEYQLEVTVKLSGCTSDLRLELPITIGSSPC  136 (136)
T ss_dssp             EEEEEE-----B-----CGGGSEEEEEEEEEEEEEETTSEEEEEEEEEEEEB-SS
T ss_pred             EEEEEEcCCCccCcccccCCcEEEEEEEEEEEEEECCceeEEEEEccEEEEecCC
Confidence                   22    333347888999999999999987789999999999998753


No 4  
>PF00339 Arrestin_N:  Arrestin (or S-antigen), N-terminal domain;  InterPro: IPR011021 G protein-coupled receptors are a large family of signalling molecules that respond to a wide variety of extracellular stimuli. The receptors relay the information encoded by the ligand through the activation of heterotrimeric G proteins and intracellular effector molecules. To ensure the appropriate regulation of the signalling cascade, it is vital to properly inactivate the receptor. This inactivation is achieved, in part, by the binding of a soluble protein, arrestin, which uncouples the receptor from the downstream G protein after the receptors are phosphorylated by G protein-coupled receptor kinases. In addition to the inactivation of G protein-coupled receptors, arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways. Arrestin (retinal S-antigen) is a major protein of the retinal rod outer segments. It interacts with photo-activated phosphorylated rhodopsin, inhibiting or 'arresting' its ability to interact with transducin []. The protein binds calcium, and shows similarity in its C terminus to alpha-transducin and other purine nucleotide-binding proteins. In mammals, arrestin is associated with autoimmune uveitis. Arrestins comprise a family of closely-related proteins that includes beta-arrestin-1 and -2, which regulate the function of beta-adrenergic receptors by binding to their phosphorylated forms, impairing their capacity to activate G(S) proteins; Cone photoreceptors C-arrestin (arrestin-X) [], which could bind to phosphorylated red/green opsins; and Drosophila phosrestins I and II, which undergo light-induced phosphorylation, and probably play a role in photoreceptor transduction [, , ].  The crystal structure of bovine retinal arrestin comprises two domains of antiparallel beta-sheets connected through a hinge region and one short alpha-helix on the back of the amino-terminal fold []. The binding region for phosphorylated light-activated rhodopsin is located at the N-terminal domain, as indicated by the docking of the photoreceptor to the three-dimensional structure of arrestin.  The N-terminal domain consists of an immunoglobulin-like beta-sandwich structure. This entry represents proteins with immunoglobulin-like domains that are similar to those found in arrestin.; PDB: 1SUJ_A 3UGX_A 1CF1_B 1AYR_A 3UGU_A 3P2D_B 1ZSH_A 2WTR_B 3GC3_A 1G4R_A ....
Probab=99.61  E-value=1.5e-14  Score=121.44  Aligned_cols=122  Identities=21%  Similarity=0.334  Sum_probs=77.3

Q ss_pred             EEEecceeeEecCCCceeeeeEEEEe-cceeeceeEEEEEEEeeecccccccccCccccceEEEEEEEEecCCC-CCCCC
Q psy2082          18 TLYMGKRDFVDYVTGVEPIDGVVVFE-EDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLASEQIYPRSE-KQHTS   95 (314)
Q Consensus        18 t~yl~~rd~~d~~~~vd~v~GvV~vd-~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~~qv~p~~~-~~~~~   95 (314)
                      .++|.+.+-+-+-+  |.|.|.|+++ .+.++.+.|.+++.|..++.-.+....+..+.+       +..+... .....
T Consensus         2 ~I~ld~~~~~y~~G--e~I~G~V~l~~~~~~~i~~i~v~l~G~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~   72 (149)
T PF00339_consen    2 EIELDNPKPVYFPG--EVISGKVVLELSKPIKIKSIKVRLKGRAKTKWSESKSSGSTFRK-------QTTPKVQYSEKKE   72 (149)
T ss_dssp             EEEES-SEEEEESS----EEEEEEECTTT-TTTSEEEEEEEEEEEESSSSTTSTTCEEEE-------EEESTSSS-SSSS
T ss_pred             EEEECCCCCEECCC--CEEEEEEEEEECCccceeEEEEEEEEEEEEEecCCCcceeeeee-------EEeccccccccee
Confidence            46666666666666  9999999999 889999999999999998755544433433333       3333332 12234


Q ss_pred             cchhhhhhhc--------cCCCeeEeEEeeCCCCCCCeEEecCCCCCCCCCeEEEEEEEEEEccCCccCccccc
Q psy2082          96 LTKMQDCLLK--------KLGPKAYPFTFNITPAAPPSVTLQPGPDETGEPCGVTYYVKLFVGENETDRSHRRS  161 (314)
Q Consensus        96 lt~lq~~l~~--------k~G~h~fPF~f~LP~~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~~~~~dr~~K~~  161 (314)
                      +...+..+..        .+|.|.|||+|+||.+||+||+        |.+|+|+|.|+|.+     +++|+.+
T Consensus        73 ~~~~~~~l~~~~~~~~~l~~G~~~fpF~f~LP~~lP~S~~--------~~~g~I~Y~l~a~l-----~~~~~~~  133 (149)
T PF00339_consen   73 YFDHESQLWGSEDGPNILPPGEYEFPFEFQLPSNLPSSFE--------GSHGSIRYKLKATL-----DRPGKKD  133 (149)
T ss_dssp             SSHHHHHHHHH--------C-TTEEEEEE---TTS--SEE--------EE-SEEEEEEEEEE-----SSTTSE-
T ss_pred             eccceeEeeeeccceecccCCCEEEEEEEECCCCCCceEe--------ccCcCEEEEEEEEE-----ECCCCCC
Confidence            4444444433        3899999999999999999999        88889999999997     5566555


No 5  
>PF03643 Vps26:  Vacuolar protein sorting-associated protein 26 ;  InterPro: IPR005377  The movement of lipid and protein components between intracellular organelles requires the regulated interactions of many molecules. Vacuolar protein sorting-associated protein (Vps)5 is a yeast protein that is a subunit of a large multimeric complex, termed the retromer complex, involved in retrograde transport of proteins from endosomes to the trans-Golgi network. Sorting nexin (SNX) 1 and SNX2 are its mammalian orthologs []. To carry out its biological functions, Vps5 forms the retromer complex with at least four other proteins: Vps17, Vps26, Vps29, and Vps35 []. This family of Vps26-proteins also contains Down syndrome critical region 3/A.; GO: 0007034 vacuolar transport, 0030904 retromer complex; PDB: 3LHA_A 3LH9_A 2R51_A 3LH8_B 2FAU_A.
Probab=98.44  E-value=0.00018  Score=67.97  Aligned_cols=209  Identities=17%  Similarity=0.213  Sum_probs=111.6

Q ss_pred             CceeeeeEEEEe-cc--eeeceeEEEEEEEeeecccccccccCccccceEEEEEEEEecCCCCCCCCcchhhhhhhccCC
Q psy2082          32 GVEPIDGVVVFE-ED--YIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLASEQIYPRSEKQHTSLTKMQDCLLKKLG  108 (314)
Q Consensus        32 ~vd~v~GvV~vd-~~--~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~~qv~p~~~~~~~~lt~lq~~l~~k~G  108 (314)
                      .-|.|.|+|.|. +.  .++-..|.+++.+.+.-=.+...      .-+++-.+.++=|+..     |+         .|
T Consensus        37 ~gE~V~G~V~I~~~~gk~~~H~GI~l~lvG~ie~~~~~~k------~~~f~~~~~eL~~~G~-----l~---------~~   96 (275)
T PF03643_consen   37 DGETVSGKVVITSKPGKSLEHQGIKLELVGQIEAFYDSGK------PIEFLSLSIELAPPGK-----LP---------EG   96 (275)
T ss_dssp             TC--EEEEEEEEESSTS-EEES-EEEEEEEEEEEGCCTT-------EEEEEEEEEEEE-SEE-----E----------S-
T ss_pred             CCCEEEEEEEEEECCCCceEEeeEEEEEEEeEeEeccCCC------ceEeEEeeEEEcCCcc-----cC---------CC
Confidence            349999999999 44  57777889988877631111111      1134445666645431     22         56


Q ss_pred             CeeEeEEeeCCCCCCCeEEecCCCCCCCCCeEEEEEEEEEEccCCccCccccc--eEEEeeeeccCCCCCCCCCCeEEEe
Q psy2082         109 PKAYPFTFNITPAAPPSVTLQPGPDETGEPCGVTYYVKLFVGENETDRSHRRS--TVALGIRKIQYAPSKNGRQPCTVVR  186 (314)
Q Consensus       109 ~h~fPF~f~LP~~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~~~~~dr~~K~~--~v~l~irkl~~~p~~~~~~p~~e~~  186 (314)
                      .. |||.|.+-+.-=.|+.        |.+-.|+|.++|.+.     |.++.-  +..+.+......|.... +-+.++ 
T Consensus        97 ~t-~pFeF~~~~k~yETY~--------G~~v~i~Y~lrv~v~-----R~~~~i~k~~ef~V~~~~~~p~~~~-~ik~ev-  160 (275)
T PF03643_consen   97 KT-FPFEFPLVEKPYETYH--------GVNVNIRYFLRVTVK-----RSYKDISKEQEFWVQNFSITPESNQ-PIKMEV-  160 (275)
T ss_dssp             EE-EEEEE-SB---S--EE---------SSEEEEEEEEEEE-------SSS-EEEEEEEEEE-EB---------EEEEE-
T ss_pred             cE-EeeEeCCCCCCCccEe--------eeEEEEEEEEEEEEE-----ccCCCcceEEEEEEEeccCCCCCCC-Cccccc-
Confidence            66 9999987332224555        999999999999974     444211  23455554444443321 111121 


Q ss_pred             EEEEeeCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEEEecCeEE--EEEEEEeec------
Q psy2082         187 KDFMLSPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVLFQNGQFR--HSVDCVETH------  258 (314)
Q Consensus       187 k~f~~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~~~~~~~~--~~Va~~E~~------  258 (314)
                          =-.+-++++..++|.-|.=.+.|.=.+.+.-.. ..|+.|.+.|++..++-. ..+..+  .++...|.-      
T Consensus       161 ----gie~~lhief~~~k~~~~l~d~i~G~i~f~lv~-~kIk~~elqLiR~Et~g~-~~~~~~e~t~i~~~eImDG~p~r  234 (275)
T PF03643_consen  161 ----GIEDCLHIEFEYDKSKYHLKDVITGKIYFLLVR-IKIKSMELQLIRVETCGC-GENYAKESTEIQKIEIMDGAPCR  234 (275)
T ss_dssp             ----CETTTEEEEEEES-SEEETT-EEEEEEEEEEES-S-EEEEEEEEEEEEEECE-CCCEEEEEEEEEEEEEESS---T
T ss_pred             ----CCCccEEEEEEEcccceECCCCEEEEEEEEEEe-ecceEEEEEEEEEEEEec-CCcccccceEEEEEEeecCCccc
Confidence                134678999999999999999987666654333 669999999998876533 222222  245554432      


Q ss_pred             ---------c----CCCCCCC-ceeEEEEEEEEEEEEE
Q psy2082         259 ---------L----ASPDPKD-AFGIVVSYMVKVKLYL  282 (314)
Q Consensus       259 ---------L----~~~~~~~-~~~I~VsY~vkV~l~v  282 (314)
                               |    .-|+..+ +....|+|+|.+-+..
T Consensus       235 ge~IPirl~l~~l~l~Pt~~~~~~~FsV~y~lnlvlid  272 (275)
T PF03643_consen  235 GESIPIRLFLPRLFLCPTYKNVNNKFSVEYELNLVLID  272 (275)
T ss_dssp             T-EEEEEEECCCT-----EEEECTTEEEEEEEEEEEEE
T ss_pred             cceeeEEEEcCCcccCCcchhcCCcEEEEEEEEEEEEc
Confidence                     1    2233333 3358999999987654


No 6  
>PF13002 LDB19:  Arrestin_N terminal like;  InterPro: IPR024391 This entry represents a predicted Ig-like beta sandwich domain found towards the N terminus of protein LDB19 []. It is also found in other sequences and is related to the arrestin N-terminal fold [].
Probab=98.16  E-value=4.9e-05  Score=67.79  Aligned_cols=123  Identities=19%  Similarity=0.296  Sum_probs=78.8

Q ss_pred             ccCCCeeEeEEeeCCCCCCCeEEecCCCCCCCCCeEEEEEEEEEEccCC---ccCccccceEEE----ee-eeccCCCCC
Q psy2082         105 KKLGPKAYPFTFNITPAAPPSVTLQPGPDETGEPCGVTYYVKLFVGENE---TDRSHRRSTVAL----GI-RKIQYAPSK  176 (314)
Q Consensus       105 ~k~G~h~fPF~f~LP~~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~~~~---~dr~~K~~~v~l----~i-rkl~~~p~~  176 (314)
                      ...|.|+|||++-||-+||+|..|  |.   +..+.|.|+++|......   ...+-|...+.+    .| |-|   +  
T Consensus        47 l~~G~h~fPFS~LiPG~LPaS~~l--gs---~~l~~I~Yel~A~a~~~~~~~~~~~~~~~~~~~~~pl~V~Rsi---~--  116 (191)
T PF13002_consen   47 LTKGSHAFPFSYLIPGHLPASMDL--GS---TPLVSIKYELKAEATYKDPRRGSSSSKPRVLKLKRPLPVKRSI---L--  116 (191)
T ss_pred             cCCCcccCCeeEECCCCCcccccc--CC---CCcEEEEEEEEEEEEEccCccccCCCcceeEEEeeeEEEEEec---C--
Confidence            448999999999999999999986  21   457899999999975410   001122221221    11 111   1  


Q ss_pred             CCCCCeEEEeEEEEeeCCcEEEEEEECCccccCCCeEEEEEEEEccccce----eEEEEEEEEEeEEEEE
Q psy2082         177 NGRQPCTVVRKDFMLSPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKV----VKKIKAMVQQGVDVVL  242 (314)
Q Consensus       177 ~~~~p~~e~~k~f~~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~----Vk~Ikv~L~Q~~~~~~  242 (314)
                      ++ ++..   .--.|-.-.|...|.||. +.+|--+.+|++.++|-++.+    ++++.=+|..++.+.+
T Consensus       117 ~g-pd~~---S~RvFPPT~l~a~a~lP~-VI~P~gtfpvel~LdGv~~~~~rWRlrKltWRiEE~~kv~a  181 (191)
T PF13002_consen  117 PG-PDKN---SLRVFPPTNLTASAVLPN-VIHPKGTFPVELRLDGVVSKDRRWRLRKLTWRIEETIKVRA  181 (191)
T ss_pred             CC-CCcc---cEEecCCCCcEEEEEcCC-eeCCCCcccEEEEEecccCCCCEEEEEeeeEEEeeEEEEEc
Confidence            11 1111   112466677899999996 556777799999999998765    5566666666655544


No 7  
>PF08737 Rgp1:  Rgp1;  InterPro: IPR014848 Rgp1 forms heterodimer with Ric1 (IPR009771 from INTERPRO) which associates with Golgi membranes and functions as a guanyl-nucleotide exchange factor []. 
Probab=97.17  E-value=0.077  Score=53.07  Aligned_cols=49  Identities=20%  Similarity=0.233  Sum_probs=41.9

Q ss_pred             eCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEE
Q psy2082         192 SPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDV  240 (314)
Q Consensus       192 s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~  240 (314)
                      .+|..-..++|.|..|.-||+|...++.++.+...+-.+.+.|.-...+
T Consensus       300 ~n~~~va~~~LsK~~yrlGE~I~g~idf~~~~~~~c~~v~~~LEs~E~v  348 (415)
T PF08737_consen  300 RNGQRVARLSLSKPAYRLGEDIVGTIDFNDASTIPCYQVSASLESEETV  348 (415)
T ss_pred             ECCeEEEEEEecCCCcccCCeEEEEEEcCCCCcceeEEEEEEEEEEEEe
Confidence            6788889999999999999999999999999866677788888765543


No 8  
>PF00339 Arrestin_N:  Arrestin (or S-antigen), N-terminal domain;  InterPro: IPR011021 G protein-coupled receptors are a large family of signalling molecules that respond to a wide variety of extracellular stimuli. The receptors relay the information encoded by the ligand through the activation of heterotrimeric G proteins and intracellular effector molecules. To ensure the appropriate regulation of the signalling cascade, it is vital to properly inactivate the receptor. This inactivation is achieved, in part, by the binding of a soluble protein, arrestin, which uncouples the receptor from the downstream G protein after the receptors are phosphorylated by G protein-coupled receptor kinases. In addition to the inactivation of G protein-coupled receptors, arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways. Arrestin (retinal S-antigen) is a major protein of the retinal rod outer segments. It interacts with photo-activated phosphorylated rhodopsin, inhibiting or 'arresting' its ability to interact with transducin []. The protein binds calcium, and shows similarity in its C terminus to alpha-transducin and other purine nucleotide-binding proteins. In mammals, arrestin is associated with autoimmune uveitis. Arrestins comprise a family of closely-related proteins that includes beta-arrestin-1 and -2, which regulate the function of beta-adrenergic receptors by binding to their phosphorylated forms, impairing their capacity to activate G(S) proteins; Cone photoreceptors C-arrestin (arrestin-X) [], which could bind to phosphorylated red/green opsins; and Drosophila phosrestins I and II, which undergo light-induced phosphorylation, and probably play a role in photoreceptor transduction [, , ].  The crystal structure of bovine retinal arrestin comprises two domains of antiparallel beta-sheets connected through a hinge region and one short alpha-helix on the back of the amino-terminal fold []. The binding region for phosphorylated light-activated rhodopsin is located at the N-terminal domain, as indicated by the docking of the photoreceptor to the three-dimensional structure of arrestin.  The N-terminal domain consists of an immunoglobulin-like beta-sandwich structure. This entry represents proteins with immunoglobulin-like domains that are similar to those found in arrestin.; PDB: 1SUJ_A 3UGX_A 1CF1_B 1AYR_A 3UGU_A 3P2D_B 1ZSH_A 2WTR_B 3GC3_A 1G4R_A ....
Probab=95.28  E-value=0.025  Score=46.93  Aligned_cols=100  Identities=18%  Similarity=0.227  Sum_probs=58.3

Q ss_pred             EEEEC--CccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEEEec---CeEEEE-EE-------------------
Q psy2082         199 EVTLD--KQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVLFQN---GQFRHS-VD-------------------  253 (314)
Q Consensus       199 ~asld--K~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~~~~---~~~~~~-Va-------------------  253 (314)
                      ++.||  +..|.+||.|.=+|.+...+...++.|++.|.=.+.+.....   ...... ..                   
T Consensus         2 ~I~ld~~~~~y~~Ge~I~G~V~l~~~~~~~i~~i~v~l~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~   81 (149)
T PF00339_consen    2 EIELDNPKPVYFPGEVISGKVVLELSKPIKIKSIKVRLKGRAKTKWSESKSSGSTFRKQTTPKVQYSEKKEYFDHESQLW   81 (149)
T ss_dssp             EEEES-SEEEEESS--EEEEEEECTTT-TTTSEEEEEEEEEEEESSSSTTSTTCEEEEEEESTSSS-SSSSSSHHHHHHH
T ss_pred             EEEECCCCCEECCCCEEEEEEEEEECCccceeEEEEEEEEEEEEEecCCCcceeeeeeEEecccccccceeeccceeEee
Confidence            44555  999999999999999988888999999999987765543211   111100 00                   


Q ss_pred             -------EEeec---------c---CCCCCCCceeEEEEEEEEEEEEECCCCceeEEEEeEEEecC
Q psy2082         254 -------CVETH---------L---ASPDPKDAFGIVVSYMVKVKLYLGALGGELSAELPFILMHP  300 (314)
Q Consensus       254 -------~~E~~---------L---~~~~~~~~~~I~VsY~vkV~l~v~~~~~~v~velPi~I~~p  300 (314)
                             .++.+         |   .|++..... -.|.|.|++.+..++ ..+...+-+|++..+
T Consensus        82 ~~~~~~~~l~~G~~~fpF~f~LP~~lP~S~~~~~-g~I~Y~l~a~l~~~~-~~~~~~~~~~~v~~~  145 (149)
T PF00339_consen   82 GSEDGPNILPPGEYEFPFEFQLPSNLPSSFEGSH-GSIRYKLKATLDRPG-KKDHKAKREFTVVEP  145 (149)
T ss_dssp             HH--------C-TTEEEEEE---TTS--SEEEE--SEEEEEEEEEESSTT-SE--CGGEEEEEEEE
T ss_pred             eeccceecccCCCEEEEEEEECCCCCCceEeccC-cCEEEEEEEEEECCC-CCCcEEEEEEEEECc
Confidence                   00000         2   233332222 389999999886543 567778888887654


No 9  
>PF07070 Spo0M:  SpoOM protein;  InterPro: IPR009776 This family consists of several bacterial SpoOM proteins which are thought to control sporulation in Bacillus subtilis.Spo0M exerts certain negative effects on sporulation and its gene expression is controlled by sigmaH [].
Probab=94.86  E-value=0.39  Score=44.03  Aligned_cols=91  Identities=19%  Similarity=0.308  Sum_probs=59.7

Q ss_pred             cCCCceeeeeEEEEe--cceeeceeEEEEEEEeeecccccccccCccccceEEEEEEEEecCCCCCCCCcchhhhhhhcc
Q psy2082          29 YVTGVEPIDGVVVFE--EDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLASEQIYPRSEKQHTSLTKMQDCLLKK  106 (314)
Q Consensus        29 ~~~~vd~v~GvV~vd--~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~~qv~p~~~~~~~~lt~lq~~l~~k  106 (314)
                      ++..-+.|.|.|.|.  ...-+=.+|++.|.+.+....+|.+     ++.+.-+.+.++   .+           ....+
T Consensus        23 ~~~pGe~v~G~V~i~GG~v~Q~I~~I~l~L~t~~~~e~~d~~-----~~~~~~~~~~~v---~~-----------~f~I~   83 (218)
T PF07070_consen   23 SVRPGETVRGEVHIKGGSVDQEIDRIYLELVTRYEVESDDKE-----YTQEVELARVRV---SG-----------PFTIE   83 (218)
T ss_pred             CccCCCEEEEEEEEEeCCcceEEeEEEEEEEEEEEEecCCCe-----EEEEEEEEEEEe---CC-----------CEEEC
Confidence            344558999999998  3444567888888887765544443     333444444444   21           11122


Q ss_pred             CC-CeeEeEEeeCCCCCCCeEEecCCCCCCCCCeEEEEEEEEEEc
Q psy2082         107 LG-PKAYPFTFNITPAAPPSVTLQPGPDETGEPCGVTYYVKLFVG  150 (314)
Q Consensus       107 ~G-~h~fPF~f~LP~~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~  150 (314)
                      +| .+.+||+|+||.++|-|.            ++.+|.++..++
T Consensus        84 ~ge~~~iPF~~~lP~etPiT~------------~~~~v~l~T~Ld  116 (218)
T PF07070_consen   84 PGEEKEIPFSFPLPWETPITE------------GGMRVWLRTGLD  116 (218)
T ss_pred             CCCEEEEeEEEECCCCCCccC------------CCcEEEEEEEEE
Confidence            44 589999999999999886            366777777764


No 10 
>PF01835 A2M_N:  MG2 domain;  InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=88.76  E-value=1  Score=35.20  Aligned_cols=27  Identities=19%  Similarity=0.402  Sum_probs=22.5

Q ss_pred             EEEEECCccccCCCeEEEEEEEEcccc
Q psy2082         198 LEVTLDKQLYAHNEKIAVSICIRNNSN  224 (314)
Q Consensus       198 l~asldK~~Y~pGE~I~v~v~I~N~Ss  224 (314)
                      +-+..||.+|.|||+|.+.+-+.+..+
T Consensus         2 ~~i~TDr~iYrPGetV~~~~~~~~~~~   28 (99)
T PF01835_consen    2 IFIQTDRPIYRPGETVHFRAIVRDLDN   28 (99)
T ss_dssp             EEEEESSSEE-TTSEEEEEEEEEEECT
T ss_pred             EEEECCccCcCCCCEEEEEEEEecccc
Confidence            346789999999999999999888873


No 11 
>KOG3063|consensus
Probab=88.25  E-value=22  Score=33.24  Aligned_cols=211  Identities=18%  Similarity=0.246  Sum_probs=102.8

Q ss_pred             eeeceeEEEEEEEeeecccccccccCccccceEEEEEEEEecCCCCCCCCcchhhhhhhccCCC----eeEeEEeeCCCC
Q psy2082          46 YIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLASEQIYPRSEKQHTSLTKMQDCLLKKLGP----KAYPFTFNITPA  121 (314)
Q Consensus        46 ~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~~qv~p~~~~~~~~lt~lq~~l~~k~G~----h~fPF~f~LP~~  121 (314)
                      +-.|..|-+.+..+.+=|. -+|-.|+.   --|+.+..+|-..+ +....+.+-..|. .||+    .+|||.|.    
T Consensus        42 f~dgEtv~G~V~l~lk~gk-kleH~Gik---iefiGqIe~~~drg-n~~eF~~lv~eLa-~pGel~~~~~fpFeF~----  111 (301)
T KOG3063|consen   42 FYDGETVSGKVNLRLKDGK-KLEHQGIK---IEFIGQIEMYYDRG-NFHEFTSLVRELA-RPGELTQSQSFPFEFP----  111 (301)
T ss_pred             EecCCeeeeEEEEEEcCCc-ccccCceE---EEEEEEEEEEecCC-cHHHHHHHHHhhc-CCcceeecccCCcccc----
Confidence            3344555555555554333 22333322   12333444433332 1223343333333 3664    67888873    


Q ss_pred             CCCeEEecCCCCCCCCCeEEEEEEEEEEccCCccCccccceEEEeeeeccCCCCCCCCCCeEEEeEEEEeeCCcEEEEEE
Q psy2082         122 APPSVTLQPGPDETGEPCGVTYYVKLFVGENETDRSHRRSTVALGIRKIQYAPSKNGRQPCTVVRKDFMLSPGDLELEVT  201 (314)
Q Consensus       122 lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~~~~~dr~~K~~~v~l~irkl~~~p~~~~~~p~~e~~k~f~~s~G~l~l~as  201 (314)
                         +++ .|-..+.|+.-.+||.+||++.+...|-..-   ..+++..+.-.|.... +-..++  +   -..-+|++..
T Consensus       112 ---~ve-kpyEsY~G~NV~lrY~lkvTv~Rr~~di~ke---~d~~V~~~~~~P~~nn-~IkmeV--G---IedCLHIEFE  178 (301)
T KOG3063|consen  112 ---HVE-KPYESYIGKNVRLRYFLKVTVSRRLTDIVKE---KDLVVHNLSTYPEINN-SIKMEV--G---IEDCLHIEFE  178 (301)
T ss_pred             ---ccc-cchhhhcCcceEEEEEEEEEEEechhhhhhh---hheeeEecccCCCCCC-ceeEee--c---hhhceEEEEE
Confidence               222 2333577999999999999987765433222   2244444444454432 112222  1   1223678888


Q ss_pred             ECCccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEEEecCeEE--EEEEEEeec--------------------c
Q psy2082         202 LDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVLFQNGQFR--HSVDCVETH--------------------L  259 (314)
Q Consensus       202 ldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~~~~~~~~--~~Va~~E~~--------------------L  259 (314)
                      -.|+-|.--|.|-=.+-.- .-.-.|+.+.++++.....-. ...++.  .++|+-|.-                    |
T Consensus       179 YnKskYhLkdvIvGkIYFl-LvRikIk~Mel~iikrEstG~-gpn~~~e~eTiakyeIMDGapvrGEsIPiRlFLagYdl  256 (301)
T KOG3063|consen  179 YNKSKYHLKDVIVGKIYFL-LVRIKIKHMELSIIKRESTGT-GPNTYVETETIAKYEIMDGAPVRGESIPIRLFLAGYDL  256 (301)
T ss_pred             ecccccchhheEEeeEEEE-EEEEEeeeeEEEEEEeecccC-CCcceeccceeeeEEeccCCCcCCCeeeeEEEecccCC
Confidence            8888887666653221110 001234555555554432211 111222  366665432                    3


Q ss_pred             CCCCCCCceeEEEEEEEEEEEE
Q psy2082         260 ASPDPKDAFGIVVSYMVKVKLY  281 (314)
Q Consensus       260 ~~~~~~~~~~I~VsY~vkV~l~  281 (314)
                      +|.+..-|.-..|.|.|..-|.
T Consensus       257 TPtmrdinkkFsVkyyLnLVlv  278 (301)
T KOG3063|consen  257 TPTMRDINKKFSVKYYLNLVLV  278 (301)
T ss_pred             CcchhhhcceeeeeeEEEEEEE
Confidence            5555434556888998876554


No 12 
>KOG2717|consensus
Probab=88.04  E-value=23  Score=33.13  Aligned_cols=126  Identities=10%  Similarity=0.160  Sum_probs=70.9

Q ss_pred             cCCCeeEeEEeeCCC-CCCCeEEecCCCCCCCCCeEEEEEEEEEEccCCccCccccceEEEeeeeccCCCCCCC-CCCeE
Q psy2082         106 KLGPKAYPFTFNITP-AAPPSVTLQPGPDETGEPCGVTYYVKLFVGENETDRSHRRSTVALGIRKIQYAPSKNG-RQPCT  183 (314)
Q Consensus       106 k~G~h~fPF~f~LP~-~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~~~~~dr~~K~~~v~l~irkl~~~p~~~~-~~p~~  183 (314)
                      ++|...+||.|.|-. +=|--+-    ..+.|-+-.|+|.+.+-+.++-.-|+.-+ +.++.+   ++.|-..+ .+|..
T Consensus        85 p~G~tEipFelpL~~kge~~~lY----ETyHGvfiNiqY~LtcdikR~~L~K~ltk-t~eFiv---~s~pv~l~e~~p~i  156 (313)
T KOG2717|consen   85 PPGTTEIPFELPLREKGEGEKLY----ETYHGVFINIQYLLTCDIKRGYLHKPLTK-TMEFIV---ESGPVDLPERPPEI  156 (313)
T ss_pred             CCCceeeeeeeeeccCCCccEee----eeecceEEEEEEEEEEecccchhcCchhh-hheeee---ccCCcccccCCCcc
Confidence            389999999987732 1221121    12559999999999999877644443322 122222   22221100 00000


Q ss_pred             -------EE-eEEEE--eeCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEE
Q psy2082         184 -------VV-RKDFM--LSPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDV  240 (314)
Q Consensus       184 -------e~-~k~f~--~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~  240 (314)
                             ++ .+...  .+-+..-+.-.||++-++--+++.=.+.|+- |...|++|.+.|.+..+.
T Consensus       157 V~F~itpdtlq~~~ker~~~p~FlvtG~Ld~t~c~~t~PltGeltVe~-seaaI~Sie~qLvRVEtc  222 (313)
T KOG2717|consen  157 VIFYITPDTLQHPLKERIKTPGFLVTGKLDATQCSLTDPLTGELTVEA-SEAAITSIEIQLVRVETC  222 (313)
T ss_pred             eEEEEChHHhhccchhhccCCceEEEeeecceeeEecCCccceEEEEe-eccceeEEEEEEEEEEEe
Confidence                   00 00000  2234455666777777777777766666664 467899999999987653


No 13 
>KOG3780|consensus
Probab=87.72  E-value=9.8  Score=37.49  Aligned_cols=100  Identities=19%  Similarity=0.294  Sum_probs=68.6

Q ss_pred             EEEEECCc--cccCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEEEec------------------CeEE--EE----
Q psy2082         198 LEVTLDKQ--LYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVLFQN------------------GQFR--HS----  251 (314)
Q Consensus       198 l~asldK~--~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~~~~------------------~~~~--~~----  251 (314)
                      +++.+|+.  +|.+||+|.=++.+.+.....++.|++.+.=.++......                  ..|+  ..    
T Consensus         7 ~~i~~d~~~~iy~~G~~vsG~v~l~~~~~~~~~~i~l~~~G~~~t~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~   86 (427)
T KOG3780|consen    7 FEIVLDNPEAIYFPGEPVSGSVVLSTKEPIKVRAIKLQLKGRARTSWSESERGTKLNSKSEGSIKSSTVNYTAKETYLDS   86 (427)
T ss_pred             EEEEeCCCccccCCCCeEEEEEEEEeCCccceeEEEEEEEEeEEEeecccccccccccccccccccceEEeeceEEEeee
Confidence            45566666  6999999999999999999999999999987765543210                  0011  00    


Q ss_pred             ---EEEE----ee-----c---------c---CCCCCCCceeEEEEEEEEEEEEECCCCceeEEEEeEEEec
Q psy2082         252 ---VDCV----ET-----H---------L---ASPDPKDAFGIVVSYMVKVKLYLGALGGELSAELPFILMH  299 (314)
Q Consensus       252 ---Va~~----E~-----~---------L---~~~~~~~~~~I~VsY~vkV~l~v~~~~~~v~velPi~I~~  299 (314)
                         +...    ..     +         |   .|++.....| .|.|.|++.+.-.+ ..+-..+-+|++..
T Consensus        87 ~~~~~~~~~g~~~~~l~~G~~~~pF~~~LP~~~P~Sfeg~~G-~irY~vk~~idr~~-~~~~~~~~~~~V~~  156 (427)
T KOG3780|consen   87 KTILWTSSNGSNSRVLPPGNYEFPFSFTLPLNLPPSFEGKFG-HVRYFVKAEIDRPW-KLNKKNRKPFTVIE  156 (427)
T ss_pred             eeEEeeccCCCCceecCCCceEEeEeccCCCCCCCceeeCCc-eEEEEEEEEEecCC-CCCccceeeEEEec
Confidence               0000    00     0         2   3556666678 88999999888765 66778888888665


No 14 
>PF01345 DUF11:  Domain of unknown function DUF11;  InterPro: IPR001434 This group of sequences is represented by a conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis (Streptococcus faecalis), and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydia trachomatis outer membrane proteins.  In C. trachomatis, three cysteine-rich proteins (also believed to be lipoproteins), MOMP, OMP6 and OMP3, make up the extracellular matrix of the outer membrane []. They are involved in the essential structural integrity of both the elementary body (EB) and recticulate body (RB) phase. They are thought to be involved in porin formation and, as these bacteria lack the peptidoglycan layer common to most Gram-negative microbes, such proteins are highly important in the pathogenicity of the organism.; GO: 0005727 extrachromosomal circular DNA
Probab=85.76  E-value=3.2  Score=30.95  Aligned_cols=42  Identities=12%  Similarity=0.202  Sum_probs=36.8

Q ss_pred             eCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEE
Q psy2082         192 SPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAM  233 (314)
Q Consensus       192 s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~  233 (314)
                      ....+.+.-+.++....+||.+...+.+.|..+.....+.+.
T Consensus        22 ~~~~~~~~k~~~~~~~~~Gd~v~ytitvtN~G~~~a~nv~v~   63 (76)
T PF01345_consen   22 AIPDLSITKTVNPSTANPGDTVTYTITVTNTGPAPATNVVVT   63 (76)
T ss_pred             CCCCEEEEEecCCCcccCCCEEEEEEEEEECCCCeeEeEEEE
Confidence            456788888899999999999999999999999998887654


No 15 
>PF00207 A2M:  Alpha-2-macroglobulin family;  InterPro: IPR001599 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins.  The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0004866 endopeptidase inhibitor activity; PDB: 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 2PN5_A 3FRP_G 3HRZ_B ....
Probab=84.98  E-value=2.9  Score=32.53  Aligned_cols=41  Identities=10%  Similarity=0.223  Sum_probs=29.1

Q ss_pred             eCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082         192 SPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMV  234 (314)
Q Consensus       192 s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L  234 (314)
                      ...|+.++..+|+ ....||.+.+.+.|.|+..+.++ ++++|
T Consensus        52 v~~p~~i~~~lP~-~l~~GD~~~i~v~v~N~~~~~~~-v~V~l   92 (92)
T PF00207_consen   52 VFKPFFIQLNLPR-SLRRGDQIQIPVTVFNYTDKDQE-VTVTL   92 (92)
T ss_dssp             EB-SEEEEEE--S-EEETTSEEEEEEEEEE-SSS-EE-EEEEE
T ss_pred             EEeeEEEEcCCCc-EEecCCEEEEEEEEEeCCCCCEE-EEEEC
Confidence            3468889999998 55799999999999999988865 45543


No 16 
>TIGR01451 B_ant_repeat conserved repeat domain. This model represents the conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis, and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydial outer membrane proteins.
Probab=83.52  E-value=2.5  Score=29.87  Aligned_cols=32  Identities=22%  Similarity=0.340  Sum_probs=27.9

Q ss_pred             ECCccccCCCeEEEEEEEEccccceeEEEEEE
Q psy2082         202 LDKQLYAHNEKIAVSICIRNNSNKVVKKIKAM  233 (314)
Q Consensus       202 ldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~  233 (314)
                      .|+....+||.|..++.|.|........+.+.
T Consensus         3 ~d~~~~~~Gd~v~Yti~v~N~g~~~a~~v~v~   34 (53)
T TIGR01451         3 VDKTVATIGDTITYTITVTNNGNVPATNVVVT   34 (53)
T ss_pred             cCccccCCCCEEEEEEEEEECCCCceEeEEEE
Confidence            57888999999999999999999888876654


No 17 
>PF03643 Vps26:  Vacuolar protein sorting-associated protein 26 ;  InterPro: IPR005377  The movement of lipid and protein components between intracellular organelles requires the regulated interactions of many molecules. Vacuolar protein sorting-associated protein (Vps)5 is a yeast protein that is a subunit of a large multimeric complex, termed the retromer complex, involved in retrograde transport of proteins from endosomes to the trans-Golgi network. Sorting nexin (SNX) 1 and SNX2 are its mammalian orthologs []. To carry out its biological functions, Vps5 forms the retromer complex with at least four other proteins: Vps17, Vps26, Vps29, and Vps35 []. This family of Vps26-proteins also contains Down syndrome critical region 3/A.; GO: 0007034 vacuolar transport, 0030904 retromer complex; PDB: 3LHA_A 3LH9_A 2R51_A 3LH8_B 2FAU_A.
Probab=82.05  E-value=25  Score=33.41  Aligned_cols=90  Identities=18%  Similarity=0.287  Sum_probs=46.2

Q ss_pred             cccCCCeEEEEEEEEccccce--eEEEEEEEEEeEEEEEE-ecCe-EE--EEEEEE--------------eeccCCCCCC
Q psy2082         206 LYAHNEKIAVSICIRNNSNKV--VKKIKAMVQQGVDVVLF-QNGQ-FR--HSVDCV--------------ETHLASPDPK  265 (314)
Q Consensus       206 ~Y~pGE~I~v~v~I~N~Ssk~--Vk~Ikv~L~Q~~~~~~~-~~~~-~~--~~Va~~--------------E~~L~~~~~~  265 (314)
                      +|.+||+|.=.|.|+..++|.  =.+|++.|.=.  +.++ ..++ +.  +....+              |..+.+....
T Consensus        34 iY~~gE~V~G~V~I~~~~gk~~~H~GI~l~lvG~--ie~~~~~~k~~~f~~~~~eL~~~G~l~~~~t~pFeF~~~~k~yE  111 (275)
T PF03643_consen   34 IYSDGETVSGKVVITSKPGKSLEHQGIKLELVGQ--IEAFYDSGKPIEFLSLSIELAPPGKLPEGKTFPFEFPLVEKPYE  111 (275)
T ss_dssp             EEETC--EEEEEEEEESSTS-EEES-EEEEEEEE--EEEGCCTT-EEEEEEEEEEEE-SEEE-S-EEEEEEE-SB---S-
T ss_pred             eEcCCCEEEEEEEEEECCCCceEEeeEEEEEEEe--EeEeccCCCceEeEEeeEEEcCCcccCCCcEEeeEeCCCCCCCc
Confidence            789999998888888777555  45677666433  3333 2221 11  111111              1112222233


Q ss_pred             Ccee--EEEEEEEEEEEEECCCCceeEEEEeEEEec
Q psy2082         266 DAFG--IVVSYMVKVKLYLGALGGELSAELPFILMH  299 (314)
Q Consensus       266 ~~~~--I~VsY~vkV~l~v~~~~~~v~velPi~I~~  299 (314)
                      .-.|  +.+.|.|+|.+.-+.  ++++-+.+|.+-.
T Consensus       112 TY~G~~v~i~Y~lrv~v~R~~--~~i~k~~ef~V~~  145 (275)
T PF03643_consen  112 TYHGVNVNIRYFLRVTVKRSY--KDISKEQEFWVQN  145 (275)
T ss_dssp             -EE-SSEEEEEEEEEEE--SS--S-EEEEEEEEEE-
T ss_pred             cEeeeEEEEEEEEEEEEEccC--CCcceEEEEEEEe
Confidence            3444  899999999776654  8999999998764


No 18 
>PF02752 Arrestin_C:  Arrestin (or S-antigen), C-terminal domain;  InterPro: IPR011022 G protein-coupled receptors are a large family of signalling molecules that respond to a wide variety of extracellular stimuli. The receptors relay the information encoded by the ligand through the activation of heterotrimeric G proteins and intracellular effector molecules. To ensure the appropriate regulation of the signalling cascade, it is vital to properly inactivate the receptor. This inactivation is achieved, in part, by the binding of a soluble protein, arrestin, which uncouples the receptor from the downstream G protein after the receptors are phosphorylated by G protein-coupled receptor kinases. In addition to the inactivation of G protein-coupled receptors, arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways. Arrestin (retinal S-antigen) is a major protein of the retinal rod outer segments. It interacts with photo-activated phosphorylated rhodopsin, inhibiting or 'arresting' its ability to interact with transducin []. The protein binds calcium, and shows similarity in its C terminus to alpha-transducin and other purine nucleotide-binding proteins. In mammals, arrestin is associated with autoimmune uveitis. Arrestins comprise a family of closely-related proteins that includes beta-arrestin-1 and -2, which regulate the function of beta-adrenergic receptors by binding to their phosphorylated forms, impairing their capacity to activate G(S) proteins; Cone photoreceptors C-arrestin (arrestin-X) [], which could bind to phosphorylated red/green opsins; and Drosophila phosrestins I and II, which undergo light-induced phosphorylation, and probably play a role in photoreceptor transduction [, , ].  The crystal structure of bovine retinal arrestin comprises two domains of antiparallel beta-sheets connected through a hinge region and one short alpha-helix on the back of the amino-terminal fold []. The binding region for phosphorylated light-activated rhodopsin is located at the N-terminal domain, as indicated by the docking of the photoreceptor to the three-dimensional structure of arrestin.  The C-terminal domain consists of an immunoglobulin-like beta-sandwich structure. This entry represents proteins with immunoglobulin-like domains that are similar to those found in arrestin.; PDB: 1SUJ_A 3UGX_A 1CF1_B 1AYR_A 3UGU_A 3P2D_B 1ZSH_A 2WTR_B 3GC3_A 1G4R_A ....
Probab=77.99  E-value=33  Score=27.29  Aligned_cols=106  Identities=14%  Similarity=0.173  Sum_probs=45.2

Q ss_pred             EEEEecceeeEecCCCceeeeeEEEEe-cceeeceeEEEEEEEeeecccccccccCccccceEEEEEEEEecCCCCCCCC
Q psy2082          17 LTLYMGKRDFVDYVTGVEPIDGVVVFE-EDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLASEQIYPRSEKQHTS   95 (314)
Q Consensus        17 ~t~yl~~rd~~d~~~~vd~v~GvV~vd-~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~~qv~p~~~~~~~~   95 (314)
                      +++-|.++=|.    .-|.|..-+-|+ ....+=++|.++|.=..+|...+... .-. +..-.+.. +....-      
T Consensus         7 ~~~~i~~~~~~----~Ge~i~v~v~i~n~s~~~i~~I~v~L~~~~~~~~~~~~~-~~~-~~~~~v~~-~~~~~~------   73 (136)
T PF02752_consen    7 LSISIPRTAYV----PGETIPVNVEIDNQSKKKIKKIKVSLVERITYKAKGGKD-ESK-SEKRVVAK-SKNCGV------   73 (136)
T ss_dssp             EEEEES-SEEE----TT--EEEEEEEEE-SSSEEEEEEEEEEEEEEE-SS-----S-E-EEEEEEEE-EECCEB------
T ss_pred             EEEEECCCEEC----CCCEEEEEEEEEECCCCEEEEEEEEEEEEEEEEEeeccc-cce-EEEEEEEE-EecCCc------
Confidence            45556666654    336777777777 44445566777665555553322210 000 00011111 111100      


Q ss_pred             cchhhhhhhccCCCeeEe--EEeeCCCCCCCeEEecCCCCCCCCCeEEEEEEEEEEc
Q psy2082          96 LTKMQDCLLKKLGPKAYP--FTFNITPAAPPSVTLQPGPDETGEPCGVTYYVKLFVG  150 (314)
Q Consensus        96 lt~lq~~l~~k~G~h~fP--F~f~LP~~lPsSf~lq~~~~~~G~~g~V~Y~Vka~i~  150 (314)
                               ...+...|-  ..|+||.++|+|..      ..|..-.|.|.|++.+.
T Consensus        74 ---------~~~~~~~~~~~~~l~lP~~~~~s~~------~~~~~i~v~Y~l~v~~~  115 (136)
T PF02752_consen   74 ---------DPGSSGSFEFNIQLQLPSNLPPSTS------TNSRLIQVEYQLEVTVK  115 (136)
T ss_dssp             ----------B-TTEEEEEEEEE-----B-----------CGGGSEEEEEEEEEEEE
T ss_pred             ---------cCCCCceEEEEEEEcCCCccCcccc------cCCcEEEEEEEEEEEEE
Confidence                     112334444  88999988887773      12889999999999974


No 19 
>PF00927 Transglut_C:  Transglutaminase family, C-terminal ig like domain;  InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase  Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds. Members of the transglutaminase family usually have three domains: N-terminal (IPR001102 from INTERPRO), middle (IPR013808 from INTERPRO) and C-terminal. The middle domain is usually well conserved, but family members can display major differences in their N- and C-terminal domains, although their overall structure is conserved []. This entry represents the C-terminal domain found in transglutaminases, which consists of an immunoglobulin-like beta-sandwich consisting of seven strands in two sheets with a Greek key topology. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ].; GO: 0003810 protein-glutamine gamma-glutamyltransferase activity, 0018149 peptide cross-linking; PDB: 2XZZ_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B 1L9N_B ....
Probab=73.48  E-value=9.6  Score=30.23  Aligned_cols=38  Identities=16%  Similarity=0.284  Sum_probs=30.4

Q ss_pred             EEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082         197 ELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQ  235 (314)
Q Consensus       197 ~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~  235 (314)
                      ++++.++.... -|+.+.+.+.+.|.|+..++.+++.|.
T Consensus         2 ~~~i~~~~~~~-vG~d~~v~v~~~N~~~~~l~~v~~~l~   39 (107)
T PF00927_consen    2 EIKIKLPGDPV-VGQDFTVSVSFTNPSSEPLRNVSLNLC   39 (107)
T ss_dssp             EEEEEEESEEB-TTSEEEEEEEEEE-SSS-EECEEEEEE
T ss_pred             eEEEEECCCcc-CCCCEEEEEEEEeCCcCccccceeEEE
Confidence            46666766665 799999999999999999999998884


No 20 
>PF04425 Bul1_N:  Bul1 N terminus;  InterPro: IPR007519 This domain is the N terminus of Saccharomyces cerevisiae (Baker's yeast) Bul1. Bul1 binds the ubiquitin ligase Rsp5, via an N-terminal PPSY motif (157-160 in P48524 from SWISSPROT) []. The complex containing Bul1 and Rsp5 is involved in intracellular trafficking of the general amino acid permease Gap1 [], degradation of Rog1 in cooperation with Bul2 and GSK-3 [], and mitochondrial inheritance []. Bul1 may contain HEAT repeats. The C terminus is IPR007520 from INTERPRO.
Probab=68.24  E-value=32  Score=34.88  Aligned_cols=118  Identities=17%  Similarity=0.107  Sum_probs=0.0

Q ss_pred             CCCceeeeeEEEEe---cceeeceeEEEEEEEeeecccc------------------cccccCccccceEEEEEEEEecC
Q psy2082          30 VTGVEPIDGVVVFE---EDYIQDRKVFGQVVCSFRYGRE------------------EDEILGLNFQKELYLASEQIYPR   88 (314)
Q Consensus        30 ~~~vd~v~GvV~vd---~~~lk~rkv~~~~~~~fryG~~------------------d~dv~gl~f~k~l~~~~~qv~p~   88 (314)
                      ...-|-|.|.|+|.   .++++=-=.+|.+.+++--+..                  |+.-.+-+-.=+.-....  |.+
T Consensus       160 y~qGD~I~GyvtI~N~S~~pIpFdMFyV~lEG~~~v~~~~~~~~~~~~~~kkFL~M~D~sASws~~~i~~~~~~~--~~~  237 (438)
T PF04425_consen  160 YTQGDIIHGYVTIENTSSKPIPFDMFYVSLEGTISVVDSKSPSSKKPRTVKKFLRMFDFSASWSYANIDRLVGDN--YCP  237 (438)
T ss_pred             ccCCCEEEEEEEEEECCCCCcccceEEEEEEEEEEEcccccccccccHHHHHHHHhhcceecccccccccccccc--cCC


Q ss_pred             CCCCCCCcchhh---hhhhccCCCeeEeEEeeCCC---------------CCCCeEEecCCC------------------
Q psy2082          89 SEKQHTSLTKMQ---DCLLKKLGPKAYPFTFNITP---------------AAPPSVTLQPGP------------------  132 (314)
Q Consensus        89 ~~~~~~~lt~lq---~~l~~k~G~h~fPF~f~LP~---------------~lPsSf~lq~~~------------------  132 (314)
                      ..-++-.=|-++   +|++.+--.|..+|+|.||.               .|||||-+....                  
T Consensus       238 ~~~Dp~Dgt~lgl~~~r~l~p~~~Yk~fF~FkiP~~LLd~~C~h~~~~H~~lPPs~G~~k~~~~~~~~~i~~n~~lGyg~  317 (438)
T PF04425_consen  238 GEVDPYDGTYLGLPNKRILEPGVKYKKFFTFKIPEQLLDSACKHNLSSHCLLPPSLGIDKNEFNGKYSSIKINKALGYGH  317 (438)
T ss_pred             ccccCCCCeeEeCCCCceecCCCeEeceeEEeCCchhccccccCcccccCcCCCCCCcChhhccchhhhhhcccccCccc


Q ss_pred             -----------CCCCCCeEEEEEEEEEE
Q psy2082         133 -----------DETGEPCGVTYYVKLFV  149 (314)
Q Consensus       133 -----------~~~G~~g~V~Y~Vka~i  149 (314)
                                 +..-..-.|.|.|.|.+
T Consensus       318 l~~~gspil~~D~s~~d~SIsY~V~AR~  345 (438)
T PF04425_consen  318 LGTKGSPILTNDFSFDDTSISYSVEARF  345 (438)
T ss_pred             cccCCCcceecccCCCCceEEEEEEEEE


No 21 
>PF10633 NPCBM_assoc:  NPCBM-associated, NEW3 domain of alpha-galactosidase;  InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=68.08  E-value=5.3  Score=29.99  Aligned_cols=28  Identities=11%  Similarity=0.308  Sum_probs=20.5

Q ss_pred             cCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082         208 AHNEKIAVSICIRNNSNKVVKKIKAMVQ  235 (314)
Q Consensus       208 ~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~  235 (314)
                      .+||++.+.+.|.|..+..+..++++|.
T Consensus         2 ~~G~~~~~~~tv~N~g~~~~~~v~~~l~   29 (78)
T PF10633_consen    2 TPGETVTVTLTVTNTGTAPLTNVSLSLS   29 (78)
T ss_dssp             -TTEEEEEEEEEE--SSS-BSS-EEEEE
T ss_pred             CCCCEEEEEEEEEECCCCceeeEEEEEe
Confidence            4899999999999999888888887774


No 22 
>PF07070 Spo0M:  SpoOM protein;  InterPro: IPR009776 This family consists of several bacterial SpoOM proteins which are thought to control sporulation in Bacillus subtilis.Spo0M exerts certain negative effects on sporulation and its gene expression is controlled by sigmaH [].
Probab=67.51  E-value=1e+02  Score=28.31  Aligned_cols=50  Identities=16%  Similarity=0.286  Sum_probs=43.4

Q ss_pred             eeCCcEEEEEEECCccccCCCeEEEEEEEEccc-cceeEEEEEEEEEeEEE
Q psy2082         191 LSPGDLELEVTLDKQLYAHNEKIAVSICIRNNS-NKVVKKIKAMVQQGVDV  240 (314)
Q Consensus       191 ~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~S-sk~Vk~Ikv~L~Q~~~~  240 (314)
                      +.-|..+++..|++..|.|||+|.=.|.|...+ ...|..|.+.|.-.+..
T Consensus         8 ~GiG~akVDT~L~~~~~~pGe~v~G~V~i~GG~v~Q~I~~I~l~L~t~~~~   58 (218)
T PF07070_consen    8 IGIGGAKVDTVLEKPSVRPGETVRGEVHIKGGSVDQEIDRIYLELVTRYEV   58 (218)
T ss_pred             cCCCCceEEEEECCCCccCCCEEEEEEEEEeCCcceEEeEEEEEEEEEEEE
Confidence            345888999999999999999999999999996 57899999999766543


No 23 
>PF05688 DUF824:  Salmonella repeat of unknown function (DUF824);  InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=67.25  E-value=8.8  Score=26.77  Aligned_cols=30  Identities=13%  Similarity=0.325  Sum_probs=25.6

Q ss_pred             cCCCeEEEEEEEEccccceeEEEEEEEEEe
Q psy2082         208 AHNEKIAVSICIRNNSNKVVKKIKAMVQQG  237 (314)
Q Consensus       208 ~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~  237 (314)
                      .-||+|+++|++.|.....+-..-+.|.+-
T Consensus        10 K~Ge~I~ltVt~kda~G~pv~n~~f~l~r~   39 (47)
T PF05688_consen   10 KVGETIPLTVTVKDANGNPVPNAPFTLTRG   39 (47)
T ss_pred             ecCCeEEEEEEEECCCCCCcCCceEEEEec
Confidence            359999999999999999999888777543


No 24 
>PF07705 CARDB:  CARDB;  InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=59.82  E-value=36  Score=25.66  Aligned_cols=40  Identities=13%  Similarity=0.313  Sum_probs=28.7

Q ss_pred             EEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082         196 LELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQ  235 (314)
Q Consensus       196 l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~  235 (314)
                      |.+.....-....+|+.+.+++.|.|.-......+.+.+.
T Consensus         4 L~v~~~~~~~~~~~g~~~~i~~~V~N~G~~~~~~~~v~~~   43 (101)
T PF07705_consen    4 LTVSITVSPSNVVPGEPVTITVTVKNNGTADAENVTVRLY   43 (101)
T ss_dssp             EEE-EEEC-SEEETTSEEEEEEEEEE-SSS-BEEEEEEEE
T ss_pred             EEEEEeeCCCcccCCCEEEEEEEEEECCCCCCCCEEEEEE
Confidence            3445566667778999999999999998888888777753


No 25 
>PF06030 DUF916:  Bacterial protein of unknown function (DUF916);  InterPro: IPR010317 This family consists of putative cell surface proteins, from Firmicutes, of unknown function. 
Probab=58.52  E-value=23  Score=29.33  Aligned_cols=27  Identities=26%  Similarity=0.390  Sum_probs=22.7

Q ss_pred             cCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082         208 AHNEKIAVSICIRNNSNKVVKKIKAMVQ  235 (314)
Q Consensus       208 ~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~  235 (314)
                      -||+...+.+.|.|+|++.++ +++.+.
T Consensus        24 ~P~q~~~l~v~i~N~s~~~~t-v~v~~~   50 (121)
T PF06030_consen   24 KPGQKQTLEVRITNNSDKEIT-VKVSAN   50 (121)
T ss_pred             CCCCEEEEEEEEEeCCCCCEE-EEEEEe
Confidence            589999999999999998876 566654


No 26 
>PF09624 DUF2393:  Protein of unknown function (DUF2393);  InterPro: IPR013417  The function of this protein is unknown. It is always found as part of a two-gene operon with IPR013416 from INTERPRO, a protein that appears to span the membrane seven times. It has so far been found in the bacteria Anabaena sp. (strain PCC 7120), Agrobacterium tumefaciens, Rhizobium meliloti, and Gloeobacter violaceus.
Probab=58.20  E-value=28  Score=29.39  Aligned_cols=38  Identities=29%  Similarity=0.323  Sum_probs=28.9

Q ss_pred             EEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082         196 LELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMV  234 (314)
Q Consensus       196 l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L  234 (314)
                      ..+++.-.+. ..-+|.+.|..+|.|.+++++++.++.+
T Consensus        48 ~~~~~~~~~~-l~~~~~~~v~g~V~N~g~~~i~~c~i~~   85 (149)
T PF09624_consen   48 IELTLTSQKR-LQYSESFYVDGTVTNTGKFTIKKCKITV   85 (149)
T ss_pred             ceEEEeeeee-eeeccEEEEEEEEEECCCCEeeEEEEEE
Confidence            3444444444 4458999999999999999999888766


No 27 
>PF07703 A2M_N_2:  Alpha-2-macroglobulin family N-terminal region;  InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=55.74  E-value=17  Score=29.70  Aligned_cols=30  Identities=27%  Similarity=0.366  Sum_probs=24.6

Q ss_pred             cEEEEEEECCccccCCCeEEEEEEEEcccc
Q psy2082         195 DLELEVTLDKQLYAHNEKIAVSICIRNNSN  224 (314)
Q Consensus       195 ~l~l~asldK~~Y~pGE~I~v~v~I~N~Ss  224 (314)
                      .++++++.+++.|.|||++.++++...+|.
T Consensus        94 ~~~v~l~~~~~~~~Pg~~~~~~i~~~~~s~  123 (136)
T PF07703_consen   94 ELKVELTASPDEYKPGEEVTLRIKAPPNSL  123 (136)
T ss_dssp             SSSEEEEESSSSBTTTSEEEEEEEESTTEE
T ss_pred             cceEEEEEecceeCCCCEEEEEEEeCCCCE
Confidence            456778889999999999999999955543


No 28 
>PF09478 CBM49:  Carbohydrate binding domain CBM49;  InterPro: IPR019028 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see [].  This domain is found at the C-terminal of cellulases and in vitro binding studies have shown it to binds to crystalline cellulose []. ; GO: 0030246 carbohydrate binding, 0005576 extracellular region
Probab=55.47  E-value=31  Score=26.18  Aligned_cols=39  Identities=21%  Similarity=0.309  Sum_probs=28.7

Q ss_pred             EEEEEECCccccCCCe-EEEEEEEEccccceeEEEEEEEE
Q psy2082         197 ELEVTLDKQLYAHNEK-IAVSICIRNNSNKVVKKIKAMVQ  235 (314)
Q Consensus       197 ~l~asldK~~Y~pGE~-I~v~v~I~N~Ssk~Vk~Ikv~L~  235 (314)
                      +++.++..+--..|.+ -...+.|.|+++++|+.+.+.+.
T Consensus         2 ~i~q~~~~sW~~~g~~y~qy~v~I~N~~~~~I~~~~i~~~   41 (80)
T PF09478_consen    2 TITQTLVNSWTENGQTYTQYDVTITNNGSKPIKSLKISID   41 (80)
T ss_pred             EEEEEEEeEEEeCCEEEEEEEEEEEECCCCeEEEEEEEEC
Confidence            3445555555555553 46799999999999999998886


No 29 
>PF04744 Monooxygenase_B:  Monooxygenase subunit B protein;  InterPro: IPR006833 Ammonia monooxygenase and the particulate methane monooxygenase are both integral membrane proteins, occurring in ammonia oxidisers and methanotrophs respectively, which are thought to be evolutionarily related []. These enzymes have a relatively wide substrate specificity and can catalyse the oxidation of a range of substrates including ammonia, methane, halogenated hydrocarbons and aromatic molecules []. These enzymes are composed of 3 subunits - A (IPR003393 from INTERPRO), B (IPR006833 from INTERPRO) and C (IPR006980 from INTERPRO) - and contain various metal centres, including copper. Particulate methane monooxygenase from Methylococcus capsulatus str. Bath is an ABC homotrimer, which contains mononuclear and dinuclear copper metal centres, and a third metal centre containing a metal ion whose identity in vivo is not certain[]. The soluble regions of these enzymes derive primarily from the B subunit. This subunit forms two antiparallel beta-barrel-like structures and contains the mono- and di- nuclear copper metal centres [].; PDB: 3CHX_E 3RFR_A 3RGB_A 1YEW_A.
Probab=54.31  E-value=14  Score=36.59  Aligned_cols=42  Identities=14%  Similarity=0.298  Sum_probs=32.3

Q ss_pred             EEEEECCccc-cCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEE
Q psy2082         198 LEVTLDKQLY-AHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVL  242 (314)
Q Consensus       198 l~asldK~~Y-~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~  242 (314)
                      +++.+.+..| +||-++.++++|+|++++.|+   +.=.|.+.+++
T Consensus       249 V~~~v~~A~Y~vpgR~l~~~l~VtN~g~~pv~---LgeF~tA~vrF  291 (381)
T PF04744_consen  249 VKVKVTDATYRVPGRTLTMTLTVTNNGDSPVR---LGEFNTANVRF  291 (381)
T ss_dssp             EEEEEEEEEEESSSSEEEEEEEEEEESSS-BE---EEEEESSS-EE
T ss_pred             eEEEEeccEEecCCcEEEEEEEEEcCCCCceE---eeeEEeccEEE
Confidence            7888888888 899999999999999999865   34455666654


No 30 
>PF00963 Cohesin:  Cohesin domain;  InterPro: IPR002102 Cohesin domains interact with a complementary domain, termed the dockerin domain (see IPR002105 from INTERPRO). The cohesin-dockerin interaction is the crucial interaction for complex formation in the cellulosome []. The scaffoldin component of the cellulolytic bacterium Clostridium thermocellum is a non-hydrolytic protein which organises the hydrolytic enzymes in a large complex, called the cellulosome. Scaffoldin comprises a series of functional domains, amongst which is a single cellulose-binding domain and nine cohesin domains which are responsible for integrating the individual enzymatic subunits into the complex.; GO: 0030246 carbohydrate binding, 0000272 polysaccharide catabolic process; PDB: 2BM3_A 3P0D_I 3KCP_A 2B59_A 3L8Q_B 3FNK_C 3GHP_B 2CCL_A 1ANU_A 1OHZ_A ....
Probab=51.86  E-value=43  Score=27.75  Aligned_cols=37  Identities=16%  Similarity=0.294  Sum_probs=29.3

Q ss_pred             EEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082         198 LEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQ  235 (314)
Q Consensus       198 l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~  235 (314)
                      +++++++.--.+||++.|.+.++|.++. |..+.+.|.
T Consensus         1 v~l~~~~~~a~~G~tv~V~V~v~~~~~~-i~~~~~~l~   37 (141)
T PF00963_consen    1 VTLSVDSVSAKPGETVTVPVNVSNVSNS-IAGMQFTLS   37 (141)
T ss_dssp             EEEEESECEE-TTSEEEEEEEEESCTTT-EEEEEEEEE
T ss_pred             CEEEeCCceECCCCEEEEEEEEEcCCCc-EEEEEEEEE
Confidence            4667788777999999999999999866 777776663


No 31 
>PF05326 SVA:  Seminal vesicle autoantigen (SVA);  InterPro: IPR007990 This family consists of seminal vesicle autoantigen and prolactin-inducible (PIP) proteins. Seminal vesicle autoantigen (SVA) is specifically present in the seminal plasma of mice. This 19 kDa secretory glycoprotein suppresses the motility of spermatozoa by interacting with phospholipid. PIP has several known functions. In saliva, this protein plays a role in host defence by binding to microorganisms such as Streptococcus. PIP is an aspartyl proteinase and it acts as a factor capable of suppressing T-cell apoptosis through its interaction with CD4 [].; GO: 0005576 extracellular region; PDB: 3ES6_B.
Probab=47.84  E-value=37  Score=28.50  Aligned_cols=61  Identities=13%  Similarity=0.176  Sum_probs=31.6

Q ss_pred             eCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEEEecCeEEEEEEE
Q psy2082         192 SPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVLFQNGQFRHSVDC  254 (314)
Q Consensus       192 s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~~~~~~~~~~Va~  254 (314)
                      +..+|.|+..+++.. .+||.+.+.++|.|+=..-+- ||+.|.-...+.+..+-.|+.+++.
T Consensus        30 ~~k~l~l~l~~~~~~-~~~eev~v~l~V~t~~~eCmv-Vk~yl~sn~~i~~~fny~YTaCLC~   90 (124)
T PF05326_consen   30 SRKPLSLNLQVPQTA-KANEEVTVTLTVTTELRECMV-VKIYLESNPPIDGSFNYKYTACLCD   90 (124)
T ss_dssp             -----EE-----SEE--SS--EEEEEEEEE--SS-EE-EEEEEEESS---SGGG-EEEEEE-S
T ss_pred             cCccEEEEeecCCCC-CCCCEEEEEEEEEcchheeEE-EEEEeccCCCccccccceEEEEeCC
Confidence            457899999999988 899999999999999877765 7888876666655445667766753


No 32 
>cd00258 GM2-AP GM2 activator protein (GM2-AP) is a non-enzymatic lysosomal protein that acts as cofactor in the sequential degradation of gangliosides. GM2A is an essential cofactor for beta-hexosaminidase A (Hex A) in the enzymatic hydrolysis of GM2 ganglioside to GM3. Mutation of the gene results in the AB variant of Tay-Sachs disease. GM2-AP and similar proteins belong to the ML domain family.
Probab=47.19  E-value=72  Score=28.05  Aligned_cols=78  Identities=15%  Similarity=0.188  Sum_probs=45.4

Q ss_pred             EEEEEEEeeecccccccccCccccceEEEEEEEEecCCCCCCCCcchhhh--hhhccCCCeeEeEE-eeCCC-CCCCeEE
Q psy2082          52 VFGQVVCSFRYGREEDEILGLNFQKELYLASEQIYPRSEKQHTSLTKMQD--CLLKKLGPKAYPFT-FNITP-AAPPSVT  127 (314)
Q Consensus        52 v~~~~~~~fryG~~d~dv~gl~f~k~l~~~~~qv~p~~~~~~~~lt~lq~--~l~~k~G~h~fPF~-f~LP~-~lPsSf~  127 (314)
                      +|+.+=|...+|        .....|+.--..++.|+.+.=..++...+-  +=--++|.|+.|=+ |.||+ +||+...
T Consensus        62 ~Wi~iPC~~~~G--------SCty~d~C~~l~~~~~~~~~Cp~~l~~~g~pC~CP~~~G~y~lp~s~f~lP~~~LPs~l~  133 (162)
T cd00258          62 LWMKIPCLDNIG--------SCTYDNACDLLDTLIPPGQQCPEPLRTYGLPCHCPFKEGVYSLPDSTFTLPNVDLPSWLT  133 (162)
T ss_pred             EEEEcccccccC--------cccccchhhchhcccCCCCCCCchhhccCCccCCCCCCcceEccceeeecccccCCCccC
Confidence            355666655443        344555555666777776522223321110  01145899998654 69997 6887764


Q ss_pred             ecCCCCCCCCCeEEEEEEEEEE
Q psy2082         128 LQPGPDETGEPCGVTYYVKLFV  149 (314)
Q Consensus       128 lq~~~~~~G~~g~V~Y~Vka~i  149 (314)
                             .|     .|.+++.+
T Consensus       134 -------~G-----~Y~i~~~l  143 (162)
T cd00258         134 -------NG-----NYRITGIL  143 (162)
T ss_pred             -------CC-----cEEEEEEE
Confidence                   24     39999986


No 33 
>smart00809 Alpha_adaptinC2 Adaptin C-terminal domain. Adaptins are components of the adaptor complexes which link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. Gamma-adaptin is a subunit of the golgi adaptor. Alpha adaptin is a heterotetramer that regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This Ig-fold domain is found in alpha, beta and gamma adaptins and consists of a beta-sandwich containing 7 strands in 2 beta-sheets in a greek-key topology PUBMED:10430869, PUBMED:12176391. The adaptor appendage contains an additional N-terminal strand.
Probab=45.15  E-value=72  Score=24.71  Aligned_cols=40  Identities=10%  Similarity=0.253  Sum_probs=31.9

Q ss_pred             eCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082         192 SPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQ  235 (314)
Q Consensus       192 s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~  235 (314)
                      .+.-|++.+.+.+.    +..+.+.+...|.|...+.++.+.+.
T Consensus         3 ~~~~l~I~~~~~~~----~~~~~i~~~~~N~s~~~it~f~~~~a   42 (104)
T smart00809        3 EKNGLQIGFKFERR----PGLIRITLTFTNKSPSPITNFSFQAA   42 (104)
T ss_pred             cCCCEEEEEEEEcC----CCeEEEEEEEEeCCCCeeeeEEEEEE
Confidence            45567888888765    45689999999999999999887664


No 34 
>TIGR03079 CH4_NH3mon_ox_B methane monooxygenase/ammonia monooxygenase, subunit B. Both ammonia oxidizers such as Nitrosomonas europaea and methanotrophs (obligate methane oxidizers) such as Methylococcus capsulatus each can grow only on their own characteristic substrate. However, both groups have the ability to oxidize both substrates, and so the relevant enzymes must be named here according to their ability to oxidze both. The protein family represented here reflects subunit B of both the particulate methane monooxygenase of methylotrophs and the ammonia monooxygenase of nitrifying bacteria.
Probab=45.06  E-value=30  Score=34.26  Aligned_cols=47  Identities=15%  Similarity=0.247  Sum_probs=33.9

Q ss_pred             CcEEEEEEECCccc-cCCCeEEEEEEEEccccceeEEEEEEEEEeEEEEEE
Q psy2082         194 GDLELEVTLDKQLY-AHNEKIAVSICIRNNSNKVVKKIKAMVQQGVDVVLF  243 (314)
Q Consensus       194 G~l~l~asldK~~Y-~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~~~~~~~  243 (314)
                      ++=.+++.+.+..| +||-.+.++++|+|++++.|+   +.=.+.+.+++-
T Consensus       264 ~~~~V~~kv~~a~Y~VPGR~l~~~~~VTN~g~~~vr---lgEF~TA~vRFl  311 (399)
T TIGR03079       264 APNPVSINVTKANYDVPGRALRVTMEITNNGDQVIS---IGEFTTAGIRFM  311 (399)
T ss_pred             CCCceEEEEeccEEecCCcEEEEEEEEEcCCCCceE---EEeEeecceEee
Confidence            34456666667666 999999999999999998865   344455555543


No 35 
>COG5554 NifU Nitrogen fixation protein [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=44.48  E-value=61  Score=23.87  Aligned_cols=39  Identities=31%  Similarity=0.584  Sum_probs=29.2

Q ss_pred             eEEEeecCCCcEEEEecceeeEecCCCceeeeeEEEEecceeeceeEEE
Q psy2082           6 KVFKKCSPNGKLTLYMGKRDFVDYVTGVEPIDGVVVFEEDYIQDRKVFG   54 (314)
Q Consensus         6 ~vfkk~s~n~~~t~yl~~rd~~d~~~~vd~v~GvV~vd~~~lk~rkv~~   54 (314)
                      ||.=.....| |+.|+.|.|.    .  +||   |.|+.+.+-|-+|..
T Consensus         2 kVmiR~t~ag-LsaYiPKKDL----E--Epi---i~V~nedlwGG~ilL   40 (69)
T COG5554           2 KVMIRVTDAG-LSAYIPKKDL----E--EPI---IVVKNEDLWGGKILL   40 (69)
T ss_pred             eEEEEEcCCc-eEEEcccccc----c--cce---EEEeccccccceEEe
Confidence            3444455667 9999999984    3  565   889999999988874


No 36 
>PF13199 Glyco_hydro_66:  Glycosyl hydrolase family 66; PDB: 3VMO_A 3VMN_A 3VMP_A.
Probab=44.21  E-value=38  Score=35.45  Aligned_cols=27  Identities=26%  Similarity=0.437  Sum_probs=17.4

Q ss_pred             EECCccccCCCeEEEEEEEEcccccee
Q psy2082         201 TLDKQLYAHNEKIAVSICIRNNSNKVV  227 (314)
Q Consensus       201 sldK~~Y~pGE~I~v~v~I~N~Ssk~V  227 (314)
                      +.||..|.|||+|.++++..|.+....
T Consensus         1 ~tDKA~Y~PGe~V~l~~~~~~~~~~~~   27 (559)
T PF13199_consen    1 TTDKARYRPGEKVTLTASLKNTTGSDF   27 (559)
T ss_dssp             EES-SSB-TTS-EEEE-EEE--SSS-E
T ss_pred             CCCcceeCCCCeEEEEEEeccCccccc
Confidence            579999999999999999999976653


No 37 
>PF02014 Reeler:  Reeler domain Schematic picture including Reeler domain;  InterPro: IPR002861 Extracellular matrix (ECM) proteins play an important role in early cortical development, specifically in the formation of neural connections and in controlling the cyto-architecture of the central nervous system. The product of the reeler gene in mouse is reelin,a large extracellular protein secreted by pioneer neurons that coordinates cell positioning during neurodevelopment []. F-spondin and mindin are a family of matrix-attached adhesion molecules that share structural similarities and overlapping domains of expression. Both F-spondin and mindin promote adhesion and outgrowth of hippocampal embryonic neurons and bind to a putative receptor(s) expressed on both hippocampal and sensory neurons []. This domain of unknown function is found at the N terminus of reelin and F-spondin.; PDB: 2ZOT_B 2ZOU_B 3COO_A.
Probab=43.13  E-value=53  Score=27.08  Aligned_cols=38  Identities=5%  Similarity=0.188  Sum_probs=25.2

Q ss_pred             CcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082         194 GDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQ  235 (314)
Q Consensus       194 G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~  235 (314)
                      ++..|.  ++.+.|.|||.+.|++  .+.++...+.+-+...
T Consensus        19 ~py~i~--~~~~~y~pg~~~~Vtl--~~~~~~~F~GFllqAr   56 (132)
T PF02014_consen   19 SPYSIS--VSPSSYEPGQTYTVTL--SSSGSSSFRGFLLQAR   56 (132)
T ss_dssp             TSEEEE--ET-SSB-TTBEEEEEE--EETTTEEBSEEEEEEE
T ss_pred             CCEEEE--eCCCeEcCCCEEEEEE--ECCCCCceeEEEEEEE
Confidence            444444  4489999999999998  6666777777555443


No 38 
>PF07703 A2M_N_2:  Alpha-2-macroglobulin family N-terminal region;  InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=43.05  E-value=31  Score=28.02  Aligned_cols=25  Identities=24%  Similarity=0.319  Sum_probs=20.5

Q ss_pred             EEEEECCccccCCCeEEEEEEEEcc
Q psy2082         198 LEVTLDKQLYAHNEKIAVSICIRNN  222 (314)
Q Consensus       198 l~asldK~~Y~pGE~I~v~v~I~N~  222 (314)
                      |++.+||..|.+||++.+.+.-.-.
T Consensus         1 l~i~~~~~~~~~Ge~~~v~v~~~~~   25 (136)
T PF07703_consen    1 LQISTDKDSYKPGETAKVTVQSPFP   25 (136)
T ss_dssp             EEEEE-SSSB-TTSEEEEEEEEESC
T ss_pred             CEEEcCCCCcCCCCEEEEEEEcCCC
Confidence            6789999999999999999987766


No 39 
>PF06159 DUF974:  Protein of unknown function (DUF974);  InterPro: IPR010378 This is a family of uncharacterised eukaryotic proteins.
Probab=41.43  E-value=37  Score=31.56  Aligned_cols=29  Identities=24%  Similarity=0.401  Sum_probs=26.1

Q ss_pred             ccCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082         207 YAHNEKIAVSICIRNNSNKVVKKIKAMVQ  235 (314)
Q Consensus       207 Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~  235 (314)
                      .|-||+-.-.+++.|.|+..|+.+.+++.
T Consensus        10 iylGEtF~~~l~~~N~s~~~v~~v~ikve   38 (249)
T PF06159_consen   10 IYLGETFSCYLSVNNDSNKPVRNVRIKVE   38 (249)
T ss_pred             EeecCCEEEEEEeecCCCCceEEeEEEEE
Confidence            46799999999999999999999888775


No 40 
>PF04426 Bul1_C:  Bul1 C terminus;  InterPro: IPR022794  This domain contains the C terminus of Saccharomyces cerevisiae Bul1. Bul1 binds the ubiquitin ligase Rsp5, via an N-terminal PPSY motif (157-160 in P48524 from SWISSPROT) []. The complex containing Bul1 and Rsp5 is involved in intracellular trafficking of the general amino acid permease Gap1 [], degradation of Rog1 in cooperation with Bul2 and GSK-3 [], and mitochondrial inheritance []. Bul1 may contain HEAT repeats. This domain is found C-terminal to PF04425 from PFAM. This signature also matches the C terminus of some bacterial Methyl-accepting chemotaxis proteins.
Probab=39.35  E-value=62  Score=30.72  Aligned_cols=36  Identities=19%  Similarity=0.360  Sum_probs=27.2

Q ss_pred             CCCCCceeEEEEEEEEEEEEECCCCceeEEEEeEEEe
Q psy2082         262 PDPKDAFGIVVSYMVKVKLYLGALGGELSAELPFILM  298 (314)
Q Consensus       262 ~~~~~~~~I~VsY~vkV~l~v~~~~~~v~velPi~I~  298 (314)
                      |+...| .+-=.|+|+|.+.++..++.+.+++|+.|.
T Consensus       235 PsFqSC-l~~R~Y~lrv~ik~~~~~~~~~l~VPv~i~  270 (272)
T PF04426_consen  235 PSFQSC-LCSRLYYLRVNIKFKNHGGSVSLKVPVQIQ  270 (272)
T ss_pred             CCchhe-eeeeEEEEEEEEEEcCCCCEEEEEeceEEe
Confidence            455444 455579999999995445679999999885


No 41 
>COG4326 Spo0M Sporulation control protein [General function prediction only]
Probab=39.31  E-value=28  Score=31.99  Aligned_cols=20  Identities=20%  Similarity=0.622  Sum_probs=17.7

Q ss_pred             CC-CeeEeEEeeCCCCCCCeE
Q psy2082         107 LG-PKAYPFTFNITPAAPPSV  126 (314)
Q Consensus       107 ~G-~h~fPF~f~LP~~lPsSf  126 (314)
                      || +|.|||+|.||.+.|-++
T Consensus       104 pgEe~~fpf~l~lP~~tPvT~  124 (270)
T COG4326         104 PGEERNFPFELSLPWNTPVTI  124 (270)
T ss_pred             CCceEeccEEEecCCCCceee
Confidence            45 599999999999999887


No 42 
>COG2373 Large extracellular alpha-helical protein [General function prediction only]
Probab=38.50  E-value=1.5e+02  Score=35.06  Aligned_cols=43  Identities=14%  Similarity=0.303  Sum_probs=38.0

Q ss_pred             eeCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEE
Q psy2082         191 LSPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAM  233 (314)
Q Consensus       191 ~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~  233 (314)
                      |-.-.+.++++++|..+.+|+++.+++...|.+...+.+-++.
T Consensus       493 f~p~r~~i~l~~~k~~~~~g~~v~~~v~~~yL~GaPa~g~~~~  535 (1621)
T COG2373         493 FIPDRFKINLTLDKTEWVPGKDVKIKVDLRYLYGAPAAGLTVQ  535 (1621)
T ss_pred             hCCceEEEecccccccccCCCcEEEEEEEEecCCCcccCceee
Confidence            4556788999999999999999999999999999888877766


No 43 
>KOG0452|consensus
Probab=37.53  E-value=35  Score=35.84  Aligned_cols=88  Identities=22%  Similarity=0.368  Sum_probs=59.3

Q ss_pred             EEEEEEccCCccCccccceEEEeeeeccCCCCCCCCCCeEEEeEEEEeeCCcEEEEEEECCccccCCCeEEEEEEEEccc
Q psy2082         144 YVKLFVGENETDRSHRRSTVALGIRKIQYAPSKNGRQPCTVVRKDFMLSPGDLELEVTLDKQLYAHNEKIAVSICIRNNS  223 (314)
Q Consensus       144 ~Vka~i~~~~~dr~~K~~~v~l~irkl~~~p~~~~~~p~~e~~k~f~~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~S  223 (314)
                      -|||.|+++- +|+|+.|-+-|.|-.++|-|-..     +++    +=-+|.=.-.+.||.++..||+.|.|+.  +|..
T Consensus       794 GvKAViaeS~-ErIHrsnLvGmGIiPl~f~~Ge~-----Adt----LgLtG~E~yti~lP~~~lkPgq~i~v~~--dtGk  861 (892)
T KOG0452|consen  794 GVKAVIAESY-ERIHRSNLVGMGIIPLQFLPGED-----ADT----LGLTGRERYTIHLPENILKPGQDITVTT--DTGK  861 (892)
T ss_pred             hhHHHHHHHH-HHHHhhccccceeeeeeecCCCC-----hhh----cCcccceeEEEECCcccCCCCceEEEEe--cCCc
Confidence            4677776654 79999998889998899877433     121    1124666677889999999999987653  3332


Q ss_pred             cceeEEEEEEEEEeEEEEEEecC
Q psy2082         224 NKVVKKIKAMVQQGVDVVLFQNG  246 (314)
Q Consensus       224 sk~Vk~Ikv~L~Q~~~~~~~~~~  246 (314)
                        . =...+++.--+++..|.++
T Consensus       862 --~-F~~~~rFdteVeltyy~~G  881 (892)
T KOG0452|consen  862 --V-FVCTLRFDTEVELTYYKNG  881 (892)
T ss_pred             --E-EEEEEEecceEEEEEEecC
Confidence              2 2234566666777777665


No 44 
>PF02883 Alpha_adaptinC2:  Adaptin C-terminal domain;  InterPro: IPR008152 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. AP (adaptor protein) complexes are found in coated vesicles and clathrin-coated pits. AP complexes connect cargo proteins and lipids to clathrin at vesicle budding sites, as well as binding accessory proteins that regulate coat assembly and disassembly (such as AP180, epsins and auxilin). There are different AP complexes in mammals. AP1 is responsible for the transport of lysosomal hydrolases between the TGN and endosomes []. AP2 associates with the plasma membrane and is responsible for endocytosis []. AP3 is responsible for protein trafficking to lysosomes and other related organelles []. AP4 is less well characterised. AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). For example, in AP1 these subunits are gamma-1-adaptin, beta-1-adaptin, mu-1 and sigma-1, while in AP2 they are alpha-adaptin, beta-2-adaptin, mu-2 and sigma-2. Each subunit has a specific function. Adaptins recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal ear (appendage) domains. Mu recognises tyrosine-based sorting signals within the cytoplasmic domains of transmembrane cargo proteins []. One function of clathrin and AP2 complex-mediated endocytosis is to regulate the number of GABA(A) receptors available at the cell surface [].  GGAs (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) are a family of monomeric clathrin adaptor proteins that are conserved from yeasts to humans. GGAs regulate clathrin-mediated the transport of proteins (such as mannose 6-phosphate receptors) from the TGN to endosomes and lysosomes through interactions with TGN-sorting receptors, sometimes in conjunction with AP-1 [, ]. GGAs bind cargo, membranes, clathrin and accessory factors. GGA1, GGA2 and GGA3 all contain a domain homologous to the ear domain of gamma-adaptin. GGAs are composed of a single polypeptide with four domains: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The VHS domain is responsible for endocytosis and signal transduction, recognising transmembrane cargo through the ACLL sequence in the cytoplasmic domains of sorting receptors []. The GAT domain (also found in Tom1 proteins) interacts with ARF (ADP-ribosylation factor) to regulate membrane trafficking [], and with ubiquitin for receptor sorting []. The hinge region contains a clathrin box for recognition and binding to clathrin, similar to that found in AP adaptins. The GAE domain is similar to the AP gamma-adaptin ear domain, and is responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis [].  This entry represents a beta-sandwich structural motif found in the appendage (ear) domain of alpha-, beta- and gamma-adaptin from AP clathrin adaptor complexes, and the GAE (gamma-adaptin ear) domain of GGA adaptor proteins. These domains have an immunoglobulin-like beta-sandwich fold containing 7 or 8 strands in 2 beta-sheets in a Greek key topology [, ]. Although these domains share a similar fold, there is little sequence identity between the alpha/beta-adaptins and gamma-adaptin/GAE. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030131 clathrin adaptor complex; PDB: 3MNM_B 3ZY7_B 1GYU_A 1GYW_B 2A7B_A 1GYV_A 2E9G_A 1E42_B 2G30_A 2IV9_B ....
Probab=35.03  E-value=1.5e+02  Score=23.46  Aligned_cols=44  Identities=9%  Similarity=0.194  Sum_probs=35.1

Q ss_pred             EEeeCCcEEEEEEECCccccCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082         189 FMLSPGDLELEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMV  234 (314)
Q Consensus       189 f~~s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L  234 (314)
                      .++.+.-|.+.+...+  -..+..+.+.+...|.|+..+.++.+.+
T Consensus         4 ~~ye~~~l~I~~~~~~--~~~~~~~~i~~~f~N~s~~~it~f~~q~   47 (115)
T PF02883_consen    4 VLYEDNGLQIGFKSEK--SPNPNQGRIKLTFGNKSSQPITNFSFQA   47 (115)
T ss_dssp             EEEEETTEEEEEEEEE--CCETTEEEEEEEEEE-SSS-BEEEEEEE
T ss_pred             EEEeCCCEEEEEEEEe--cCCCCEEEEEEEEEECCCCCcceEEEEE
Confidence            3567778888888887  5668899999999999999999988765


No 45 
>PRK12676 bifunctional inositol-1 monophosphatase/fructose-1,6-bisphosphatase; Reviewed
Probab=34.61  E-value=27  Score=32.37  Aligned_cols=49  Identities=16%  Similarity=0.243  Sum_probs=37.4

Q ss_pred             CCceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEEE
Q psy2082          31 TGVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLASE   83 (314)
Q Consensus        31 ~~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~~   83 (314)
                      =.||||||.-    .+++|.-.|+...+-++-|+-..=|+-..+..++|.+..
T Consensus        84 WvIDPiDGT~----nfv~g~p~~~vsial~~~g~p~~gvV~~P~~~e~~~A~~  132 (263)
T PRK12676         84 VVLDPLDGTY----NAINGIPFYAISIAVFKGGKPVYGYVYNLATGDFYEAIP  132 (263)
T ss_pred             EEEeccCCch----HHhcCCCceEEEEEEEECCeEEEEEEEecCCCCEEEEEC
Confidence            3578888865    899999999977777776766666777788888888653


No 46 
>KOG2540|consensus
Probab=34.55  E-value=74  Score=29.48  Aligned_cols=28  Identities=18%  Similarity=0.267  Sum_probs=24.3

Q ss_pred             Cccc-cCCCeEEEEEEEEccccceeEEEE
Q psy2082         204 KQLY-AHNEKIAVSICIRNNSNKVVKKIK  231 (314)
Q Consensus       204 K~~Y-~pGE~I~v~v~I~N~Ssk~Vk~Ik  231 (314)
                      +++| .|||+...--+..|.|.+.|-+|.
T Consensus       155 rEiyV~PGEtALaFYta~N~sdkpIiGvs  183 (269)
T KOG2540|consen  155 REIYVLPGETALAFYTAENPSDKPIIGVS  183 (269)
T ss_pred             eEEEEcCCcceeeeEeccCCCCCCceeeE
Confidence            3456 799999999999999999998865


No 47 
>cd08548 Type_I_cohesin_like Type I cohesin domain, interaction partner of dockerin. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I cohesins; their interactions with dockerin mediate assembly of a range of dockerin-borne enzymes to the complex.
Probab=33.79  E-value=91  Score=26.09  Aligned_cols=39  Identities=13%  Similarity=0.176  Sum_probs=32.1

Q ss_pred             EEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEEe
Q psy2082         199 EVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQG  237 (314)
Q Consensus       199 ~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q~  237 (314)
                      ++.+.+--=.+||++.|-|.++|..++.|..+.+.|.--
T Consensus         2 ~v~ig~v~~~~G~tv~VpV~~~~v~~~~i~~~~f~l~yD   40 (135)
T cd08548           2 EVKIGSVSGKPGDTVTVPVTLSNVPSKGIGACDFVLSYD   40 (135)
T ss_pred             eEEeccEEecCCCEEEEEEEEecCCccCEEEEEEEEEeC
Confidence            556666666799999999999999998899888877543


No 48 
>PF13595 DUF4138:  Domain of unknown function (DUF4138)
Probab=33.31  E-value=58  Score=30.37  Aligned_cols=36  Identities=22%  Similarity=0.484  Sum_probs=29.8

Q ss_pred             EEEEECCccccCCCeEEEEEEEEccccce--eEEEEEEE
Q psy2082         198 LEVTLDKQLYAHNEKIAVSICIRNNSNKV--VKKIKAMV  234 (314)
Q Consensus       198 l~asldK~~Y~pGE~I~v~v~I~N~Ssk~--Vk~Ikv~L  234 (314)
                      |++.|. .+|++|+-+-+.++|.|.|+.+  |.-|++.+
T Consensus       130 i~~~L~-~Iy~~~d~lyf~~~i~N~S~i~ydID~irf~i  167 (246)
T PF13595_consen  130 IQFRLN-NIYVHGDYLYFHLSIKNKSNIPYDIDFIRFKI  167 (246)
T ss_pred             eEEEEE-eEEEECCEEEEEEEEEcCCCCCeEEeEEEEEE
Confidence            555554 6999999999999999999865  77777765


No 49 
>cd08544 Reeler Reeler, the N-terminal domain of reelin, F-spondin, and a variety of other proteins. This domain is found at the N-terminus of F-spondin, a protein attached to the extracellular matrix, which plays roles in neuronal development and vascular remodelling. The F-spondin reeler domain has been reported to bind heparin. The reeler domain is also found at the N-terminus of reelin, an extracellular glycoprotein involved in the development of the brain cortex, and in a variety of other eukaryotic proteins with different domain architectures, including the animal ferric-chelate reductase 1 or stromal cell-derived receptor 2, a member of the cytochrome B561 family, which reduces ferric iron before its transport from the endosome to the cytoplasm. Also included is the insect putative defense protein 1, which is expressed upon bacterial infection and appears to contain a single reeler domain.
Probab=32.64  E-value=89  Score=25.71  Aligned_cols=37  Identities=11%  Similarity=0.179  Sum_probs=26.0

Q ss_pred             EEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEEE
Q psy2082         198 LEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQQ  236 (314)
Q Consensus       198 l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~Q  236 (314)
                      ..++++...|.|||.+.|++.-.|.  ..-+.+-+....
T Consensus        21 y~i~~~~~~y~pG~~~~Vtl~~~~~--~~F~GF~lqAr~   57 (135)
T cd08544          21 YSITISGNSYVPGETYTVTLSGSSP--SPFRGFLLQARD   57 (135)
T ss_pred             EEEEeCCCEECCCCEEEEEEECCCC--CceeEEEEEEEc
Confidence            3445666699999999999988776  455555544443


No 50 
>COG4326 Spo0M Sporulation control protein [General function prediction only]
Probab=32.50  E-value=1.2e+02  Score=27.95  Aligned_cols=43  Identities=7%  Similarity=0.218  Sum_probs=35.4

Q ss_pred             eCCcEEEEEEECCccccCCCeEEEEEEEEcccc-ceeEEEEEEE
Q psy2082         192 SPGDLELEVTLDKQLYAHNEKIAVSICIRNNSN-KVVKKIKAMV  234 (314)
Q Consensus       192 s~G~l~l~asldK~~Y~pGE~I~v~v~I~N~Ss-k~Vk~Ikv~L  234 (314)
                      .=|.-++.--|.+++|+|||.+...+.|.-.-. .+|..|.++|
T Consensus        29 GIGaAKVDt~L~~~~~~PG~~v~g~vhv~GG~~AQdI~~I~LkL   72 (270)
T COG4326          29 GIGAAKVDTVLQQEVLYPGQSVKGIVHVYGGATAQDIDNIELKL   72 (270)
T ss_pred             CcchhhhhhhhhhccccCCceEEEEEEEecCchHhhhhhhhhhh
Confidence            346777888899999999999999999987763 5688887777


No 51 
>cd01641 Bacterial_IMPase_like_1 Predominantly bacterial family of Mg++ dependend phosphatases, related to inositol monophosphatases. These enzymes may dephosphorylate fructose-1,6-bisphosphate, inositol monophospate, 3'-phosphoadenosine-5'-phosphate,  or similar substrates.
Probab=30.34  E-value=37  Score=31.03  Aligned_cols=46  Identities=22%  Similarity=0.460  Sum_probs=35.9

Q ss_pred             ceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082          33 VEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS   82 (314)
Q Consensus        33 vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~   82 (314)
                      ||||||.-    .+++|.--|....+-++-|+-..=++...+..++|.+.
T Consensus        77 iDPIDGT~----nf~~g~p~~~vsial~~~g~p~~gvV~~P~~~~~~~A~  122 (248)
T cd01641          77 LDPIDGTK----SFIRGLPVWGTLIALLHDGRPVLGVIDQPALGERWIGA  122 (248)
T ss_pred             EecCcCch----hHhcCCCceEEEEEEEECCEEEEEEEccCccCCEEEEe
Confidence            79999965    88999888887666666676666677778888888864


No 52 
>PF06988 NifT:  NifT/FixU protein;  InterPro: IPR009727 This family consists of several NifT and FixU bacterial proteins. The function of NifT is unknown although it is thought that the protein may be involved in biosynthesis of the FeMo cofactor of nitrogenase although perturbation of nifT expression in Klebsiella pneumoniae has only a limited effect on nitrogen fixation [].; GO: 0009399 nitrogen fixation; PDB: 2JN4_A.
Probab=30.27  E-value=50  Score=24.56  Aligned_cols=21  Identities=29%  Similarity=0.479  Sum_probs=13.6

Q ss_pred             EEEeecCCCcEEEEecceeeE
Q psy2082           7 VFKKCSPNGKLTLYMGKRDFV   27 (314)
Q Consensus         7 vfkk~s~n~~~t~yl~~rd~~   27 (314)
                      |.=.-..+|.|++|+.|+|.=
T Consensus         3 VmiR~~~~G~ls~YVpKKDLE   23 (64)
T PF06988_consen    3 VMIRKNGAGGLSAYVPKKDLE   23 (64)
T ss_dssp             EEEEE-SS--EEEEETTTTEE
T ss_pred             EEEEeCCCcCEEEEEeCCccc
Confidence            334456789999999999953


No 53 
>cd01639 IMPase IMPase, inositol monophosphatase and related domains. A family of Mg++ dependent phosphatases, inhibited by lithium, many of which may act on inositol monophosphate substrate. They dephosphorylate inositol phosphate to generate inositol, which may be recycled into inositol lipids; in eukaryotes IMPase plays a vital role in intracellular signaling. IMPase is one of the proposed targets of Li+ therapy in manic-depressive illness. This family contains some bacterial members of the inositol monophosphatase family classified as SuhB-like. E. coli SuhB has been suggested to participate in posstranscriptional control of gene expression, and its inositol monophosphatase activity doesn't appear to be sufficient for its cellular function. It has been proposed, that SuhB plays a role in the biosynthesis of phosphatidylinositol in mycobacteria.
Probab=30.25  E-value=34  Score=31.09  Aligned_cols=47  Identities=17%  Similarity=0.299  Sum_probs=35.6

Q ss_pred             CceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082          32 GVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS   82 (314)
Q Consensus        32 ~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~   82 (314)
                      .||||||.-    .+++|.--|....+-++-|+-..=++...+..++|...
T Consensus        79 vIDPIDGT~----nf~~g~p~~~vsial~~~g~p~~gvV~~P~~~~~~~a~  125 (244)
T cd01639          79 IIDPLDGTT----NFVHGFPHFAVSIALAVKGEPVVGVVYDPIRNELFTAV  125 (244)
T ss_pred             EEecccChh----HHhcCCCcEEEEEEEEECCEEEEEEEEeCCCCcEEEEE
Confidence            478999964    89999999997777777676556667777777777764


No 54 
>PF11355 DUF3157:  Protein of unknown function (DUF3157);  InterPro: IPR021501  This family of proteins with unknown function appears to be restricted to Gammaproteobacteria. 
Probab=30.05  E-value=1.3e+02  Score=27.36  Aligned_cols=36  Identities=19%  Similarity=0.401  Sum_probs=29.2

Q ss_pred             EEEEECCccccCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082         198 LEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMV  234 (314)
Q Consensus       198 l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L  234 (314)
                      +++.|...-|-.| ..-+...+.|+|+..|-.|.+.+
T Consensus       100 VdV~l~~~~y~~~-~L~l~~~ltnqSsqsVv~Vel~v  135 (199)
T PF11355_consen  100 VDVSLGASQYEDG-QLGLPFSLTNQSSQSVVLVELEV  135 (199)
T ss_pred             eeEEEeccceeCC-eEEEEEEEecCCCceEEEEEEEE
Confidence            6677777666666 99999999999999998777655


No 55 
>PRK10757 inositol monophosphatase; Provisional
Probab=29.26  E-value=37  Score=31.49  Aligned_cols=47  Identities=23%  Similarity=0.419  Sum_probs=35.2

Q ss_pred             CceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082          32 GVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS   82 (314)
Q Consensus        32 ~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~   82 (314)
                      .||||||..    .+++|.--|....+-++-|+-..=|+-..+..++|.+.
T Consensus        82 iIDPIDGT~----nf~~g~p~~~vsial~~~g~pv~GvV~~P~~~~~~~A~  128 (267)
T PRK10757         82 VIDPLDGTT----NFIKRLPHFAVSIAVRIKGRTEVAVVYDPMRNELFTAT  128 (267)
T ss_pred             EEeCccCch----HHHhCCCcEEEEEEEEECCEEEEEEEEcCCCCCEEEEE
Confidence            389999975    89999888887666665566555566677778888764


No 56 
>TIGR02934 nifT_nitrog probable nitrogen fixation protein FixT. This largely uncharacterized protein family is assigned a role in nitrogen fixation by two criteria. First, its gene occurs, generally, among genes essential for expression of active nitrogenase. Second, its phylogenetic profile closely matches that of nitrogen-fixing bacteria. However, mutational studies in Klebsiella pneumoniae failed to demonstrate any phenotype for deletion or overexpression of the protein.
Probab=28.86  E-value=57  Score=24.50  Aligned_cols=18  Identities=28%  Similarity=0.571  Sum_probs=14.7

Q ss_pred             eecCCCcEEEEecceeeE
Q psy2082          10 KCSPNGKLTLYMGKRDFV   27 (314)
Q Consensus        10 k~s~n~~~t~yl~~rd~~   27 (314)
                      ..+.+|.|++|+.|+|.=
T Consensus         6 R~~~~g~l~~YvpKKDLE   23 (67)
T TIGR02934         6 RRNRAGELSAYVPKKDLE   23 (67)
T ss_pred             EeCCCCCEEEEEECCcch
Confidence            456679999999999953


No 57 
>PF07919 Gryzun:  Gryzun, putative trafficking through Golgi;  InterPro: IPR012880 The proteins featured in this family are all hypothetical eukaryotic proteins of unknown function. The region in question is approximately 150 residues long. 
Probab=28.73  E-value=1.7e+02  Score=29.78  Aligned_cols=39  Identities=21%  Similarity=0.272  Sum_probs=32.1

Q ss_pred             eeCCcEEEEEEE--CCccccCCCeEEEEEEEEccccceeEE
Q psy2082         191 LSPGDLELEVTL--DKQLYAHNEKIAVSICIRNNSNKVVKK  229 (314)
Q Consensus       191 ~s~G~l~l~asl--dK~~Y~pGE~I~v~v~I~N~Ssk~Vk~  229 (314)
                      ....|-+|++++  .+.-|+-||.+.+.+.|.|+.......
T Consensus       168 I~p~pp~v~I~~~~~~~~~l~gE~~~i~i~I~n~e~~~~~~  208 (554)
T PF07919_consen  168 ILPRPPKVSIKLPNHKPPALTGEFYPIPITISNNEDEEASG  208 (554)
T ss_pred             EECCCCCeEEEeCCCCCCeEcCCEEEEEEEEEcCCCcccee
Confidence            456677777777  677899999999999999999877654


No 58 
>TIGR01331 bisphos_cysQ 3'(2'),5'-bisphosphate nucleotidase, bacterial. Sulfate is incorporated into 3-phosphoadenylylsulfate, PAPS, for utilization in pathways such as methionine biosynthesis. Transfer of sulfate from PAPS to an acceptor leaves adenosine 3'-5'-bisphosphate, APS. This model describes a form found in bacteria of the enzyme 3'(2'),5'-bisphosphate nucleotidase, which removes the 3'-phosphate from APS to regenerate AMP and help drive the cycle.
Probab=27.88  E-value=35  Score=31.31  Aligned_cols=47  Identities=19%  Similarity=0.253  Sum_probs=37.0

Q ss_pred             CceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082          32 GVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS   82 (314)
Q Consensus        32 ~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~   82 (314)
                      .||||||.-    .+++|.-.|....+-++-|+--+=|+-..+..++|.+.
T Consensus        80 vIDPIDGT~----nF~~G~p~~~vsIal~~~g~pv~gvI~~P~~~~~~~A~  126 (249)
T TIGR01331        80 LVDPLDGTK----EFINRNGDFTVNIALVEHGVPVLGVVYAPATGVTYFAT  126 (249)
T ss_pred             EEcCCcChH----HHhcCCCcEEEEEEEEECCEEEEEEEEecCCCCEEEEE
Confidence            489999965    89999888988777777677666677778888888764


No 59 
>PF14796 AP3B1_C:  Clathrin-adaptor complex-3 beta-1 subunit C-terminal
Probab=27.86  E-value=1.6e+02  Score=25.42  Aligned_cols=44  Identities=18%  Similarity=0.267  Sum_probs=36.6

Q ss_pred             eeCCcEEEEEEECCccc-cCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082         191 LSPGDLELEVTLDKQLY-AHNEKIAVSICIRNNSNKVVKKIKAMV  234 (314)
Q Consensus       191 ~s~G~l~l~asldK~~Y-~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L  234 (314)
                      +..+=|.++-...|+-+ +..--+.|++.++|+|...+++|++.=
T Consensus        64 v~G~GL~v~Y~F~RqP~~~s~~mvsIql~ftN~s~~~i~~I~i~~  108 (145)
T PF14796_consen   64 VNGKGLSVEYRFSRQPSLYSPSMVSIQLTFTNNSDEPIKNIHIGE  108 (145)
T ss_pred             cCCCceeEEEEEccCCcCCCCCcEEEEEEEEecCCCeecceEECC
Confidence            45667888888899877 556788999999999999999988654


No 60 
>PLN02911 inositol-phosphate phosphatase
Probab=27.21  E-value=40  Score=31.98  Aligned_cols=48  Identities=23%  Similarity=0.435  Sum_probs=35.9

Q ss_pred             CCceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082          31 TGVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS   82 (314)
Q Consensus        31 ~~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~   82 (314)
                      =.||||||..    .+++|.-.|....+-++=|+-.+-|+-..+.+++|.+.
T Consensus       112 WiIDPIDGT~----NFv~G~p~favsIal~~~g~pv~GvV~~P~~~e~y~A~  159 (296)
T PLN02911        112 WVLDPIDGTK----SFITGKPLFGTLIALLYKGKPVLGIIDQPVLKERWVGV  159 (296)
T ss_pred             EEEeCCcChH----HHhcCCCceEEEEEEEECCEEEEEEEecCCCCCEEEEE
Confidence            3469999975    89999998887666665565555677777888888865


No 61 
>cd01515 Arch_FBPase_1 Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family (FBPase class IV). These are Mg++ dependent phosphatases. Members in this family may have both fructose-1,6-bisphosphatase and inositol-monophosphatase activity. In hyperthermophilic archaea, inositol monophosphatase is thought to play a role in the biosynthesis of di-myo-inositol-1,1'-phosphate, an osmolyte unique to hyperthermophiles.
Probab=27.19  E-value=40  Score=31.08  Aligned_cols=48  Identities=19%  Similarity=0.261  Sum_probs=36.0

Q ss_pred             CCceeeeeEEEEecceeeceeEEEEEEEeeeccc--ccccccCccccceEEEEE
Q psy2082          31 TGVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGR--EEDEILGLNFQKELYLAS   82 (314)
Q Consensus        31 ~~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~--~d~dv~gl~f~k~l~~~~   82 (314)
                      =.||||||.-    .+++|.-.|+...+-.+.|+  -.+-|+...+..++|.+.
T Consensus        79 WvIDPIDGT~----nfv~g~p~~~isial~~~g~~~p~~gvv~~P~~~~~~~a~  128 (257)
T cd01515          79 VVLDPLDGTY----NAINGIPFYSVSVAVFKIDKSDPYYGYVYNLATGDLYYAI  128 (257)
T ss_pred             EEEeCcCChh----HHhcCCCceEEEEEEEeCCCCCeEEEEEEecCCCCeEEEE
Confidence            3478888854    78999999998878887787  555566667777787754


No 62 
>cd01517 PAP_phosphatase PAP-phosphatase_like domains. PAP-phosphatase is a member of the inositol monophosphatase family, and catalyses the hydrolysis of 3'-phosphoadenosine-5'-phosphate (PAP) to AMP. In Saccharomyces cerevisiae, HAL2 (MET22) is involved in methionine biosynthesis and provides increased salt tolerance when over-expressed. Bacterial members of this domain family may differ in their substrate specificity and dephosphorylate different targets, as the substrate binding site does not appear to be conserved in that sub-set.
Probab=26.10  E-value=55  Score=30.41  Aligned_cols=47  Identities=19%  Similarity=0.328  Sum_probs=32.9

Q ss_pred             CCceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccc-------cceEEEEE
Q psy2082          31 TGVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNF-------QKELYLAS   82 (314)
Q Consensus        31 ~~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f-------~k~l~~~~   82 (314)
                      =.||||||.-    .+++|. -|....+-.+-|+--+-|+-..+       ..++|.+.
T Consensus        75 WiIDPIDGT~----nfv~g~-~~~vsIal~~~g~pv~GvI~~P~~~~~~~~~~~~~~A~  128 (274)
T cd01517          75 WVLDPIDGTK----GFLRGD-QFAVALALIEDGEVVLGVIGCPNLPLDDGGGGDLFSAV  128 (274)
T ss_pred             EEEcCCcCch----hhhcCC-ceEEEEEEEECCEEEEEEEeCCCccccCCCCCcEEEEE
Confidence            3489999965    899999 88866666654554555666666       77777754


No 63 
>cd08546 cohesin_like Cohesin domain, interaction parter of dockerin. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. Cohesin modules are phylogenetically distributed into three groups:  type I cohesin-dockerin interactions mediate assembly of a range of dockerin-borne enzymes to the complex, while type-II interactions mediate attachment of the cellulosome complex to the bacterial cell wall. Recently discovered type-III cohesins, such as found in the anchoring scaffoldin ScaE, appears to contribute to increased stability of the elaborate cellulosome complex. While the p
Probab=25.93  E-value=2.1e+02  Score=22.82  Aligned_cols=35  Identities=11%  Similarity=0.172  Sum_probs=26.7

Q ss_pred             EEEEECCccccCCCeEEEEEEEEccccceeEEEEEEEE
Q psy2082         198 LEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMVQ  235 (314)
Q Consensus       198 l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L~  235 (314)
                      +.+..+... .+||++.+.+.++|.+  .+....+.|.
T Consensus         3 ~~~~~~~~~-~~G~~~~v~v~~~~~~--~~~~~~~~l~   37 (135)
T cd08546           3 VSLGAPSTV-KVGETVTVTVKVNNVP--NVAAADFTLS   37 (135)
T ss_pred             EEEeccccc-cCCCEEEEEEEEecCC--CeEEEEEEEE
Confidence            444555555 9999999999999998  7777666664


No 64 
>TIGR02067 his_9_proposed histidinol-phosphate phosphatase HisN, inositol monophosphatase family. This subfamily belongs to the inositol monophosphatase family (pfam00459). The members of this family consist of no more than one per species and are found only in species in which histidine is synthesized de novo but no histidinol phosphatase can be found in either of the two described families (TIGR01261, TIGR01856). In at least one species, the member of this family is found near known histidine biosynthesis genes.
Probab=24.86  E-value=44  Score=30.62  Aligned_cols=47  Identities=21%  Similarity=0.322  Sum_probs=33.6

Q ss_pred             CceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082          32 GVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS   82 (314)
Q Consensus        32 ~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~   82 (314)
                      .||||||.-    .+++|.--|+...+-.+-|+-..-|+-..+..++|.+.
T Consensus        78 iiDPIDGT~----nF~~g~p~~~vsial~~~g~p~~gvv~~P~~~~~~~A~  124 (251)
T TIGR02067        78 VLDPIDGTK----SFIRGVPVWGTLIALVEGGMPVLGVIFQPATGERWWAA  124 (251)
T ss_pred             EEecCcChh----HHhcCCCceEEEEEEEECCEEEEEEEEEcCCCCEEEEe
Confidence            379999965    78888888886556565565555566667777777754


No 65 
>PLN02553 inositol-phosphate phosphatase
Probab=24.43  E-value=52  Score=30.48  Aligned_cols=47  Identities=15%  Similarity=0.128  Sum_probs=35.8

Q ss_pred             CceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082          32 GVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS   82 (314)
Q Consensus        32 ~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~   82 (314)
                      .||||||..    .+++|.-.|....+-++-|+-..=|+-..+..++|.+.
T Consensus        89 iIDPIDGT~----NF~~g~p~~avsIal~~~g~pv~GvV~~P~~~e~~~A~  135 (270)
T PLN02553         89 IVDPLDGTT----NFVHGFPFVCVSIGLTIGKVPVVGVVYNPILDELFTAV  135 (270)
T ss_pred             EEecccchh----hHhhcCCceEEEEEEEECCEEEEEEEecCCCCCeEEEE
Confidence            689999965    88899888887666676666556666677788888865


No 66 
>PF12389 Peptidase_M73:  Camelysin metallo-endopeptidase;  InterPro: IPR022121 Camelysin is a novel surface metallopeptidase from Bacillus cereus []. Camelysin prefers cleavage sites in front of aliphatic and hydrophilic amino acid residues (-OH, -SO3H, amido group), and requires zinc for activity [, ].
Probab=24.42  E-value=1.6e+02  Score=26.78  Aligned_cols=30  Identities=7%  Similarity=0.139  Sum_probs=25.6

Q ss_pred             ccccCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082         205 QLYAHNEKIAVSICIRNNSNKVVKKIKAMV  234 (314)
Q Consensus       205 ~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L  234 (314)
                      +-..||+++.-.+.|.|.-+-+++.|.+..
T Consensus        59 ~nlkPGD~v~k~f~l~N~Gtldi~~v~l~~   88 (199)
T PF12389_consen   59 SNLKPGDTVEKEFTLKNSGTLDIKDVLLKT   88 (199)
T ss_pred             ccCCCCCeEEEEEEEEeCCeeeeeeEEEEE
Confidence            467899999999999999999988776654


No 67 
>COG2309 AmpS Leucyl aminopeptidase (aminopeptidase T) [Amino acid transport and metabolism]
Probab=23.94  E-value=1.5e+02  Score=29.59  Aligned_cols=54  Identities=24%  Similarity=0.375  Sum_probs=40.1

Q ss_pred             eecCCC-cEEEEecceeeEecCC-------------------CceeeeeEEEEe-cceeeceeEEEEEEEeeeccc
Q psy2082          10 KCSPNG-KLTLYMGKRDFVDYVT-------------------GVEPIDGVVVFE-EDYIQDRKVFGQVVCSFRYGR   64 (314)
Q Consensus        10 k~s~n~-~~t~yl~~rd~~d~~~-------------------~vd~v~GvV~vd-~~~lk~rkv~~~~~~~fryG~   64 (314)
                      -++|.| .||+.|+.|--...-.                   +=|.|+|+|+.| |-+-+|.-+ =.+..+|+=|+
T Consensus       193 ~~tp~gTDLt~~l~~~~~~~~~g~~~~~~~~NmPtgEVFtaP~~~sv~G~V~~~~Pl~~~G~li-e~i~l~FedG~  267 (385)
T COG2309         193 ITTPGGTDLTFGLAGRHAWNDAGSVNKGGNANMPTGEVFTAPHEDSVNGVVVFDKPLSYQGTLI-EGIRLTFEDGK  267 (385)
T ss_pred             EeCCCCceeEEEcCCceeeccCCccCCCCCCCCCCcceEeccccCccceEEEECcCCCcCCEee-eceEEEEECCE
Confidence            478988 9999999988775544                   346899999999 777677663 34777775444


No 68 
>cd01643 Bacterial_IMPase_like_2 Bacterial family of Mg++ dependent phosphatases, related to inositol monophosphatases. These enzymes may dephosphorylate inositol monophosphate or similar substrates.
Probab=23.74  E-value=52  Score=30.09  Aligned_cols=47  Identities=15%  Similarity=0.337  Sum_probs=34.4

Q ss_pred             CceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082          32 GVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS   82 (314)
Q Consensus        32 ~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~   82 (314)
                      .||||||.-    .+++|.-.|....+-++=|+--.=++-..+.+++|.+.
T Consensus        75 iIDPIDGT~----nF~~g~p~~~vsial~~~g~pv~GvV~~P~~~~~~~A~  121 (242)
T cd01643          75 VIDPIDGTT----NFARGIPIWAISIALLYRGEPVFGVIALPALNQTFVAF  121 (242)
T ss_pred             EEeCccChH----HHhcCCCceEEEEEEEECCEEEEEEEecCCCCCEEEEE
Confidence            389999965    88899988887766665565555566667777887764


No 69 
>PRK03298 hypothetical protein; Provisional
Probab=23.59  E-value=5.9e+02  Score=23.56  Aligned_cols=46  Identities=22%  Similarity=0.266  Sum_probs=27.0

Q ss_pred             CeEEEEEEEEEEccCCccC--ccccceEEEeeee-ccCCCCCCCCCCeE
Q psy2082         138 PCGVTYYVKLFVGENETDR--SHRRSTVALGIRK-IQYAPSKNGRQPCT  183 (314)
Q Consensus       138 ~g~V~Y~Vka~i~~~~~dr--~~K~~~v~l~irk-l~~~p~~~~~~p~~  183 (314)
                      .|.|+|+=+|.-.-+..+|  ..|.|-..++-.+ -.+-|.+|.++|+.
T Consensus         7 ~C~V~Y~GRa~s~L~~~~RLiiiK~DGsvlVH~d~~~~kPlNW~pp~~~   55 (224)
T PRK03298          7 RCSVDYVGRLTAHLPSARRLLLVKADGSVSVHADDRAYKPLNWMSPPCT   55 (224)
T ss_pred             EEEEEEccceeccCCCCceEEEEccCCeEEEecCCCCCCCcccCCCCeE
Confidence            4888888777633233333  3455533344444 56778888877754


No 70 
>PRK05089 cytochrome C oxidase assembly protein; Provisional
Probab=22.54  E-value=1.2e+02  Score=27.21  Aligned_cols=25  Identities=16%  Similarity=-0.057  Sum_probs=22.4

Q ss_pred             ccCCCeEEEEEEEEccccceeEEEE
Q psy2082         207 YAHNEKIAVSICIRNNSNKVVKKIK  231 (314)
Q Consensus       207 Y~pGE~I~v~v~I~N~Ssk~Vk~Ik  231 (314)
                      -.|||...+.-.+.|.|++.|....
T Consensus        90 V~pGE~~~~~y~a~N~sd~~i~g~A  114 (188)
T PRK05089         90 VHPGELNLVFYEAENLSDRPIVGQA  114 (188)
T ss_pred             EcCCCeEEEEEEEECCCCCcEEEEE
Confidence            3689999999999999999998754


No 71 
>TIGR03780 Bac_Flav_CT_N Bacteroides conjugative transposon TraN protein. Members of this family are the TraN protein encoded by transfer region genes of conjugative transposons of Bacteroides. The family is related to conjugative transfer proteins VirB9 and TrbG of Agrobacterium Ti plasmids.
Probab=21.92  E-value=1.2e+02  Score=29.01  Aligned_cols=37  Identities=19%  Similarity=0.387  Sum_probs=29.4

Q ss_pred             EEEEECCccccCCCeEEEEEEEEccccce--eEEEEEEEE
Q psy2082         198 LEVTLDKQLYAHNEKIAVSICIRNNSNKV--VKKIKAMVQ  235 (314)
Q Consensus       198 l~asldK~~Y~pGE~I~v~v~I~N~Ssk~--Vk~Ikv~L~  235 (314)
                      +++.| +++|.||+-+-++.++.|+|+-.  |.-|++++.
T Consensus       169 i~~~l-~~Iy~~~d~lyf~~~l~N~Sni~ydID~irfki~  207 (285)
T TIGR03780       169 IQFLL-NGIYTHNDLLYFHTSLENKTNIPFDIDFIRFKIV  207 (285)
T ss_pred             cEEEE-EeEEEECCEEEEEEEEEcCCCCCcEeEEEEEEEE
Confidence            44444 47999999999999999999854  887777664


No 72 
>KOG2625|consensus
Probab=21.72  E-value=53  Score=30.60  Aligned_cols=28  Identities=29%  Similarity=0.290  Sum_probs=24.4

Q ss_pred             ccCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082         207 YAHNEKIAVSICIRNNSNKVVKKIKAMV  234 (314)
Q Consensus       207 Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L  234 (314)
                      .+-||+...-++|.|.|+++|+.|.++-
T Consensus        11 iflgetfs~yinv~nds~k~v~~i~lk~   38 (348)
T KOG2625|consen   11 IFLGETFSFYINVHNDSEKTVKDILLKA   38 (348)
T ss_pred             eeeccceEEEEEEecchhhhhhhheeee
Confidence            3569999999999999999999887654


No 73 
>PRK10931 adenosine-3'(2'),5'-bisphosphate nucleotidase; Provisional
Probab=21.30  E-value=52  Score=30.08  Aligned_cols=47  Identities=19%  Similarity=0.330  Sum_probs=34.8

Q ss_pred             CceeeeeEEEEecceeeceeEEEEEEEeeecccccccccCccccceEEEEE
Q psy2082          32 GVEPIDGVVVFEEDYIQDRKVFGQVVCSFRYGREEDEILGLNFQKELYLAS   82 (314)
Q Consensus        32 ~vd~v~GvV~vd~~~lk~rkv~~~~~~~fryG~~d~dv~gl~f~k~l~~~~   82 (314)
                      .||||||..    .+++|.--|....+-++-|+-.+=|+-..+..++|.+.
T Consensus        81 iIDPIDGT~----nF~~g~p~~~vsIal~~~g~p~~GvV~~P~~~~~y~A~  127 (246)
T PRK10931         81 LVDPLDGTK----EFIKRNGEFTVNIALIEQGKPVLGVVYAPVMNVMYSAA  127 (246)
T ss_pred             EEecCcChH----HHHcCCCCEEEEEEEEECCEEEEEEEeecCCCCEEEEE
Confidence            489999965    88899888887666666566555566667777887764


No 74 
>PF04442 CtaG_Cox11:  Cytochrome c oxidase assembly protein CtaG/Cox11;  InterPro: IPR007533 Cytochrome c oxidase assembly protein is essential for the assembly of functional cytochrome oxidase protein. In eukaryotes it is an integral protein of the mitochondrial inner membrane. Cox11 is essential for the insertion of Cu(I) ions to form the CuB site. This is essential for the stability of other structures in subunit I, for example haems a and a3, and the magnesium/manganese centre. Cox11 is probably only required in sub-stoichiometric amounts relative to the structural units []. The C-terminal region of the protein is known to form a dimer. Each monomer coordinates one Cu(I) ion via three conserved cysteine residues (111, 208 and 210) in Saccharomyces cerevisiae (P19516 from SWISSPROT). Met 224 is also thought to play a role in copper transfer or stabilising the copper site [].; GO: 0005507 copper ion binding; PDB: 1SO9_A 1SP0_A.
Probab=20.70  E-value=1e+02  Score=26.83  Aligned_cols=25  Identities=20%  Similarity=0.049  Sum_probs=16.7

Q ss_pred             ccCCCeEEEEEEEEccccceeEEEE
Q psy2082         207 YAHNEKIAVSICIRNNSNKVVKKIK  231 (314)
Q Consensus       207 Y~pGE~I~v~v~I~N~Ssk~Vk~Ik  231 (314)
                      -.|||...+.-.+.|.|++.|....
T Consensus        63 V~pGe~~~~~y~a~N~s~~~i~g~A   87 (152)
T PF04442_consen   63 VHPGETALVFYEATNPSDKPITGQA   87 (152)
T ss_dssp             EETT--EEEEEEEEE-SSS-EE---
T ss_pred             eCCCCEEEEEEEEECCCCCcEEEEE
Confidence            3799999999999999999998754


No 75 
>cd08908 START_STARD12-like C-terminal lipid-binding START domain of mammalian STARD12 and related proteins, which also have an N-terminal Rho GTPase-activating protein (RhoGAP) domain. This subgroup includes the steroidogenic acute regulatory protein (StAR)-related lipid transfer (START) domains of STARD12 (also known as DLC-1, Arhgap7, and p122-RhoGAP) and related proteins. It belongs to the START domain family, and in turn to the SRPBCC (START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC) domain superfamily of proteins that bind hydrophobic ligands. SRPBCC domains have a deep hydrophobic ligand-binding pocket. Proteins belonging to this subgroup also have an N-terminal SAM (sterile alpha motif) domain and a RhoGAP domain, and have a SAM-RhoGAP-START domain organization. The precise function of the START domain in this subgroup is unclear.
Probab=20.11  E-value=2.5e+02  Score=25.37  Aligned_cols=36  Identities=19%  Similarity=0.222  Sum_probs=26.5

Q ss_pred             cceEEEeecCCCcEEEE-------ecceeeEec-CCCceeeeeE
Q psy2082           4 NFKVFKKCSPNGKLTLY-------MGKRDFVDY-VTGVEPIDGV   39 (314)
Q Consensus         4 ~~~vfkk~s~n~~~t~y-------l~~rd~~d~-~~~vd~v~Gv   39 (314)
                      ..+|.++.++|..|.=|       +..||||-+ ...-|.-+|-
T Consensus        84 e~~vIe~ld~~~~I~Yy~~~~PwP~~~RD~V~~Rs~~~~~~~g~  127 (204)
T cd08908          84 DSKVIEILDSQTEIYQYVQNSMAPHPARDYVVLRTWRTNLPKGA  127 (204)
T ss_pred             heEeeEecCCCceEEEEEccCCCCCCCcEEEEEEEEEEeCCCCe
Confidence            45788999999988777       568999988 4333444563


No 76 
>cd08759 Type_III_cohesin_like Cohesin domain, interaction partner of dockerin. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. Two specific calcium-dependent interactions between cohesin and dockerin appear to be essential for cellulosome assembly, type I and type II. This subfamily represents type III cohesins and closely related domains.
Probab=20.06  E-value=2.2e+02  Score=24.93  Aligned_cols=35  Identities=17%  Similarity=0.196  Sum_probs=26.7

Q ss_pred             EEEEECCccccCCCeEEEEEEEEccccceeEEEEEEE
Q psy2082         198 LEVTLDKQLYAHNEKIAVSICIRNNSNKVVKKIKAMV  234 (314)
Q Consensus       198 l~asldK~~Y~pGE~I~v~v~I~N~Ssk~Vk~Ikv~L  234 (314)
                      |.+.-+|.-|-+||+|.|.+...+..  .|....+.|
T Consensus         3 l~l~~~k~~~~aGetvti~vkg~~l~--~VnA~~~~l   37 (160)
T cd08759           3 LSLVPSKTTVKAGETITIDLYGQGLK--NVNALGAHI   37 (160)
T ss_pred             eEEeccccccCCCCEEEEEEEecccc--eeeeeeeEE
Confidence            55667788899999999999999886  555544443


Done!