Query         psy192
Match_columns 898
No_of_seqs    283 out of 847
Neff          7.8 
Searched_HMMs 46136
Date          Fri Aug 16 19:39:51 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy192.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/192hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0518|consensus              100.0  6E-106  1E-110  925.2  77.4  851    2-898   116-1075(1113)
  2 KOG0518|consensus              100.0 1.2E-98  3E-103  865.1  73.7  809    3-896    17-939 (1113)
  3 smart00557 IG_FLMN Filamin-typ  99.8 1.2E-19 2.7E-24  163.4  12.4   90  804-898     1-90  (93)
  4 smart00557 IG_FLMN Filamin-typ  99.8 2.2E-19 4.9E-24  161.7  13.2   90  208-297     1-92  (93)
  5 PF00630 Filamin:  Filamin/ABP2  99.7 5.5E-17 1.2E-21  148.7  12.7   93  802-895     2-101 (101)
  6 PF00630 Filamin:  Filamin/ABP2  99.7 3.1E-16 6.7E-21  143.7  10.9   87  206-292     2-101 (101)
  7 PRK14081 triple tyrosine motif  99.4 7.8E-07 1.7E-11  104.4  57.1  424   20-472    55-555 (667)
  8 PRK14081 triple tyrosine motif  99.3 3.3E-06 7.2E-11   99.2  59.7  374   71-472    55-459 (667)
  9 TIGR00864 PCC polycystin catio  95.3      21 0.00045   49.7  32.7  293   70-407  1830-2141(2740)
 10 PF13584 BatD:  Oxygen toleranc  94.9     3.6 7.8E-05   48.6  22.7  203   69-277    74-354 (484)
 11 KOG1948|consensus               94.8      13 0.00029   45.3  37.9  124  158-299   256-385 (1165)
 12 TIGR03503 conserved hypothetic  94.4    0.54 1.2E-05   52.6  13.0  118   66-201   169-301 (374)
 13 PF13584 BatD:  Oxygen toleranc  92.6     8.6 0.00019   45.5  20.3  205  167-373    74-355 (484)
 14 TIGR00864 PCC polycystin catio  92.4      65  0.0014   45.2  43.1  263  168-470  1830-2110(2740)
 15 TIGR03503 conserved hypothetic  91.6     2.4 5.2E-05   47.6  12.9  116  258-390   168-300 (374)
 16 KOG1428|consensus               90.5       1 2.2E-05   56.6   9.1   43  349-392  2099-2141(3738)
 17 TIGR00868 hCaCC calcium-activa  90.2      10 0.00022   47.5  17.6  151  222-380   512-684 (863)
 18 PF13115 YtkA:  YtkA-like        88.2     4.1 8.9E-05   35.7   9.3   71   15-86      3-85  (86)
 19 PF09134 Invasin_D3:  Invasin,   87.6     5.9 0.00013   35.7   9.7   87    7-103    11-97  (99)
 20 PF01835 A2M_N:  MG2 domain;  I  87.1       2 4.4E-05   38.7   6.8   64  314-377    10-86  (99)
 21 PF10670 DUF4198:  Domain of un  87.1      35 0.00076   35.1  17.2   60  222-283   148-213 (215)
 22 PF01835 A2M_N:  MG2 domain;  I  86.9     3.4 7.4E-05   37.2   8.3   64  124-187    11-86  (99)
 23 COG2373 Large extracellular al  86.6 1.4E+02   0.003   40.6  30.3   85   47-136   430-521 (1621)
 24 PF09315 DUF1973:  Domain of un  85.8      20 0.00043   36.4  13.8  129  222-354    26-165 (179)
 25 COG1470 Predicted membrane pro  83.1      96  0.0021   35.8  37.9   75  263-344    49-123 (513)
 26 KOG1428|consensus               82.3     3.5 7.6E-05   52.2   7.5   48   59-107  2097-2144(3738)
 27 PF10670 DUF4198:  Domain of un  79.5      13 0.00028   38.4  10.2   67   19-90    134-213 (215)
 28 PF13753 SWM_repeat:  Putative   79.2 1.1E+02  0.0024   33.9  24.9  142  227-376    63-223 (317)
 29 PF06312 Neurexophilin:  Neurex  73.5     9.7 0.00021   39.7   6.9   46  331-376    68-115 (219)
 30 cd02858 Esterase_N_term Estera  73.2      20 0.00043   31.4   8.0   59  226-286     7-65  (85)
 31 PF02369 Big_1:  Bacterial Ig-l  71.8      21 0.00045   32.4   8.0   62  319-380    23-92  (100)
 32 PF06312 Neurexophilin:  Neurex  71.3      11 0.00025   39.2   6.8   45  142-186    70-115 (219)
 33 smart00634 BID_1 Bacterial Ig-  70.1      22 0.00048   31.6   7.7   52   38-90     30-84  (92)
 34 PF02369 Big_1:  Bacterial Ig-l  69.3      26 0.00056   31.8   8.0   32  159-190    61-92  (100)
 35 TIGR00868 hCaCC calcium-activa  69.1 3.2E+02   0.007   34.7  24.6   47  239-285   627-684 (863)
 36 PF13860 FlgD_ig:  FlgD Ig-like  68.2      33 0.00072   29.7   8.2   56  320-375    13-76  (81)
 37 PF07495 Y_Y_Y:  Y_Y_Y domain;   64.6      13 0.00028   30.5   4.7   37  614-653    13-49  (66)
 38 PF11896 DUF3416:  Domain of un  64.1      27 0.00058   35.7   7.8   62  124-185    23-87  (187)
 39 PF13473 Cupredoxin_1:  Cupredo  63.4      13 0.00029   33.8   5.0   57  222-283    40-96  (104)
 40 PF13115 YtkA:  YtkA-like        63.0      71  0.0015   27.7   9.4   27  348-374    59-85  (86)
 41 COG1470 Predicted membrane pro  62.7   3E+02  0.0065   32.0  36.2   73   13-104     7-79  (513)
 42 PF13473 Cupredoxin_1:  Cupredo  60.7      25 0.00054   31.9   6.3   62  817-891    39-100 (104)
 43 PF13860 FlgD_ig:  FlgD Ig-like  60.3      62  0.0013   28.0   8.4   44  129-173    13-57  (81)
 44 cd02858 Esterase_N_term Estera  57.2      93   0.002   27.2   9.1   60  414-475     6-66  (85)
 45 KOG2996|consensus               55.7     7.2 0.00016   45.3   2.1   35  620-654   696-730 (865)
 46 PF13753 SWM_repeat:  Putative   55.3 1.3E+02  0.0028   33.4  12.1  142  131-281    64-223 (317)
 47 TIGR03096 nitroso_cyanin nitro  53.7      97  0.0021   29.9   9.0   32  258-289    97-130 (135)
 48 PF13620 CarboxypepD_reg:  Carb  51.9      64  0.0014   27.4   7.2   45  143-190    16-60  (82)
 49 smart00634 BID_1 Bacterial Ig-  51.9      67  0.0014   28.5   7.4   30  159-188    55-84  (92)
 50 PF07705 CARDB:  CARDB;  InterP  51.0      69  0.0015   28.2   7.5   65  124-188    15-85  (101)
 51 PRK13211 N-acetylglucosamine-b  49.5 2.4E+02  0.0052   33.2  13.3  114   73-201   287-405 (478)
 52 PF09315 DUF1973:  Domain of un  47.9 3.1E+02  0.0068   27.8  14.7   72  125-201    26-99  (179)
 53 KOG2996|consensus               46.1      15 0.00032   42.9   2.6   32  858-889   701-734 (865)
 54 PF04151 PPC:  Bacterial pre-pe  44.9      78  0.0017   26.3   6.4   40  429-468    25-69  (70)
 55 cd08548 Type_I_cohesin_like Ty  44.7 2.9E+02  0.0063   26.5  11.1   91  370-469    16-114 (135)
 56 PF09134 Invasin_D3:  Invasin,   43.5      72  0.0016   28.9   6.1   69  320-390    22-96  (99)
 57 PF00017 SH2:  SH2 domain;  Int  43.0      39 0.00085   28.6   4.3   26  628-653    19-44  (77)
 58 PF13620 CarboxypepD_reg:  Carb  42.2      64  0.0014   27.4   5.7   50   38-92     10-60  (82)
 59 COG2372 CopC Uncharacterized p  41.4      72  0.0016   30.3   6.0   70  319-388    45-124 (127)
 60 PF04151 PPC:  Bacterial pre-pe  40.8 1.6E+02  0.0034   24.4   7.6   53  323-375    15-69  (70)
 61 PF07495 Y_Y_Y:  Y_Y_Y domain;   40.4      32 0.00069   28.1   3.2   19  452-470    30-48  (66)
 62 KOG1693|consensus               40.2 1.3E+02  0.0028   30.8   7.9   78   95-184    21-98  (209)
 63 PF00017 SH2:  SH2 domain;  Int  40.1      39 0.00084   28.6   3.8   27  861-887    19-45  (77)
 64 PF07705 CARDB:  CARDB;  InterP  38.6      75  0.0016   28.0   5.7   67  579-653    16-85  (101)
 65 PF07691 PA14:  PA14 domain;  I  38.4 3.5E+02  0.0075   25.5  12.1   80   18-106    49-136 (145)
 66 PF05751 FixH:  FixH;  InterPro  37.4      84  0.0018   30.3   6.3   50  834-886    85-134 (146)
 67 TIGR03096 nitroso_cyanin nitro  37.4      34 0.00074   32.9   3.3   30  863-892    99-130 (135)
 68 smart00252 SH2 Src homology 2   36.5      40 0.00086   29.1   3.4   29  625-653    17-45  (84)
 69 smart00252 SH2 Src homology 2   36.0      33 0.00072   29.6   2.8   28  860-887    19-46  (84)
 70 PF04234 CopC:  CopC domain;  I  35.9      32 0.00069   31.0   2.8   55  144-200    35-96  (97)
 71 PRK06655 flgD flagellar basal   34.4 1.6E+02  0.0035   31.1   8.0   27  320-346   114-140 (225)
 72 PF09430 DUF2012:  Protein of u  34.0 1.6E+02  0.0035   27.7   7.3   55   48-106    10-64  (123)
 73 PF03422 CBM_6:  Carbohydrate b  33.4   2E+02  0.0044   26.5   8.0   64   25-91     41-111 (125)
 74 PF01483 P_proprotein:  Proprot  31.9   3E+02  0.0064   23.9   8.3   35  227-261     4-38  (87)
 75 PF01483 P_proprotein:  Proprot  31.6   2E+02  0.0043   25.0   7.1   61  130-202     4-70  (87)
 76 PRK06655 flgD flagellar basal   31.5 1.9E+02  0.0041   30.5   8.1   14  143-156   127-140 (225)
 77 COG2373 Large extracellular al  31.5 1.6E+03   0.034   31.1  41.6  117  210-331   395-525 (1621)
 78 KOG1948|consensus               31.3 1.2E+03   0.026   29.6  41.5  122  348-486   256-384 (1165)
 79 PRK10301 hypothetical protein;  30.1 1.5E+02  0.0032   28.1   6.3   68   30-99     45-122 (124)
 80 PRK12633 flgD flagellar basal   30.0 1.8E+02  0.0039   30.8   7.6   45  320-364   117-162 (230)
 81 PF09430 DUF2012:  Protein of u  29.9 2.1E+02  0.0045   27.0   7.3   53  241-297    10-62  (123)
 82 PF05751 FixH:  FixH;  InterPro  29.8   2E+02  0.0042   27.7   7.4   56   31-87     69-131 (146)
 83 PF07703 A2M_N_2:  Alpha-2-macr  29.6 4.7E+02    0.01   24.4  10.2  105  313-422     8-118 (136)
 84 KOG1692|consensus               29.5 1.9E+02  0.0042   29.4   7.1   56  125-183    38-93  (201)
 85 PRK10301 hypothetical protein;  28.8 1.4E+02  0.0031   28.2   6.0   55  145-200    62-123 (124)
 86 PF04234 CopC:  CopC domain;  I  28.5      46   0.001   29.9   2.5   32  720-752    60-95  (97)
 87 PF11896 DUF3416:  Domain of un  28.3 2.3E+02  0.0049   29.0   7.7   60  222-281    24-88  (187)
 88 cd00173 SH2 Src homology 2 dom  27.0      89  0.0019   27.3   4.1   29  625-653    16-44  (94)
 89 cd00173 SH2 Src homology 2 dom  27.0      73  0.0016   27.8   3.6   27  861-887    19-45  (94)
 90 KOG1690|consensus               26.1 1.8E+02  0.0038   29.9   6.2   79    4-89     27-109 (215)
 91 PRK09619 flgD flagellar basal   25.5   2E+02  0.0044   30.1   7.0   45  320-364   111-155 (218)
 92 PRK12812 flgD flagellar basal   25.4 2.2E+02  0.0047   30.7   7.3   45  320-364   129-174 (259)
 93 KOG1692|consensus               25.2 1.6E+02  0.0034   30.0   5.6   55  412-466    39-93  (201)
 94 PF07523 Big_3:  Bacterial Ig-l  23.8      83  0.0018   26.0   3.1   42  142-188    17-58  (67)
 95 PRK09619 flgD flagellar basal   23.8 2.3E+02   0.005   29.7   7.0   32  143-174   124-155 (218)
 96 PF11589 DUF3244:  Domain of un  23.8 2.3E+02   0.005   25.8   6.3   60  324-383    40-101 (106)
 97 PRK12633 flgD flagellar basal   23.4 2.7E+02  0.0058   29.5   7.4   14  143-156   130-143 (230)
 98 PF06016 Reovirus_L2:  Reovirus  23.4 1.3E+02  0.0028   39.2   5.7  105  314-422  1034-1149(1289)
 99 PF05404 TRAP-delta:  Transloco  23.2 3.2E+02  0.0068   27.4   7.3   52  130-185    45-101 (167)
100 PF07523 Big_3:  Bacterial Ig-l  22.5 1.4E+02   0.003   24.7   4.1   41  334-379    19-59  (67)
101 PF00801 PKD:  PKD domain;  Int  22.4 2.9E+02  0.0062   22.7   6.1   53  410-469     8-60  (69)
102 PF11589 DUF3244:  Domain of un  22.2 1.5E+02  0.0033   27.0   4.7   52  142-193    48-101 (106)
103 COG4856 Uncharacterized protei  20.8 5.6E+02   0.012   29.1   9.3   25  646-670    53-79  (403)
104 PF10633 NPCBM_assoc:  NPCBM-as  20.5 5.2E+02   0.011   21.8   7.5   67  817-884     1-75  (78)

No 1  
>KOG0518|consensus
Probab=100.00  E-value=6.4e-106  Score=925.23  Aligned_cols=851  Identities=30%  Similarity=0.498  Sum_probs=675.2

Q ss_pred             CCCCcccCeEEeCCCCeEEEEEEcCCceeEEEEEEECCEEec--------------------------------------
Q psy192            2 PSGNVDKPVIEDNHDGTVSLHYDPREEGLHELALKFNGDHVQ--------------------------------------   43 (898)
Q Consensus         2 p~g~~~~~~i~~~~d~~~~v~f~P~~~G~h~v~v~~~G~~v~--------------------------------------   43 (898)
                      |+|....|.++++.||+|+++++|.+.|+|.+.++|++.+++                                      
T Consensus       116 ~~g~~~~~~v~~~~dg~y~~k~~p~e~~~h~~e~~~~e~~~~~sP~~~~v~~~td~n~~~Alg~~le~~~vg~p~~f~v~  195 (1113)
T KOG0518|consen  116 LIGSLTWTLVQDYGDGIYKTKRTPKEKGEHEVEVLYDEKPVPASPFVVKVNEGTDWNDVQALGPGLESARVGKPNVFEVE  195 (1113)
T ss_pred             ccccceeEeeeccCCcceeeecCchhccchhhhhhhcccccccCCceeccccccCcccceEeccchhhcccCCCceeEEe
Confidence            678999999999999999999999999999999999999874                                      


Q ss_pred             ----CCCCeeEEEecCccceEEEEeCCCCeEEEEEeeCCCeEEEEEEEECCeecCCCCeE---EEEecCCcccceeeeee
Q psy192           44 ----GYGGLSLSIEGPSKAEIQCKDNADGSLNISYRPTEPGYYIINLKFADHHVEGSPFT---AKIVGEGSNRQREKIQR  116 (898)
Q Consensus        44 ----G~g~l~v~v~GP~~~~i~~~d~~dG~~~v~ytP~~~G~~~I~V~~~g~~i~gSPf~---~~v~~~~~~~~~~~i~~  116 (898)
                          |.++|.+.++||+++...|.|+++|+|.++|+|.++|+|.+++.|.+.||+||||.   ++..|+.+  .  .+  
T Consensus       196 ~~Ga~~gnl~~a~~gpse~~~~~~di~~g~c~veyip~~~gd~sv~~~~gg~~v~GsPf~~~t~~~~Dp~k--i--ki--  269 (1113)
T KOG0518|consen  196 TPGAGQGNLEVAVEGPSEALTVIPDIKPGSCSVEYIPPEPGDYSVNTYYGGVPVPGSPFRVATCDKVDPSK--I--KI--  269 (1113)
T ss_pred             cCCccccceeeeecChhhhcccccccCCCcceeeecCCCCCceEEeeeecccccCCCCeeeeeecccChhh--e--ee--
Confidence                33689999999999999999999999999999999999999999999999999995   44556552  2  22  


Q ss_pred             eecccceeeeCcEEEEEEEcCCCcccceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECCeecCCCce
Q psy192          117 QREAVPVTEVGSTCKLTFKMPGITAFDLSATVTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKDIHIPGSPF  196 (898)
Q Consensus       117 ~~~~~~~~~vg~~~~f~v~~~~~~~~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~V~gSPf  196 (898)
                        .+++..+..++..|++|++.++-..+.+.|-++.+-..+..+.++++++|+|.|+|.+.|.|.|+|.++++.++.|||
T Consensus       270 --~gl~~gr~~~p~~F~vD~S~aG~~pl~v~V~g~r~p~e~v~~v~~~d~t~~V~fvPs~~g~~~isV~~ade~vp~spf  347 (1113)
T KOG0518|consen  270 --SGLPKGRLNKPQYFTVDASKAGLLPLEVAVCGGRVPVEPVMAVALGDGTYNVSFVPSEDGPYTISVRIADEDVPRSPF  347 (1113)
T ss_pred             --cCCCcccccCCeeEEEechhcCccceEEEecCCCCcccceEEecCCCceeEEEEeEcccCCeeEEEEecCCcCCCCcc
Confidence              134444778899999999999999999999887777566788899999999999999999999999999999999999


Q ss_pred             EEEEccCCCCCcceEEEeCCCCCcc--eeeeeeEEEEEeccCCCCceEEEEEcCCcc--eeeEEECCCC-cEEEEEEeCC
Q psy192          197 QFTVGPLRDGGAHRVHAGGPGLERG--EQNQPCEFNVWTREAGAGSLAISVEGPSKA--EIDFKDRKDG-SCYVSYVVAE  271 (898)
Q Consensus       197 ~v~V~~~~~~d~~~v~a~G~GL~~~--~~g~~~~F~V~~~daG~g~l~v~v~gps~~--~i~~~d~~dG-ty~v~y~P~~  271 (898)
                      .+.|.|.  .+++++++.|+||...  .+-.+..|++|.+|||.+.|++.|.||.+.  +..+.++++| .|.++|.|.+
T Consensus       348 ~~kvlp~--~~as~~~a~g~gLt~~~v~as~~~d~~fD~~Dageg~levqV~gp~Gk~~~~~V~d~~~~~~h~vsY~pd~  425 (1113)
T KOG0518|consen  348 LVKVLPT--VDASKVTASGAGLTDIGVNASVEVDFTFDEGDAGEGLLEVQVVGPEGKEKEVVVRDNGRGGIHIVTYVPDC  425 (1113)
T ss_pred             eeeeccc--cchhhhccccCccccccccccccceeeEEccccccceEEEEEECCCCCceeeEEEecCCCceEEEEEcCCC
Confidence            9999974  5689999999999987  456778999999999999999999999764  4567888887 9999999999


Q ss_pred             CceEEEEEEECCeeCCCCCceeEecCCCCCceeEEEeeeCCCceecCCCeEEEEEecCC-ccceEEEEECCCCCeeeeEE
Q psy192          272 PGEYRVGIKFNDQHIPDSPYKLFVSPAMGDAHKLEIAQFPQGVVMADKPTQFLVRKNGA-VGALDAKVISPSGTEDDCFI  350 (898)
Q Consensus       272 ~G~y~i~V~~~g~~I~gSPf~v~V~~~~~~~~~~~~~~~~~~~~~~g~~~~f~V~~~~a-~G~l~v~v~~p~g~~~~~~v  350 (898)
                      +|.|.|.|+|+|++|++|||++...+   |++++.+..-.  .+++|.+..|.||++.+ .|.+.+.+.+|+++..+..+
T Consensus       426 ~G~y~i~v~~~gd~i~gSPf~~ra~~---dask~~~~~~i--~~~vg~~~~~~vdak~ag~g~~~~~~~~pdn~n~~~~v  500 (1113)
T KOG0518|consen  426 PGRYLIVVFYGGDPIPGSPFTARAYP---DASKEVLVPPI--NAQVGKEANFVVDAKAAGAGPVTCTVVTPDNRNVPADV  500 (1113)
T ss_pred             CCceEEEEEECCcccCCCceEEEecc---CccceEecCCC--cccccceeeeEeehhhccCCCceeEEeccccCCCCcee
Confidence            99999999999999999999999876   56776654322  26899999999999877 47789999999999999999


Q ss_pred             EecCCCeEEEEEEecCCeeEEEEEEECCeecCCCCeEEEeccCC------------CCCceEEecC-----------CCc
Q psy192          351 QPIDGDNYSIRFMPRENGIHNIHIKFNGVHIPGSPLRIKVGKGE------------ADPAAVHATG-----------NGL  407 (898)
Q Consensus       351 ~~~~~g~y~v~f~P~~~G~h~i~V~~~g~~I~gSPf~v~V~~~~------------~d~s~v~v~G-----------~Gl  407 (898)
                      +.+++|+|.+.|++.++|+|.|+|+|+|+|+++|||.+.+..+.            .+-.......           -+|
T Consensus       501 ~~~~dg~~~iv~ta~~~~~~~i~v~f~ge~vp~sp~~~~a~~~~~~~~~~r~v~e~h~v~v~~~~~~~~vi~~~v~k~~l  580 (1113)
T KOG0518|consen  501 QKNGDGTFDIVFTAQDPGTYTIDVRFGGEHVPDSPFIVSASAGIGTAVAARWVTEEHSVEVIEQQSKTLVIPFTVQKKEL  580 (1113)
T ss_pred             eeccCcceeeEeccCCCCcceEEEEECCeeCCCCceeEEEecccccccccccccccccccccccccccccccccccccch
Confidence            99999999999999999999999999999999999999886430            0000000000           011


Q ss_pred             ce----eeeCCeEEEEEEcCCCC---CCceEEEEECCCCceeeeEEeCCe-EEEEEecCCCeeEEEEEEECCEecCCCCe
Q psy192          408 AE----IKSGVKTDFIVDTCNAG---AGTLAVTIDGPSKVSMDCTEVEEG-YKVRYTPLVPGDYYVSLKYNGYHIVGSPF  479 (898)
Q Consensus       408 ~~----~~~g~~~~F~Vd~~~aG---~g~l~v~v~gp~~~~~~~~~~~~g-y~v~ytp~~~G~y~i~V~~~g~~I~gSPf  479 (898)
                      ..    ...++..-++||..+.-   .-.+.+.|.||++..+.|.+|+|| |.++|+|+.||.|+|.|+|+++||+||||
T Consensus       581 ~~ev~~~t~k~~~~~iiDn~d~t~~v~~~~t~aiegPskaei~c~dn~Dgt~~vsy~P~vpG~y~i~V~~~~k~vpgSPF  660 (1113)
T KOG0518|consen  581 AGEVNMPTGKQAKPEIIDNKDGTITVRYAPTLAIEGPSKAEIRCYDNKDGTCSVSYTPTVPGVYKIEVKFGDKHVPGSPF  660 (1113)
T ss_pred             hccccCccccccCceeeccCCCceeEEeccccccCCCCccceeEEeCCCCcEEeccCCCCCceEEEEEEECCeeCCCCCc
Confidence            10    11123334556554431   112345578999999999999999 99999999999999999999999999999


Q ss_pred             EEEEec-ccCCCCC--cceeeeEEEeeEEeeeccCCC-CCccccccCCCceEEEe--cCc--eeeeeccCccEEEEEeec
Q psy192          480 KVKCTG-KDLGERG--GQETSSVTVETVQKVAKNKTQ-GPVIPIFKSDASKVTCK--GMG--LKKAYAQKQNMFTIHCQD  551 (898)
Q Consensus       480 ~v~v~~-~~~~~~~--~~~~s~v~v~~~~~~~~~~~~-~~~~p~~~~~~~~v~~~--g~g--l~~~~~~~~~~f~v~~~~  551 (898)
                      +..|.+ .++...-  ...+..+.+.. .+..-.... ....|..  ....+.++  -.|  -..+.+++.+.+.|+.+.
T Consensus       661 ~~~Vt~~~s~r~s~~~Vgs~~ev~in~-~~~~~s~ltaeI~~PsG--n~~~c~~r~l~~g~~~itF~P~e~GeH~I~Vk~  737 (1113)
T KOG0518|consen  661 VALVTGDESIRTSHLSVGSIHEVTINI-TEADLSVLTAEIVDPSG--NPEPCLVRRLPNGHDGITFTPREVGEHKINVKV  737 (1113)
T ss_pred             eEEEecccccccccceecceeeEEeec-cccCCceeEEEEECCCC--CccceeeEecCCCceeEEECCCcCcceEEEEEE
Confidence            999986 2111000  00111222210 000000000 0000111  11122221  112  224668888999998765


Q ss_pred             C-----CCCeEEEEec--cCCcceeEeCCCccceecCCeeEEEEEecCCCCCCCceeeecccCCCCcEEEEeCCccceEE
Q psy192          552 A-----GSPFKLYVDS--IPSGYVTAYGPGLISGVSGEPCLFTISTKGAGAGSPFQFTVGPLRDGGLSMAVEGPSKAEIT  624 (898)
Q Consensus       552 ~-----G~~~~l~v~~--~~~~~v~~~Gpgl~~g~~g~p~~fti~t~~ag~g~p~~~~~~~~~~g~l~v~v~GPs~~~i~  624 (898)
                      +     |+||.+.|..  .+.+++.++|++|..|+.-+|++|+|||+.||.             |+|+++++||+|++++
T Consensus       738 ~G~hVpgsPf~i~V~~~e~dAsk~~v~g~g~~~G~t~ep~~fivDtr~agy-------------GgLsi~~~Gpskvd~~  804 (1113)
T KOG0518|consen  738 AGKHVPGSPFSIKVSESEIDASKVRVSGQGLKEGHTFEPAEFIVDTRKAGY-------------GGLSISVQGPSKVDLN  804 (1113)
T ss_pred             cceECCCCCeEEEeccccccceeEEEecccccccccccchheEeccccCCC-------------CceEEEEeCCcccccc
Confidence            4     5788777753  356789999999999988899999999999997             8999999999999999


Q ss_pred             EEeCCCCeEEEEEEcCCCeeEEEEEEECCeecCCCceEEEEccCcccccee----------eecCceeecccceeeccCc
Q psy192          625 YHDNKDGTVAVSYLPTAPGEYKIAVKFGEKHIKGSPYLAKITGEGRKRNQI----------SVGSCSEVSFPGKVSDSDI  694 (898)
Q Consensus       625 ~~~~~dg~~~v~Y~P~~pG~Y~i~V~~~~~~I~GSPF~v~V~~~~~~~~~~----------~~g~~~~~~~~~~i~~~~~  694 (898)
                      |.|++||+|+|+|+|++||.|.|+|+|+++||+||||+|+++++++.-+.+          .+|+.  .+|..++++...
T Consensus       805 ~~d~~dGt~kV~ytPtepG~Y~I~i~Fad~~I~gSPftVkv~~~~~vvesi~~~~~~~~va~~g~~--~~l~lk~~~~~~  882 (1113)
T KOG0518|consen  805 VEDREDGTCKVSYTPTEPGTYIINIKFADEHIKGSPFTVKVTGESRVVESITRDREAPSVARVGHT--CSLDLKATEASS  882 (1113)
T ss_pred             eeecCCCeEEEEEeCCCCceEEEEEEEcCccCCCCceEEEecCCeeEeeeeeecccccceecccce--eeeeeecCCCCc
Confidence            999999999999999999999999999999999999999999987422222          22322  234445554444


Q ss_pred             ceeEEEEECCCCCeeceEEEeeCCCcEEEEEEEcccccEEEEEEECCEEcCCCCeEEEeccccccCcceeEEeCcccccc
Q psy192          695 RSLNASIQAPSGLEEPCFLKKIPNGNLGISFTPREVGSHLVSVKKMGVHIKNSPFKINVGEREVGDAKKVKVFGQSLTEG  774 (898)
Q Consensus       695 ~~l~a~v~~psg~~~~~~v~~~~~g~y~v~ftP~e~G~h~i~V~~~G~~i~gSPf~i~V~~~e~~da~kv~v~G~gl~~g  774 (898)
                      ...+|.+.+|||...+..+.++.+++|+|+|||+|+|.|+|+|+|.++|+.+|||++-  +-+-+-++|+|+-|+||..+
T Consensus       883 ~d~ta~vt~PSG~~~~aei~~~~~~~y~vrFtP~e~G~~tl~V~y~~~~vegspftvg--s~~e~~ahkvra~g~Gl~R~  960 (1113)
T KOG0518|consen  883 QDITARVTDPSGRVFEAEIVDLGQGTYQVRFTPKEPGNHTLSVKYKDQHVEGSPFTVG--SLGEGGAHKVRAAGPGLERG  960 (1113)
T ss_pred             cceEEEeeCCCCCccccEEEECCCceEEEEecCCCCCceEEEEEecCccCCCCceeee--cccccccceeeccCCceeec
Confidence            5678889999999999999999999999999999999999999999999999997763  33457899999999999999


Q ss_pred             ccCCCCceEEecCCCCCc-eEEecCCCCCCCceeEEecCCccce-ecCceeEEEEEcCCCCCceeEEEEeCCCccccccc
Q psy192          775 KTHEENPFTVDTRDAGSP-LRIKVGKGEADPAAVHATGNGLAEI-KSGVKTDFIVDTCNAGAGTLAVTIDGPSKVSVKKY  852 (898)
Q Consensus       775 ~~~~~~~f~vd~~~aG~p-~~v~v~~~~~~~~~v~a~G~GL~~~-~~g~~~~f~vdt~~aG~g~l~v~i~gp~~~~~~~~  852 (898)
                      .++++  |+|++|+||-. +-+.+...+ =....++.++||... ..|.+..|.|+|+. +.|.+.+.+.+|+....   
T Consensus       961 ~~g~~--F~I~~rea~~~g~~va~egps-c~v~y~v~~pGv~~l~~~~~~~~~~v~tn~-a~g~ida~v~~p~~~~~--- 1033 (1113)
T KOG0518|consen  961 EAGEP--FSIWTREAGAGGLAVAVEGPS-CGVSYTVQTPGVGSLQEQGVPASFAVTTNR-AKGDIDAKVHTPSNAVE--- 1033 (1113)
T ss_pred             cccCc--eeEeehhcCccceEEEeeCCC-ceeEEEecCCCcchhhhcCcceEEEEEecc-ccCceeeEEeCCCCCcc---
Confidence            99998  99999999843 222232111 122345567776532 12247889999865 56788899999986432   


Q ss_pred             cceEEEEeeCCceEEEEEEeCCCeeEEEEEEECCccCCCCCceeeC
Q psy192          853 KDEIFTRHTGRNNFEVKYIVRDRGEYLLIVKWGDDHIPGSPFKVEV  898 (898)
Q Consensus       853 ~~~~~~~~~g~~~y~v~y~~~~~G~y~l~vk~~~~~i~GSPF~v~v  898 (898)
                        ++++  ...++|.++|+|.|+|.|+|.|||+|.|||||||+|+|
T Consensus      1034 --~c~~--~~~~~ya~~f~P~e~G~h~i~Vkfd~~hv~gSPF~i~v 1075 (1113)
T KOG0518|consen 1034 --ECYV--SEIEKYAMRFIPKENGVHTIDVKFDGEHVPGSPFKIRV 1075 (1113)
T ss_pred             --EEEe--eccceEEEEEeecCCCcEEEEEEeCCccCCCCCeEEEE
Confidence              2332  23449999999999999999999999999999999986


No 2  
>KOG0518|consensus
Probab=100.00  E-value=1.2e-98  Score=865.06  Aligned_cols=809  Identities=32%  Similarity=0.519  Sum_probs=649.5

Q ss_pred             CCCcccCeEEeCCCCeEEEEEEcCCceeEEEEEEECCEEec----------------------CCCC-------------
Q psy192            3 SGNVDKPVIEDNHDGTVSLHYDPREEGLHELALKFNGDHVQ----------------------GYGG-------------   47 (898)
Q Consensus         3 ~g~~~~~~i~~~~d~~~~v~f~P~~~G~h~v~v~~~G~~v~----------------------G~g~-------------   47 (898)
                      +|..-+..|++|.|+||.+.|.|+.+|.+.++++|++....                      |.-+             
T Consensus        17 ~g~~ae~~i~~~~~~T~~~t~~~~~aG~~~~slk~~~~dg~~~p~~v~vl~~~~~skv~~~~p~~q~~~~v~~a~~~ft~   96 (1113)
T KOG0518|consen   17 SGIVAEVKIQDNLDGTVTVTYDPQRAGFYILSLKYDGSDGVNLPSLVQVLSAVDTSKVKKKGPGIQGLHNVREALNKFTV   96 (1113)
T ss_pred             cCcceeeEEEecCCceEEEEEccccCcceeEEEEecCccccccceeeEEeeccccceeEEecCCccCcchhhhhhhhhhh
Confidence            45666777999999999999999999999999999998652                      1111             


Q ss_pred             ---eeEEEec---------Cccc--eEEEEeCCCCeEEEEEeeCCCeEEEEEEEECCeecCCCCeEEEEecCCcccceee
Q psy192           48 ---LSLSIEG---------PSKA--EIQCKDNADGSLNISYRPTEPGYYIINLKFADHHVEGSPFTAKIVGEGSNRQREK  113 (898)
Q Consensus        48 ---l~v~v~G---------P~~~--~i~~~d~~dG~~~v~ytP~~~G~~~I~V~~~g~~i~gSPf~~~v~~~~~~~~~~~  113 (898)
                         +.+.+-|         |...  ..-++++.||+|+..|+|.+.|+|.+.++|++.+++-|||.+.+.--. +..+  
T Consensus        97 d~r~~~nigs~hivd~v~~~~g~~~~~~v~~~~dg~y~~k~~p~e~~~h~~e~~~~e~~~~~sP~~~~v~~~t-d~n~--  173 (1113)
T KOG0518|consen   97 DNRKETNIGSAHIVDHVVKLIGSLTWTLVQDYGDGIYKTKRTPKEKGEHEVEVLYDEKPVPASPFVVKVNEGT-DWND--  173 (1113)
T ss_pred             ccceeeccCCcccccccccccccceeEeeeccCCcceeeecCchhccchhhhhhhcccccccCCceecccccc-Cccc--
Confidence               1111112         1111  223467889999999999999999999999999999999998764211 1111  


Q ss_pred             eeeeecccceeeeCcEEEEEEEcCCCcccceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECCeecCC
Q psy192          114 IQRQREAVPVTEVGSTCKLTFKMPGITAFDLSATVTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKDIHIPG  193 (898)
Q Consensus       114 i~~~~~~~~~~~vg~~~~f~v~~~~~~~~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~V~g  193 (898)
                      ....+..+....+|++..|++..++++.+.+...+.+|+--  ...+.++++|..+++|+|.++|+|++.++|++.||+|
T Consensus       174 ~~Alg~~le~~~vg~p~~f~v~~~Ga~~gnl~~a~~gpse~--~~~~~di~~g~c~veyip~~~gd~sv~~~~gg~~v~G  251 (1113)
T KOG0518|consen  174 VQALGPGLESARVGKPNVFEVETPGAGQGNLEVAVEGPSEA--LTVIPDIKPGSCSVEYIPPEPGDYSVNTYYGGVPVPG  251 (1113)
T ss_pred             ceEeccchhhcccCCCceeEEecCCccccceeeeecChhhh--cccccccCCCcceeeecCCCCCceEEeeeecccccCC
Confidence            11122344556889999999999999999999999988753  3467889999999999999999999999999999999


Q ss_pred             CceEEEEccCCCCCcceEEEeCCCCCcceeeeeeEEEEEeccCCCCceEEEEEcCCc--ceeeEEECCCCcEEEEEEeCC
Q psy192          194 SPFQFTVGPLRDGGAHRVHAGGPGLERGEQNQPCEFNVWTREAGAGSLAISVEGPSK--AEIDFKDRKDGSCYVSYVVAE  271 (898)
Q Consensus       194 SPf~v~V~~~~~~d~~~v~a~G~GL~~~~~g~~~~F~V~~~daG~g~l~v~v~gps~--~~i~~~d~~dGty~v~y~P~~  271 (898)
                      |||++.+..  ..|++++++.  ||..+++++|..|+||+.+||-..|.|.+-++..  ..+...+++|++|+|.|+|..
T Consensus       252 sPf~~~t~~--~~Dp~kiki~--gl~~gr~~~p~~F~vD~S~aG~~pl~v~V~g~r~p~e~v~~v~~~d~t~~V~fvPs~  327 (1113)
T KOG0518|consen  252 SPFRVATCD--KVDPSKIKIS--GLPKGRLNKPQYFTVDASKAGLLPLEVAVCGGRVPVEPVMAVALGDGTYNVSFVPSE  327 (1113)
T ss_pred             CCeeeeeec--ccChhheeec--CCCcccccCCeeEEEechhcCccceEEEecCCCCcccceEEecCCCceeEEEEeEcc
Confidence            999877653  3578898888  6888899999999999999999999999988754  246778899999999999999


Q ss_pred             CceEEEEEEECCeeCCCCCceeEecCCCCCceeEEEeeeCCC--ceecCCCeEEEEEecCC-ccceEEEEECCCCCeeee
Q psy192          272 PGEYRVGIKFNDQHIPDSPYKLFVSPAMGDAHKLEIAQFPQG--VVMADKPTQFLVRKNGA-VGALDAKVISPSGTEDDC  348 (898)
Q Consensus       272 ~G~y~i~V~~~g~~I~gSPf~v~V~~~~~~~~~~~~~~~~~~--~~~~g~~~~f~V~~~~a-~G~l~v~v~~p~g~~~~~  348 (898)
                      .|.|.|+|.++++.++.|||.+.+.+..+ +++....++...  ++.+-.+..|.+|..|+ .|.|+++|++|.|+..++
T Consensus       328 ~g~~~isV~~ade~vp~spf~~kvlp~~~-as~~~a~g~gLt~~~v~as~~~d~~fD~~Dageg~levqV~gp~Gk~~~~  406 (1113)
T KOG0518|consen  328 DGPYTISVRIADEDVPRSPFLVKVLPTVD-ASKVTASGAGLTDIGVNASVEVDFTFDEGDAGEGLLEVQVVGPEGKEKEV  406 (1113)
T ss_pred             cCCeeEEEEecCCcCCCCcceeeeccccc-hhhhccccCccccccccccccceeeEEccccccceEEEEEECCCCCceee
Confidence            99999999999999999999999998753 466666554433  23455678899999988 678999999999999999


Q ss_pred             EEEecCCC-eEEEEEEecCCeeEEEEEEECCeecCCCCeEEEeccCCCCCceEEecCCCcceeeeCCeEEEEEEcCCCCC
Q psy192          349 FIQPIDGD-NYSIRFMPRENGIHNIHIKFNGVHIPGSPLRIKVGKGEADPAAVHATGNGLAEIKSGVKTDFIVDTCNAGA  427 (898)
Q Consensus       349 ~v~~~~~g-~y~v~f~P~~~G~h~i~V~~~g~~I~gSPf~v~V~~~~~d~s~v~v~G~Gl~~~~~g~~~~F~Vd~~~aG~  427 (898)
                      .++++.++ +|.++|.|..+|.|.|.|+|+|++|++|||+++..   .|++|+.+..+.-  .++|++.+|.||++.||.
T Consensus       407 ~V~d~~~~~~h~vsY~pd~~G~y~i~v~~~gd~i~gSPf~~ra~---~dask~~~~~~i~--~~vg~~~~~~vdak~ag~  481 (1113)
T KOG0518|consen  407 VVRDNGRGGIHIVTYVPDCPGRYLIVVFYGGDPIPGSPFTARAY---PDASKEVLVPPIN--AQVGKEANFVVDAKAAGA  481 (1113)
T ss_pred             EEEecCCCceEEEEEcCCCCCceEEEEEECCcccCCCceEEEec---cCccceEecCCCc--ccccceeeeEeehhhccC
Confidence            99997766 99999999999999999999999999999999985   3899988754432  688999999999999999


Q ss_pred             CceEEEEECCCCcee--eeEEeCCe-EEEEEecCCCeeEEEEEEECCEecCCCCeEEEEecccCCCCCcceeeeEEEeeE
Q psy192          428 GTLAVTIDGPSKVSM--DCTEVEEG-YKVRYTPLVPGDYYVSLKYNGYHIVGSPFKVKCTGKDLGERGGQETSSVTVETV  504 (898)
Q Consensus       428 g~l~v~v~gp~~~~~--~~~~~~~g-y~v~ytp~~~G~y~i~V~~~g~~I~gSPf~v~v~~~~~~~~~~~~~s~v~v~~~  504 (898)
                      +.+.+.+..|++..+  ++..+.+| |++.||+..+|+|.|.|+|+|+|+++|||.+.....-.. .   .... .+...
T Consensus       482 g~~~~~~~~pdn~n~~~~v~~~~dg~~~iv~ta~~~~~~~i~v~f~ge~vp~sp~~~~a~~~~~~-~---~~~r-~v~e~  556 (1113)
T KOG0518|consen  482 GPVTCTVVTPDNRNVPADVQKNGDGTFDIVFTAQDPGTYTIDVRFGGEHVPDSPFIVSASAGIGT-A---VAAR-WVTEE  556 (1113)
T ss_pred             CCceeEEeccccCCCCceeeeccCcceeeEeccCCCCcceEEEEECCeeCCCCceeEEEeccccc-c---cccc-ccccc
Confidence            999999999987654  55677888 999999999999999999999999999999987643000 0   0000 00000


Q ss_pred             EeeeccCCCCCccccccCCCceEEEecCceeeeeccCccEEEEEeecCCCCeEEEEeccCCcceeEeCCCccceecCCee
Q psy192          505 QKVAKNKTQGPVIPIFKSDASKVTCKGMGLKKAYAQKQNMFTIHCQDAGSPFKLYVDSIPSGYVTAYGPGLISGVSGEPC  584 (898)
Q Consensus       505 ~~~~~~~~~~~~~p~~~~~~~~v~~~g~gl~~~~~~~~~~f~v~~~~~G~~~~l~v~~~~~~~v~~~Gpgl~~g~~g~p~  584 (898)
                      -.+       ..++          .+.    + .+  .-.|++.-.+      +           +..-++-++....| 
T Consensus       557 h~v-------~v~~----------~~~----~-~~--vi~~~v~k~~------l-----------~~ev~~~t~k~~~~-  594 (1113)
T KOG0518|consen  557 HSV-------EVIE----------QQS----K-TL--VIPFTVQKKE------L-----------AGEVNMPTGKQAKP-  594 (1113)
T ss_pred             ccc-------cccc----------ccc----c-cc--cccccccccc------h-----------hccccCccccccCc-
Confidence            000       0000          000    0 00  0011111000      0           00001111222233 


Q ss_pred             EEEEEecCCCCCCCceeeecccCCCCcEEEEeCCccceEEEEeCCCCeEEEEEEcCCCeeEEEEEEECCeecCCCceEEE
Q psy192          585 LFTISTKGAGAGSPFQFTVGPLRDGGLSMAVEGPSKAEITYHDNKDGTVAVSYLPTAPGEYKIAVKFGEKHIKGSPYLAK  664 (898)
Q Consensus       585 ~fti~t~~ag~g~p~~~~~~~~~~g~l~v~v~GPs~~~i~~~~~~dg~~~v~Y~P~~pG~Y~i~V~~~~~~I~GSPF~v~  664 (898)
                       ++|+.++.-.      .++    =.+.++||||++++|+|+||+||+|.|+|+|+.||+|+|.|+|+|+|||||||+..
T Consensus       595 -~iiDn~d~t~------~v~----~~~t~aiegPskaei~c~dn~Dgt~~vsy~P~vpG~y~i~V~~~~k~vpgSPF~~~  663 (1113)
T KOG0518|consen  595 -EIIDNKDGTI------TVR----YAPTLAIEGPSKAEIRCYDNKDGTCSVSYTPTVPGVYKIEVKFGDKHVPGSPFVAL  663 (1113)
T ss_pred             -eeeccCCCce------eEE----eccccccCCCCccceeEEeCCCCcEEeccCCCCCceEEEEEEECCeeCCCCCceEE
Confidence             3455544321      111    12357899999999999999999999999999999999999999999999999999


Q ss_pred             Ecc-CccccceeeecCceeecccceeeccCcceeEEEEECCCCCeeceEEEeeCCCcEEEEEEEcccccEEEEEEECCEE
Q psy192          665 ITG-EGRKRNQISVGSCSEVSFPGKVSDSDIRSLNASIQAPSGLEEPCFLKKIPNGNLGISFTPREVGSHLVSVKKMGVH  743 (898)
Q Consensus       665 V~~-~~~~~~~~~~g~~~~~~~~~~i~~~~~~~l~a~v~~psg~~~~~~v~~~~~g~y~v~ftP~e~G~h~i~V~~~G~~  743 (898)
                      |++ .+.+.++.++|+..+..+.  +++.+.+.|+|+|.+|+|..+||.++..+||.|.++|+|+|+|+|+|+|+++|+|
T Consensus       664 Vt~~~s~r~s~~~Vgs~~ev~in--~~~~~~s~ltaeI~~PsGn~~~c~~r~l~~g~~~itF~P~e~GeH~I~Vk~~G~h  741 (1113)
T KOG0518|consen  664 VTGDESIRTSHLSVGSIHEVTIN--ITEADLSVLTAEIVDPSGNPEPCLVRRLPNGHDGITFTPREVGEHKINVKVAGKH  741 (1113)
T ss_pred             EecccccccccceecceeeEEee--ccccCCceeEEEEECCCCCccceeeEecCCCceeEEECCCcCcceEEEEEEcceE
Confidence            996 6677788999999888775  7788889999999999999999999999999999999999999999999999999


Q ss_pred             cCCCCeEEEeccccccCcceeEEeCccccccccCCCCceEEecCCCC---------------------------------
Q psy192          744 IKNSPFKINVGEREVGDAKKVKVFGQSLTEGKTHEENPFTVDTRDAG---------------------------------  790 (898)
Q Consensus       744 i~gSPf~i~V~~~e~~da~kv~v~G~gl~~g~~~~~~~f~vd~~~aG---------------------------------  790 (898)
                      |++|||.|.|.+.|. ||+|++++|+||.+|.+.|+++|+||||+||                                 
T Consensus       742 VpgsPf~i~V~~~e~-dAsk~~v~g~g~~~G~t~ep~~fivDtr~agyGgLsi~~~Gpskvd~~~~d~~dGt~kV~ytPt  820 (1113)
T KOG0518|consen  742 VPGSPFSIKVSESEI-DASKVRVSGQGLKEGHTFEPAEFIVDTRKAGYGGLSISVQGPSKVDLNVEDREDGTCKVSYTPT  820 (1113)
T ss_pred             CCCCCeEEEeccccc-cceeEEEecccccccccccchheEeccccCCCCceEEEEeCCcccccceeecCCCeEEEEEeCC
Confidence            999999999999999 9999999999999999999999999999998                                 


Q ss_pred             ------------------CceEEecCCCCCCCc----eeEEecCCccceecCceeEEEEEcCCCCCceeEEEEeCCCccc
Q psy192          791 ------------------SPLRIKVGKGEADPA----AVHATGNGLAEIKSGVKTDFIVDTCNAGAGTLAVTIDGPSKVS  848 (898)
Q Consensus       791 ------------------~p~~v~v~~~~~~~~----~v~a~G~GL~~~~~g~~~~f~vdt~~aG~g~l~v~i~gp~~~~  848 (898)
                                        |||.|++.    +++    .+.-..+++.-+..|+.++|.++.-+++.-..+++++.|++--
T Consensus       821 epG~Y~I~i~Fad~~I~gSPftVkv~----~~~~vvesi~~~~~~~~va~~g~~~~l~lk~~~~~~~d~ta~vt~PSG~~  896 (1113)
T KOG0518|consen  821 EPGTYIINIKFADEHIKGSPFTVKVT----GESRVVESITRDREAPSVARVGHTCSLDLKATEASSQDITARVTDPSGRV  896 (1113)
T ss_pred             CCceEEEEEEEcCccCCCCceEEEec----CCeeEeeeeeecccccceecccceeeeeeecCCCCccceEEEeeCCCCCc
Confidence                              34444442    222    2333556677789999999999999999999999999999743


Q ss_pred             cccccceEEEEeeCCceEEEEEEeCCCeeEEEEEEECCccCCCCCcee
Q psy192          849 VKKYKDEIFTRHTGRNNFEVKYIVRDRGEYLLIVKWGDDHIPGSPFKV  896 (898)
Q Consensus       849 ~~~~~~~~~~~~~g~~~y~v~y~~~~~G~y~l~vk~~~~~i~GSPF~v  896 (898)
                           .+..+.+.++++|.|+|+|+|+|.|+|+|+|.|+||+||||++
T Consensus       897 -----~~aei~~~~~~~y~vrFtP~e~G~~tl~V~y~~~~vegspftv  939 (1113)
T KOG0518|consen  897 -----FEAEIVDLGQGTYQVRFTPKEPGNHTLSVKYKDQHVEGSPFTV  939 (1113)
T ss_pred             -----cccEEEECCCceEEEEecCCCCCceEEEEEecCccCCCCceee
Confidence                 2566778999999999999999999999999999999999987


No 3  
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=99.81  E-value=1.2e-19  Score=163.38  Aligned_cols=90  Identities=39%  Similarity=0.725  Sum_probs=84.0

Q ss_pred             CceeEEecCCccceecCceeEEEEEcCCCCCceeEEEEeCCCccccccccceEEEEeeCCceEEEEEEeCCCeeEEEEEE
Q psy192          804 PAAVHATGNGLAEIKSGVKTDFIVDTCNAGAGTLAVTIDGPSKVSVKKYKDEIFTRHTGRNNFEVKYIVRDRGEYLLIVK  883 (898)
Q Consensus       804 ~~~v~a~G~GL~~~~~g~~~~f~vdt~~aG~g~l~v~i~gp~~~~~~~~~~~~~~~~~g~~~y~v~y~~~~~G~y~l~vk  883 (898)
                      |++|+|+|+||..+.+|++++|+|+|+++|.+.|+|.|.+|++..+     ++.+.+++||+|.|+|+|+++|.|.|+|+
T Consensus         1 p~~~~v~G~Gl~~~~vg~~~~f~v~~~d~G~~~~~v~i~~p~g~~~-----~~~v~d~~dGty~v~y~P~~~G~~~i~V~   75 (93)
T smart00557        1 ASKVKASGPGLEKGVVGEPAEFTIDTRGAGGGELEVEVTGPSGKKV-----PVEVKDNGDGTYTVSYTPTEPGDYTVTVK   75 (93)
T ss_pred             CCeEEEECCCcCceecCCCEEEEEEcCCCCCCcEEEEEECCCCCee-----EeEEEeCCCCEEEEEEEeCCCEeEEEEEE
Confidence            5789999999999999999999999999999999999999997332     56788999999999999999999999999


Q ss_pred             ECCccCCCCCceeeC
Q psy192          884 WGDDHIPGSPFKVEV  898 (898)
Q Consensus       884 ~~~~~i~GSPF~v~v  898 (898)
                      |+|+||+||||++.|
T Consensus        76 ~~g~~I~gSPF~v~V   90 (93)
T smart00557       76 FGGEHIPGSPFTVKV   90 (93)
T ss_pred             ECCEECCCCCEEEEE
Confidence            999999999999975


No 4  
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=99.81  E-value=2.2e-19  Score=161.67  Aligned_cols=90  Identities=48%  Similarity=0.879  Sum_probs=85.3

Q ss_pred             cceEEEeCCCCCcceeeeeeEEEEEeccCCCCceEEEEEcCCc--ceeeEEECCCCcEEEEEEeCCCceEEEEEEECCee
Q psy192          208 AHRVHAGGPGLERGEQNQPCEFNVWTREAGAGSLAISVEGPSK--AEIDFKDRKDGSCYVSYVVAEPGEYRVGIKFNDQH  285 (898)
Q Consensus       208 ~~~v~a~G~GL~~~~~g~~~~F~V~~~daG~g~l~v~v~gps~--~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g~~  285 (898)
                      +++|+|+|+||+.+.+|++++|+|+++|+|.+.|+|.|.+|++  .++++.|++||+|.|+|+|..+|.|.|+|+|+|++
T Consensus         1 p~~~~v~G~Gl~~~~vg~~~~f~v~~~d~G~~~~~v~i~~p~g~~~~~~v~d~~dGty~v~y~P~~~G~~~i~V~~~g~~   80 (93)
T smart00557        1 ASKVKASGPGLEKGVVGEPAEFTIDTRGAGGGELEVEVTGPSGKKVPVEVKDNGDGTYTVSYTPTEPGDYTVTVKFGGEH   80 (93)
T ss_pred             CCeEEEECCCcCceecCCCEEEEEEcCCCCCCcEEEEEECCCCCeeEeEEEeCCCCEEEEEEEeCCCEeEEEEEEECCEE
Confidence            4689999999999999999999999999999999999999986  46788999999999999999999999999999999


Q ss_pred             CCCCCceeEecC
Q psy192          286 IPDSPYKLFVSP  297 (898)
Q Consensus       286 I~gSPf~v~V~~  297 (898)
                      |+||||.+.|.+
T Consensus        81 I~gSPF~v~V~~   92 (93)
T smart00557       81 IPGSPFTVKVGP   92 (93)
T ss_pred             CCCCCEEEEEeC
Confidence            999999999865


No 5  
>PF00630 Filamin:  Filamin/ABP280 repeat;  InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=99.72  E-value=5.5e-17  Score=148.73  Aligned_cols=93  Identities=31%  Similarity=0.512  Sum_probs=79.8

Q ss_pred             CCCceeEEecCCccceecCceeEEEEEcCCCCCc-------eeEEEEeCCCccccccccceEEEEeeCCceEEEEEEeCC
Q psy192          802 ADPAAVHATGNGLAEIKSGVKTDFIVDTCNAGAG-------TLAVTIDGPSKVSVKKYKDEIFTRHTGRNNFEVKYIVRD  874 (898)
Q Consensus       802 ~~~~~v~a~G~GL~~~~~g~~~~f~vdt~~aG~g-------~l~v~i~gp~~~~~~~~~~~~~~~~~g~~~y~v~y~~~~  874 (898)
                      +||++|+|+|+||..+.+|++++|+|++++++.+       .|.|+|.+|+...... ..++.+.+++||+|.++|+|.+
T Consensus         2 ~dp~~~~v~G~gl~~~~~g~~~~F~V~~~d~~g~~~~~~~~~~~v~i~~p~~~~~~~-~~~~~v~~~~~G~y~v~y~p~~   80 (101)
T PF00630_consen    2 IDPSKCKVSGPGLEPAVVGEPATFTVDTRDAGGNPVSSGGDEFQVTITSPDGKEEPV-PVPVEVIDNGDGTYTVSYTPTE   80 (101)
T ss_dssp             CSGCGEEEESGGGTEEETTSEEEEEEEETTTTSSBEESTSSEEEEEEESSSSESS---EEEEEEEEESSSEEEEEEEESS
T ss_pred             CCCCEEEEECCccCCeECCCcEEEEEEEccCCCCccccCCceeEEEEeCCCCCcccc-ccceEEEECCCCEEEEEEEeCc
Confidence            6899999999999999999999999999998555       5779999998751100 1246677889999999999999


Q ss_pred             CeeEEEEEEECCccCCCCCce
Q psy192          875 RGEYLLIVKWGDDHIPGSPFK  895 (898)
Q Consensus       875 ~G~y~l~vk~~~~~i~GSPF~  895 (898)
                      +|.|.|+|+|+|+||+||||+
T Consensus        81 ~G~y~i~V~~~g~~I~gSPf~  101 (101)
T PF00630_consen   81 PGKYKISVKINGQPIPGSPFT  101 (101)
T ss_dssp             SEEEEEEEEESSEB-TTSSEE
T ss_pred             cEeEEEEEEECCEECcCCCcC
Confidence            999999999999999999996


No 6  
>PF00630 Filamin:  Filamin/ABP280 repeat;  InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=99.67  E-value=3.1e-16  Score=143.75  Aligned_cols=87  Identities=34%  Similarity=0.685  Sum_probs=78.1

Q ss_pred             CCcceEEEeCCCCCcceeeeeeEEEEEeccCCCC-------ceEEEEEcCCcc------eeeEEECCCCcEEEEEEeCCC
Q psy192          206 GGAHRVHAGGPGLERGEQNQPCEFNVWTREAGAG-------SLAISVEGPSKA------EIDFKDRKDGSCYVSYVVAEP  272 (898)
Q Consensus       206 ~d~~~v~a~G~GL~~~~~g~~~~F~V~~~daG~g-------~l~v~v~gps~~------~i~~~d~~dGty~v~y~P~~~  272 (898)
                      .||++|+|+|+||+.+.+|++++|+|+++|++++       .|.|.|.+|+..      .+++.+++||+|.++|+|+++
T Consensus         2 ~dp~~~~v~G~gl~~~~~g~~~~F~V~~~d~~g~~~~~~~~~~~v~i~~p~~~~~~~~~~~~v~~~~~G~y~v~y~p~~~   81 (101)
T PF00630_consen    2 IDPSKCKVSGPGLEPAVVGEPATFTVDTRDAGGNPVSSGGDEFQVTITSPDGKEEPVPVPVEVIDNGDGTYTVSYTPTEP   81 (101)
T ss_dssp             CSGCGEEEESGGGTEEETTSEEEEEEEETTTTSSBEESTSSEEEEEEESSSSESS--EEEEEEEEESSSEEEEEEEESSS
T ss_pred             CCCCEEEEECCccCCeECCCcEEEEEEEccCCCCccccCCceeEEEEeCCCCCccccccceEEEECCCCEEEEEEEeCcc
Confidence            4789999999999999999999999999999655       567999999664      356789999999999999999


Q ss_pred             ceEEEEEEECCeeCCCCCce
Q psy192          273 GEYRVGIKFNDQHIPDSPYK  292 (898)
Q Consensus       273 G~y~i~V~~~g~~I~gSPf~  292 (898)
                      |.|+|+|+|+|++|+||||+
T Consensus        82 G~y~i~V~~~g~~I~gSPf~  101 (101)
T PF00630_consen   82 GKYKISVKINGQPIPGSPFT  101 (101)
T ss_dssp             EEEEEEEEESSEB-TTSSEE
T ss_pred             EeEEEEEEECCEECcCCCcC
Confidence            99999999999999999996


No 7  
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=99.36  E-value=7.8e-07  Score=104.44  Aligned_cols=424  Identities=13%  Similarity=0.154  Sum_probs=223.1

Q ss_pred             EEEEEcCCceeEEEEEEECCEEec---------------------------------CCCCeeEEEecCccceEE--EEe
Q psy192           20 SLHYDPREEGLHELALKFNGDHVQ---------------------------------GYGGLSLSIEGPSKAEIQ--CKD   64 (898)
Q Consensus        20 ~v~f~P~~~G~h~v~v~~~G~~v~---------------------------------G~g~l~v~v~GP~~~~i~--~~d   64 (898)
                      ++.++|+++|.+.|.|..-..--.                                 |. .|.+.|++-++. +-  ..-
T Consensus        55 ~~~w~p~~~GkY~imVq~K~~~S~~~fD~~~~~~~~v~~~~~~~I~~~~~~~~~l~vGe-~l~~~V~~~~e~-~LYKF~I  132 (667)
T PRK14081         55 EINWIPKEEGEYTIMVQAKKEDSNKPFDYVSKEDYVIGKAEEKLIKNIYLDKDTLNVGE-KIEIKVDSNKEP-LMYRYWI  132 (667)
T ss_pred             eEEEeeCCCccEEEEEEEecCCCCCCcceeEEEEEEEcccchhhheeeEecCccccCCC-EEEEEEEeccCc-EEEEEEE
Confidence            689999999999999988432110                                 11 244555543322 21  112


Q ss_pred             CCCCeEE----------EEEeeCCCeEEEEEEEECCeecCC-------CCeEEEEecCCcccceeeeeeeecccceeeeC
Q psy192           65 NADGSLN----------ISYRPTEPGYYIINLKFADHHVEG-------SPFTAKIVGEGSNRQREKIQRQREAVPVTEVG  127 (898)
Q Consensus        65 ~~dG~~~----------v~ytP~~~G~~~I~V~~~g~~i~g-------SPf~~~v~~~~~~~~~~~i~~~~~~~~~~~vg  127 (898)
                      ++||.+.          ++|+|+.+|.|.|-+..-+.-=..       .=|.  |....  .  ..|............|
T Consensus       133 ~~~~~w~~iqDYst~n~lsyt~~~~G~Y~ll~~~Kd~~S~~~fDD~~~v~y~--Vk~~~--~--v~I~~F~~lns~~i~~  206 (667)
T PRK14081        133 KEDNNWKLIKDYSTENSLSYTANKPGKYELLVECKRIDSTKDFDDFKKVKFK--VKEID--K--VEITDFKCLNKELICD  206 (667)
T ss_pred             cCCCcEEEEEecCCcceEEEEecCCCcEEEEEEEecCCCccccCcceEEEEE--cccCc--c--eEEEeccccCcceecC
Confidence            3455444          699999999999988765422110       0111  11111  0  0122111122335667


Q ss_pred             cEEEEEEEcCCCcccceE--EEEeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECCeecCC-----CceEEEE
Q psy192          128 STCKLTFKMPGITAFDLS--ATVTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKDIHIPG-----SPFQFTV  200 (898)
Q Consensus       128 ~~~~f~v~~~~~~~~~~~--~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~V~g-----SPf~v~V  200 (898)
                      ....|.+.+..-++..+-  -.+.+++|...  .+++=.. .=.++|+|..+|.|+|.+.+.+..=.+     .=..++|
T Consensus       207 ~eI~f~~~a~~~~g~~~LYKF~~i~~~G~~~--~~qdYst-~n~~~y~~~~~G~Y~i~~~VKD~~S~~eyDD~~~l~y~V  283 (667)
T PRK14081        207 EELVFEVESVYEEDRTILYKFVKIDSDGKQT--CIQDYST-KNIVSYKEKKSGDYKLLCLVKDMYSNNEFDDRAVLVYTV  283 (667)
T ss_pred             cEEEEEEEEEeCCCceEEEEEEEECCCCCEE--EecCccc-cceEEEEeCCCccEEEEEEEeccCcccccccceEEEEEE
Confidence            888888777654443332  22445666532  2222111 235789999999999999997643332     1123334


Q ss_pred             ccCCCCCcceEEEeCCCCCc-ceeeeeeEEEEEeccCCCCce--EEEEEcCCcceeeEEECCCCc-EEEEEEeCCCceEE
Q psy192          201 GPLRDGGAHRVHAGGPGLER-GEQNQPCEFNVWTREAGAGSL--AISVEGPSKAEIDFKDRKDGS-CYVSYVVAEPGEYR  276 (898)
Q Consensus       201 ~~~~~~d~~~v~a~G~GL~~-~~~g~~~~F~V~~~daG~g~l--~v~v~gps~~~i~~~d~~dGt-y~v~y~P~~~G~y~  276 (898)
                      .+-.   +-+++-.=.-+.. -.+|++..|.+.+.+  +.+|  ...|.|+..     .+.+=++ -.+.|+|..+|.|+
T Consensus       284 k~~~---~vkI~~~~~d~~s~ql~g~~I~ika~a~G--G~~llYrf~I~G~~~-----e~~~Y~~~n~~~w~P~~~G~Y~  353 (667)
T PRK14081        284 KPYK---DIKIRNFTTDLSSPQLTDTDIELKAVAEG--GKELLYRFIIKGKES-----EDSGYIRNNIYTWKPKIAGKYS  353 (667)
T ss_pred             ecCC---CcEEEEEEEcCCCCeecCceEEEEEEecC--CCceEEEEEECCcEE-----eeccccccceEEEeeCCCceEE
Confidence            3211   1222211111222 358999999998874  4444  466776542     2222233 46899999999999


Q ss_pred             EEEEECCeeCCCC-----CceeEecCCC-CC--ceeEEEeeeCCCceecCCCeEEEEEecCCccceEEEEECCCCCeeee
Q psy192          277 VGIKFNDQHIPDS-----PYKLFVSPAM-GD--AHKLEIAQFPQGVVMADKPTQFLVRKNGAVGALDAKVISPSGTEDDC  348 (898)
Q Consensus       277 i~V~~~g~~I~gS-----Pf~v~V~~~~-~~--~~~~~~~~~~~~~~~~g~~~~f~V~~~~a~G~l~v~v~~p~g~~~~~  348 (898)
                      |.|..-|..-.+.     -..+.|.... .+  ...+.+.  ....+.+|++..|.+.+.+..+-+=--++--+|..+. 
T Consensus       354 I~v~VKDk~S~~eyD~~~~v~f~i~~~~~~p~~I~~vl~d--~~~~~lvG~~i~i~v~a~gg~~~lY~f~ik~ng~~ve-  430 (667)
T PRK14081        354 ITLWVKDISSKGEYEDKSSIDYTIEEKSKEPIKIEDVILD--KGKHILKGEEIKIRVIAEGGTNLRYSFIIKKDGKEEE-  430 (667)
T ss_pred             EEEEEEcCcCcccccceEEEEEEEcccCCCCeEEEEEEEC--CCCceEeCCeEEEEEEecCCCeEEEEEEEEECCEEEE-
Confidence            9999988654311     1112222210 01  1122221  1234678999999988776543221112222454321 


Q ss_pred             EEEecCCCeEEEEEEecCCeeEEEEEEECCeecC----C-CCeEEEeccCCCCCceE-EecCCCcceeeeCCeEEEEEEc
Q psy192          349 FIQPIDGDNYSIRFMPRENGIHNIHIKFNGVHIP----G-SPLRIKVGKGEADPAAV-HATGNGLAEIKSGVKTDFIVDT  422 (898)
Q Consensus       349 ~v~~~~~g~y~v~f~P~~~G~h~i~V~~~g~~I~----g-SPf~v~V~~~~~d~s~v-~v~G~Gl~~~~~g~~~~F~Vd~  422 (898)
                       -.+-. -.-.+.|+|..+|.|+|.|..-+..=.    . .=+.+.|..  .-|.+. .+-=+-.....+|+..+|.+.+
T Consensus       431 -~~~Y~-~~~~~~f~P~~~G~Y~IeV~vKdk~S~~~yD~~k~v~l~V~e--~~P~~i~~il~~~~~~~~vg~~i~~~~~~  506 (667)
T PRK14081        431 -KIDYG-KNNWVNFIPEEKGNYELEVRVKDKYSDKEYDAHTIVYIKVHE--YIPAEIDYILLPVKEYYLVGDDIEIEVII  506 (667)
T ss_pred             -Eeecc-cccEEEEEECCCeeEEEEEEEecccCchhcccceEEEEEEec--cCceeeeeEEecccccEEeCCEEEEEEEE
Confidence             11111 123789999999999999987653111    0 112334431  112110 0001122345789999999988


Q ss_pred             CCCCCCceEEEEECCCCceeeeEEeCCeEEEEEecCCCeeEEEEEEECCE
Q psy192          423 CNAGAGTLAVTIDGPSKVSMDCTEVEEGYKVRYTPLVPGDYYVSLKYNGY  472 (898)
Q Consensus       423 ~~aG~g~l~v~v~gp~~~~~~~~~~~~gy~v~ytp~~~G~y~i~V~~~g~  472 (898)
                      .+...-.+...+. -++.-+..++-...-..+|+|..+|.|+|.|..-+.
T Consensus       507 ~~~k~v~y~y~~~-~NG~~v~~t~Ys~~~~ysf~P~~~GkY~I~V~aKn~  555 (667)
T PRK14081        507 QNTKDVLIKYILK-INGHKVEETDYIKNKKYKFIPKCSGKYTIEVLAKNI  555 (667)
T ss_pred             eCCCeEEEEEEEE-ECCEEEEEeeccccceEEEeecCCceEEEEEEEccc
Confidence            7732211121111 112122222222224578899999999998876553


No 8  
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=99.33  E-value=3.3e-06  Score=99.24  Aligned_cols=374  Identities=14%  Similarity=0.138  Sum_probs=198.1

Q ss_pred             EEEEeeCCCeEEEEEEEECCeecCCCCeEE------EEecCCcccceeeeeeeecccceeeeCcEEEEEEEcCCCcccce
Q psy192           71 NISYRPTEPGYYIINLKFADHHVEGSPFTA------KIVGEGSNRQREKIQRQREAVPVTEVGSTCKLTFKMPGITAFDL  144 (898)
Q Consensus        71 ~v~ytP~~~G~~~I~V~~~g~~i~gSPf~~------~v~~~~~~~~~~~i~~~~~~~~~~~vg~~~~f~v~~~~~~~~~~  144 (898)
                      .++|+|+++|.|.|-|..-...-.. ||--      .|......    -|..-........+|+.+.|.++...-   ++
T Consensus        55 ~~~w~p~~~GkY~imVq~K~~~S~~-~fD~~~~~~~~v~~~~~~----~I~~~~~~~~~l~vGe~l~~~V~~~~e---~~  126 (667)
T PRK14081         55 EINWIPKEEGEYTIMVQAKKEDSNK-PFDYVSKEDYVIGKAEEK----LIKNIYLDKDTLNVGEKIEIKVDSNKE---PL  126 (667)
T ss_pred             eEEEeeCCCccEEEEEEEecCCCCC-CcceeEEEEEEEcccchh----hheeeEecCccccCCCEEEEEEEeccC---cE
Confidence            3799999999999999987655433 4431      12221110    111111112334678888888775432   12


Q ss_pred             EEE-EeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECCeecCC-----CceEEEEccCCCCCcceEEEeCC-C
Q psy192          145 SAT-VTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKDIHIPG-----SPFQFTVGPLRDGGAHRVHAGGP-G  217 (898)
Q Consensus       145 ~~~-V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~V~g-----SPf~v~V~~~~~~d~~~v~a~G~-G  217 (898)
                      -=+ ....+|+..  .++|=.. .=.++|+|..+|.|+|.+..-+.-=..     .=..++|.+.     .++.+.+- -
T Consensus       127 LYKF~I~~~~~w~--~iqDYst-~n~lsyt~~~~G~Y~ll~~~Kd~~S~~~fDD~~~v~y~Vk~~-----~~v~I~~F~~  198 (667)
T PRK14081        127 MYRYWIKEDNNWK--LIKDYST-ENSLSYTANKPGKYELLVECKRIDSTKDFDDFKKVKFKVKEI-----DKVEITDFKC  198 (667)
T ss_pred             EEEEEEcCCCcEE--EEEecCC-cceEEEEecCCCcEEEEEEEecCCCccccCcceEEEEEcccC-----cceEEEeccc
Confidence            111 133344422  2333111 236789999999999988775321110     0034555542     23444432 2


Q ss_pred             CCcc-eeeeeeEEEEEeccCCCCceE--EEEEcCCcceeeEEECCCCcEEEEEEeCCCceEEEEEEECCeeCCCC-----
Q psy192          218 LERG-EQNQPCEFNVWTREAGAGSLA--ISVEGPSKAEIDFKDRKDGSCYVSYVVAEPGEYRVGIKFNDQHIPDS-----  289 (898)
Q Consensus       218 L~~~-~~g~~~~F~V~~~daG~g~l~--v~v~gps~~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g~~I~gS-----  289 (898)
                      |... .+|+...|.+++...++..+-  ..+.+..+ .+.|...=.-+-.++|+|..+|.|.|.+...|..-...     
T Consensus       199 lns~~i~~~eI~f~~~a~~~~g~~~LYKF~~i~~~G-~~~~~qdYst~n~~~y~~~~~G~Y~i~~~VKD~~S~~eyDD~~  277 (667)
T PRK14081        199 LNKELICDEELVFEVESVYEEDRTILYKFVKIDSDG-KQTCIQDYSTKNIVSYKEKKSGDYKLLCLVKDMYSNNEFDDRA  277 (667)
T ss_pred             cCcceecCcEEEEEEEEEeCCCceEEEEEEEECCCC-CEEEecCccccceEEEEeCCCccEEEEEEEeccCcccccccce
Confidence            4443 589999999998866555543  33433333 22232111122368999999999999998877532111     


Q ss_pred             --CceeEecCCCCCceeEEEeeeCCCceecCCCeEEEEEecCCccceEEEEECCCCCeeeeEEEecCCCeEEEEEEecCC
Q psy192          290 --PYKLFVSPAMGDAHKLEIAQFPQGVVMADKPTQFLVRKNGAVGALDAKVISPSGTEDDCFIQPIDGDNYSIRFMPREN  367 (898)
Q Consensus       290 --Pf~v~V~~~~~~~~~~~~~~~~~~~~~~g~~~~f~V~~~~a~G~l~v~v~~p~g~~~~~~v~~~~~g~y~v~f~P~~~  367 (898)
                        =|+|....+. ....+.+  ...+....|++..|...+.+..+-|=--++  .|...  ...+- .-.=.+.|+|.++
T Consensus       278 ~l~y~Vk~~~~v-kI~~~~~--d~~s~ql~g~~I~ika~a~GG~~llYrf~I--~G~~~--e~~~Y-~~~n~~~w~P~~~  349 (667)
T PRK14081        278 VLVYTVKPYKDI-KIRNFTT--DLSSPQLTDTDIELKAVAEGGKELLYRFII--KGKES--EDSGY-IRNNIYTWKPKIA  349 (667)
T ss_pred             EEEEEEecCCCc-EEEEEEE--cCCCCeecCceEEEEEEecCCCceEEEEEE--CCcEE--eeccc-cccceEEEeeCCC
Confidence              1443211110 0111222  122345688988888887764332211122  24332  11111 1123789999999


Q ss_pred             eeEEEEEEECCeecCC-----CCeEEEeccCCCCC---ceEEecCCCcceeeeCCeEEEEEEcCCCCCCceEEEEECCCC
Q psy192          368 GIHNIHIKFNGVHIPG-----SPLRIKVGKGEADP---AAVHATGNGLAEIKSGVKTDFIVDTCNAGAGTLAVTIDGPSK  439 (898)
Q Consensus       368 G~h~i~V~~~g~~I~g-----SPf~v~V~~~~~d~---s~v~v~G~Gl~~~~~g~~~~F~Vd~~~aG~g~l~v~v~gp~~  439 (898)
                      |.|.|.|.+-+..=.+     .=..+++......|   ..+.+..  -....+|++..|.+.+.+...=.....|.. ++
T Consensus       350 G~Y~I~v~VKDk~S~~eyD~~~~v~f~i~~~~~~p~~I~~vl~d~--~~~~lvG~~i~i~v~a~gg~~~lY~f~ik~-ng  426 (667)
T PRK14081        350 GKYSITLWVKDISSKGEYEDKSSIDYTIEEKSKEPIKIEDVILDK--GKHILKGEEIKIRVIAEGGTNLRYSFIIKK-DG  426 (667)
T ss_pred             ceEEEEEEEEcCcCcccccceEEEEEEEcccCCCCeEEEEEEECC--CCceEeCCeEEEEEEecCCCeEEEEEEEEE-CC
Confidence            9999999987653321     11123332211122   2233332  234578999999988765532112222221 23


Q ss_pred             ceeeeEEeCCeEEEEEecCCCeeEEEEEEECCE
Q psy192          440 VSMDCTEVEEGYKVRYTPLVPGDYYVSLKYNGY  472 (898)
Q Consensus       440 ~~~~~~~~~~gy~v~ytp~~~G~y~i~V~~~g~  472 (898)
                      ..++-++-...-.++|+|..+|.|+|+|..-+.
T Consensus       427 ~~ve~~~Y~~~~~~~f~P~~~G~Y~IeV~vKdk  459 (667)
T PRK14081        427 KEEEKIDYGKNNWVNFIPEEKGNYELEVRVKDK  459 (667)
T ss_pred             EEEEEeecccccEEEEEECCCeeEEEEEEEecc
Confidence            233322222236899999999999998876543


No 9  
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=95.26  E-value=21  Score=49.70  Aligned_cols=293  Identities=12%  Similarity=0.093  Sum_probs=145.0

Q ss_pred             EEEEEeeCCCeEEEEEEEECCeecCCC--CeEEEEecCCcccceeeeeeeecccceeeeCcEEEEEEEcCCCcccceEEE
Q psy192           70 LNISYRPTEPGYYIINLKFADHHVEGS--PFTAKIVGEGSNRQREKIQRQREAVPVTEVGSTCKLTFKMPGITAFDLSAT  147 (898)
Q Consensus        70 ~~v~ytP~~~G~~~I~V~~~g~~i~gS--Pf~~~v~~~~~~~~~~~i~~~~~~~~~~~vg~~~~f~v~~~~~~~~~~~~~  147 (898)
                      ..+.||+.++|.|+|++..-+.- ...  ...+.|.++-....   +   ........+|+..+|.+.+..-.  +++..
T Consensus      1830 k~v~~Tf~~aG~ytV~L~AsN~v-s~~~~s~~~~VQe~I~~L~---L---~as~~~~~~n~~v~fsa~l~~GS--~Vtf~ 1900 (2740)
T TIGR00864      1830 KHACMTFPDAGTFAIRLNASNAV-SGKSASREFFAEEPIFGLE---L---KASKKIAAIGEKVEFQILLAAGS--AVNFR 1900 (2740)
T ss_pred             ceeEEecCCCeEEEEEEEEEccc-CcceeeeeEEEEEecceEE---E---ecccccccCCCEEEEEEEecCCC--ceEEE
Confidence            56889999999999999843321 222  23334444321111   0   01223456788888887665322  23322


Q ss_pred             EeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECCeecCCCceEEE--EccCCCCCcceEEEeCCCCCc-----
Q psy192          148 VTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKDIHIPGSPFQFT--VGPLRDGGAHRVHAGGPGLER-----  220 (898)
Q Consensus       148 V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~V~gSPf~v~--V~~~~~~d~~~v~a~G~GL~~-----  220 (898)
                      ..=-+|...   +.+ ...+|.  +.=.++|.|.|+|...+. |......++  |...         +.|.-|..     
T Consensus      1901 w~fGdgs~~---~~t-~~~t~~--HsY~~~G~Y~VtV~A~N~-Vs~~~a~~~V~Vle~---------V~gL~I~~~c~~~ 1964 (2740)
T TIGR00864      1901 LQIGGAAPE---VLQ-PGPRFS--HSFPRVDDHMVNLRAKNE-VSCAQANLHIEVLEA---------VRGLQIPDCCAAG 1964 (2740)
T ss_pred             EEeCCCCce---eec-CCCeEE--eecCCCCcEEEEEEEEec-cceeeeeEEEEEEEe---------ccceEecCCcccc
Confidence            221223321   111 122333  444789999999998764 444444443  3321         12211111     


Q ss_pred             ceeeeeeEEEEEeccCCCCceEEEEEcCCcceeeEEE---CCCCc-----EEEEEEeCCCceEEEEEEECCe-eCCCCCc
Q psy192          221 GEQNQPCEFNVWTREAGAGSLAISVEGPSKAEIDFKD---RKDGS-----CYVSYVVAEPGEYRVGIKFNDQ-HIPDSPY  291 (898)
Q Consensus       221 ~~~g~~~~F~V~~~daG~g~l~v~v~gps~~~i~~~d---~~dGt-----y~v~y~P~~~G~y~i~V~~~g~-~I~gSPf  291 (898)
                      ..+|+...|+..... | ..+...=        .+..   +++..     =++.|+|..+|.|+|+|+-.+. .-..-+-
T Consensus      1965 ~~~g~~~tF~A~v~~-g-~~V~f~W--------~f~~~~~~g~~~~~~~G~~vty~~~~~G~~~I~v~A~N~l~s~n~t~ 2034 (2740)
T TIGR00864      1965 IATGEEKNFTANVQR-G-KPVAFAW--------TFDLHHLHGDSLVIHMGKDVSYTAEAAGLLEIQLGAFNALGAENITL 2034 (2740)
T ss_pred             eecCceEEEEEEEec-C-CceEEEE--------EeeecccCCCcceeecCCceEEccCCCcEEEEEEEEEcccCcccccE
Confidence            245677777665521 1 1111100        0000   11111     1489999999999999987542 2222334


Q ss_pred             eeEecCCCCCceeEEEeeeCCCceecCCCeEEEEEecCCccceEEEEECCCCCeeeeEEEecCCCeEEEEEEecCCeeEE
Q psy192          292 KLFVSPAMGDAHKLEIAQFPQGVVMADKPTQFLVRKNGAVGALDAKVISPSGTEDDCFIQPIDGDNYSIRFMPRENGIHN  371 (898)
Q Consensus       292 ~v~V~~~~~~~~~~~~~~~~~~~~~~g~~~~f~V~~~~a~G~l~v~v~~p~g~~~~~~v~~~~~g~y~v~f~P~~~G~h~  371 (898)
                      .+.|.....   ...+... .....+|+...|.+...-..-++.....=-+|...      ......+++++=.++|.|.
T Consensus      2035 ~i~VQ~~i~---~~~L~~~-p~~~~~n~~v~f~~~~~~~~~~v~~~WDFGDGt~~------~~T~~~~~sHtY~r~G~Y~ 2104 (2740)
T TIGR00864      2035 QLEAQDALM---DAALQAG-PQDCFTNKMAQFEAATSPKPNFMACHWDFGDGSAG------QDTDEPRAEHEYLHPGDYR 2104 (2740)
T ss_pred             EEEEEehee---eEEEeec-CcceecCceEEEEEEEccCCCcEEEEEEecCCCCc------cccCCceeEEEecCCccEE
Confidence            445533221   1122111 12245577777776653221123333221123221      1112457778778889999


Q ss_pred             EEEE-ECCeecCCCCeEEEeccCCCCCceEEecCCCc
Q psy192          372 IHIK-FNGVHIPGSPLRIKVGKGEADPAAVHATGNGL  407 (898)
Q Consensus       372 i~V~-~~g~~I~gSPf~v~V~~~~~d~s~v~v~G~Gl  407 (898)
                      +.|. +|.....-.=.++.|...++.+-.|.+.|+.-
T Consensus      2105 V~V~a~N~VS~~~a~~~i~V~vl~C~~P~V~i~~~~~ 2141 (2740)
T TIGR00864      2105 VQVNASNLVSFFSAHAEINVQVLACEEPEVDVVLALQ 2141 (2740)
T ss_pred             EEEEEEcceeeeeeEEEEEEEEecCCCCCcEeccCce
Confidence            9987 45433322233444444457777777776643


No 10 
>PF13584 BatD:  Oxygen tolerance
Probab=94.87  E-value=3.6  Score=48.64  Aligned_cols=203  Identities=16%  Similarity=0.269  Sum_probs=108.5

Q ss_pred             eEEEEEeeCCCeEEEE---EEEECCeecCCCCeEEEEecCCcccc------eeeee-eeecccceeeeCcEEEEEEEcCC
Q psy192           69 SLNISYRPTEPGYYII---NLKFADHHVEGSPFTAKIVGEGSNRQ------REKIQ-RQREAVPVTEVGSTCKLTFKMPG  138 (898)
Q Consensus        69 ~~~v~ytP~~~G~~~I---~V~~~g~~i~gSPf~~~v~~~~~~~~------~~~i~-~~~~~~~~~~vg~~~~f~v~~~~  138 (898)
                      +|.+.++|.++|.|+|   .|.+||+...=-|..++|.+......      ...+. ..........+|++..+++++.-
T Consensus        74 ~~~~~l~p~~~G~~~IP~~~v~v~Gk~~~S~pi~i~V~~~~~~~~~~~~~~~~~~~l~~~v~~~~~Yvge~v~lt~~ly~  153 (484)
T PF13584_consen   74 TYTYTLQPKKTGTFTIPPFTVEVDGKTYKSQPITIEVSKASQSPSQPPSNADDDVFLEAEVSKKSVYVGEPVILTLRLYT  153 (484)
T ss_pred             EEEEEEEecccceEEEceEEEEECCEEEeecCEEEEEEecccCCccccccccccEEEEEEeCCCceecCCcEEEEEEEEE
Confidence            6778889999999995   67779988888899999987653211      11111 11112334678888877766431


Q ss_pred             Ccc--------------cceEEEEeCCCCCeeeeEEE-EcCCCE-------EEEEEEcCcceeEEE-----EEEECC---
Q psy192          139 ITA--------------FDLSATVTSPGGVTEDAEIN-EVEDGL-------YAVHFVPKELGVHTV-----SVRYKD---  188 (898)
Q Consensus       139 ~~~--------------~~~~~~V~~p~G~~~~~~v~-~~~~g~-------y~v~f~P~~~G~h~I-----~V~~~g---  188 (898)
                      ...              .++.++-.   +...  ... ..-+|.       ++....|.++|.+.|     .+.+..   
T Consensus       154 ~~~~~~~~~~~~~~p~~~~~~~~~~---~~~~--~~~~~~i~G~~y~~~~~~~~~l~P~ksG~l~I~~~~~~~~~~~~~~  228 (484)
T PF13584_consen  154 RNNFRQLGIEELPPPDFEGFWVEQL---GDDR--QYEEERINGRRYRVIELRRYALFPQKSGTLTIPPATFEVTVSDPSG  228 (484)
T ss_pred             ecCchhccccccCCCCCCCcEEEEC---CCCC--ceeEEEECCEEEEEEEEEEEEEEeCCceeEEecCEEEEEEEecccC
Confidence            111              11222212   1111  111 112331       345567999999988     333311   


Q ss_pred             --------------eecCCCceEEEEccCCC-C------Cc-ceEE--EeCCCCCcceeeeeeEEEEEeccCCCCc-eE-
Q psy192          189 --------------IHIPGSPFQFTVGPLRD-G------GA-HRVH--AGGPGLERGEQNQPCEFNVWTREAGAGS-LA-  242 (898)
Q Consensus       189 --------------~~V~gSPf~v~V~~~~~-~------d~-~~v~--a~G~GL~~~~~g~~~~F~V~~~daG~g~-l~-  242 (898)
                                    ..+...|.+++|.+... .      .+ .+.+  ..-+ -....+|++.+.+|...+.|.-. +. 
T Consensus       229 ~~~~fg~~~~~~~~~~~~s~~~~i~V~plP~~~~p~~f~~aVg~f~l~~~~~-~~~~~~Ge~vt~ti~i~g~Gn~~~~~l  307 (484)
T PF13584_consen  229 RRDFFGGNFGRSRPVSISSEPLTITVKPLPAEGAPADFSGAVGNFSLSQSWD-PTEVKVGEPVTRTITISGEGNLPSIQL  307 (484)
T ss_pred             ccCccccccccceeEEecCCCeEEEeccCCcccCCCCcccceeEEEEEEEcC-cccccCCCeEEEEEEEEEEcchhcccC
Confidence                          23566788888876433 1      11 1111  1111 12346888888888877654321 10 


Q ss_pred             EEEEcCCc-------ceeeEEECCCC-----cEEEEEEeCCCceEEE
Q psy192          243 ISVEGPSK-------AEIDFKDRKDG-----SCYVSYVVAEPGEYRV  277 (898)
Q Consensus       243 v~v~gps~-------~~i~~~d~~dG-----ty~v~y~P~~~G~y~i  277 (898)
                      -.+.-|.+       .........+|     +++..|+|..+|.|.|
T Consensus       308 P~l~~~~~~~vy~~~~~~~~~~~~~g~~g~~~~~~~~ip~~~G~~~l  354 (484)
T PF13584_consen  308 PPLNLPKGFRVYPPKPQEQDKPSGGGLTGSRTFKYTLIPKKPGDFTL  354 (484)
T ss_pred             CCCCCCcccEEcCCCccccccccCCcceEEEEEEEEEEeCCCCeEEc
Confidence            00111111       11122223333     3567888999999986


No 11 
>KOG1948|consensus
Probab=94.82  E-value=13  Score=45.30  Aligned_cols=124  Identities=21%  Similarity=0.258  Sum_probs=74.1

Q ss_pred             eEEEEcCCCEEEEEEEcCcceeEEEEEEECCeecCCCceEEEEccCCCCCcceEEEeCCCCCcceeeeeeEEEEEec---
Q psy192          158 AEINEVEDGLYAVHFVPKELGVHTVSVRYKDIHIPGSPFQFTVGPLRDGGAHRVHAGGPGLERGEQNQPCEFNVWTR---  234 (898)
Q Consensus       158 ~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~V~gSPf~v~V~~~~~~d~~~v~a~G~GL~~~~~g~~~~F~V~~~---  234 (898)
                      |.-....+|.|.+.-+|  .|.|.+--.+-|     +|..++|.|.    +-++.+.=+-|+-...-+-+-|+|..|   
T Consensus       256 c~~vsd~~G~fsfksvP--sGkY~l~a~y~g-----e~~~fdvSP~----~l~v~Vehd~lqi~~ef~vtgfSvtGRVl~  324 (1165)
T KOG1948|consen  256 CIGVSDPRGRFSFKSVP--SGKYYLAASYVG-----EPKSFDVSPN----PLKVVVEHDHLQIASEFRVTGFSVTGRVLV  324 (1165)
T ss_pred             EEEEEcCCceEEEEEcC--CCCEEEEEEecC-----CceEEEeCCC----ceeEEEeccceeccceeEEEEEEeeeeEEe
Confidence            33345678888766666  699999888765     4555666542    334444444444333333344555443   


Q ss_pred             ---cCCCCceEEEEEcCCcceeeEEECCCCcEEEEEEeCCCceEEEEEEECCeeCCCCCceeEecCCC
Q psy192          235 ---EAGAGSLAISVEGPSKAEIDFKDRKDGSCYVSYVVAEPGEYRVGIKFNDQHIPDSPYKLFVSPAM  299 (898)
Q Consensus       235 ---daG~g~l~v~v~gps~~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g~~I~gSPf~v~V~~~~  299 (898)
                         +.+--...|.|.|-.+.    .-..+|.|+++=. ...|.|+|++.  -+|..-|+.++.|.+..
T Consensus       325 g~~g~~l~gvvvlvngk~~~----kTdaqGyykLen~-~t~gtytI~a~--kehlqFstv~~kv~pnt  385 (1165)
T KOG1948|consen  325 GSKGLPLSGVVVLVNGKSGG----KTDAQGYYKLENL-KTDGTYTITAK--KEHLQFSTVHAKVKPNT  385 (1165)
T ss_pred             CCCCCCccceEEEEcCcccc----eEcccceEEeeee-eccCcEEEEEe--ccceeeeeEEEEecCCc
Confidence               11222334555554332    2347899988763 46899999885  46777788888887654


No 12 
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=94.40  E-value=0.54  Score=52.62  Aligned_cols=118  Identities=17%  Similarity=0.185  Sum_probs=74.0

Q ss_pred             CCCeEEEEE-eeCCCeEEEEEEEECCee----------cCCCCeEEEEecCCcccceeeeeeeecccceeeeCcEEEEEE
Q psy192           66 ADGSLNISY-RPTEPGYYIINLKFADHH----------VEGSPFTAKIVGEGSNRQREKIQRQREAVPVTEVGSTCKLTF  134 (898)
Q Consensus        66 ~dG~~~v~y-tP~~~G~~~I~V~~~g~~----------i~gSPf~~~v~~~~~~~~~~~i~~~~~~~~~~~vg~~~~f~v  134 (898)
                      +||.|+..+ .-..+|.|.+.+..++.-          |.-.|+++++..+.                  ..+.+..+.|
T Consensus       169 ~DGvFT~~l~l~~~~G~Y~~~v~~~n~vF~R~~~q~v~V~p~Pi~~~~~~~~------------------~~~~~h~l~v  230 (374)
T TIGR03503       169 GDGIFTGEFNLDVAPGEYRPTYQSRNPVFLREVEQPVLVYPLPVSYTVIQSE------------------DESGAHQLMV  230 (374)
T ss_pred             CCceEEEEeeccCCCceEEEEEEEcCceEEEEEEEeEEEECCCeeEEEEccC------------------CCCCceEEEE
Confidence            799999999 456799999999987622          23334443333221                  0123444444


Q ss_pred             EcCC--C--cccceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECCeecCCCceEEEEc
Q psy192          135 KMPG--I--TAFDLSATVTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKDIHIPGSPFQFTVG  201 (898)
Q Consensus       135 ~~~~--~--~~~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~V~gSPf~v~V~  201 (898)
                      ++..  .  +.-.+.+++..|+|......+...+..+....=.+.+.|.|.++.+.-+..+.|-||.+.+.
T Consensus       231 ~~d~~~i~~~s~~~~~~~~~P~g~~~~~~~~~~~~~~~~~l~~~~~~G~Y~i~~~~~~Tt~~GRe~~~~lp  301 (374)
T TIGR03503       231 DADAGHIDPGSLVIHGELVFPNGQIQQFSIELEEPETRVDLPANYEFGKYRVKGTVFGTTITGRELQLTLP  301 (374)
T ss_pred             EcccceeccccEEEEEEEECCCCceEEecccCccCceEEeccCcCCCeEEEEEEEEEEecCCCCEEEEEcC
Confidence            4331  2  23345678899999833222222333333332234489999999999999999999988874


No 13 
>PF13584 BatD:  Oxygen tolerance
Probab=92.63  E-value=8.6  Score=45.47  Aligned_cols=205  Identities=15%  Similarity=0.267  Sum_probs=110.8

Q ss_pred             EEEEEEEcCcceeEEE---EEEECCeecCCCceEEEEccCCCCC-------cce--EEEeCCCCCcceeeeeeEEEEEec
Q psy192          167 LYAVHFVPKELGVHTV---SVRYKDIHIPGSPFQFTVGPLRDGG-------AHR--VHAGGPGLERGEQNQPCEFNVWTR  234 (898)
Q Consensus       167 ~y~v~f~P~~~G~h~I---~V~~~g~~V~gSPf~v~V~~~~~~d-------~~~--v~a~G~GL~~~~~g~~~~F~V~~~  234 (898)
                      +|.+.+.|.+.|.+.|   .|.++|....-.|+.++|.+.....       ...  +++.= .-....+|++..+++...
T Consensus        74 ~~~~~l~p~~~G~~~IP~~~v~v~Gk~~~S~pi~i~V~~~~~~~~~~~~~~~~~~~l~~~v-~~~~~Yvge~v~lt~~ly  152 (484)
T PF13584_consen   74 TYTYTLQPKKTGTFTIPPFTVEVDGKTYKSQPITIEVSKASQSPSQPPSNADDDVFLEAEV-SKKSVYVGEPVILTLRLY  152 (484)
T ss_pred             EEEEEEEecccceEEEceEEEEECCEEEeecCEEEEEEecccCCccccccccccEEEEEEe-CCCceecCCcEEEEEEEE
Confidence            6889999999999975   6788999999999999998653211       111  22111 112346888887777654


Q ss_pred             cCCCCc-eE-EEEEcCC--cc-------eeeEE-ECCCCc-EE------EEEEeCCCceEEE-----EEEECC-------
Q psy192          235 EAGAGS-LA-ISVEGPS--KA-------EIDFK-DRKDGS-CY------VSYVVAEPGEYRV-----GIKFND-------  283 (898)
Q Consensus       235 daG~g~-l~-v~v~gps--~~-------~i~~~-d~~dGt-y~------v~y~P~~~G~y~i-----~V~~~g-------  283 (898)
                      -...-. +. ..+.-|.  +.       ..... ..-+|. |.      ....|..+|++.|     .+.+..       
T Consensus       153 ~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~i~G~~y~~~~~~~~~l~P~ksG~l~I~~~~~~~~~~~~~~~~~~  232 (484)
T PF13584_consen  153 TRNNFRQLGIEELPPPDFEGFWVEQLGDDRQYEEERINGRRYRVIELRRYALFPQKSGTLTIPPATFEVTVSDPSGRRDF  232 (484)
T ss_pred             EecCchhccccccCCCCCCCcEEEECCCCCceeEEEECCEEEEEEEEEEEEEEeCCceeEEecCEEEEEEEecccCccCc
Confidence            332211 11 0111111  00       11111 123453 32      2347999999998     333321       


Q ss_pred             ----------eeCCCCCceeEecCCCCC---------ceeEEEe-eeCCCceecCCCeEEEEEecCCccceE-E---EEE
Q psy192          284 ----------QHIPDSPYKLFVSPAMGD---------AHKLEIA-QFPQGVVMADKPTQFLVRKNGAVGALD-A---KVI  339 (898)
Q Consensus       284 ----------~~I~gSPf~v~V~~~~~~---------~~~~~~~-~~~~~~~~~g~~~~f~V~~~~a~G~l~-v---~v~  339 (898)
                                ..+...|.++.|.+-..+         ..++.+. ......+.+|++.+++|...+. |++. .   .+.
T Consensus       233 fg~~~~~~~~~~~~s~~~~i~V~plP~~~~p~~f~~aVg~f~l~~~~~~~~~~~Ge~vt~ti~i~g~-Gn~~~~~lP~l~  311 (484)
T PF13584_consen  233 FGGNFGRSRPVSISSEPLTITVKPLPAEGAPADFSGAVGNFSLSQSWDPTEVKVGEPVTRTITISGE-GNLPSIQLPPLN  311 (484)
T ss_pred             cccccccceeEEecCCCeEEEeccCCcccCCCCcccceeEEEEEEEcCcccccCCCeEEEEEEEEEE-cchhcccCCCCC
Confidence                      134556777777653221         1333332 2334567899999888876432 4433 1   112


Q ss_pred             CCCCCee-e--e--EEEecCCC-----eEEEEEEecCCeeEEEE
Q psy192          340 SPSGTED-D--C--FIQPIDGD-----NYSIRFMPRENGIHNIH  373 (898)
Q Consensus       340 ~p~g~~~-~--~--~v~~~~~g-----~y~v~f~P~~~G~h~i~  373 (898)
                      .|+|-.+ +  .  ......+|     ++++.|+|.++|.|.|-
T Consensus       312 ~~~~~~vy~~~~~~~~~~~~~g~~g~~~~~~~~ip~~~G~~~lP  355 (484)
T PF13584_consen  312 LPKGFRVYPPKPQEQDKPSGGGLTGSRTFKYTLIPKKPGDFTLP  355 (484)
T ss_pred             CCcccEEcCCCccccccccCCcceEEEEEEEEEEeCCCCeEEcC
Confidence            2233211 1  1  11112222     57788889999998753


No 14 
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=92.39  E-value=65  Score=45.23  Aligned_cols=263  Identities=13%  Similarity=0.211  Sum_probs=134.3

Q ss_pred             EEEEEEcCcceeEEEEEEECCeecCCC--ceEEEEccCCCCCcceEEEeCCCCCcceeeeeeEEEEEeccCCCCceEEE-
Q psy192          168 YAVHFVPKELGVHTVSVRYKDIHIPGS--PFQFTVGPLRDGGAHRVHAGGPGLERGEQNQPCEFNVWTREAGAGSLAIS-  244 (898)
Q Consensus       168 y~v~f~P~~~G~h~I~V~~~g~~V~gS--Pf~v~V~~~~~~d~~~v~a~G~GL~~~~~g~~~~F~V~~~daG~g~l~v~-  244 (898)
                      ..+.|++.++|.|.|++..-+. +...  ...+.|...    .....+. .+......|+.++|.+...- |. ++... 
T Consensus      1830 k~v~~Tf~~aG~ytV~L~AsN~-vs~~~~s~~~~VQe~----I~~L~L~-as~~~~~~n~~v~fsa~l~~-GS-~Vtf~w 1901 (2740)
T TIGR00864      1830 KHACMTFPDAGTFAIRLNASNA-VSGKSASREFFAEEP----IFGLELK-ASKKIAAIGEKVEFQILLAA-GS-AVNFRL 1901 (2740)
T ss_pred             ceeEEecCCCeEEEEEEEEEcc-cCcceeeeeEEEEEe----cceEEEe-cccccccCCCEEEEEEEecC-CC-ceEEEE
Confidence            5788999999999999998653 2332  334444321    1112222 23345668999999997753 33 34333 


Q ss_pred             -EEcCCcceeeEEECCCCcEEEEEEeCCCceEEEEEEECCeeCCCCCceeEe--cCCCCCceeEEEeeeCCCceecCCCe
Q psy192          245 -VEGPSKAEIDFKDRKDGSCYVSYVVAEPGEYRVGIKFNDQHIPDSPYKLFV--SPAMGDAHKLEIAQFPQGVVMADKPT  321 (898)
Q Consensus       245 -v~gps~~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g~~I~gSPf~v~V--~~~~~~~~~~~~~~~~~~~~~~g~~~  321 (898)
                       +.+-+ ..+  .+ ..-  ++++.=..+|.|.|.|..-+. +....+.+.|  ..+.   +.+.+..-...++.+|+..
T Consensus      1902 ~fGdgs-~~~--~t-~~~--t~~HsY~~~G~Y~VtV~A~N~-Vs~~~a~~~V~Vle~V---~gL~I~~~c~~~~~~g~~~ 1971 (2740)
T TIGR00864      1902 QIGGAA-PEV--LQ-PGP--RFSHSFPRVDDHMVNLRAKNE-VSCAQANLHIEVLEAV---RGLQIPDCCAAGIATGEEK 1971 (2740)
T ss_pred             EeCCCC-cee--ec-CCC--eEEeecCCCCcEEEEEEEEec-cceeeeeEEEEEEEec---cceEecCCcccceecCceE
Confidence             32211 111  11 112  344555789999999987643 2223333333  2211   1111111001123444444


Q ss_pred             EEEEEecCCccceEEEEECCCCCeee--eEE--Eec-CCC-----eEEEEEEecCCeeEEEEEE-ECCeecCCCCeEEEe
Q psy192          322 QFLVRKNGAVGALDAKVISPSGTEDD--CFI--QPI-DGD-----NYSIRFMPRENGIHNIHIK-FNGVHIPGSPLRIKV  390 (898)
Q Consensus       322 ~f~V~~~~a~G~l~v~v~~p~g~~~~--~~v--~~~-~~g-----~y~v~f~P~~~G~h~i~V~-~~g~~I~gSPf~v~V  390 (898)
                      .|.           |++.  .|..+.  ..+  ... ++.     -=.++|+|.++|.|+|+|+ +|...-..=+..++|
T Consensus      1972 tF~-----------A~v~--~g~~V~f~W~f~~~~~~g~~~~~~~G~~vty~~~~~G~~~I~v~A~N~l~s~n~t~~i~V 2038 (2740)
T TIGR00864      1972 NFT-----------ANVQ--RGKPVAFAWTFDLHHLHGDSLVIHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQLEA 2038 (2740)
T ss_pred             EEE-----------EEEe--cCCceEEEEEeeecccCCCcceeecCCceEEccCCCcEEEEEEEEEcccCcccccEEEEE
Confidence            444           4442  222211  000  000 010     0158999999999999998 554333334556666


Q ss_pred             ccCCCCCceEEecCCCcceeeeCCeEEEEEEcCCCCCCceEEEEE-CCCCceeeeEEeCCeEEEEEecCCCeeEEEEEEE
Q psy192          391 GKGEADPAAVHATGNGLAEIKSGVKTDFIVDTCNAGAGTLAVTID-GPSKVSMDCTEVEEGYKVRYTPLVPGDYYVSLKY  469 (898)
Q Consensus       391 ~~~~~d~s~v~v~G~Gl~~~~~g~~~~F~Vd~~~aG~g~l~v~v~-gp~~~~~~~~~~~~gy~v~ytp~~~G~y~i~V~~  469 (898)
                      -....   ...+. .+.....+|+.+.|.+...- +..++....+ |... ...   .-+..+++++=.++|+|.+.|..
T Consensus      2039 Q~~i~---~~~L~-~~p~~~~~n~~v~f~~~~~~-~~~~v~~~WDFGDGt-~~~---~T~~~~~sHtY~r~G~Y~V~V~a 2109 (2740)
T TIGR00864      2039 QDALM---DAALQ-AGPQDCFTNKMAQFEAATSP-KPNFMACHWDFGDGS-AGQ---DTDEPRAEHEYLHPGDYRVQVNA 2109 (2740)
T ss_pred             Eehee---eEEEe-ecCcceecCceEEEEEEEcc-CCCcEEEEEEecCCC-Ccc---ccCCceeEEEecCCccEEEEEEE
Confidence            43211   11111 11234567888888887632 2333444443 2111 111   11236788888899999999975


Q ss_pred             C
Q psy192          470 N  470 (898)
Q Consensus       470 ~  470 (898)
                      .
T Consensus      2110 ~ 2110 (2740)
T TIGR00864      2110 S 2110 (2740)
T ss_pred             E
Confidence            3


No 15 
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=91.63  E-value=2.4  Score=47.57  Aligned_cols=116  Identities=18%  Similarity=0.251  Sum_probs=72.5

Q ss_pred             CCCCcEEEEEEe-CCCceEEEEEEECCee----------CCCCCceeEecCCCCCceeEEEeeeCCCceecCCCeEEEEE
Q psy192          258 RKDGSCYVSYVV-AEPGEYRVGIKFNDQH----------IPDSPYKLFVSPAMGDAHKLEIAQFPQGVVMADKPTQFLVR  326 (898)
Q Consensus       258 ~~dGty~v~y~P-~~~G~y~i~V~~~g~~----------I~gSPf~v~V~~~~~~~~~~~~~~~~~~~~~~g~~~~f~V~  326 (898)
                      -.||.|+.++.- ..+|.|.+.|..++.-          +.-.|+.+.+..+.                ..+.+..+.|.
T Consensus       168 p~DGvFT~~l~l~~~~G~Y~~~v~~~n~vF~R~~~q~v~V~p~Pi~~~~~~~~----------------~~~~~h~l~v~  231 (374)
T TIGR03503       168 PGDGIFTGEFNLDVAPGEYRPTYQSRNPVFLREVEQPVLVYPLPVSYTVIQSE----------------DESGAHQLMVD  231 (374)
T ss_pred             CCCceEEEEeeccCCCceEEEEEEEcCceEEEEEEEeEEEECCCeeEEEEccC----------------CCCCceEEEEE
Confidence            368999998864 4589999999998742          22334443332211                11334555555


Q ss_pred             ecCC---ccc--eEEEEECCCCCeeeeEEEecC-CCeEEEEEEecCCeeEEEEEEECCeecCCCCeEEEe
Q psy192          327 KNGA---VGA--LDAKVISPSGTEDDCFIQPID-GDNYSIRFMPRENGIHNIHIKFNGVHIPGSPLRIKV  390 (898)
Q Consensus       327 ~~~a---~G~--l~v~v~~p~g~~~~~~v~~~~-~g~y~v~f~P~~~G~h~i~V~~~g~~I~gSPf~v~V  390 (898)
                      ....   .+.  +.+++..|+|. ........+ ..+...-=...+.|.|+++.+.-+..+.|.||.+..
T Consensus       232 ~d~~~i~~~s~~~~~~~~~P~g~-~~~~~~~~~~~~~~~~l~~~~~~G~Y~i~~~~~~Tt~~GRe~~~~l  300 (374)
T TIGR03503       232 ADAGHIDPGSLVIHGELVFPNGQ-IQQFSIELEEPETRVDLPANYEFGKYRVKGTVFGTTITGRELQLTL  300 (374)
T ss_pred             cccceeccccEEEEEEEECCCCc-eEEecccCccCceEEeccCcCCCeEEEEEEEEEEecCCCCEEEEEc
Confidence            4332   243  56778899997 333333322 223222224559999999999999999999998876


No 16 
>KOG1428|consensus
Probab=90.52  E-value=1  Score=56.58  Aligned_cols=43  Identities=23%  Similarity=0.533  Sum_probs=37.6

Q ss_pred             EEEecCCCeEEEEEEecCCeeEEEEEEECCeecCCCCeEEEecc
Q psy192          349 FIQPIDGDNYSIRFMPRENGIHNIHIKFNGVHIPGSPLRIKVGK  392 (898)
Q Consensus       349 ~v~~~~~g~y~v~f~P~~~G~h~i~V~~~g~~I~gSPf~v~V~~  392 (898)
                      -++.+++|+|...|+|...|.|+|+++++|..|.- -++++|..
T Consensus      2099 ~ir~nnd~~y~~nWTP~a~G~Y~l~~tiDG~eId~-~~eVkVkd 2141 (3738)
T KOG1428|consen 2099 LIRVNNDGTYCANWTPGAIGLYTLHVTIDGIEIDA-GLEVKVKD 2141 (3738)
T ss_pred             eEEecCCCceecccCCccceeEEEEEEecceeecc-ceEEEecc
Confidence            45567899999999999999999999999999975 68888863


No 17 
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=90.16  E-value=10  Score=47.51  Aligned_cols=151  Identities=15%  Similarity=0.148  Sum_probs=87.0

Q ss_pred             eeeeeeEEEEEeccCCCCceEEEEEcCCccee-eE-EECCCCcEEEEE-Ee--CCCceEEEEEEECCeeCCCCCceeEec
Q psy192          222 EQNQPCEFNVWTREAGAGSLAISVEGPSKAEI-DF-KDRKDGSCYVSY-VV--AEPGEYRVGIKFNDQHIPDSPYKLFVS  296 (898)
Q Consensus       222 ~~g~~~~F~V~~~daG~g~l~v~v~gps~~~i-~~-~d~~dGty~v~y-~P--~~~G~y~i~V~~~g~~I~gSPf~v~V~  296 (898)
                      .+|..+.|.| +...  ...++.+.+|++... .+ .|.  .+..+.. +|  .++|.|+.+|.-..   ..-+-+++|+
T Consensus       512 tvG~~T~F~v-tw~~--~~p~i~l~dP~G~~~~~~~~d~--~~~~a~l~ipg~a~~G~W~y~l~~~~---~~q~i~vtVt  583 (863)
T TIGR00868       512 TVGKDTFFLI-TWEF--LKPEIFLQDPSGKSTSDFLVDK--LNKMAYLQIPGTAKVGTWTYSLQASA---NPQTLTLTVT  583 (863)
T ss_pred             CCCCCcEEEE-EcCC--CCceEEEECCCCCccccceEee--cceEEEEEcCCCCccceeEEEEeccC---CCceEEEEEE
Confidence            3577777777 4433  257889999987531 11 111  1112221 23  36899998885441   1122345554


Q ss_pred             CC--CCCceeEEEe-eeCCCceecCCCeEEEEEecCC-----ccceEEEEECCCCCeeeeEEEec--------CCCeEEE
Q psy192          297 PA--MGDAHKLEIA-QFPQGVVMADKPTQFLVRKNGA-----VGALDAKVISPSGTEDDCFIQPI--------DGDNYSI  360 (898)
Q Consensus       297 ~~--~~~~~~~~~~-~~~~~~~~~g~~~~f~V~~~~a-----~G~l~v~v~~p~g~~~~~~v~~~--------~~g~y~v  360 (898)
                      .-  ..+..-+++. .+........+|+.++..-+..     ..++.|.|..|+|..+...+.++        +||+|.-
T Consensus       584 sra~s~~~~pi~~~a~~~~~~~~~psp~ivya~V~kG~~pVlgA~Vta~i~~~~g~~~~l~l~DnGaGaD~~knDGIYS~  663 (863)
T TIGR00868       584 SRARSPTLPPVTVTAKMNKDTAKFPSPMIVYAKISQGFLPVLGANVTALIESENGHTVTLELLDNGAGADTVKNDGIYSR  663 (863)
T ss_pred             eccCCCCCCCEEEEEEEccccccCCCceEEEEEEeeCCceeeccEEEEEEECCCCCEEEEEEeecCCCCCccccCceeee
Confidence            21  1112223332 2222223345666555554433     23578888999998776677664        3899999


Q ss_pred             EEEe-cCCeeEEEEEEECCee
Q psy192          361 RFMP-RENGIHNIHIKFNGVH  380 (898)
Q Consensus       361 ~f~P-~~~G~h~i~V~~~g~~  380 (898)
                      .|+. ...|.|.|.|..+|..
T Consensus       664 yft~f~~nGrYsl~V~v~~~~  684 (863)
T TIGR00868       664 YFTAYDGNGRYSLKVRALGGV  684 (863)
T ss_pred             eeecccCCceEEEEEEEeCCC
Confidence            9975 5779999999988754


No 18 
>PF13115 YtkA:  YtkA-like
Probab=88.23  E-value=4.1  Score=35.66  Aligned_cols=71  Identities=14%  Similarity=0.282  Sum_probs=50.6

Q ss_pred             CCCeEEEEEE---cCCceeEEEEEEE--CCEEecCCCCeeEEEecCc-------cceEEEEeCCCCeEEEEEeeCCCeEE
Q psy192           15 HDGTVSLHYD---PREEGLHELALKF--NGDHVQGYGGLSLSIEGPS-------KAEIQCKDNADGSLNISYRPTEPGYY   82 (898)
Q Consensus        15 ~d~~~~v~f~---P~~~G~h~v~v~~--~G~~v~G~g~l~v~v~GP~-------~~~i~~~d~~dG~~~v~ytP~~~G~~   82 (898)
                      ..+.|+|++.   +-..|...+.+..  +|+||.+.- +.+.+.=|.       ...+.....++|.|.+..+-+.+|.|
T Consensus         3 ~~~~~~v~l~~~~~~~~g~~~i~v~~~~~g~pv~~a~-V~~~~~m~~~~g~~~~~~~~~~~~~~~G~Y~~~~~f~m~G~W   81 (86)
T PF13115_consen    3 QVGGYTVELVSPEPPKVGENTITVTVDQGGKPVTDAD-VQFEIWMPDMEGMEPMTSKVELEETGPGVYEAEVTFSMAGTW   81 (86)
T ss_pred             ccccEEEEEecCCCCcCCceEEEEEECCCCCCCCCCE-EEEEEEeCCCCCCCCCceeeeeecCCCCeEEEEeecCCCeeE
Confidence            3467888887   3466777766664  588887663 666654442       23345555689999999999999999


Q ss_pred             EEEE
Q psy192           83 IINL   86 (898)
Q Consensus        83 ~I~V   86 (898)
                      .|.|
T Consensus        82 ~i~v   85 (86)
T PF13115_consen   82 QITV   85 (86)
T ss_pred             EEEE
Confidence            9987


No 19 
>PF09134 Invasin_D3:  Invasin, domain 3;  InterPro: IPR015217 This domain adopts a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, arranged in a Greek-key topology. It forms part of the extracellular region of the protein, which can be expressed as a soluble protein (Inv497) that binds integrins and promotes subsequent uptake by cells when attached to bacteria []. ; PDB: 1CWV_A.
Probab=87.60  E-value=5.9  Score=35.74  Aligned_cols=87  Identities=18%  Similarity=0.301  Sum_probs=50.1

Q ss_pred             ccCeEEeCCCCeEEEEEEcCCceeEEEEEEECCEEecCCCCeeEEEecCccceEEEEeCCCCeEEEEEeeCCCeEEEEEE
Q psy192            7 DKPVIEDNHDGTVSLHYDPREEGLHELALKFNGDHVQGYGGLSLSIEGPSKAEIQCKDNADGSLNISYRPTEPGYYIINL   86 (898)
Q Consensus         7 ~~~~i~~~~d~~~~v~f~P~~~G~h~v~v~~~G~~v~G~g~l~v~v~GP~~~~i~~~d~~dG~~~v~ytP~~~G~~~I~V   86 (898)
                      -|..|.-++..+..+.|+++..         +|+||.|-..|.+.-.|- ...+.-..+..|.|+.+..-+.+|+++|.=
T Consensus        11 ~k~~I~ADG~dta~LTfta~D~---------~g~pvsGl~~l~f~~~g~-~vtls~itE~~GvYtATl~Gt~~G~vtI~P   80 (99)
T PF09134_consen   11 SKPSIIADGQDTATLTFTARDA---------NGNPVSGLSVLAFVQDGV-PVTLSPITENPGVYTATLKGTSAGEVTITP   80 (99)
T ss_dssp             S-SEEETTS---EEEEEE-B-T---------TS-B--S--SEEEEEEES---EE---EE-SS-EEEEEE-SS-EEEEEEE
T ss_pred             CccccccCCcceEEEEEEEEcC---------CCCcccCceeEEEEccCC-CcEEccccccCCEEEEEEEcccCceEEEEE
Confidence            3667777777888999998876         588887775566654432 122221223569999999999999999999


Q ss_pred             EECCeecCCCCeEEEEe
Q psy192           87 KFADHHVEGSPFTAKIV  103 (898)
Q Consensus        87 ~~~g~~i~gSPf~~~v~  103 (898)
                      .++|+++..=|-++..+
T Consensus        81 ~v~G~~l~~~~~k~tl~   97 (99)
T PF09134_consen   81 QVNGQPLATLQKKLTLI   97 (99)
T ss_dssp             EETTEE-GGG-EEEEEE
T ss_pred             eeCCEEccccceeEEEe
Confidence            99999998777776654


No 20 
>PF01835 A2M_N:  MG2 domain;  InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=87.10  E-value=2  Score=38.70  Aligned_cols=64  Identities=17%  Similarity=0.246  Sum_probs=44.5

Q ss_pred             ceecCCCeEEEEEecCCc--------cceEEEEECCCCCeeee-EE-EecCCCeEEEEEE-ecCC--eeEEEEEEEC
Q psy192          314 VVMADKPTQFLVRKNGAV--------GALDAKVISPSGTEDDC-FI-QPIDGDNYSIRFM-PREN--GIHNIHIKFN  377 (898)
Q Consensus       314 ~~~~g~~~~f~V~~~~a~--------G~l~v~v~~p~g~~~~~-~v-~~~~~g~y~v~f~-P~~~--G~h~i~V~~~  377 (898)
                      -.+.|+.+.|.+-.++..        ..+.+.|.+|+|..+.- .. .....|.|..+|. |..+  |.|+|.+..+
T Consensus        10 iYrPGetV~~~~~~~~~~~~~~~~~~~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~~~~~   86 (99)
T PF01835_consen   10 IYRPGETVHFRAIVRDLDNDFKPPANSPVTVTIKDPSGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYTIRVKTD   86 (99)
T ss_dssp             EE-TTSEEEEEEEEEEECTTCSCESSEEEEEEEEETTSEEEEEEEEEETTCTTEEEEEEE--SS---EEEEEEEEET
T ss_pred             CcCCCCEEEEEEEEeccccccccccCCceEEEEECCCCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEEEEEEEc
Confidence            356777777766533222        35889999999987633 33 3468999999995 6664  9999999994


No 21 
>PF10670 DUF4198:  Domain of unknown function (DUF4198)
Probab=87.08  E-value=35  Score=35.14  Aligned_cols=60  Identities=15%  Similarity=0.298  Sum_probs=43.2

Q ss_pred             eeeeeeEEEEEeccCCCCceEEEEEcCCcc------eeeEEECCCCcEEEEEEeCCCceEEEEEEECC
Q psy192          222 EQNQPCEFNVWTREAGAGSLAISVEGPSKA------EIDFKDRKDGSCYVSYVVAEPGEYRVGIKFND  283 (898)
Q Consensus       222 ~~g~~~~F~V~~~daG~g~l~v~v~gps~~------~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g  283 (898)
                      .+|++..|.|--.+..-.+..|.+..+...      .....-+.+|  .++|++..+|.|.|.+.+.+
T Consensus       148 ~~g~~~~~~vl~~GkPl~~a~V~~~~~~~~~~~~~~~~~~~TD~~G--~~~~~~~~~G~wli~a~~~~  213 (215)
T PF10670_consen  148 KAGDPLPFQVLFDGKPLAGAEVEAFSPGGWYDVEHEAKTLKTDANG--RATFTLPRPGLWLIRASHKD  213 (215)
T ss_pred             cCCCEEEEEEEECCeEcccEEEEEEECCCccccccceEEEEECCCC--EEEEecCCCEEEEEEEEEec
Confidence            467888888887776656666776655432      4445556677  67889999999999998754


No 22 
>PF01835 A2M_N:  MG2 domain;  InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=86.94  E-value=3.4  Score=37.20  Aligned_cols=64  Identities=19%  Similarity=0.329  Sum_probs=43.5

Q ss_pred             eeeCcEEEEEEEcCC-------CcccceEEEEeCCCCCeeee-EE-EEcCCCEEEEEEE-cCcc--eeEEEEEEEC
Q psy192          124 TEVGSTCKLTFKMPG-------ITAFDLSATVTSPGGVTEDA-EI-NEVEDGLYAVHFV-PKEL--GVHTVSVRYK  187 (898)
Q Consensus       124 ~~vg~~~~f~v~~~~-------~~~~~~~~~V~~p~G~~~~~-~v-~~~~~g~y~v~f~-P~~~--G~h~I~V~~~  187 (898)
                      -..|+.+.|.+-+.+       .....+++.|.+|+|+.+.- .. ..+..|.|..+|. |..+  |.|.|.+..+
T Consensus        11 YrPGetV~~~~~~~~~~~~~~~~~~~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~~~~~   86 (99)
T PF01835_consen   11 YRPGETVHFRAIVRDLDNDFKPPANSPVTVTIKDPSGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYTIRVKTD   86 (99)
T ss_dssp             E-TTSEEEEEEEEEEECTTCSCESSEEEEEEEEETTSEEEEEEEEEETTCTTEEEEEEE--SS---EEEEEEEEET
T ss_pred             cCCCCEEEEEEEEeccccccccccCCceEEEEECCCCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEEEEEEEc
Confidence            345666666543221       12356889999999997644 33 3478999999998 6664  9999999994


No 23 
>COG2373 Large extracellular alpha-helical protein [General function prediction only]
Probab=86.57  E-value=1.4e+02  Score=40.60  Aligned_cols=85  Identities=16%  Similarity=0.230  Sum_probs=55.5

Q ss_pred             CeeEEEecCccceEE---EEeCCCCeEEEEE-eeC--CCeEEEEEEEECCe-ecCCCCeEEEEecCCcccceeeeeeeec
Q psy192           47 GLSLSIEGPSKAEIQ---CKDNADGSLNISY-RPT--EPGYYIINLKFADH-HVEGSPFTAKIVGEGSNRQREKIQRQRE  119 (898)
Q Consensus        47 ~l~v~v~GP~~~~i~---~~d~~dG~~~v~y-tP~--~~G~~~I~V~~~g~-~i~gSPf~~~v~~~~~~~~~~~i~~~~~  119 (898)
                      .+.+.+..|.+..+.   ++-+.+|.+.++| +|.  ..|.|++++.+.+. .+-...|.|.-+-+.  +.+-.++..  
T Consensus       430 p~~l~v~~PdG~~~~~~~~~~~~~G~~~~~~~l~~na~tG~w~l~~~~~~~~~~~s~~f~V~df~p~--r~~i~l~~~--  505 (1621)
T COG2373         430 PLKLRVLDPDGSVLRTLTITLDEEGLYELSFPLPENALTGGYTLELYTGGKSAVISMSFRVEDFIPD--RFKINLTLD--  505 (1621)
T ss_pred             CeEEEEECCCCcEEEEEEEeccccCceEEeeeCCCCCCcceEEEEEEeCCccceeeeeEEhhHhCCc--eEEEecccc--
Confidence            477888899986654   4457899999999 554  47999999999987 666678877544333  222222211  


Q ss_pred             ccceeeeCcEEEEEEEc
Q psy192          120 AVPVTEVGSTCKLTFKM  136 (898)
Q Consensus       120 ~~~~~~vg~~~~f~v~~  136 (898)
                       ......|++.+++|+.
T Consensus       506 -k~~~~~g~~v~~~v~~  521 (1621)
T COG2373         506 -KTEWVPGKDVKIKVDL  521 (1621)
T ss_pred             -cccccCCCcEEEEEEE
Confidence             1113456677777664


No 24 
>PF09315 DUF1973:  Domain of unknown function (DUF1973);  InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels. 
Probab=85.82  E-value=20  Score=36.39  Aligned_cols=129  Identities=16%  Similarity=0.162  Sum_probs=69.7

Q ss_pred             eeeeeeEEEEEeccCCCCceEEEEEcCCcceeeE--EECCCCcEEEEEE-eCCCceEEEEEEECCeeCCCCCceeEecCC
Q psy192          222 EQNQPCEFNVWTREAGAGSLAISVEGPSKAEIDF--KDRKDGSCYVSYV-VAEPGEYRVGIKFNDQHIPDSPYKLFVSPA  298 (898)
Q Consensus       222 ~~g~~~~F~V~~~daG~g~l~v~v~gps~~~i~~--~d~~dGty~v~y~-P~~~G~y~i~V~~~g~~I~gSPf~v~V~~~  298 (898)
                      .+|+.+.|.|.-...  ....+.+.+|++....-  .|.......+... ..++|.|.+.|......  --+..+.|...
T Consensus        26 tvG~~T~f~v~w~~~--~~p~i~L~~P~G~~~~~~~~d~~~~~~~i~ipg~ae~G~W~y~i~~~~~~--~q~v~vtVtS~  101 (179)
T PF09315_consen   26 TVGNNTVFTVTWQNS--SPPSITLTDPSGTVYTTFTTDSNSKTARIQIPGTAEVGTWTYSITNTSSS--SQTVTVTVTSR  101 (179)
T ss_pred             CCCCCeEEEEEECCC--CCceEEEECCCCCEEeeeEEcccccEEEEECCCCcccccEEEEEecCCCC--cceEEEEEEee
Confidence            357788898886654  56788999998865532  3333333334331 26789999999543322  13355555432


Q ss_pred             C--CCceeEEEee-eCCCceecCCCeEEEEEecCC-----ccceEEEEECCCCCeeeeEEEecC
Q psy192          299 M--GDAHKLEIAQ-FPQGVVMADKPTQFLVRKNGA-----VGALDAKVISPSGTEDDCFIQPID  354 (898)
Q Consensus       299 ~--~~~~~~~~~~-~~~~~~~~g~~~~f~V~~~~a-----~G~l~v~v~~p~g~~~~~~v~~~~  354 (898)
                      .  ....-+.+.. +........+|+.+..+-+..     ..++.|.|..|+|......+.|++
T Consensus       102 ~~~~~~~Pitv~~~~~~~~~~~~~p~~IyA~V~qg~~PVlgA~V~A~V~~~~g~~~~l~L~DnG  165 (179)
T PF09315_consen  102 ASSPTVPPITVTAWLSQSSVNFPSPVIIYAEVQQGYLPVLGANVTATVESPSGNTVTLELLDNG  165 (179)
T ss_pred             ECCCCCCCEEEEEEECCCCcCCCCcEEEEEEEEECCeeeeCCEEEEEEcCCCCCEEEEEEEcCC
Confidence            1  1111222221 111111223454444433322     235788888999987666666543


No 25 
>COG1470 Predicted membrane protein [Function unknown]
Probab=83.12  E-value=96  Score=35.84  Aligned_cols=75  Identities=9%  Similarity=0.108  Sum_probs=39.6

Q ss_pred             EEEEEEeCCCceEEEEEEECCeeCCCCCceeEecCCCCCceeEEEeeeCCCceecCCCeEEEEEecCCccceEEEEECCC
Q psy192          263 CYVSYVVAEPGEYRVGIKFNDQHIPDSPYKLFVSPAMGDAHKLEIAQFPQGVVMADKPTQFLVRKNGAVGALDAKVISPS  342 (898)
Q Consensus       263 y~v~y~P~~~G~y~i~V~~~g~~I~gSPf~v~V~~~~~~~~~~~~~~~~~~~~~~g~~~~f~V~~~~a~G~l~v~v~~p~  342 (898)
                      +...++|..+|.|.+....+|.+-    |++.|..  ..+.++.+.-.. ....+|....|.+...+..+.+...++.|.
T Consensus        49 l~~~v~~~~~g~~~v~f~i~~~~~----~~v~v~~--~~~l~it~p~~~-~~~e~g~~v~~~~~l~~~~~~i~i~lk~pk  121 (513)
T COG1470          49 LQFKVLPGKAGSYSVKFSIEGVPY----WTVNVYT--SEPLQITLPIEA-QEAEAGLKVRVPELLEAGDSTISIDLKCPK  121 (513)
T ss_pred             EEEEEecCCCCcEEEEEEECCcee----EEEEEEe--cccEEEeccchh-hhhccccEEEEeehhhCCcceEEEeeecCc
Confidence            566777888999988875554332    5555543  223444432111 112345555555544443444555555566


Q ss_pred             CC
Q psy192          343 GT  344 (898)
Q Consensus       343 g~  344 (898)
                      |.
T Consensus       122 ~w  123 (513)
T COG1470         122 GW  123 (513)
T ss_pred             ch
Confidence            54


No 26 
>KOG1428|consensus
Probab=82.26  E-value=3.5  Score=52.20  Aligned_cols=48  Identities=15%  Similarity=0.360  Sum_probs=42.5

Q ss_pred             eEEEEeCCCCeEEEEEeeCCCeEEEEEEEECCeecCCCCeEEEEecCCc
Q psy192           59 EIQCKDNADGSLNISYRPTEPGYYIINLKFADHHVEGSPFTAKIVGEGS  107 (898)
Q Consensus        59 ~i~~~d~~dG~~~v~ytP~~~G~~~I~V~~~g~~i~gSPf~~~v~~~~~  107 (898)
                      .+.++.|.||+|...|||..+|.|+|+++.+|.+|. --+.++|.++..
T Consensus      2097 ~m~ir~nnd~~y~~nWTP~a~G~Y~l~~tiDG~eId-~~~eVkVkd~Pk 2144 (3738)
T KOG1428|consen 2097 NMLIRVNNDGTYCANWTPGAIGLYTLHVTIDGIEID-AGLEVKVKDPPK 2144 (3738)
T ss_pred             cceEEecCCCceecccCCccceeEEEEEEecceeec-cceEEEeccCCC
Confidence            456778999999999999999999999999999997 468899988763


No 27 
>PF10670 DUF4198:  Domain of unknown function (DUF4198)
Probab=79.54  E-value=13  Score=38.37  Aligned_cols=67  Identities=19%  Similarity=0.364  Sum_probs=49.2

Q ss_pred             EEEEEEcC-------CceeEEEEEEECCEEecCCCCeeEEEecCccc------eEEEEeCCCCeEEEEEeeCCCeEEEEE
Q psy192           19 VSLHYDPR-------EEGLHELALKFNGDHVQGYGGLSLSIEGPSKA------EIQCKDNADGSLNISYRPTEPGYYIIN   85 (898)
Q Consensus        19 ~~v~f~P~-------~~G~h~v~v~~~G~~v~G~g~l~v~v~GP~~~------~i~~~d~~dG~~~v~ytP~~~G~~~I~   85 (898)
                      +.++++|.       .-.+-.+.|.|+|+|++|   ..|.+..+...      ....+-+.+|.  ++|++..+|.|.|.
T Consensus       134 ~~lEIvPl~~P~~l~~g~~~~~~vl~~GkPl~~---a~V~~~~~~~~~~~~~~~~~~~TD~~G~--~~~~~~~~G~wli~  208 (215)
T PF10670_consen  134 LPLEIVPLTNPYKLKAGDPLPFQVLFDGKPLAG---AEVEAFSPGGWYDVEHEAKTLKTDANGR--ATFTLPRPGLWLIR  208 (215)
T ss_pred             CcEEEEECcCcccccCCCEEEEEEEECCeEccc---EEEEEEECCCccccccceEEEEECCCCE--EEEecCCCEEEEEE
Confidence            45778884       345678999999999985   56666555433      55666667885  56788899999999


Q ss_pred             EEECC
Q psy192           86 LKFAD   90 (898)
Q Consensus        86 V~~~g   90 (898)
                      +.+.+
T Consensus       209 a~~~~  213 (215)
T PF10670_consen  209 ASHKD  213 (215)
T ss_pred             EEEec
Confidence            98754


No 28 
>PF13753 SWM_repeat:  Putative flagellar system-associated repeat
Probab=79.21  E-value=1.1e+02  Score=33.92  Aligned_cols=142  Identities=13%  Similarity=0.246  Sum_probs=71.9

Q ss_pred             eEEEEEeccCCCCc-eEEEEEcCCcceeeEEECCCCcEEEEEEeC---CCceEEEEEE-EC---CeeCCCCCceeEecCC
Q psy192          227 CEFNVWTREAGAGS-LAISVEGPSKAEIDFKDRKDGSCYVSYVVA---EPGEYRVGIK-FN---DQHIPDSPYKLFVSPA  298 (898)
Q Consensus       227 ~~F~V~~~daG~g~-l~v~v~gps~~~i~~~d~~dGty~v~y~P~---~~G~y~i~V~-~~---g~~I~gSPf~v~V~~~  298 (898)
                      ..|.....++..+. +.+.+.   +....+....+|.|.+.+.|.   ..|.|.|.+. .-   |..-......+.|...
T Consensus        63 ~t~s~tvs~~~~g~~v~v~~~---g~~~t~~~~~~G~ws~t~~~~~~l~~g~~ti~v~~~tD~aGN~~t~~s~~~~vDt~  139 (317)
T PF13753_consen   63 VTFSGTVSGAEPGSTVTVTIN---GTTGTLTADADGNWSVTVTPSDDLPDGDYTITVTTVTDAAGNTSTAASQTFTVDTT  139 (317)
T ss_pred             eEEEEEecCCCCCCEEEEEEC---CEEEEEEEecCCcEEEeeccccccccCcceeEEEEEEccCCccccccccccccccc
Confidence            35655555555554 555552   223344445789999999884   7899999998 42   2221112333333221


Q ss_pred             CCCceeEEEeeeCCCceecCCCeEEEEEecCCccc------eEEEEECCCCCeeeeEEEecCCCeEEEEEEe-----cCC
Q psy192          299 MGDAHKLEIAQFPQGVVMADKPTQFLVRKNGAVGA------LDAKVISPSGTEDDCFIQPIDGDNYSIRFMP-----REN  367 (898)
Q Consensus       299 ~~~~~~~~~~~~~~~~~~~g~~~~f~V~~~~a~G~------l~v~v~~p~g~~~~~~v~~~~~g~y~v~f~P-----~~~  367 (898)
                      ...+..+.+..+....+..+....+.+..++.-+.      +.+.+   +|...  ......+|.|.+.|.|     ..-
T Consensus       140 ~~~~p~vti~~~~~~~~~~~~~~~~t~t~sg~v~~~~~~d~v~vt~---~G~~~--~~~~~~~g~~t~~~~~~~~~~~~d  214 (317)
T PF13753_consen  140 APTAPTVTITGISDDNIINGAESTVTVTFSGTVTGFDAGDTVTVTI---NGTTY--TTTVGADGTWTVTVTPSDLAGLAD  214 (317)
T ss_pred             cccccccceecccCCceeeccceeecccccccceeeeeceeEEEee---ccccc--ceeecCCCcccccccccccccccC
Confidence            11123344443333333333333333333322111      23323   34322  2233467899999988     345


Q ss_pred             eeEEEEEEE
Q psy192          368 GIHNIHIKF  376 (898)
Q Consensus       368 G~h~i~V~~  376 (898)
                      |.|.+.+..
T Consensus       215 ~~~~v~v~~  223 (317)
T PF13753_consen  215 GTYTVTVTV  223 (317)
T ss_pred             ceEEEEEEe
Confidence            677777654


No 29 
>PF06312 Neurexophilin:  Neurexophilin
Probab=73.51  E-value=9.7  Score=39.68  Aligned_cols=46  Identities=15%  Similarity=0.240  Sum_probs=39.3

Q ss_pred             c-cceEEEEECCCCC-eeeeEEEecCCCeEEEEEEecCCeeEEEEEEE
Q psy192          331 V-GALDAKVISPSGT-EDDCFIQPIDGDNYSIRFMPRENGIHNIHIKF  376 (898)
Q Consensus       331 ~-G~l~v~v~~p~g~-~~~~~v~~~~~g~y~v~f~P~~~G~h~i~V~~  376 (898)
                      + |++.|+|.+++-+ ...+.|.|..||+|.++|.-.=+|.-+|+|.+
T Consensus        68 G~GDfrAri~s~~l~ag~~G~V~Dh~NGTYtv~F~L~W~G~v~vsV~L  115 (219)
T PF06312_consen   68 GWGDFRARIFSPKLKAGAAGKVTDHGNGTYTVSFPLLWPGQVSVSVSL  115 (219)
T ss_pred             CCCCcEEEEecCCccccceEEEEECCCCeEEEEEEeecCceEEEEEEE
Confidence            5 7889999887643 35689999999999999999999999998875


No 30 
>cd02858 Esterase_N_term Esterase N-terminal domain. Esterases catalyze the hydrolysis of organic esters to release an alcohol or thiol and acid. The term can be applied to enzymes that hydrolyze carboxylate, phosphate and sulphate esters, but is more often restricted to the first class of substrate. The N-terminus of esterase may be related to the immunoglobulin and/or fibronectin type III superfamilies.  These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions.  Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=73.23  E-value=20  Score=31.42  Aligned_cols=59  Identities=12%  Similarity=0.196  Sum_probs=41.5

Q ss_pred             eeEEEEEeccCCCCceEEEEEcCCcceeeEEECCCCcEEEEEEeCCCceEEEEEEECCeeC
Q psy192          226 PCEFNVWTREAGAGSLAISVEGPSKAEIDFKDRKDGSCYVSYVVAEPGEYRVGIKFNDQHI  286 (898)
Q Consensus       226 ~~~F~V~~~daG~g~l~v~v~gps~~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g~~I  286 (898)
                      .++|.+.+-.|  ..+.|.+.........+...++|.++++..+..+|.|.-.+..+|..+
T Consensus         7 ~v~F~vwAP~A--~~V~L~~~~~~~~~~~m~~~~~G~W~~~v~~l~~g~Y~Y~~~vdg~~~   65 (85)
T cd02858           7 TVTFRLFAPKA--NEVQVRGSWGGAGSHPMTKDEAGVWSVTTGPLAPGIYTYSFLVDGVRV   65 (85)
T ss_pred             cEEEEEECCCC--CEEEEEeecCCCccEeCeECCCeEEEEEECCCCCcEEEEEEEECCeEe
Confidence            46888888766  445555544323345566678999999875577888999999998555


No 31 
>PF02369 Big_1:  Bacterial Ig-like domain (group 1);  InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=71.79  E-value=21  Score=32.40  Aligned_cols=62  Identities=11%  Similarity=0.106  Sum_probs=35.8

Q ss_pred             CCeEEEEEecCCccc----eEEEE--ECCCCCeeee--EEEecCCCeEEEEEEecCCeeEEEEEEECCee
Q psy192          319 KPTQFLVRKNGAVGA----LDAKV--ISPSGTEDDC--FIQPIDGDNYSIRFMPRENGIHNIHIKFNGVH  380 (898)
Q Consensus       319 ~~~~f~V~~~~a~G~----l~v~v--~~p~g~~~~~--~v~~~~~g~y~v~f~P~~~G~h~i~V~~~g~~  380 (898)
                      +...+++...|+.|+    ..+.+  ....+...+.  ...-..+|.+.++|+...+|.|.|...+++..
T Consensus        23 ~~~tltatV~D~~gnpv~g~~V~f~~~~~~~~l~~~~~~~~Td~~G~a~~tltst~aG~~~VtA~~~~~~   92 (100)
T PF02369_consen   23 DTNTLTATVTDANGNPVPGQPVTFSSSSSGGTLSPTNTSATTDSNGIATVTLTSTKAGTYTVTATVDGGS   92 (100)
T ss_dssp             S-EEEEEEEEETTSEB-TS-EEEE--EESSSEES-CEE-EEE-TTSEEEEEEE-SS-EEEEEEEEETTEE
T ss_pred             CcEEEEEEEEcCCCCCCCCCEEEEEEcCCCcEEecCccccEECCCEEEEEEEEecCceEEEEEEEECCcc
Confidence            344455554555443    34444  2223332222  34557899999999999999999999998753


No 32 
>PF06312 Neurexophilin:  Neurexophilin
Probab=71.34  E-value=11  Score=39.16  Aligned_cols=45  Identities=27%  Similarity=0.292  Sum_probs=37.8

Q ss_pred             cceEEEEeCCCCCe-eeeEEEEcCCCEEEEEEEcCcceeEEEEEEE
Q psy192          142 FDLSATVTSPGGVT-EDAEINEVEDGLYAVHFVPKELGVHTVSVRY  186 (898)
Q Consensus       142 ~~~~~~V~~p~G~~-~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~  186 (898)
                      ||+.|++.++.-+. ...+|+|.+||+|++.|+-.=+|.-.|+|.+
T Consensus        70 GDfrAri~s~~l~ag~~G~V~Dh~NGTYtv~F~L~W~G~v~vsV~L  115 (219)
T PF06312_consen   70 GDFRARIFSPKLKAGAAGKVTDHGNGTYTVSFPLLWPGQVSVSVSL  115 (219)
T ss_pred             CCcEEEEecCCccccceEEEEECCCCeEEEEEEeecCceEEEEEEE
Confidence            77788888776432 4679999999999999999999998888876


No 33 
>smart00634 BID_1 Bacterial Ig-like domain (group 1).
Probab=70.07  E-value=22  Score=31.59  Aligned_cols=52  Identities=17%  Similarity=0.360  Sum_probs=39.5

Q ss_pred             CCEEecCCCCeeEEEecCccce---EEEEeCCCCeEEEEEeeCCCeEEEEEEEECC
Q psy192           38 NGDHVQGYGGLSLSIEGPSKAE---IQCKDNADGSLNISYRPTEPGYYIINLKFAD   90 (898)
Q Consensus        38 ~G~~v~G~g~l~v~v~GP~~~~---i~~~d~~dG~~~v~ytP~~~G~~~I~V~~~g   90 (898)
                      +|.|+++. .+.++++|+....   -....+.+|.+.+.++.+.+|.++|.+..++
T Consensus        30 ~Gnpv~~~-~V~f~~~~~~~~~~~~~~~~Td~~G~a~~~l~~~~~G~~~vta~~~~   84 (92)
T smart00634       30 NGNPVAGQ-EVTFTTPSGGALTLSKGTATTDANGIATVTLTSTTAGVYTVTASLEN   84 (92)
T ss_pred             CCCCcCCC-EEEEEECCCceeeccCCeeeeCCCCEEEEEEECCCCcEEEEEEEECC
Confidence            58889875 4777777764211   2234567999999999999999999998776


No 34 
>PF02369 Big_1:  Bacterial Ig-like domain (group 1);  InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=69.32  E-value=26  Score=31.80  Aligned_cols=32  Identities=16%  Similarity=0.202  Sum_probs=25.5

Q ss_pred             EEEEcCCCEEEEEEEcCcceeEEEEEEECCee
Q psy192          159 EINEVEDGLYAVHFVPKELGVHTVSVRYKDIH  190 (898)
Q Consensus       159 ~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~  190 (898)
                      ....+.+|.+.++|+...+|.+.|.+.+++..
T Consensus        61 ~~~Td~~G~a~~tltst~aG~~~VtA~~~~~~   92 (100)
T PF02369_consen   61 SATTDSNGIATVTLTSTKAGTYTVTATVDGGS   92 (100)
T ss_dssp             -EEE-TTSEEEEEEE-SS-EEEEEEEEETTEE
T ss_pred             ccEECCCEEEEEEEEecCceEEEEEEEECCcc
Confidence            46778999999999999999999999999754


No 35 
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=69.13  E-value=3.2e+02  Score=34.70  Aligned_cols=47  Identities=13%  Similarity=0.263  Sum_probs=32.9

Q ss_pred             CceEEEEEcCCcc--eeeEEEC--------CCCcEEEEEEe-CCCceEEEEEEECCee
Q psy192          239 GSLAISVEGPSKA--EIDFKDR--------KDGSCYVSYVV-AEPGEYRVGIKFNDQH  285 (898)
Q Consensus       239 g~l~v~v~gps~~--~i~~~d~--------~dGty~v~y~P-~~~G~y~i~V~~~g~~  285 (898)
                      ..+.+.|+.|++.  .+...||        .||.|+-.|+. ...|.|+|.|+.+|..
T Consensus       627 A~Vta~i~~~~g~~~~l~l~DnGaGaD~~knDGIYS~yft~f~~nGrYsl~V~v~~~~  684 (863)
T TIGR00868       627 ANVTALIESENGHTVTLELLDNGAGADTVKNDGIYSRYFTAYDGNGRYSLKVRALGGV  684 (863)
T ss_pred             cEEEEEEECCCCCEEEEEEeecCCCCCccccCceeeeeeecccCCceEEEEEEEeCCC
Confidence            3455666666543  3455564        48999988875 4569999999998765


No 36 
>PF13860 FlgD_ig:  FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=68.22  E-value=33  Score=29.73  Aligned_cols=56  Identities=7%  Similarity=0.092  Sum_probs=30.2

Q ss_pred             CeEEEEEecCCccceEEEEECCCCCeee-eEEEecCCCeEEEEEE-------ecCCeeEEEEEE
Q psy192          320 PTQFLVRKNGAVGALDAKVISPSGTEDD-CFIQPIDGDNYSIRFM-------PRENGIHNIHIK  375 (898)
Q Consensus       320 ~~~f~V~~~~a~G~l~v~v~~p~g~~~~-~~v~~~~~g~y~v~f~-------P~~~G~h~i~V~  375 (898)
                      +..|.+.......++.+.|.+.+|+.+. ..+.....|.+.+.|-       +-..|.|++.|.
T Consensus        13 ~~~~~~~l~~~a~~v~v~I~d~~G~~V~t~~~~~~~~G~~~~~WdG~d~~G~~~~~G~Y~~~v~   76 (81)
T PF13860_consen   13 KGSIEYTLPEDADNVTVTIYDSNGQVVRTISLGSQSAGEHSFTWDGKDDDGNPVPDGTYTFRVT   76 (81)
T ss_dssp             EEEEEEEECSSCEEEEEEEEETTS-EEEEEEEEECSSEEEEEEE-SB-TTS-B--SEEEEEEEE
T ss_pred             EEEEEEeCCCcccEEEEEEEcCCCCEEEEEEcCCcCCceEEEEECCCCCCcCCCCCCCEEEEEE
Confidence            4455555544445678888888887652 2333344556666665       445555555554


No 37 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=64.59  E-value=13  Score=30.49  Aligned_cols=37  Identities=27%  Similarity=0.481  Sum_probs=23.1

Q ss_pred             EEeCCccceEEEEeCCCCeEEEEEEcCCCeeEEEEEEECC
Q psy192          614 AVEGPSKAEITYHDNKDGTVAVSYLPTAPGEYKIAVKFGE  653 (898)
Q Consensus       614 ~v~GPs~~~i~~~~~~dg~~~v~Y~P~~pG~Y~i~V~~~~  653 (898)
                      .++|.++.-+...+...   .++|++..||.|.+.|+.-+
T Consensus        13 ~l~g~d~~W~~~~~~~~---~~~~~~L~~G~Y~l~V~a~~   49 (66)
T PF07495_consen   13 RLEGFDDEWITLGSYSN---SISYTNLPPGKYTLEVRAKD   49 (66)
T ss_dssp             EEETTESSEEEESSTS----EEEEES--SEEEEEEEEEEE
T ss_pred             EEECCCCeEEECCCCcE---EEEEEeCCCEEEEEEEEEEC
Confidence            34555543333333333   99999999999999998744


No 38 
>PF11896 DUF3416:  Domain of unknown function (DUF3416);  InterPro: IPR021828  This presumed domain is functionally uncharacterised. This domain is found in bacteria and archaea. This domain is about 190 amino acids in length. This domain is found associated with PF00128 from PFAM. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 3ZT7_A 3ZST_B 3ZT6_A 3ZSS_D 3ZT5_B.
Probab=64.15  E-value=27  Score=35.66  Aligned_cols=62  Identities=16%  Similarity=0.255  Sum_probs=36.6

Q ss_pred             eeeCcEEEEEEEcCCCcccceEEEE--eCCCCCe-eeeEEEEcCCCEEEEEEEcCcceeEEEEEE
Q psy192          124 TEVGSTCKLTFKMPGITAFDLSATV--TSPGGVT-EDAEINEVEDGLYAVHFVPKELGVHTVSVR  185 (898)
Q Consensus       124 ~~vg~~~~f~v~~~~~~~~~~~~~V--~~p~G~~-~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~  185 (898)
                      ..+|+...++-++=.-|...+.|.+  ..++++. ....+...+++.|..+|+|.+.|.|+..|.
T Consensus        23 rvvGe~v~V~Adif~DGHD~l~A~l~~r~~~~~~w~~vpM~~~gnDrW~a~f~~~~~G~~~f~Ve   87 (187)
T PF11896_consen   23 RVVGEPVPVSADIFRDGHDALAAELLWRHPGEREWQEVPMTPLGNDRWEASFTPDRPGRYEFRVE   87 (187)
T ss_dssp             EETT-EEEEEEEE--SSSS-EEEEEEEE-TTS-B----B-EESTS-EEEEEEE--SSEEEEEEEE
T ss_pred             eecCCeEEEEEEEEecCCCcEEEEEEEECCCCCcceeeccccCCCCEEEEEEECCCceeEEEEEE
Confidence            3568888877766544555555544  6777765 455677889999999999999999877664


No 39 
>PF13473 Cupredoxin_1:  Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=63.36  E-value=13  Score=33.81  Aligned_cols=57  Identities=14%  Similarity=0.219  Sum_probs=32.0

Q ss_pred             eeeeeeEEEEEeccCCCCceEEEEEcCCcceeeEEECCCCcEEEEEEeCCCceEEEEEEECC
Q psy192          222 EQNQPCEFNVWTREAGAGSLAISVEGPSKAEIDFKDRKDGSCYVSYVVAEPGEYRVGIKFND  283 (898)
Q Consensus       222 ~~g~~~~F~V~~~daG~g~l~v~v~gps~~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g  283 (898)
                      ..|++.++++.-.|...  -++.+.+-   .....-....+.+++|+|.++|+|.+...++.
T Consensus        40 ~~G~~v~l~~~N~~~~~--h~~~i~~~---~~~~~l~~g~~~~~~f~~~~~G~y~~~C~~~~   96 (104)
T PF13473_consen   40 KAGQPVTLTFTNNDSRP--HEFVIPDL---GISKVLPPGETATVTFTPLKPGEYEFYCTMHP   96 (104)
T ss_dssp             ETTCEEEEEEEE-SSS---EEEEEGGG---TEEEEE-TT-EEEEEEEE-S-EEEEEB-SSS-
T ss_pred             cCCCeEEEEEEECCCCc--EEEEECCC---ceEEEECCCCEEEEEEcCCCCEEEEEEcCCCC
Confidence            45777777776665443  44455441   22222345677899999999999999876554


No 40 
>PF13115 YtkA:  YtkA-like
Probab=62.96  E-value=71  Score=27.71  Aligned_cols=27  Identities=11%  Similarity=0.118  Sum_probs=22.9

Q ss_pred             eEEEecCCCeEEEEEEecCCeeEEEEE
Q psy192          348 CFIQPIDGDNYSIRFMPRENGIHNIHI  374 (898)
Q Consensus       348 ~~v~~~~~g~y~v~f~P~~~G~h~i~V  374 (898)
                      ..+...++|.|.+......+|.|.|.|
T Consensus        59 ~~~~~~~~G~Y~~~~~f~m~G~W~i~v   85 (86)
T PF13115_consen   59 VELEETGPGVYEAEVTFSMAGTWQITV   85 (86)
T ss_pred             eeeecCCCCeEEEEeecCCCeeEEEEE
Confidence            444556899999999999999999987


No 41 
>COG1470 Predicted membrane protein [Function unknown]
Probab=62.65  E-value=3e+02  Score=32.02  Aligned_cols=73  Identities=12%  Similarity=0.219  Sum_probs=42.2

Q ss_pred             eCCCCeEEEEEEcCCceeEEEEEEECCEEecCCCCeeEEEecCccceEEEEeCCCCeEEEEEeeCCCeEEEEEEEECCee
Q psy192           13 DNHDGTVSLHYDPREEGLHELALKFNGDHVQGYGGLSLSIEGPSKAEIQCKDNADGSLNISYRPTEPGYYIINLKFADHH   92 (898)
Q Consensus        13 ~~~d~~~~v~f~P~~~G~h~v~v~~~G~~v~G~g~l~v~v~GP~~~~i~~~d~~dG~~~v~ytP~~~G~~~I~V~~~g~~   92 (898)
                      .|....-.+.+--.-.+...+...|+|..|.|-               .......-.+.+.++|..+|.|.+....+|.+
T Consensus         7 n~~~~~~~~~ls~~~~~~~d~~f~y~g~~i~~~---------------~lr~~e~~~l~~~v~~~~~g~~~v~f~i~~~~   71 (513)
T COG1470           7 NNGTILVTVLLSLMSLKGADVKFLYQGIEIKGL---------------KLRPKESVELQFKVLPGKAGSYSVKFSIEGVP   71 (513)
T ss_pred             cCCcEEEEEEEEehhcCCccEEEEEeeeEeeee---------------EcCCCcceEEEEEEecCCCCcEEEEEEECCce
Confidence            333333344444444477777778888777421               11111233567777888999998887765533


Q ss_pred             cCCCCeEEEEec
Q psy192           93 VEGSPFTAKIVG  104 (898)
Q Consensus        93 i~gSPf~~~v~~  104 (898)
                      -    |++.|+.
T Consensus        72 ~----~~v~v~~   79 (513)
T COG1470          72 Y----WTVNVYT   79 (513)
T ss_pred             e----EEEEEEe
Confidence            2    5666665


No 42 
>PF13473 Cupredoxin_1:  Cupredoxin-like domain; PDB: 1IBZ_D 1IC0_E 1IBY_D.
Probab=60.66  E-value=25  Score=31.95  Aligned_cols=62  Identities=15%  Similarity=0.193  Sum_probs=34.8

Q ss_pred             eecCceeEEEEEcCCCCCceeEEEEeCCCccccccccceEEEEeeCCceEEEEEEeCCCeeEEEEEEECCccCCC
Q psy192          817 IKSGVKTDFIVDTCNAGAGTLAVTIDGPSKVSVKKYKDEIFTRHTGRNNFEVKYIVRDRGEYLLIVKWGDDHIPG  891 (898)
Q Consensus       817 ~~~g~~~~f~vdt~~aG~g~l~v~i~gp~~~~~~~~~~~~~~~~~g~~~y~v~y~~~~~G~y~l~vk~~~~~i~G  891 (898)
                      +..|++..+++.  |.|...-.+.+.+-..        +  ..-....+.+++++|.++|+|.+.+.++.. ..|
T Consensus        39 v~~G~~v~l~~~--N~~~~~h~~~i~~~~~--------~--~~l~~g~~~~~~f~~~~~G~y~~~C~~~~~-m~G  100 (104)
T PF13473_consen   39 VKAGQPVTLTFT--NNDSRPHEFVIPDLGI--------S--KVLPPGETATVTFTPLKPGEYEFYCTMHPN-MKG  100 (104)
T ss_dssp             EETTCEEEEEEE--E-SSS-EEEEEGGGTE--------E--EEE-TT-EEEEEEEE-S-EEEEEB-SSS-T-TB-
T ss_pred             EcCCCeEEEEEE--ECCCCcEEEEECCCce--------E--EEECCCCEEEEEEcCCCCEEEEEEcCCCCc-cee
Confidence            455777777765  3444556666665321        1  112234588999999999999999997775 444


No 43 
>PF13860 FlgD_ig:  FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=60.27  E-value=62  Score=28.00  Aligned_cols=44  Identities=11%  Similarity=0.222  Sum_probs=22.8

Q ss_pred             EEEEEEEcCCCcccceEEEEeCCCCCeeee-EEEEcCCCEEEEEEE
Q psy192          129 TCKLTFKMPGITAFDLSATVTSPGGVTEDA-EINEVEDGLYAVHFV  173 (898)
Q Consensus       129 ~~~f~v~~~~~~~~~~~~~V~~p~G~~~~~-~v~~~~~g~y~v~f~  173 (898)
                      ...|.+.++.. ..++++.|.+.+|+.+.. .+.....|.+.+.|-
T Consensus        13 ~~~~~~~l~~~-a~~v~v~I~d~~G~~V~t~~~~~~~~G~~~~~Wd   57 (81)
T PF13860_consen   13 KGSIEYTLPED-ADNVTVTIYDSNGQVVRTISLGSQSAGEHSFTWD   57 (81)
T ss_dssp             EEEEEEEECSS-CEEEEEEEEETTS-EEEEEEEEECSSEEEEEEE-
T ss_pred             EEEEEEeCCCc-ccEEEEEEEcCCCCEEEEEEcCCcCCceEEEEEC
Confidence            44555555532 235778888888887532 233334445555544


No 44 
>cd02858 Esterase_N_term Esterase N-terminal domain. Esterases catalyze the hydrolysis of organic esters to release an alcohol or thiol and acid. The term can be applied to enzymes that hydrolyze carboxylate, phosphate and sulphate esters, but is more often restricted to the first class of substrate. The N-terminus of esterase may be related to the immunoglobulin and/or fibronectin type III superfamilies.  These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions.  Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=57.21  E-value=93  Score=27.16  Aligned_cols=60  Identities=25%  Similarity=0.279  Sum_probs=40.3

Q ss_pred             CeEEEEEEcCCCCCCceEEEEECCCCceeeeEEeCCe-EEEEEecCCCeeEEEEEEECCEecC
Q psy192          414 VKTDFIVDTCNAGAGTLAVTIDGPSKVSMDCTEVEEG-YKVRYTPLVPGDYYVSLKYNGYHIV  475 (898)
Q Consensus       414 ~~~~F~Vd~~~aG~g~l~v~v~gp~~~~~~~~~~~~g-y~v~ytp~~~G~y~i~V~~~g~~I~  475 (898)
                      ...+|.+.+-.|  ..+.+.+.............++| +.++-.+..+|.|.-.+..+|..++
T Consensus         6 ~~v~F~vwAP~A--~~V~L~~~~~~~~~~~m~~~~~G~W~~~v~~l~~g~Y~Y~~~vdg~~~~   66 (85)
T cd02858           6 RTVTFRLFAPKA--NEVQVRGSWGGAGSHPMTKDEAGVWSVTTGPLAPGIYTYSFLVDGVRVI   66 (85)
T ss_pred             CcEEEEEECCCC--CEEEEEeecCCCccEeCeECCCeEEEEEECCCCCcEEEEEEEECCeEec
Confidence            356888888766  45555555432223444556788 7888766788989988888886553


No 45 
>KOG2996|consensus
Probab=55.74  E-value=7.2  Score=45.28  Aligned_cols=35  Identities=29%  Similarity=0.526  Sum_probs=30.8

Q ss_pred             cceEEEEeCCCCeEEEEEEcCCCeeEEEEEEECCe
Q psy192          620 KAEITYHDNKDGTVAVSYLPTAPGEYKIAVKFGEK  654 (898)
Q Consensus       620 ~~~i~~~~~~dg~~~v~Y~P~~pG~Y~i~V~~~~~  654 (898)
                      +|+-.++...+|+|.|.|.+.+.++|.|+||||++
T Consensus       696 qaes~Lk~~~ngT~LVR~r~kea~e~AISikynne  730 (865)
T KOG2996|consen  696 QAESTLKNRPNGTYLVRYRTKEAKEFAISIKYNNE  730 (865)
T ss_pred             hhhhHhhcCCCceEEEEecccchhheeEEEEeccc
Confidence            34566777889999999999999999999999985


No 46 
>PF13753 SWM_repeat:  Putative flagellar system-associated repeat
Probab=55.33  E-value=1.3e+02  Score=33.37  Aligned_cols=142  Identities=18%  Similarity=0.209  Sum_probs=70.1

Q ss_pred             EEEEEcCCCcccc-eEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcC---cceeEEEEEE-EC---CeecCCCceEEEEcc
Q psy192          131 KLTFKMPGITAFD-LSATVTSPGGVTEDAEINEVEDGLYAVHFVPK---ELGVHTVSVR-YK---DIHIPGSPFQFTVGP  202 (898)
Q Consensus       131 ~f~v~~~~~~~~~-~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~---~~G~h~I~V~-~~---g~~V~gSPf~v~V~~  202 (898)
                      .|.....++..++ +.+.+   +|.....  ....+|.|.+.+.|.   ..|.|.|.+. .-   |..-......+.|..
T Consensus        64 t~s~tvs~~~~g~~v~v~~---~g~~~t~--~~~~~G~ws~t~~~~~~l~~g~~ti~v~~~tD~aGN~~t~~s~~~~vDt  138 (317)
T PF13753_consen   64 TFSGTVSGAEPGSTVTVTI---NGTTGTL--TADADGNWSVTVTPSDDLPDGDYTITVTTVTDAAGNTSTAASQTFTVDT  138 (317)
T ss_pred             EEEEEecCCCCCCEEEEEE---CCEEEEE--EEecCCcEEEeeccccccccCcceeEEEEEEccCCcccccccccccccc
Confidence            4555555554444 44444   4544333  344788999999884   7899999998 42   222111234444432


Q ss_pred             CCCCCcceEEEeCCCCCcceeeeeeEEEEEecc----CCC-CceEEEEEcCCcceeeEEECCCCcEEEEEEe-----CCC
Q psy192          203 LRDGGAHRVHAGGPGLERGEQNQPCEFNVWTRE----AGA-GSLAISVEGPSKAEIDFKDRKDGSCYVSYVV-----AEP  272 (898)
Q Consensus       203 ~~~~d~~~v~a~G~GL~~~~~g~~~~F~V~~~d----aG~-g~l~v~v~gps~~~i~~~d~~dGty~v~y~P-----~~~  272 (898)
                      .. ..+..+.+.+..-.....++...+++...+    ... ..+.+.+.|...   ......||.|++.|.|     ..-
T Consensus       139 ~~-~~~p~vti~~~~~~~~~~~~~~~~t~t~sg~v~~~~~~d~v~vt~~G~~~---~~~~~~~g~~t~~~~~~~~~~~~d  214 (317)
T PF13753_consen  139 TA-PTAPTVTITGISDDNIINGAESTVTVTFSGTVTGFDAGDTVTVTINGTTY---TTTVGADGTWTVTVTPSDLAGLAD  214 (317)
T ss_pred             cc-ccccccceecccCCceeeccceeecccccccceeeeeceeEEEeeccccc---ceeecCCCcccccccccccccccC
Confidence            10 012334444211001111222223332222    122 225566655432   2334578899999988     344


Q ss_pred             ceEEEEEEE
Q psy192          273 GEYRVGIKF  281 (898)
Q Consensus       273 G~y~i~V~~  281 (898)
                      |.|.|.+..
T Consensus       215 ~~~~v~v~~  223 (317)
T PF13753_consen  215 GTYTVTVTV  223 (317)
T ss_pred             ceEEEEEEe
Confidence            666666654


No 47 
>TIGR03096 nitroso_cyanin nitrosocyanin. Nitrosocyanin, as described from the obligate chemolithoautotroph Nitrosomonas europaea, is a red copper protein of unknown function with sequence similarity to a number of blue copper redox proteins.
Probab=53.66  E-value=97  Score=29.88  Aligned_cols=32  Identities=16%  Similarity=0.434  Sum_probs=26.2

Q ss_pred             CCCCcEEEEEEeCCCceEEEEEEEC--CeeCCCC
Q psy192          258 RKDGSCYVSYVVAEPGEYRVGIKFN--DQHIPDS  289 (898)
Q Consensus       258 ~~dGty~v~y~P~~~G~y~i~V~~~--g~~I~gS  289 (898)
                      ..+.+.+++|++.+||.|..+..+.  ..|+|||
T Consensus        97 ~pGet~TitF~adKpG~Y~y~C~~HP~~~H~~~~  130 (135)
T TIGR03096        97 KAGETKTISFKADKAGAFTIWCQLHPKNIHLPGS  130 (135)
T ss_pred             CCCCeEEEEEECCCCEEEEEeCCCCChhhcCCCc
Confidence            4466789999999999999888775  5677876


No 48 
>PF13620 CarboxypepD_reg:  Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=51.93  E-value=64  Score=27.44  Aligned_cols=45  Identities=22%  Similarity=0.256  Sum_probs=29.6

Q ss_pred             ceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECCee
Q psy192          143 DLSATVTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKDIH  190 (898)
Q Consensus       143 ~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g~~  190 (898)
                      ..++.+.+.++.. ........+|.|.+.-.|.  |.|.|.+...|..
T Consensus        16 ~a~V~l~~~~~~~-~~~~~Td~~G~f~~~~l~~--g~Y~l~v~~~g~~   60 (82)
T PF13620_consen   16 GATVTLTDQDGGT-VYTTTTDSDGRFSFEGLPP--GTYTLRVSAPGYQ   60 (82)
T ss_dssp             T-EEEET--TTTE-CCEEE--TTSEEEEEEE-S--EEEEEEEEBTTEE
T ss_pred             CEEEEEEEeeCCC-EEEEEECCCceEEEEccCC--EeEEEEEEECCcc
Confidence            3456666555553 4567778999999997777  9999999888854


No 49 
>smart00634 BID_1 Bacterial Ig-like domain (group 1).
Probab=51.88  E-value=67  Score=28.47  Aligned_cols=30  Identities=20%  Similarity=0.328  Sum_probs=25.9

Q ss_pred             EEEEcCCCEEEEEEEcCcceeEEEEEEECC
Q psy192          159 EINEVEDGLYAVHFVPKELGVHTVSVRYKD  188 (898)
Q Consensus       159 ~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g  188 (898)
                      ....+.+|...+.++...+|.+.|.++.+|
T Consensus        55 ~~~Td~~G~a~~~l~~~~~G~~~vta~~~~   84 (92)
T smart00634       55 TATTDANGIATVTLTSTTAGVYTVTASLEN   84 (92)
T ss_pred             eeeeCCCCEEEEEEECCCCcEEEEEEEECC
Confidence            345667999999999999999999999876


No 50 
>PF07705 CARDB:  CARDB;  InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=50.97  E-value=69  Score=28.18  Aligned_cols=65  Identities=12%  Similarity=0.168  Sum_probs=34.3

Q ss_pred             eeeCcEEEEEEEcCCCcc---cceEEEEeCCCCCeeeeEEEEcC---CCEEEEEEEcCcceeEEEEEEECC
Q psy192          124 TEVGSTCKLTFKMPGITA---FDLSATVTSPGGVTEDAEINEVE---DGLYAVHFVPKELGVHTVSVRYKD  188 (898)
Q Consensus       124 ~~vg~~~~f~v~~~~~~~---~~~~~~V~~p~G~~~~~~v~~~~---~g~y~v~f~P~~~G~h~I~V~~~g  188 (898)
                      ...|+..+|.+.+...|.   ..+.+.+..-+.......|....   ..++.+.|.+..+|.|.|.+.++.
T Consensus        15 ~~~g~~~~i~~~V~N~G~~~~~~~~v~~~~~~~~~~~~~i~~L~~g~~~~v~~~~~~~~~G~~~i~~~iD~   85 (101)
T PF07705_consen   15 VVPGEPVTITVTVKNNGTADAENVTVRLYLDGNSVSTVTIPSLAPGESETVTFTWTPPSPGSYTIRVVIDP   85 (101)
T ss_dssp             EETTSEEEEEEEEEE-SSS-BEEEEEEEEETTEEEEEEEESEB-TTEEEEEEEEEE-SS-CEEEEEEEEST
T ss_pred             ccCCCEEEEEEEEEECCCCCCCCEEEEEEECCceeccEEECCcCCCcEEEEEEEEEeCCCCeEEEEEEEee
Confidence            445666666666654433   23444443222111122332333   336788888889999999999874


No 51 
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=49.53  E-value=2.4e+02  Score=33.23  Aligned_cols=114  Identities=14%  Similarity=0.228  Sum_probs=65.2

Q ss_pred             EEeeCCCeEEEEEEEECCeecCCCCeEEEEecCCcccceeeeeeeecccceeeeCcEEEEEEEcCCCcccceEEEEeCCC
Q psy192           73 SYRPTEPGYYIINLKFADHHVEGSPFTAKIVGEGSNRQREKIQRQREAVPVTEVGSTCKLTFKMPGITAFDLSATVTSPG  152 (898)
Q Consensus        73 ~ytP~~~G~~~I~V~~~g~~i~gSPf~~~v~~~~~~~~~~~i~~~~~~~~~~~vg~~~~f~v~~~~~~~~~~~~~V~~p~  152 (898)
                      -|+=...|..+|+|.|+-+.. +--+.+.|++-.  ..            -...+..+++.+++...+..+++++|.+.+
T Consensus       287 iy~~~~s~~~~vei~~~~~~p-~p~~~lsVsGL~--~e------------Y~I~dG~~~i~ftv~a~g~~~vta~V~d~~  351 (478)
T PRK13211        287 IYLKAGSGLERVEIGYDIEAP-APDYELEVSGLA--KE------------YKIGDGAATLDFTVTATGDMNVEATVYNHD  351 (478)
T ss_pred             eEEeCCCCceEEEEEEEecCC-CCCCceEEecCc--ce------------eEEcCCcEEEEEEEEeccceEEEEEEEcCC
Confidence            345566666666666654332 113444444322  11            011123444444544455678999999999


Q ss_pred             CCeeeeEEEEcCC--CEEEEEEEcCcceeEEEEEEE---CCeecCCCceEEEEc
Q psy192          153 GVTEDAEINEVED--GLYAVHFVPKELGVHTVSVRY---KDIHIPGSPFQFTVG  201 (898)
Q Consensus       153 G~~~~~~v~~~~~--g~y~v~f~P~~~G~h~I~V~~---~g~~V~gSPf~v~V~  201 (898)
                      |+.+-..-....+  ..+++...-.++|.|.|.|+.   +|+.+.-.-+.|.|.
T Consensus       352 g~~~~~~~~~v~d~s~~vtL~Ls~~~AG~y~Lvv~~t~~dG~~~~q~~~~~~v~  405 (478)
T PRK13211        352 GEALGSKSQTVNDGSQSVSLDLSKLKAGHHMLVVKAKPKDGELIKQQTLDFMLE  405 (478)
T ss_pred             CCeeeeeeEEecCCceeEEEecccCCCceEEEEEEEEeCCCceeeeeeEEEEEE
Confidence            9865332222333  356666667789999999986   456664555777773


No 52 
>PF09315 DUF1973:  Domain of unknown function (DUF1973);  InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels. 
Probab=47.91  E-value=3.1e+02  Score=27.76  Aligned_cols=72  Identities=21%  Similarity=0.253  Sum_probs=44.2

Q ss_pred             eeCcEEEEEEEcCCCcccceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEc--CcceeEEEEEEECCeecCCCceEEEEc
Q psy192          125 EVGSTCKLTFKMPGITAFDLSATVTSPGGVTEDAEINEVEDGLYAVHFVP--KELGVHTVSVRYKDIHIPGSPFQFTVG  201 (898)
Q Consensus       125 ~vg~~~~f~v~~~~~~~~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P--~~~G~h~I~V~~~g~~V~gSPf~v~V~  201 (898)
                      .+|....|.|.....  ....+.+++|+|+....-..|.......+.. |  -+.|..+++|.....  .-.+..++|.
T Consensus        26 tvG~~T~f~v~w~~~--~~p~i~L~~P~G~~~~~~~~d~~~~~~~i~i-pg~ae~G~W~y~i~~~~~--~~q~v~vtVt   99 (179)
T PF09315_consen   26 TVGNNTVFTVTWQNS--SPPSITLTDPSGTVYTTFTTDSNSKTARIQI-PGTAEVGTWTYSITNTSS--SSQTVTVTVT   99 (179)
T ss_pred             CCCCCeEEEEEECCC--CCceEEEECCCCCEEeeeEEcccccEEEEEC-CCCcccccEEEEEecCCC--CcceEEEEEE
Confidence            457788888877665  3467789999999765534444444444443 4  467888888753321  1233555554


No 53 
>KOG2996|consensus
Probab=46.14  E-value=15  Score=42.91  Aligned_cols=32  Identities=19%  Similarity=0.560  Sum_probs=28.0

Q ss_pred             EEeeCCceEEEEEEeCCCeeEEEEEEECCc--cC
Q psy192          858 TRHTGRNNFEVKYIVRDRGEYLLIVKWGDD--HI  889 (898)
Q Consensus       858 ~~~~g~~~y~v~y~~~~~G~y~l~vk~~~~--~i  889 (898)
                      .+...+|||-|+|.+++.++|.|+|||+++  ||
T Consensus       701 Lk~~~ngT~LVR~r~kea~e~AISikynnevKHi  734 (865)
T KOG2996|consen  701 LKNRPNGTYLVRYRTKEAKEFAISIKYNNEVKHI  734 (865)
T ss_pred             hhcCCCceEEEEecccchhheeEEEEeccccceE
Confidence            445578999999999999999999999985  55


No 54 
>PF04151 PPC:  Bacterial pre-peptidase C-terminal domain;  InterPro: IPR007280 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  This domain is normally found at the C terminus of secreted archaeal and bacterial peptidases, the majority of which belong to MEROPS peptidase families M4 (vibriolysin, IPR001570 from INTERPRO), M9A amd M9B (microbial collangenase, IPR002169 from INTERPRO), M28 (aminopeptidase Ap1, IPR007484 from INTERPRO) and S8 (subtilisin family peptidases, IPR000209 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 4DY5_B 4DXZ_A 4DY3_B 3JQW_A 3JQX_C 1NQJ_B 1NQD_A 2O8O_A 1WMF_A 1WME_A ....
Probab=44.95  E-value=78  Score=26.34  Aligned_cols=40  Identities=15%  Similarity=0.336  Sum_probs=26.9

Q ss_pred             ceEEEEECCCCceeee-EEeC--Ce--EEEEEecCCCeeEEEEEE
Q psy192          429 TLAVTIDGPSKVSMDC-TEVE--EG--YKVRYTPLVPGDYYVSLK  468 (898)
Q Consensus       429 ~l~v~v~gp~~~~~~~-~~~~--~g--y~v~ytp~~~G~y~i~V~  468 (898)
                      ++.+.+..+++..+.- .+..  .+  -.++|++..+|.|.|.|.
T Consensus        25 d~dl~l~~~~g~~~~~~d~~~~~~~~~~~i~~~~~~~GtYyi~V~   69 (70)
T PF04151_consen   25 DADLYLYDSNGNSLASYDDSSQSGGNDESITFTAPAAGTYYIRVY   69 (70)
T ss_dssp             SEEEEEEETTSSSCEECCCCTCETTSEEEEEEEESSSEEEEEEEE
T ss_pred             CeEEEEEcCCCCchhhheecCCCCCCccEEEEEcCCCEEEEEEEE
Confidence            6677777776433322 1222  23  578999999999999885


No 55 
>cd08548 Type_I_cohesin_like Type I cohesin domain, interaction partner of dockerin. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I cohesins; their interactions with dockerin mediate assembly of a range of dockerin-borne enzymes to the complex.
Probab=44.73  E-value=2.9e+02  Score=26.48  Aligned_cols=91  Identities=13%  Similarity=0.271  Sum_probs=49.8

Q ss_pred             EEEEEEECCee---cCCCCeEEEeccCCCCCceEEecCCCcceeeeCCeEEEEEEcCCCCCCceEEEEECCCCceeeeEE
Q psy192          370 HNIHIKFNGVH---IPGSPLRIKVGKGEADPAAVHATGNGLAEIKSGVKTDFIVDTCNAGAGTLAVTIDGPSKVSMDCTE  446 (898)
Q Consensus       370 h~i~V~~~g~~---I~gSPf~v~V~~~~~d~s~v~v~G~Gl~~~~~g~~~~F~Vd~~~aG~g~l~v~v~gp~~~~~~~~~  446 (898)
                      .++.|++.+.+   |.+..|.+.     .|++.+.+..-.-.....+-...|.....+  .|.+.+.+..+..-  .-.-
T Consensus        16 v~VpV~~~~v~~~~i~~~~f~l~-----yD~s~Lev~~v~~G~i~~~~~~~~~~n~~~--~g~i~~~~~~~~~~--~~~i   86 (135)
T cd08548          16 VTVPVTLSNVPSKGIGACDFVLS-----YDPSVLEVVSVEAGDIVKNPSANFSSNVSS--EGTIKLLFSDSTGG--SYLI   86 (135)
T ss_pred             EEEEEEEecCCccCEEEEEEEEE-----eCCceeEEEecccCceecCCccceEeeccC--CCeEEEEEECCCCC--ceEE
Confidence            44555555443   555555544     366665554221111111124566665544  57788777766421  1122


Q ss_pred             eCCe--EEEEEecCC---CeeEEEEEEE
Q psy192          447 VEEG--YKVRYTPLV---PGDYYVSLKY  469 (898)
Q Consensus       447 ~~~g--y~v~ytp~~---~G~y~i~V~~  469 (898)
                      .++|  ++++|+...   .|.|.|.+.-
T Consensus        87 ~~~G~~atitfkv~~~a~~~~~~i~~~~  114 (135)
T cd08548          87 TTDGVFATITFKVKSNAAGGTYAITLSS  114 (135)
T ss_pred             ecCCEEEEEEEEEcCCCCCCcEEEEEec
Confidence            3677  689999887   7889887764


No 56 
>PF09134 Invasin_D3:  Invasin, domain 3;  InterPro: IPR015217 This domain adopts a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, arranged in a Greek-key topology. It forms part of the extracellular region of the protein, which can be expressed as a soluble protein (Inv497) that binds integrins and promotes subsequent uptake by cells when attached to bacteria []. ; PDB: 1CWV_A.
Probab=43.46  E-value=72  Score=28.94  Aligned_cols=69  Identities=14%  Similarity=0.198  Sum_probs=36.5

Q ss_pred             CeEEEEEecCCccc----eE-EEEECCCCCeee-eEEEecCCCeEEEEEEecCCeeEEEEEEECCeecCCCCeEEEe
Q psy192          320 PTQFLVRKNGAVGA----LD-AKVISPSGTEDD-CFIQPIDGDNYSIRFMPRENGIHNIHIKFNGVHIPGSPLRIKV  390 (898)
Q Consensus       320 ~~~f~V~~~~a~G~----l~-v~v~~p~g~~~~-~~v~~~~~g~y~v~f~P~~~G~h~i~V~~~g~~I~gSPf~v~V  390 (898)
                      .+.+++.++|+.|+    |. ++.. .+|.... ..+.+ ..|.|+.+..=..+|..+|.=.++|+++..-|-+++.
T Consensus        22 ta~LTfta~D~~g~pvsGl~~l~f~-~~g~~vtls~itE-~~GvYtATl~Gt~~G~vtI~P~v~G~~l~~~~~k~tl   96 (99)
T PF09134_consen   22 TATLTFTARDANGNPVSGLSVLAFV-QDGVPVTLSPITE-NPGVYTATLKGTSAGEVTITPQVNGQPLATLQKKLTL   96 (99)
T ss_dssp             -EEEEEE-B-TTS-B--S--SEEEE-EEES--EE---EE--SS-EEEEEE-SS-EEEEEEEEETTEE-GGG-EEEEE
T ss_pred             eEEEEEEEEcCCCCcccCceeEEEE-ccCCCcEEccccc-cCCEEEEEEEcccCceEEEEEeeCCEEccccceeEEE
Confidence            45566666776553    22 2221 1232211 12333 4689999999999999999999999999876666654


No 57 
>PF00017 SH2:  SH2 domain;  InterPro: IPR000980 The Src homology 2 (SH2) domain is a protein domain of about 100 amino-acid residues first identified as a conserved sequence region between the oncoproteins Src and Fps []. Similar sequences were later found in many other intracellular signal-transducing proteins []. SH2 domains function as regulatory modules of intracellular signalling cascades by interacting with high affinity to phosphotyrosine-containing target peptides in a sequence-specific, SH2 domains recognise between 3-6 residues C-terminal to the phosphorylated tyrosine in a fashion that differs from one SH2 domain to another, and strictly phosphorylation-dependent manner [, , , ]. They are found in a wide variety of protein contexts e.g., in association with catalytic domains of phospholipase Cy (PLCy) and the non-receptor protein tyrosine kinases; within structural proteins such as fodrin and tensin; and in a group of small adaptor molecules, i.e Crk and Nck. The domains are frequently found as repeats in a single protein sequence and will then often bind both mono- and di-phosphorylated substrates.  The structure of the SH2 domain belongs to the alpha+beta class, its overall shape forming a compact flattened hemisphere. The core structural elements comprise a central hydrophobic anti-parallel beta-sheet, flanked by 2 short alpha-helices. The loop between strands 2 and 3 provides many of the binding interactions with the phosphate group of its phosphopeptide ligand, and is hence designated the phosphate binding loop, the phosphorylated ligand binds perpendicular to the beta-sheet and typically interacts with the phosphate binding loop and a hydrophobic binding pocket that interacts with a pY+3 side chain. The N- and C-termini of the domain are close together in space and on the opposite face from the phosphopeptide binding surface and it has been speculated that this has facilitated their integration into surface-exposed regions of host proteins [].; GO: 0005515 protein binding; PDB: 1M27_A 1KA6_A 1D4W_B 1D4T_A 1D1Z_B 1KA7_A 1UUR_A 1UUS_A 1BLJ_A 1BLK_A ....
Probab=42.96  E-value=39  Score=28.56  Aligned_cols=26  Identities=35%  Similarity=0.644  Sum_probs=24.9

Q ss_pred             CCCCeEEEEEEcCCCeeEEEEEEECC
Q psy192          628 NKDGTVAVSYLPTAPGEYKIAVKFGE  653 (898)
Q Consensus       628 ~~dg~~~v~Y~P~~pG~Y~i~V~~~~  653 (898)
                      ..+|+|.|....+.+|.|.|+|++++
T Consensus        19 ~~~G~FLvR~s~~~~~~~~Lsv~~~~   44 (77)
T PF00017_consen   19 KPDGTFLVRPSSSKPGKYVLSVRFDG   44 (77)
T ss_dssp             SSTTEEEEEEESSSTTSEEEEEEETT
T ss_pred             CCCCeEEEEecccccccccccccccc
Confidence            56999999999999999999999999


No 58 
>PF13620 CarboxypepD_reg:  Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=42.23  E-value=64  Score=27.41  Aligned_cols=50  Identities=16%  Similarity=0.361  Sum_probs=31.5

Q ss_pred             CCEEecCCCCeeEEEecCcc-ceEEEEeCCCCeEEEEEeeCCCeEEEEEEEECCee
Q psy192           38 NGDHVQGYGGLSLSIEGPSK-AEIQCKDNADGSLNISYRPTEPGYYIINLKFADHH   92 (898)
Q Consensus        38 ~G~~v~G~g~l~v~v~GP~~-~~i~~~d~~dG~~~v~ytP~~~G~~~I~V~~~g~~   92 (898)
                      +|.|++   +..|.+..... ......-+.+|.|.+.-+|.  |.|.|.+...|..
T Consensus        10 ~g~pv~---~a~V~l~~~~~~~~~~~~Td~~G~f~~~~l~~--g~Y~l~v~~~g~~   60 (82)
T PF13620_consen   10 TGQPVP---GATVTLTDQDGGTVYTTTTDSDGRFSFEGLPP--GTYTLRVSAPGYQ   60 (82)
T ss_dssp             TSCBHT---T-EEEET--TTTECCEEE--TTSEEEEEEE-S--EEEEEEEEBTTEE
T ss_pred             CCCCcC---CEEEEEEEeeCCCEEEEEECCCceEEEEccCC--EeEEEEEEECCcc
Confidence            577775   44666664332 23445667999999998876  9999999876643


No 59 
>COG2372 CopC Uncharacterized protein, homolog of Cu resistance protein CopC [General function prediction only]
Probab=41.39  E-value=72  Score=30.26  Aligned_cols=70  Identities=17%  Similarity=0.305  Sum_probs=40.6

Q ss_pred             CCeEEEEEecCC--ccceEEEEECCCCCeeee-EEEec-CCCe-EEEEE-EecCCeeEEEE---EEECCeecCCC-CeEE
Q psy192          319 KPTQFLVRKNGA--VGALDAKVISPSGTEDDC-FIQPI-DGDN-YSIRF-MPRENGIHNIH---IKFNGVHIPGS-PLRI  388 (898)
Q Consensus       319 ~~~~f~V~~~~a--~G~l~v~v~~p~g~~~~~-~v~~~-~~g~-y~v~f-~P~~~G~h~i~---V~~~g~~I~gS-Pf~v  388 (898)
                      -|..|.+.-++.  .+-..+++.+|+|..+.. ..... .+++ +.|.- .|-++|.|.+.   |.-+|..+.|| +|.+
T Consensus        45 aP~~i~L~Fse~ve~~fs~~~l~~~d~~~v~t~~~~~~~~~~~~l~v~l~~~L~aG~Y~v~WrvvS~DGH~v~G~~sFsV  124 (127)
T COG2372          45 APAAITLEFSEGVEPGFSGAKLTGPDGEEVATAGTKLDEQNHTQLEVPLPQPLKAGVYTVDWRVVSSDGHVVKGSISFSV  124 (127)
T ss_pred             CceeEEEecCCccCCCcceeEEECCCCCccccCcccccccCCcEEEecCcccCCCCcEEEEEEEEecCCcEeccEEEEEe
Confidence            466677766655  233568888888865422 11111 1222 55544 45677888777   45678888886 4444


No 60 
>PF04151 PPC:  Bacterial pre-peptidase C-terminal domain;  InterPro: IPR007280 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  This domain is normally found at the C terminus of secreted archaeal and bacterial peptidases, the majority of which belong to MEROPS peptidase families M4 (vibriolysin, IPR001570 from INTERPRO), M9A amd M9B (microbial collangenase, IPR002169 from INTERPRO), M28 (aminopeptidase Ap1, IPR007484 from INTERPRO) and S8 (subtilisin family peptidases, IPR000209 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 4DY5_B 4DXZ_A 4DY3_B 3JQW_A 3JQX_C 1NQJ_B 1NQD_A 2O8O_A 1WMF_A 1WME_A ....
Probab=40.75  E-value=1.6e+02  Score=24.44  Aligned_cols=53  Identities=19%  Similarity=0.308  Sum_probs=32.4

Q ss_pred             EEEEecCCccceEEEEECCCCCeeeeEE-Eec-CCCeEEEEEEecCCeeEEEEEE
Q psy192          323 FLVRKNGAVGALDAKVISPSGTEDDCFI-QPI-DGDNYSIRFMPRENGIHNIHIK  375 (898)
Q Consensus       323 f~V~~~~a~G~l~v~v~~p~g~~~~~~v-~~~-~~g~y~v~f~P~~~G~h~i~V~  375 (898)
                      +.|+.....+++...+.+++|....... ... .+..=.++|+..++|+|.|.|.
T Consensus        15 l~i~l~~~~~d~dl~l~~~~g~~~~~~d~~~~~~~~~~~i~~~~~~~GtYyi~V~   69 (70)
T PF04151_consen   15 LTIDLSGGSGDADLYLYDSNGNSLASYDDSSQSGGNDESITFTAPAAGTYYIRVY   69 (70)
T ss_dssp             EEEEECETTSSEEEEEEETTSSSCEECCCCTCETTSEEEEEEEESSSEEEEEEEE
T ss_pred             EEEEEcCCCCCeEEEEEcCCCCchhhheecCCCCCCccEEEEEcCCCEEEEEEEE
Confidence            4444444445677778888864321100 111 1234678999999999999885


No 61 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=40.43  E-value=32  Score=28.12  Aligned_cols=19  Identities=32%  Similarity=0.707  Sum_probs=15.4

Q ss_pred             EEEEecCCCeeEEEEEEEC
Q psy192          452 KVRYTPLVPGDYYVSLKYN  470 (898)
Q Consensus       452 ~v~ytp~~~G~y~i~V~~~  470 (898)
                      .++|++..||+|+|.|+..
T Consensus        30 ~~~~~~L~~G~Y~l~V~a~   48 (66)
T PF07495_consen   30 SISYTNLPPGKYTLEVRAK   48 (66)
T ss_dssp             EEEEES--SEEEEEEEEEE
T ss_pred             EEEEEeCCCEEEEEEEEEE
Confidence            9999999999999988863


No 62 
>KOG1693|consensus
Probab=40.21  E-value=1.3e+02  Score=30.81  Aligned_cols=78  Identities=17%  Similarity=0.187  Sum_probs=47.6

Q ss_pred             CCCeEEEEecCCcccceeeeeeeecccceeeeCcEEEEEEEcCCCcccceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEc
Q psy192           95 GSPFTAKIVGEGSNRQREKIQRQREAVPVTEVGSTCKLTFKMPGITAFDLSATVTSPGGVTEDAEINEVEDGLYAVHFVP  174 (898)
Q Consensus        95 gSPf~~~v~~~~~~~~~~~i~~~~~~~~~~~vg~~~~f~v~~~~~~~~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P  174 (898)
                      =||+++..-+...+...+.+         ..-++.+.|.+.....|..|+...+.+|+|+++..+-...-   =++.|+-
T Consensus        21 a~elTfeLp~~aKqC~Y~d~---------~~~~~~~~~~fqV~tGG~fDVD~~I~aPdgkvI~~~~kk~~---~~~~f~a   88 (209)
T KOG1693|consen   21 ASELTFELPDNAKQCFYEDL---------KKDDDTTSFEFQVQTGGHFDVDYDIEAPDGKVIYSEKKKRY---DSFLFKA   88 (209)
T ss_pred             cccEEEEcCCcchhheeeec---------ccCCceEEEEEEEEeCCceeeEEEEECCCCCEEeecccccc---ccEEEEE
Confidence            35777665554433222222         12234466666655557788999999999997543333222   2467888


Q ss_pred             CcceeEEEEE
Q psy192          175 KELGVHTVSV  184 (898)
Q Consensus       175 ~~~G~h~I~V  184 (898)
                      ...|+|++..
T Consensus        89 e~~G~Y~fCF   98 (209)
T KOG1693|consen   89 EGKGEYTFCF   98 (209)
T ss_pred             ecceEEEEEe
Confidence            8899988764


No 63 
>PF00017 SH2:  SH2 domain;  InterPro: IPR000980 The Src homology 2 (SH2) domain is a protein domain of about 100 amino-acid residues first identified as a conserved sequence region between the oncoproteins Src and Fps []. Similar sequences were later found in many other intracellular signal-transducing proteins []. SH2 domains function as regulatory modules of intracellular signalling cascades by interacting with high affinity to phosphotyrosine-containing target peptides in a sequence-specific, SH2 domains recognise between 3-6 residues C-terminal to the phosphorylated tyrosine in a fashion that differs from one SH2 domain to another, and strictly phosphorylation-dependent manner [, , , ]. They are found in a wide variety of protein contexts e.g., in association with catalytic domains of phospholipase Cy (PLCy) and the non-receptor protein tyrosine kinases; within structural proteins such as fodrin and tensin; and in a group of small adaptor molecules, i.e Crk and Nck. The domains are frequently found as repeats in a single protein sequence and will then often bind both mono- and di-phosphorylated substrates.  The structure of the SH2 domain belongs to the alpha+beta class, its overall shape forming a compact flattened hemisphere. The core structural elements comprise a central hydrophobic anti-parallel beta-sheet, flanked by 2 short alpha-helices. The loop between strands 2 and 3 provides many of the binding interactions with the phosphate group of its phosphopeptide ligand, and is hence designated the phosphate binding loop, the phosphorylated ligand binds perpendicular to the beta-sheet and typically interacts with the phosphate binding loop and a hydrophobic binding pocket that interacts with a pY+3 side chain. The N- and C-termini of the domain are close together in space and on the opposite face from the phosphopeptide binding surface and it has been speculated that this has facilitated their integration into surface-exposed regions of host proteins [].; GO: 0005515 protein binding; PDB: 1M27_A 1KA6_A 1D4W_B 1D4T_A 1D1Z_B 1KA7_A 1UUR_A 1UUS_A 1BLJ_A 1BLK_A ....
Probab=40.09  E-value=39  Score=28.62  Aligned_cols=27  Identities=22%  Similarity=0.407  Sum_probs=24.7

Q ss_pred             eCCceEEEEEEeCCCeeEEEEEEECCc
Q psy192          861 TGRNNFEVKYIVRDRGEYLLIVKWGDD  887 (898)
Q Consensus       861 ~g~~~y~v~y~~~~~G~y~l~vk~~~~  887 (898)
                      ..+|+|-|+.....+|.|.|+|++.++
T Consensus        19 ~~~G~FLvR~s~~~~~~~~Lsv~~~~~   45 (77)
T PF00017_consen   19 KPDGTFLVRPSSSKPGKYVLSVRFDGK   45 (77)
T ss_dssp             SSTTEEEEEEESSSTTSEEEEEEETTE
T ss_pred             CCCCeEEEEeccccccccccccccccc
Confidence            458999999999999999999999983


No 64 
>PF07705 CARDB:  CARDB;  InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=38.60  E-value=75  Score=27.97  Aligned_cols=67  Identities=30%  Similarity=0.539  Sum_probs=35.7

Q ss_pred             ecCCeeEEEEEecCCCCCCCceeeecccCCCCcEEEEeCCccc--eE-EEEeCCCCeEEEEEEcCCCeeEEEEEEECC
Q psy192          579 VSGEPCLFTISTKGAGAGSPFQFTVGPLRDGGLSMAVEGPSKA--EI-TYHDNKDGTVAVSYLPTAPGEYKIAVKFGE  653 (898)
Q Consensus       579 ~~g~p~~fti~t~~ag~g~p~~~~~~~~~~g~l~v~v~GPs~~--~i-~~~~~~dg~~~v~Y~P~~pG~Y~i~V~~~~  653 (898)
                      ..|++..|++.-++.|....-.+.+        .+.+.|....  .+ .+...+..++.+.|.+..+|.|.|.+..+.
T Consensus        16 ~~g~~~~i~~~V~N~G~~~~~~~~v--------~~~~~~~~~~~~~i~~L~~g~~~~v~~~~~~~~~G~~~i~~~iD~   85 (101)
T PF07705_consen   16 VPGEPVTITVTVKNNGTADAENVTV--------RLYLDGNSVSTVTIPSLAPGESETVTFTWTPPSPGSYTIRVVIDP   85 (101)
T ss_dssp             ETTSEEEEEEEEEE-SSS-BEEEEE--------EEEETTEEEEEEEESEB-TTEEEEEEEEEE-SS-CEEEEEEEEST
T ss_pred             cCCCEEEEEEEEEECCCCCCCCEEE--------EEEECCceeccEEECCcCCCcEEEEEEEEEeCCCCeEEEEEEEee
Confidence            4557778888777777643211111        1222222211  22 233344566778888889999999988754


No 65 
>PF07691 PA14:  PA14 domain;  InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=38.44  E-value=3.5e+02  Score=25.53  Aligned_cols=80  Identities=19%  Similarity=0.200  Sum_probs=51.3

Q ss_pred             eEEEEEEcCCceeEEEEEEECCEEecCCCCeeEEEecCccceEEEEeC--------CCCeEEEEEeeCCCeEEEEEEEEC
Q psy192           18 TVSLHYDPREEGLHELALKFNGDHVQGYGGLSLSIEGPSKAEIQCKDN--------ADGSLNISYRPTEPGYYIINLKFA   89 (898)
Q Consensus        18 ~~~v~f~P~~~G~h~v~v~~~G~~v~G~g~l~v~v~GP~~~~i~~~d~--------~dG~~~v~ytP~~~G~~~I~V~~~   89 (898)
                      .|+-.|+|.+.|.|++.+.-+|.       ..+.|.|-.  -++...+        ..........-..-+.|.|.|.|-
T Consensus        49 ~~~G~~~~~~~G~y~f~~~~~d~-------~~l~idg~~--vid~~~~~~~~~~~~~~~~~~~~v~l~~g~~y~i~i~y~  119 (145)
T PF07691_consen   49 RWTGYFKPPETGTYTFSLTSDDG-------ARLWIDGKL--VIDNWGNQGGGFFNSGPSSTSGTVTLEAGGKYPIRIEYF  119 (145)
T ss_dssp             EEEEEEEESSSEEEEEEEEESSE-------EEEEETTEE--EEECSCTTTSTTTTTSBCCEEEEEEE-TT-EEEEEEEEE
T ss_pred             EEEEEEecccCceEEEEEEeccc-------EEEEECCEE--EEcCCccccccccccccceEEEEEEeeCCeeEEEEEEEE
Confidence            46778999999999999997664       445555411  1111111        123455556666778999999998


Q ss_pred             CeecCCCCeEEEEecCC
Q psy192           90 DHHVEGSPFTAKIVGEG  106 (898)
Q Consensus        90 g~~i~gSPf~~~v~~~~  106 (898)
                      ...-....|.+....+.
T Consensus       120 ~~~~~~~~~~l~w~~p~  136 (145)
T PF07691_consen  120 NRGGDASLIQLSWTPPG  136 (145)
T ss_dssp             ECSCSSCCEEEEEE-TT
T ss_pred             ECCCCCEEEEEEEECCC
Confidence            87777777777777665


No 66 
>PF05751 FixH:  FixH;  InterPro: IPR008620 This family consists of several Rhizobium FixH like proteins. It has been suggested that the four proteins FixG, FixH, FixI, and FixS may participate in a membrane-bound complex coupling the FixI cation pump with a redox process catalysed by FixG [].
Probab=37.45  E-value=84  Score=30.29  Aligned_cols=50  Identities=10%  Similarity=0.309  Sum_probs=39.2

Q ss_pred             CceeEEEEeCCCccccccccceEEEEeeCCceEEEEEEeCCCeeEEEEEEECC
Q psy192          834 AGTLAVTIDGPSKVSVKKYKDEIFTRHTGRNNFEVKYIVRDRGEYLLIVKWGD  886 (898)
Q Consensus       834 ~g~l~v~i~gp~~~~~~~~~~~~~~~~~g~~~y~v~y~~~~~G~y~l~vk~~~  886 (898)
                      ...+++.+.-|+....+   ..+.....++|.|........+|.|.|.|+|..
T Consensus        85 ~~~~~l~l~rp~~~~~D---~~~~l~~~~~g~y~~~~~~~~~G~W~l~l~~~~  134 (146)
T PF05751_consen   85 GAKLTLSLYRPTDAKLD---FTLTLTESAPGVYRAPVPLLKKGRWYLRLDWEP  134 (146)
T ss_pred             CceEEEEEECCCCccCC---eeEEeeECCCceEEEEcCCCCCccEEEEEEEec
Confidence            46788888888764332   246667788999999999889999999997743


No 67 
>TIGR03096 nitroso_cyanin nitrosocyanin. Nitrosocyanin, as described from the obligate chemolithoautotroph Nitrosomonas europaea, is a red copper protein of unknown function with sequence similarity to a number of blue copper redox proteins.
Probab=37.44  E-value=34  Score=32.92  Aligned_cols=30  Identities=17%  Similarity=0.383  Sum_probs=26.6

Q ss_pred             CceEEEEEEeCCCeeEEEEEEECC--ccCCCC
Q psy192          863 RNNFEVKYIVRDRGEYLLIVKWGD--DHIPGS  892 (898)
Q Consensus       863 ~~~y~v~y~~~~~G~y~l~vk~~~--~~i~GS  892 (898)
                      +.+.++++++.+||+|...|.|+-  .|+|||
T Consensus        99 Get~TitF~adKpG~Y~y~C~~HP~~~H~~~~  130 (135)
T TIGR03096        99 GETKTISFKADKAGAFTIWCQLHPKNIHLPGS  130 (135)
T ss_pred             CCeEEEEEECCCCEEEEEeCCCCChhhcCCCc
Confidence            458899999999999999999965  599997


No 68 
>smart00252 SH2 Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
Probab=36.50  E-value=40  Score=29.08  Aligned_cols=29  Identities=24%  Similarity=0.499  Sum_probs=25.1

Q ss_pred             EEeCCCCeEEEEEEcCCCeeEEEEEEECC
Q psy192          625 YHDNKDGTVAVSYLPTAPGEYKIAVKFGE  653 (898)
Q Consensus       625 ~~~~~dg~~~v~Y~P~~pG~Y~i~V~~~~  653 (898)
                      +...++|+|.|....+.+|.|.|++++++
T Consensus        17 L~~~~~G~FLvR~s~~~~~~~~Lsv~~~~   45 (84)
T smart00252       17 LKNEGDGDFLVRDSESEPGDYVLSVRVKG   45 (84)
T ss_pred             HhcCCCcEEEEEcCCCCCCCEEEEEEECC
Confidence            34456999999999999999999999973


No 69 
>smart00252 SH2 Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
Probab=36.00  E-value=33  Score=29.58  Aligned_cols=28  Identities=25%  Similarity=0.471  Sum_probs=24.0

Q ss_pred             eeCCceEEEEEEeCCCeeEEEEEEECCc
Q psy192          860 HTGRNNFEVKYIVRDRGEYLLIVKWGDD  887 (898)
Q Consensus       860 ~~g~~~y~v~y~~~~~G~y~l~vk~~~~  887 (898)
                      ..++|+|-|+...+.+|.|.|++++.++
T Consensus        19 ~~~~G~FLvR~s~~~~~~~~Lsv~~~~~   46 (84)
T smart00252       19 NEGDGDFLVRDSESEPGDYVLSVRVKGK   46 (84)
T ss_pred             cCCCcEEEEEcCCCCCCCEEEEEEECCE
Confidence            3458999999999999999999999853


No 70 
>PF04234 CopC:  CopC domain;  InterPro: IPR007348 CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm [].; GO: 0005507 copper ion binding, 0046688 response to copper ion, 0042597 periplasmic space; PDB: 1IX2_B 1LYQ_A 2C9P_C 2C9R_A 2C9Q_A 1M42_A 1OT4_A 1NM4_A.
Probab=35.88  E-value=32  Score=30.97  Aligned_cols=55  Identities=36%  Similarity=0.467  Sum_probs=30.8

Q ss_pred             eEEEEeCCCCCeeee---EEEEcCCCEEEEEEEc-CcceeEEEEEEE---CCeecCCCceEEEE
Q psy192          144 LSATVTSPGGVTEDA---EINEVEDGLYAVHFVP-KELGVHTVSVRY---KDIHIPGSPFQFTV  200 (898)
Q Consensus       144 ~~~~V~~p~G~~~~~---~v~~~~~g~y~v~f~P-~~~G~h~I~V~~---~g~~V~gSPf~v~V  200 (898)
                      -++.|++++|+.+..   .+.. ....+.+..-+ ..+|.|.|.-+.   +|.++.|+ |.|.|
T Consensus        35 s~v~v~~~~g~~v~~~~~~~~~-~~~~~~~~l~~~l~~G~YtV~wrvvs~DGH~~~G~-~~F~V   96 (97)
T PF04234_consen   35 SSVTVTDPDGKRVDLGEPTVDG-DGKTLTVPLPPPLPPGTYTVSWRVVSADGHPVSGS-FSFTV   96 (97)
T ss_dssp             -EEEEEEEEETTSCTCEEEEEE-STTEEEEEESS---SEEEEEEEEEEETTSCEEEEE-EEEEE
T ss_pred             cEEEEEcCCCceeecCcceecC-CceEEEEECCCCCCCceEEEEEEEEecCCCCcCCE-EEEEE
Confidence            346677777765432   2222 34566777666 567777776555   67777776 44443


No 71 
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=34.43  E-value=1.6e+02  Score=31.06  Aligned_cols=27  Identities=19%  Similarity=0.209  Sum_probs=15.7

Q ss_pred             CeEEEEEecCCccceEEEEECCCCCee
Q psy192          320 PTQFLVRKNGAVGALDAKVISPSGTED  346 (898)
Q Consensus       320 ~~~f~V~~~~a~G~l~v~v~~p~g~~~  346 (898)
                      ...|.++..++...+.+.|.+.+|+.+
T Consensus       114 ~~~~~~~l~~~a~~vti~I~D~~G~~V  140 (225)
T PRK06655        114 TTPFGVELPSAADNVTVTITDSAGQVV  140 (225)
T ss_pred             ceEEEEEcCCCCcEEEEEEEcCCCCEE
Confidence            455555544443456677777777654


No 72 
>PF09430 DUF2012:  Protein of unknown function (DUF2012);  InterPro: IPR019008  This domain is found in different proteins, including uncharacterised protein family UPF0480 and nodal modulators. A nodal modulator has been identified as part of a protein complex that participates in the nodal signaling pathway during vertebrate development []. 
Probab=33.98  E-value=1.6e+02  Score=27.73  Aligned_cols=55  Identities=22%  Similarity=0.454  Sum_probs=41.2

Q ss_pred             eeEEEecCccceEEEEeCCCCeEEEEEeeCCCeEEEEEEEECCeecCCCCeEEEEecCC
Q psy192           48 LSLSIEGPSKAEIQCKDNADGSLNISYRPTEPGYYIINLKFADHHVEGSPFTAKIVGEG  106 (898)
Q Consensus        48 l~v~v~GP~~~~i~~~d~~dG~~~v~ytP~~~G~~~I~V~~~g~~i~gSPf~~~v~~~~  106 (898)
                      ..|++.|-....+...-+.||+|.+.=+|  +|.|.++|..-+..-+  |+++.|.+.+
T Consensus        10 t~V~L~~g~~~~~~~~v~~dG~F~f~~Vp--~GsY~L~V~s~~~~F~--~~RVdV~~~~   64 (123)
T PF09430_consen   10 TRVTLNGGQYRPISAFVRSDGSFVFHNVP--PGSYLLEVHSPDYVFP--PYRVDVSSSG   64 (123)
T ss_pred             EEEEEeCCCccceEEEecCCCEEEeCCCC--CceEEEEEECCCcccc--CEEEEEecCC
Confidence            44566554444556666789999988775  5999999998877777  7999998555


No 73 
>PF03422 CBM_6:  Carbohydrate binding module (family 6);  InterPro: IPR005084 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see [].  This entry represents CBM6 from CAZY which was previously known as cellulose-binding domain family VI (CBD VI). CBM6 bind to amorphous cellulose, xylan, mixed beta-(1,3)(1,4)glucan and beta-1,3-glucan[, , ]. CBM6 adopts a classic lectin-like beta-jelly roll fold, predominantly consisting of five antiparallel beta-strands on one face and four antiparallel beta-strands on the other face. It contains two potential ligand binding sites, named respectively cleft A and B. These clefts include aromatic residues which are probably involved in the substrate binding. The cleft B is located on the concave surface of one beta-sheet, and the cleft A on one edge of the protein between the loop that connects the inner and outer beta-sheets of the jellyroll fold []. The multiple binding clefts confer the extensive range of specificities displayed by the domain [, , ].; GO: 0030246 carbohydrate binding; PDB: 1UY1_A 1UY3_A 1UY4_A 1UY2_A 1UYY_A 1UXZ_B 1UYZ_A 1UY0_B 1UYX_A 1UZ0_A ....
Probab=33.43  E-value=2e+02  Score=26.55  Aligned_cols=64  Identities=22%  Similarity=0.357  Sum_probs=39.3

Q ss_pred             cCCceeEEEEEEECCEEecCCCCeeEEEecCcc---ceEEEEeCC-CCe---EEEEEeeCCCeEEEEEEEECCe
Q psy192           25 PREEGLHELALKFNGDHVQGYGGLSLSIEGPSK---AEIQCKDNA-DGS---LNISYRPTEPGYYIINLKFADH   91 (898)
Q Consensus        25 P~~~G~h~v~v~~~G~~v~G~g~l~v~v~GP~~---~~i~~~d~~-dG~---~~v~ytP~~~G~~~I~V~~~g~   91 (898)
                      =.+.|.|.+.+++....  +.+.++|.+.|+..   ..+.+...+ -.+   ..+.. ....|.|+|.+.+.+.
T Consensus        41 ~~~~g~y~~~~~~a~~~--~~~~~~l~id~~~g~~~~~~~~~~tg~w~~~~~~~~~v-~l~~G~h~i~l~~~~~  111 (125)
T PF03422_consen   41 VPEAGTYTLTIRYANGG--GGGTIELRIDGPDGTLIGTVSLPPTGGWDTWQTVSVSV-KLPAGKHTIYLVFNGG  111 (125)
T ss_dssp             ESSSEEEEEEEEEEESS--SSEEEEEEETTTTSEEEEEEEEE-ESSTTEEEEEEEEE-EEESEEEEEEEEESSS
T ss_pred             eCCCceEEEEEEEECCC--CCcEEEEEECCCCCcEEEEEEEcCCCCccccEEEEEEE-eeCCCeeEEEEEEECC
Confidence            45889999998884331  11568899988443   445553222 122   22333 3444999999999764


No 74 
>PF01483 P_proprotein:  Proprotein convertase P-domain;  InterPro: IPR002884 This domain, termed the P domain is approximately 150 amino acids in length and C-terminal to a serine endopeptidase domain which belong to MEROPS peptidase family S8 (clan SB), subfamily S8B (kexin). The domain is primarily associated with the calcium-dependent serine endopeptidases, kex2/subtilisin proprotein convertases (PCs), which have been identified in all eukaryotes [] and in the gammaproteobacteria, Nostoc (cyanobacteria) and in Streptomyces avermitilis. The P domain appears necessary for folding and maintaining the endopeptidase catalytic domain and to regulate its calcium and acidic pH dependence. In addition, contained within the middle of the P domain in most PC family members is the cognate integrin binding RGD sequence [], which may be required for intracellular compartmentalization and maintenance of enzyme stability within the ER. The integrity of the RGD sequence of proprotein convertase PC1 is critical for its zymogen and C-terminal processing and for its cellular trafficking [, ]. The carboxy-terminal tail provides uniqueness to each PC family member being the least conserved region of all convertases [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1P8J_E 1OT5_B 2ID4_A 1R64_A 3HJR_A.
Probab=31.90  E-value=3e+02  Score=23.91  Aligned_cols=35  Identities=26%  Similarity=0.358  Sum_probs=22.6

Q ss_pred             eEEEEEeccCCCCceEEEEEcCCcceeeEEECCCC
Q psy192          227 CEFNVWTREAGAGSLAISVEGPSKAEIDFKDRKDG  261 (898)
Q Consensus       227 ~~F~V~~~daG~g~l~v~v~gps~~~i~~~d~~dG  261 (898)
                      +.+.++..-.-.++|.+.+.+|++..+.+.++..+
T Consensus         4 v~v~v~i~H~~~gdL~i~L~SP~Gt~~~L~~~~~~   38 (87)
T PF01483_consen    4 VQVSVDITHPYRGDLRITLISPSGTRSTLKDRRGS   38 (87)
T ss_dssp             EEEEEEEEESSGGGEEEEEE-TT--EEEEE-SSTT
T ss_pred             EEEEEEEEcCCcCCEEEEEECCCCCEEEEECCcCC
Confidence            45556665667799999999999988877765433


No 75 
>PF01483 P_proprotein:  Proprotein convertase P-domain;  InterPro: IPR002884 This domain, termed the P domain is approximately 150 amino acids in length and C-terminal to a serine endopeptidase domain which belong to MEROPS peptidase family S8 (clan SB), subfamily S8B (kexin). The domain is primarily associated with the calcium-dependent serine endopeptidases, kex2/subtilisin proprotein convertases (PCs), which have been identified in all eukaryotes [] and in the gammaproteobacteria, Nostoc (cyanobacteria) and in Streptomyces avermitilis. The P domain appears necessary for folding and maintaining the endopeptidase catalytic domain and to regulate its calcium and acidic pH dependence. In addition, contained within the middle of the P domain in most PC family members is the cognate integrin binding RGD sequence [], which may be required for intracellular compartmentalization and maintenance of enzyme stability within the ER. The integrity of the RGD sequence of proprotein convertase PC1 is critical for its zymogen and C-terminal processing and for its cellular trafficking [, ]. The carboxy-terminal tail provides uniqueness to each PC family member being the least conserved region of all convertases [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1P8J_E 1OT5_B 2ID4_A 1R64_A 3HJR_A.
Probab=31.56  E-value=2e+02  Score=25.04  Aligned_cols=61  Identities=20%  Similarity=0.325  Sum_probs=32.4

Q ss_pred             EEEEEEcCCCcccceEEEEeCCCCCeeeeEEEEcC------CCEEEEEEEcCcceeEEEEEEECCeecCCCceEEEEcc
Q psy192          130 CKLTFKMPGITAFDLSATVTSPGGVTEDAEINEVE------DGLYAVHFVPKELGVHTVSVRYKDIHIPGSPFQFTVGP  202 (898)
Q Consensus       130 ~~f~v~~~~~~~~~~~~~V~~p~G~~~~~~v~~~~------~g~y~v~f~P~~~G~h~I~V~~~g~~V~gSPf~v~V~~  202 (898)
                      +.+.+++.....++|.+.+++|+|...  .+.+..      ++.+...|+         ...|.|+...|. +++.|..
T Consensus         4 v~v~v~i~H~~~gdL~i~L~SP~Gt~~--~L~~~~~~d~~~~~~~~~~f~---------~~~f~Ge~~~G~-W~L~v~D   70 (87)
T PF01483_consen    4 VQVSVDITHPYRGDLRITLISPSGTRS--TLKDRRGSDDSGSGFLNWTFT---------SVAFWGESANGT-WTLRVTD   70 (87)
T ss_dssp             EEEEEEEEESSGGGEEEEEE-TT--EE--EEE-SSTTHHHSTSEEEEEEE---------ESTTTT-B--EE-EEEEEEE
T ss_pred             EEEEEEEEcCCcCCEEEEEECCCCCEE--EEECCcCCCcccccccccEEE---------EEeecCccCCCE-EEEEEEE
Confidence            345555555567999999999999864  344322      345666665         233456665554 5555543


No 76 
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=31.55  E-value=1.9e+02  Score=30.48  Aligned_cols=14  Identities=21%  Similarity=0.437  Sum_probs=8.3

Q ss_pred             ceEEEEeCCCCCee
Q psy192          143 DLSATVTSPGGVTE  156 (898)
Q Consensus       143 ~~~~~V~~p~G~~~  156 (898)
                      .+++.|.+.+|+.+
T Consensus       127 ~vti~I~D~~G~~V  140 (225)
T PRK06655        127 NVTVTITDSAGQVV  140 (225)
T ss_pred             EEEEEEEcCCCCEE
Confidence            35666666666654


No 77 
>COG2373 Large extracellular alpha-helical protein [General function prediction only]
Probab=31.48  E-value=1.6e+03  Score=31.09  Aligned_cols=117  Identities=15%  Similarity=0.162  Sum_probs=77.3

Q ss_pred             eEEEeC-CCCCcceeeeeeEEEEEeccCCC------CceEEEEEcCCccee---eEEECCCCcEEEEEEeC---CCceEE
Q psy192          210 RVHAGG-PGLERGEQNQPCEFNVWTREAGA------GSLAISVEGPSKAEI---DFKDRKDGSCYVSYVVA---EPGEYR  276 (898)
Q Consensus       210 ~v~a~G-~GL~~~~~g~~~~F~V~~~daG~------g~l~v~v~gps~~~i---~~~d~~dGty~v~y~P~---~~G~y~  276 (898)
                      ++.++. -||-+  .|+..++.+-+|+...      ..+.+.+..|++..+   .+.-+..|.+.++|.-.   ..|.|+
T Consensus       395 k~y~ftDRglYR--pGE~v~~~~~~R~~~~~~a~~~~p~~l~v~~PdG~~~~~~~~~~~~~G~~~~~~~l~~na~tG~w~  472 (1621)
T COG2373         395 KVYLFTDRGLYR--PGETVHVNALLRDFDGKTALDNQPLKLRVLDPDGSVLRTLTITLDEEGLYELSFPLPENALTGGYT  472 (1621)
T ss_pred             EEEEecCcccCC--CCceeeeeeeehhhcccccccCCCeEEEEECCCCcEEEEEEEeccccCceEEeeeCCCCCCcceEE
Confidence            444443 24543  5899999998887533      457899999998654   45557889999999643   359999


Q ss_pred             EEEEECCe-eCCCCCceeEecCCCCCceeEEEeeeCCCceecCCCeEEEEEecCCc
Q psy192          277 VGIKFNDQ-HIPDSPYKLFVSPAMGDAHKLEIAQFPQGVVMADKPTQFLVRKNGAV  331 (898)
Q Consensus       277 i~V~~~g~-~I~gSPf~v~V~~~~~~~~~~~~~~~~~~~~~~g~~~~f~V~~~~a~  331 (898)
                      |.+.+.|+ .+....|.|.-.-  .+.=++.+... ...+..|++..+.|+.+...
T Consensus       473 l~~~~~~~~~~~s~~f~V~df~--p~r~~i~l~~~-k~~~~~g~~v~~~v~~~yL~  525 (1621)
T COG2373         473 LELYTGGKSAVISMSFRVEDFI--PDRFKINLTLD-KTEWVPGKDVKIKVDLRYLY  525 (1621)
T ss_pred             EEEEeCCccceeeeeEEhhHhC--CceEEEecccc-cccccCCCcEEEEEEEEecC
Confidence            99999987 5666677663221  12122333222 22357888999999876443


No 78 
>KOG1948|consensus
Probab=31.30  E-value=1.2e+03  Score=29.59  Aligned_cols=122  Identities=20%  Similarity=0.254  Sum_probs=64.5

Q ss_pred             eEEEecCCCeEEEEEEecCCeeEEEEEEECCeecCCCCeEEEeccCCCCCceEEecCCCcceeeeCCeEEEEEEcC----
Q psy192          348 CFIQPIDGDNYSIRFMPRENGIHNIHIKFNGVHIPGSPLRIKVGKGEADPAAVHATGNGLAEIKSGVKTDFIVDTC----  423 (898)
Q Consensus       348 ~~v~~~~~g~y~v~f~P~~~G~h~i~V~~~g~~I~gSPf~v~V~~~~~d~s~v~v~G~Gl~~~~~g~~~~F~Vd~~----  423 (898)
                      |.-....+|.|.+.-.|  .|+|.+.-.|-|+     |..++|.+   .+-++.+.-+-|+-...-+-+-|+|..|    
T Consensus       256 c~~vsd~~G~fsfksvP--sGkY~l~a~y~ge-----~~~fdvSP---~~l~v~Vehd~lqi~~ef~vtgfSvtGRVl~g  325 (1165)
T KOG1948|consen  256 CIGVSDPRGRFSFKSVP--SGKYYLAASYVGE-----PKSFDVSP---NPLKVVVEHDHLQIASEFRVTGFSVTGRVLVG  325 (1165)
T ss_pred             EEEEEcCCceEEEEEcC--CCCEEEEEEecCC-----ceEEEeCC---CceeEEEeccceeccceeEEEEEEeeeeEEeC
Confidence            33344568887777666  7999998887654     44555532   2333333322232222222233444332    


Q ss_pred             --CCCCCceEEEEECCCCceeeeEEeCCe-EEEEEecCCCeeEEEEEEECCEecCCCCeEEEEecc
Q psy192          424 --NAGAGTLAVTIDGPSKVSMDCTEVEEG-YKVRYTPLVPGDYYVSLKYNGYHIVGSPFKVKCTGK  486 (898)
Q Consensus       424 --~aG~g~l~v~v~gp~~~~~~~~~~~~g-y~v~ytp~~~G~y~i~V~~~g~~I~gSPf~v~v~~~  486 (898)
                        +-|-....+.+.|..+.    ....+| |+++= -.+.|.|+|+++  -+|..=|+.+++|.+.
T Consensus       326 ~~g~~l~gvvvlvngk~~~----kTdaqGyykLen-~~t~gtytI~a~--kehlqFstv~~kv~pn  384 (1165)
T KOG1948|consen  326 SKGLPLSGVVVLVNGKSGG----KTDAQGYYKLEN-LKTDGTYTITAK--KEHLQFSTVHAKVKPN  384 (1165)
T ss_pred             CCCCCccceEEEEcCcccc----eEcccceEEeee-eeccCcEEEEEe--ccceeeeeEEEEecCC
Confidence              11222333444443322    122456 66654 346789988776  4566667888877654


No 79 
>PRK10301 hypothetical protein; Provisional
Probab=30.12  E-value=1.5e+02  Score=28.13  Aligned_cols=68  Identities=24%  Similarity=0.509  Sum_probs=38.1

Q ss_pred             eEEEEEEECCEEecCCCCeeEEEecCccceEEEE----eCCCC-eEEEEEe-eCCCeEEEEEEE---ECCeecCCC-CeE
Q psy192           30 LHELALKFNGDHVQGYGGLSLSIEGPSKAEIQCK----DNADG-SLNISYR-PTEPGYYIINLK---FADHHVEGS-PFT   99 (898)
Q Consensus        30 ~h~v~v~~~G~~v~G~g~l~v~v~GP~~~~i~~~----d~~dG-~~~v~yt-P~~~G~~~I~V~---~~g~~i~gS-Pf~   99 (898)
                      +-+|++.||...-++.  ..+++.++.+..+...    +..++ ++.+..- +-.+|.|+|+-+   -+|+++.|+ .|+
T Consensus        45 P~~V~L~F~e~v~~~~--s~i~v~~~~g~~v~~~~~~~~~~~~~~~~v~l~~~L~~G~YtV~Wrvvs~DGH~~~G~~~F~  122 (124)
T PRK10301         45 PQALTLNFSEGIEPGF--SGATITGPKQENIKTLPAKRNEQDQKQLIVPLADSLKPGTYTVDWHVVSVDGHKTKGHYTFS  122 (124)
T ss_pred             CCEEEEEcCCCccccc--cEEEEEcCCCCEeccCCccccCCCCcEEEEECCCCCCCccEEEEEEEEecCCCccCCeEEEE
Confidence            4578888875543322  3466667766555422    12222 3444442 357888887644   567777776 443


No 80 
>PRK12633 flgD flagellar basal body rod modification protein; Provisional
Probab=29.99  E-value=1.8e+02  Score=30.80  Aligned_cols=45  Identities=18%  Similarity=0.255  Sum_probs=24.2

Q ss_pred             CeEEEEEecCCccceEEEEECCCCCeee-eEEEecCCCeEEEEEEe
Q psy192          320 PTQFLVRKNGAVGALDAKVISPSGTEDD-CFIQPIDGDNYSIRFMP  364 (898)
Q Consensus       320 ~~~f~V~~~~a~G~l~v~v~~p~g~~~~-~~v~~~~~g~y~v~f~P  364 (898)
                      .+.|.++..++...+.+.|++.+|+.+. ..+-....|.+.+.|-.
T Consensus       117 ~~~~~~~l~~~a~~v~v~I~D~~G~vV~t~~lg~~~aG~~~f~WDG  162 (230)
T PRK12633        117 ATPFGIDLQGDATKVTVKVLDPSGAVVRTMELGDLKTGVHTLQWDG  162 (230)
T ss_pred             ceeEEEecCCcCcEEEEEEEeCCCCEEEEEecCCCCCCceeEEECC
Confidence            3445555444444577778887777643 22323345555555544


No 81 
>PF09430 DUF2012:  Protein of unknown function (DUF2012);  InterPro: IPR019008  This domain is found in different proteins, including uncharacterised protein family UPF0480 and nodal modulators. A nodal modulator has been identified as part of a protein complex that participates in the nodal signaling pathway during vertebrate development []. 
Probab=29.87  E-value=2.1e+02  Score=27.01  Aligned_cols=53  Identities=28%  Similarity=0.408  Sum_probs=37.2

Q ss_pred             eEEEEEcCCcceeeEEECCCCcEEEEEEeCCCceEEEEEEECCeeCCCCCceeEecC
Q psy192          241 LAISVEGPSKAEIDFKDRKDGSCYVSYVVAEPGEYRVGIKFNDQHIPDSPYKLFVSP  297 (898)
Q Consensus       241 l~v~v~gps~~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~~g~~I~gSPf~v~V~~  297 (898)
                      ..|.+.+.....+...-+.||+|.+.=+|  +|.|.|+|..-+-..  +|++|.|.+
T Consensus        10 t~V~L~~g~~~~~~~~v~~dG~F~f~~Vp--~GsY~L~V~s~~~~F--~~~RVdV~~   62 (123)
T PF09430_consen   10 TRVTLNGGQYRPISAFVRSDGSFVFHNVP--PGSYLLEVHSPDYVF--PPYRVDVSS   62 (123)
T ss_pred             EEEEEeCCCccceEEEecCCCEEEeCCCC--CceEEEEEECCCccc--cCEEEEEec
Confidence            34555554443556666889988776665  699999998776665  578888873


No 82 
>PF05751 FixH:  FixH;  InterPro: IPR008620 This family consists of several Rhizobium FixH like proteins. It has been suggested that the four proteins FixG, FixH, FixI, and FixS may participate in a membrane-bound complex coupling the FixI cation pump with a redox process catalysed by FixG [].
Probab=29.77  E-value=2e+02  Score=27.69  Aligned_cols=56  Identities=25%  Similarity=0.448  Sum_probs=41.2

Q ss_pred             EEEEEEE---CCEEecCCCCeeEEEecCccc----eEEEEeCCCCeEEEEEeeCCCeEEEEEEE
Q psy192           31 HELALKF---NGDHVQGYGGLSLSIEGPSKA----EIQCKDNADGSLNISYRPTEPGYYIINLK   87 (898)
Q Consensus        31 h~v~v~~---~G~~v~G~g~l~v~v~GP~~~----~i~~~d~~dG~~~v~ytP~~~G~~~I~V~   87 (898)
                      -.+.+.+   +|.|+.+. .+.+.+.-|+..    .+..+..++|.|........+|.|.|.|.
T Consensus        69 ~~~~i~~~d~~g~~~~~~-~~~l~l~rp~~~~~D~~~~l~~~~~g~y~~~~~~~~~G~W~l~l~  131 (146)
T PF05751_consen   69 NSLTIRLTDPNGAPVSGA-KLTLSLYRPTDAKLDFTLTLTESAPGVYRAPVPLLKKGRWYLRLD  131 (146)
T ss_pred             CeEEEEEEcCCCCcCcCc-eEEEEEECCCCccCCeeEEeeECCCceEEEEcCCCCCccEEEEEE
Confidence            3445555   56665433 577888888754    45666789999999998889999999994


No 83 
>PF07703 A2M_N_2:  Alpha-2-macroglobulin family N-terminal region;  InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=29.59  E-value=4.7e+02  Score=24.44  Aligned_cols=105  Identities=14%  Similarity=0.117  Sum_probs=57.8

Q ss_pred             CceecCCCeEEEEEecCCccceEEEEECCCCCeeeeEEEecCCC--eEEEEEEecCCeeEEEEEEE----CCeecCCCCe
Q psy192          313 GVVMADKPTQFLVRKNGAVGALDAKVISPSGTEDDCFIQPIDGD--NYSIRFMPRENGIHNIHIKF----NGVHIPGSPL  386 (898)
Q Consensus       313 ~~~~~g~~~~f~V~~~~a~G~l~v~v~~p~g~~~~~~v~~~~~g--~y~v~f~P~~~G~h~i~V~~----~g~~I~gSPf  386 (898)
                      ..+.+|+.+.+.|.+....+.+-..+.+ .|+.......+..++  .+.+..++.......+.+.+    +|+.+.+| .
T Consensus         8 ~~~~~Ge~~~v~v~~~~~~~~~~~~v~s-~g~I~~~~~~~~~~~~~~~~~~v~~~~~P~~~v~~~~v~~~~g~~~~~s-~   85 (136)
T PF07703_consen    8 DSYKPGETAKVTVQSPFPNGTFLYLVES-RGKIVSTGSVELKNGSTTFEFPVTPDMAPNFYVLAYYVRPADGEVVADS-V   85 (136)
T ss_dssp             SSB-TTSEEEEEEEEESCESEEEEEEEE-TTEEEEEEEEECTTTSSEEEEEE-GGGTSEEEEEEEEETTCTCEEEEEE-E
T ss_pred             CCcCCCCEEEEEEEcCCCccEEEEEEEE-CCeEEEEEEEEecCCcEEEEEecchhcCCcEEEEEEEEcCCCCeEEEEE-E
Confidence            3578899999999987766767777765 344443433333333  57777778888777777665    45555443 2


Q ss_pred             EEEeccCCCCCceEEecCCCcceeeeCCeEEEEEEc
Q psy192          387 RIKVGKGEADPAAVHATGNGLAEIKSGVKTDFIVDT  422 (898)
Q Consensus       387 ~v~V~~~~~d~s~v~v~G~Gl~~~~~g~~~~F~Vd~  422 (898)
                      .+.|..  .-..++.+.-.- .....|+.+++.|.+
T Consensus        86 ~i~V~~--~~~~~v~l~~~~-~~~~Pg~~~~~~i~~  118 (136)
T PF07703_consen   86 WIEVEP--CFELKVELTASP-DEYKPGEEVTLRIKA  118 (136)
T ss_dssp             EEEBGC--SGSSSEEEEESS-SSBTTTSEEEEEEEE
T ss_pred             EEEecc--cccceEEEEEec-ceeCCCCEEEEEEEe
Confidence            223321  122233332210 233456666666665


No 84 
>KOG1692|consensus
Probab=29.49  E-value=1.9e+02  Score=29.44  Aligned_cols=56  Identities=21%  Similarity=0.338  Sum_probs=37.8

Q ss_pred             eeCcEEEEEEEcCCCcccceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEE
Q psy192          125 EVGSTCKLTFKMPGITAFDLSATVTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVS  183 (898)
Q Consensus       125 ~vg~~~~f~v~~~~~~~~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~  183 (898)
                      ..|....+++..-+.+-.++..+|.+|+|+...-...+ ..|+|  +|+....|.|+..
T Consensus        38 ~~gd~~~vsF~v~~gg~~~vd~~I~gP~~~~i~~~~~~-ssgk~--tF~a~~~G~Y~fC   93 (201)
T KOG1692|consen   38 EEGDKLSVSFEVIDGGFLGVDVEITGPDGKIIHKGKRE-SSGKY--TFTAPKKGTYTFC   93 (201)
T ss_pred             ccCCEEEEEEEEecCCccceeEEEECCCCchhhhcccc-cCceE--EEEecCCceEEEE
Confidence            35677777776665566778899999999865333333 33555  5667777877765


No 85 
>PRK10301 hypothetical protein; Provisional
Probab=28.76  E-value=1.4e+02  Score=28.23  Aligned_cols=55  Identities=24%  Similarity=0.275  Sum_probs=30.9

Q ss_pred             EEEEeCCCCCeeeeE---EEEcCCCEEEEEEE-cCcceeEEEEEEE---CCeecCCCceEEEE
Q psy192          145 SATVTSPGGVTEDAE---INEVEDGLYAVHFV-PKELGVHTVSVRY---KDIHIPGSPFQFTV  200 (898)
Q Consensus       145 ~~~V~~p~G~~~~~~---v~~~~~g~y~v~f~-P~~~G~h~I~V~~---~g~~V~gSPf~v~V  200 (898)
                      .+++.+++|+.+...   +...+..++.+..- +-.+|.|+|+-+.   +|.++.|+ |.|+|
T Consensus        62 ~i~v~~~~g~~v~~~~~~~~~~~~~~~~v~l~~~L~~G~YtV~Wrvvs~DGH~~~G~-~~F~V  123 (124)
T PRK10301         62 GATITGPKQENIKTLPAKRNEQDQKQLIVPLADSLKPGTYTVDWHVVSVDGHKTKGH-YTFSV  123 (124)
T ss_pred             EEEEEcCCCCEeccCCccccCCCCcEEEEECCCCCCCccEEEEEEEEecCCCccCCe-EEEEE
Confidence            467777877654321   11122223444432 3467888776554   77888887 55554


No 86 
>PF04234 CopC:  CopC domain;  InterPro: IPR007348 CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm [].; GO: 0005507 copper ion binding, 0046688 response to copper ion, 0042597 periplasmic space; PDB: 1IX2_B 1LYQ_A 2C9P_C 2C9R_A 2C9Q_A 1M42_A 1OT4_A 1NM4_A.
Probab=28.46  E-value=46  Score=29.94  Aligned_cols=32  Identities=25%  Similarity=0.359  Sum_probs=13.9

Q ss_pred             cEEEEEEE-cccccEEEEE---EECCEEcCCCCeEEE
Q psy192          720 NLGISFTP-REVGSHLVSV---KKMGVHIKNSPFKIN  752 (898)
Q Consensus       720 ~y~v~ftP-~e~G~h~i~V---~~~G~~i~gSPf~i~  752 (898)
                      .+.+...+ ...|.|+|.-   .-||.++.|+ |.|.
T Consensus        60 ~~~~~l~~~l~~G~YtV~wrvvs~DGH~~~G~-~~F~   95 (97)
T PF04234_consen   60 TLTVPLPPPLPPGTYTVSWRVVSADGHPVSGS-FSFT   95 (97)
T ss_dssp             EEEEEESS---SEEEEEEEEEEETTSCEEEEE-EEEE
T ss_pred             EEEEECCCCCCCceEEEEEEEEecCCCCcCCE-EEEE
Confidence            34444433 4444444433   2366666555 4443


No 87 
>PF11896 DUF3416:  Domain of unknown function (DUF3416);  InterPro: IPR021828  This presumed domain is functionally uncharacterised. This domain is found in bacteria and archaea. This domain is about 190 amino acids in length. This domain is found associated with PF00128 from PFAM. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 3ZT7_A 3ZST_B 3ZT6_A 3ZSS_D 3ZT5_B.
Probab=28.28  E-value=2.3e+02  Score=29.04  Aligned_cols=60  Identities=17%  Similarity=0.292  Sum_probs=36.3

Q ss_pred             eeeeeeEEEEEeccCCCCceEEEE--EcCCc---ceeeEEECCCCcEEEEEEeCCCceEEEEEEE
Q psy192          222 EQNQPCEFNVWTREAGAGSLAISV--EGPSK---AEIDFKDRKDGSCYVSYVVAEPGEYRVGIKF  281 (898)
Q Consensus       222 ~~g~~~~F~V~~~daG~g~l~v~v--~gps~---~~i~~~d~~dGty~v~y~P~~~G~y~i~V~~  281 (898)
                      .+|++..++-+.--.|-..|.+.+  ..+++   .++.+..-++..|..+|+|..+|.|+..|.-
T Consensus        24 vvGe~v~V~Adif~DGHD~l~A~l~~r~~~~~~w~~vpM~~~gnDrW~a~f~~~~~G~~~f~VeA   88 (187)
T PF11896_consen   24 VVGEPVPVSADIFRDGHDALAAELLWRHPGEREWQEVPMTPLGNDRWEASFTPDRPGRYEFRVEA   88 (187)
T ss_dssp             ETT-EEEEEEEE--SSSS-EEEEEEEE-TTS-B----B-EESTS-EEEEEEE--SSEEEEEEEEE
T ss_pred             ecCCeEEEEEEEEecCCCcEEEEEEEECCCCCcceeeccccCCCCEEEEEEECCCceeEEEEEEE
Confidence            567777777776666777776555  45544   3466667788899999999999999888753


No 88 
>cd00173 SH2 Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Probab=27.05  E-value=89  Score=27.29  Aligned_cols=29  Identities=28%  Similarity=0.525  Sum_probs=26.0

Q ss_pred             EEeCCCCeEEEEEEcCCCeeEEEEEEECC
Q psy192          625 YHDNKDGTVAVSYLPTAPGEYKIAVKFGE  653 (898)
Q Consensus       625 ~~~~~dg~~~v~Y~P~~pG~Y~i~V~~~~  653 (898)
                      +.+..+|+|.|......+|.|.|+|++++
T Consensus        16 L~~~~~G~FLiR~s~~~~~~~~Lsv~~~~   44 (94)
T cd00173          16 LKKKPDGTFLVRDSESSPGDYVLSVRVKG   44 (94)
T ss_pred             HhcCCCceEEEEecCCCCCCEEEEEEECC
Confidence            34467999999999999999999999988


No 89 
>cd00173 SH2 Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Probab=27.00  E-value=73  Score=27.85  Aligned_cols=27  Identities=22%  Similarity=0.327  Sum_probs=24.4

Q ss_pred             eCCceEEEEEEeCCCeeEEEEEEECCc
Q psy192          861 TGRNNFEVKYIVRDRGEYLLIVKWGDD  887 (898)
Q Consensus       861 ~g~~~y~v~y~~~~~G~y~l~vk~~~~  887 (898)
                      ..+|+|-++.....+|.|.|+|++.++
T Consensus        19 ~~~G~FLiR~s~~~~~~~~Lsv~~~~~   45 (94)
T cd00173          19 KPDGTFLVRDSESSPGDYVLSVRVKGK   45 (94)
T ss_pred             CCCceEEEEecCCCCCCEEEEEEECCE
Confidence            468999999999999999999999873


No 90 
>KOG1690|consensus
Probab=26.09  E-value=1.8e+02  Score=29.85  Aligned_cols=79  Identities=15%  Similarity=0.183  Sum_probs=45.6

Q ss_pred             CCcccCeEEeCCCCeEEEEEEcCCceeEEEEEEECCEEe-cCCCCee--EEEecCccc-eEEEEeCCCCeEEEEEeeCCC
Q psy192            4 GNVDKPVIEDNHDGTVSLHYDPREEGLHELALKFNGDHV-QGYGGLS--LSIEGPSKA-EIQCKDNADGSLNISYRPTEP   79 (898)
Q Consensus         4 g~~~~~~i~~~~d~~~~v~f~P~~~G~h~v~v~~~G~~v-~G~g~l~--v~v~GP~~~-~i~~~d~~dG~~~v~ytP~~~   79 (898)
                      |-..+|-+++..+|+.-       -|.|.+.+.-+-... .++++++  |.|+-|-.. .+-.......+=+++||...+
T Consensus        27 ~~e~KCF~eelpk~tmv-------~G~yk~qlyd~~~~~y~~~p~~gm~VeV~e~fdnnh~Vl~q~~ss~G~ftFta~~~   99 (215)
T KOG1690|consen   27 GTEKKCFIEELPKGTMV-------TGNYKAQLYDDQLKGYGSYPNIGMHVEVKETFDNNHVVLSQQYSSEGDFTFTALTP   99 (215)
T ss_pred             CCcccchhhhCCCCcEE-------EeeeeeeeecchhcccccCCCceEEEEeecCCCCceEEEeecCCCCCceEEEccCC
Confidence            55677888888886632       244444443221111 1334444  555555543 233333344455689999999


Q ss_pred             eEEEEEEEEC
Q psy192           80 GYYIINLKFA   89 (898)
Q Consensus        80 G~~~I~V~~~   89 (898)
                      |+|.|.+.-+
T Consensus       100 GeH~IC~~s~  109 (215)
T KOG1690|consen  100 GEHRICIQSN  109 (215)
T ss_pred             CceEEEEecc
Confidence            9999999854


No 91 
>PRK09619 flgD flagellar basal body rod modification protein; Reviewed
Probab=25.54  E-value=2e+02  Score=30.13  Aligned_cols=45  Identities=11%  Similarity=0.084  Sum_probs=21.9

Q ss_pred             CeEEEEEecCCccceEEEEECCCCCeeeeEEEecCCCeEEEEEEe
Q psy192          320 PTQFLVRKNGAVGALDAKVISPSGTEDDCFIQPIDGDNYSIRFMP  364 (898)
Q Consensus       320 ~~~f~V~~~~a~G~l~v~v~~p~g~~~~~~v~~~~~g~y~v~f~P  364 (898)
                      ...|.++..++...+.+.|.+.+|+.....+-....|.+.+.|-+
T Consensus       111 ~~~~~~~L~~~a~~v~v~I~D~~G~v~t~~l~~~~aG~~~f~WDG  155 (218)
T PRK09619        111 PVAGRLTLKHPAPTLTLHITDILGQEKKIDLGKQPAGPVNFTLDP  155 (218)
T ss_pred             eeEEEEecCCcCcEEEEEEEeCCCCEEEEecCCcCCCceeEEECC
Confidence            334444433333346777777777642222222344555555544


No 92 
>PRK12812 flgD flagellar basal body rod modification protein; Reviewed
Probab=25.38  E-value=2.2e+02  Score=30.72  Aligned_cols=45  Identities=2%  Similarity=-0.046  Sum_probs=23.5

Q ss_pred             CeEEEEEecCCccceEEEEECCCCCeee-eEEEecCCCeEEEEEEe
Q psy192          320 PTQFLVRKNGAVGALDAKVISPSGTEDD-CFIQPIDGDNYSIRFMP  364 (898)
Q Consensus       320 ~~~f~V~~~~a~G~l~v~v~~p~g~~~~-~~v~~~~~g~y~v~f~P  364 (898)
                      ++.|.++......++.+.|.+.+|+.+. ..+.....|.+.+.|-.
T Consensus       129 ~~~~~~~l~~~a~~v~v~I~D~~G~~V~t~~lg~~~aG~~~f~WDG  174 (259)
T PRK12812        129 LIALKLYFPEDSDEGTLEIYDSNNKLVEKIDFKEISQGLFTMEWDG  174 (259)
T ss_pred             eeEEEEecCCcCceEEEEEEeCCCCEEEEEecCCCCCcceeEEECC
Confidence            4555555444444577777777776542 22223344555444433


No 93 
>KOG1692|consensus
Probab=25.17  E-value=1.6e+02  Score=30.04  Aligned_cols=55  Identities=22%  Similarity=0.262  Sum_probs=35.8

Q ss_pred             eCCeEEEEEEcCCCCCCceEEEEECCCCceeeeEEeCCeEEEEEecCCCeeEEEE
Q psy192          412 SGVKTDFIVDTCNAGAGTLAVTIDGPSKVSMDCTEVEEGYKVRYTPLVPGDYYVS  466 (898)
Q Consensus       412 ~g~~~~F~Vd~~~aG~g~l~v~v~gp~~~~~~~~~~~~gy~v~ytp~~~G~y~i~  466 (898)
                      .|...+.+-..-+.|.-.+++.|.||.+-.+.-.+.+.+=..+|++...|.|+..
T Consensus        39 ~gd~~~vsF~v~~gg~~~vd~~I~gP~~~~i~~~~~~ssgk~tF~a~~~G~Y~fC   93 (201)
T KOG1692|consen   39 EGDKLSVSFEVIDGGFLGVDVEITGPDGKIIHKGKRESSGKYTFTAPKKGTYTFC   93 (201)
T ss_pred             cCCEEEEEEEEecCCccceeEEEECCCCchhhhcccccCceEEEEecCCceEEEE
Confidence            3444444444444567788999999987554433333335678899999999873


No 94 
>PF07523 Big_3:  Bacterial Ig-like domain (group 3);  InterPro: IPR011080 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.; PDB: 2L7Y_A 2KPN_A.
Probab=23.77  E-value=83  Score=26.02  Aligned_cols=42  Identities=24%  Similarity=0.202  Sum_probs=25.8

Q ss_pred             cceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcCcceeEEEEEEECC
Q psy192          142 FDLSATVTSPGGVTEDAEINEVEDGLYAVHFVPKELGVHTVSVRYKD  188 (898)
Q Consensus       142 ~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~~~G~h~I~V~~~g  188 (898)
                      ..+.+.++..+|+.+     +..+-+.+..|-...+|.|.|.+++.+
T Consensus        17 ~~~~v~at~~dG~~~-----~~~~~~vs~~~d~~~~G~y~Vt~~y~~   58 (67)
T PF07523_consen   17 TGLFVTATYSDGTSL-----PLSDVTVSGTVDTSKAGTYTVTYTYKG   58 (67)
T ss_dssp             HCHEEEEEETTS-ES------GCCSEEES---TTS-CCEEEEEEECT
T ss_pred             cCCEEEEEEcCCCEe-----ceeeeEEEeeeecCCCceEEEEEEECC
Confidence            345677777788762     233344444678899999999999999


No 95 
>PRK09619 flgD flagellar basal body rod modification protein; Reviewed
Probab=23.77  E-value=2.3e+02  Score=29.70  Aligned_cols=32  Identities=16%  Similarity=0.243  Sum_probs=14.9

Q ss_pred             ceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEc
Q psy192          143 DLSATVTSPGGVTEDAEINEVEDGLYAVHFVP  174 (898)
Q Consensus       143 ~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P  174 (898)
                      .+++.|.+.+|+......-....|.+.+.|-+
T Consensus       124 ~v~v~I~D~~G~v~t~~l~~~~aG~~~f~WDG  155 (218)
T PRK09619        124 TLTLHITDILGQEKKIDLGKQPAGPVNFTLDP  155 (218)
T ss_pred             EEEEEEEeCCCCEEEEecCCcCCCceeEEECC
Confidence            35666666666642222222334444444443


No 96 
>PF11589 DUF3244:  Domain of unknown function (DUF3244);  InterPro: IPR021638  This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=23.75  E-value=2.3e+02  Score=25.84  Aligned_cols=60  Identities=12%  Similarity=0.129  Sum_probs=34.7

Q ss_pred             EEEecCCccceEEEEECCCCCeeeeEEEe-cCCCeEEEEEEecCCeeEEEEEEECCe-ecCC
Q psy192          324 LVRKNGAVGALDAKVISPSGTEDDCFIQP-IDGDNYSIRFMPRENGIHNIHIKFNGV-HIPG  383 (898)
Q Consensus       324 ~V~~~~a~G~l~v~v~~p~g~~~~~~v~~-~~~g~y~v~f~P~~~G~h~i~V~~~g~-~I~g  383 (898)
                      .|.-....+++.++|++..|..+--.... .....+.+...-...|.|.|.+..+.- .+-|
T Consensus        40 ~I~F~~~~~~vtI~I~d~~G~vVy~~~~~~~~~~~~~I~L~~~~~G~Y~l~i~~~~g~~l~G  101 (106)
T PF11589_consen   40 SIEFESPIGDVTITIKDSTGNVVYSETVSNSAGQSITIDLNGLPSGEYTLEITNGNGTYLYG  101 (106)
T ss_dssp             EEEESS--SEEEEEEEETT--EEEEEEESCGGTTEEEEE-TTS-SEEEEEEEEECTC-EEEE
T ss_pred             EEEEcCCCCCEEEEEEeCCCCEEEEEEccCCCCcEEEEEeCCCCCccEEEEEEeCCCCEEEE
Confidence            44444455778888888778765333332 234467777766678999999887754 5443


No 97 
>PRK12633 flgD flagellar basal body rod modification protein; Provisional
Probab=23.39  E-value=2.7e+02  Score=29.50  Aligned_cols=14  Identities=21%  Similarity=0.463  Sum_probs=8.5

Q ss_pred             ceEEEEeCCCCCee
Q psy192          143 DLSATVTSPGGVTE  156 (898)
Q Consensus       143 ~~~~~V~~p~G~~~  156 (898)
                      ++++.|.+.+|+.+
T Consensus       130 ~v~v~I~D~~G~vV  143 (230)
T PRK12633        130 KVTVKVLDPSGAVV  143 (230)
T ss_pred             EEEEEEEeCCCCEE
Confidence            35666666666654


No 98 
>PF06016 Reovirus_L2:  Reovirus core-spike protein lambda-2 (L2);  InterPro: IPR010311 This family consists of several Reovirus core-spike protein lambda-2 (L2) sequences. The reovirus L2 genome segment encodes the core spike protein lambda-2, which mediates enzymatic reactions in 5' capping of the viral plus-strand transcripts [].; GO: 0004482 mRNA (guanine-N7-)-methyltransferase activity, 0004484 mRNA guanylyltransferase activity, 0005524 ATP binding, 0006370 mRNA capping, 0019028 viral capsid; PDB: 1EJ6_A 3IYL_W 3K1Q_A.
Probab=23.35  E-value=1.3e+02  Score=39.24  Aligned_cols=105  Identities=11%  Similarity=0.231  Sum_probs=53.2

Q ss_pred             ceecCCCeEEEEEecCCccceEEE-----EE-CCCCCe----eeeEE-EecCCCeEEEEEEecCCeeEEEEEEECCeecC
Q psy192          314 VVMADKPTQFLVRKNGAVGALDAK-----VI-SPSGTE----DDCFI-QPIDGDNYSIRFMPRENGIHNIHIKFNGVHIP  382 (898)
Q Consensus       314 ~~~~g~~~~f~V~~~~a~G~l~v~-----v~-~p~g~~----~~~~v-~~~~~g~y~v~f~P~~~G~h~i~V~~~g~~I~  382 (898)
                      ...+|+.+++.|..-+....+.+.     +. ...|..    -++.. -+..+..+.++|+|..+|.|.+.+..++..|+
T Consensus      1034 ~~~VGq~~~i~I~~~~~~~s~~~~~~gV~v~~~~~~~~~s~~~~v~~q~~~~~~~~~l~~tp~~aGi~~vv~~~~~~~v~ 1113 (1289)
T PF06016_consen 1034 QLVVGQQCSITIPGPNPQASVSVTHDGVEVFTGTAGNLVSMVGPVALQYDPANNEWTLTWTPSTAGILDVVVDDGPVPVP 1113 (1289)
T ss_dssp             SSCTTT-EEEEEESS-TTEEEEEECTTCEEEECCCTTTCESSSEEEEEEETTTTEEEEEEE-SS-CEEEEEECTTTCCEE
T ss_pred             CccccceEEEEEccCCCCceEEEEECCEEEEEecCCcccccccceeEEEeccCceEEEEEccCCCceEEEEeccCCcccc
Confidence            357899999999865443222222     11 122321    11222 24567899999999999999997777776665


Q ss_pred             CCCeEEEeccCCCCCceEEecCCCcceeeeCCeEEEEEEc
Q psy192          383 GSPLRIKVGKGEADPAAVHATGNGLAEIKSGVKTDFIVDT  422 (898)
Q Consensus       383 gSPf~v~V~~~~~d~s~v~v~G~Gl~~~~~g~~~~F~Vd~  422 (898)
                      -  ..+.|..  .|+.-+.....-+..-.+|..++.++|.
T Consensus      1114 ~--GS~~i~~--Pd~~i~v~~Pa~~D~T~aGtd~~I~~d~ 1149 (1289)
T PF06016_consen 1114 L--GSFTIEP--PDPTITVTWPANLDYTDAGTDVNITCDP 1149 (1289)
T ss_dssp             E--EEEEEB-------EEEE--TT--SSTT-EEEEEEE-T
T ss_pred             c--ceeEeeC--CCCceEEecccceeeeccCceeeEEecc
Confidence            3  2344432  5776666655444444556555555554


No 99 
>PF05404 TRAP-delta:  Translocon-associated protein, delta subunit precursor (TRAP-delta);  InterPro: IPR008855 This family consists of several eukaryotic translocon-associated protein, delta subunit precursors (TRAP-delta or SSR-delta). The exact function of this protein is unknown [].; GO: 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=23.24  E-value=3.2e+02  Score=27.39  Aligned_cols=52  Identities=19%  Similarity=0.267  Sum_probs=0.0

Q ss_pred             EEEEEEcC-CCcccceEEEEeCCCCCeeeeEEEEcCCCEEEEEEEcC----cceeEEEEEE
Q psy192          130 CKLTFKMP-GITAFDLSATVTSPGGVTEDAEINEVEDGLYAVHFVPK----ELGVHTVSVR  185 (898)
Q Consensus       130 ~~f~v~~~-~~~~~~~~~~V~~p~G~~~~~~v~~~~~g~y~v~f~P~----~~G~h~I~V~  185 (898)
                      ++|+++.. +.....|-|+|   +|+..|+ .+..+.++|.|+|+-.    ..|+|.|.+.
T Consensus        45 ~EFsl~C~n~~~~~~LyAeV---~Gk~~PV-ar~~~~nkYQVSW~~e~k~a~sG~y~V~~f  101 (167)
T PF05404_consen   45 AEFSLKCSNGAKNISLYAEV---NGKILPV-ARSGDTNKYQVSWTEEHKKASSGTYEVKFF  101 (167)
T ss_pred             EEEEEEeCCCCcCccEEEEE---CCEEEEE-EEcCCCCceEEEEEechhhccCCceEEEEe


No 100
>PF07523 Big_3:  Bacterial Ig-like domain (group 3);  InterPro: IPR011080 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.; PDB: 2L7Y_A 2KPN_A.
Probab=22.46  E-value=1.4e+02  Score=24.68  Aligned_cols=41  Identities=20%  Similarity=0.255  Sum_probs=23.4

Q ss_pred             eEEEEECCCCCeeeeEEEecCCCeEEEEEEecCCeeEEEEEEECCe
Q psy192          334 LDAKVISPSGTEDDCFIQPIDGDNYSIRFMPRENGIHNIHIKFNGV  379 (898)
Q Consensus       334 l~v~v~~p~g~~~~~~v~~~~~g~y~v~f~P~~~G~h~i~V~~~g~  379 (898)
                      +.+.....+|..     .+.++-+....|....+|.|.|.++|++.
T Consensus        19 ~~v~at~~dG~~-----~~~~~~~vs~~~d~~~~G~y~Vt~~y~~~   59 (67)
T PF07523_consen   19 LFVTATYSDGTS-----LPLSDVTVSGTVDTSKAGTYTVTYTYKGV   59 (67)
T ss_dssp             HEEEEEETTS-E-----S-GCCSEEES---TTS-CCEEEEEEECTE
T ss_pred             CEEEEEEcCCCE-----eceeeeEEEeeeecCCCceEEEEEEECCE
Confidence            445556666755     22233334446789999999999999983


No 101
>PF00801 PKD:  PKD domain;  InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions.  PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=22.40  E-value=2.9e+02  Score=22.66  Aligned_cols=53  Identities=23%  Similarity=0.311  Sum_probs=31.1

Q ss_pred             eeeCCeEEEEEEcCCCCCCceEEEEECCCCceeeeEEeCCeEEEEEecCCCeeEEEEEEE
Q psy192          410 IKSGVKTDFIVDTCNAGAGTLAVTIDGPSKVSMDCTEVEEGYKVRYTPLVPGDYYVSLKY  469 (898)
Q Consensus       410 ~~~g~~~~F~Vd~~~aG~g~l~v~v~gp~~~~~~~~~~~~gy~v~ytp~~~G~y~i~V~~  469 (898)
                      ...|+++.|....  .....+....+=.+     ......+-.++|+-..+|.|.|.|+.
T Consensus         8 ~~~g~~v~f~~~~--~~g~~~~y~W~fgd-----~~~~~~~~~~t~ty~~~G~y~V~ltv   60 (69)
T PF00801_consen    8 VPTGQPVTFTASS--SDGSPVTYSWDFGD-----NGTVSTGSSVTHTYSSPGTYTVTLTV   60 (69)
T ss_dssp             EBTTSEEEEEETT--TTSSECEEEEE-SS-----ESEEECSSEEEEEESSSEEEEEEEEE
T ss_pred             EcCCCEEEEEEEc--cCCCCeEEEEEECC-----CCccccCCCEEEEcCCCeEEEEEEEE
Confidence            4567888888777  22233444333111     00112234778888999999998875


No 102
>PF11589 DUF3244:  Domain of unknown function (DUF3244);  InterPro: IPR021638  This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=22.24  E-value=1.5e+02  Score=27.04  Aligned_cols=52  Identities=13%  Similarity=0.069  Sum_probs=35.7

Q ss_pred             cceEEEEeCCCCCeeeeEEEEcC-CCEEEEEEEcCcceeEEEEEEECCe-ecCC
Q psy192          142 FDLSATVTSPGGVTEDAEINEVE-DGLYAVHFVPKELGVHTVSVRYKDI-HIPG  193 (898)
Q Consensus       142 ~~~~~~V~~p~G~~~~~~v~~~~-~g~y~v~f~P~~~G~h~I~V~~~g~-~V~g  193 (898)
                      ++++++|++-.|..+..+...+. ...+.+.--=...|.|.|.+..+.- .+-|
T Consensus        48 ~~vtI~I~d~~G~vVy~~~~~~~~~~~~~I~L~~~~~G~Y~l~i~~~~g~~l~G  101 (106)
T PF11589_consen   48 GDVTITIKDSTGNVVYSETVSNSAGQSITIDLNGLPSGEYTLEITNGNGTYLYG  101 (106)
T ss_dssp             SEEEEEEEETT--EEEEEEESCGGTTEEEEE-TTS-SEEEEEEEEECTC-EEEE
T ss_pred             CCEEEEEEeCCCCEEEEEEccCCCCcEEEEEeCCCCCccEEEEEEeCCCCEEEE
Confidence            56888999988988776666544 3467777655679999999998865 6655


No 103
>COG4856 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.78  E-value=5.6e+02  Score=29.05  Aligned_cols=25  Identities=24%  Similarity=0.429  Sum_probs=19.1

Q ss_pred             EEEEEECCe--ecCCCceEEEEccCcc
Q psy192          646 KIAVKFGEK--HIKGSPYLAKITGEGR  670 (898)
Q Consensus       646 ~i~V~~~~~--~I~GSPF~v~V~~~~~  670 (898)
                      -+.|+|+..  +|+|-||+++|+=.+.
T Consensus        53 pvdvlYD~~~y~isg~~etV~Vtl~G~   79 (403)
T COG4856          53 PVDVLYDSDKYFISGQPETVTVTLKGP   79 (403)
T ss_pred             eeEEEEccccccccCCceEEEEEEeCC
Confidence            366888764  8999999998875543


No 104
>PF10633 NPCBM_assoc:  NPCBM-associated, NEW3 domain of alpha-galactosidase;  InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=20.46  E-value=5.2e+02  Score=21.84  Aligned_cols=67  Identities=15%  Similarity=0.266  Sum_probs=0.0

Q ss_pred             eecCceeEEEEEcCCCCCce---eEEEEeCCCccccccccceEEEEeeCCceEEEEEEeC-----CCeeEEEEEEE
Q psy192          817 IKSGVKTDFIVDTCNAGAGT---LAVTIDGPSKVSVKKYKDEIFTRHTGRNNFEVKYIVR-----DRGEYLLIVKW  884 (898)
Q Consensus       817 ~~~g~~~~f~vdt~~aG~g~---l~v~i~gp~~~~~~~~~~~~~~~~~g~~~y~v~y~~~-----~~G~y~l~vk~  884 (898)
                      +..|+...|++.-+|.|...   +.+.++.|++-.+..--.++. .-....+-.+.+...     .+|.|.|.++.
T Consensus         1 v~~G~~~~~~~tv~N~g~~~~~~v~~~l~~P~GW~~~~~~~~~~-~l~pG~s~~~~~~V~vp~~a~~G~y~v~~~a   75 (78)
T PF10633_consen    1 VTPGETVTVTLTVTNTGTAPLTNVSLSLSLPEGWTVSASPASVP-SLPPGESVTVTFTVTVPADAAPGTYTVTVTA   75 (78)
T ss_dssp             --TTEEEEEEEEEE--SSS-BSS-EEEEE--TTSE---EEEEE---B-TTSEEEEEEEEEE-TT--SEEEEEEEEE
T ss_pred             CCCCCEEEEEEEEEECCCCceeeEEEEEeCCCCccccCCccccc-cCCCCCEEEEEEEEECCCCCCCceEEEEEEE


Done!