Query         psy15334
Match_columns 451
No_of_seqs    355 out of 1855
Neff          7.7 
Searched_HMMs 46136
Date          Fri Aug 16 18:17:39 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy15334.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15334hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4228|consensus              100.0 3.3E-80 7.2E-85  663.5  20.9  280  150-436   522-809 (1087)
  2 PHA02742 protein tyrosine phos 100.0 1.7E-76 3.6E-81  585.7  30.5  276  148-428     6-299 (303)
  3 PHA02740 protein tyrosine phos 100.0 5.9E-76 1.3E-80  578.4  29.4  269  149-428     7-291 (298)
  4 KOG0791|consensus              100.0 1.7E-76 3.6E-81  571.8  23.2  292  148-442    75-373 (374)
  5 PHA02747 protein tyrosine phos 100.0 1.7E-74 3.7E-79  573.2  30.1  277  148-428     6-302 (312)
  6 PHA02746 protein tyrosine phos 100.0 3.1E-74 6.7E-79  573.6  30.8  276  148-427     6-316 (323)
  7 PHA02738 hypothetical protein; 100.0 4.3E-73 9.4E-78  564.6  28.4  277  146-429     4-298 (320)
  8 KOG0790|consensus              100.0   1E-73 2.2E-78  558.1  13.0  278  166-443   246-540 (600)
  9 KOG0792|consensus              100.0 1.6E-68 3.4E-73  567.0  25.7  266  163-432   867-1137(1144)
 10 smart00194 PTPc Protein tyrosi 100.0 1.1E-67 2.3E-72  513.4  28.9  253  166-422     2-257 (258)
 11 cd00047 PTPc Protein tyrosine  100.0   4E-61 8.7E-66  459.6  27.3  227  192-422     1-230 (231)
 12 KOG0793|consensus              100.0 2.4E-62 5.3E-67  498.0  15.7  259  162-423   730-993 (1004)
 13 PF00102 Y_phosphatase:  Protei 100.0 6.9E-59 1.5E-63  443.3  24.3  229  190-423     1-235 (235)
 14 COG5599 PTP2 Protein tyrosine  100.0 2.3E-59   5E-64  433.3  16.8  238  182-426    45-295 (302)
 15 KOG4228|consensus              100.0 5.3E-59 1.2E-63  500.2  18.3  277  143-426   796-1086(1087)
 16 PRK15375 pathogenicity island  100.0 7.4E-53 1.6E-57  425.3  24.2  215  195-426   303-531 (535)
 17 KOG0789|consensus              100.0 4.5E-51 9.7E-56  422.4  24.3  243  184-431   121-373 (415)
 18 smart00012 PTPc_DSPc Protein t  99.9 1.9E-23   4E-28  173.7  11.9  102  319-422     1-104 (105)
 19 smart00404 PTPc_motif Protein   99.9 1.9E-23   4E-28  173.7  11.9  102  319-422     1-104 (105)
 20 PTZ00242 protein tyrosine phos  99.7 5.2E-16 1.1E-20  140.5  11.7  135  233-421    16-155 (166)
 21 PTZ00393 protein tyrosine phos  99.5 3.8E-13 8.3E-18  126.4  11.1  135  233-421    92-226 (241)
 22 KOG1720|consensus               99.3 2.1E-11 4.5E-16  111.0  11.9   97  321-429   117-213 (225)
 23 KOG2836|consensus               99.2 5.3E-10 1.2E-14   94.6  12.1  131  233-417    17-149 (173)
 24 cd00127 DSPc Dual specificity   99.1 7.6E-10 1.6E-14   96.8  10.2   88  319-418    49-136 (139)
 25 COG2453 CDC14 Predicted protei  98.9 1.2E-08 2.6E-13   93.9  12.5   90  327-425    79-168 (180)
 26 smart00195 DSPc Dual specifici  98.7 1.6E-07 3.4E-12   82.2  11.6   57  354-418    77-133 (138)
 27 PF00782 DSPc:  Dual specificit  98.6   2E-07 4.4E-12   80.9   8.3   70  338-419    60-129 (133)
 28 PRK12361 hypothetical protein;  98.0 3.2E-05   7E-10   83.3  11.5   82  329-419   151-232 (547)
 29 PF05706 CDKN3:  Cyclin-depende  98.0 1.4E-05   3E-10   71.5   5.9   89  252-376    66-154 (168)
 30 PF03162 Y_phosphatase2:  Tyros  97.6 9.3E-05   2E-09   66.9   5.6   72  320-403    59-130 (164)
 31 PF14566 PTPlike_phytase:  Inos  97.3 0.00045 9.8E-09   61.5   5.6   49  328-379   100-148 (149)
 32 KOG2283|consensus               97.0  0.0012 2.6E-08   68.4   6.0   89  326-419    78-169 (434)
 33 KOG1718|consensus               96.5  0.0057 1.2E-07   54.6   5.9   56  339-406    82-137 (198)
 34 PF13350 Y_phosphatase3:  Tyros  96.5   0.004 8.6E-08   56.2   5.1   39  234-273    20-58  (164)
 35 TIGR01244 conserved hypothetic  96.4  0.0082 1.8E-07   52.4   6.3   56  336-405    71-126 (135)
 36 COG5350 Predicted protein tyro  96.4   0.024 5.2E-07   49.8   8.9   82  322-412    61-143 (172)
 37 KOG1716|consensus               96.3   0.013 2.7E-07   57.9   7.8   61  338-410   142-202 (285)
 38 KOG1719|consensus               96.0    0.09 1.9E-06   46.4  10.6   55  355-417   109-163 (183)
 39 PF04273 DUF442:  Putative phos  95.3   0.034 7.5E-07   46.8   5.2   38  336-378    71-108 (110)
 40 COG2365 Protein tyrosine/serin  95.1   0.038 8.3E-07   53.4   5.6   27  355-381   136-162 (249)
 41 PLN02727 NAD kinase             94.7   0.071 1.5E-06   59.5   6.9   42  334-378   323-364 (986)
 42 KOG0196|consensus               93.8    0.14   3E-06   56.2   6.7   21   47-67    501-521 (996)
 43 KOG4471|consensus               93.8   0.086 1.9E-06   55.6   5.1   33  347-379   366-398 (717)
 44 KOG1717|consensus               93.6    0.18 3.9E-06   48.4   6.3   68  339-421   239-306 (343)
 45 PF07353 Uroplakin_II:  Uroplak  90.9    0.29 6.2E-06   43.3   3.8   25   42-66    100-124 (184)
 46 KOG1572|consensus               88.4     1.3 2.8E-05   42.0   6.4   28  351-378   144-171 (249)
 47 PF01102 Glycophorin_A:  Glycop  87.8    0.17 3.6E-06   43.3   0.1   25   87-111    68-94  (122)
 48 PF08693 SKG6:  Transmembrane a  87.3    0.39 8.5E-06   32.5   1.6   21   88-108    17-38  (40)
 49 PF06602 Myotub-related:  Myotu  86.7    0.88 1.9E-05   46.4   4.6   41  339-379   215-255 (353)
 50 KOG2386|consensus               85.4     1.1 2.4E-05   45.8   4.5   88  325-420    91-181 (393)
 51 KOG1089|consensus               82.4     1.9 4.2E-05   46.0   4.9   30  349-378   338-367 (573)
 52 PF01299 Lamp:  Lysosome-associ  80.8     3.8 8.2E-05   40.9   6.2   54   54-111   241-300 (306)
 53 PF00041 fn3:  Fibronectin type  79.4     3.1 6.8E-05   31.8   4.2   24   44-67     56-79  (85)
 54 PF12273 RCR:  Chitin synthesis  77.8    0.63 1.4E-05   40.3  -0.3    7   87-93      2-8   (130)
 55 PF01034 Syndecan:  Syndecan do  73.8     1.1 2.3E-05   33.6   0.1   24   87-110    13-38  (64)
 56 PF15102 TMEM154:  TMEM154 prot  69.1     7.8 0.00017   34.1   4.3    8  166-173   123-130 (146)
 57 PF02439 Adeno_E3_CR2:  Adenovi  67.2     2.5 5.4E-05   28.2   0.7   22   86-107     6-29  (38)
 58 PF04478 Mid2:  Mid2 like cell   63.6     5.5 0.00012   35.3   2.3   17   94-110    62-78  (154)
 59 PF05283 MGC-24:  Multi-glycosy  63.4     4.9 0.00011   37.0   2.1   24   84-107   159-182 (186)
 60 PF12877 DUF3827:  Domain of un  62.6     3.2   7E-05   44.7   0.8   12   82-93    265-276 (684)
 61 PF05808 Podoplanin:  Podoplani  58.3     3.3 7.1E-05   37.0   0.0   28   87-114   133-160 (162)
 62 PF14575 EphA2_TM:  Ephrin type  55.6     5.6 0.00012   31.0   0.9   11   89-99      7-17  (75)
 63 COG3453 Uncharacterized protei  54.9      22 0.00047   30.4   4.3   27  336-366    72-98  (130)
 64 COG1470 Predicted membrane pro  54.6     9.7 0.00021   39.7   2.7   52   54-108   458-509 (513)
 65 KOG4221|consensus               47.4      31 0.00067   40.3   5.3   20   48-67    986-1005(1381)
 66 PF05568 ASFV_J13L:  African sw  46.4       7 0.00015   34.1   0.1   23   88-110    34-56  (189)
 67 smart00060 FN3 Fibronectin typ  44.8      24 0.00053   25.3   3.0   23   44-66     57-79  (83)
 68 PF05454 DAG1:  Dystroglycan (D  43.7     7.7 0.00017   38.3   0.0    8  103-110   168-175 (290)
 69 PF02061 Lambda_CIII:  Lambda P  42.1      34 0.00074   23.2   2.8   24  322-345     2-25  (45)
 70 cd01533 4RHOD_Repeat_2 Member   41.4      44 0.00096   27.2   4.3   15  355-370    66-80  (109)
 71 cd01518 RHOD_YceA Member of th  39.3      31 0.00068   27.6   3.0   18  354-372    60-77  (101)
 72 cd01523 RHOD_Lact_B Member of   38.1      31 0.00066   27.6   2.8   17  354-371    60-76  (100)
 73 TIGR01167 LPXTG_anchor LPXTG-m  37.9      34 0.00075   21.7   2.4    8  100-107    24-31  (34)
 74 PF02009 Rifin_STEVOR:  Rifin/s  37.8     9.4  0.0002   37.9  -0.4    8  103-110   277-284 (299)
 75 PTZ00382 Variant-specific surf  37.7      20 0.00044   29.3   1.6    6  103-108    89-94  (96)
 76 PF07204 Orthoreo_P10:  Orthore  36.7      14 0.00031   29.8   0.5   20   91-110    50-69  (98)
 77 PF05337 CSF-1:  Macrophage col  35.8      12 0.00026   36.3   0.0   23   89-111   232-254 (285)
 78 PF06084 Cytomega_TRL10:  Cytom  34.7      29 0.00062   29.1   2.0   33   75-107    47-80  (150)
 79 COG3889 Predicted solute bindi  33.0      39 0.00085   37.5   3.2   15   95-109   856-871 (872)
 80 PF11770 GAPT:  GRB2-binding ad  32.5      40 0.00086   29.8   2.6   25   83-107     7-33  (158)
 81 PF11353 DUF3153:  Protein of u  32.1      42  0.0009   31.4   3.0   15   50-64    166-180 (209)
 82 cd01519 RHOD_HSP67B2 Member of  31.8      59  0.0013   26.0   3.5   16  354-370    65-80  (106)
 83 TIGR03000 plancto_dom_1 Planct  31.7      65  0.0014   25.1   3.4   28   42-69     27-54  (75)
 84 PF08374 Protocadherin:  Protoc  31.1      14  0.0003   34.6  -0.4   11  235-245   181-191 (221)
 85 cd00063 FN3 Fibronectin type 3  30.8      59  0.0013   23.9   3.2   23   44-66     57-79  (93)
 86 PRK14750 kdpF potassium-transp  30.2      34 0.00073   21.3   1.3   18   86-103     3-20  (29)
 87 PF06365 CD34_antigen:  CD34/Po  30.0      26 0.00057   32.7   1.2   14   99-112   118-131 (202)
 88 cd01448 TST_Repeat_1 Thiosulfa  29.3      89  0.0019   25.8   4.3   33  336-372    63-95  (122)
 89 COG5495 Uncharacterized conser  29.2      56  0.0012   31.3   3.2   18  353-371    89-106 (289)
 90 PLN02160 thiosulfate sulfurtra  28.5      51  0.0011   28.5   2.7   18  354-372    80-97  (136)
 91 PRK11493 sseA 3-mercaptopyruva  25.8      61  0.0013   31.7   3.0   49  322-373   201-249 (281)
 92 TIGR03865 PQQ_CXXCW PQQ-depend  25.4 1.1E+02  0.0024   27.2   4.4   18  354-371   115-132 (162)
 93 PF14610 DUF4448:  Protein of u  25.2      40 0.00087   31.0   1.5   13   90-102   164-176 (189)
 94 PF10727 Rossmann-like:  Rossma  24.7      81  0.0018   27.1   3.2   29  339-367    80-108 (127)
 95 PF03684 UPF0179:  Uncharacteri  24.7      44 0.00096   29.4   1.6   29   35-63     19-51  (142)
 96 KOG1530|consensus               24.6      64  0.0014   28.0   2.5   29  341-370    75-103 (136)
 97 KOG0208|consensus               24.3 1.6E+02  0.0034   34.1   6.0   67  356-422   852-926 (1140)
 98 PRK10287 thiosulfate:cyanide s  24.2 1.2E+02  0.0025   25.0   4.0   17  355-372    60-76  (104)
 99 PF10577 UPF0560:  Uncharacteri  24.2      37  0.0008   38.0   1.2   18   94-111   285-302 (807)
100 cd01520 RHOD_YbbB Member of th  23.2 1.7E+02  0.0036   24.7   4.9   20  354-373    85-104 (128)
101 TIGR03024 arch_pef_cterm PEF-C  23.0      36 0.00078   20.8   0.5    9   99-107    15-23  (26)
102 PTZ00208 65 kDa invariant surf  23.0      58  0.0013   33.5   2.2   11  101-111   406-416 (436)
103 PF12768 Rax2:  Cortical protei  22.9      52  0.0011   32.4   1.9   20   90-109   238-257 (281)
104 PF02035 Coagulin:  Coagulin;    22.1      30 0.00066   29.6   0.1   28  228-259    98-126 (174)
105 TIGR02981 phageshock_pspE phag  21.2 1.5E+02  0.0033   24.1   4.1   18  354-372    57-74  (101)
106 PF13908 Shisa:  Wnt and FGF in  21.0      88  0.0019   28.4   2.9    8   86-93     78-85  (179)
107 TIGR03154 sulfolob_CbsA cytoch  20.5 1.3E+02  0.0029   30.3   4.1   22   87-108   444-465 (465)
108 PF11344 DUF3146:  Protein of u  20.1      71  0.0015   24.9   1.7   15   49-63     65-79  (80)

No 1  
>KOG4228|consensus
Probab=100.00  E-value=3.3e-80  Score=663.50  Aligned_cols=280  Identities=41%  Similarity=0.712  Sum_probs=261.4

Q ss_pred             ceeccch----hhhccCCcc-hHHHhHHhhcCCCCCcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEee
Q psy15334        150 PIPVTLF----CTMTLDPDW-IAVEFEDLKSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNAN  224 (451)
Q Consensus       150 ~i~~~~f----~~~~~~~~~-~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs  224 (451)
                      ||++++|    ..+..|.+. |++|||.|.+.  ..++..+.+++|+.||||.||++|||+||+|.+++|++++||||||
T Consensus       522 pI~v~dl~~~i~~~~~~~~~~F~~Eyesl~~~--~~~~~~s~~~en~~KNRY~nilayD~sRV~L~~i~Gd~~sDYINAn  599 (1087)
T KOG4228|consen  522 PIPVADLLEHIERRKADDGYGFKQEYESLPEG--QFTWEHSNKKENKQKNRYENILAYDHSRVILPPIEGDPNSDYINAN  599 (1087)
T ss_pred             CccHHHHHHHHHHhhcccccchHHHHhhcccc--CCccccCccccccccccCCcchhhhcceeeecccCCCccccceeee
Confidence            8999999    445555555 99999999876  6678899999999999999999999999999999999999999999


Q ss_pred             eecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeecce
Q psy15334        225 YIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSW  304 (451)
Q Consensus       225 ~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~  304 (451)
                      |||||+.+++|||||||+++|+.||||||||+++.+|||+|+++|.++.||++|||  .++..||++.|++.+.....+|
T Consensus       600 yIdGy~e~n~yIaaQgP~~eTv~DFWRMVWEq~S~~IVMvTnl~E~~r~kC~qYWP--~~t~~yGdi~V~~~~~~~~a~y  677 (1087)
T KOG4228|consen  600 YIDGYKEPNAYIAAQGPRPETVGDFWRMVWEQKSAGIVMVTNLEEFSRVKCAQYWP--EGTETYGDIKVTLVQTKPLAEY  677 (1087)
T ss_pred             eeecccccccceeccCCcccchHHHHHHheeccCCcEEEEecccccccccccccCC--CCccccccccccceeeeeeccc
Confidence            99999999999999999999999999999999999999999999999999999999  3467899999999999999999


Q ss_pred             EEEEEEEEeC--CeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHH
Q psy15334        305 VLSELRMCRG--GEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRL  382 (451)
Q Consensus       305 ~~r~l~v~~~--~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l  382 (451)
                      .+|+|.+.+.  ...|.|+||||++|||||+|..+..+|.|++.++....+..||||||||||+||||+|++||.|++++
T Consensus       678 ~iRtf~l~~~g~~~~R~v~qfhFt~Wpd~gvPe~~t~lL~f~rrvk~~~p~~aGPiVVHCSAGvGRTG~fi~iDaml~~~  757 (1087)
T KOG4228|consen  678 GIRTFALKKQGENPKREVRQFHFTAWPDHGVPETPTGLLKFRRRVKTFNPPDAGPIVVHCSAGVGRTGCFIVIDAMLDRL  757 (1087)
T ss_pred             eEEeeeccccCCCCCceeeeeeeccCCCCCCcccchHHHHHHHHhccCCCcCCCCEEEECCCCCCCcceEEEeHHHHHHH
Confidence            9999999854  46899999999999999999999999999999999887788999999999999999999999999999


Q ss_pred             HcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHH-HHhcCCCCchhhh
Q psy15334        383 YNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQV-YLQGGVRHEERQI  436 (451)
Q Consensus       383 ~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~-~l~~~~~~~~~~~  436 (451)
                      +.+   +.+||++.|+.||.||+.||||++||+|||+++++ ++.|.++++...+
T Consensus       758 ~~e---~~vdiy~~v~~lR~QR~~mVQt~eQYiFi~~AllE~~~~G~T~i~~~~~  809 (1087)
T KOG4228|consen  758 ECE---GKVDIYGHVKTLRRQRNNMVQTEEQYIFIHEALLEACLCGDTEVPASTL  809 (1087)
T ss_pred             Hhh---CccceechhHHHHhccccccccHHHHHHHHHHHHHHHhcCCccccHHHH
Confidence            999   99999999999999999999999999999999997 5677777766543


No 2  
>PHA02742 protein tyrosine phosphatase; Provisional
Probab=100.00  E-value=1.7e-76  Score=585.68  Aligned_cols=276  Identities=27%  Similarity=0.504  Sum_probs=250.2

Q ss_pred             Cccee-ccchhhhccCCcc---hHHHhHHhhcCCCCCcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEe
Q psy15334        148 SHPIP-VTLFCTMTLDPDW---IAVEFEDLKSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNA  223 (451)
Q Consensus       148 ~~~i~-~~~f~~~~~~~~~---~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINA  223 (451)
                      .+.+. +.+|.....+++.   +.+||+.|.......+|..+..++|..||||.||+|||||||+|+..+  +++|||||
T Consensus         6 ~~~~~~~~~~~~~~~~~~~~~~i~~E~~~i~~~~~~~~~~~~~~~~N~~KNRy~~i~~~D~tRV~L~~~~--~~~dYINA   83 (303)
T PHA02742          6 SKKNSFAKNCEQLIEESNLAEILKEEHEHIMQEIVAFSCNESLELKNMKKCRYPDAPCFDRNRVILKIED--GGDDFINA   83 (303)
T ss_pred             ccccCcHHHHHHHHhcCCHHHHHHHHHHHHhcCCCCCchhhccChhhhhcCCCCCCCCCCCCEEEeCCCC--CCCCcEEE
Confidence            34677 7888666666665   899999998766678889999999999999999999999999998633  36899999


Q ss_pred             eeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCC-CCCCceeeeEEEEEeeeEeec
Q psy15334        224 NYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPS-DQHPMRCADIEISLQDVCHYG  302 (451)
Q Consensus       224 s~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~-~~~~~~~g~~~V~~~~~~~~~  302 (451)
                      |||+|+..+++|||||||+++|++|||+||||+++.+|||||+..|.|+.||.+|||. +++...||+++|+++++....
T Consensus        84 syI~g~~~~~~fIatQ~Pl~~T~~dFW~MVw~~~~~vIVmLt~~~E~g~~kc~~YWp~~~~~~~~~g~~~V~~~~~~~~~  163 (303)
T PHA02742         84 SYVDGHNAKGRFICTQAPLEETALDFWQAIFQDQVRVIVMITKIMEDGKEACYPYWMPHERGKATHGEFKIKTKKIKSFR  163 (303)
T ss_pred             eeecCCCCCCeEEEECCCCcccHHHHHHHHhhcCCCeEEEcchhhhCCeeeccccCCCCCCCceEEEEEEEEEEEEEEcC
Confidence            9999999899999999999999999999999999999999999999999999999985 456788999999999998889


Q ss_pred             ceEEEEEEEEe--CCeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcC-----------CCCCCEEEEcCCCCchh
Q psy15334        303 SWVLSELRMCR--GGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHIS-----------PAQHPIVVHCSAGVGRT  369 (451)
Q Consensus       303 ~~~~r~l~v~~--~~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~-----------~~~~PivVHCs~GvGRt  369 (451)
                      ++++++|.+++  .++.|.|+||||++|||+|+|.++..+++|+..+++...           ...+|||||||+|+|||
T Consensus       164 ~~~~~~l~l~~~~~~~~r~V~h~~y~~Wpd~gvP~~~~~~l~~i~~v~~~~~~~~~~~~~~~~~~~~PIvVHCsaGvGRT  243 (303)
T PHA02742        164 NYAVTNLCLTDTNTGASLDIKHFAYEDWPHGGLPRDPNKFLDFVLAVREADLKADVDIKGENIVKEPPILVHCSAGLDRA  243 (303)
T ss_pred             CEEEEEEEEEECCCCceEEEEEEEECCCCCCCcCCCHHHHHHHHHHHHHHhhhccccccccccCCCCCeEEECCCCCchh
Confidence            99999999986  468999999999999999999999999999999876321           12479999999999999


Q ss_pred             hhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHHHHhcC
Q psy15334        370 GTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQGG  428 (451)
Q Consensus       370 G~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~~~  428 (451)
                      |+|||+|++++++..+   +.+||+++|+.||+||++||||.+||.|||++|++|++..
T Consensus       244 GtF~aid~~i~~~~~~---~~v~v~~~V~~lR~qR~~~Vqt~~QY~F~y~~l~~y~~~~  299 (303)
T PHA02742        244 GAFCAIDICISKYNER---AIIPLLSIVRDLRKQRHNCLSLPQQYIFCYFIVLIFAKLM  299 (303)
T ss_pred             HHHHHHHHHHHHHHhc---CCCCHHHHHHHHHhhcccccCCHHHHHHHHHHHHHHHHHh
Confidence            9999999999999887   8999999999999999999999999999999999998754


No 3  
>PHA02740 protein tyrosine phosphatase; Provisional
Probab=100.00  E-value=5.9e-76  Score=578.40  Aligned_cols=269  Identities=19%  Similarity=0.404  Sum_probs=241.1

Q ss_pred             cceeccchhhhccCCcc---hHHHhHHhhcCCCCCcccccccccCCCCCC--CCCCCCCCCcceEecCCCCCCCCCcEEe
Q psy15334        149 HPIPVTLFCTMTLDPDW---IAVEFEDLKSVGRGQPTTAAELRCNRPKNR--YTNIFPYDSSRYKLQPVDDEEGSDYVNA  223 (451)
Q Consensus       149 ~~i~~~~f~~~~~~~~~---~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR--~~~i~pyD~sRV~L~~~~~~~~~dYINA  223 (451)
                      .++++++|......++.   +.+||++|.......++..+.+++|+.|||  |.||+|||||||+|++     ++|||||
T Consensus         7 ~~~~~~~f~~~~~~~~~~~~i~~E~~~i~~~~~~~~~~~~~~~~n~~knr~r~~~i~~~D~sRV~L~~-----~~~YINA   81 (298)
T PHA02740          7 VDINGMDFINFINKPDLLSCIIKEYRAIVPEHEDEANKACAQAENKAKDENLALHITRLLHRRIKLFN-----DEKVLDA   81 (298)
T ss_pred             cccCHHHHHHHHhCCCHHHHHHHHHHHHhccCccchhhhhhchhhhcccccccCCcCCccCceEEecC-----CCCcEEE
Confidence            46788899555455544   999999998776677888899999999996  6899999999999964     4589999


Q ss_pred             eeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceeccccccc-ccCCCC-CCCceeeeEEEEEeeeEee
Q psy15334        224 NYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCY-KYWPSD-QHPMRCADIEISLQDVCHY  301 (451)
Q Consensus       224 s~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~-~YwP~~-~~~~~~g~~~V~~~~~~~~  301 (451)
                      ||||||..+++|||||||+++|++|||+||||+++.+|||||+..|   .||. +|||.+ +....||+|.|++.++...
T Consensus        82 nyVdG~~~~~~fIaTQ~Pl~~T~~dFW~MVwe~~~~iIVMLt~~~e---~kc~~~YWP~~~~~~~~~g~f~V~~~~~~~~  158 (298)
T PHA02740         82 RFVDGYDFEQKFICIINLCEDACDKFLQALSDNKVQIIVLISRHAD---KKCFNQFWSLKEGCVITSDKFQIETLEIIIK  158 (298)
T ss_pred             EEecCCCCCCcEEEecCCchhhHHHHHHHHHhcCCCEEEEcccccc---ccccccCCCCCCCCeEEECCEEEEEEEEEec
Confidence            9999999999999999999999999999999999999999999877   4798 999975 4457899999999998888


Q ss_pred             cceEEEEEEEEe-CCeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhc--------CCCCCCEEEEcCCCCchhhhH
Q psy15334        302 GSWVLSELRMCR-GGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHI--------SPAQHPIVVHCSAGVGRTGTF  372 (451)
Q Consensus       302 ~~~~~r~l~v~~-~~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~--------~~~~~PivVHCs~GvGRtG~f  372 (451)
                      .++++|.|.+++ .+++|.|+||||++|||+|+|.++..+++|+..|++..        ....+|||||||||+||||+|
T Consensus       159 ~~~~~~~l~l~~~~~~~r~V~Hfqyt~WPd~gvP~~~~~~l~fi~~V~~~~~~~~~~~~~~~~~PIVVHCSaGvGRTGtF  238 (298)
T PHA02740        159 PHFNLTLLSLTDKFGQAQKISHFQYTAWPADGFSHDPDAFIDFFCNIDDLCADLEKHKADGKIAPIIIDCIDGISSSAVF  238 (298)
T ss_pred             CCEEEEEEEEEcCCCCcEEEEEEeecCCCCCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCCCCCEEEECCCCCchhHHH
Confidence            999999999975 46799999999999999999999999999998887531        223589999999999999999


Q ss_pred             HHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHHHHhcC
Q psy15334        373 IALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQGG  428 (451)
Q Consensus       373 iai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~~~  428 (451)
                      ||+|++++++..+   +.|||+++|++||+||++||||.+||.|||++|++|+...
T Consensus       239 caiDi~l~~~~~~---~~vdi~~~V~~lR~qR~~~Vqt~~QY~F~y~~l~~yl~~~  291 (298)
T PHA02740        239 CVFDICATEFDKT---GMLSIANALKKVRQKKYGCMNCLDDYVFCYHLIAAYLKEK  291 (298)
T ss_pred             HHHHHHHHHHHhc---CcccHHHHHHHHHhhCccccCCHHHHHHHHHHHHHHHHHh
Confidence            9999999999988   9999999999999999999999999999999999998764


No 4  
>KOG0791|consensus
Probab=100.00  E-value=1.7e-76  Score=571.76  Aligned_cols=292  Identities=48%  Similarity=0.855  Sum_probs=275.8

Q ss_pred             Ccceeccch----hhhccCCcc-hHHHhHHhhcCCCCCcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEE
Q psy15334        148 SHPIPVTLF----CTMTLDPDW-IAVEFEDLKSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVN  222 (451)
Q Consensus       148 ~~~i~~~~f----~~~~~~~~~-~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYIN  222 (451)
                      +.+|++++|    +.+..|++. |++||++|+..+.++++.+|.+++|.+||||.||+|||+|||+|.|+.+++++||||
T Consensus        75 s~~i~len~e~~~~~~~~dsn~~f~eey~~lk~~g~~~~~~aA~l~~n~~knr~~nilpYd~Srvkl~~~~e~~~~DYin  154 (374)
T KOG0791|consen   75 TNPVSLENFELHVKKMSADSNYKFSEEYNELKLVGLDQSKLAALLPLNRPKNRYTNILPYDNSRVKLSPVNEEEGSDYIN  154 (374)
T ss_pred             CCceeeccchhhhhhcccccchhhHHHHhhhhhhcccchhhhhhcCCCCCcCCcCCCCCcccccceeccccccchhhhhh
Confidence            458889998    677889888 999999999999999999999999999999999999999999999999999999999


Q ss_pred             eeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeec
Q psy15334        223 ANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYG  302 (451)
Q Consensus       223 As~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~  302 (451)
                      ||||+|+.+++.|||||||+++|..|||+||||+++.+|||||.+.|.++.||++|||.+..++.||+++|+.++++...
T Consensus       155 A~~~Pg~~s~~~fIAtQGPlp~t~~~fW~mvwq~~~~~IVmlt~~~e~~R~kc~~ywP~~~~~~~~gdi~V~~v~e~~~~  234 (374)
T KOG0791|consen  155 ASYIPGYNSPREFIATQGPLPETRDDFWKMVWQQKSHIIVMLTKCNEKGRVKCDEYWPDEEVPVAYGDITVTMVSEESLD  234 (374)
T ss_pred             hhhcCCCCCcceEEEeeCCCCCChhHHHHHHHHhhcceEEEEeecccccchhhhhhcccccccceeccEEEEEechhhcC
Confidence            99999999999999999999999999999999999999999999999999999999999888899999999999999999


Q ss_pred             ceEEEEEEEEeCCeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHH
Q psy15334        303 SWVLSELRMCRGGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRL  382 (451)
Q Consensus       303 ~~~~r~l~v~~~~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l  382 (451)
                      +|++|+|.+...+++|.++||||+.|||||+|.++..+++|++.+++......+|++|||||||||||||+|+|.+++|+
T Consensus       235 ~w~ir~~~l~~~~~sr~ir~f~y~~wPd~gvp~~~~sl~~f~~~~r~~~~~~~~p~iVhCSAGVgRTGTFiald~LLqq~  314 (374)
T KOG0791|consen  235 EWTIREFRLNYAGESRKIRHFHYTAWPDFGVPSSTESLLQFVRMVRQSLDTSKGPTIVHCSAGVGRTGTFIALDRLLQQI  314 (374)
T ss_pred             CceEEEEeeecccccceeEEEEEeeccccCCCCCchhHHHHHHHHHhhcccCCCceeEEeecccccccchHhHHHHHHHh
Confidence            99999999998899999999999999999999999999999999999988788999999999999999999999999999


Q ss_pred             HcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHHHHhcCCC--Cchhhhhhcccc
Q psy15334        383 YNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQGGVR--HEERQIQNNEAI  442 (451)
Q Consensus       383 ~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~~~~~--~~~~~~~~~~~~  442 (451)
                      ..+   +.+|++++|.+||.+|+.||||++||+|+|+|+++.|.++..  .....++.|.++
T Consensus       315 ~~~---~~vdi~~iv~~lR~~R~~mVqte~Qyvfl~~c~~~~l~~~~~~~~~s~~~~~n~~~  373 (374)
T KOG0791|consen  315 DSE---ETVDIFGVVLELRSARMLMVQTEDQYVFLHQCVLESLQGKKPQLCISDVIYENVGP  373 (374)
T ss_pred             ccc---ccccHHHHHHHhhhccccccchHHHHHHHHHHHHHHHhCCCcccccchhcccCCCC
Confidence            888   899999999999999999999999999999999999998864  344455555443


No 5  
>PHA02747 protein tyrosine phosphatase; Provisional
Probab=100.00  E-value=1.7e-74  Score=573.24  Aligned_cols=277  Identities=29%  Similarity=0.504  Sum_probs=247.9

Q ss_pred             CcceeccchhhhccCCcc---hHHHhHHhhcCCCCCcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEee
Q psy15334        148 SHPIPVTLFCTMTLDPDW---IAVEFEDLKSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNAN  224 (451)
Q Consensus       148 ~~~i~~~~f~~~~~~~~~---~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs  224 (451)
                      ...+.+.||.+...+++.   +.+||+.|.....+..+..+..++|..||||.||+||||+||+|.+.++ +.+||||||
T Consensus         6 ~~~~~~~~f~~~~~~~~~~~~i~~E~~~i~~~~~~~~~~~~~~~~N~~kNRy~di~p~D~sRV~L~~~~~-~~~dYINAs   84 (312)
T PHA02747          6 FAECRAIDFLKRRNQLNCFGIIRDEHHQIILKPFDGLIANFEKPENQPKNRYWDIPCWDHNRVILDSGGG-STSDYIHAN   84 (312)
T ss_pred             ccccCHHHHHHHHhCCCHHHHHHHHHHHHhcCCCCChhhhhhCchhhccCCCCCCCCCCCCEeEecCCCC-CCCCcEEee
Confidence            456788999666666665   8999999976655667778889999999999999999999999987443 468999999


Q ss_pred             eecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccce-ecccccccccCCC-CCCCceeeeEEEEEeeeEeec
Q psy15334        225 YIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCI-EKHRDKCYKYWPS-DQHPMRCADIEISLQDVCHYG  302 (451)
Q Consensus       225 ~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~-E~~~~kc~~YwP~-~~~~~~~g~~~V~~~~~~~~~  302 (451)
                      ||+||..+++|||||||+++|++|||+||||++|.+|||||... |.|+.||.+|||. +++.+.+|++.|++.+.....
T Consensus        85 yV~g~~~~~~yIaTQ~Pl~~T~~dFW~MVwe~~~~~IVMLt~~~~e~g~~kc~~YWp~~~~~~~~~g~~~V~~~~~~~~~  164 (312)
T PHA02747         85 WIDGFEDDKKFIATQGPFAETCADFWKAVWQEHCSIIVMLTPTKGTNGEEKCYQYWCLNEDGNIDMEDFRIETLKTSVRA  164 (312)
T ss_pred             eecCCCCCCeEEEeCCCcchhHHHHHHHHHhccCCEEEEcccccccCCcccccccCCCCCCCCeEeeeEEEEEEEEEecC
Confidence            99999999999999999999999999999999999999999999 8999999999986 455788999999999998889


Q ss_pred             ceEEEEEEEEeC--CeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhc---C-------CCCCCEEEEcCCCCchhh
Q psy15334        303 SWVLSELRMCRG--GEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHI---S-------PAQHPIVVHCSAGVGRTG  370 (451)
Q Consensus       303 ~~~~r~l~v~~~--~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~---~-------~~~~PivVHCs~GvGRtG  370 (451)
                      ++++|.|.++..  ++.|.|+||||++|||+|+|.++..+++|+..++...   .       ...+|||||||||+||||
T Consensus       165 ~~~~~~l~l~~~~~~~~r~V~h~~y~~Wpd~~~P~~~~~~l~fi~~v~~~~~~~~~~~~~~~~~~~PIvVHCsaGvGRtG  244 (312)
T PHA02747        165 KYILTLIEITDKILKDSRKISHFQCSEWFEDETPSDHPDFIKFIKIIDINRKKSGKLFNPKDALLCPIVVHCSDGVGKTG  244 (312)
T ss_pred             CeEEEEEEEEECCCCCceEEEEEEECCCCCCCCCCChHHHHHHHHHHHHHHHHhhccccccccCCCCEEEEecCCCcchh
Confidence            999999999863  6899999999999999999999999999987765421   1       123799999999999999


Q ss_pred             hHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHH---HHHHHHHHhcC
Q psy15334        371 TFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFI---YECVQVYLQGG  428 (451)
Q Consensus       371 ~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fi---y~~l~~~l~~~  428 (451)
                      +|||+|++++++..+   +.+||+++|+.||+||++||||.+||.||   |++|.+||...
T Consensus       245 tfcaidi~i~~l~~~---~~v~v~~~V~~lR~qR~~~Vqt~~QY~F~~~~Y~~l~~~~~~~  302 (312)
T PHA02747        245 IFCAVDICLNQLVKR---KAICLAKTAEKIREQRHAGIMNFDDYLFIQPGYEVLHYFLSKI  302 (312)
T ss_pred             HHHHHHHHHHHHHhc---CCCCHHHHHHHHHhccccccCCHHHHHHHHHHHHHHHHHHHHh
Confidence            999999999999888   89999999999999999999999999999   99999998764


No 6  
>PHA02746 protein tyrosine phosphatase; Provisional
Probab=100.00  E-value=3.1e-74  Score=573.55  Aligned_cols=276  Identities=26%  Similarity=0.481  Sum_probs=246.6

Q ss_pred             CcceeccchhhhccCCcc---hHHHhHHhhcCCCCCcccccccccCCCCCCCCCCCCCCCcceEecCCC-----------
Q psy15334        148 SHPIPVTLFCTMTLDPDW---IAVEFEDLKSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVD-----------  213 (451)
Q Consensus       148 ~~~i~~~~f~~~~~~~~~---~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~-----------  213 (451)
                      ...+.+.+|......+++   +.+||.+|.......++..+..++|+.||||.||+|||||||+|.+..           
T Consensus         6 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~i~~~~~~~t~~~~~~~~N~~KNRy~di~p~D~sRV~L~~~~~~~~~~~~~~~   85 (323)
T PHA02746          6 FEIFNAFDFFDKTNHAKFCEFVLLEHAEVMDIPIRGTTNHFLKKENLKKNRFHDIPCWDHSRVVINAHESLKMFDVGDSD   85 (323)
T ss_pred             cccCCHHHHHHHhcCCCHHHHHHHHHHHHhcCCCCcchhhccChhhccccCCCCCCCCcCCEEEecCCCCcccccccccc
Confidence            345778888666666666   889999998877777888899999999999999999999999998743           


Q ss_pred             --------CCCCCCcEEeeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCC-CC
Q psy15334        214 --------DEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSD-QH  284 (451)
Q Consensus       214 --------~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~-~~  284 (451)
                              +++++||||||||+||..+++|||||||+++|++|||+||||+++.+|||||+.. .++.||.+|||.+ +.
T Consensus        86 ~~~~~~~~~~~~~dYINAsyV~g~~~~~~fIaTQ~Pl~~T~~dFW~MIwe~~v~vIVMLt~~~-e~~~kc~~YWP~~~~~  164 (323)
T PHA02746         86 GKKIEVTSEDNAENYIHANFVDGFKEANKFICAQGPKEDTSEDFFKLISEHESQVIVSLTDID-DDDEKCFELWTKEEDS  164 (323)
T ss_pred             ccccccccCCCCCCeEEEecccCCCCCCeEEEeCCCChhhHHHHHHHHHhhCCCEEEEecccc-cCCccCCCCCCCCCCC
Confidence                    2345899999999999999999999999999999999999999999999999975 5678999999976 44


Q ss_pred             CceeeeEEEEEeeeEeecceEEEEEEEEeC--CeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhc----------C
Q psy15334        285 PMRCADIEISLQDVCHYGSWVLSELRMCRG--GEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHI----------S  352 (451)
Q Consensus       285 ~~~~g~~~V~~~~~~~~~~~~~r~l~v~~~--~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~----------~  352 (451)
                      .+.||++.|++.+.....+++.+.|.+++.  ++.|.|+||||++|||+|+|.++..+++|++.|++..          .
T Consensus       165 ~~~~g~~~V~~~~~~~~~~~~~~~l~l~~~~~~~~r~V~h~~y~~Wpd~gvP~~~~~~l~~i~~v~~~~~~~~~~~~~~~  244 (323)
T PHA02746        165 ELAFGRFVAKILDIIEELSFTKTRLMITDKISDTSREIHHFWFPDWPDNGIPTGMAEFLELINKVNEEQAELIKQADNDP  244 (323)
T ss_pred             CeEEcCEEEEEEEEEEcCCeEEEEEEEEECCCCCceEEEEEEECCCCCCCcCCCHHHHHHHHHHHHHHHHHHhhhhccCC
Confidence            678999999999888888899999999853  5789999999999999999999999999999887542          1


Q ss_pred             CCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHHHHhc
Q psy15334        353 PAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQG  427 (451)
Q Consensus       353 ~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~~  427 (451)
                      +..+|||||||+|+||||+|||+|++++++..+   +.+||+++|+.||+||++||||.+||.|||++|.+||..
T Consensus       245 ~~~~PIvVHCsaGvGRTGtfcaid~~l~~l~~~---~~vdv~~~V~~lR~qR~~~Vqt~~QY~F~y~~l~~~l~~  316 (323)
T PHA02746        245 QTLGPIVVHCSAGIGRAGTFCAIDNALEQLEKE---KEVCLGEIVLKIRKQRHSSVFLPEQYAFCYKALKYAIIE  316 (323)
T ss_pred             CCCCCEEEEcCCCCCcchhHHHHHHHHHHHHhc---CCCCHHHHHHHHHhcccccCCCHHHHHHHHHHHHHHHHH
Confidence            124899999999999999999999999999888   899999999999999999999999999999999998754


No 7  
>PHA02738 hypothetical protein; Provisional
Probab=100.00  E-value=4.3e-73  Score=564.63  Aligned_cols=277  Identities=32%  Similarity=0.536  Sum_probs=244.4

Q ss_pred             CCCcceeccchhhhccCCcc---hHHHhHHhhcCCCCCcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEE
Q psy15334        146 NCSHPIPVTLFCTMTLDPDW---IAVEFEDLKSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVN  222 (451)
Q Consensus       146 ~~~~~i~~~~f~~~~~~~~~---~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYIN  222 (451)
                      ++.+.+.+++|......++.   +.+||+.|.....+.++..+.  .|+.||||.||+|||++||+|++.  .+++||||
T Consensus         4 ~~~~~~~~~~fl~~~~~~~~~~~i~~E~~~i~~~~~~~t~~~~~--~N~~kNR~~~v~~~d~sRV~L~~~--~~~~dYIN   79 (320)
T PHA02738          4 CKFRELKYAEFLALMEKSDCEEVITREHQKVISEKVDGTFNAEK--KNRKLNRYLDAVCFDHSRVILPAE--RNRGDYIN   79 (320)
T ss_pred             cccccCCHHHHHHHHhCCCHHHHHHHHHHHHhcCCCCcchhHHH--HhhhhCCCCCcCCCcCCEEEeCCC--CCCCCceE
Confidence            44556778888544445544   899999997655555555543  499999999999999999999753  34689999


Q ss_pred             eeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCC-CCCceeeeEEEEEeeeEee
Q psy15334        223 ANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSD-QHPMRCADIEISLQDVCHY  301 (451)
Q Consensus       223 As~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~-~~~~~~g~~~V~~~~~~~~  301 (451)
                      ||||+|+..+++|||||||+++|++|||+||||++|.+|||||+..|.++.+|.+|||.+ +..+.||+|+|++.+....
T Consensus        80 AsyVdg~~~~~kfI~TQ~Pl~~T~~dFW~MVwe~~v~~IVmL~~~~E~~~~kc~~YWp~~~~~~~~~g~f~V~~~~~~~~  159 (320)
T PHA02738         80 ANYVDGFEYKKKFICGQAPTRQTCYDFYRMLWMEHVQIIVMLCKKKENGREKCFPYWSDVEQGSIRFGKFKITTTQVETH  159 (320)
T ss_pred             eEEecCCCCCceeEEecCCChHHHHHHHHHHHHhCCcEEEEeeehhhCCeeeccccCCCCCCCceEeccEEEEEEEEEec
Confidence            999999999999999999999999999999999999999999999999999999999974 5678999999999999989


Q ss_pred             cceEEEEEEEEeC-CeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcC-------------CCCCCEEEEcCCCCc
Q psy15334        302 GSWVLSELRMCRG-GEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHIS-------------PAQHPIVVHCSAGVG  367 (451)
Q Consensus       302 ~~~~~r~l~v~~~-~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~-------------~~~~PivVHCs~GvG  367 (451)
                      .+++.|.|.++.+ ++.|.|+||||++|||+|+|.++..+++|+..|++...             ...+|||||||+|+|
T Consensus       160 ~~~~~~~l~l~~~~~~~r~V~h~~y~~Wpd~gvP~~~~~~l~fi~~V~~~~~~~~~~~~~~~~~~~~~~PIVVHCs~GiG  239 (320)
T PHA02738        160 PHYVKSTLLLTDGTSATQTVTHFNFTAWPDHDVPKNTSEFLNFVLEVRQCQKELAQESLQIGHNRLQPPPIVVHCNAGLG  239 (320)
T ss_pred             CCeEEEEEEEEeCCCCcEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHhhhhhcccCccccCCCCeEEEcCCCCC
Confidence            9999999999864 57899999999999999999999999999998875321             124799999999999


Q ss_pred             hhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHHHHhcCC
Q psy15334        368 RTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQGGV  429 (451)
Q Consensus       368 RtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~~~~  429 (451)
                      |||+|||+|++++++..+   +.+||+++|+.||+||++||||.+||.|||++|++|+....
T Consensus       240 RtGtFcaidi~i~~~~~~---~~vdv~~~V~~lR~qR~~~vqt~~QY~F~y~~l~~y~~~~~  298 (320)
T PHA02738        240 RTPCYCVVDISISRFDAC---ATVSIPSIVSSIRNQRYYSLFIPFQYFFCYRAVKRYVNLTV  298 (320)
T ss_pred             hhhhhhHHHHHHHHHHhc---CCcCHHHHHHHHHhhhhhccCCHHHHHHHHHHHHHHHHhcC
Confidence            999999999999999988   89999999999999999999999999999999999988654


No 8  
>KOG0790|consensus
Probab=100.00  E-value=1e-73  Score=558.13  Aligned_cols=278  Identities=40%  Similarity=0.654  Sum_probs=244.6

Q ss_pred             hHHHhHHhhcCCCC--CcccccccccCCCCCCCCCCCCCCCcceEecCCC-CCCCCCcEEeeeecC------C--CCCcc
Q psy15334        166 IAVEFEDLKSVGRG--QPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVD-DEEGSDYVNANYIGG------Y--LSPRQ  234 (451)
Q Consensus       166 ~~~Ef~~l~~~~~~--~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~-~~~~~dYINAs~i~g------~--~~~~~  234 (451)
                      |.+||+.|++....  .+...+..|+|+.||||.||+||||+||+|...+ ..+++||||||||--      .  .-++.
T Consensus       246 fwEEFesLqq~~~~~~~sr~EG~r~eN~~KNRYkNIlPfDHtRViL~d~~~n~pgsdYINAnyi~~~~q~~~~~~~~kKs  325 (600)
T KOG0790|consen  246 FWEEFESLQQQEVKNLHSRLEGQRPENKGKNRYKNILPFDHTRVILQDRDPNIPGSDYINANYIMIEFQRLCNNSKPKKS  325 (600)
T ss_pred             hhHHHHHhhhhhhHhhhhhhccccchhhccccccccCcccceeeEeecCCCCCccchhcccchhhhhhhhccCccccccc
Confidence            99999999765432  2344677899999999999999999999998663 468999999999831      1  12357


Q ss_pred             EEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeecceEEEEEEEE--
Q psy15334        235 YIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMC--  312 (451)
Q Consensus       235 yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~--  312 (451)
                      ||||||-|.+|+.|||+||||+|+++|||-|...|.|+.||.+|||+++....||.+.|++..+....+|++|+|.++  
T Consensus       326 yIAtQGCL~nTVnDFW~MvwQENsrVIVMtTkE~ERgK~KC~~YWPee~~~e~~G~~~v~~v~E~~t~dY~LR~l~vs~~  405 (600)
T KOG0790|consen  326 YIATQGCLQNTVNDFWRMVWQENSRVIVMTTKEVERGKSKCVKYWPEEGALEEYGVMRVRNVKESDTHDYTLRELKVSKL  405 (600)
T ss_pred             eeehhhHHHHHHHHHHHHHHhccceEEEEehhhhhcccccccccCCcccchhhcCceEEEeccccccccceehheeeccc
Confidence            999999999999999999999999999999999999999999999998888899999999999999999999999998  


Q ss_pred             -eCCeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcC--CCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCC
Q psy15334        313 -RGGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHIS--PAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAG  389 (451)
Q Consensus       313 -~~~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~--~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~  389 (451)
                       +++..|.|+||||..|||||||.+|..+|+|+..|.....  ...||||||||||+|||||||+||++|+++.+..-.-
T Consensus       406 ~~g~~~R~I~~yh~~tWPDHGvP~dPg~vLnFLe~V~~rq~~l~~AgpIvVHCSAGIGrTGTfiViD~lld~I~~~Gldc  485 (600)
T KOG0790|consen  406 GNGNLEREIWHYHYLTWPDHGVPSDPGGVLNFLEEVNHRQESLMDAGPIVVHCSAGIGRTGTFIVIDMLLDQIREKGLDC  485 (600)
T ss_pred             cCCcchhhhhhhheeecccCCCcCCccHHHHHHHHhhhhhccccccCcEEEEccCCcCCcceEEEhHHHHHHHHhcCCCC
Confidence             5678999999999999999999999999999999976543  3578999999999999999999999999999874445


Q ss_pred             CCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHHHHhcCC-CCchhhhhhccccc
Q psy15334        390 SVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQGGV-RHEERQIQNNEAIE  443 (451)
Q Consensus       390 ~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~~~~-~~~~~~~~~~~~~~  443 (451)
                      .+||..+|+.+|.||.|||||+.||.|||.||.+||+... ...+.+..+-+++|
T Consensus       486 ~iDi~ktIqmVRsqRSGmVQTEaQYkFiY~Avqhyietl~~r~e~eq~~~~~g~e  540 (600)
T KOG0790|consen  486 DIDIQKTIQMVRSQRSGMVQTEAQYKFIYVAVQHYIETLQKRIEEEQKSKRKGHE  540 (600)
T ss_pred             cccHHHHHHHHHHHhcchhhhHHhHHHHHHHHHHHHHHHHHHHHHHHhhcccccc
Confidence            7999999999999999999999999999999999999754 34444555555544


No 9  
>KOG0792|consensus
Probab=100.00  E-value=1.6e-68  Score=567.01  Aligned_cols=266  Identities=39%  Similarity=0.666  Sum_probs=243.9

Q ss_pred             CcchHHHhHHhhcCCCCCcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCC--CCCccEEEecC
Q psy15334        163 PDWIAVEFEDLKSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGY--LSPRQYIATQG  240 (451)
Q Consensus       163 ~~~~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~--~~~~~yI~tQ~  240 (451)
                      ++.+..|||.|....+...|+.|.+|+|..||||.||+|||.+||.|.+..+. +..|||||+|.-.  +..+.||||||
T Consensus       867 ~g~v~~efE~i~rkkp~~~~~~a~lpeN~~KNRyrdvlPYD~tRV~L~~~~~~-nagYINAS~i~ipvgg~e~~YIA~QG  945 (1144)
T KOG0792|consen  867 DGMVFTEFEQIYRKKPGMACSIARLPENAPKNRYRDVLPYDDTRVELQPSKDN-NAGYINASHIEIPVGGIENRYIACQG  945 (1144)
T ss_pred             cCchHHHHHHHhhhCCCceeeeccccccccccccCCcCCCcCceeeeccCCCC-CcccccceeeEeeecceeEEEEEecC
Confidence            34467799998776688899999999999999999999999999999986544 7799999999754  44578999999


Q ss_pred             CCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCC-CCCceeeeEEEEEeeeEeecceEEEEEEEEe--CCee
Q psy15334        241 PLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSD-QHPMRCADIEISLQDVCHYGSWVLSELRMCR--GGEH  317 (451)
Q Consensus       241 Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~-~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~~--~~~~  317 (451)
                      ||+.|+.|||+|||||++.+|||||..+|.|+.||.+|||.. .+.+.||.|.|++..+.....|++|+|++.+  .++.
T Consensus       946 PLp~T~~DFWQMVWEQ~~~lIvMlT~e~EggR~KchqYWPr~~~~t~~ygrf~v~~~~~~~t~~y~tr~m~l~~~~t~ee 1025 (1144)
T KOG0792|consen  946 PLPHTCTDFWQMVWEQGSTLIVMLTTEVEGGRVKCHQYWPRLGHETMEYGRFQVTCVFEQQTTCYVTREMTLKDLQTREE 1025 (1144)
T ss_pred             CCcchHHHHHHHHHhcCceEEEEEeehhhcCeeccccccCCCCccceeccceEEEEEEecccccEEEEeEEEeeccCCce
Confidence            999999999999999999999999999999999999999987 4678999999999999999999999999986  5799


Q ss_pred             eEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHH
Q psy15334        318 RMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIV  397 (451)
Q Consensus       318 r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v  397 (451)
                      |+|+|+||++|||||+|+++..||+|+..++......+.|||||||||+||||++|.+++++..|+.+   ..+||.++|
T Consensus      1026 R~V~hLQYtaWPDHg~P~D~~~FL~FleevrsvR~~t~pPilvHCSAGiGRTGVlIl~e~~l~lle~N---e~vdi~div 1102 (1144)
T KOG0792|consen 1026 RTVWHLQYTAWPDHGVPDDPNDFLDFLEEVRSVRRGTNPPILVHCSAGIGRTGVLILMETALCLLEHN---EPVDILDIV 1102 (1144)
T ss_pred             eeeeeeeecccccCCCCCChHHHHHHHHHHHHHhccCCCCeEEEccCCCCcceehHHHHHHHHHHhcC---CCCCHHHHH
Confidence            99999999999999999999999999999987655445699999999999999999999999999988   899999999


Q ss_pred             HHHHhhccCCCCCHHHHHHHHHHHHHHHhcCCCCc
Q psy15334        398 HGLRRQRVLMVQSLAQYAFIYECVQVYLQGGVRHE  432 (451)
Q Consensus       398 ~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~~~~~~~  432 (451)
                      +.||.||..||||..||.|||++|+.+++.....+
T Consensus      1103 r~mR~QR~~mVQT~~QYkFVyevil~~l~~~~l~~ 1137 (1144)
T KOG0792|consen 1103 RTMRDQRAMMVQTLSQYKFVYEVILRVLKRGRLIP 1137 (1144)
T ss_pred             HHHHHHHhhhccchHHhhHHHHHHHHHHHhccccc
Confidence            99999999999999999999999999998765443


No 10 
>smart00194 PTPc Protein tyrosine phosphatase, catalytic domain.
Probab=100.00  E-value=1.1e-67  Score=513.36  Aligned_cols=253  Identities=49%  Similarity=0.871  Sum_probs=236.1

Q ss_pred             hHHHhHHhhcCCCC-CcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCCCCCccEEEecCCCCC
Q psy15334        166 IAVEFEDLKSVGRG-QPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCS  244 (451)
Q Consensus       166 ~~~Ef~~l~~~~~~-~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~~  244 (451)
                      |.+||+.|.+.... .++..+..++|..||||.||+|||++||+|.+.++ +++||||||||+|+..+++||+||+|+++
T Consensus         2 ~~~e~~~i~~~~~~~~~~~~~~~~~n~~kNR~~~v~~~d~~RV~l~~~~~-~~~~YInAs~v~~~~~~~~fI~tQ~P~~~   80 (258)
T smart00194        2 LEEEFEKLDRLKPDDESCTVAARPENRDKNRYKDVLPYDHTRVKLTPPPG-EGSDYINASYIDGPNGPKAYIATQGPLPS   80 (258)
T ss_pred             hHHHHHHhhccCCCCccccccCCccccccCCCCCCCCCCCCEeEccCCCC-CCCCceeeeeEecCCCccceEEeCCCchH
Confidence            67899999876655 67888999999999999999999999999986443 57899999999999999999999999999


Q ss_pred             CHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeecceEEEEEEEEeCC--eeeEEEE
Q psy15334        245 TVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMCRGG--EHRMISH  322 (451)
Q Consensus       245 T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~~~~--~~r~V~h  322 (451)
                      |++|||+||||+++.+||||+...|.++.+|.+|||.+.+.+.||.++|++.+.....+++.|+|.++..+  +.|.|+|
T Consensus        81 t~~dFW~mv~~~~~~~IVmL~~~~e~~~~~~~~Ywp~~~~~~~~g~~~V~~~~~~~~~~~~~~~l~v~~~~~~~~~~v~~  160 (258)
T smart00194       81 TVEDFWRMVWEEKVTTIVMLTELEEKGREKCAQYWPEEEEPLTYGDITVTLKSVEKVDDYTIRTLEVTNTGCSETRTVTH  160 (258)
T ss_pred             HHHHHHHHHHhcCCCEEEEeccceeCCEeeccccCCCCCCcceECCEEEEEEEEEecCCEEEEEEEEEECCCCCcEEEEE
Confidence            99999999999999999999999999999999999987767899999999999999999999999998754  8999999


Q ss_pred             EEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHh
Q psy15334        323 YHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRR  402 (451)
Q Consensus       323 ~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~  402 (451)
                      |||++|||+++|.++..+++|++.++.......+||+|||++|+||||+|||++++++++..+   +.+|++++++.||+
T Consensus       161 ~~y~~W~d~~~P~~~~~~~~~i~~v~~~~~~~~~pivVHC~~G~gRsg~f~a~~~~~~~l~~~---~~v~v~~~v~~lR~  237 (258)
T smart00194      161 YHYTNWPDHGVPESPKSILDLVRAVRKSQSTSTGPIVVHCSAGVGRTGTFIAIDILLQQLEAG---KEVDIFEIVKELRS  237 (258)
T ss_pred             EeeCCCCCCCCCCCHHHHHHHHHHHHHhhccCCCCEEEEeCCCCCccchhhHHHHHHHHHHHc---CCCCHHHHHHHHHh
Confidence            999999999999999999999999998765446899999999999999999999999999888   89999999999999


Q ss_pred             hccCCCCCHHHHHHHHHHHH
Q psy15334        403 QRVLMVQSLAQYAFIYECVQ  422 (451)
Q Consensus       403 qR~~~Vqt~~QY~fiy~~l~  422 (451)
                      ||++||++.+||.|||++++
T Consensus       238 ~R~~~v~~~~Qy~f~~~~l~  257 (258)
T smart00194      238 QRPGMVQTEEQYIFLYRAIL  257 (258)
T ss_pred             ccccccCCHHHHHHHHHHHh
Confidence            99999999999999999986


No 11 
>cd00047 PTPc Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
Probab=100.00  E-value=4e-61  Score=459.63  Aligned_cols=227  Identities=51%  Similarity=0.959  Sum_probs=211.4

Q ss_pred             CCCCCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecc
Q psy15334        192 PKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKH  271 (451)
Q Consensus       192 ~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~  271 (451)
                      .||||+|++|||++||+|.... .+++||||||||+|+..+++||+||+|+.+|++|||+||||+++.+||||+...|.+
T Consensus         1 ~knR~~~~~~~d~~rV~l~~~~-~~~~~yinA~~v~~~~~~~~fI~tQ~P~~~t~~~FW~mv~~~~~~~IV~l~~~~e~~   79 (231)
T cd00047           1 KKNRYKDILPYDHTRVKLKPDD-DEGSDYINASYIDGYNPPKAYIATQGPLPNTVEDFWRMVWEQKVPVIVMLTELVEKG   79 (231)
T ss_pred             CCCCCCCCCCCCCCEEEccCCC-CCCCCcEEeeeEecCCCCcceEEcCCCChhhHHHHHHHHHhcCCCEEEEccccccCC
Confidence            4899999999999999997632 256899999999999989999999999999999999999999999999999999999


Q ss_pred             cccccccCCCCCCCceeeeEEEEEeeeEeecceEEEEEEEEeC--CeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHH
Q psy15334        272 RDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMCRG--GEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRT  349 (451)
Q Consensus       272 ~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~~~--~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~  349 (451)
                      ..+|.+|||.+.+...||.++|++++.....+++.|+|.|+..  +..|.|+||||++|||+++|.+...+++|+..+++
T Consensus        80 ~~~~~~Ywp~~~~~~~~g~~~V~~~~~~~~~~~~~~~l~i~~~~~~~~~~V~~~~~~~W~d~~~p~~~~~~~~~~~~v~~  159 (231)
T cd00047          80 REKCAQYWPEEEGSLTYGDITVTLVSEEKLDDYTVRTLKLSNTGTGETRTVTHFQYTGWPDHGVPESPDSLLDLLRKVRK  159 (231)
T ss_pred             CccCccCCCCCCCCeEecCEEEEEEEEEEcCCEEEEEEEEEECCCCCceEEEEEeECCCCCCCccCChHHHHHHHHHHHH
Confidence            9999999998756789999999999999999999999999875  48999999999999999999999999999999987


Q ss_pred             hcC-CCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHH
Q psy15334        350 HIS-PAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQ  422 (451)
Q Consensus       350 ~~~-~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~  422 (451)
                      ... ...+||+|||++|+||||+|||++++++++..+   +.+|+.++++.||+||++||+|.+||.|||++++
T Consensus       160 ~~~~~~~~pivVHC~~G~gRsg~~~a~~~~~~~~~~~---~~~~~~~~v~~iR~~R~~~v~~~~Qy~f~~~~l~  230 (231)
T cd00047         160 SQQQPGSGPIVVHCSAGVGRTGTFIAIDILLQRLEAE---GVVDIFQTVKELRSQRPGMVQTEEQYIFLYRAIL  230 (231)
T ss_pred             HhccCCCCCeEEECCCCCCccchHHHHHHHHHHHHhc---CCCCHHHHHHHHHhccccccCCHHHHHHHHHHHh
Confidence            643 336799999999999999999999999999887   8999999999999999999999999999999986


No 12 
>KOG0793|consensus
Probab=100.00  E-value=2.4e-62  Score=498.05  Aligned_cols=259  Identities=36%  Similarity=0.607  Sum_probs=235.3

Q ss_pred             CCcchHHHhHHhhcCCC-CCcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCCC-CCccEEEec
Q psy15334        162 DPDWIAVEFEDLKSVGR-GQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYL-SPRQYIATQ  239 (451)
Q Consensus       162 ~~~~~~~Ef~~l~~~~~-~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~~-~~~~yI~tQ  239 (451)
                      +.+.+++|++.|..... ..++..+..++|..|||...++||||+||.|.-.....++||||||+|-..+ ..-.|||||
T Consensus       730 nk~rlekeWealc~YqaEpn~r~~aq~e~~a~kNR~~a~LpyDhsrV~l~~~~s~s~~dYiNAS~I~DhDPR~paYIAtQ  809 (1004)
T KOG0793|consen  730 NKNRLEKEWEALCAYQAEPNSRFVAQREENAPKNRSLAVLPYDHSRVLLKAENSHSHSDYINASPIMDHDPRNPAYIATQ  809 (1004)
T ss_pred             CchHHHHHHHHHHHhhcCcchhhhhcccccccccCCCCccccccceeeeccccCccccccccccccccCCCCccceeecc
Confidence            34459999999976543 3457788899999999999999999999999987777889999999996532 345799999


Q ss_pred             CCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEee-cceEEEEEEEE--eCCe
Q psy15334        240 GPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHY-GSWVLSELRMC--RGGE  316 (451)
Q Consensus       240 ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~-~~~~~r~l~v~--~~~~  316 (451)
                      ||+++|+.|||+||||+.|.+|||||.+.|+|...|.+|||+++.. .|+-+.|+++++... .||.+|.|-++  .+++
T Consensus       810 gPl~stiA~FWQmvWe~G~~vIV~Lt~l~Engv~qc~rYWPdeGse-lyhiyEV~LVSEHIWceDfLVRSFYLKNlqtse  888 (1004)
T KOG0793|consen  810 GPLPSTIADFWQMVWESGCVVIVMLTPLAENGVRQCYRYWPDEGSE-LYHIYEVNLVSEHIWCEDFLVRSFYLKNLQTSE  888 (1004)
T ss_pred             CCCchHHHHHHHHHHHcCcEEEEEecChhhcchhhhhhcCCCCCcc-eeeeEEeeeehhhhhhhhHHHHHHHHhhccccc
Confidence            9999999999999999999999999999999999999999998764 599999999998774 67889988776  4789


Q ss_pred             eeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHH
Q psy15334        317 HRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRI  396 (451)
Q Consensus       317 ~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~  396 (451)
                      +|+|+||||.+||+.|+|.+..+|++|.++|+++......|||||||+|.|||||+|+||+++.+|.+..  +.+||..+
T Consensus       889 TRTvTQFHfLSWp~egvPasarslLdFRRKVNK~YRGRScpIiVH~sdGaGRTG~YiliDmvl~Rm~kGa--keIDIaAT  966 (1004)
T KOG0793|consen  889 TRTVTQFHFLSWPDEGVPASARSLLDFRRKVNKCYRGRSCPIIVHCSDGAGRTGTYILIDMVLNRMAKGA--KEIDIAAT  966 (1004)
T ss_pred             ceeeeeeeeecccccCCccchHHHHHHHHHhhhhccCCCCceEEEccCCCCccceeeeHHHHHHHHhccc--hhhhHHHH
Confidence            9999999999999999999999999999999999887789999999999999999999999999998653  89999999


Q ss_pred             HHHHHhhccCCCCCHHHHHHHHHHHHH
Q psy15334        397 VHGLRRQRVLMVQSLAQYAFIYECVQV  423 (451)
Q Consensus       397 v~~lR~qR~~~Vqt~~QY~fiy~~l~~  423 (451)
                      +..||.||+|||+|.+||.|.+.||.+
T Consensus       967 lEHlRDQR~GmVaTkdQFef~l~aVAe  993 (1004)
T KOG0793|consen  967 LEHLRDQRPGMVATKDQFEFALTAVAE  993 (1004)
T ss_pred             HHHHhhcCCcceeehhhhHHHHHHHHH
Confidence            999999999999999999999999985


No 13 
>PF00102 Y_phosphatase:  Protein-tyrosine phosphatase;  InterPro: IPR000242 Protein tyrosine (pTyr) phosphorylation is a common post-translational modification which can create novel recognition motifs for protein interactions and cellular localisation, affect protein stability, and regulate enzyme activity. Consequently, maintaining an appropriate level of protein tyrosine phosphorylation is essential for many cellular functions. Tyrosine-specific protein phosphatases (PTPase; 3.1.3.48 from EC) catalyse the removal of a phosphate group attached to a tyrosine residue, using a cysteinyl-phosphate enzyme intermediate. These enzymes are key regulatory components in signal transduction pathways (such as the MAP kinase pathway) and cell cycle control, and are important in the control of cell growth, proliferation, differentiation and transformation [, ]. The PTP superfamily can be divided into four subfamilies []:   (1) pTyr-specific phosphatases (2) dual specificity phosphatases (dTyr and dSer/dThr) (3) Cdc25 phosphatases (dTyr and/or dThr) (4) LMW (low molecular weight) phosphatases   Based on their cellular localisation, PTPases are also classified as:   Receptor-like, which are transmembrane receptors that contain PTPase domains [] Non-receptor (intracellular) PTPases []   All PTPases carry the highly conserved active site motif C(X)5R (PTP signature motif), employ a common catalytic mechanism, and share a similar core structure made of a central parallel beta-sheet with flanking alpha-helices containing a beta-loop-alpha-loop that encompasses the PTP signature motif []. Functional diversity between PTPases is endowed by regulatory domains and subunits.  This entry repesents several receptor and non-receptor protein-tyrosine phosphatases. Structurally, all known receptor PTPases, are made up of a variable length extracellular domain, followed by a transmembrane region and a C-terminal catalytic cytoplasmic domain. Some of the receptor PTPases contain fibronectin type III (FN-III) repeats, immunoglobulin-like domains, MAM domains or carbonic anhydrase-like domains in their extracellular region. The cytoplasmic region generally contains two copies of the PTPase domain. The first seems to have enzymatic activity, while the second is inactive. The inactive domains of tandem phosphatases can be divided into two classes. Those which bind phosphorylated tyrosine residues may recruit multi-phosphorylated substrates for the adjacent active domains and are more conserved, while the other class have accumulated several variable amino acid substitutions and have a complete loss of tyrosine binding capability. The second class shows a release of evolutionary constraint for the sites around the catalytic centre, which emphasises a difference in function from the first group. There is a region of higher conservation common to both classes, suggesting a new regulatory centre []. PTPase domains consist of about 300 amino acids. There are two conserved cysteines, the second one has been shown to be absolutely required for activity. Furthermore, a number of conserved residues in its immediate vicinity have also been shown to be important.; GO: 0004725 protein tyrosine phosphatase activity, 0006470 protein dephosphorylation; PDB: 3O4T_A 3O4S_A 3O4U_A 2A3K_A 2QDP_A 2QDC_A 2QDM_A 2HVL_A 1ZC0_A 3D44_A ....
Probab=100.00  E-value=6.9e-59  Score=443.26  Aligned_cols=229  Identities=40%  Similarity=0.804  Sum_probs=208.1

Q ss_pred             CCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEecccee
Q psy15334        190 NRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIE  269 (451)
Q Consensus       190 N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E  269 (451)
                      |+.|||+.|++|||++||+|.+.  .+++||||||||+++...++||+||+|+++|++|||+||||++|.+||||+...|
T Consensus         1 n~~knR~~~~~~~d~~rv~l~~~--~~~~~yi~A~~v~~~~~~~~~I~~q~P~~~t~~~FW~mv~~~~~~~Iv~L~~~~e   78 (235)
T PF00102_consen    1 NREKNRSPDILPYDDSRVKLNPS--PGDSDYINASYVDGYKNGKKFIVTQAPMPDTIEDFWQMVWEQKVQIIVMLCSFDE   78 (235)
T ss_dssp             GGGGSSSTTS-BBTTTEEETTTT--TTSTTEESEEEEEESSSTEEEEEEES-SGGGHHHHHHHHHHTTBSEEEEESBSEE
T ss_pred             CCccCCCCCCCcCcceEEEecCC--CCccchhhhhhhccccchhhheeecccccccccceehheeeccccceeccccccc
Confidence            78999999999999999999542  2336799999999998899999999999999999999999999999999999999


Q ss_pred             cccccccccCC-CCCCCceeeeEEEEEe-eeE-eecceEEEEEEEEeCCe---eeEEEEEEeecCCCCCCCCChhHHHHH
Q psy15334        270 KHRDKCYKYWP-SDQHPMRCADIEISLQ-DVC-HYGSWVLSELRMCRGGE---HRMISHYHFTTWPDSGVPSSPNTLVSF  343 (451)
Q Consensus       270 ~~~~kc~~YwP-~~~~~~~~g~~~V~~~-~~~-~~~~~~~r~l~v~~~~~---~r~V~h~~~~~Wpd~gvP~~~~~ll~f  343 (451)
                      .+..+|.+||| ..+..+.+|+++|++. +.. ....+++|+|.++..+.   .+.++||||..||++++|.+...+++|
T Consensus        79 ~~~~~~~~y~P~~~~~~~~~g~~~V~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~v~~~~~~~W~~~~~P~~~~~~~~~  158 (235)
T PF00102_consen   79 SGDEKCDQYWPLKEGESLKFGDYTVKCKEKIEENSKHYTVRKLEVTNGKSKKKSRTVTHFHYTNWPDDGVPPSPESFLDF  158 (235)
T ss_dssp             TTEESS--TSTSSSSSEEEETTEEEEEEEEEEEEESSEEEEEEEEEETTTTEEEEEEEEEEEESSSSSSSGSSSHHHHHH
T ss_pred             ccccccccccccccccccccccccccccccccccccceEEeeccccccccccccccccceeeeeccccccccccchhhhh
Confidence            99999999999 5567889999999999 666 67899999999987543   589999999999999999999999999


Q ss_pred             HHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHH
Q psy15334        344 VLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQV  423 (451)
Q Consensus       344 i~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~  423 (451)
                      ++.+++......+|++|||++|+||||+||+++++++++..+   +.+||.++++.||++|+++|++.+||.|||+++++
T Consensus       159 ~~~v~~~~~~~~~pivVhc~~G~gRsg~f~~~~~~~~~~~~~---~~~~v~~~~~~lR~~R~~~i~~~~qy~f~~~~~~e  235 (235)
T PF00102_consen  159 IRKVNKSKDDPNGPIVVHCSDGVGRSGTFCAIDILIEQLKKE---GEVDVFEIVKKLRQQRPGAIQSPEQYRFCYMAVLE  235 (235)
T ss_dssp             HHHHHHHHSTTSSEEEEESSSSSHHHHHHHHHHHHHHHHHHH---SEECHHHHHHHHHTTSTTSSSSHHHHHHHHHHHHH
T ss_pred             hhhccccccCCccceEeecccccccccccccchhhccccccc---cchhhHHHHHHHHhhCCCccCCHHHHHHHHHHHhC
Confidence            999998876667899999999999999999999999999998   88999999999999999999999999999999986


No 14 
>COG5599 PTP2 Protein tyrosine phosphatase [Signal transduction mechanisms]
Probab=100.00  E-value=2.3e-59  Score=433.30  Aligned_cols=238  Identities=37%  Similarity=0.638  Sum_probs=196.5

Q ss_pred             ccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccC--C
Q psy15334        182 TTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRT--P  259 (451)
Q Consensus       182 ~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v--~  259 (451)
                      +.....+.|-.||||.||+||+++||+|...  ...+||||||||+...  .+|||||||.++|+.|||+|||+...  .
T Consensus        45 ~~~sls~~~~~rNRYsnIvPYe~tRV~Lk~~--k~~~DYINAS~ik~~~--~~yIAtQgP~~~t~ddFW~mvw~n~~~~g  120 (302)
T COG5599          45 DDTSLSSTNYARNRYSNIVPYEHTRVHLKYG--KSINDYINASYIKTPR--GKYIATQGPKPETIDDFWKMVWHNVPNNG  120 (302)
T ss_pred             cccccCccccccccccccccccCceeeccCC--CchhhcccceeeecCC--CceEEecCCCCchHHHHHHHHHhcCCCce
Confidence            3445667899999999999999999999875  4568999999999855  45999999999999999999999998  9


Q ss_pred             eEEEeccceecccccccccCCCC-CCCceeeeEEEEEeeeEeec--ceEEEEEEEEe-CCeeeEEEEEEeecCCCCCCCC
Q psy15334        260 AIVMLTQCIEKHRDKCYKYWPSD-QHPMRCADIEISLQDVCHYG--SWVLSELRMCR-GGEHRMISHYHFTTWPDSGVPS  335 (451)
Q Consensus       260 ~IVmLt~~~E~~~~kc~~YwP~~-~~~~~~g~~~V~~~~~~~~~--~~~~r~l~v~~-~~~~r~V~h~~~~~Wpd~gvP~  335 (451)
                      +||||+++.|.+++||.+|||.. ++...+|...+.-++.+...  .+.++.++++. ++..+.|+||+|.+|+|.++| 
T Consensus       121 vIVmLt~l~E~~rekc~qYWp~~~~~~~~~G~~v~~~~~~e~~~d~~~~~~~f~L~~~~~~~k~Ihhf~y~nW~D~~~p-  199 (302)
T COG5599         121 VIVMLTSLPEYNREKCDQYWPLGYDDTLIIGLRVIKQKKYELFNDNIVNVHNFELTSINGPPKKIHHFQYINWVDFNVP-  199 (302)
T ss_pred             EEEEecChHHHhHHHHhhhCCCCcCcceeeeeEEEEEecccccccceeeeeecccccCCCCccEEEEEEecCccccCCc-
Confidence            99999999999999999999942 34567886555555444433  35678888874 578999999999999999999 


Q ss_pred             ChhHHHHHHHHHHHhcCC-CCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccC-C----CCCC-HHHHHHHHHhhccCCC
Q psy15334        336 SPNTLVSFVLSFRTHISP-AQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEG-A----GSVD-ILRIVHGLRRQRVLMV  408 (451)
Q Consensus       336 ~~~~ll~fi~~v~~~~~~-~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~-~----~~vd-i~~~v~~lR~qR~~~V  408 (451)
                      +...+.++++.+...  + ..+|++||||||+||||||+|+|.++++...... .    ...| |+++|..||+||+.||
T Consensus       200 ~i~sl~~~~~sl~~s--p~~t~piiVHCSAGvGRTGTFIalD~ll~~~~~~~~~t~~~~~t~D~if~iV~~LRsQRmkmV  277 (302)
T COG5599         200 DIRSLTEVIHSLNDS--PVRTGPIIVHCSAGVGRTGTFIALDILLRMPNDTLNHTDTWEDTQDLIFQIVLSLRSQRMKMV  277 (302)
T ss_pred             CHHHHHHHHHHhhcC--cCCCCCEEEEeccCCCCcceeeeHHHHHhccccccCCCchhhhhhhHHHHHHHHHHHHHHHHH
Confidence            566666666665432  2 5789999999999999999999999987643210 0    1123 8999999999999999


Q ss_pred             CCHHHHHHHHHHHHHHHh
Q psy15334        409 QSLAQYAFIYECVQVYLQ  426 (451)
Q Consensus       409 qt~~QY~fiy~~l~~~l~  426 (451)
                      |+.+||.|||++++++.+
T Consensus       278 Qn~~Qf~flY~~~~~l~~  295 (302)
T COG5599         278 QNKTQFKFLYDAFLELNK  295 (302)
T ss_pred             HhHHHHHHHHHHHHHHHH
Confidence            999999999999999873


No 15 
>KOG4228|consensus
Probab=100.00  E-value=5.3e-59  Score=500.16  Aligned_cols=277  Identities=36%  Similarity=0.653  Sum_probs=241.8

Q ss_pred             cccCCCcceeccchh----hhcc-CC--c--chHHHhHHhhcCCCCC-cccccccccCCCCCCCCCCCCCCCcceEecCC
Q psy15334        143 TLHNCSHPIPVTLFC----TMTL-DP--D--WIAVEFEDLKSVGRGQ-PTTAAELRCNRPKNRYTNIFPYDSSRYKLQPV  212 (451)
Q Consensus       143 ~~~~~~~~i~~~~f~----~~~~-~~--~--~~~~Ef~~l~~~~~~~-~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~  212 (451)
                      ....+.++|++.+|.    .+.. ++  +  .+++||+.|....... ....+.+++|+.|||+.+|+|||++||.|.++
T Consensus       796 ~~~~G~T~i~~~~~~~~~~~l~~~~p~~~~t~le~EF~~L~~~~~~~~~~~~~~l~~N~~KNR~~~i~P~d~~rv~L~~~  875 (1087)
T KOG4228|consen  796 ACLCGDTEVPASTLAPYSQKLKRIDPPENKTGLEEEFKTLNSCKPRTRMMICGNLPENKSKNRQVNILPYDRNRVILIPT  875 (1087)
T ss_pred             HHhcCCccccHHHHHHHHHHhccCCCCCCCcchHHHHHHHhhccccchhhhccccchhcccccccccCCchhcccceecc
Confidence            345788999999992    2222 32  2  2999999998765432 22348999999999999999999999999999


Q ss_pred             CCCCCCCcEEeeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEE
Q psy15334        213 DDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIE  292 (451)
Q Consensus       213 ~~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~  292 (451)
                      .|. ++||||||+||||..+..|||||+||.+|++||||||||++|.+||||++..+.  .+|.||||.++ ...||.|.
T Consensus       876 ~G~-~sdYINAs~idgy~~~~~fivtq~PL~~T~~DFWrmi~d~~~tsiVmL~~l~~~--~~C~qyw~~~g-~~~yg~i~  951 (1087)
T KOG4228|consen  876 HGE-SSDYINASFIDGYRQPKAFIVTQGPLAETVEDFWRMIWDQNVTSIVMLTELKHP--EKCPQYWPPEG-SQRYGPIE  951 (1087)
T ss_pred             CCC-cccccchhhhcccCCcceEEEecCCcccchHHHHHHhhccceeEEEEecccCcc--cccccccCCcC-ceecCcEE
Confidence            887 899999999999999999999999999999999999999999999999998776  89999999854 57899999


Q ss_pred             EEEeeeEeecceEEEEEEEEe--CCeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHH-HhcCC-CCCCEEEEcCCCCch
Q psy15334        293 ISLQDVCHYGSWVLSELRMCR--GGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFR-THISP-AQHPIVVHCSAGVGR  368 (451)
Q Consensus       293 V~~~~~~~~~~~~~r~l~v~~--~~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~-~~~~~-~~~PivVHCs~GvGR  368 (451)
                      |+.++......++.|+|.+++  .+..|+|+||||++||..++|+.....+.++-.+. +.... ..+|++|||++|+||
T Consensus       952 Ve~~~~~~~~~~t~r~f~i~n~~~~~~r~v~qfq~~~WP~~~~~p~~~~~~~~i~~~~~~~q~~~~~~P~~Vhc~nG~~r 1031 (1087)
T KOG4228|consen  952 VEDMNEHINPQYTAREFGVTNEREKQSRTVRQFQFTGWPEYGKPPQSKGPISKIPSVASKWQQLGADGPIIVHCLNGVGR 1031 (1087)
T ss_pred             EEecccccchhhhhhhheeeeccccCceEEEEEEecCCcccCcCCCCcchhhhHHHHHHHHHhhcCCCCEEEEEcCCCcc
Confidence            999999999999999999986  46799999999999999888876666655554443 32222 278999999999999


Q ss_pred             hhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHHHHh
Q psy15334        369 TGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQ  426 (451)
Q Consensus       369 tG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~  426 (451)
                      ||+|||+.+++++++.+   +.||||++|+.||.+|++||++.+||.|||+++++|+.
T Consensus      1032 sg~f~ai~~l~e~~~~e---~~vDVfq~vk~Lr~~rp~mv~t~~QY~fcYdv~~~y~~ 1086 (1087)
T KOG4228|consen 1032 TGTFCAISILLERMRKE---GVVDVFQTVKTLRFQRPGMVDTSDQYQFCYDVALEYLG 1086 (1087)
T ss_pred             eeehHHHHHHHHHHhhc---CceeeehhhhhhhhcCccccCcHHHHHHHHHHHHHhhc
Confidence            99999999999999999   99999999999999999999999999999999999985


No 16 
>PRK15375 pathogenicity island 1 effector protein StpP; Provisional
Probab=100.00  E-value=7.4e-53  Score=425.30  Aligned_cols=215  Identities=19%  Similarity=0.258  Sum_probs=183.9

Q ss_pred             CCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCCCCCccEEEecCCCC--CCHHHHHhHhhhccCCeEEEeccceeccc
Q psy15334        195 RYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLC--STVDDFWRMCWESRTPAIVMLTQCIEKHR  272 (451)
Q Consensus       195 R~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~--~T~~dFWrMVwe~~v~~IVmLt~~~E~~~  272 (451)
                      --.+|+|+|||||+|.      +++|||||||+ +..++.+|++|.|+.  +|++||||||||++|.+|||||+..|.++
T Consensus       303 ~~~~I~~~~~TrV~ls------~g~yI~ANyI~-~~~k~~aIa~QYP~nt~eTieDFWrMVWEn~~~VIVMLT~l~E~g~  375 (535)
T PRK15375        303 AVSSIPINQQTQVKLS------DGMPVPVNTLT-FDGKPVALAGSYPKNTPDALEAHMKMLLEKECSCLVVLTSEDQMQA  375 (535)
T ss_pred             HhcccCccccceEEcC------CCCccccceee-ecCccceEeccCCCCchhhHHHHHHHHHHcCCCEEEEeeccccCcc
Confidence            3479999999999995      27999999999 888889999999976  89999999999999999999999999999


Q ss_pred             ccccccCCCCCCCceeeeEEEEEeeeEee-cceEEEEEEE--EeCCeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHH
Q psy15334        273 DKCYKYWPSDQHPMRCADIEISLQDVCHY-GSWVLSELRM--CRGGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRT  349 (451)
Q Consensus       273 ~kc~~YwP~~~~~~~~g~~~V~~~~~~~~-~~~~~r~l~v--~~~~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~  349 (451)
                      .+|.+||+   ++..||+|+|++.+++.. .++.+|.+.+  ...+..+.|+||||++|||||+|+++..++.|++.|+.
T Consensus       376 ~KC~pYW~---~s~tYGdItVts~seesls~g~iIR~f~L~Ik~~E~R~tV~QFHyTnWPDHGVPpST~~LleLvr~Vr~  452 (535)
T PRK15375        376 KQLPPYFR---GSYTFGEVHTNSQKVSSASQGEAIDQYNMQLSCGEKRYTIPVLHVKNWPDHQPLPSTDQLEYLADRVKN  452 (535)
T ss_pred             cccCccCC---CcceeccEEEEEEEEEeccCCceEEEEEEEEecCCceeEEEEEEeCCCCCCCCCCChHHHHHHHHHHHH
Confidence            99999996   356899999999988765 4577776644  44445568999999999999999988889999999876


Q ss_pred             hcCC--------CCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccC-CCCCHHHHHHHHHH
Q psy15334        350 HISP--------AQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVL-MVQSLAQYAFIYEC  420 (451)
Q Consensus       350 ~~~~--------~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~-~Vqt~~QY~fiy~~  420 (451)
                      ....        .++++|||||||+||||+|||++.    +..+   +.+|+.++|+.||.+|++ |||+.+||.|++..
T Consensus       453 ~~q~~~~~~~~~nk~~PVVHCSAGVGRTGTFIAi~l----lk~~---~~~sle~IV~dlR~qRng~MVQt~eQy~~l~~~  525 (535)
T PRK15375        453 SNQNGAPGRSSSDKHLPMIHCLGGVGRTGTMAAALV----LKDN---PHSNLEQVRADFRNSRNNRMLEDASQFVQLKAM  525 (535)
T ss_pred             hhhcccccccccCCCCceEEcCCCCchHHHHHHHHH----Hhcc---ccCCHHHHHHHHHhcCCccccccHHHHHHHHHH
Confidence            5311        123448999999999999999975    3434   678999999999999999 99999999999988


Q ss_pred             HHHHHh
Q psy15334        421 VQVYLQ  426 (451)
Q Consensus       421 l~~~l~  426 (451)
                      -...|.
T Consensus       526 ~~~~~~  531 (535)
T PRK15375        526 QAQLLM  531 (535)
T ss_pred             HHHHhh
Confidence            776654


No 17 
>KOG0789|consensus
Probab=100.00  E-value=4.5e-51  Score=422.45  Aligned_cols=243  Identities=32%  Similarity=0.527  Sum_probs=196.4

Q ss_pred             ccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEE
Q psy15334        184 AAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVM  263 (451)
Q Consensus       184 ~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVm  263 (451)
                      ......|..||||.||.|||++||+|....   ..||||||||+++...+.||+||||+++|.+|||+||||+++..|||
T Consensus       121 ~~~~~~~~~kNr~~dv~~~d~~rV~l~~~~---~~dyI~A~~v~~~~~~~~~I~tQ~Pl~~t~~dFW~mv~~~k~~~Ivm  197 (415)
T KOG0789|consen  121 CDAFLKNMDKNRYSDVQCLDHTRVKLKPPQ---DKDYIHASKIYFPKGPRTAILTQAPLSKTTEDFWRMVFEEKVESVVL  197 (415)
T ss_pred             hhhhhccccccccCCcCcccCCeeecCCCC---CcccchhhccCCcCCCcCceecCCCCcccHHHHHHHHHHcCCCEEEE
Confidence            344567888999999999999999997532   67999999999976448999999999999999999999999999999


Q ss_pred             eccceecccccccccCCCCC-CCceeeeEEEEEeee---Eee---cceEEEEEEE-EeCCe--eeEEEEEEeecCCCCCC
Q psy15334        264 LTQCIEKHRDKCYKYWPSDQ-HPMRCADIEISLQDV---CHY---GSWVLSELRM-CRGGE--HRMISHYHFTTWPDSGV  333 (451)
Q Consensus       264 Lt~~~E~~~~kc~~YwP~~~-~~~~~g~~~V~~~~~---~~~---~~~~~r~l~v-~~~~~--~r~V~h~~~~~Wpd~gv  333 (451)
                      ||+..|.|+.+|.+|||.+. ....||.+...+...   ...   ..+....+.+ ...+.  .+.|.|+||++|||+|+
T Consensus       198 l~~~~E~~~~kc~~Y~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~WPd~~~  277 (415)
T KOG0789|consen  198 LCSDEELGAAECASYFPSKRGVKLTFLEFGKKFVENRKVEMKQQDQTSTVGLLLLVLPEGLEISSSVVHYHYINWPDHGA  277 (415)
T ss_pred             ecchhhccHHHHhhcCcccCCCeeeecccceeEEEeeeeecccccccceeeEEEeecCCCccccceEEEEeeCCCccccC
Confidence            99999999999999999875 455677664433332   221   2222222222 22333  46999999999999999


Q ss_pred             CCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHH
Q psy15334        334 PSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQ  413 (451)
Q Consensus       334 P~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~Q  413 (451)
                      |.+...++.+++.-.....+..+|++||||+|+||||+|+++++++.++....  ...++..+++.+|.||+++|||..|
T Consensus       278 p~~~~~~l~~~~~~~~~~~~~~~P~vVhcsaG~gRtgt~v~~~~~~~~~~~~~--~~~~~~~~~~~iR~qR~~~vqt~~Q  355 (415)
T KOG0789|consen  278 PDSVKSILPLLRQSVLELRPKQEPIEVHCSAGAGRAGTLVLIEHALIELQGPE--GEPPIDEILREIRYQRPGAVQSPLQ  355 (415)
T ss_pred             CcchHHHHHHHHhhhhhhcCCCCCeEEECCCCCCccchHHHHHHHHHHHhcCC--CCccHHHHHHHHHHHhhhcccchhH
Confidence            99999999998751112233468999999999999999999998888876432  5678999999999999999999999


Q ss_pred             HHHHHHHHHHHHhcCCCC
Q psy15334        414 YAFIYECVQVYLQGGVRH  431 (451)
Q Consensus       414 Y~fiy~~l~~~l~~~~~~  431 (451)
                      |.|+|+++++|+......
T Consensus       356 y~f~~~~~~~~~~~~~~~  373 (415)
T KOG0789|consen  356 YLFIYAATLKYIKKVSYW  373 (415)
T ss_pred             HHHHHHHHHHHHHHccCC
Confidence            999999999998874433


No 18 
>smart00012 PTPc_DSPc Protein tyrosine phosphatase, catalytic domain, undefined specificity. Protein tyrosine phosphatases. Homologues detected by this profile and not by those of "PTPc" or  "DSPc" are predicted to be protein phosphatases with a similar fold to DSPs and PTPs, yet with unpredicted specificities.
Probab=99.90  E-value=1.9e-23  Score=173.69  Aligned_cols=102  Identities=49%  Similarity=0.954  Sum_probs=92.0

Q ss_pred             EEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcC--CCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHH
Q psy15334        319 MISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHIS--PAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRI  396 (451)
Q Consensus       319 ~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~--~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~  396 (451)
                      .+.||||++|||+++|.....+++|+..+++...  +.++||+|||++|+||||+||+++.++.++..+.  +.+|+.++
T Consensus         1 ~~~~~~~~~Wpd~~~P~~~~~~~~~~~~v~~~~~~~~~~~pvlVHC~~G~gRtg~~~~~~~~~~~~~~~~--~~~~~~~~   78 (105)
T smart00012        1 TVKHYHYTGWPDHGVPESPDSILEFLRAVKKNLNQSSSSGPVVVHCSAGVGRTGTFVALDILLQQLESET--GEVDIFQT   78 (105)
T ss_pred             CeEEEeeCCCCCCCCCCCHHHHHHHHHHHHHHHHhcCCCCCEEEEeCCCCChhhHHHHHHHHHHHHHhcC--CCCCHHHH
Confidence            3789999999999999998999999999887643  2367999999999999999999999999887542  57999999


Q ss_pred             HHHHHhhccCCCCCHHHHHHHHHHHH
Q psy15334        397 VHGLRRQRVLMVQSLAQYAFIYECVQ  422 (451)
Q Consensus       397 v~~lR~qR~~~Vqt~~QY~fiy~~l~  422 (451)
                      ++.+|++|++++++.+||.|+|.+++
T Consensus        79 ~~~ir~~r~~~~~~~~q~~~~~~~~~  104 (105)
T smart00012       79 VKELRKQRPGMVQTFEQYLFLYRALL  104 (105)
T ss_pred             HHHHHhhhhhhCCcHHHHHHHHHHHh
Confidence            99999999999999999999999986


No 19 
>smart00404 PTPc_motif Protein tyrosine phosphatase, catalytic domain motif.
Probab=99.90  E-value=1.9e-23  Score=173.69  Aligned_cols=102  Identities=49%  Similarity=0.954  Sum_probs=92.0

Q ss_pred             EEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcC--CCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHH
Q psy15334        319 MISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHIS--PAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRI  396 (451)
Q Consensus       319 ~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~--~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~  396 (451)
                      .+.||||++|||+++|.....+++|+..+++...  +.++||+|||++|+||||+||+++.++.++..+.  +.+|+.++
T Consensus         1 ~~~~~~~~~Wpd~~~P~~~~~~~~~~~~v~~~~~~~~~~~pvlVHC~~G~gRtg~~~~~~~~~~~~~~~~--~~~~~~~~   78 (105)
T smart00404        1 TVKHYHYTGWPDHGVPESPDSILEFLRAVKKNLNQSSSSGPVVVHCSAGVGRTGTFVALDILLQQLESET--GEVDIFQT   78 (105)
T ss_pred             CeEEEeeCCCCCCCCCCCHHHHHHHHHHHHHHHHhcCCCCCEEEEeCCCCChhhHHHHHHHHHHHHHhcC--CCCCHHHH
Confidence            3789999999999999998999999999887643  2367999999999999999999999999887542  57999999


Q ss_pred             HHHHHhhccCCCCCHHHHHHHHHHHH
Q psy15334        397 VHGLRRQRVLMVQSLAQYAFIYECVQ  422 (451)
Q Consensus       397 v~~lR~qR~~~Vqt~~QY~fiy~~l~  422 (451)
                      ++.+|++|++++++.+||.|+|.+++
T Consensus        79 ~~~ir~~r~~~~~~~~q~~~~~~~~~  104 (105)
T smart00404       79 VKELRKQRPGMVQTFEQYLFLYRALL  104 (105)
T ss_pred             HHHHHhhhhhhCCcHHHHHHHHHHHh
Confidence            99999999999999999999999986


No 20 
>PTZ00242 protein tyrosine phosphatase; Provisional
Probab=99.66  E-value=5.2e-16  Score=140.52  Aligned_cols=135  Identities=19%  Similarity=0.287  Sum_probs=99.2

Q ss_pred             ccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeecceEEEEEEEE
Q psy15334        233 RQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMC  312 (451)
Q Consensus       233 ~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~  312 (451)
                      -+||++-+|+++++.+|....-+.++..||.++.          ++|+.+                              
T Consensus        16 ~r~~~~~~P~~~~~~~~l~~L~~~gI~~Iv~l~~----------~~~~~~------------------------------   55 (166)
T PTZ00242         16 FKFLILDAPSPSNLPLYIKELQRYNVTHLVRVCG----------PTYDAE------------------------------   55 (166)
T ss_pred             eEEEEecCCCcccHHHHHHHHHhCCCeEEEecCC----------CCCCHH------------------------------
Confidence            3799999999999999999999999999999864          122111                              


Q ss_pred             eCCeeeEEEEEEeecCC--CCCCCCChh---HHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccC
Q psy15334        313 RGGEHRMISHYHFTTWP--DSGVPSSPN---TLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEG  387 (451)
Q Consensus       313 ~~~~~r~V~h~~~~~Wp--d~gvP~~~~---~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~  387 (451)
                          .....+++|..||  |+++| +..   .+++++..+........+||+|||.+|+||||+||++..+    +.   
T Consensus        56 ----~~~~~gi~~~~~p~~D~~~P-~~~~i~~~~~~i~~~~~~~~~~g~~V~VHC~aGigRSgt~~a~yL~----~~---  123 (166)
T PTZ00242         56 ----LLEKNGIEVHDWPFDDGAPP-PKAVIDNWLRLLDQEFAKQSTPPETIAVHCVAGLGRAPILVALALV----EY---  123 (166)
T ss_pred             ----HHHHCCCEEEecCCCCCCCC-CHHHHHHHHHHHHHHHHhhccCCCeEEEECCCCCCHHHHHHHHHHH----Hh---
Confidence                0112234455566  65655 444   4455555544332335789999999999999999998732    22   


Q ss_pred             CCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHH
Q psy15334        388 AGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECV  421 (451)
Q Consensus       388 ~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l  421 (451)
                       +.+++.++++.+|++|++++ +..|+.|+.+..
T Consensus       124 -~~~s~~eAi~~vr~~R~~~i-~~~Q~~~l~~~~  155 (166)
T PTZ00242        124 -GGMEPLDAVGFVREKRKGAI-NQTQLQFLKKYK  155 (166)
T ss_pred             -CCCCHHHHHHHHHHHCCCCc-hHHHHHHHHHHH
Confidence             44699999999999999999 589999987654


No 21 
>PTZ00393 protein tyrosine phosphatase; Provisional
Probab=99.46  E-value=3.8e-13  Score=126.36  Aligned_cols=135  Identities=15%  Similarity=0.265  Sum_probs=98.4

Q ss_pred             ccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeecceEEEEEEEE
Q psy15334        233 RQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMC  312 (451)
Q Consensus       233 ~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~  312 (451)
                      -+||++-+|+..++.+|..-..+.++..||-+|.          .+.+.+  .+                          
T Consensus        92 ~rfLi~~~P~~~~~~~yl~eLk~~gV~~lVrlcE----------~~Yd~~--~~--------------------------  133 (241)
T PTZ00393         92 IKILILDAPTNDLLPLYIKEMKNYNVTDLVRTCE----------RTYNDG--EI--------------------------  133 (241)
T ss_pred             eeEEEeCCCCHHHHHHHHHHHHHcCCCEEEECCC----------CCCCHH--HH--------------------------
Confidence            4899999999999999999999999999999864          111211  00                          


Q ss_pred             eCCeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCC
Q psy15334        313 RGGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVD  392 (451)
Q Consensus       313 ~~~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vd  392 (451)
                         +..-+.+++ ..|||+++|+. ..+.+|+..+..... ..++|+|||.+|+||||+++|+..+    . .   + .+
T Consensus       134 ---~~~GI~~~~-lpipDg~aPs~-~~i~~~l~~i~~~l~-~g~~VaVHC~AGlGRTGtl~AayLI----~-~---G-ms  198 (241)
T PTZ00393        134 ---TSAGINVHE-LIFPDGDAPTV-DIVSNWLTIVNNVIK-NNRAVAVHCVAGLGRAPVLASIVLI----E-F---G-MD  198 (241)
T ss_pred             ---HHcCCeEEE-eecCCCCCCCH-HHHHHHHHHHHHHHh-cCCeEEEECCCCCCHHHHHHHHHHH----H-c---C-CC
Confidence               011123333 46899998863 444455555544332 3578999999999999999987643    2 1   2 58


Q ss_pred             HHHHHHHHHhhccCCCCCHHHHHHHHHHH
Q psy15334        393 ILRIVHGLRRQRVLMVQSLAQYAFIYECV  421 (451)
Q Consensus       393 i~~~v~~lR~qR~~~Vqt~~QY~fiy~~l  421 (451)
                      ..+++..+|+.|++++ +..|+.|+.+.-
T Consensus       199 peeAI~~VR~~RPgAI-n~~Q~~fL~~y~  226 (241)
T PTZ00393        199 PIDAIVFIRDRRKGAI-NKRQLQFLKAYK  226 (241)
T ss_pred             HHHHHHHHHHHCCCCC-CHHHHHHHHHHH
Confidence            8899999999999999 578999977553


No 22 
>KOG1720|consensus
Probab=99.31  E-value=2.1e-11  Score=111.02  Aligned_cols=97  Identities=26%  Similarity=0.338  Sum_probs=77.5

Q ss_pred             EEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHH
Q psy15334        321 SHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGL  400 (451)
Q Consensus       321 ~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~l  400 (451)
                      .|+++ -.||.+.| ++..+.+|++.+.....  .|-|.|||-+|.||||++||..+|..        -.+.--+++..|
T Consensus       117 ~h~~l-~f~Dg~tP-~~~~v~~fv~i~e~~~~--~g~iaVHCkaGlGRTG~liAc~lmy~--------~g~ta~eaI~~l  184 (225)
T KOG1720|consen  117 DHHDL-FFADGSTP-TDAIVKEFVKIVENAEK--GGKIAVHCKAGLGRTGTLIACYLMYE--------YGMTAGEAIAWL  184 (225)
T ss_pred             eeeee-ecCCCCCC-CHHHHHHHHHHHHHHHh--cCeEEEEeccCCCchhHHHHHHHHHH--------hCCCHHHHHHHH
Confidence            34443 24788877 46888999998887654  67899999999999999999987764        235567999999


Q ss_pred             HhhccCCCCCHHHHHHHHHHHHHHHhcCC
Q psy15334        401 RRQRVLMVQSLAQYAFIYECVQVYLQGGV  429 (451)
Q Consensus       401 R~qR~~~Vqt~~QY~fiy~~l~~~l~~~~  429 (451)
                      |..|||||++.+|+..+++-..-++.+..
T Consensus       185 R~~RpG~V~gpqQ~~l~~~q~~~~~~~~~  213 (225)
T KOG1720|consen  185 RICRPGAVIGPQQHKLLHKQRDLWLAGDL  213 (225)
T ss_pred             HhcCCccccCHHHHHHHHHHHHHHhhhcc
Confidence            99999999999999999987765665543


No 23 
>KOG2836|consensus
Probab=99.16  E-value=5.3e-10  Score=94.56  Aligned_cols=131  Identities=21%  Similarity=0.399  Sum_probs=95.2

Q ss_pred             ccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeecceEEEEEEEE
Q psy15334        233 RQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMC  312 (451)
Q Consensus       233 ~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~  312 (451)
                      -+|.+|..|...|+..|-.-.-.+++.++|-.|...         |   +..++.-.                       
T Consensus        17 MrFLIThnPtnaTln~fieELkKygvttvVRVCe~T---------Y---dt~~lek~-----------------------   61 (173)
T KOG2836|consen   17 MRFLITHNPTNATLNKFIEELKKYGVTTVVRVCEPT---------Y---DTTPLEKE-----------------------   61 (173)
T ss_pred             eEEEEecCCCchhHHHHHHHHHhcCCeEEEEecccc---------c---CCchhhhc-----------------------
Confidence            479999999999999999999999999999998521         1   11111100                       


Q ss_pred             eCCeeeEEEEEEeecCC-CCCCCCChhHHHHHHHHHHHhcC-CCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCC
Q psy15334        313 RGGEHRMISHYHFTTWP-DSGVPSSPNTLVSFVLSFRTHIS-PAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGS  390 (451)
Q Consensus       313 ~~~~~r~V~h~~~~~Wp-d~gvP~~~~~ll~fi~~v~~~~~-~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~  390 (451)
                               -++..+|| |.|.|+..+-+-+.+..+..... ....-|-|||-||+||+-+++|+..+-.-|+.+     
T Consensus        62 ---------GI~Vldw~f~dg~ppp~qvv~~w~~l~~~~f~e~p~~cvavhcvaglgrapvlvalalie~gmkye-----  127 (173)
T KOG2836|consen   62 ---------GITVLDWPFDDGAPPPNQVVDDWLSLVKTKFREEPGCCVAVHCVAGLGRAPVLVALALIEAGMKYE-----  127 (173)
T ss_pred             ---------CceEeecccccCCCCchHHHHHHHHHHHHHHhhCCCCeEEEEeecccCcchHHHHHHHHHccccHH-----
Confidence                     12345798 88988776655555555554332 224569999999999999999987665555444     


Q ss_pred             CCHHHHHHHHHhhccCCCCCHHHHHHH
Q psy15334        391 VDILRIVHGLRRQRVLMVQSLAQYAFI  417 (451)
Q Consensus       391 vdi~~~v~~lR~qR~~~Vqt~~QY~fi  417 (451)
                          ++|..+|++|.|++. ..|..|+
T Consensus       128 ----dave~ir~krrga~n-~kql~~l  149 (173)
T KOG2836|consen  128 ----DAVEMIRQKRRGAIN-SKQLLYL  149 (173)
T ss_pred             ----HHHHHHHHHhhcccc-HHHHHHH
Confidence                889999999999995 4676663


No 24 
>cd00127 DSPc Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
Probab=99.07  E-value=7.6e-10  Score=96.78  Aligned_cols=88  Identities=15%  Similarity=0.198  Sum_probs=65.6

Q ss_pred             EEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHH
Q psy15334        319 MISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVH  398 (451)
Q Consensus       319 ~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~  398 (451)
                      ...|+.+.+|++...+.....+++|+......    ++||+|||.+|.||||++++...+..        +..+..++++
T Consensus        49 ~~~~~~~~D~~~~~~~~~~~~~~~~i~~~~~~----~~~vlVHC~~G~~Rs~~~~~~~l~~~--------~~~~~~~a~~  116 (139)
T cd00127          49 NYLYVPILDLPSQDISKYFDEAVDFIDDAREK----GGKVLVHCLAGVSRSATLVIAYLMKT--------LGLSLREAYE  116 (139)
T ss_pred             eEEEEEceeCCCCChHHHHHHHHHHHHHHHhc----CCcEEEECCCCCchhHHHHHHHHHHH--------cCCCHHHHHH
Confidence            46777887887555444444556666554432    57999999999999999998664432        3468899999


Q ss_pred             HHHhhccCCCCCHHHHHHHH
Q psy15334        399 GLRRQRVLMVQSLAQYAFIY  418 (451)
Q Consensus       399 ~lR~qR~~~Vqt~~QY~fiy  418 (451)
                      .+|++|+.+..+..+...+.
T Consensus       117 ~vr~~r~~~~~~~~~~~~l~  136 (139)
T cd00127         117 FVKSRRPIISPNAGFMRQLK  136 (139)
T ss_pred             HHHHHCCccCCCHHHHHHHH
Confidence            99999999988888777654


No 25 
>COG2453 CDC14 Predicted protein-tyrosine phosphatase [Signal transduction mechanisms]
Probab=98.93  E-value=1.2e-08  Score=93.86  Aligned_cols=90  Identities=29%  Similarity=0.276  Sum_probs=68.1

Q ss_pred             cCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccC
Q psy15334        327 TWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVL  406 (451)
Q Consensus       327 ~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~  406 (451)
                      .++|+.+|+. ..+-+++..+...... .+.|+|||.+|+|||||++|+..|+.+       +.-...+++.-+|..|++
T Consensus        79 ~~~D~~~p~~-~~l~~~v~~i~~~~~~-g~kVvVHC~~GigRSgtviaA~lm~~~-------~~~~~~~~i~~~~~~r~~  149 (180)
T COG2453          79 PILDGTVPDL-EDLDKIVDFIEEALSK-GKKVVVHCQGGIGRSGTVIAAYLMLYG-------GLSLADEAIAVKRRRRPG  149 (180)
T ss_pred             eecCCCCCcH-HHHHHHHHHHHHHHhc-CCeEEEEcCCCCchHHHHHHHHHHHHc-------CCCCHHHHHHHHHhcCCc
Confidence            5689998864 4555555555444433 348999999999999999998877642       344566888889999999


Q ss_pred             CCCCHHHHHHHHHHHHHHH
Q psy15334        407 MVQSLAQYAFIYECVQVYL  425 (451)
Q Consensus       407 ~Vqt~~QY~fiy~~l~~~l  425 (451)
                      ++.+..|+.|..+.....-
T Consensus       150 ~v~~~~q~~~~~e~~~~~~  168 (180)
T COG2453         150 AVVTEIQHLFELEQELFRK  168 (180)
T ss_pred             ccccHHHHHHHHHHHHHHh
Confidence            9999999999888766443


No 26 
>smart00195 DSPc Dual specificity phosphatase, catalytic domain.
Probab=98.71  E-value=1.6e-07  Score=82.20  Aligned_cols=57  Identities=18%  Similarity=0.292  Sum_probs=47.2

Q ss_pred             CCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHH
Q psy15334        354 AQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIY  418 (451)
Q Consensus       354 ~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy  418 (451)
                      ..+||+|||.+|.||||+++++..+..        ...++.+++..+|+.|+.+..+..|..-+.
T Consensus        77 ~~~~VlVHC~~G~~RS~~v~~~yl~~~--------~~~~~~~A~~~v~~~R~~~~p~~~~~~qL~  133 (138)
T smart00195       77 KGGKVLVHCQAGVSRSATLIIAYLMKY--------RNLSLNDAYDFVKDRRPIISPNFGFLRQLI  133 (138)
T ss_pred             CCCeEEEECCCCCchHHHHHHHHHHHH--------hCCCHHHHHHHHHHHCCccCCCHhHHHHHH
Confidence            367999999999999999998774432        235889999999999999999988876654


No 27 
>PF00782 DSPc:  Dual specificity phosphatase, catalytic domain;  InterPro: IPR000340 Protein tyrosine (pTyr) phosphorylation is a common post-translational modification which can create novel recognition motifs for protein interactions and cellular localisation, affect protein stability, and regulate enzyme activity. Consequently, maintaining an appropriate level of protein tyrosine phosphorylation is essential for many cellular functions. Tyrosine-specific protein phosphatases (PTPase; 3.1.3.48 from EC) catalyse the removal of a phosphate group attached to a tyrosine residue, using a cysteinyl-phosphate enzyme intermediate. These enzymes are key regulatory components in signal transduction pathways (such as the MAP kinase pathway) and cell cycle control, and are important in the control of cell growth, proliferation, differentiation and transformation [, ]. The PTP superfamily can be divided into four subfamilies []:   (1) pTyr-specific phosphatases (2) dual specificity phosphatases (dTyr and dSer/dThr) (3) Cdc25 phosphatases (dTyr and/or dThr) (4) LMW (low molecular weight) phosphatases   Based on their cellular localisation, PTPases are also classified as:   Receptor-like, which are transmembrane receptors that contain PTPase domains [] Non-receptor (intracellular) PTPases []   All PTPases carry the highly conserved active site motif C(X)5R (PTP signature motif), employ a common catalytic mechanism, and share a similar core structure made of a central parallel beta-sheet with flanking alpha-helices containing a beta-loop-alpha-loop that encompasses the PTP signature motif []. Functional diversity between PTPases is endowed by regulatory domains and subunits.  This entry represents dual specificity protein-tyrosine phosphatases. Ser/Thr and Tyr dual specificity phosphatases are a group of enzymes with both Ser/Thr (3.1.3.16 from EC) and tyrosine specific protein phosphatase (3.1.3.48 from EC) activity able to remove both the serine/threonine or tyrosine-bound phosphate group from a wide range of phosphoproteins, including a number of enzymes which have been phosphorylated under the action of a kinase. Dual specificity protein phosphatases (DSPs) regulate mitogenic signal transduction and control the cell cycle. The crystal structure of a human DSP, vaccinia H1-related phosphatase (or VHR), has been determined at 2.1 angstrom resolution []. A shallow active site pocket in VHR allows for the hydrolysis of phosphorylated serine, threonine, or tyrosine protein residues, whereas the deeper active site of protein tyrosine phosphatases (PTPs) restricts substrate specificity to only phosphotyrosine. Positively charged crevices near the active site may explain the enzyme's preference for substrates with two phosphorylated residues. The VHR structure defines a conserved structural scaffold for both DSPs and PTPs. A "recognition region" connecting helix alpha1 to strand beta1, may determine differences in substrate specificity between VHR, the PTPs, and other DSPs. These proteins may also have inactive phosphatase domains, and dependent on the domain composition this loss of catalytic activity has different effects on protein function. Inactive single domain phosphatases can still specifically bind substrates, and protect again dephosphorylation, while the inactive domains of tandem phosphatases can be further subdivided into two classes. Those which bind phosphorylated tyrosine residues may recruit multi-phosphorylated substrates for the adjacent active domains and are more conserved, while the other class have accumulated several variable amino acid substitutions and have a complete loss of tyrosine binding capability. The second class shows a release of evolutionary constraint for the sites around the catalytic centre, which emphasises a difference in function from the first group. There is a region of higher conservation common to both classes, suggesting a new regulatory centre [].; GO: 0008138 protein tyrosine/serine/threonine phosphatase activity, 0006470 protein dephosphorylation; PDB: 2G6Z_A 1MKP_A 1YZ4_A 2P4D_A 1M3G_A 1ZZW_A 2OUD_A 2HXP_A 3LJ8_A 1OHD_A ....
Probab=98.58  E-value=2e-07  Score=80.89  Aligned_cols=70  Identities=20%  Similarity=0.349  Sum_probs=52.9

Q ss_pred             hHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHH
Q psy15334        338 NTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFI  417 (451)
Q Consensus       338 ~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fi  417 (451)
                      ..+.+|+.....    .+++|+|||.+|+||||+++++..+..        ...++.++++.+|+.|+.+..+..|+..+
T Consensus        60 ~~~~~~i~~~~~----~~~~VlVHC~~G~~RS~~v~~ayLm~~--------~~~~~~~A~~~v~~~rp~~~~~~~~~~~L  127 (133)
T PF00782_consen   60 DQAVEFIENAIS----EGGKVLVHCKAGLSRSGAVAAAYLMKK--------NGMSLEEAIEYVRSRRPQINPNPSFIRQL  127 (133)
T ss_dssp             HHHHHHHHHHHH----TTSEEEEEESSSSSHHHHHHHHHHHHH--------HTSSHHHHHHHHHHHSTTSTHHHHHHHHH
T ss_pred             HHHHHhhhhhhc----ccceeEEEeCCCcccchHHHHHHHHHH--------cCCCHHHHHHHHHHHCCCCCCCHHHHHHH
Confidence            345555554432    267999999999999999998765432        23588899999999999998888887776


Q ss_pred             HH
Q psy15334        418 YE  419 (451)
Q Consensus       418 y~  419 (451)
                      +.
T Consensus       128 ~~  129 (133)
T PF00782_consen  128 YE  129 (133)
T ss_dssp             HH
T ss_pred             HH
Confidence            53


No 28 
>PRK12361 hypothetical protein; Provisional
Probab=98.03  E-value=3.2e-05  Score=83.27  Aligned_cols=82  Identities=18%  Similarity=0.280  Sum_probs=59.7

Q ss_pred             CCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCC
Q psy15334        329 PDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMV  408 (451)
Q Consensus       329 pd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~V  408 (451)
                      .|+..| +.+.+-+.++.+.+... ..++|+|||.+|.|||+++++...+.    ..   ...++.+++..+|+.||.+.
T Consensus       151 ~D~~~p-~~~~l~~a~~~i~~~~~-~~~~VlVHC~~G~sRSa~vv~ayLm~----~~---~~~~~~eA~~~vr~~Rp~v~  221 (547)
T PRK12361        151 LDHSVP-TLAQLNQAINWIHRQVR-ANKSVVVHCALGRGRSVLVLAAYLLC----KD---PDLTVEEVLQQIKQIRKTAR  221 (547)
T ss_pred             CCCCCC-cHHHHHHHHHHHHHHHH-CCCeEEEECCCCCCcHHHHHHHHHHH----hc---cCCCHHHHHHHHHHHCCCCC
Confidence            366666 44555444444443332 25799999999999999998876432    22   35688999999999999999


Q ss_pred             CCHHHHHHHHH
Q psy15334        409 QSLAQYAFIYE  419 (451)
Q Consensus       409 qt~~QY~fiy~  419 (451)
                      .+..|...+-+
T Consensus       222 ~n~~q~~~l~~  232 (547)
T PRK12361        222 LNKRQLRALEK  232 (547)
T ss_pred             CCHHHHHHHHH
Confidence            99999877554


No 29 
>PF05706 CDKN3:  Cyclin-dependent kinase inhibitor 3 (CDKN3);  InterPro: IPR022778  This entry represents a domain found in cyclin-dependent kinase inhibitor 3 or kinase associated phosphatase proteins from several mammalian species. The cyclin-dependent kinase (Cdk)-associated protein phosphatase (KAP) is a human dual specificity protein phosphatase that dephosphorylates Cdk2 on threonine 160 in a cyclin-dependent manner [], []. This domain is also found in MAP kinase phosphatase and esterases. This entry contains both eukaryotic and bacterial proteins.; GO: 0004721 phosphoprotein phosphatase activity, 0004725 protein tyrosine phosphatase activity; PDB: 1FQ1_A 1FPZ_F.
Probab=97.96  E-value=1.4e-05  Score=71.46  Aligned_cols=89  Identities=22%  Similarity=0.280  Sum_probs=47.6

Q ss_pred             HhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeecceEEEEEEEEeCCeeeEEEEEEeecCCCC
Q psy15334        252 MCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMCRGGEHRMISHYHFTTWPDS  331 (451)
Q Consensus       252 MVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~~~~~~r~V~h~~~~~Wpd~  331 (451)
                      -+..+.+..||.|+...|..+-+    +|.-++..                             +.+-+..+|| -.||.
T Consensus        66 ~Lk~~G~~~Vvtl~~~~EL~~l~----Vp~L~~~~-----------------------------~~~Gi~~~h~-PI~D~  111 (168)
T PF05706_consen   66 RLKDWGAQDVVTLLTDHELARLG----VPDLGEAA-----------------------------QARGIAWHHL-PIPDG  111 (168)
T ss_dssp             HHHHTT--EEEE-S-HHHHHHTT-----TTHHHHH-----------------------------HHTT-EEEE-----TT
T ss_pred             HHHHCCCCEEEEeCcHHHHHHcC----CccHHHHH-----------------------------HHcCCEEEec-CccCC
Confidence            46678999999998777765432    23211000                             1122333444 46899


Q ss_pred             CCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHH
Q psy15334        332 GVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALD  376 (451)
Q Consensus       332 gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~  376 (451)
                      ++|+ ...+.+++..+.+.... ...|+|||..|.||||+++|..
T Consensus       112 ~aPd-~~~~~~i~~eL~~~L~~-g~~V~vHC~GGlGRtGlvAAcL  154 (168)
T PF05706_consen  112 SAPD-FAAAWQILEELAARLEN-GRKVLVHCRGGLGRTGLVAACL  154 (168)
T ss_dssp             S----HHHHHHHHHHHHHHHHT-T--EEEE-SSSSSHHHHHHHHH
T ss_pred             CCCC-HHHHHHHHHHHHHHHHc-CCEEEEECCCCCCHHHHHHHHH
Confidence            9995 56677777777766554 5689999999999999988643


No 30 
>PF03162 Y_phosphatase2:  Tyrosine phosphatase family;  InterPro: IPR004861 Protein tyrosine (pTyr) phosphorylation is a common post-translational modification which can create novel recognition motifs for protein interactions and cellular localisation, affect protein stability, and regulate enzyme activity. Consequently, maintaining an appropriate level of protein tyrosine phosphorylation is essential for many cellular functions. Tyrosine-specific protein phosphatases (PTPase; 3.1.3.48 from EC) catalyse the removal of a phosphate group attached to a tyrosine residue, using a cysteinyl-phosphate enzyme intermediate. These enzymes are key regulatory components in signal transduction pathways (such as the MAP kinase pathway) and cell cycle control, and are important in the control of cell growth, proliferation, differentiation and transformation [, ]. The PTP superfamily can be divided into four subfamilies []:   (1) pTyr-specific phosphatases (2) dual specificity phosphatases (dTyr and dSer/dThr) (3) Cdc25 phosphatases (dTyr and/or dThr) (4) LMW (low molecular weight) phosphatases   Based on their cellular localisation, PTPases are also classified as:   Receptor-like, which are transmembrane receptors that contain PTPase domains [] Non-receptor (intracellular) PTPases []   All PTPases carry the highly conserved active site motif C(X)5R (PTP signature motif), employ a common catalytic mechanism, and share a similar core structure made of a central parallel beta-sheet with flanking alpha-helices containing a beta-loop-alpha-loop that encompasses the PTP signature motif []. Functional diversity between PTPases is endowed by regulatory domains and subunits.  This entry represents protein-tyrosine phosphatases predominantly from fungi, plants and bacteria, several of which are putative enzymes. These proteins are closely related to the Y-phosphatase and DSPc families. This entry includes the PTPase SIW14 from Saccharomyces cerevisiae (Baker's yeast), which plays a role in actin filament organisation and endocytosis.; PDB: 2Q47_A 1XRI_A.
Probab=97.62  E-value=9.3e-05  Score=66.95  Aligned_cols=72  Identities=22%  Similarity=0.329  Sum_probs=40.8

Q ss_pred             EEEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHH
Q psy15334        320 ISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHG  399 (451)
Q Consensus       320 V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~  399 (451)
                      +.|+....+.+..++.+.+.+.+.++.+...   .+.||+|||..|..|||+.+|+...   +      ..-++.+++.+
T Consensus        59 l~~~~~~~~~~~~~~~~~~~v~~aL~~ild~---~n~PvLiHC~~G~~rTG~vvg~lRk---~------Q~W~~~~i~~E  126 (164)
T PF03162_consen   59 LIHIPMSSSKDPWVPISEEQVAEALEIILDP---RNYPVLIHCNHGKDRTGLVVGCLRK---L------QGWSLSSIFDE  126 (164)
T ss_dssp             EEE-------GGG----HHHHHHHHHHHH-G---GG-SEEEE-SSSSSHHHHHHHHHHH---H------TTB-HHHHHHH
T ss_pred             EEEeccccccCccccCCHHHHHHHHHHHhCC---CCCCEEEEeCCCCcchhhHHHHHHH---H------cCCCHHHHHHH
Confidence            4455555555545666677777777655432   3679999999999999999986653   3      23466788888


Q ss_pred             HHhh
Q psy15334        400 LRRQ  403 (451)
Q Consensus       400 lR~q  403 (451)
                      +|.-
T Consensus       127 y~~f  130 (164)
T PF03162_consen  127 YRRF  130 (164)
T ss_dssp             HHHH
T ss_pred             HHHh
Confidence            8763


No 31 
>PF14566 PTPlike_phytase:  Inositol hexakisphosphate; PDB: 1U24_A 2PSZ_B 3MOZ_A 3D1H_B 2B4P_B 3D1Q_A 2B4O_A 3MMJ_B 1U25_A 1U26_B ....
Probab=97.28  E-value=0.00045  Score=61.48  Aligned_cols=49  Identities=29%  Similarity=0.572  Sum_probs=34.6

Q ss_pred             CCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHH
Q psy15334        328 WPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVL  379 (451)
Q Consensus       328 Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l  379 (451)
                      .+|+..|. ++.+-.|+..++..  +....+++||.+|.|||.+|.++..|+
T Consensus       100 itd~~~P~-~~~iD~fi~~v~~~--p~~~~l~fhC~~G~GRTTt~Mv~~~li  148 (149)
T PF14566_consen  100 ITDHQAPD-PEDIDAFINFVKSL--PKDTWLHFHCQAGRGRTTTFMVMYDLI  148 (149)
T ss_dssp             E-TTS----HHHHHHHHHHHHTS---TT-EEEEE-SSSSHHHHHHHHHHHHH
T ss_pred             CCCcCCCC-HHHHHHHHHHHHhC--CCCCeEEEECCCCCCHHHHHHHHHHHh
Confidence            36777674 67788888888765  446789999999999999999987665


No 32 
>KOG2283|consensus
Probab=96.98  E-value=0.0012  Score=68.41  Aligned_cols=89  Identities=24%  Similarity=0.315  Sum_probs=59.4

Q ss_pred             ecCCCCCCCCChhHHHHHHHHHHHhcCCCCC-CEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhc
Q psy15334        326 TTWPDSGVPSSPNTLVSFVLSFRTHISPAQH-PIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQR  404 (451)
Q Consensus       326 ~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~-PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR  404 (451)
                      .+|+||.+|. -+.|..|.+.+...+..+.. =++|||.+|-||||+.+++..+...+.... ...+|.+..-   |-..
T Consensus        78 ~~~~Dh~~P~-L~~l~~~c~~~~~WL~~d~~nVvvvHCk~Gkgrtg~~icA~L~~~~~~~ta-~eald~~~~k---R~~~  152 (434)
T KOG2283|consen   78 FGFDDHNPPP-LELLCPFCKSMDNWLSEDPKNVVVVHCKAGKGRTGVMICAYLIYSGISATA-EEALDYFNEK---RFDE  152 (434)
T ss_pred             cCCCCCCCCc-HHHHHHHHHCHHHHHhcCccceEEEEccCCCcceEEEEeHHHHhhhhcCCH-HHHHHHHhhh---hccc
Confidence            4899999884 56777888888777665443 479999999999999988887776665421 0234444322   2111


Q ss_pred             c--CCCCCHHHHHHHHH
Q psy15334        405 V--LMVQSLAQYAFIYE  419 (451)
Q Consensus       405 ~--~~Vqt~~QY~fiy~  419 (451)
                      .  +-+-...|-.|++.
T Consensus       153 ~~~~~~~~PSq~RYv~Y  169 (434)
T KOG2283|consen  153 GKSKGVTIPSQRRYVGY  169 (434)
T ss_pred             cccCCccCchhhHHHHH
Confidence            2  34566777777654


No 33 
>KOG1718|consensus
Probab=96.50  E-value=0.0057  Score=54.55  Aligned_cols=56  Identities=18%  Similarity=0.276  Sum_probs=37.9

Q ss_pred             HHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccC
Q psy15334        339 TLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVL  406 (451)
Q Consensus       339 ~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~  406 (451)
                      .+.+.+..|...    .|-++|||-|||+||+.+|.+..+    +.+    ..++.++-..+...||-
T Consensus        82 ~vAD~I~~v~~~----gG~TLvHC~AGVSRSAsLClAYLm----K~~----~msLreAy~~vKa~Rpi  137 (198)
T KOG1718|consen   82 PVADKIHSVIMR----GGKTLVHCVAGVSRSASLCLAYLM----KYH----CMSLREAYHWVKARRPI  137 (198)
T ss_pred             HHHHHHHHHHhc----CCcEEEEEccccchhHHHHHHHHH----HHc----cchHHHHHHHHHhhCce
Confidence            344555555443    688999999999999999977643    332    45566666666666653


No 34 
>PF13350 Y_phosphatase3:  Tyrosine phosphatase family; PDB: 1YWF_A 2OZ5_B.
Probab=96.50  E-value=0.004  Score=56.17  Aligned_cols=39  Identities=21%  Similarity=0.138  Sum_probs=24.0

Q ss_pred             cEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccc
Q psy15334        234 QYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRD  273 (451)
Q Consensus       234 ~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~  273 (451)
                      .++=+=.|..-|-+|.+.+. +.++++||=|-...|..+.
T Consensus        20 ~lyRS~~l~~lt~~d~~~L~-~lgI~tIiDLRs~~E~~~~   58 (164)
T PF13350_consen   20 RLYRSGNLSNLTEADLERLR-ELGIRTIIDLRSPTERERA   58 (164)
T ss_dssp             SEEEES--TT--HHHHHHHH-HTT--EEEE-S-HHHHHHH
T ss_pred             cEEecCCcCcCCHHHHHHHH-hCCCCEEEECCCccccccC
Confidence            45556667778999999888 9999999999887776543


No 35 
>TIGR01244 conserved hypothetical protein TIGR01244. No member of this family is characterized. The member from Xylella fastidiosa is a longer protein with an N-terminal region described by this model, followed by a metallo-beta-lactamase family domain and an additional C-terminal region. Members scoring above the trusted cutoff are limited to the proteobacteria.
Probab=96.40  E-value=0.0082  Score=52.45  Aligned_cols=56  Identities=20%  Similarity=0.427  Sum_probs=38.4

Q ss_pred             ChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhcc
Q psy15334        336 SPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRV  405 (451)
Q Consensus       336 ~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~  405 (451)
                      +...+..|.+.+..    ..+||++||.+|- |||++.++....     .   + ++...++...|..-.
T Consensus        71 ~~~~v~~f~~~~~~----~~~pvL~HC~sG~-Rt~~l~al~~~~-----~---g-~~~~~i~~~~~~~G~  126 (135)
T TIGR01244        71 TPDDVETFRAAIGA----AEGPVLAYCRSGT-RSSLLWGFRQAA-----E---G-VPVEEIVRRAQAAGY  126 (135)
T ss_pred             CHHHHHHHHHHHHh----CCCCEEEEcCCCh-HHHHHHHHHHHH-----c---C-CCHHHHHHHHHHcCC
Confidence            44556666665542    2589999999999 999988765332     1   3 567777777766543


No 36 
>COG5350 Predicted protein tyrosine phosphatase [General function prediction only]
Probab=96.38  E-value=0.024  Score=49.83  Aligned_cols=82  Identities=18%  Similarity=0.344  Sum_probs=49.5

Q ss_pred             EEEeecCCCCC-CCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHH
Q psy15334        322 HYHFTTWPDSG-VPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGL  400 (451)
Q Consensus       322 h~~~~~Wpd~g-vP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~l  400 (451)
                      ||+=..-||.| .-.....+..++.-+++.  +...|+||||.+|++||-....+..+ . +  .   ...|=-++...|
T Consensus        61 ~fnDI~~~~~g~~ap~e~Hv~~i~DF~~~w--p~~apllIHC~aGISRStA~A~i~a~-a-l--a---~~~de~ela~~L  131 (172)
T COG5350          61 HFNDIAEPDDGWIAPGEAHVRAIIDFADEW--PRFAPLLIHCYAGISRSTAAALIAAL-A-L--A---PDMDETELAERL  131 (172)
T ss_pred             eeccccCCCccccCCCHHHHHHHHHHHhcC--ccccceeeeeccccccchHHHHHHHH-h-h--c---cccChHHHHHHH
Confidence            44444556555 333334433333333333  34689999999999999765443222 1 1  1   345666788899


Q ss_pred             HhhccCCCCCHH
Q psy15334        401 RRQRVLMVQSLA  412 (451)
Q Consensus       401 R~qR~~~Vqt~~  412 (451)
                      |.-+|.+-.+.-
T Consensus       132 ra~sp~atPN~R  143 (172)
T COG5350         132 RALSPYATPNPR  143 (172)
T ss_pred             HhcCcccCCChh
Confidence            999998877643


No 37 
>KOG1716|consensus
Probab=96.31  E-value=0.013  Score=57.95  Aligned_cols=61  Identities=20%  Similarity=0.354  Sum_probs=46.4

Q ss_pred             hHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCC
Q psy15334        338 NTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQS  410 (451)
Q Consensus       338 ~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt  410 (451)
                      ...++|+..++..    .+-|+|||.+|++||.+++.+++|..        ....+.++.+.++..|+....+
T Consensus       142 ~~~~~fI~~a~~~----~~~vlVHC~~GvSRSat~viAYlM~~--------~~~~l~~A~~~vk~~R~~i~PN  202 (285)
T KOG1716|consen  142 PEAISFIEKAREK----GGKVLVHCQAGVSRSATLVIAYLMKY--------EGLSLEDAYELVKSRRPIISPN  202 (285)
T ss_pred             HHHHHHHHHHHhC----CCeEEEEcCCccchhHHHHHHHHHHH--------cCCCHHHHHHHHHHhCCccCCC
Confidence            4456677766654    67899999999999999998776542        2456778888888888877664


No 38 
>KOG1719|consensus
Probab=96.03  E-value=0.09  Score=46.44  Aligned_cols=55  Identities=24%  Similarity=0.310  Sum_probs=43.8

Q ss_pred             CCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHH
Q psy15334        355 QHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFI  417 (451)
Q Consensus       355 ~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fi  417 (451)
                      ..-+-|||-||-|||.|.++...+.    .    +.-....++..+|+.|+...=-..|..-+
T Consensus       109 GktvYVHCKAGRtRSaTvV~cYLmq----~----~~wtpe~A~~~vr~iRp~VlL~~~Qw~~l  163 (183)
T KOG1719|consen  109 GKTVYVHCKAGRTRSATVVACYLMQ----H----KNWTPEAAVEHVRKIRPRVLLRPAQWDVL  163 (183)
T ss_pred             CCeEEEEecCCCccchhhhhhhhhh----h----cCCCHHHHHHHHHhcCcceeecHHHHHHH
Confidence            4578999999999999998876432    1    34566789999999999988888887543


No 39 
>PF04273 DUF442:  Putative phosphatase (DUF442);  InterPro: IPR005939 Although this domain is uncharacterised it seems likely that it performs a phosphatase function.; GO: 0016787 hydrolase activity; PDB: 2F46_A 3GXH_B 3GXG_B.
Probab=95.28  E-value=0.034  Score=46.77  Aligned_cols=38  Identities=21%  Similarity=0.525  Sum_probs=27.7

Q ss_pred             ChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHH
Q psy15334        336 SPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTV  378 (451)
Q Consensus       336 ~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~  378 (451)
                      +.+.+..|.+.+...    .+||++||..|. |++++.++.-.
T Consensus        71 ~~~~v~~f~~~l~~~----~~Pvl~hC~sG~-Ra~~l~~l~~~  108 (110)
T PF04273_consen   71 TEEDVEAFADALESL----PKPVLAHCRSGT-RASALWALAQA  108 (110)
T ss_dssp             -HHHHHHHHHHHHTT----TTSEEEE-SCSH-HHHHHHHHHHH
T ss_pred             CHHHHHHHHHHHHhC----CCCEEEECCCCh-hHHHHHHHHhh
Confidence            567777777766653    579999999998 99988876543


No 40 
>COG2365 Protein tyrosine/serine phosphatase [Signal transduction mechanisms]
Probab=95.09  E-value=0.038  Score=53.44  Aligned_cols=27  Identities=41%  Similarity=0.695  Sum_probs=23.5

Q ss_pred             CCCEEEEcCCCCchhhhHHHHHHHHHH
Q psy15334        355 QHPIVVHCSAGVGRTGTFIALDTVLQR  381 (451)
Q Consensus       355 ~~PivVHCs~GvGRtG~fiai~~~l~~  381 (451)
                      ++|+++||.+|.-|||..+|+...+..
T Consensus       136 ~~PvL~HC~~GkdRTGl~~al~r~~~~  162 (249)
T COG2365         136 NGPVLIHCTAGKDRTGLVAALYRKLVG  162 (249)
T ss_pred             cCCEEEecCCCCcchHHHHHHHHHHhC
Confidence            689999999999999999998766543


No 41 
>PLN02727 NAD kinase
Probab=94.66  E-value=0.071  Score=59.51  Aligned_cols=42  Identities=21%  Similarity=0.316  Sum_probs=34.0

Q ss_pred             CCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHH
Q psy15334        334 PSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTV  378 (451)
Q Consensus       334 P~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~  378 (451)
                      +.+.+.+.+|.+.+++.   ..+||++||..|.+|||+++|....
T Consensus       323 apt~EqVe~fa~~l~~s---lpkPVLvHCKSGarRAGamvA~yl~  364 (986)
T PLN02727        323 APSAEQVEKFASLVSDS---SKKPIYLHSKEGVWRTSAMVSRWKQ  364 (986)
T ss_pred             CCCHHHHHHHHHHHHhh---cCCCEEEECCCCCchHHHHHHHHHH
Confidence            34568888888888543   2679999999999999999998765


No 42 
>KOG0196|consensus
Probab=93.82  E-value=0.14  Score=56.19  Aligned_cols=21  Identities=43%  Similarity=0.457  Sum_probs=18.5

Q ss_pred             EcCCCCCCCeeeEEEEEeeCC
Q psy15334         47 CNGPLKQGTTYRVKVRAFTSP   67 (451)
Q Consensus        47 ~NgpL~p~~~Y~i~vrA~t~~   67 (451)
                      .--.|+||+.|.|.|||.|..
T Consensus       501 ti~gL~p~t~YvfqVRarT~a  521 (996)
T KOG0196|consen  501 TITGLKPGTVYVFQVRARTAA  521 (996)
T ss_pred             EeeccCCCcEEEEEEEEeccc
Confidence            345799999999999999986


No 43 
>KOG4471|consensus
Probab=93.81  E-value=0.086  Score=55.61  Aligned_cols=33  Identities=27%  Similarity=0.612  Sum_probs=26.4

Q ss_pred             HHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHH
Q psy15334        347 FRTHISPAQHPIVVHCSAGVGRTGTFIALDTVL  379 (451)
Q Consensus       347 v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l  379 (451)
                      |.......+.||+||||+|=.||..++++..++
T Consensus       366 Ia~kVe~~~~sVlVHCSDGWDRT~QlvsLA~Ll  398 (717)
T KOG4471|consen  366 IADKVESESRSVLVHCSDGWDRTAQLVSLAMLL  398 (717)
T ss_pred             HHHHHhcCCceEEEEcCCCccchHHHHHHHHHH
Confidence            333445567899999999999999999987764


No 44 
>KOG1717|consensus
Probab=93.60  E-value=0.18  Score=48.40  Aligned_cols=68  Identities=19%  Similarity=0.359  Sum_probs=42.0

Q ss_pred             HHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHH
Q psy15334        339 TLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIY  418 (451)
Q Consensus       339 ~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy  418 (451)
                      ..+.||...|..    +.-++|||-+|+.||-|.+++..| +.+.       +..-++...+.+.+.+.-.+   |-|++
T Consensus       239 EAIsfIdeArsk----~cgvLVHClaGISRSvTvtvaYLM-qkl~-------lslndAyd~Vk~kksnisPN---FnFMg  303 (343)
T KOG1717|consen  239 EAISFIDEARSK----NCGVLVHCLAGISRSVTVTVAYLM-QKLN-------LSLNDAYDFVKHKKSNISPN---FNFMG  303 (343)
T ss_pred             HHHHHHHHhhcc----CCcEEEeeeccccchhHHHHHHHH-HHhc-------cchhhHHHHHHHhccCCCCC---cchhH
Confidence            346677666543    457999999999999999887644 3332       23334444444445554444   45666


Q ss_pred             HHH
Q psy15334        419 ECV  421 (451)
Q Consensus       419 ~~l  421 (451)
                      +.+
T Consensus       304 QLl  306 (343)
T KOG1717|consen  304 QLL  306 (343)
T ss_pred             HHH
Confidence            654


No 45 
>PF07353 Uroplakin_II:  Uroplakin II;  InterPro: IPR009952 This family contains uroplakin II, which is approximately 180 residues long and seems to be restricted to mammals. Uroplakin II is an integral membrane protein, and is one of the components of the apical plaques of mammalian urothelium formed by the asymmetric unit membrane - this is believed to play a role in strengthening the urothelial apical surface to prevent the cells from rupturing during bladder distension [].; GO: 0016044 cellular membrane organization, 0030176 integral to endoplasmic reticulum membrane
Probab=90.87  E-value=0.29  Score=43.28  Aligned_cols=25  Identities=20%  Similarity=0.239  Sum_probs=21.1

Q ss_pred             CCCceEcCCCCCCCeeeEEEEEeeC
Q psy15334         42 NEAHYCNGPLKQGTTYRVKVRAFTS   66 (451)
Q Consensus        42 ~~~~~~NgpL~p~~~Y~i~vrA~t~   66 (451)
                      +-.+|++++|.||+.|.++...-.+
T Consensus       100 rlsaYqVtNL~pGTkY~isY~Vtkg  124 (184)
T PF07353_consen  100 RLSAYQVTNLQPGTKYYISYLVTKG  124 (184)
T ss_pred             cceeEEeeccCCCcEEEEEEEEecC
Confidence            4578999999999999999876444


No 46 
>KOG1572|consensus
Probab=88.40  E-value=1.3  Score=42.02  Aligned_cols=28  Identities=29%  Similarity=0.585  Sum_probs=22.9

Q ss_pred             cCCCCCCEEEEcCCCCchhhhHHHHHHH
Q psy15334        351 ISPAQHPIVVHCSAGVGRTGTFIALDTV  378 (451)
Q Consensus       351 ~~~~~~PivVHCs~GvGRtG~fiai~~~  378 (451)
                      ....+.|+++||.-|--|||++++...-
T Consensus       144 ld~~N~P~Lihc~rGkhRtg~lVgclRk  171 (249)
T KOG1572|consen  144 LDKRNYPILIHCKRGKHRTGCLVGCLRK  171 (249)
T ss_pred             hcccCCceEEecCCCCcchhhhHHHHHH
Confidence            3445789999999999999999985543


No 47 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=87.78  E-value=0.17  Score=43.29  Aligned_cols=25  Identities=16%  Similarity=0.186  Sum_probs=13.1

Q ss_pred             chhhhHH--HHHHHHhhheeeeeeccc
Q psy15334         87 WLLVMGF--LVLIVLPVLFIIIRRKIV  111 (451)
Q Consensus        87 ~ii~gv~--Liliil~vll~~~rrr~~  111 (451)
                      +||+||+  +|+++++++|+++|+|++
T Consensus        68 ~Ii~gv~aGvIg~Illi~y~irR~~Kk   94 (122)
T PF01102_consen   68 GIIFGVMAGVIGIILLISYCIRRLRKK   94 (122)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHS--
T ss_pred             ehhHHHHHHHHHHHHHHHHHHHHHhcc
Confidence            5556665  455555555666655553


No 48 
>PF08693 SKG6:  Transmembrane alpha-helix domain;  InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=87.26  E-value=0.39  Score=32.51  Aligned_cols=21  Identities=29%  Similarity=0.407  Sum_probs=9.7

Q ss_pred             hhhhHHHHHHHHhhhee-eeee
Q psy15334         88 LLVMGFLVLIVLPVLFI-IIRR  108 (451)
Q Consensus        88 ii~gv~Liliil~vll~-~~rr  108 (451)
                      +++++..|++++++++| ++||
T Consensus        17 VvVPV~vI~~vl~~~l~~~~rR   38 (40)
T PF08693_consen   17 VVVPVGVIIIVLGAFLFFWYRR   38 (40)
T ss_pred             EEechHHHHHHHHHHhheEEec
Confidence            33444444555544444 4444


No 49 
>PF06602 Myotub-related:  Myotubularin-like phosphatase domain;  InterPro: IPR010569 This family represents a region within eukaryotic myotubularin-related proteins that is sometimes found with IPR004182 from INTERPRO. Myotubularin is a dual-specific lipid phosphatase that dephosphorylates phosphatidylinositol 3-phosphate and phosphatidylinositol (3,5)-bi-phosphate []. Mutations in gene encoding myotubularin-related proteins have been associated with disease [].; GO: 0016791 phosphatase activity, 0016311 dephosphorylation; PDB: 1M7R_B 1LW3_A 1ZVR_A 1ZSQ_A 2YF0_A.
Probab=86.69  E-value=0.88  Score=46.36  Aligned_cols=41  Identities=24%  Similarity=0.413  Sum_probs=26.4

Q ss_pred             HHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHH
Q psy15334        339 TLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVL  379 (451)
Q Consensus       339 ~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l  379 (451)
                      .++.-...+-+.......+|+|||++|-.||..++++..++
T Consensus       215 ~~L~~a~~i~~~l~~~~~~Vlvh~~dGwDrt~q~~sL~ql~  255 (353)
T PF06602_consen  215 SILSGASRIADLLHDEGSSVLVHCSDGWDRTSQLSSLAQLL  255 (353)
T ss_dssp             HHHHHHHHHHHHHHTT--EEEEECTTSSSHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHhhccCceEEEEcCCCCcccHHHHHHHHHH
Confidence            34443334444443456799999999999999988765543


No 50 
>KOG2386|consensus
Probab=85.39  E-value=1.1  Score=45.85  Aligned_cols=88  Identities=19%  Similarity=0.256  Sum_probs=58.9

Q ss_pred             eecCCCCCCCCChhHHHHHHHHHHHhcC---CCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHH
Q psy15334        325 FTTWPDSGVPSSPNTLVSFVLSFRTHIS---PAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLR  401 (451)
Q Consensus       325 ~~~Wpd~gvP~~~~~ll~fi~~v~~~~~---~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR  401 (451)
                      ...-+-+++-++....-.|++.+.....   -...=|.|||..|..|||-+|+...+    ..    +...+-++++.+-
T Consensus        91 K~~c~g~~~vp~~~~v~~fv~~v~~f~~~~~~~~~LI~vhcthG~NrtgyLI~~yL~----~~----~~~s~~~aik~f~  162 (393)
T KOG2386|consen   91 KRNCPGRGVVPRTELVDKFVKLVKGFVDDTKLDDELIGVHCTHGLNRTGYLICAYLA----DV----GGYSSSEAIKRFA  162 (393)
T ss_pred             EeccCCcccCCCccchHHHHHHHHHHHhcccCCCCEEEEeCCCcccccceeeeeeee----ec----cCccHHHHHHHHH
Confidence            3455666633344455566666655433   23445999999999999987654322    22    3378889999999


Q ss_pred             hhccCCCCCHHHHHHHHHH
Q psy15334        402 RQRVLMVQSLAQYAFIYEC  420 (451)
Q Consensus       402 ~qR~~~Vqt~~QY~fiy~~  420 (451)
                      ..|+..+.-.+-+.=+|..
T Consensus       163 ~~r~~gi~k~dyi~~L~~~  181 (393)
T KOG2386|consen  163 DARPPGIEKQDYIDALYSR  181 (393)
T ss_pred             HhCCCccCchHHHHHHhhc
Confidence            9999998887766655543


No 51 
>KOG1089|consensus
Probab=82.40  E-value=1.9  Score=46.02  Aligned_cols=30  Identities=30%  Similarity=0.498  Sum_probs=23.2

Q ss_pred             HhcCCCCCCEEEEcCCCCchhhhHHHHHHH
Q psy15334        349 THISPAQHPIVVHCSAGVGRTGTFIALDTV  378 (451)
Q Consensus       349 ~~~~~~~~PivVHCs~GvGRtG~fiai~~~  378 (451)
                      +.+...+.+||||||+|-.||...+.+.-+
T Consensus       338 ~~l~~~~~sVlvhcsdGwDrT~qV~SLaQl  367 (573)
T KOG1089|consen  338 KCLSSEGASVLVHCSDGWDRTCQVSSLAQL  367 (573)
T ss_pred             HHHHhCCCeEEEEccCCcchhHHHHHHHHH
Confidence            334445679999999999999998876543


No 52 
>PF01299 Lamp:  Lysosome-associated membrane glycoprotein (Lamp);  InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below.   +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+  In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100.  Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail.   Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=80.79  E-value=3.8  Score=40.86  Aligned_cols=54  Identities=17%  Similarity=0.242  Sum_probs=26.3

Q ss_pred             CCeeeEEEEEeeCCCCcccccCCCCcccCCC----ccchhhhHHHH--HHHHhhheeeeeeccc
Q psy15334         54 GTTYRVKVRAFTSPELFTDTIYSPPVSTGKD----VSWLLVMGFLV--LIVLPVLFIIIRRKIV  111 (451)
Q Consensus        54 ~~~Y~i~vrA~t~~~~~t~s~~S~pi~t~~~----~~~ii~gv~Li--liil~vll~~~rrr~~  111 (451)
                      -....+.|+||-..+.    .|+....+..|    -.+|++|+.|.  +||+++.+++.|||.+
T Consensus       241 l~~~~lqvQaF~~~~~----~Fg~a~~C~~D~~~~~vPIaVG~~La~lvlivLiaYli~Rrr~~  300 (306)
T PF01299_consen  241 LNLSDLQVQAFRVKNN----TFGTAEECSSDDTSDLVPIAVGAALAGLVLIVLIAYLIGRRRSR  300 (306)
T ss_pred             EEEeEEEEEEEEecCC----CCCChhcCCcCCccchHHHHHHHHHHHHHHHHHHhheeEecccc
Confidence            4555678888865322    14444334332    23556676533  2222333555555543


No 53 
>PF00041 fn3:  Fibronectin type III domain;  InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=79.40  E-value=3.1  Score=31.76  Aligned_cols=24  Identities=42%  Similarity=0.594  Sum_probs=21.1

Q ss_pred             CceEcCCCCCCCeeeEEEEEeeCC
Q psy15334         44 AHYCNGPLKQGTTYRVKVRAFTSP   67 (451)
Q Consensus        44 ~~~~NgpL~p~~~Y~i~vrA~t~~   67 (451)
                      ..+.-..|+|++.|.|.|+|++..
T Consensus        56 ~~~~i~~L~p~t~Y~~~v~a~~~~   79 (85)
T PF00041_consen   56 TSYTITGLQPGTTYEFRVRAVNSD   79 (85)
T ss_dssp             SEEEEESCCTTSEEEEEEEEEETT
T ss_pred             eeeeeccCCCCCEEEEEEEEEeCC
Confidence            367888999999999999999874


No 54 
>PF12273 RCR:  Chitin synthesis regulation, resistance to Congo red;  InterPro: IPR020999  RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 []. 
Probab=77.83  E-value=0.63  Score=40.27  Aligned_cols=7  Identities=29%  Similarity=0.757  Sum_probs=3.1

Q ss_pred             chhhhHH
Q psy15334         87 WLLVMGF   93 (451)
Q Consensus        87 ~ii~gv~   93 (451)
                      |++.+++
T Consensus         2 W~l~~ii    8 (130)
T PF12273_consen    2 WVLFAII    8 (130)
T ss_pred             eeeHHHH
Confidence            3444444


No 55 
>PF01034 Syndecan:  Syndecan domain;  InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains:   A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains;  A transmembrane region;  A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins.    The proteins known to belong to this family are:    Syndecan 1.  Syndecan 2 or fibroglycan.  Syndecan 3 or neuroglycan or N-syndecan.  Syndecan 4 or amphiglycan or ryudocan.  Drosophila syndecan.   Caenorhabditis elegans probable syndecan (F57C7.3).    Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=73.85  E-value=1.1  Score=33.61  Aligned_cols=24  Identities=29%  Similarity=0.555  Sum_probs=0.4

Q ss_pred             chhhhHH--HHHHHHhhheeeeeecc
Q psy15334         87 WLLVMGF--LVLIVLPVLFIIIRRKI  110 (451)
Q Consensus        87 ~ii~gv~--Liliil~vll~~~rrr~  110 (451)
                      ++|+|++  +++++++++|++.|.|+
T Consensus        13 avIaG~Vvgll~ailLIlf~iyR~rk   38 (64)
T PF01034_consen   13 AVIAGGVVGLLFAILLILFLIYRMRK   38 (64)
T ss_dssp             ------------------------S-
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            4555554  44444445555555433


No 56 
>PF15102 TMEM154:  TMEM154 protein family
Probab=69.07  E-value=7.8  Score=34.10  Aligned_cols=8  Identities=38%  Similarity=0.879  Sum_probs=3.6

Q ss_pred             hHHHhHHh
Q psy15334        166 IAVEFEDL  173 (451)
Q Consensus       166 ~~~Ef~~l  173 (451)
                      +.-|-|+|
T Consensus       123 meiEmeel  130 (146)
T PF15102_consen  123 MEIEMEEL  130 (146)
T ss_pred             hhhhHHHH
Confidence            33444555


No 57 
>PF02439 Adeno_E3_CR2:  Adenovirus E3 region protein CR2;  InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=67.20  E-value=2.5  Score=28.23  Aligned_cols=22  Identities=9%  Similarity=0.245  Sum_probs=11.7

Q ss_pred             cchhhhHH--HHHHHHhhheeeee
Q psy15334         86 SWLLVMGF--LVLIVLPVLFIIIR  107 (451)
Q Consensus        86 ~~ii~gv~--Liliil~vll~~~r  107 (451)
                      .++|+||+  ++++++.+++|.+.
T Consensus         6 IaIIv~V~vg~~iiii~~~~YaCc   29 (38)
T PF02439_consen    6 IAIIVAVVVGMAIIIICMFYYACC   29 (38)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHH
Confidence            35677776  44444555554433


No 58 
>PF04478 Mid2:  Mid2 like cell wall stress sensor;  InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=63.59  E-value=5.5  Score=35.26  Aligned_cols=17  Identities=29%  Similarity=0.567  Sum_probs=8.2

Q ss_pred             HHHHHHhhheeeeeecc
Q psy15334         94 LVLIVLPVLFIIIRRKI  110 (451)
Q Consensus        94 Liliil~vll~~~rrr~  110 (451)
                      |||++++++|++++|++
T Consensus        62 ill~il~lvf~~c~r~k   78 (154)
T PF04478_consen   62 ILLGILALVFIFCIRRK   78 (154)
T ss_pred             HHHHHHHhheeEEEecc
Confidence            44444555544544444


No 59 
>PF05283 MGC-24:  Multi-glycosylated core protein 24 (MGC-24);  InterPro: IPR007947 CD164 is a mucin-like receptor, or sialomucin, with specificity in receptor/ ligand interactions that depends on the structural characteristics of the mucin-like receptor. Its functions include mediating, or regulating, haematopoietic progenitor cell adhesion and the negative regulation of their growth and/or-differentiation. It exists in the native state as a disulphide- linked homodimer of two 80-85kDa subunits. It is usually expressed by CD34+ and CD341o/- haematopoietic stem cells and associated microenvironmental cells. It contains, in its extracellular region, two mucin domains (I and II) linked by a non-mucin domain, which has been predicted to contain intra- disulphide bridges. This receptor may play a key role in haematopoiesis by facilitating the adhesion of human CD34+ cells to bone marrow stroma and by negatively regulating CD34+ CD341o/- haematopoietic progenitor cell proliferation. These effects involve the CD164 class I and/or II epitopes recognised by the monoclonal antibodies (mAbs) 105A5 and 103B2/9E10. These epitopes are carbohydrate-dependent and are located on the N-terminal mucin domain I [, ]. It has been found that murine MGC-24v and rat endolyn share significant sequence similarities with human CD164. However, CD164 lacks the consensus glycosaminoglycan (GAG)-attachment site found in MGC-24; it is possible that GAG-association is responsible for the high molecular weight of the epithelial-derived MGC-24 glycoprotein [].  Genomic structure studies have placed CD164 within the mucin-subgroup that comprises multiple exons, and demonstrate the diverse chromosomal distribution of this family of molecules. Molecules with such multiple exons may have sophisticated regulatory mechanisms that involve not only post-translational modifications of the oligosaccharide side chains, but also differential exon usage. Although differences in the intron and exon sizes are seen between the mouse and human genes, the predicted proteins are similar in size and structure, maintaining functionally important motifs that regulate cell proliferation or subcellular distribution [].  CD164 is a gene whose expression depends on differential usage of poly- adenylation sites within the 3'-UTR. The conserved distribution of the 3.2- and 1.2-kb CD164 transcripts between mouse and human suggests that (i) a mechanism may exist to regulate tissue-specific polyadenylation, and (ii) differences in polyadenylation are important for the expression and function of CD164 in different tissues. Two other aspects of the structure of CD164 are of particular interest. First, it shares one of several conserved features of a cytokine-binding pocket - in this respect, it is notable that evidence exists for a class of cell-surface sialomucin modulators that directly interact with growth factor receptors to regulate their response to physiological ligands. Second, its cytoplasmic tail contains a C-terminal YHTL motif found in many endocytic membrane proteins or receptors. These Tyr-based motifs bind to adaptor proteins, which mediate the sorting of membrane proteins into transport vesicles from the plasma membrane to the endosomes, and between intracellular compartments. 
Probab=63.40  E-value=4.9  Score=36.96  Aligned_cols=24  Identities=25%  Similarity=0.364  Sum_probs=19.1

Q ss_pred             CccchhhhHHHHHHHHhhheeeee
Q psy15334         84 DVSWLLVMGFLVLIVLPVLFIIIR  107 (451)
Q Consensus        84 ~~~~ii~gv~Liliil~vll~~~r  107 (451)
                      |....|+||||.|-+++|+||++|
T Consensus       159 D~~SFiGGIVL~LGv~aI~ff~~K  182 (186)
T PF05283_consen  159 DAASFIGGIVLTLGVLAIIFFLYK  182 (186)
T ss_pred             chhhhhhHHHHHHHHHHHHHHHhh
Confidence            345679999999999888887765


No 60 
>PF12877 DUF3827:  Domain of unknown function (DUF3827);  InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells. 
Probab=62.56  E-value=3.2  Score=44.71  Aligned_cols=12  Identities=8%  Similarity=0.301  Sum_probs=9.1

Q ss_pred             CCCccchhhhHH
Q psy15334         82 GKDVSWLLVMGF   93 (451)
Q Consensus        82 ~~~~~~ii~gv~   93 (451)
                      ...++|||+||+
T Consensus       265 ~~~NlWII~gVl  276 (684)
T PF12877_consen  265 PPNNLWIIAGVL  276 (684)
T ss_pred             CCCCeEEEehHh
Confidence            456789988885


No 61 
>PF05808 Podoplanin:  Podoplanin;  InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=58.26  E-value=3.3  Score=36.97  Aligned_cols=28  Identities=21%  Similarity=0.295  Sum_probs=0.0

Q ss_pred             chhhhHHHHHHHHhhheeeeeecccCcC
Q psy15334         87 WLLVMGFLVLIVLPVLFIIIRRKIVGVT  114 (451)
Q Consensus        87 ~ii~gv~Liliil~vll~~~rrr~~~~~  114 (451)
                      +||+|+++-+.+|+.++++.-||.++|+
T Consensus       133 GIIVGVLlaIG~igGIIivvvRKmSGRy  160 (162)
T PF05808_consen  133 GIIVGVLLAIGFIGGIIIVVVRKMSGRY  160 (162)
T ss_dssp             ----------------------------
T ss_pred             eehhhHHHHHHHHhheeeEEeehhcccc
Confidence            4556665433334444444444555554


No 62 
>PF14575 EphA2_TM:  Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=55.59  E-value=5.6  Score=30.96  Aligned_cols=11  Identities=27%  Similarity=0.543  Sum_probs=4.4

Q ss_pred             hhhHHHHHHHH
Q psy15334         89 LVMGFLVLIVL   99 (451)
Q Consensus        89 i~gv~Liliil   99 (451)
                      ++|++++++++
T Consensus         7 ~~g~~~ll~~v   17 (75)
T PF14575_consen    7 IVGVLLLLVLV   17 (75)
T ss_dssp             HHHHHHHHHHH
T ss_pred             HHHHHHHHHhh
Confidence            34444333333


No 63 
>COG3453 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=54.92  E-value=22  Score=30.36  Aligned_cols=27  Identities=15%  Similarity=0.453  Sum_probs=19.5

Q ss_pred             ChhHHHHHHHHHHHhcCCCCCCEEEEcCCCC
Q psy15334        336 SPNTLVSFVLSFRTHISPAQHPIVVHCSAGV  366 (451)
Q Consensus       336 ~~~~ll~fi~~v~~~~~~~~~PivVHCs~Gv  366 (451)
                      +...+-.|.+.+.+.    .+||+.||..|.
T Consensus        72 T~~dV~~f~~Al~ea----egPVlayCrsGt   98 (130)
T COG3453          72 TEADVEAFQRALDEA----EGPVLAYCRSGT   98 (130)
T ss_pred             CHHHHHHHHHHHHHh----CCCEEeeecCCc
Confidence            455666666666554    789999998775


No 64 
>COG1470 Predicted membrane protein [Function unknown]
Probab=54.63  E-value=9.7  Score=39.73  Aligned_cols=52  Identities=17%  Similarity=0.275  Sum_probs=25.8

Q ss_pred             CCeeeEEEEEeeCCCCcccccCCCCcccCCCccchhhhHHHHHHHHhhheeeeee
Q psy15334         54 GTTYRVKVRAFTSPELFTDTIYSPPVSTGKDVSWLLVMGFLVLIVLPVLFIIIRR  108 (451)
Q Consensus        54 ~~~Y~i~vrA~t~~~~~t~s~~S~pi~t~~~~~~ii~gv~Liliil~vll~~~rr  108 (451)
                      ...|++.++|-+..  .+ +.-+.-+.......+-.+|+.+|++++++++|+.||
T Consensus       458 aGdY~i~i~~ksDq--~s-~e~tlrV~V~~sS~st~iGI~Ii~~~v~~L~fviRK  509 (513)
T COG1470         458 AGDYRITITAKSDQ--AS-SEDTLRVVVGQSSTSTYIGIAIIVLVVLGLIFVIRK  509 (513)
T ss_pred             CCcEEEEEEEeecc--cc-ccceEEEEEeccccchhhhHHHHHHHHHHHHhhhHH
Confidence            34577777765541  11 111111112223345566776666677777776664


No 65 
>KOG4221|consensus
Probab=47.37  E-value=31  Score=40.27  Aligned_cols=20  Identities=25%  Similarity=0.217  Sum_probs=17.4

Q ss_pred             cCCCCCCCeeeEEEEEeeCC
Q psy15334         48 NGPLKQGTTYRVKVRAFTSP   67 (451)
Q Consensus        48 NgpL~p~~~Y~i~vrA~t~~   67 (451)
                      -.+|.|++.|-|+++|-+..
T Consensus       986 v~~l~p~t~yffkiQAr~~k 1005 (1381)
T KOG4221|consen  986 VPNLDPDTGYFFKIQARNEK 1005 (1381)
T ss_pred             cCCCCCCCceEEEEEeeccC
Confidence            36899999999999998764


No 66 
>PF05568 ASFV_J13L:  African swine fever virus J13L protein;  InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=46.38  E-value=7  Score=34.11  Aligned_cols=23  Identities=17%  Similarity=0.408  Sum_probs=11.0

Q ss_pred             hhhhHHHHHHHHhhheeeeeecc
Q psy15334         88 LLVMGFLVLIVLPVLFIIIRRKI  110 (451)
Q Consensus        88 ii~gv~Liliil~vll~~~rrr~  110 (451)
                      |++++++++|++.++++|+-+|+
T Consensus        34 ILiaIvVliiiiivli~lcssRK   56 (189)
T PF05568_consen   34 ILIAIVVLIIIIIVLIYLCSSRK   56 (189)
T ss_pred             HHHHHHHHHHHHHHHHHHHhhhh
Confidence            44555544444445555544444


No 67 
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition  sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=44.79  E-value=24  Score=25.25  Aligned_cols=23  Identities=39%  Similarity=0.588  Sum_probs=20.0

Q ss_pred             CceEcCCCCCCCeeeEEEEEeeC
Q psy15334         44 AHYCNGPLKQGTTYRVKVRAFTS   66 (451)
Q Consensus        44 ~~~~NgpL~p~~~Y~i~vrA~t~   66 (451)
                      ..+.-..|.|++.|.+.|+|++.
T Consensus        57 ~~~~i~~L~~~~~Y~v~v~a~~~   79 (83)
T smart00060       57 TSYTLTGLKPGTEYEFRVRAVNG   79 (83)
T ss_pred             cEEEEeCcCCCCEEEEEEEEEcc
Confidence            45777889999999999999875


No 68 
>PF05454 DAG1:  Dystroglycan (Dystrophin-associated glycoprotein 1);  InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=43.66  E-value=7.7  Score=38.29  Aligned_cols=8  Identities=13%  Similarity=0.202  Sum_probs=0.0

Q ss_pred             eeeeeecc
Q psy15334        103 FIIIRRKI  110 (451)
Q Consensus       103 l~~~rrr~  110 (451)
                      +|++|||+
T Consensus       168 icyrrkR~  175 (290)
T PF05454_consen  168 ICYRRKRK  175 (290)
T ss_dssp             --------
T ss_pred             Hhhhhhhc
Confidence            34444333


No 69 
>PF02061 Lambda_CIII:  Lambda Phage CIII;  InterPro: IPR013056  Bacteriophage lambda regulatory protein CIII is a small protein that plays a role in stabilising the CII transcriptional activator, via a mechanism that is not yet fully understood [, ]. Stabilised CII activates CI, the gene for the repressor protein that prevents transcription of proteins required for lytic development. The central portion of the protein is well conserved and is both necessary and sufficient for the activity of the protein []. Comparative analysis of the CIII sequence in lambda, Bacteriophage HK022 and the lambdoid Enterobacteria phage P22 has led to the suggestion that this central region assumes an amphipathic alpha-helical structure []. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=42.10  E-value=34  Score=23.21  Aligned_cols=24  Identities=25%  Similarity=0.678  Sum_probs=20.2

Q ss_pred             EEEeecCCCCCCCCChhHHHHHHH
Q psy15334        322 HYHFTTWPDSGVPSSPNTLVSFVL  345 (451)
Q Consensus       322 h~~~~~Wpd~gvP~~~~~ll~fi~  345 (451)
                      +|++.+||-.|+|.-.++|++-+.
T Consensus         2 ~~~~AG~~~~G~~ql~ESLLdrIt   25 (45)
T PF02061_consen    2 QYAIAGWPRMGCPQLSESLLDRIT   25 (45)
T ss_pred             eeeecCccccCCchhhHHHHHHHH
Confidence            688999999999977788887653


No 70 
>cd01533 4RHOD_Repeat_2 Member of the Rhodanese Homology Domain superfamily, repeat 2. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. This CD aligns the 2nd repeat which does contain the putative catalytic Cys residue.
Probab=41.37  E-value=44  Score=27.21  Aligned_cols=15  Identities=40%  Similarity=0.769  Sum_probs=12.0

Q ss_pred             CCCEEEEcCCCCchhh
Q psy15334        355 QHPIVVHCSAGVGRTG  370 (451)
Q Consensus       355 ~~PivVHCs~GvGRtG  370 (451)
                      ..||+|+|..|. ||.
T Consensus        66 ~~~ivv~C~~G~-rs~   80 (109)
T cd01533          66 RTPIVVNCAGRT-RSI   80 (109)
T ss_pred             CCeEEEECCCCc-hHH
Confidence            579999999886 663


No 71 
>cd01518 RHOD_YceA Member of the Rhodanese Homology Domain superfamily. This CD includes Escherichia coli YceA, Bacillus subtilis YbfQ, and similar uncharacterized proteins.
Probab=39.25  E-value=31  Score=27.65  Aligned_cols=18  Identities=17%  Similarity=0.366  Sum_probs=13.8

Q ss_pred             CCCCEEEEcCCCCchhhhH
Q psy15334        354 AQHPIVVHCSAGVGRTGTF  372 (451)
Q Consensus       354 ~~~PivVHCs~GvGRtG~f  372 (451)
                      ...||||+|..| .||...
T Consensus        60 ~~~~ivvyC~~G-~rs~~a   77 (101)
T cd01518          60 KGKKVLMYCTGG-IRCEKA   77 (101)
T ss_pred             CCCEEEEECCCc-hhHHHH
Confidence            357999999876 677654


No 72 
>cd01523 RHOD_Lact_B Member of the Rhodanese Homology Domain superfamily. This CD includes predicted proteins with rhodanese-like domains found N-terminal of the metallo-beta-lactamase domain.
Probab=38.07  E-value=31  Score=27.61  Aligned_cols=17  Identities=12%  Similarity=0.233  Sum_probs=12.9

Q ss_pred             CCCCEEEEcCCCCchhhh
Q psy15334        354 AQHPIVVHCSAGVGRTGT  371 (451)
Q Consensus       354 ~~~PivVHCs~GvGRtG~  371 (451)
                      ...||||+|..|. ||..
T Consensus        60 ~~~~ivv~C~~G~-rs~~   76 (100)
T cd01523          60 DDQEVTVICAKEG-SSQF   76 (100)
T ss_pred             CCCeEEEEcCCCC-cHHH
Confidence            3579999999884 6643


No 73 
>TIGR01167 LPXTG_anchor LPXTG-motif cell wall anchor domain. A common feature of this proteins containing this domain appears to be a high proportion of charged and zwitterionic residues immediatedly upstream of the LPXTG motif. This model differs from other descriptions of the LPXTG region by including a portion of that upstream charged region.
Probab=37.85  E-value=34  Score=21.67  Aligned_cols=8  Identities=25%  Similarity=0.173  Sum_probs=3.1

Q ss_pred             hhheeeee
Q psy15334        100 PVLFIIIR  107 (451)
Q Consensus       100 ~vll~~~r  107 (451)
                      +++++.+|
T Consensus        24 ~~~~~~~r   31 (34)
T TIGR01167        24 GGLLLRKR   31 (34)
T ss_pred             HHHHheec
Confidence            33344443


No 74 
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=37.76  E-value=9.4  Score=37.91  Aligned_cols=8  Identities=25%  Similarity=0.413  Sum_probs=3.5

Q ss_pred             eeeeeecc
Q psy15334        103 FIIIRRKI  110 (451)
Q Consensus       103 l~~~rrr~  110 (451)
                      ++||.||+
T Consensus       277 LILRYRRK  284 (299)
T PF02009_consen  277 LILRYRRK  284 (299)
T ss_pred             HHHHHHHH
Confidence            44444443


No 75 
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=37.67  E-value=20  Score=29.29  Aligned_cols=6  Identities=17%  Similarity=0.573  Sum_probs=2.6

Q ss_pred             eeeeee
Q psy15334        103 FIIIRR  108 (451)
Q Consensus       103 l~~~rr  108 (451)
                      +|++||
T Consensus        89 ~f~~r~   94 (96)
T PTZ00382         89 WFVCRG   94 (96)
T ss_pred             eeEEee
Confidence            444443


No 76 
>PF07204 Orthoreo_P10:  Orthoreovirus membrane fusion protein p10;  InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=36.72  E-value=14  Score=29.83  Aligned_cols=20  Identities=15%  Similarity=0.222  Sum_probs=9.4

Q ss_pred             hHHHHHHHHhhheeeeeecc
Q psy15334         91 MGFLVLIVLPVLFIIIRRKI  110 (451)
Q Consensus        91 gv~Liliil~vll~~~rrr~  110 (451)
                      |++|++|+++++++.+.|++
T Consensus        50 G~iLilIii~Lv~CC~~K~K   69 (98)
T PF07204_consen   50 GLILILIIIALVCCCRAKHK   69 (98)
T ss_pred             hhhhHHHHHHHHHHhhhhhh
Confidence            44455555544444444333


No 77 
>PF05337 CSF-1:  Macrophage colony stimulating factor-1 (CSF-1);  InterPro: IPR008001 Colony stimulating factor 1 (CSF-1) is a homodimeric polypeptide growth factor whose primary function is to regulate the survival, proliferation, differentiation, and function of cells of the mononuclear phagocytic lineage. This lineage includes mononuclear phagocytic precursors, blood monocytes, tissue macrophages, osteoclasts, and microglia of the brain, all of which possess cell surface receptors for CSF-1. The protein has also been linked with male fertility [] and mutations in the Csf-1 gene have been found to cause osteopetrosis and failure of tooth eruption [].; GO: 0005125 cytokine activity, 0008083 growth factor activity, 0016021 integral to membrane; PDB: 3EJJ_A.
Probab=35.78  E-value=12  Score=36.33  Aligned_cols=23  Identities=26%  Similarity=0.275  Sum_probs=0.0

Q ss_pred             hhhHHHHHHHHhhheeeeeeccc
Q psy15334         89 LVMGFLVLIVLPVLFIIIRRKIV  111 (451)
Q Consensus        89 i~gv~Liliil~vll~~~rrr~~  111 (451)
                      +.+|+|+|+.++.|+|.+|||..
T Consensus       232 VPSiILVLLaVGGLLfYr~rrRs  254 (285)
T PF05337_consen  232 VPSIILVLLAVGGLLFYRRRRRS  254 (285)
T ss_dssp             -----------------------
T ss_pred             ccchhhhhhhccceeeecccccc
Confidence            34455677777777666665543


No 78 
>PF06084 Cytomega_TRL10:  Cytomegalovirus TRL10 protein;  InterPro: IPR009284 This family consists of several Cytomegalovirus TRL10 proteins. TRL10 represents a structural component of the virus particle and like the other HCMV envelope glycoproteins, is present in a disulphide-linked complex [].
Probab=34.66  E-value=29  Score=29.09  Aligned_cols=33  Identities=33%  Similarity=0.398  Sum_probs=18.4

Q ss_pred             CCCCcccCCCccchhhhHH-HHHHHHhhheeeee
Q psy15334         75 YSPPVSTGKDVSWLLVMGF-LVLIVLPVLFIIIR  107 (451)
Q Consensus        75 ~S~pi~t~~~~~~ii~gv~-Liliil~vll~~~r  107 (451)
                      +.+||+.-..-.+|-+|.+ ..||+++++||+.-
T Consensus        47 ~tepis~lg~ysawgagsfiatliillviffviy   80 (150)
T PF06084_consen   47 YTEPISMLGIYSAWGAGSFIATLIILLVIFFVIY   80 (150)
T ss_pred             ccCcchhhhhhhhcccchHHHHHHHHHHHhheeE
Confidence            5567766555556667765 34555555544443


No 79 
>COG3889 Predicted solute binding protein [General function prediction only]
Probab=33.02  E-value=39  Score=37.55  Aligned_cols=15  Identities=27%  Similarity=0.698  Sum_probs=7.2

Q ss_pred             HHHHHhhh-eeeeeec
Q psy15334         95 VLIVLPVL-FIIIRRK  109 (451)
Q Consensus        95 iliil~vl-l~~~rrr  109 (451)
                      ||++|++| ++++|||
T Consensus       856 iiv~LaAla~lLrRRr  871 (872)
T COG3889         856 IIVGLAALALLLRRRR  871 (872)
T ss_pred             HHHHHHHHHHHHHhhc
Confidence            33444443 5565554


No 80 
>PF11770 GAPT:  GRB2-binding adapter (GAPT);  InterPro: IPR021082  This entry represents a family of transmembrane proteins which bind the growth factor receptor-bound protein 2 (GRB2) in B cells []. In contrast to other transmembrane adaptor proteins, GAPT, which this entry represents, is not phosphorylated upon BCR ligation. It associates with GRB2 constitutively through its proline-rich region []. 
Probab=32.55  E-value=40  Score=29.80  Aligned_cols=25  Identities=8%  Similarity=-0.065  Sum_probs=12.8

Q ss_pred             CCccchhhhHH--HHHHHHhhheeeee
Q psy15334         83 KDVSWLLVMGF--LVLIVLPVLFIIIR  107 (451)
Q Consensus        83 ~~~~~ii~gv~--Liliil~vll~~~r  107 (451)
                      ...+++.+|++  |+|++.|+-++|.+
T Consensus         7 n~sv~i~igi~Ll~lLl~cgiGcvwhw   33 (158)
T PF11770_consen    7 NTSVAISIGISLLLLLLLCGIGCVWHW   33 (158)
T ss_pred             CchHHHHHHHHHHHHHHHHhcceEEEe
Confidence            34456777776  33333444455543


No 81 
>PF11353 DUF3153:  Protein of unknown function (DUF3153);  InterPro: IPR021499  This family of proteins with unknown function appear to be restricted to Cyanobacteria. Some members are annotated as membrane proteins however this cannot be confirmed. 
Probab=32.06  E-value=42  Score=31.42  Aligned_cols=15  Identities=13%  Similarity=0.313  Sum_probs=10.2

Q ss_pred             CCCCCCeeeEEEEEe
Q psy15334         50 PLKQGTTYRVKVRAF   64 (451)
Q Consensus        50 pL~p~~~Y~i~vrA~   64 (451)
                      +|+||..-++.+.+.
T Consensus       166 ~L~pGe~N~L~~~~w  180 (209)
T PF11353_consen  166 KLQPGEINHLEASFW  180 (209)
T ss_pred             ecCCCceeEEEEEEE
Confidence            677777777766653


No 82 
>cd01519 RHOD_HSP67B2 Member of the Rhodanese Homology Domain superfamily. This CD includes the heat shock protein 67B2 of Drosophila melanogaster and other similar proteins, many of which are uncharacterized.
Probab=31.85  E-value=59  Score=26.04  Aligned_cols=16  Identities=31%  Similarity=0.698  Sum_probs=12.0

Q ss_pred             CCCCEEEEcCCCCchhh
Q psy15334        354 AQHPIVVHCSAGVGRTG  370 (451)
Q Consensus       354 ~~~PivVHCs~GvGRtG  370 (451)
                      ...||||+|..|. |+.
T Consensus        65 ~~~~ivv~c~~g~-~s~   80 (106)
T cd01519          65 KDKELIFYCKAGV-RSK   80 (106)
T ss_pred             CCCeEEEECCCcH-HHH
Confidence            3679999999875 553


No 83 
>TIGR03000 plancto_dom_1 Planctomycetes uncharacterized domain TIGR03000. Domains described by this model are found, so far, only in the Planctomycetes (Pirellula sp. strain 1 and Gemmata obscuriglobus), in up to six proteins per genome, and may be duplicated within a protein. The function is unknown.
Probab=31.69  E-value=65  Score=25.08  Aligned_cols=28  Identities=21%  Similarity=0.303  Sum_probs=23.7

Q ss_pred             CCCceEcCCCCCCCeeeEEEEEeeCCCC
Q psy15334         42 NEAHYCNGPLKQGTTYRVKVRAFTSPEL   69 (451)
Q Consensus        42 ~~~~~~NgpL~p~~~Y~i~vrA~t~~~~   69 (451)
                      ....|.-.+|++|..|++.|+|-...++
T Consensus        27 ~~R~F~T~~L~~G~~y~Y~v~a~~~~dG   54 (75)
T TIGR03000        27 TVRTFTTPPLEAGKEYEYTVTAEYDRDG   54 (75)
T ss_pred             cEEEEECCCCCCCCEEEEEEEEEEecCC
Confidence            4567999999999999999999776544


No 84 
>PF08374 Protocadherin:  Protocadherin;  InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated []. 
Probab=31.12  E-value=14  Score=34.64  Aligned_cols=11  Identities=36%  Similarity=0.307  Sum_probs=5.1

Q ss_pred             EEEecCCCCCC
Q psy15334        235 YIATQGPLCST  245 (451)
Q Consensus       235 yI~tQ~Pl~~T  245 (451)
                      -|+---|+.+|
T Consensus       181 ~~iQeLP~~NT  191 (221)
T PF08374_consen  181 HIIQELPLDNT  191 (221)
T ss_pred             hhhhhcCCcce
Confidence            34444455554


No 85 
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=30.75  E-value=59  Score=23.92  Aligned_cols=23  Identities=39%  Similarity=0.588  Sum_probs=18.6

Q ss_pred             CceEcCCCCCCCeeeEEEEEeeC
Q psy15334         44 AHYCNGPLKQGTTYRVKVRAFTS   66 (451)
Q Consensus        44 ~~~~NgpL~p~~~Y~i~vrA~t~   66 (451)
                      ..+--..|.|++.|.+.|+|+..
T Consensus        57 ~~~~i~~l~p~~~Y~~~v~a~~~   79 (93)
T cd00063          57 TSYTLTGLKPGTEYEFRVRAVNG   79 (93)
T ss_pred             cEEEEccccCCCEEEEEEEEECC
Confidence            34555569999999999999875


No 86 
>PRK14750 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=30.24  E-value=34  Score=21.34  Aligned_cols=18  Identities=28%  Similarity=0.351  Sum_probs=10.8

Q ss_pred             cchhhhHHHHHHHHhhhe
Q psy15334         86 SWLLVMGFLVLIVLPVLF  103 (451)
Q Consensus        86 ~~ii~gv~Liliil~vll  103 (451)
                      ..+|.|++|++++++-++
T Consensus         3 ~~vi~g~llv~lLl~YLv   20 (29)
T PRK14750          3 FSIVCGALLVLLLLGYLV   20 (29)
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            356777776666655443


No 87 
>PF06365 CD34_antigen:  CD34/Podocalyxin family;  InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=30.01  E-value=26  Score=32.66  Aligned_cols=14  Identities=14%  Similarity=0.090  Sum_probs=6.9

Q ss_pred             HhhheeeeeecccC
Q psy15334         99 LPVLFIIIRRKIVG  112 (451)
Q Consensus        99 l~vll~~~rrr~~~  112 (451)
                      +++.+|++.||..+
T Consensus       118 ~~~~Y~~~~Rrs~~  131 (202)
T PF06365_consen  118 LGAGYCCHQRRSWS  131 (202)
T ss_pred             HHHHHHhhhhccCC
Confidence            33345555555543


No 88 
>cd01448 TST_Repeat_1 Thiosulfate sulfurtransferase (TST), N-terminal, inactive domain. TST contains 2 copies of the Rhodanese Homology Domain; this is the 1st repeat, which does not contain the catalytically active Cys residue. The role of the 1st repeat is uncertain, but it is believed to be involved in protein interaction.
Probab=29.33  E-value=89  Score=25.82  Aligned_cols=33  Identities=21%  Similarity=0.233  Sum_probs=19.7

Q ss_pred             ChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhH
Q psy15334        336 SPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTF  372 (451)
Q Consensus       336 ~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~f  372 (451)
                      +++.+.+++...   -.....||||+|..| |+....
T Consensus        63 ~~~~~~~~~~~~---~~~~~~~vv~~c~~g-~~~a~~   95 (122)
T cd01448          63 SPEEFAELLGSL---GISNDDTVVVYDDGG-GFFAAR   95 (122)
T ss_pred             CHHHHHHHHHHc---CCCCCCEEEEECCCC-CccHHH
Confidence            455666665432   123367999999998 444433


No 89 
>COG5495 Uncharacterized conserved protein [Function unknown]
Probab=29.22  E-value=56  Score=31.30  Aligned_cols=18  Identities=39%  Similarity=0.700  Sum_probs=11.8

Q ss_pred             CCCCCEEEEcCCCCchhhh
Q psy15334        353 PAQHPIVVHCSAGVGRTGT  371 (451)
Q Consensus       353 ~~~~PivVHCs~GvGRtG~  371 (451)
                      ...+.||+||| |.+-+|.
T Consensus        89 ~rpg~iv~HcS-ga~~~~i  106 (289)
T COG5495          89 NRPGTIVAHCS-GANGSGI  106 (289)
T ss_pred             cCCCeEEEEcc-CCCchhh
Confidence            34678999999 4444443


No 90 
>PLN02160 thiosulfate sulfurtransferase
Probab=28.50  E-value=51  Score=28.47  Aligned_cols=18  Identities=28%  Similarity=0.503  Sum_probs=13.6

Q ss_pred             CCCCEEEEcCCCCchhhhH
Q psy15334        354 AQHPIVVHCSAGVGRTGTF  372 (451)
Q Consensus       354 ~~~PivVHCs~GvGRtG~f  372 (451)
                      ...||+|||..| +||..-
T Consensus        80 ~~~~IivyC~sG-~RS~~A   97 (136)
T PLN02160         80 PADDILVGCQSG-ARSLKA   97 (136)
T ss_pred             CCCcEEEECCCc-HHHHHH
Confidence            357999999987 677643


No 91 
>PRK11493 sseA 3-mercaptopyruvate sulfurtransferase; Provisional
Probab=25.77  E-value=61  Score=31.65  Aligned_cols=49  Identities=16%  Similarity=0.308  Sum_probs=24.4

Q ss_pred             EEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHH
Q psy15334        322 HYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFI  373 (451)
Q Consensus       322 h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fi  373 (451)
                      |+.+..+.+.+...++..|..++..   .-.....|||++|..|.--+.++.
T Consensus       201 ~i~~~~~~~~~~~~~~~~l~~~~~~---~g~~~~~~ii~yC~~G~~A~~~~~  249 (281)
T PRK11493        201 NVPWTELVREGELKTTDELDAIFFG---RGVSFDRPIIASCGSGVTAAVVVL  249 (281)
T ss_pred             CCCHHHhcCCCCcCCHHHHHHHHHh---cCCCCCCCEEEECCcHHHHHHHHH
Confidence            3444444443333344444444322   111235699999988774444333


No 92 
>TIGR03865 PQQ_CXXCW PQQ-dependent catabolism-associated CXXCW motif protein. Members of this protein family have a CXXXCW motif, consistent with a possible role in redox cofactor binding. This protein family shows strong relationships by phylogenetic profiling and conserved gene neighborhoods with a transport system for alcohols metabolized by PQQ-dependent enzymes.
Probab=25.36  E-value=1.1e+02  Score=27.25  Aligned_cols=18  Identities=22%  Similarity=0.316  Sum_probs=14.9

Q ss_pred             CCCCEEEEcCCCCchhhh
Q psy15334        354 AQHPIVVHCSAGVGRTGT  371 (451)
Q Consensus       354 ~~~PivVHCs~GvGRtG~  371 (451)
                      ...||||.|..|..|+..
T Consensus       115 ~d~~IVvYC~~G~~~S~~  132 (162)
T TIGR03865       115 KDRPLVFYCLADCWMSWN  132 (162)
T ss_pred             CCCEEEEEECCCCHHHHH
Confidence            367999999998878875


No 93 
>PF14610 DUF4448:  Protein of unknown function (DUF4448)
Probab=25.24  E-value=40  Score=30.96  Aligned_cols=13  Identities=15%  Similarity=0.235  Sum_probs=5.1

Q ss_pred             hhHHHHHHHHhhh
Q psy15334         90 VMGFLVLIVLPVL  102 (451)
Q Consensus        90 ~gv~Liliil~vl  102 (451)
                      +++++++++++++
T Consensus       164 lPvvv~~~~~~~~  176 (189)
T PF14610_consen  164 LPVVVVVLALIMY  176 (189)
T ss_pred             ccHHHHHHHHHHH
Confidence            3344443333333


No 94 
>PF10727 Rossmann-like:  Rossmann-like domain;  InterPro: IPR019665 This entry represents an NAD/NADP-binding domain with a core Rossmann-type fold, found in an uncharacterised protein family thought to be putative NADP oxidoreductase coenzyme F420-dependent proteins and/or NAD-dependent glycerol-3-phosphate dehydrogenase-like proteins. This Rossmann-fold domain consists of 3-layers alpha/beta/alpha, where the six beta strands are parallel in the order 321456.; PDB: 3DFU_A 3C24_A.
Probab=24.74  E-value=81  Score=27.10  Aligned_cols=29  Identities=21%  Similarity=0.204  Sum_probs=13.3

Q ss_pred             HHHHHHHHHHHhcCCCCCCEEEEcCCCCc
Q psy15334        339 TLVSFVLSFRTHISPAQHPIVVHCSAGVG  367 (451)
Q Consensus       339 ~ll~fi~~v~~~~~~~~~PivVHCs~GvG  367 (451)
                      .+-.....+...-....+.+|||||...+
T Consensus        80 aI~~va~~La~~~~~~~g~iVvHtSGa~~  108 (127)
T PF10727_consen   80 AIAEVAEQLAQYGAWRPGQIVVHTSGALG  108 (127)
T ss_dssp             HHHHHHHHHHCC--S-TT-EEEES-SS--
T ss_pred             HHHHHHHHHHHhccCCCCcEEEECCCCCh
Confidence            34444555544311235789999996554


No 95 
>PF03684 UPF0179:  Uncharacterised protein family (UPF0179);  InterPro: IPR005369 The function of this family is unknown, however the proteins contain two cysteine clusters that may be iron sulphur redox centres.
Probab=24.66  E-value=44  Score=29.37  Aligned_cols=29  Identities=38%  Similarity=0.733  Sum_probs=22.0

Q ss_pred             eCC-CCCC--CCCceEcCCCCCCCeeeEE-EEE
Q psy15334         35 IGR-ETCP--NEAHYCNGPLKQGTTYRVK-VRA   63 (451)
Q Consensus        35 iG~-~~c~--~~~~~~NgpL~p~~~Y~i~-vrA   63 (451)
                      .|. +.|.  .....|.|.|++|..|+|. ||.
T Consensus        19 ~G~~~eCe~Crlk~~C~~nLe~Gr~YrI~~VR~   51 (142)
T PF03684_consen   19 YGPAEECEGCRLKNVCLGNLEPGRRYRIVEVRN   51 (142)
T ss_pred             ecCccccCCCCChHHhcccCCCCceEEEEEEcC
Confidence            454 4576  5667899999999999965 664


No 96 
>KOG1530|consensus
Probab=24.59  E-value=64  Score=27.97  Aligned_cols=29  Identities=24%  Similarity=0.496  Sum_probs=22.0

Q ss_pred             HHHHHHHHHhcCCCCCCEEEEcCCCCchhh
Q psy15334        341 VSFVLSFRTHISPAQHPIVVHCSAGVGRTG  370 (451)
Q Consensus       341 l~fi~~v~~~~~~~~~PivVHCs~GvGRtG  370 (451)
                      .+|++.+-....+....||++|..|. |+.
T Consensus        75 ~eF~kqvg~~kp~~d~eiIf~C~SG~-Rs~  103 (136)
T KOG1530|consen   75 PEFLKQVGSSKPPHDKEIIFGCASGV-RSL  103 (136)
T ss_pred             HHHHHHhcccCCCCCCcEEEEeccCc-chh
Confidence            35778777666666779999999986 554


No 97 
>KOG0208|consensus
Probab=24.27  E-value=1.6e+02  Score=34.14  Aligned_cols=67  Identities=22%  Similarity=0.304  Sum_probs=52.0

Q ss_pred             CCEEEEcCCCCchhhhHHHHHHHHHHHHccc------CCCCCCHHHHHHHHHhhccCCCCCHH--HHHHHHHHHH
Q psy15334        356 HPIVVHCSAGVGRTGTFIALDTVLQRLYNSE------GAGSVDILRIVHGLRRQRVLMVQSLA--QYAFIYECVQ  422 (451)
Q Consensus       356 ~PivVHCs~GvGRtG~fiai~~~l~~l~~~~------~~~~vdi~~~v~~lR~qR~~~Vqt~~--QY~fiy~~l~  422 (451)
                      +-.|-.|.+|+.-+|++-|+|..+..-+.|.      ..+..+|..+..-+|..|..+|.+..  ||.-+|.++.
T Consensus       852 ~y~VgfCGDGANDCgALKaAdvGISLSeaEASvAApFTSk~~~I~cVp~vIrEGRaALVTSf~~FkYMalYs~iq  926 (1140)
T KOG0208|consen  852 GYKVGFCGDGANDCGALKAADVGISLSEAEASVAAPFTSKTPSISCVPDVIREGRAALVTSFACFKYMALYSAIQ  926 (1140)
T ss_pred             CcEEEecCCCcchhhhhhhcccCcchhhhhHhhcCccccCCCchhhHhHHHhhhhhhhhhhHHHHHHHHHHHHHH
Confidence            5688999999999999999999877555442      12456888888889999999999864  6666665543


No 98 
>PRK10287 thiosulfate:cyanide sulfurtransferase; Provisional
Probab=24.20  E-value=1.2e+02  Score=25.01  Aligned_cols=17  Identities=24%  Similarity=0.587  Sum_probs=12.5

Q ss_pred             CCCEEEEcCCCCchhhhH
Q psy15334        355 QHPIVVHCSAGVGRTGTF  372 (451)
Q Consensus       355 ~~PivVHCs~GvGRtG~f  372 (451)
                      ..||+++|.+| .||...
T Consensus        60 ~~~IVlyC~~G-~rS~~a   76 (104)
T PRK10287         60 NDTVKLYCNAG-RQSGQA   76 (104)
T ss_pred             CCeEEEEeCCC-hHHHHH
Confidence            56899999887 555544


No 99 
>PF10577 UPF0560:  Uncharacterised protein family UPF0560;  InterPro: IPR018890  This family of proteins has no known function. 
Probab=24.20  E-value=37  Score=37.96  Aligned_cols=18  Identities=50%  Similarity=0.680  Sum_probs=10.9

Q ss_pred             HHHHHHhhheeeeeeccc
Q psy15334         94 LVLIVLPVLFIIIRRKIV  111 (451)
Q Consensus        94 Liliil~vll~~~rrr~~  111 (451)
                      |+|+++.+|+|.||||..
T Consensus       285 ivl~lL~vLl~yCrrkc~  302 (807)
T PF10577_consen  285 IVLILLCVLLCYCRRKCL  302 (807)
T ss_pred             HHHHHHHHHHHhhhcccC
Confidence            444555667777776554


No 100
>cd01520 RHOD_YbbB Member of the Rhodanese Homology Domain superfamily. This CD includes several putative ATP /GTP binding proteins including E. coli YbbB.
Probab=23.24  E-value=1.7e+02  Score=24.68  Aligned_cols=20  Identities=15%  Similarity=0.398  Sum_probs=14.6

Q ss_pred             CCCCEEEEcCCCCchhhhHH
Q psy15334        354 AQHPIVVHCSAGVGRTGTFI  373 (451)
Q Consensus       354 ~~~PivVHCs~GvGRtG~fi  373 (451)
                      ...||||.|..|..||...+
T Consensus        85 ~~~~vvvyC~~~G~rs~~a~  104 (128)
T cd01520          85 RDPKLLIYCARGGMRSQSLA  104 (128)
T ss_pred             CCCeEEEEeCCCCccHHHHH
Confidence            46799999986666776544


No 101
>TIGR03024 arch_pef_cterm PEF-C-terminal archaeal protein sorting domain. This domain, distantly related to the PEP-Cterm domain described in model TIGR02595, is found in Methanosarcina mazei in four different proteins, as well as in other archaea such as Methanococcoides burtonii. Several proteins with this domain have their genes only a short distance from a distant homology of EpsH, a proposed integral membrane transpeptidase.
Probab=22.98  E-value=36  Score=20.85  Aligned_cols=9  Identities=33%  Similarity=0.718  Sum_probs=3.4

Q ss_pred             Hhhheeeee
Q psy15334         99 LPVLFIIIR  107 (451)
Q Consensus        99 l~vll~~~r  107 (451)
                      +++++++.|
T Consensus        15 l~i~~i~~r   23 (26)
T TIGR03024        15 LAIIVILRR   23 (26)
T ss_pred             HHHHHHHhh
Confidence            333333333


No 102
>PTZ00208 65 kDa invariant surface glycoprotein; Provisional
Probab=22.98  E-value=58  Score=33.46  Aligned_cols=11  Identities=18%  Similarity=0.452  Sum_probs=6.2

Q ss_pred             hheeeeeeccc
Q psy15334        101 VLFIIIRRKIV  111 (451)
Q Consensus       101 vll~~~rrr~~  111 (451)
                      +++|++|||..
T Consensus       406 ~~~~v~rrr~~  416 (436)
T PTZ00208        406 FFIMVKRRRNS  416 (436)
T ss_pred             hheeeeeccCC
Confidence            44556666553


No 103
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=22.95  E-value=52  Score=32.45  Aligned_cols=20  Identities=20%  Similarity=0.562  Sum_probs=10.2

Q ss_pred             hhHHHHHHHHhhheeeeeec
Q psy15334         90 VMGFLVLIVLPVLFIIIRRK  109 (451)
Q Consensus        90 ~gv~Liliil~vll~~~rrr  109 (451)
                      +|++++|+++++++.+.|||
T Consensus       238 LG~v~ll~l~Gii~~~~~r~  257 (281)
T PF12768_consen  238 LGTVFLLVLIGIILAYIRRR  257 (281)
T ss_pred             HHHHHHHHHHHHHHHHHHhh
Confidence            34445556666655444443


No 104
>PF02035 Coagulin:  Coagulin;  InterPro: IPR000275 Coagulogen is a gel-forming protein of hemolymph that hinders the spread of invaders by immobilising them [, ]. The protein contains a single 175- residue polypeptide chain; this is cleaved after Arg-18 and Arg-46 by a clotting enzyme contained in the hemocyte and activated by a bacterial endotoxin (lipopolysaccharide). Cleavage releases two chains of coagulin, A and B, linked by two disulphide bonds, together with the peptide C [, ]. Gel formation results from interlinking of coagulin molecules. Secondary structure prediction suggests the C peptide forms an alpha- helix, which is released during the proteolytic conversion of coagulogen to coagulin gel []. The beta-sheet structure and 16 half-cystines found in the molecule appear to yield a compact protein stable to acid and heat. Mammalian blood coagulation is based on the proteolytically induced polymerisation of fibrinogens. Initially, fibrin monomers noncovalently interact with each other. The resulting homopolymers are further stabilised when the plasma transglutaminase (TGase) intermolecularly cross-links epsilon-(gamma-glutamyl)lysine bonds. In crustaceans, hemolymph coagulation depends on the TGase-mediated cross-linking of specific plasma-clotting proteins, but without the proteolytic cascade. In horseshoe crabs, the proteolytic coagulation cascade triggered by lipopolysaccharides and beta-1,3-glucans leads to the conversion of coagulogen into coagulin, resulting in noncovalent coagulin homopolymers through head-to-tail interaction. Horseshoe crab TGase, however, does not cross-link coagulins intermolecularly. Recently, we found that coagulins are cross-linked on hemocyte cell surface proteins called proxins. This indicates that a cross-linking reaction at the final stage of hemolymph coagulation is an important innate immune system of horseshoe crabs [].; GO: 0042381 hemolymph coagulation, 0005576 extracellular region; PDB: 1AOC_A.
Probab=22.13  E-value=30  Score=29.62  Aligned_cols=28  Identities=21%  Similarity=0.628  Sum_probs=15.7

Q ss_pred             CCCCCccE-EEecCCCCCCHHHHHhHhhhccCC
Q psy15334        228 GYLSPRQY-IATQGPLCSTVDDFWRMCWESRTP  259 (451)
Q Consensus       228 g~~~~~~y-I~tQ~Pl~~T~~dFWrMVwe~~v~  259 (451)
                      ||....+| |+.|+|..    -|-+.|||++++
T Consensus        98 gyt~a~efrvivqapra----gfrqcvwqhkcr  126 (174)
T PF02035_consen   98 GYTVAGEFRVIVQAPRA----GFRQCVWQHKCR  126 (174)
T ss_dssp             EE-TTS-EEEE--BCCC----TB-B---EEEET
T ss_pred             cceecceEEEEEeCchh----hHHHHHHHhhhc
Confidence            66666676 88999986    688999999875


No 105
>TIGR02981 phageshock_pspE phage shock operon rhodanese PspE. Members of this very narrowly defined protein family are proteins active as rhodanese (EC 2.8.1.1) and found in the extended variants of the phage shock protein (psp operon) in Escherichia coli and a few closely related species. Note that the designation phage shock protein PspE has been applied, incorrectly, in many instances where the genome lacks the phage shock regulon entirely.
Probab=21.23  E-value=1.5e+02  Score=24.14  Aligned_cols=18  Identities=22%  Similarity=0.517  Sum_probs=13.2

Q ss_pred             CCCCEEEEcCCCCchhhhH
Q psy15334        354 AQHPIVVHCSAGVGRTGTF  372 (451)
Q Consensus       354 ~~~PivVHCs~GvGRtG~f  372 (451)
                      ...||+|+|..|. ||...
T Consensus        57 ~~~~vvlyC~~G~-rS~~a   74 (101)
T TIGR02981        57 KNDTVKLYCNAGR-QSGMA   74 (101)
T ss_pred             CCCeEEEEeCCCH-HHHHH
Confidence            3568999999884 66654


No 106
>PF13908 Shisa:  Wnt and FGF inhibitory regulator
Probab=21.04  E-value=88  Score=28.36  Aligned_cols=8  Identities=13%  Similarity=0.031  Sum_probs=4.3

Q ss_pred             cchhhhHH
Q psy15334         86 SWLLVMGF   93 (451)
Q Consensus        86 ~~ii~gv~   93 (451)
                      .++|+||+
T Consensus        78 ~~iivgvi   85 (179)
T PF13908_consen   78 TGIIVGVI   85 (179)
T ss_pred             eeeeeehh
Confidence            45555554


No 107
>TIGR03154 sulfolob_CbsA cytochrome b558/566, subunit A. Members of this protein family are CbsA, one subunit of a highly glycosylated, heterodimeric, mono-heme cytochrome b558/566, found in Sulfolobus acidocaldarius and several other members of the Sulfolobales, a branch of the Crenarchaeota.
Probab=20.50  E-value=1.3e+02  Score=30.32  Aligned_cols=22  Identities=18%  Similarity=0.534  Sum_probs=13.8

Q ss_pred             chhhhHHHHHHHHhhheeeeee
Q psy15334         87 WLLVMGFLVLIVLPVLFIIIRR  108 (451)
Q Consensus        87 ~ii~gv~Liliil~vll~~~rr  108 (451)
                      +-|+|+++.++++++++.++||
T Consensus       444 wTVaGVvIAiVALV~l~~V~rr  465 (465)
T TIGR03154       444 VTIIGVVIAIVALVILYVVFRR  465 (465)
T ss_pred             EEeehhHHHHHHHhheeEEecC
Confidence            4467776666666666666554


No 108
>PF11344 DUF3146:  Protein of unknown function (DUF3146);  InterPro: IPR021492  This family of proteins with unknown function appear to be restricted to Cyanobacteria. 
Probab=20.12  E-value=71  Score=24.87  Aligned_cols=15  Identities=33%  Similarity=0.541  Sum_probs=13.5

Q ss_pred             CCCCCCCeeeEEEEE
Q psy15334         49 GPLKQGTTYRVKVRA   63 (451)
Q Consensus        49 gpL~p~~~Y~i~vrA   63 (451)
                      =.|+||..|.|.|||
T Consensus        65 Y~LEpGgdY~Ftira   79 (80)
T PF11344_consen   65 YQLEPGGDYSFTIRA   79 (80)
T ss_pred             eeccCCCceEEEEec
Confidence            369999999999997


Done!