Query psy15334
Match_columns 451
No_of_seqs 355 out of 1855
Neff 7.7
Searched_HMMs 46136
Date Fri Aug 16 18:17:39 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15334.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15334hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4228|consensus 100.0 3.3E-80 7.2E-85 663.5 20.9 280 150-436 522-809 (1087)
2 PHA02742 protein tyrosine phos 100.0 1.7E-76 3.6E-81 585.7 30.5 276 148-428 6-299 (303)
3 PHA02740 protein tyrosine phos 100.0 5.9E-76 1.3E-80 578.4 29.4 269 149-428 7-291 (298)
4 KOG0791|consensus 100.0 1.7E-76 3.6E-81 571.8 23.2 292 148-442 75-373 (374)
5 PHA02747 protein tyrosine phos 100.0 1.7E-74 3.7E-79 573.2 30.1 277 148-428 6-302 (312)
6 PHA02746 protein tyrosine phos 100.0 3.1E-74 6.7E-79 573.6 30.8 276 148-427 6-316 (323)
7 PHA02738 hypothetical protein; 100.0 4.3E-73 9.4E-78 564.6 28.4 277 146-429 4-298 (320)
8 KOG0790|consensus 100.0 1E-73 2.2E-78 558.1 13.0 278 166-443 246-540 (600)
9 KOG0792|consensus 100.0 1.6E-68 3.4E-73 567.0 25.7 266 163-432 867-1137(1144)
10 smart00194 PTPc Protein tyrosi 100.0 1.1E-67 2.3E-72 513.4 28.9 253 166-422 2-257 (258)
11 cd00047 PTPc Protein tyrosine 100.0 4E-61 8.7E-66 459.6 27.3 227 192-422 1-230 (231)
12 KOG0793|consensus 100.0 2.4E-62 5.3E-67 498.0 15.7 259 162-423 730-993 (1004)
13 PF00102 Y_phosphatase: Protei 100.0 6.9E-59 1.5E-63 443.3 24.3 229 190-423 1-235 (235)
14 COG5599 PTP2 Protein tyrosine 100.0 2.3E-59 5E-64 433.3 16.8 238 182-426 45-295 (302)
15 KOG4228|consensus 100.0 5.3E-59 1.2E-63 500.2 18.3 277 143-426 796-1086(1087)
16 PRK15375 pathogenicity island 100.0 7.4E-53 1.6E-57 425.3 24.2 215 195-426 303-531 (535)
17 KOG0789|consensus 100.0 4.5E-51 9.7E-56 422.4 24.3 243 184-431 121-373 (415)
18 smart00012 PTPc_DSPc Protein t 99.9 1.9E-23 4E-28 173.7 11.9 102 319-422 1-104 (105)
19 smart00404 PTPc_motif Protein 99.9 1.9E-23 4E-28 173.7 11.9 102 319-422 1-104 (105)
20 PTZ00242 protein tyrosine phos 99.7 5.2E-16 1.1E-20 140.5 11.7 135 233-421 16-155 (166)
21 PTZ00393 protein tyrosine phos 99.5 3.8E-13 8.3E-18 126.4 11.1 135 233-421 92-226 (241)
22 KOG1720|consensus 99.3 2.1E-11 4.5E-16 111.0 11.9 97 321-429 117-213 (225)
23 KOG2836|consensus 99.2 5.3E-10 1.2E-14 94.6 12.1 131 233-417 17-149 (173)
24 cd00127 DSPc Dual specificity 99.1 7.6E-10 1.6E-14 96.8 10.2 88 319-418 49-136 (139)
25 COG2453 CDC14 Predicted protei 98.9 1.2E-08 2.6E-13 93.9 12.5 90 327-425 79-168 (180)
26 smart00195 DSPc Dual specifici 98.7 1.6E-07 3.4E-12 82.2 11.6 57 354-418 77-133 (138)
27 PF00782 DSPc: Dual specificit 98.6 2E-07 4.4E-12 80.9 8.3 70 338-419 60-129 (133)
28 PRK12361 hypothetical protein; 98.0 3.2E-05 7E-10 83.3 11.5 82 329-419 151-232 (547)
29 PF05706 CDKN3: Cyclin-depende 98.0 1.4E-05 3E-10 71.5 5.9 89 252-376 66-154 (168)
30 PF03162 Y_phosphatase2: Tyros 97.6 9.3E-05 2E-09 66.9 5.6 72 320-403 59-130 (164)
31 PF14566 PTPlike_phytase: Inos 97.3 0.00045 9.8E-09 61.5 5.6 49 328-379 100-148 (149)
32 KOG2283|consensus 97.0 0.0012 2.6E-08 68.4 6.0 89 326-419 78-169 (434)
33 KOG1718|consensus 96.5 0.0057 1.2E-07 54.6 5.9 56 339-406 82-137 (198)
34 PF13350 Y_phosphatase3: Tyros 96.5 0.004 8.6E-08 56.2 5.1 39 234-273 20-58 (164)
35 TIGR01244 conserved hypothetic 96.4 0.0082 1.8E-07 52.4 6.3 56 336-405 71-126 (135)
36 COG5350 Predicted protein tyro 96.4 0.024 5.2E-07 49.8 8.9 82 322-412 61-143 (172)
37 KOG1716|consensus 96.3 0.013 2.7E-07 57.9 7.8 61 338-410 142-202 (285)
38 KOG1719|consensus 96.0 0.09 1.9E-06 46.4 10.6 55 355-417 109-163 (183)
39 PF04273 DUF442: Putative phos 95.3 0.034 7.5E-07 46.8 5.2 38 336-378 71-108 (110)
40 COG2365 Protein tyrosine/serin 95.1 0.038 8.3E-07 53.4 5.6 27 355-381 136-162 (249)
41 PLN02727 NAD kinase 94.7 0.071 1.5E-06 59.5 6.9 42 334-378 323-364 (986)
42 KOG0196|consensus 93.8 0.14 3E-06 56.2 6.7 21 47-67 501-521 (996)
43 KOG4471|consensus 93.8 0.086 1.9E-06 55.6 5.1 33 347-379 366-398 (717)
44 KOG1717|consensus 93.6 0.18 3.9E-06 48.4 6.3 68 339-421 239-306 (343)
45 PF07353 Uroplakin_II: Uroplak 90.9 0.29 6.2E-06 43.3 3.8 25 42-66 100-124 (184)
46 KOG1572|consensus 88.4 1.3 2.8E-05 42.0 6.4 28 351-378 144-171 (249)
47 PF01102 Glycophorin_A: Glycop 87.8 0.17 3.6E-06 43.3 0.1 25 87-111 68-94 (122)
48 PF08693 SKG6: Transmembrane a 87.3 0.39 8.5E-06 32.5 1.6 21 88-108 17-38 (40)
49 PF06602 Myotub-related: Myotu 86.7 0.88 1.9E-05 46.4 4.6 41 339-379 215-255 (353)
50 KOG2386|consensus 85.4 1.1 2.4E-05 45.8 4.5 88 325-420 91-181 (393)
51 KOG1089|consensus 82.4 1.9 4.2E-05 46.0 4.9 30 349-378 338-367 (573)
52 PF01299 Lamp: Lysosome-associ 80.8 3.8 8.2E-05 40.9 6.2 54 54-111 241-300 (306)
53 PF00041 fn3: Fibronectin type 79.4 3.1 6.8E-05 31.8 4.2 24 44-67 56-79 (85)
54 PF12273 RCR: Chitin synthesis 77.8 0.63 1.4E-05 40.3 -0.3 7 87-93 2-8 (130)
55 PF01034 Syndecan: Syndecan do 73.8 1.1 2.3E-05 33.6 0.1 24 87-110 13-38 (64)
56 PF15102 TMEM154: TMEM154 prot 69.1 7.8 0.00017 34.1 4.3 8 166-173 123-130 (146)
57 PF02439 Adeno_E3_CR2: Adenovi 67.2 2.5 5.4E-05 28.2 0.7 22 86-107 6-29 (38)
58 PF04478 Mid2: Mid2 like cell 63.6 5.5 0.00012 35.3 2.3 17 94-110 62-78 (154)
59 PF05283 MGC-24: Multi-glycosy 63.4 4.9 0.00011 37.0 2.1 24 84-107 159-182 (186)
60 PF12877 DUF3827: Domain of un 62.6 3.2 7E-05 44.7 0.8 12 82-93 265-276 (684)
61 PF05808 Podoplanin: Podoplani 58.3 3.3 7.1E-05 37.0 0.0 28 87-114 133-160 (162)
62 PF14575 EphA2_TM: Ephrin type 55.6 5.6 0.00012 31.0 0.9 11 89-99 7-17 (75)
63 COG3453 Uncharacterized protei 54.9 22 0.00047 30.4 4.3 27 336-366 72-98 (130)
64 COG1470 Predicted membrane pro 54.6 9.7 0.00021 39.7 2.7 52 54-108 458-509 (513)
65 KOG4221|consensus 47.4 31 0.00067 40.3 5.3 20 48-67 986-1005(1381)
66 PF05568 ASFV_J13L: African sw 46.4 7 0.00015 34.1 0.1 23 88-110 34-56 (189)
67 smart00060 FN3 Fibronectin typ 44.8 24 0.00053 25.3 3.0 23 44-66 57-79 (83)
68 PF05454 DAG1: Dystroglycan (D 43.7 7.7 0.00017 38.3 0.0 8 103-110 168-175 (290)
69 PF02061 Lambda_CIII: Lambda P 42.1 34 0.00074 23.2 2.8 24 322-345 2-25 (45)
70 cd01533 4RHOD_Repeat_2 Member 41.4 44 0.00096 27.2 4.3 15 355-370 66-80 (109)
71 cd01518 RHOD_YceA Member of th 39.3 31 0.00068 27.6 3.0 18 354-372 60-77 (101)
72 cd01523 RHOD_Lact_B Member of 38.1 31 0.00066 27.6 2.8 17 354-371 60-76 (100)
73 TIGR01167 LPXTG_anchor LPXTG-m 37.9 34 0.00075 21.7 2.4 8 100-107 24-31 (34)
74 PF02009 Rifin_STEVOR: Rifin/s 37.8 9.4 0.0002 37.9 -0.4 8 103-110 277-284 (299)
75 PTZ00382 Variant-specific surf 37.7 20 0.00044 29.3 1.6 6 103-108 89-94 (96)
76 PF07204 Orthoreo_P10: Orthore 36.7 14 0.00031 29.8 0.5 20 91-110 50-69 (98)
77 PF05337 CSF-1: Macrophage col 35.8 12 0.00026 36.3 0.0 23 89-111 232-254 (285)
78 PF06084 Cytomega_TRL10: Cytom 34.7 29 0.00062 29.1 2.0 33 75-107 47-80 (150)
79 COG3889 Predicted solute bindi 33.0 39 0.00085 37.5 3.2 15 95-109 856-871 (872)
80 PF11770 GAPT: GRB2-binding ad 32.5 40 0.00086 29.8 2.6 25 83-107 7-33 (158)
81 PF11353 DUF3153: Protein of u 32.1 42 0.0009 31.4 3.0 15 50-64 166-180 (209)
82 cd01519 RHOD_HSP67B2 Member of 31.8 59 0.0013 26.0 3.5 16 354-370 65-80 (106)
83 TIGR03000 plancto_dom_1 Planct 31.7 65 0.0014 25.1 3.4 28 42-69 27-54 (75)
84 PF08374 Protocadherin: Protoc 31.1 14 0.0003 34.6 -0.4 11 235-245 181-191 (221)
85 cd00063 FN3 Fibronectin type 3 30.8 59 0.0013 23.9 3.2 23 44-66 57-79 (93)
86 PRK14750 kdpF potassium-transp 30.2 34 0.00073 21.3 1.3 18 86-103 3-20 (29)
87 PF06365 CD34_antigen: CD34/Po 30.0 26 0.00057 32.7 1.2 14 99-112 118-131 (202)
88 cd01448 TST_Repeat_1 Thiosulfa 29.3 89 0.0019 25.8 4.3 33 336-372 63-95 (122)
89 COG5495 Uncharacterized conser 29.2 56 0.0012 31.3 3.2 18 353-371 89-106 (289)
90 PLN02160 thiosulfate sulfurtra 28.5 51 0.0011 28.5 2.7 18 354-372 80-97 (136)
91 PRK11493 sseA 3-mercaptopyruva 25.8 61 0.0013 31.7 3.0 49 322-373 201-249 (281)
92 TIGR03865 PQQ_CXXCW PQQ-depend 25.4 1.1E+02 0.0024 27.2 4.4 18 354-371 115-132 (162)
93 PF14610 DUF4448: Protein of u 25.2 40 0.00087 31.0 1.5 13 90-102 164-176 (189)
94 PF10727 Rossmann-like: Rossma 24.7 81 0.0018 27.1 3.2 29 339-367 80-108 (127)
95 PF03684 UPF0179: Uncharacteri 24.7 44 0.00096 29.4 1.6 29 35-63 19-51 (142)
96 KOG1530|consensus 24.6 64 0.0014 28.0 2.5 29 341-370 75-103 (136)
97 KOG0208|consensus 24.3 1.6E+02 0.0034 34.1 6.0 67 356-422 852-926 (1140)
98 PRK10287 thiosulfate:cyanide s 24.2 1.2E+02 0.0025 25.0 4.0 17 355-372 60-76 (104)
99 PF10577 UPF0560: Uncharacteri 24.2 37 0.0008 38.0 1.2 18 94-111 285-302 (807)
100 cd01520 RHOD_YbbB Member of th 23.2 1.7E+02 0.0036 24.7 4.9 20 354-373 85-104 (128)
101 TIGR03024 arch_pef_cterm PEF-C 23.0 36 0.00078 20.8 0.5 9 99-107 15-23 (26)
102 PTZ00208 65 kDa invariant surf 23.0 58 0.0013 33.5 2.2 11 101-111 406-416 (436)
103 PF12768 Rax2: Cortical protei 22.9 52 0.0011 32.4 1.9 20 90-109 238-257 (281)
104 PF02035 Coagulin: Coagulin; 22.1 30 0.00066 29.6 0.1 28 228-259 98-126 (174)
105 TIGR02981 phageshock_pspE phag 21.2 1.5E+02 0.0033 24.1 4.1 18 354-372 57-74 (101)
106 PF13908 Shisa: Wnt and FGF in 21.0 88 0.0019 28.4 2.9 8 86-93 78-85 (179)
107 TIGR03154 sulfolob_CbsA cytoch 20.5 1.3E+02 0.0029 30.3 4.1 22 87-108 444-465 (465)
108 PF11344 DUF3146: Protein of u 20.1 71 0.0015 24.9 1.7 15 49-63 65-79 (80)
No 1
>KOG4228|consensus
Probab=100.00 E-value=3.3e-80 Score=663.50 Aligned_cols=280 Identities=41% Similarity=0.712 Sum_probs=261.4
Q ss_pred ceeccch----hhhccCCcc-hHHHhHHhhcCCCCCcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEee
Q psy15334 150 PIPVTLF----CTMTLDPDW-IAVEFEDLKSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNAN 224 (451)
Q Consensus 150 ~i~~~~f----~~~~~~~~~-~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs 224 (451)
||++++| ..+..|.+. |++|||.|.+. ..++..+.+++|+.||||.||++|||+||+|.+++|++++||||||
T Consensus 522 pI~v~dl~~~i~~~~~~~~~~F~~Eyesl~~~--~~~~~~s~~~en~~KNRY~nilayD~sRV~L~~i~Gd~~sDYINAn 599 (1087)
T KOG4228|consen 522 PIPVADLLEHIERRKADDGYGFKQEYESLPEG--QFTWEHSNKKENKQKNRYENILAYDHSRVILPPIEGDPNSDYINAN 599 (1087)
T ss_pred CccHHHHHHHHHHhhcccccchHHHHhhcccc--CCccccCccccccccccCCcchhhhcceeeecccCCCccccceeee
Confidence 8999999 445555555 99999999876 6678899999999999999999999999999999999999999999
Q ss_pred eecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeecce
Q psy15334 225 YIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSW 304 (451)
Q Consensus 225 ~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~ 304 (451)
|||||+.+++|||||||+++|+.||||||||+++.+|||+|+++|.++.||++||| .++..||++.|++.+.....+|
T Consensus 600 yIdGy~e~n~yIaaQgP~~eTv~DFWRMVWEq~S~~IVMvTnl~E~~r~kC~qYWP--~~t~~yGdi~V~~~~~~~~a~y 677 (1087)
T KOG4228|consen 600 YIDGYKEPNAYIAAQGPRPETVGDFWRMVWEQKSAGIVMVTNLEEFSRVKCAQYWP--EGTETYGDIKVTLVQTKPLAEY 677 (1087)
T ss_pred eeecccccccceeccCCcccchHHHHHHheeccCCcEEEEecccccccccccccCC--CCccccccccccceeeeeeccc
Confidence 99999999999999999999999999999999999999999999999999999999 3467899999999999999999
Q ss_pred EEEEEEEEeC--CeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHH
Q psy15334 305 VLSELRMCRG--GEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRL 382 (451)
Q Consensus 305 ~~r~l~v~~~--~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l 382 (451)
.+|+|.+.+. ...|.|+||||++|||||+|..+..+|.|++.++....+..||||||||||+||||+|++||.|++++
T Consensus 678 ~iRtf~l~~~g~~~~R~v~qfhFt~Wpd~gvPe~~t~lL~f~rrvk~~~p~~aGPiVVHCSAGvGRTG~fi~iDaml~~~ 757 (1087)
T KOG4228|consen 678 GIRTFALKKQGENPKREVRQFHFTAWPDHGVPETPTGLLKFRRRVKTFNPPDAGPIVVHCSAGVGRTGCFIVIDAMLDRL 757 (1087)
T ss_pred eEEeeeccccCCCCCceeeeeeeccCCCCCCcccchHHHHHHHHhccCCCcCCCCEEEECCCCCCCcceEEEeHHHHHHH
Confidence 9999999854 46899999999999999999999999999999999887788999999999999999999999999999
Q ss_pred HcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHH-HHhcCCCCchhhh
Q psy15334 383 YNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQV-YLQGGVRHEERQI 436 (451)
Q Consensus 383 ~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~-~l~~~~~~~~~~~ 436 (451)
+.+ +.+||++.|+.||.||+.||||++||+|||+++++ ++.|.++++...+
T Consensus 758 ~~e---~~vdiy~~v~~lR~QR~~mVQt~eQYiFi~~AllE~~~~G~T~i~~~~~ 809 (1087)
T KOG4228|consen 758 ECE---GKVDIYGHVKTLRRQRNNMVQTEEQYIFIHEALLEACLCGDTEVPASTL 809 (1087)
T ss_pred Hhh---CccceechhHHHHhccccccccHHHHHHHHHHHHHHHhcCCccccHHHH
Confidence 999 99999999999999999999999999999999997 5677777766543
No 2
>PHA02742 protein tyrosine phosphatase; Provisional
Probab=100.00 E-value=1.7e-76 Score=585.68 Aligned_cols=276 Identities=27% Similarity=0.504 Sum_probs=250.2
Q ss_pred Cccee-ccchhhhccCCcc---hHHHhHHhhcCCCCCcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEe
Q psy15334 148 SHPIP-VTLFCTMTLDPDW---IAVEFEDLKSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNA 223 (451)
Q Consensus 148 ~~~i~-~~~f~~~~~~~~~---~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINA 223 (451)
.+.+. +.+|.....+++. +.+||+.|.......+|..+..++|..||||.||+|||||||+|+..+ +++|||||
T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~i~~E~~~i~~~~~~~~~~~~~~~~N~~KNRy~~i~~~D~tRV~L~~~~--~~~dYINA 83 (303)
T PHA02742 6 SKKNSFAKNCEQLIEESNLAEILKEEHEHIMQEIVAFSCNESLELKNMKKCRYPDAPCFDRNRVILKIED--GGDDFINA 83 (303)
T ss_pred ccccCcHHHHHHHHhcCCHHHHHHHHHHHHhcCCCCCchhhccChhhhhcCCCCCCCCCCCCEEEeCCCC--CCCCcEEE
Confidence 34677 7888666666665 899999998766678889999999999999999999999999998633 36899999
Q ss_pred eeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCC-CCCCceeeeEEEEEeeeEeec
Q psy15334 224 NYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPS-DQHPMRCADIEISLQDVCHYG 302 (451)
Q Consensus 224 s~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~-~~~~~~~g~~~V~~~~~~~~~ 302 (451)
|||+|+..+++|||||||+++|++|||+||||+++.+|||||+..|.|+.||.+|||. +++...||+++|+++++....
T Consensus 84 syI~g~~~~~~fIatQ~Pl~~T~~dFW~MVw~~~~~vIVmLt~~~E~g~~kc~~YWp~~~~~~~~~g~~~V~~~~~~~~~ 163 (303)
T PHA02742 84 SYVDGHNAKGRFICTQAPLEETALDFWQAIFQDQVRVIVMITKIMEDGKEACYPYWMPHERGKATHGEFKIKTKKIKSFR 163 (303)
T ss_pred eeecCCCCCCeEEEECCCCcccHHHHHHHHhhcCCCeEEEcchhhhCCeeeccccCCCCCCCceEEEEEEEEEEEEEEcC
Confidence 9999999899999999999999999999999999999999999999999999999985 456788999999999998889
Q ss_pred ceEEEEEEEEe--CCeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcC-----------CCCCCEEEEcCCCCchh
Q psy15334 303 SWVLSELRMCR--GGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHIS-----------PAQHPIVVHCSAGVGRT 369 (451)
Q Consensus 303 ~~~~r~l~v~~--~~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~-----------~~~~PivVHCs~GvGRt 369 (451)
++++++|.+++ .++.|.|+||||++|||+|+|.++..+++|+..+++... ...+|||||||+|+|||
T Consensus 164 ~~~~~~l~l~~~~~~~~r~V~h~~y~~Wpd~gvP~~~~~~l~~i~~v~~~~~~~~~~~~~~~~~~~~PIvVHCsaGvGRT 243 (303)
T PHA02742 164 NYAVTNLCLTDTNTGASLDIKHFAYEDWPHGGLPRDPNKFLDFVLAVREADLKADVDIKGENIVKEPPILVHCSAGLDRA 243 (303)
T ss_pred CEEEEEEEEEECCCCceEEEEEEEECCCCCCCcCCCHHHHHHHHHHHHHHhhhccccccccccCCCCCeEEECCCCCchh
Confidence 99999999986 468999999999999999999999999999999876321 12479999999999999
Q ss_pred hhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHHHHhcC
Q psy15334 370 GTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQGG 428 (451)
Q Consensus 370 G~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~~~ 428 (451)
|+|||+|++++++..+ +.+||+++|+.||+||++||||.+||.|||++|++|++..
T Consensus 244 GtF~aid~~i~~~~~~---~~v~v~~~V~~lR~qR~~~Vqt~~QY~F~y~~l~~y~~~~ 299 (303)
T PHA02742 244 GAFCAIDICISKYNER---AIIPLLSIVRDLRKQRHNCLSLPQQYIFCYFIVLIFAKLM 299 (303)
T ss_pred HHHHHHHHHHHHHHhc---CCCCHHHHHHHHHhhcccccCCHHHHHHHHHHHHHHHHHh
Confidence 9999999999999887 8999999999999999999999999999999999998754
No 3
>PHA02740 protein tyrosine phosphatase; Provisional
Probab=100.00 E-value=5.9e-76 Score=578.40 Aligned_cols=269 Identities=19% Similarity=0.404 Sum_probs=241.1
Q ss_pred cceeccchhhhccCCcc---hHHHhHHhhcCCCCCcccccccccCCCCCC--CCCCCCCCCcceEecCCCCCCCCCcEEe
Q psy15334 149 HPIPVTLFCTMTLDPDW---IAVEFEDLKSVGRGQPTTAAELRCNRPKNR--YTNIFPYDSSRYKLQPVDDEEGSDYVNA 223 (451)
Q Consensus 149 ~~i~~~~f~~~~~~~~~---~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR--~~~i~pyD~sRV~L~~~~~~~~~dYINA 223 (451)
.++++++|......++. +.+||++|.......++..+.+++|+.||| |.||+|||||||+|++ ++|||||
T Consensus 7 ~~~~~~~f~~~~~~~~~~~~i~~E~~~i~~~~~~~~~~~~~~~~n~~knr~r~~~i~~~D~sRV~L~~-----~~~YINA 81 (298)
T PHA02740 7 VDINGMDFINFINKPDLLSCIIKEYRAIVPEHEDEANKACAQAENKAKDENLALHITRLLHRRIKLFN-----DEKVLDA 81 (298)
T ss_pred cccCHHHHHHHHhCCCHHHHHHHHHHHHhccCccchhhhhhchhhhcccccccCCcCCccCceEEecC-----CCCcEEE
Confidence 46788899555455544 999999998776677888899999999996 6899999999999964 4589999
Q ss_pred eeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceeccccccc-ccCCCC-CCCceeeeEEEEEeeeEee
Q psy15334 224 NYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCY-KYWPSD-QHPMRCADIEISLQDVCHY 301 (451)
Q Consensus 224 s~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~-~YwP~~-~~~~~~g~~~V~~~~~~~~ 301 (451)
||||||..+++|||||||+++|++|||+||||+++.+|||||+..| .||. +|||.+ +....||+|.|++.++...
T Consensus 82 nyVdG~~~~~~fIaTQ~Pl~~T~~dFW~MVwe~~~~iIVMLt~~~e---~kc~~~YWP~~~~~~~~~g~f~V~~~~~~~~ 158 (298)
T PHA02740 82 RFVDGYDFEQKFICIINLCEDACDKFLQALSDNKVQIIVLISRHAD---KKCFNQFWSLKEGCVITSDKFQIETLEIIIK 158 (298)
T ss_pred EEecCCCCCCcEEEecCCchhhHHHHHHHHHhcCCCEEEEcccccc---ccccccCCCCCCCCeEEECCEEEEEEEEEec
Confidence 9999999999999999999999999999999999999999999877 4798 999975 4457899999999998888
Q ss_pred cceEEEEEEEEe-CCeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhc--------CCCCCCEEEEcCCCCchhhhH
Q psy15334 302 GSWVLSELRMCR-GGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHI--------SPAQHPIVVHCSAGVGRTGTF 372 (451)
Q Consensus 302 ~~~~~r~l~v~~-~~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~--------~~~~~PivVHCs~GvGRtG~f 372 (451)
.++++|.|.+++ .+++|.|+||||++|||+|+|.++..+++|+..|++.. ....+|||||||||+||||+|
T Consensus 159 ~~~~~~~l~l~~~~~~~r~V~Hfqyt~WPd~gvP~~~~~~l~fi~~V~~~~~~~~~~~~~~~~~PIVVHCSaGvGRTGtF 238 (298)
T PHA02740 159 PHFNLTLLSLTDKFGQAQKISHFQYTAWPADGFSHDPDAFIDFFCNIDDLCADLEKHKADGKIAPIIIDCIDGISSSAVF 238 (298)
T ss_pred CCEEEEEEEEEcCCCCcEEEEEEeecCCCCCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCCCCCEEEECCCCCchhHHH
Confidence 999999999975 46799999999999999999999999999998887531 223589999999999999999
Q ss_pred HHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHHHHhcC
Q psy15334 373 IALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQGG 428 (451)
Q Consensus 373 iai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~~~ 428 (451)
||+|++++++..+ +.|||+++|++||+||++||||.+||.|||++|++|+...
T Consensus 239 caiDi~l~~~~~~---~~vdi~~~V~~lR~qR~~~Vqt~~QY~F~y~~l~~yl~~~ 291 (298)
T PHA02740 239 CVFDICATEFDKT---GMLSIANALKKVRQKKYGCMNCLDDYVFCYHLIAAYLKEK 291 (298)
T ss_pred HHHHHHHHHHHhc---CcccHHHHHHHHHhhCccccCCHHHHHHHHHHHHHHHHHh
Confidence 9999999999988 9999999999999999999999999999999999998764
No 4
>KOG0791|consensus
Probab=100.00 E-value=1.7e-76 Score=571.76 Aligned_cols=292 Identities=48% Similarity=0.855 Sum_probs=275.8
Q ss_pred Ccceeccch----hhhccCCcc-hHHHhHHhhcCCCCCcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEE
Q psy15334 148 SHPIPVTLF----CTMTLDPDW-IAVEFEDLKSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVN 222 (451)
Q Consensus 148 ~~~i~~~~f----~~~~~~~~~-~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYIN 222 (451)
+.+|++++| +.+..|++. |++||++|+..+.++++.+|.+++|.+||||.||+|||+|||+|.|+.+++++||||
T Consensus 75 s~~i~len~e~~~~~~~~dsn~~f~eey~~lk~~g~~~~~~aA~l~~n~~knr~~nilpYd~Srvkl~~~~e~~~~DYin 154 (374)
T KOG0791|consen 75 TNPVSLENFELHVKKMSADSNYKFSEEYNELKLVGLDQSKLAALLPLNRPKNRYTNILPYDNSRVKLSPVNEEEGSDYIN 154 (374)
T ss_pred CCceeeccchhhhhhcccccchhhHHHHhhhhhhcccchhhhhhcCCCCCcCCcCCCCCcccccceeccccccchhhhhh
Confidence 458889998 677889888 999999999999999999999999999999999999999999999999999999999
Q ss_pred eeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeec
Q psy15334 223 ANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYG 302 (451)
Q Consensus 223 As~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~ 302 (451)
||||+|+.+++.|||||||+++|..|||+||||+++.+|||||.+.|.++.||++|||.+..++.||+++|+.++++...
T Consensus 155 A~~~Pg~~s~~~fIAtQGPlp~t~~~fW~mvwq~~~~~IVmlt~~~e~~R~kc~~ywP~~~~~~~~gdi~V~~v~e~~~~ 234 (374)
T KOG0791|consen 155 ASYIPGYNSPREFIATQGPLPETRDDFWKMVWQQKSHIIVMLTKCNEKGRVKCDEYWPDEEVPVAYGDITVTMVSEESLD 234 (374)
T ss_pred hhhcCCCCCcceEEEeeCCCCCChhHHHHHHHHhhcceEEEEeecccccchhhhhhcccccccceeccEEEEEechhhcC
Confidence 99999999999999999999999999999999999999999999999999999999999888899999999999999999
Q ss_pred ceEEEEEEEEeCCeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHH
Q psy15334 303 SWVLSELRMCRGGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRL 382 (451)
Q Consensus 303 ~~~~r~l~v~~~~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l 382 (451)
+|++|+|.+...+++|.++||||+.|||||+|.++..+++|++.+++......+|++|||||||||||||+|+|.+++|+
T Consensus 235 ~w~ir~~~l~~~~~sr~ir~f~y~~wPd~gvp~~~~sl~~f~~~~r~~~~~~~~p~iVhCSAGVgRTGTFiald~LLqq~ 314 (374)
T KOG0791|consen 235 EWTIREFRLNYAGESRKIRHFHYTAWPDFGVPSSTESLLQFVRMVRQSLDTSKGPTIVHCSAGVGRTGTFIALDRLLQQI 314 (374)
T ss_pred CceEEEEeeecccccceeEEEEEeeccccCCCCCchhHHHHHHHHHhhcccCCCceeEEeecccccccchHhHHHHHHHh
Confidence 99999999998899999999999999999999999999999999999988788999999999999999999999999999
Q ss_pred HcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHHHHhcCCC--Cchhhhhhcccc
Q psy15334 383 YNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQGGVR--HEERQIQNNEAI 442 (451)
Q Consensus 383 ~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~~~~~--~~~~~~~~~~~~ 442 (451)
..+ +.+|++++|.+||.+|+.||||++||+|+|+|+++.|.++.. .....++.|.++
T Consensus 315 ~~~---~~vdi~~iv~~lR~~R~~mVqte~Qyvfl~~c~~~~l~~~~~~~~~s~~~~~n~~~ 373 (374)
T KOG0791|consen 315 DSE---ETVDIFGVVLELRSARMLMVQTEDQYVFLHQCVLESLQGKKPQLCISDVIYENVGP 373 (374)
T ss_pred ccc---ccccHHHHHHHhhhccccccchHHHHHHHHHHHHHHHhCCCcccccchhcccCCCC
Confidence 888 899999999999999999999999999999999999998864 344455555443
No 5
>PHA02747 protein tyrosine phosphatase; Provisional
Probab=100.00 E-value=1.7e-74 Score=573.24 Aligned_cols=277 Identities=29% Similarity=0.504 Sum_probs=247.9
Q ss_pred CcceeccchhhhccCCcc---hHHHhHHhhcCCCCCcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEee
Q psy15334 148 SHPIPVTLFCTMTLDPDW---IAVEFEDLKSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNAN 224 (451)
Q Consensus 148 ~~~i~~~~f~~~~~~~~~---~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs 224 (451)
...+.+.||.+...+++. +.+||+.|.....+..+..+..++|..||||.||+||||+||+|.+.++ +.+||||||
T Consensus 6 ~~~~~~~~f~~~~~~~~~~~~i~~E~~~i~~~~~~~~~~~~~~~~N~~kNRy~di~p~D~sRV~L~~~~~-~~~dYINAs 84 (312)
T PHA02747 6 FAECRAIDFLKRRNQLNCFGIIRDEHHQIILKPFDGLIANFEKPENQPKNRYWDIPCWDHNRVILDSGGG-STSDYIHAN 84 (312)
T ss_pred ccccCHHHHHHHHhCCCHHHHHHHHHHHHhcCCCCChhhhhhCchhhccCCCCCCCCCCCCEeEecCCCC-CCCCcEEee
Confidence 456788999666666665 8999999976655667778889999999999999999999999987443 468999999
Q ss_pred eecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccce-ecccccccccCCC-CCCCceeeeEEEEEeeeEeec
Q psy15334 225 YIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCI-EKHRDKCYKYWPS-DQHPMRCADIEISLQDVCHYG 302 (451)
Q Consensus 225 ~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~-E~~~~kc~~YwP~-~~~~~~~g~~~V~~~~~~~~~ 302 (451)
||+||..+++|||||||+++|++|||+||||++|.+|||||... |.|+.||.+|||. +++.+.+|++.|++.+.....
T Consensus 85 yV~g~~~~~~yIaTQ~Pl~~T~~dFW~MVwe~~~~~IVMLt~~~~e~g~~kc~~YWp~~~~~~~~~g~~~V~~~~~~~~~ 164 (312)
T PHA02747 85 WIDGFEDDKKFIATQGPFAETCADFWKAVWQEHCSIIVMLTPTKGTNGEEKCYQYWCLNEDGNIDMEDFRIETLKTSVRA 164 (312)
T ss_pred eecCCCCCCeEEEeCCCcchhHHHHHHHHHhccCCEEEEcccccccCCcccccccCCCCCCCCeEeeeEEEEEEEEEecC
Confidence 99999999999999999999999999999999999999999999 8999999999986 455788999999999998889
Q ss_pred ceEEEEEEEEeC--CeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhc---C-------CCCCCEEEEcCCCCchhh
Q psy15334 303 SWVLSELRMCRG--GEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHI---S-------PAQHPIVVHCSAGVGRTG 370 (451)
Q Consensus 303 ~~~~r~l~v~~~--~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~---~-------~~~~PivVHCs~GvGRtG 370 (451)
++++|.|.++.. ++.|.|+||||++|||+|+|.++..+++|+..++... . ...+|||||||||+||||
T Consensus 165 ~~~~~~l~l~~~~~~~~r~V~h~~y~~Wpd~~~P~~~~~~l~fi~~v~~~~~~~~~~~~~~~~~~~PIvVHCsaGvGRtG 244 (312)
T PHA02747 165 KYILTLIEITDKILKDSRKISHFQCSEWFEDETPSDHPDFIKFIKIIDINRKKSGKLFNPKDALLCPIVVHCSDGVGKTG 244 (312)
T ss_pred CeEEEEEEEEECCCCCceEEEEEEECCCCCCCCCCChHHHHHHHHHHHHHHHHhhccccccccCCCCEEEEecCCCcchh
Confidence 999999999863 6899999999999999999999999999987765421 1 123799999999999999
Q ss_pred hHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHH---HHHHHHHHhcC
Q psy15334 371 TFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFI---YECVQVYLQGG 428 (451)
Q Consensus 371 ~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fi---y~~l~~~l~~~ 428 (451)
+|||+|++++++..+ +.+||+++|+.||+||++||||.+||.|| |++|.+||...
T Consensus 245 tfcaidi~i~~l~~~---~~v~v~~~V~~lR~qR~~~Vqt~~QY~F~~~~Y~~l~~~~~~~ 302 (312)
T PHA02747 245 IFCAVDICLNQLVKR---KAICLAKTAEKIREQRHAGIMNFDDYLFIQPGYEVLHYFLSKI 302 (312)
T ss_pred HHHHHHHHHHHHHhc---CCCCHHHHHHHHHhccccccCCHHHHHHHHHHHHHHHHHHHHh
Confidence 999999999999888 89999999999999999999999999999 99999998764
No 6
>PHA02746 protein tyrosine phosphatase; Provisional
Probab=100.00 E-value=3.1e-74 Score=573.55 Aligned_cols=276 Identities=26% Similarity=0.481 Sum_probs=246.6
Q ss_pred CcceeccchhhhccCCcc---hHHHhHHhhcCCCCCcccccccccCCCCCCCCCCCCCCCcceEecCCC-----------
Q psy15334 148 SHPIPVTLFCTMTLDPDW---IAVEFEDLKSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVD----------- 213 (451)
Q Consensus 148 ~~~i~~~~f~~~~~~~~~---~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~----------- 213 (451)
...+.+.+|......+++ +.+||.+|.......++..+..++|+.||||.||+|||||||+|.+..
T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~i~~~~~~~t~~~~~~~~N~~KNRy~di~p~D~sRV~L~~~~~~~~~~~~~~~ 85 (323)
T PHA02746 6 FEIFNAFDFFDKTNHAKFCEFVLLEHAEVMDIPIRGTTNHFLKKENLKKNRFHDIPCWDHSRVVINAHESLKMFDVGDSD 85 (323)
T ss_pred cccCCHHHHHHHhcCCCHHHHHHHHHHHHhcCCCCcchhhccChhhccccCCCCCCCCcCCEEEecCCCCcccccccccc
Confidence 345778888666666666 889999998877777888899999999999999999999999998743
Q ss_pred --------CCCCCCcEEeeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCC-CC
Q psy15334 214 --------DEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSD-QH 284 (451)
Q Consensus 214 --------~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~-~~ 284 (451)
+++++||||||||+||..+++|||||||+++|++|||+||||+++.+|||||+.. .++.||.+|||.+ +.
T Consensus 86 ~~~~~~~~~~~~~dYINAsyV~g~~~~~~fIaTQ~Pl~~T~~dFW~MIwe~~v~vIVMLt~~~-e~~~kc~~YWP~~~~~ 164 (323)
T PHA02746 86 GKKIEVTSEDNAENYIHANFVDGFKEANKFICAQGPKEDTSEDFFKLISEHESQVIVSLTDID-DDDEKCFELWTKEEDS 164 (323)
T ss_pred ccccccccCCCCCCeEEEecccCCCCCCeEEEeCCCChhhHHHHHHHHHhhCCCEEEEecccc-cCCccCCCCCCCCCCC
Confidence 2345899999999999999999999999999999999999999999999999975 5678999999976 44
Q ss_pred CceeeeEEEEEeeeEeecceEEEEEEEEeC--CeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhc----------C
Q psy15334 285 PMRCADIEISLQDVCHYGSWVLSELRMCRG--GEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHI----------S 352 (451)
Q Consensus 285 ~~~~g~~~V~~~~~~~~~~~~~r~l~v~~~--~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~----------~ 352 (451)
.+.||++.|++.+.....+++.+.|.+++. ++.|.|+||||++|||+|+|.++..+++|++.|++.. .
T Consensus 165 ~~~~g~~~V~~~~~~~~~~~~~~~l~l~~~~~~~~r~V~h~~y~~Wpd~gvP~~~~~~l~~i~~v~~~~~~~~~~~~~~~ 244 (323)
T PHA02746 165 ELAFGRFVAKILDIIEELSFTKTRLMITDKISDTSREIHHFWFPDWPDNGIPTGMAEFLELINKVNEEQAELIKQADNDP 244 (323)
T ss_pred CeEEcCEEEEEEEEEEcCCeEEEEEEEEECCCCCceEEEEEEECCCCCCCcCCCHHHHHHHHHHHHHHHHHHhhhhccCC
Confidence 678999999999888888899999999853 5789999999999999999999999999999887542 1
Q ss_pred CCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHHHHhc
Q psy15334 353 PAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQG 427 (451)
Q Consensus 353 ~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~~ 427 (451)
+..+|||||||+|+||||+|||+|++++++..+ +.+||+++|+.||+||++||||.+||.|||++|.+||..
T Consensus 245 ~~~~PIvVHCsaGvGRTGtfcaid~~l~~l~~~---~~vdv~~~V~~lR~qR~~~Vqt~~QY~F~y~~l~~~l~~ 316 (323)
T PHA02746 245 QTLGPIVVHCSAGIGRAGTFCAIDNALEQLEKE---KEVCLGEIVLKIRKQRHSSVFLPEQYAFCYKALKYAIIE 316 (323)
T ss_pred CCCCCEEEEcCCCCCcchhHHHHHHHHHHHHhc---CCCCHHHHHHHHHhcccccCCCHHHHHHHHHHHHHHHHH
Confidence 124899999999999999999999999999888 899999999999999999999999999999999998754
No 7
>PHA02738 hypothetical protein; Provisional
Probab=100.00 E-value=4.3e-73 Score=564.63 Aligned_cols=277 Identities=32% Similarity=0.536 Sum_probs=244.4
Q ss_pred CCCcceeccchhhhccCCcc---hHHHhHHhhcCCCCCcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEE
Q psy15334 146 NCSHPIPVTLFCTMTLDPDW---IAVEFEDLKSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVN 222 (451)
Q Consensus 146 ~~~~~i~~~~f~~~~~~~~~---~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYIN 222 (451)
++.+.+.+++|......++. +.+||+.|.....+.++..+. .|+.||||.||+|||++||+|++. .+++||||
T Consensus 4 ~~~~~~~~~~fl~~~~~~~~~~~i~~E~~~i~~~~~~~t~~~~~--~N~~kNR~~~v~~~d~sRV~L~~~--~~~~dYIN 79 (320)
T PHA02738 4 CKFRELKYAEFLALMEKSDCEEVITREHQKVISEKVDGTFNAEK--KNRKLNRYLDAVCFDHSRVILPAE--RNRGDYIN 79 (320)
T ss_pred cccccCCHHHHHHHHhCCCHHHHHHHHHHHHhcCCCCcchhHHH--HhhhhCCCCCcCCCcCCEEEeCCC--CCCCCceE
Confidence 44556778888544445544 899999997655555555543 499999999999999999999753 34689999
Q ss_pred eeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCC-CCCceeeeEEEEEeeeEee
Q psy15334 223 ANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSD-QHPMRCADIEISLQDVCHY 301 (451)
Q Consensus 223 As~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~-~~~~~~g~~~V~~~~~~~~ 301 (451)
||||+|+..+++|||||||+++|++|||+||||++|.+|||||+..|.++.+|.+|||.+ +..+.||+|+|++.+....
T Consensus 80 AsyVdg~~~~~kfI~TQ~Pl~~T~~dFW~MVwe~~v~~IVmL~~~~E~~~~kc~~YWp~~~~~~~~~g~f~V~~~~~~~~ 159 (320)
T PHA02738 80 ANYVDGFEYKKKFICGQAPTRQTCYDFYRMLWMEHVQIIVMLCKKKENGREKCFPYWSDVEQGSIRFGKFKITTTQVETH 159 (320)
T ss_pred eEEecCCCCCceeEEecCCChHHHHHHHHHHHHhCCcEEEEeeehhhCCeeeccccCCCCCCCceEeccEEEEEEEEEec
Confidence 999999999999999999999999999999999999999999999999999999999974 5678999999999999989
Q ss_pred cceEEEEEEEEeC-CeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcC-------------CCCCCEEEEcCCCCc
Q psy15334 302 GSWVLSELRMCRG-GEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHIS-------------PAQHPIVVHCSAGVG 367 (451)
Q Consensus 302 ~~~~~r~l~v~~~-~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~-------------~~~~PivVHCs~GvG 367 (451)
.+++.|.|.++.+ ++.|.|+||||++|||+|+|.++..+++|+..|++... ...+|||||||+|+|
T Consensus 160 ~~~~~~~l~l~~~~~~~r~V~h~~y~~Wpd~gvP~~~~~~l~fi~~V~~~~~~~~~~~~~~~~~~~~~~PIVVHCs~GiG 239 (320)
T PHA02738 160 PHYVKSTLLLTDGTSATQTVTHFNFTAWPDHDVPKNTSEFLNFVLEVRQCQKELAQESLQIGHNRLQPPPIVVHCNAGLG 239 (320)
T ss_pred CCeEEEEEEEEeCCCCcEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHhhhhhcccCccccCCCCeEEEcCCCCC
Confidence 9999999999864 57899999999999999999999999999998875321 124799999999999
Q ss_pred hhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHHHHhcCC
Q psy15334 368 RTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQGGV 429 (451)
Q Consensus 368 RtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~~~~ 429 (451)
|||+|||+|++++++..+ +.+||+++|+.||+||++||||.+||.|||++|++|+....
T Consensus 240 RtGtFcaidi~i~~~~~~---~~vdv~~~V~~lR~qR~~~vqt~~QY~F~y~~l~~y~~~~~ 298 (320)
T PHA02738 240 RTPCYCVVDISISRFDAC---ATVSIPSIVSSIRNQRYYSLFIPFQYFFCYRAVKRYVNLTV 298 (320)
T ss_pred hhhhhhHHHHHHHHHHhc---CCcCHHHHHHHHHhhhhhccCCHHHHHHHHHHHHHHHHhcC
Confidence 999999999999999988 89999999999999999999999999999999999988654
No 8
>KOG0790|consensus
Probab=100.00 E-value=1e-73 Score=558.13 Aligned_cols=278 Identities=40% Similarity=0.654 Sum_probs=244.6
Q ss_pred hHHHhHHhhcCCCC--CcccccccccCCCCCCCCCCCCCCCcceEecCCC-CCCCCCcEEeeeecC------C--CCCcc
Q psy15334 166 IAVEFEDLKSVGRG--QPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVD-DEEGSDYVNANYIGG------Y--LSPRQ 234 (451)
Q Consensus 166 ~~~Ef~~l~~~~~~--~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~-~~~~~dYINAs~i~g------~--~~~~~ 234 (451)
|.+||+.|++.... .+...+..|+|+.||||.||+||||+||+|...+ ..+++||||||||-- . .-++.
T Consensus 246 fwEEFesLqq~~~~~~~sr~EG~r~eN~~KNRYkNIlPfDHtRViL~d~~~n~pgsdYINAnyi~~~~q~~~~~~~~kKs 325 (600)
T KOG0790|consen 246 FWEEFESLQQQEVKNLHSRLEGQRPENKGKNRYKNILPFDHTRVILQDRDPNIPGSDYINANYIMIEFQRLCNNSKPKKS 325 (600)
T ss_pred hhHHHHHhhhhhhHhhhhhhccccchhhccccccccCcccceeeEeecCCCCCccchhcccchhhhhhhhccCccccccc
Confidence 99999999765432 2344677899999999999999999999998663 468999999999831 1 12357
Q ss_pred EEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeecceEEEEEEEE--
Q psy15334 235 YIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMC-- 312 (451)
Q Consensus 235 yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~-- 312 (451)
||||||-|.+|+.|||+||||+|+++|||-|...|.|+.||.+|||+++....||.+.|++..+....+|++|+|.++
T Consensus 326 yIAtQGCL~nTVnDFW~MvwQENsrVIVMtTkE~ERgK~KC~~YWPee~~~e~~G~~~v~~v~E~~t~dY~LR~l~vs~~ 405 (600)
T KOG0790|consen 326 YIATQGCLQNTVNDFWRMVWQENSRVIVMTTKEVERGKSKCVKYWPEEGALEEYGVMRVRNVKESDTHDYTLRELKVSKL 405 (600)
T ss_pred eeehhhHHHHHHHHHHHHHHhccceEEEEehhhhhcccccccccCCcccchhhcCceEEEeccccccccceehheeeccc
Confidence 999999999999999999999999999999999999999999999998888899999999999999999999999998
Q ss_pred -eCCeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcC--CCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCC
Q psy15334 313 -RGGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHIS--PAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAG 389 (451)
Q Consensus 313 -~~~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~--~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~ 389 (451)
+++..|.|+||||..|||||||.+|..+|+|+..|..... ...||||||||||+|||||||+||++|+++.+..-.-
T Consensus 406 ~~g~~~R~I~~yh~~tWPDHGvP~dPg~vLnFLe~V~~rq~~l~~AgpIvVHCSAGIGrTGTfiViD~lld~I~~~Gldc 485 (600)
T KOG0790|consen 406 GNGNLEREIWHYHYLTWPDHGVPSDPGGVLNFLEEVNHRQESLMDAGPIVVHCSAGIGRTGTFIVIDMLLDQIREKGLDC 485 (600)
T ss_pred cCCcchhhhhhhheeecccCCCcCCccHHHHHHHHhhhhhccccccCcEEEEccCCcCCcceEEEhHHHHHHHHhcCCCC
Confidence 5678999999999999999999999999999999976543 3578999999999999999999999999999874445
Q ss_pred CCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHHHHhcCC-CCchhhhhhccccc
Q psy15334 390 SVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQGGV-RHEERQIQNNEAIE 443 (451)
Q Consensus 390 ~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~~~~-~~~~~~~~~~~~~~ 443 (451)
.+||..+|+.+|.||.|||||+.||.|||.||.+||+... ...+.+..+-+++|
T Consensus 486 ~iDi~ktIqmVRsqRSGmVQTEaQYkFiY~Avqhyietl~~r~e~eq~~~~~g~e 540 (600)
T KOG0790|consen 486 DIDIQKTIQMVRSQRSGMVQTEAQYKFIYVAVQHYIETLQKRIEEEQKSKRKGHE 540 (600)
T ss_pred cccHHHHHHHHHHHhcchhhhHHhHHHHHHHHHHHHHHHHHHHHHHHhhcccccc
Confidence 7999999999999999999999999999999999999754 34444555555544
No 9
>KOG0792|consensus
Probab=100.00 E-value=1.6e-68 Score=567.01 Aligned_cols=266 Identities=39% Similarity=0.666 Sum_probs=243.9
Q ss_pred CcchHHHhHHhhcCCCCCcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCC--CCCccEEEecC
Q psy15334 163 PDWIAVEFEDLKSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGY--LSPRQYIATQG 240 (451)
Q Consensus 163 ~~~~~~Ef~~l~~~~~~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~--~~~~~yI~tQ~ 240 (451)
++.+..|||.|....+...|+.|.+|+|..||||.||+|||.+||.|.+..+. +..|||||+|.-. +..+.||||||
T Consensus 867 ~g~v~~efE~i~rkkp~~~~~~a~lpeN~~KNRyrdvlPYD~tRV~L~~~~~~-nagYINAS~i~ipvgg~e~~YIA~QG 945 (1144)
T KOG0792|consen 867 DGMVFTEFEQIYRKKPGMACSIARLPENAPKNRYRDVLPYDDTRVELQPSKDN-NAGYINASHIEIPVGGIENRYIACQG 945 (1144)
T ss_pred cCchHHHHHHHhhhCCCceeeeccccccccccccCCcCCCcCceeeeccCCCC-CcccccceeeEeeecceeEEEEEecC
Confidence 34467799998776688899999999999999999999999999999986544 7799999999754 44578999999
Q ss_pred CCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCC-CCCceeeeEEEEEeeeEeecceEEEEEEEEe--CCee
Q psy15334 241 PLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSD-QHPMRCADIEISLQDVCHYGSWVLSELRMCR--GGEH 317 (451)
Q Consensus 241 Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~-~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~~--~~~~ 317 (451)
||+.|+.|||+|||||++.+|||||..+|.|+.||.+|||.. .+.+.||.|.|++..+.....|++|+|++.+ .++.
T Consensus 946 PLp~T~~DFWQMVWEQ~~~lIvMlT~e~EggR~KchqYWPr~~~~t~~ygrf~v~~~~~~~t~~y~tr~m~l~~~~t~ee 1025 (1144)
T KOG0792|consen 946 PLPHTCTDFWQMVWEQGSTLIVMLTTEVEGGRVKCHQYWPRLGHETMEYGRFQVTCVFEQQTTCYVTREMTLKDLQTREE 1025 (1144)
T ss_pred CCcchHHHHHHHHHhcCceEEEEEeehhhcCeeccccccCCCCccceeccceEEEEEEecccccEEEEeEEEeeccCCce
Confidence 999999999999999999999999999999999999999987 4678999999999999999999999999986 5799
Q ss_pred eEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHH
Q psy15334 318 RMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIV 397 (451)
Q Consensus 318 r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v 397 (451)
|+|+|+||++|||||+|+++..||+|+..++......+.|||||||||+||||++|.+++++..|+.+ ..+||.++|
T Consensus 1026 R~V~hLQYtaWPDHg~P~D~~~FL~FleevrsvR~~t~pPilvHCSAGiGRTGVlIl~e~~l~lle~N---e~vdi~div 1102 (1144)
T KOG0792|consen 1026 RTVWHLQYTAWPDHGVPDDPNDFLDFLEEVRSVRRGTNPPILVHCSAGIGRTGVLILMETALCLLEHN---EPVDILDIV 1102 (1144)
T ss_pred eeeeeeeecccccCCCCCChHHHHHHHHHHHHHhccCCCCeEEEccCCCCcceehHHHHHHHHHHhcC---CCCCHHHHH
Confidence 99999999999999999999999999999987655445699999999999999999999999999988 899999999
Q ss_pred HHHHhhccCCCCCHHHHHHHHHHHHHHHhcCCCCc
Q psy15334 398 HGLRRQRVLMVQSLAQYAFIYECVQVYLQGGVRHE 432 (451)
Q Consensus 398 ~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~~~~~~~ 432 (451)
+.||.||..||||..||.|||++|+.+++.....+
T Consensus 1103 r~mR~QR~~mVQT~~QYkFVyevil~~l~~~~l~~ 1137 (1144)
T KOG0792|consen 1103 RTMRDQRAMMVQTLSQYKFVYEVILRVLKRGRLIP 1137 (1144)
T ss_pred HHHHHHHhhhccchHHhhHHHHHHHHHHHhccccc
Confidence 99999999999999999999999999998765443
No 10
>smart00194 PTPc Protein tyrosine phosphatase, catalytic domain.
Probab=100.00 E-value=1.1e-67 Score=513.36 Aligned_cols=253 Identities=49% Similarity=0.871 Sum_probs=236.1
Q ss_pred hHHHhHHhhcCCCC-CcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCCCCCccEEEecCCCCC
Q psy15334 166 IAVEFEDLKSVGRG-QPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCS 244 (451)
Q Consensus 166 ~~~Ef~~l~~~~~~-~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~~ 244 (451)
|.+||+.|.+.... .++..+..++|..||||.||+|||++||+|.+.++ +++||||||||+|+..+++||+||+|+++
T Consensus 2 ~~~e~~~i~~~~~~~~~~~~~~~~~n~~kNR~~~v~~~d~~RV~l~~~~~-~~~~YInAs~v~~~~~~~~fI~tQ~P~~~ 80 (258)
T smart00194 2 LEEEFEKLDRLKPDDESCTVAARPENRDKNRYKDVLPYDHTRVKLTPPPG-EGSDYINASYIDGPNGPKAYIATQGPLPS 80 (258)
T ss_pred hHHHHHHhhccCCCCccccccCCccccccCCCCCCCCCCCCEeEccCCCC-CCCCceeeeeEecCCCccceEEeCCCchH
Confidence 67899999876655 67888999999999999999999999999986443 57899999999999999999999999999
Q ss_pred CHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeecceEEEEEEEEeCC--eeeEEEE
Q psy15334 245 TVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMCRGG--EHRMISH 322 (451)
Q Consensus 245 T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~~~~--~~r~V~h 322 (451)
|++|||+||||+++.+||||+...|.++.+|.+|||.+.+.+.||.++|++.+.....+++.|+|.++..+ +.|.|+|
T Consensus 81 t~~dFW~mv~~~~~~~IVmL~~~~e~~~~~~~~Ywp~~~~~~~~g~~~V~~~~~~~~~~~~~~~l~v~~~~~~~~~~v~~ 160 (258)
T smart00194 81 TVEDFWRMVWEEKVTTIVMLTELEEKGREKCAQYWPEEEEPLTYGDITVTLKSVEKVDDYTIRTLEVTNTGCSETRTVTH 160 (258)
T ss_pred HHHHHHHHHHhcCCCEEEEeccceeCCEeeccccCCCCCCcceECCEEEEEEEEEecCCEEEEEEEEEECCCCCcEEEEE
Confidence 99999999999999999999999999999999999987767899999999999999999999999998754 8999999
Q ss_pred EEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHh
Q psy15334 323 YHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRR 402 (451)
Q Consensus 323 ~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~ 402 (451)
|||++|||+++|.++..+++|++.++.......+||+|||++|+||||+|||++++++++..+ +.+|++++++.||+
T Consensus 161 ~~y~~W~d~~~P~~~~~~~~~i~~v~~~~~~~~~pivVHC~~G~gRsg~f~a~~~~~~~l~~~---~~v~v~~~v~~lR~ 237 (258)
T smart00194 161 YHYTNWPDHGVPESPKSILDLVRAVRKSQSTSTGPIVVHCSAGVGRTGTFIAIDILLQQLEAG---KEVDIFEIVKELRS 237 (258)
T ss_pred EeeCCCCCCCCCCCHHHHHHHHHHHHHhhccCCCCEEEEeCCCCCccchhhHHHHHHHHHHHc---CCCCHHHHHHHHHh
Confidence 999999999999999999999999998765446899999999999999999999999999888 89999999999999
Q ss_pred hccCCCCCHHHHHHHHHHHH
Q psy15334 403 QRVLMVQSLAQYAFIYECVQ 422 (451)
Q Consensus 403 qR~~~Vqt~~QY~fiy~~l~ 422 (451)
||++||++.+||.|||++++
T Consensus 238 ~R~~~v~~~~Qy~f~~~~l~ 257 (258)
T smart00194 238 QRPGMVQTEEQYIFLYRAIL 257 (258)
T ss_pred ccccccCCHHHHHHHHHHHh
Confidence 99999999999999999986
No 11
>cd00047 PTPc Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
Probab=100.00 E-value=4e-61 Score=459.63 Aligned_cols=227 Identities=51% Similarity=0.959 Sum_probs=211.4
Q ss_pred CCCCCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecc
Q psy15334 192 PKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKH 271 (451)
Q Consensus 192 ~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~ 271 (451)
.||||+|++|||++||+|.... .+++||||||||+|+..+++||+||+|+.+|++|||+||||+++.+||||+...|.+
T Consensus 1 ~knR~~~~~~~d~~rV~l~~~~-~~~~~yinA~~v~~~~~~~~fI~tQ~P~~~t~~~FW~mv~~~~~~~IV~l~~~~e~~ 79 (231)
T cd00047 1 KKNRYKDILPYDHTRVKLKPDD-DEGSDYINASYIDGYNPPKAYIATQGPLPNTVEDFWRMVWEQKVPVIVMLTELVEKG 79 (231)
T ss_pred CCCCCCCCCCCCCCEEEccCCC-CCCCCcEEeeeEecCCCCcceEEcCCCChhhHHHHHHHHHhcCCCEEEEccccccCC
Confidence 4899999999999999997632 256899999999999989999999999999999999999999999999999999999
Q ss_pred cccccccCCCCCCCceeeeEEEEEeeeEeecceEEEEEEEEeC--CeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHH
Q psy15334 272 RDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMCRG--GEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRT 349 (451)
Q Consensus 272 ~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~~~--~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~ 349 (451)
..+|.+|||.+.+...||.++|++++.....+++.|+|.|+.. +..|.|+||||++|||+++|.+...+++|+..+++
T Consensus 80 ~~~~~~Ywp~~~~~~~~g~~~V~~~~~~~~~~~~~~~l~i~~~~~~~~~~V~~~~~~~W~d~~~p~~~~~~~~~~~~v~~ 159 (231)
T cd00047 80 REKCAQYWPEEEGSLTYGDITVTLVSEEKLDDYTVRTLKLSNTGTGETRTVTHFQYTGWPDHGVPESPDSLLDLLRKVRK 159 (231)
T ss_pred CccCccCCCCCCCCeEecCEEEEEEEEEEcCCEEEEEEEEEECCCCCceEEEEEeECCCCCCCccCChHHHHHHHHHHHH
Confidence 9999999998756789999999999999999999999999875 48999999999999999999999999999999987
Q ss_pred hcC-CCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHH
Q psy15334 350 HIS-PAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQ 422 (451)
Q Consensus 350 ~~~-~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~ 422 (451)
... ...+||+|||++|+||||+|||++++++++..+ +.+|+.++++.||+||++||+|.+||.|||++++
T Consensus 160 ~~~~~~~~pivVHC~~G~gRsg~~~a~~~~~~~~~~~---~~~~~~~~v~~iR~~R~~~v~~~~Qy~f~~~~l~ 230 (231)
T cd00047 160 SQQQPGSGPIVVHCSAGVGRTGTFIAIDILLQRLEAE---GVVDIFQTVKELRSQRPGMVQTEEQYIFLYRAIL 230 (231)
T ss_pred HhccCCCCCeEEECCCCCCccchHHHHHHHHHHHHhc---CCCCHHHHHHHHHhccccccCCHHHHHHHHHHHh
Confidence 643 336799999999999999999999999999887 8999999999999999999999999999999986
No 12
>KOG0793|consensus
Probab=100.00 E-value=2.4e-62 Score=498.05 Aligned_cols=259 Identities=36% Similarity=0.607 Sum_probs=235.3
Q ss_pred CCcchHHHhHHhhcCCC-CCcccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCCC-CCccEEEec
Q psy15334 162 DPDWIAVEFEDLKSVGR-GQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYL-SPRQYIATQ 239 (451)
Q Consensus 162 ~~~~~~~Ef~~l~~~~~-~~~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~~-~~~~yI~tQ 239 (451)
+.+.+++|++.|..... ..++..+..++|..|||...++||||+||.|.-.....++||||||+|-..+ ..-.|||||
T Consensus 730 nk~rlekeWealc~YqaEpn~r~~aq~e~~a~kNR~~a~LpyDhsrV~l~~~~s~s~~dYiNAS~I~DhDPR~paYIAtQ 809 (1004)
T KOG0793|consen 730 NKNRLEKEWEALCAYQAEPNSRFVAQREENAPKNRSLAVLPYDHSRVLLKAENSHSHSDYINASPIMDHDPRNPAYIATQ 809 (1004)
T ss_pred CchHHHHHHHHHHHhhcCcchhhhhcccccccccCCCCccccccceeeeccccCccccccccccccccCCCCccceeecc
Confidence 34459999999976543 3457788899999999999999999999999987777889999999996532 345799999
Q ss_pred CCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEee-cceEEEEEEEE--eCCe
Q psy15334 240 GPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHY-GSWVLSELRMC--RGGE 316 (451)
Q Consensus 240 ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~-~~~~~r~l~v~--~~~~ 316 (451)
||+++|+.|||+||||+.|.+|||||.+.|+|...|.+|||+++.. .|+-+.|+++++... .||.+|.|-++ .+++
T Consensus 810 gPl~stiA~FWQmvWe~G~~vIV~Lt~l~Engv~qc~rYWPdeGse-lyhiyEV~LVSEHIWceDfLVRSFYLKNlqtse 888 (1004)
T KOG0793|consen 810 GPLPSTIADFWQMVWESGCVVIVMLTPLAENGVRQCYRYWPDEGSE-LYHIYEVNLVSEHIWCEDFLVRSFYLKNLQTSE 888 (1004)
T ss_pred CCCchHHHHHHHHHHHcCcEEEEEecChhhcchhhhhhcCCCCCcc-eeeeEEeeeehhhhhhhhHHHHHHHHhhccccc
Confidence 9999999999999999999999999999999999999999998764 599999999998774 67889988776 4789
Q ss_pred eeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHH
Q psy15334 317 HRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRI 396 (451)
Q Consensus 317 ~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~ 396 (451)
+|+|+||||.+||+.|+|.+..+|++|.++|+++......|||||||+|.|||||+|+||+++.+|.+.. +.+||..+
T Consensus 889 TRTvTQFHfLSWp~egvPasarslLdFRRKVNK~YRGRScpIiVH~sdGaGRTG~YiliDmvl~Rm~kGa--keIDIaAT 966 (1004)
T KOG0793|consen 889 TRTVTQFHFLSWPDEGVPASARSLLDFRRKVNKCYRGRSCPIIVHCSDGAGRTGTYILIDMVLNRMAKGA--KEIDIAAT 966 (1004)
T ss_pred ceeeeeeeeecccccCCccchHHHHHHHHHhhhhccCCCCceEEEccCCCCccceeeeHHHHHHHHhccc--hhhhHHHH
Confidence 9999999999999999999999999999999999887789999999999999999999999999998653 89999999
Q ss_pred HHHHHhhccCCCCCHHHHHHHHHHHHH
Q psy15334 397 VHGLRRQRVLMVQSLAQYAFIYECVQV 423 (451)
Q Consensus 397 v~~lR~qR~~~Vqt~~QY~fiy~~l~~ 423 (451)
+..||.||+|||+|.+||.|.+.||.+
T Consensus 967 lEHlRDQR~GmVaTkdQFef~l~aVAe 993 (1004)
T KOG0793|consen 967 LEHLRDQRPGMVATKDQFEFALTAVAE 993 (1004)
T ss_pred HHHHhhcCCcceeehhhhHHHHHHHHH
Confidence 999999999999999999999999985
No 13
>PF00102 Y_phosphatase: Protein-tyrosine phosphatase; InterPro: IPR000242 Protein tyrosine (pTyr) phosphorylation is a common post-translational modification which can create novel recognition motifs for protein interactions and cellular localisation, affect protein stability, and regulate enzyme activity. Consequently, maintaining an appropriate level of protein tyrosine phosphorylation is essential for many cellular functions. Tyrosine-specific protein phosphatases (PTPase; 3.1.3.48 from EC) catalyse the removal of a phosphate group attached to a tyrosine residue, using a cysteinyl-phosphate enzyme intermediate. These enzymes are key regulatory components in signal transduction pathways (such as the MAP kinase pathway) and cell cycle control, and are important in the control of cell growth, proliferation, differentiation and transformation [, ]. The PTP superfamily can be divided into four subfamilies []: (1) pTyr-specific phosphatases (2) dual specificity phosphatases (dTyr and dSer/dThr) (3) Cdc25 phosphatases (dTyr and/or dThr) (4) LMW (low molecular weight) phosphatases Based on their cellular localisation, PTPases are also classified as: Receptor-like, which are transmembrane receptors that contain PTPase domains [] Non-receptor (intracellular) PTPases [] All PTPases carry the highly conserved active site motif C(X)5R (PTP signature motif), employ a common catalytic mechanism, and share a similar core structure made of a central parallel beta-sheet with flanking alpha-helices containing a beta-loop-alpha-loop that encompasses the PTP signature motif []. Functional diversity between PTPases is endowed by regulatory domains and subunits. This entry repesents several receptor and non-receptor protein-tyrosine phosphatases. Structurally, all known receptor PTPases, are made up of a variable length extracellular domain, followed by a transmembrane region and a C-terminal catalytic cytoplasmic domain. Some of the receptor PTPases contain fibronectin type III (FN-III) repeats, immunoglobulin-like domains, MAM domains or carbonic anhydrase-like domains in their extracellular region. The cytoplasmic region generally contains two copies of the PTPase domain. The first seems to have enzymatic activity, while the second is inactive. The inactive domains of tandem phosphatases can be divided into two classes. Those which bind phosphorylated tyrosine residues may recruit multi-phosphorylated substrates for the adjacent active domains and are more conserved, while the other class have accumulated several variable amino acid substitutions and have a complete loss of tyrosine binding capability. The second class shows a release of evolutionary constraint for the sites around the catalytic centre, which emphasises a difference in function from the first group. There is a region of higher conservation common to both classes, suggesting a new regulatory centre []. PTPase domains consist of about 300 amino acids. There are two conserved cysteines, the second one has been shown to be absolutely required for activity. Furthermore, a number of conserved residues in its immediate vicinity have also been shown to be important.; GO: 0004725 protein tyrosine phosphatase activity, 0006470 protein dephosphorylation; PDB: 3O4T_A 3O4S_A 3O4U_A 2A3K_A 2QDP_A 2QDC_A 2QDM_A 2HVL_A 1ZC0_A 3D44_A ....
Probab=100.00 E-value=6.9e-59 Score=443.26 Aligned_cols=229 Identities=40% Similarity=0.804 Sum_probs=208.1
Q ss_pred CCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEecccee
Q psy15334 190 NRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIE 269 (451)
Q Consensus 190 N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E 269 (451)
|+.|||+.|++|||++||+|.+. .+++||||||||+++...++||+||+|+++|++|||+||||++|.+||||+...|
T Consensus 1 n~~knR~~~~~~~d~~rv~l~~~--~~~~~yi~A~~v~~~~~~~~~I~~q~P~~~t~~~FW~mv~~~~~~~Iv~L~~~~e 78 (235)
T PF00102_consen 1 NREKNRSPDILPYDDSRVKLNPS--PGDSDYINASYVDGYKNGKKFIVTQAPMPDTIEDFWQMVWEQKVQIIVMLCSFDE 78 (235)
T ss_dssp GGGGSSSTTS-BBTTTEEETTTT--TTSTTEESEEEEEESSSTEEEEEEES-SGGGHHHHHHHHHHTTBSEEEEESBSEE
T ss_pred CCccCCCCCCCcCcceEEEecCC--CCccchhhhhhhccccchhhheeecccccccccceehheeeccccceeccccccc
Confidence 78999999999999999999542 2336799999999998899999999999999999999999999999999999999
Q ss_pred cccccccccCC-CCCCCceeeeEEEEEe-eeE-eecceEEEEEEEEeCCe---eeEEEEEEeecCCCCCCCCChhHHHHH
Q psy15334 270 KHRDKCYKYWP-SDQHPMRCADIEISLQ-DVC-HYGSWVLSELRMCRGGE---HRMISHYHFTTWPDSGVPSSPNTLVSF 343 (451)
Q Consensus 270 ~~~~kc~~YwP-~~~~~~~~g~~~V~~~-~~~-~~~~~~~r~l~v~~~~~---~r~V~h~~~~~Wpd~gvP~~~~~ll~f 343 (451)
.+..+|.+||| ..+..+.+|+++|++. +.. ....+++|+|.++..+. .+.++||||..||++++|.+...+++|
T Consensus 79 ~~~~~~~~y~P~~~~~~~~~g~~~V~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~v~~~~~~~W~~~~~P~~~~~~~~~ 158 (235)
T PF00102_consen 79 SGDEKCDQYWPLKEGESLKFGDYTVKCKEKIEENSKHYTVRKLEVTNGKSKKKSRTVTHFHYTNWPDDGVPPSPESFLDF 158 (235)
T ss_dssp TTEESS--TSTSSSSSEEEETTEEEEEEEEEEEEESSEEEEEEEEEETTTTEEEEEEEEEEEESSSSSSSGSSSHHHHHH
T ss_pred ccccccccccccccccccccccccccccccccccccceEEeeccccccccccccccccceeeeeccccccccccchhhhh
Confidence 99999999999 5567889999999999 666 67899999999987543 589999999999999999999999999
Q ss_pred HHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHH
Q psy15334 344 VLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQV 423 (451)
Q Consensus 344 i~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~ 423 (451)
++.+++......+|++|||++|+||||+||+++++++++..+ +.+||.++++.||++|+++|++.+||.|||+++++
T Consensus 159 ~~~v~~~~~~~~~pivVhc~~G~gRsg~f~~~~~~~~~~~~~---~~~~v~~~~~~lR~~R~~~i~~~~qy~f~~~~~~e 235 (235)
T PF00102_consen 159 IRKVNKSKDDPNGPIVVHCSDGVGRSGTFCAIDILIEQLKKE---GEVDVFEIVKKLRQQRPGAIQSPEQYRFCYMAVLE 235 (235)
T ss_dssp HHHHHHHHSTTSSEEEEESSSSSHHHHHHHHHHHHHHHHHHH---SEECHHHHHHHHHTTSTTSSSSHHHHHHHHHHHHH
T ss_pred hhhccccccCCccceEeecccccccccccccchhhccccccc---cchhhHHHHHHHHhhCCCccCCHHHHHHHHHHHhC
Confidence 999998876667899999999999999999999999999998 88999999999999999999999999999999986
No 14
>COG5599 PTP2 Protein tyrosine phosphatase [Signal transduction mechanisms]
Probab=100.00 E-value=2.3e-59 Score=433.30 Aligned_cols=238 Identities=37% Similarity=0.638 Sum_probs=196.5
Q ss_pred ccccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccC--C
Q psy15334 182 TTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRT--P 259 (451)
Q Consensus 182 ~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v--~ 259 (451)
+.....+.|-.||||.||+||+++||+|... ...+||||||||+... .+|||||||.++|+.|||+|||+... .
T Consensus 45 ~~~sls~~~~~rNRYsnIvPYe~tRV~Lk~~--k~~~DYINAS~ik~~~--~~yIAtQgP~~~t~ddFW~mvw~n~~~~g 120 (302)
T COG5599 45 DDTSLSSTNYARNRYSNIVPYEHTRVHLKYG--KSINDYINASYIKTPR--GKYIATQGPKPETIDDFWKMVWHNVPNNG 120 (302)
T ss_pred cccccCccccccccccccccccCceeeccCC--CchhhcccceeeecCC--CceEEecCCCCchHHHHHHHHHhcCCCce
Confidence 3445667899999999999999999999875 4568999999999855 45999999999999999999999998 9
Q ss_pred eEEEeccceecccccccccCCCC-CCCceeeeEEEEEeeeEeec--ceEEEEEEEEe-CCeeeEEEEEEeecCCCCCCCC
Q psy15334 260 AIVMLTQCIEKHRDKCYKYWPSD-QHPMRCADIEISLQDVCHYG--SWVLSELRMCR-GGEHRMISHYHFTTWPDSGVPS 335 (451)
Q Consensus 260 ~IVmLt~~~E~~~~kc~~YwP~~-~~~~~~g~~~V~~~~~~~~~--~~~~r~l~v~~-~~~~r~V~h~~~~~Wpd~gvP~ 335 (451)
+||||+++.|.+++||.+|||.. ++...+|...+.-++.+... .+.++.++++. ++..+.|+||+|.+|+|.++|
T Consensus 121 vIVmLt~l~E~~rekc~qYWp~~~~~~~~~G~~v~~~~~~e~~~d~~~~~~~f~L~~~~~~~k~Ihhf~y~nW~D~~~p- 199 (302)
T COG5599 121 VIVMLTSLPEYNREKCDQYWPLGYDDTLIIGLRVIKQKKYELFNDNIVNVHNFELTSINGPPKKIHHFQYINWVDFNVP- 199 (302)
T ss_pred EEEEecChHHHhHHHHhhhCCCCcCcceeeeeEEEEEecccccccceeeeeecccccCCCCccEEEEEEecCccccCCc-
Confidence 99999999999999999999942 34567886555555444433 35678888874 578999999999999999999
Q ss_pred ChhHHHHHHHHHHHhcCC-CCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccC-C----CCCC-HHHHHHHHHhhccCCC
Q psy15334 336 SPNTLVSFVLSFRTHISP-AQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEG-A----GSVD-ILRIVHGLRRQRVLMV 408 (451)
Q Consensus 336 ~~~~ll~fi~~v~~~~~~-~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~-~----~~vd-i~~~v~~lR~qR~~~V 408 (451)
+...+.++++.+... + ..+|++||||||+||||||+|+|.++++...... . ...| |+++|..||+||+.||
T Consensus 200 ~i~sl~~~~~sl~~s--p~~t~piiVHCSAGvGRTGTFIalD~ll~~~~~~~~~t~~~~~t~D~if~iV~~LRsQRmkmV 277 (302)
T COG5599 200 DIRSLTEVIHSLNDS--PVRTGPIIVHCSAGVGRTGTFIALDILLRMPNDTLNHTDTWEDTQDLIFQIVLSLRSQRMKMV 277 (302)
T ss_pred CHHHHHHHHHHhhcC--cCCCCCEEEEeccCCCCcceeeeHHHHHhccccccCCCchhhhhhhHHHHHHHHHHHHHHHHH
Confidence 566666666665432 2 5789999999999999999999999987643210 0 1123 8999999999999999
Q ss_pred CCHHHHHHHHHHHHHHHh
Q psy15334 409 QSLAQYAFIYECVQVYLQ 426 (451)
Q Consensus 409 qt~~QY~fiy~~l~~~l~ 426 (451)
|+.+||.|||++++++.+
T Consensus 278 Qn~~Qf~flY~~~~~l~~ 295 (302)
T COG5599 278 QNKTQFKFLYDAFLELNK 295 (302)
T ss_pred HhHHHHHHHHHHHHHHHH
Confidence 999999999999999873
No 15
>KOG4228|consensus
Probab=100.00 E-value=5.3e-59 Score=500.16 Aligned_cols=277 Identities=36% Similarity=0.653 Sum_probs=241.8
Q ss_pred cccCCCcceeccchh----hhcc-CC--c--chHHHhHHhhcCCCCC-cccccccccCCCCCCCCCCCCCCCcceEecCC
Q psy15334 143 TLHNCSHPIPVTLFC----TMTL-DP--D--WIAVEFEDLKSVGRGQ-PTTAAELRCNRPKNRYTNIFPYDSSRYKLQPV 212 (451)
Q Consensus 143 ~~~~~~~~i~~~~f~----~~~~-~~--~--~~~~Ef~~l~~~~~~~-~~~~~~~~~N~~KNR~~~i~pyD~sRV~L~~~ 212 (451)
....+.++|++.+|. .+.. ++ + .+++||+.|....... ....+.+++|+.|||+.+|+|||++||.|.++
T Consensus 796 ~~~~G~T~i~~~~~~~~~~~l~~~~p~~~~t~le~EF~~L~~~~~~~~~~~~~~l~~N~~KNR~~~i~P~d~~rv~L~~~ 875 (1087)
T KOG4228|consen 796 ACLCGDTEVPASTLAPYSQKLKRIDPPENKTGLEEEFKTLNSCKPRTRMMICGNLPENKSKNRQVNILPYDRNRVILIPT 875 (1087)
T ss_pred HHhcCCccccHHHHHHHHHHhccCCCCCCCcchHHHHHHHhhccccchhhhccccchhcccccccccCCchhcccceecc
Confidence 345788999999992 2222 32 2 2999999998765432 22348999999999999999999999999999
Q ss_pred CCCCCCCcEEeeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEE
Q psy15334 213 DDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIE 292 (451)
Q Consensus 213 ~~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~ 292 (451)
.|. ++||||||+||||..+..|||||+||.+|++||||||||++|.+||||++..+. .+|.||||.++ ...||.|.
T Consensus 876 ~G~-~sdYINAs~idgy~~~~~fivtq~PL~~T~~DFWrmi~d~~~tsiVmL~~l~~~--~~C~qyw~~~g-~~~yg~i~ 951 (1087)
T KOG4228|consen 876 HGE-SSDYINASFIDGYRQPKAFIVTQGPLAETVEDFWRMIWDQNVTSIVMLTELKHP--EKCPQYWPPEG-SQRYGPIE 951 (1087)
T ss_pred CCC-cccccchhhhcccCCcceEEEecCCcccchHHHHHHhhccceeEEEEecccCcc--cccccccCCcC-ceecCcEE
Confidence 887 899999999999999999999999999999999999999999999999998776 89999999854 57899999
Q ss_pred EEEeeeEeecceEEEEEEEEe--CCeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHH-HhcCC-CCCCEEEEcCCCCch
Q psy15334 293 ISLQDVCHYGSWVLSELRMCR--GGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFR-THISP-AQHPIVVHCSAGVGR 368 (451)
Q Consensus 293 V~~~~~~~~~~~~~r~l~v~~--~~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~-~~~~~-~~~PivVHCs~GvGR 368 (451)
|+.++......++.|+|.+++ .+..|+|+||||++||..++|+.....+.++-.+. +.... ..+|++|||++|+||
T Consensus 952 Ve~~~~~~~~~~t~r~f~i~n~~~~~~r~v~qfq~~~WP~~~~~p~~~~~~~~i~~~~~~~q~~~~~~P~~Vhc~nG~~r 1031 (1087)
T KOG4228|consen 952 VEDMNEHINPQYTAREFGVTNEREKQSRTVRQFQFTGWPEYGKPPQSKGPISKIPSVASKWQQLGADGPIIVHCLNGVGR 1031 (1087)
T ss_pred EEecccccchhhhhhhheeeeccccCceEEEEEEecCCcccCcCCCCcchhhhHHHHHHHHHhhcCCCCEEEEEcCCCcc
Confidence 999999999999999999986 46799999999999999888876666655554443 32222 278999999999999
Q ss_pred hhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHHHHHHh
Q psy15334 369 TGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQ 426 (451)
Q Consensus 369 tG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l~~~l~ 426 (451)
||+|||+.+++++++.+ +.||||++|+.||.+|++||++.+||.|||+++++|+.
T Consensus 1032 sg~f~ai~~l~e~~~~e---~~vDVfq~vk~Lr~~rp~mv~t~~QY~fcYdv~~~y~~ 1086 (1087)
T KOG4228|consen 1032 TGTFCAISILLERMRKE---GVVDVFQTVKTLRFQRPGMVDTSDQYQFCYDVALEYLG 1086 (1087)
T ss_pred eeehHHHHHHHHHHhhc---CceeeehhhhhhhhcCccccCcHHHHHHHHHHHHHhhc
Confidence 99999999999999999 99999999999999999999999999999999999985
No 16
>PRK15375 pathogenicity island 1 effector protein StpP; Provisional
Probab=100.00 E-value=7.4e-53 Score=425.30 Aligned_cols=215 Identities=19% Similarity=0.258 Sum_probs=183.9
Q ss_pred CCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCCCCCccEEEecCCCC--CCHHHHHhHhhhccCCeEEEeccceeccc
Q psy15334 195 RYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLC--STVDDFWRMCWESRTPAIVMLTQCIEKHR 272 (451)
Q Consensus 195 R~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~--~T~~dFWrMVwe~~v~~IVmLt~~~E~~~ 272 (451)
--.+|+|+|||||+|. +++|||||||+ +..++.+|++|.|+. +|++||||||||++|.+|||||+..|.++
T Consensus 303 ~~~~I~~~~~TrV~ls------~g~yI~ANyI~-~~~k~~aIa~QYP~nt~eTieDFWrMVWEn~~~VIVMLT~l~E~g~ 375 (535)
T PRK15375 303 AVSSIPINQQTQVKLS------DGMPVPVNTLT-FDGKPVALAGSYPKNTPDALEAHMKMLLEKECSCLVVLTSEDQMQA 375 (535)
T ss_pred HhcccCccccceEEcC------CCCccccceee-ecCccceEeccCCCCchhhHHHHHHHHHHcCCCEEEEeeccccCcc
Confidence 3479999999999995 27999999999 888889999999976 89999999999999999999999999999
Q ss_pred ccccccCCCCCCCceeeeEEEEEeeeEee-cceEEEEEEE--EeCCeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHH
Q psy15334 273 DKCYKYWPSDQHPMRCADIEISLQDVCHY-GSWVLSELRM--CRGGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRT 349 (451)
Q Consensus 273 ~kc~~YwP~~~~~~~~g~~~V~~~~~~~~-~~~~~r~l~v--~~~~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~ 349 (451)
.+|.+||+ ++..||+|+|++.+++.. .++.+|.+.+ ...+..+.|+||||++|||||+|+++..++.|++.|+.
T Consensus 376 ~KC~pYW~---~s~tYGdItVts~seesls~g~iIR~f~L~Ik~~E~R~tV~QFHyTnWPDHGVPpST~~LleLvr~Vr~ 452 (535)
T PRK15375 376 KQLPPYFR---GSYTFGEVHTNSQKVSSASQGEAIDQYNMQLSCGEKRYTIPVLHVKNWPDHQPLPSTDQLEYLADRVKN 452 (535)
T ss_pred cccCccCC---CcceeccEEEEEEEEEeccCCceEEEEEEEEecCCceeEEEEEEeCCCCCCCCCCChHHHHHHHHHHHH
Confidence 99999996 356899999999988765 4577776644 44445568999999999999999988889999999876
Q ss_pred hcCC--------CCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccC-CCCCHHHHHHHHHH
Q psy15334 350 HISP--------AQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVL-MVQSLAQYAFIYEC 420 (451)
Q Consensus 350 ~~~~--------~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~-~Vqt~~QY~fiy~~ 420 (451)
.... .++++|||||||+||||+|||++. +..+ +.+|+.++|+.||.+|++ |||+.+||.|++..
T Consensus 453 ~~q~~~~~~~~~nk~~PVVHCSAGVGRTGTFIAi~l----lk~~---~~~sle~IV~dlR~qRng~MVQt~eQy~~l~~~ 525 (535)
T PRK15375 453 SNQNGAPGRSSSDKHLPMIHCLGGVGRTGTMAAALV----LKDN---PHSNLEQVRADFRNSRNNRMLEDASQFVQLKAM 525 (535)
T ss_pred hhhcccccccccCCCCceEEcCCCCchHHHHHHHHH----Hhcc---ccCCHHHHHHHHHhcCCccccccHHHHHHHHHH
Confidence 5311 123448999999999999999975 3434 678999999999999999 99999999999988
Q ss_pred HHHHHh
Q psy15334 421 VQVYLQ 426 (451)
Q Consensus 421 l~~~l~ 426 (451)
-...|.
T Consensus 526 ~~~~~~ 531 (535)
T PRK15375 526 QAQLLM 531 (535)
T ss_pred HHHHhh
Confidence 776654
No 17
>KOG0789|consensus
Probab=100.00 E-value=4.5e-51 Score=422.45 Aligned_cols=243 Identities=32% Similarity=0.527 Sum_probs=196.4
Q ss_pred ccccccCCCCCCCCCCCCCCCcceEecCCCCCCCCCcEEeeeecCCCCCccEEEecCCCCCCHHHHHhHhhhccCCeEEE
Q psy15334 184 AAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVM 263 (451)
Q Consensus 184 ~~~~~~N~~KNR~~~i~pyD~sRV~L~~~~~~~~~dYINAs~i~g~~~~~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVm 263 (451)
......|..||||.||.|||++||+|.... ..||||||||+++...+.||+||||+++|.+|||+||||+++..|||
T Consensus 121 ~~~~~~~~~kNr~~dv~~~d~~rV~l~~~~---~~dyI~A~~v~~~~~~~~~I~tQ~Pl~~t~~dFW~mv~~~k~~~Ivm 197 (415)
T KOG0789|consen 121 CDAFLKNMDKNRYSDVQCLDHTRVKLKPPQ---DKDYIHASKIYFPKGPRTAILTQAPLSKTTEDFWRMVFEEKVESVVL 197 (415)
T ss_pred hhhhhccccccccCCcCcccCCeeecCCCC---CcccchhhccCCcCCCcCceecCCCCcccHHHHHHHHHHcCCCEEEE
Confidence 344567888999999999999999997532 67999999999976448999999999999999999999999999999
Q ss_pred eccceecccccccccCCCCC-CCceeeeEEEEEeee---Eee---cceEEEEEEE-EeCCe--eeEEEEEEeecCCCCCC
Q psy15334 264 LTQCIEKHRDKCYKYWPSDQ-HPMRCADIEISLQDV---CHY---GSWVLSELRM-CRGGE--HRMISHYHFTTWPDSGV 333 (451)
Q Consensus 264 Lt~~~E~~~~kc~~YwP~~~-~~~~~g~~~V~~~~~---~~~---~~~~~r~l~v-~~~~~--~r~V~h~~~~~Wpd~gv 333 (451)
||+..|.|+.+|.+|||.+. ....||.+...+... ... ..+....+.+ ...+. .+.|.|+||++|||+|+
T Consensus 198 l~~~~E~~~~kc~~Y~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~WPd~~~ 277 (415)
T KOG0789|consen 198 LCSDEELGAAECASYFPSKRGVKLTFLEFGKKFVENRKVEMKQQDQTSTVGLLLLVLPEGLEISSSVVHYHYINWPDHGA 277 (415)
T ss_pred ecchhhccHHHHhhcCcccCCCeeeecccceeEEEeeeeecccccccceeeEEEeecCCCccccceEEEEeeCCCccccC
Confidence 99999999999999999875 455677664433332 221 2222222222 22333 46999999999999999
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHH
Q psy15334 334 PSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQ 413 (451)
Q Consensus 334 P~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~Q 413 (451)
|.+...++.+++.-.....+..+|++||||+|+||||+|+++++++.++.... ...++..+++.+|.||+++|||..|
T Consensus 278 p~~~~~~l~~~~~~~~~~~~~~~P~vVhcsaG~gRtgt~v~~~~~~~~~~~~~--~~~~~~~~~~~iR~qR~~~vqt~~Q 355 (415)
T KOG0789|consen 278 PDSVKSILPLLRQSVLELRPKQEPIEVHCSAGAGRAGTLVLIEHALIELQGPE--GEPPIDEILREIRYQRPGAVQSPLQ 355 (415)
T ss_pred CcchHHHHHHHHhhhhhhcCCCCCeEEECCCCCCccchHHHHHHHHHHHhcCC--CCccHHHHHHHHHHHhhhcccchhH
Confidence 99999999998751112233468999999999999999999998888876432 5678999999999999999999999
Q ss_pred HHHHHHHHHHHHhcCCCC
Q psy15334 414 YAFIYECVQVYLQGGVRH 431 (451)
Q Consensus 414 Y~fiy~~l~~~l~~~~~~ 431 (451)
|.|+|+++++|+......
T Consensus 356 y~f~~~~~~~~~~~~~~~ 373 (415)
T KOG0789|consen 356 YLFIYAATLKYIKKVSYW 373 (415)
T ss_pred HHHHHHHHHHHHHHccCC
Confidence 999999999998874433
No 18
>smart00012 PTPc_DSPc Protein tyrosine phosphatase, catalytic domain, undefined specificity. Protein tyrosine phosphatases. Homologues detected by this profile and not by those of "PTPc" or "DSPc" are predicted to be protein phosphatases with a similar fold to DSPs and PTPs, yet with unpredicted specificities.
Probab=99.90 E-value=1.9e-23 Score=173.69 Aligned_cols=102 Identities=49% Similarity=0.954 Sum_probs=92.0
Q ss_pred EEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcC--CCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHH
Q psy15334 319 MISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHIS--PAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRI 396 (451)
Q Consensus 319 ~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~--~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~ 396 (451)
.+.||||++|||+++|.....+++|+..+++... +.++||+|||++|+||||+||+++.++.++..+. +.+|+.++
T Consensus 1 ~~~~~~~~~Wpd~~~P~~~~~~~~~~~~v~~~~~~~~~~~pvlVHC~~G~gRtg~~~~~~~~~~~~~~~~--~~~~~~~~ 78 (105)
T smart00012 1 TVKHYHYTGWPDHGVPESPDSILEFLRAVKKNLNQSSSSGPVVVHCSAGVGRTGTFVALDILLQQLESET--GEVDIFQT 78 (105)
T ss_pred CeEEEeeCCCCCCCCCCCHHHHHHHHHHHHHHHHhcCCCCCEEEEeCCCCChhhHHHHHHHHHHHHHhcC--CCCCHHHH
Confidence 3789999999999999998999999999887643 2367999999999999999999999999887542 57999999
Q ss_pred HHHHHhhccCCCCCHHHHHHHHHHHH
Q psy15334 397 VHGLRRQRVLMVQSLAQYAFIYECVQ 422 (451)
Q Consensus 397 v~~lR~qR~~~Vqt~~QY~fiy~~l~ 422 (451)
++.+|++|++++++.+||.|+|.+++
T Consensus 79 ~~~ir~~r~~~~~~~~q~~~~~~~~~ 104 (105)
T smart00012 79 VKELRKQRPGMVQTFEQYLFLYRALL 104 (105)
T ss_pred HHHHHhhhhhhCCcHHHHHHHHHHHh
Confidence 99999999999999999999999986
No 19
>smart00404 PTPc_motif Protein tyrosine phosphatase, catalytic domain motif.
Probab=99.90 E-value=1.9e-23 Score=173.69 Aligned_cols=102 Identities=49% Similarity=0.954 Sum_probs=92.0
Q ss_pred EEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcC--CCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHH
Q psy15334 319 MISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHIS--PAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRI 396 (451)
Q Consensus 319 ~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~--~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~ 396 (451)
.+.||||++|||+++|.....+++|+..+++... +.++||+|||++|+||||+||+++.++.++..+. +.+|+.++
T Consensus 1 ~~~~~~~~~Wpd~~~P~~~~~~~~~~~~v~~~~~~~~~~~pvlVHC~~G~gRtg~~~~~~~~~~~~~~~~--~~~~~~~~ 78 (105)
T smart00404 1 TVKHYHYTGWPDHGVPESPDSILEFLRAVKKNLNQSSSSGPVVVHCSAGVGRTGTFVALDILLQQLESET--GEVDIFQT 78 (105)
T ss_pred CeEEEeeCCCCCCCCCCCHHHHHHHHHHHHHHHHhcCCCCCEEEEeCCCCChhhHHHHHHHHHHHHHhcC--CCCCHHHH
Confidence 3789999999999999998999999999887643 2367999999999999999999999999887542 57999999
Q ss_pred HHHHHhhccCCCCCHHHHHHHHHHHH
Q psy15334 397 VHGLRRQRVLMVQSLAQYAFIYECVQ 422 (451)
Q Consensus 397 v~~lR~qR~~~Vqt~~QY~fiy~~l~ 422 (451)
++.+|++|++++++.+||.|+|.+++
T Consensus 79 ~~~ir~~r~~~~~~~~q~~~~~~~~~ 104 (105)
T smart00404 79 VKELRKQRPGMVQTFEQYLFLYRALL 104 (105)
T ss_pred HHHHHhhhhhhCCcHHHHHHHHHHHh
Confidence 99999999999999999999999986
No 20
>PTZ00242 protein tyrosine phosphatase; Provisional
Probab=99.66 E-value=5.2e-16 Score=140.52 Aligned_cols=135 Identities=19% Similarity=0.287 Sum_probs=99.2
Q ss_pred ccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeecceEEEEEEEE
Q psy15334 233 RQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMC 312 (451)
Q Consensus 233 ~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~ 312 (451)
-+||++-+|+++++.+|....-+.++..||.++. ++|+.+
T Consensus 16 ~r~~~~~~P~~~~~~~~l~~L~~~gI~~Iv~l~~----------~~~~~~------------------------------ 55 (166)
T PTZ00242 16 FKFLILDAPSPSNLPLYIKELQRYNVTHLVRVCG----------PTYDAE------------------------------ 55 (166)
T ss_pred eEEEEecCCCcccHHHHHHHHHhCCCeEEEecCC----------CCCCHH------------------------------
Confidence 3799999999999999999999999999999864 122111
Q ss_pred eCCeeeEEEEEEeecCC--CCCCCCChh---HHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccC
Q psy15334 313 RGGEHRMISHYHFTTWP--DSGVPSSPN---TLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEG 387 (451)
Q Consensus 313 ~~~~~r~V~h~~~~~Wp--d~gvP~~~~---~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~ 387 (451)
.....+++|..|| |+++| +.. .+++++..+........+||+|||.+|+||||+||++..+ +.
T Consensus 56 ----~~~~~gi~~~~~p~~D~~~P-~~~~i~~~~~~i~~~~~~~~~~g~~V~VHC~aGigRSgt~~a~yL~----~~--- 123 (166)
T PTZ00242 56 ----LLEKNGIEVHDWPFDDGAPP-PKAVIDNWLRLLDQEFAKQSTPPETIAVHCVAGLGRAPILVALALV----EY--- 123 (166)
T ss_pred ----HHHHCCCEEEecCCCCCCCC-CHHHHHHHHHHHHHHHHhhccCCCeEEEECCCCCCHHHHHHHHHHH----Hh---
Confidence 0112234455566 65655 444 4455555544332335789999999999999999998732 22
Q ss_pred CCCCCHHHHHHHHHhhccCCCCCHHHHHHHHHHH
Q psy15334 388 AGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECV 421 (451)
Q Consensus 388 ~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy~~l 421 (451)
+.+++.++++.+|++|++++ +..|+.|+.+..
T Consensus 124 -~~~s~~eAi~~vr~~R~~~i-~~~Q~~~l~~~~ 155 (166)
T PTZ00242 124 -GGMEPLDAVGFVREKRKGAI-NQTQLQFLKKYK 155 (166)
T ss_pred -CCCCHHHHHHHHHHHCCCCc-hHHHHHHHHHHH
Confidence 44699999999999999999 589999987654
No 21
>PTZ00393 protein tyrosine phosphatase; Provisional
Probab=99.46 E-value=3.8e-13 Score=126.36 Aligned_cols=135 Identities=15% Similarity=0.265 Sum_probs=98.4
Q ss_pred ccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeecceEEEEEEEE
Q psy15334 233 RQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMC 312 (451)
Q Consensus 233 ~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~ 312 (451)
-+||++-+|+..++.+|..-..+.++..||-+|. .+.+.+ .+
T Consensus 92 ~rfLi~~~P~~~~~~~yl~eLk~~gV~~lVrlcE----------~~Yd~~--~~-------------------------- 133 (241)
T PTZ00393 92 IKILILDAPTNDLLPLYIKEMKNYNVTDLVRTCE----------RTYNDG--EI-------------------------- 133 (241)
T ss_pred eeEEEeCCCCHHHHHHHHHHHHHcCCCEEEECCC----------CCCCHH--HH--------------------------
Confidence 4899999999999999999999999999999864 111211 00
Q ss_pred eCCeeeEEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCC
Q psy15334 313 RGGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVD 392 (451)
Q Consensus 313 ~~~~~r~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vd 392 (451)
+..-+.+++ ..|||+++|+. ..+.+|+..+..... ..++|+|||.+|+||||+++|+..+ . . + .+
T Consensus 134 ---~~~GI~~~~-lpipDg~aPs~-~~i~~~l~~i~~~l~-~g~~VaVHC~AGlGRTGtl~AayLI----~-~---G-ms 198 (241)
T PTZ00393 134 ---TSAGINVHE-LIFPDGDAPTV-DIVSNWLTIVNNVIK-NNRAVAVHCVAGLGRAPVLASIVLI----E-F---G-MD 198 (241)
T ss_pred ---HHcCCeEEE-eecCCCCCCCH-HHHHHHHHHHHHHHh-cCCeEEEECCCCCCHHHHHHHHHHH----H-c---C-CC
Confidence 011123333 46899998863 444455555544332 3578999999999999999987643 2 1 2 58
Q ss_pred HHHHHHHHHhhccCCCCCHHHHHHHHHHH
Q psy15334 393 ILRIVHGLRRQRVLMVQSLAQYAFIYECV 421 (451)
Q Consensus 393 i~~~v~~lR~qR~~~Vqt~~QY~fiy~~l 421 (451)
..+++..+|+.|++++ +..|+.|+.+.-
T Consensus 199 peeAI~~VR~~RPgAI-n~~Q~~fL~~y~ 226 (241)
T PTZ00393 199 PIDAIVFIRDRRKGAI-NKRQLQFLKAYK 226 (241)
T ss_pred HHHHHHHHHHHCCCCC-CHHHHHHHHHHH
Confidence 8899999999999999 578999977553
No 22
>KOG1720|consensus
Probab=99.31 E-value=2.1e-11 Score=111.02 Aligned_cols=97 Identities=26% Similarity=0.338 Sum_probs=77.5
Q ss_pred EEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHH
Q psy15334 321 SHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGL 400 (451)
Q Consensus 321 ~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~l 400 (451)
.|+++ -.||.+.| ++..+.+|++.+..... .|-|.|||-+|.||||++||..+|.. -.+.--+++..|
T Consensus 117 ~h~~l-~f~Dg~tP-~~~~v~~fv~i~e~~~~--~g~iaVHCkaGlGRTG~liAc~lmy~--------~g~ta~eaI~~l 184 (225)
T KOG1720|consen 117 DHHDL-FFADGSTP-TDAIVKEFVKIVENAEK--GGKIAVHCKAGLGRTGTLIACYLMYE--------YGMTAGEAIAWL 184 (225)
T ss_pred eeeee-ecCCCCCC-CHHHHHHHHHHHHHHHh--cCeEEEEeccCCCchhHHHHHHHHHH--------hCCCHHHHHHHH
Confidence 34443 24788877 46888999998887654 67899999999999999999987764 235567999999
Q ss_pred HhhccCCCCCHHHHHHHHHHHHHHHhcCC
Q psy15334 401 RRQRVLMVQSLAQYAFIYECVQVYLQGGV 429 (451)
Q Consensus 401 R~qR~~~Vqt~~QY~fiy~~l~~~l~~~~ 429 (451)
|..|||||++.+|+..+++-..-++.+..
T Consensus 185 R~~RpG~V~gpqQ~~l~~~q~~~~~~~~~ 213 (225)
T KOG1720|consen 185 RICRPGAVIGPQQHKLLHKQRDLWLAGDL 213 (225)
T ss_pred HhcCCccccCHHHHHHHHHHHHHHhhhcc
Confidence 99999999999999999987765665543
No 23
>KOG2836|consensus
Probab=99.16 E-value=5.3e-10 Score=94.56 Aligned_cols=131 Identities=21% Similarity=0.399 Sum_probs=95.2
Q ss_pred ccEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeecceEEEEEEEE
Q psy15334 233 RQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMC 312 (451)
Q Consensus 233 ~~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~ 312 (451)
-+|.+|..|...|+..|-.-.-.+++.++|-.|... | +..++.-.
T Consensus 17 MrFLIThnPtnaTln~fieELkKygvttvVRVCe~T---------Y---dt~~lek~----------------------- 61 (173)
T KOG2836|consen 17 MRFLITHNPTNATLNKFIEELKKYGVTTVVRVCEPT---------Y---DTTPLEKE----------------------- 61 (173)
T ss_pred eEEEEecCCCchhHHHHHHHHHhcCCeEEEEecccc---------c---CCchhhhc-----------------------
Confidence 479999999999999999999999999999998521 1 11111100
Q ss_pred eCCeeeEEEEEEeecCC-CCCCCCChhHHHHHHHHHHHhcC-CCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCC
Q psy15334 313 RGGEHRMISHYHFTTWP-DSGVPSSPNTLVSFVLSFRTHIS-PAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGS 390 (451)
Q Consensus 313 ~~~~~r~V~h~~~~~Wp-d~gvP~~~~~ll~fi~~v~~~~~-~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~ 390 (451)
-++..+|| |.|.|+..+-+-+.+..+..... ....-|-|||-||+||+-+++|+..+-.-|+.+
T Consensus 62 ---------GI~Vldw~f~dg~ppp~qvv~~w~~l~~~~f~e~p~~cvavhcvaglgrapvlvalalie~gmkye----- 127 (173)
T KOG2836|consen 62 ---------GITVLDWPFDDGAPPPNQVVDDWLSLVKTKFREEPGCCVAVHCVAGLGRAPVLVALALIEAGMKYE----- 127 (173)
T ss_pred ---------CceEeecccccCCCCchHHHHHHHHHHHHHHhhCCCCeEEEEeecccCcchHHHHHHHHHccccHH-----
Confidence 12345798 88988776655555555554332 224569999999999999999987665555444
Q ss_pred CCHHHHHHHHHhhccCCCCCHHHHHHH
Q psy15334 391 VDILRIVHGLRRQRVLMVQSLAQYAFI 417 (451)
Q Consensus 391 vdi~~~v~~lR~qR~~~Vqt~~QY~fi 417 (451)
++|..+|++|.|++. ..|..|+
T Consensus 128 ----dave~ir~krrga~n-~kql~~l 149 (173)
T KOG2836|consen 128 ----DAVEMIRQKRRGAIN-SKQLLYL 149 (173)
T ss_pred ----HHHHHHHHHhhcccc-HHHHHHH
Confidence 889999999999995 4676663
No 24
>cd00127 DSPc Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
Probab=99.07 E-value=7.6e-10 Score=96.78 Aligned_cols=88 Identities=15% Similarity=0.198 Sum_probs=65.6
Q ss_pred EEEEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHH
Q psy15334 319 MISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVH 398 (451)
Q Consensus 319 ~V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~ 398 (451)
...|+.+.+|++...+.....+++|+...... ++||+|||.+|.||||++++...+.. +..+..++++
T Consensus 49 ~~~~~~~~D~~~~~~~~~~~~~~~~i~~~~~~----~~~vlVHC~~G~~Rs~~~~~~~l~~~--------~~~~~~~a~~ 116 (139)
T cd00127 49 NYLYVPILDLPSQDISKYFDEAVDFIDDAREK----GGKVLVHCLAGVSRSATLVIAYLMKT--------LGLSLREAYE 116 (139)
T ss_pred eEEEEEceeCCCCChHHHHHHHHHHHHHHHhc----CCcEEEECCCCCchhHHHHHHHHHHH--------cCCCHHHHHH
Confidence 46777887887555444444556666554432 57999999999999999998664432 3468899999
Q ss_pred HHHhhccCCCCCHHHHHHHH
Q psy15334 399 GLRRQRVLMVQSLAQYAFIY 418 (451)
Q Consensus 399 ~lR~qR~~~Vqt~~QY~fiy 418 (451)
.+|++|+.+..+..+...+.
T Consensus 117 ~vr~~r~~~~~~~~~~~~l~ 136 (139)
T cd00127 117 FVKSRRPIISPNAGFMRQLK 136 (139)
T ss_pred HHHHHCCccCCCHHHHHHHH
Confidence 99999999988888777654
No 25
>COG2453 CDC14 Predicted protein-tyrosine phosphatase [Signal transduction mechanisms]
Probab=98.93 E-value=1.2e-08 Score=93.86 Aligned_cols=90 Identities=29% Similarity=0.276 Sum_probs=68.1
Q ss_pred cCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccC
Q psy15334 327 TWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVL 406 (451)
Q Consensus 327 ~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~ 406 (451)
.++|+.+|+. ..+-+++..+...... .+.|+|||.+|+|||||++|+..|+.+ +.-...+++.-+|..|++
T Consensus 79 ~~~D~~~p~~-~~l~~~v~~i~~~~~~-g~kVvVHC~~GigRSgtviaA~lm~~~-------~~~~~~~~i~~~~~~r~~ 149 (180)
T COG2453 79 PILDGTVPDL-EDLDKIVDFIEEALSK-GKKVVVHCQGGIGRSGTVIAAYLMLYG-------GLSLADEAIAVKRRRRPG 149 (180)
T ss_pred eecCCCCCcH-HHHHHHHHHHHHHHhc-CCeEEEEcCCCCchHHHHHHHHHHHHc-------CCCCHHHHHHHHHhcCCc
Confidence 5689998864 4555555555444433 348999999999999999998877642 344566888889999999
Q ss_pred CCCCHHHHHHHHHHHHHHH
Q psy15334 407 MVQSLAQYAFIYECVQVYL 425 (451)
Q Consensus 407 ~Vqt~~QY~fiy~~l~~~l 425 (451)
++.+..|+.|..+.....-
T Consensus 150 ~v~~~~q~~~~~e~~~~~~ 168 (180)
T COG2453 150 AVVTEIQHLFELEQELFRK 168 (180)
T ss_pred ccccHHHHHHHHHHHHHHh
Confidence 9999999999888766443
No 26
>smart00195 DSPc Dual specificity phosphatase, catalytic domain.
Probab=98.71 E-value=1.6e-07 Score=82.20 Aligned_cols=57 Identities=18% Similarity=0.292 Sum_probs=47.2
Q ss_pred CCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHH
Q psy15334 354 AQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIY 418 (451)
Q Consensus 354 ~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy 418 (451)
..+||+|||.+|.||||+++++..+.. ...++.+++..+|+.|+.+..+..|..-+.
T Consensus 77 ~~~~VlVHC~~G~~RS~~v~~~yl~~~--------~~~~~~~A~~~v~~~R~~~~p~~~~~~qL~ 133 (138)
T smart00195 77 KGGKVLVHCQAGVSRSATLIIAYLMKY--------RNLSLNDAYDFVKDRRPIISPNFGFLRQLI 133 (138)
T ss_pred CCCeEEEECCCCCchHHHHHHHHHHHH--------hCCCHHHHHHHHHHHCCccCCCHhHHHHHH
Confidence 367999999999999999998774432 235889999999999999999988876654
No 27
>PF00782 DSPc: Dual specificity phosphatase, catalytic domain; InterPro: IPR000340 Protein tyrosine (pTyr) phosphorylation is a common post-translational modification which can create novel recognition motifs for protein interactions and cellular localisation, affect protein stability, and regulate enzyme activity. Consequently, maintaining an appropriate level of protein tyrosine phosphorylation is essential for many cellular functions. Tyrosine-specific protein phosphatases (PTPase; 3.1.3.48 from EC) catalyse the removal of a phosphate group attached to a tyrosine residue, using a cysteinyl-phosphate enzyme intermediate. These enzymes are key regulatory components in signal transduction pathways (such as the MAP kinase pathway) and cell cycle control, and are important in the control of cell growth, proliferation, differentiation and transformation [, ]. The PTP superfamily can be divided into four subfamilies []: (1) pTyr-specific phosphatases (2) dual specificity phosphatases (dTyr and dSer/dThr) (3) Cdc25 phosphatases (dTyr and/or dThr) (4) LMW (low molecular weight) phosphatases Based on their cellular localisation, PTPases are also classified as: Receptor-like, which are transmembrane receptors that contain PTPase domains [] Non-receptor (intracellular) PTPases [] All PTPases carry the highly conserved active site motif C(X)5R (PTP signature motif), employ a common catalytic mechanism, and share a similar core structure made of a central parallel beta-sheet with flanking alpha-helices containing a beta-loop-alpha-loop that encompasses the PTP signature motif []. Functional diversity between PTPases is endowed by regulatory domains and subunits. This entry represents dual specificity protein-tyrosine phosphatases. Ser/Thr and Tyr dual specificity phosphatases are a group of enzymes with both Ser/Thr (3.1.3.16 from EC) and tyrosine specific protein phosphatase (3.1.3.48 from EC) activity able to remove both the serine/threonine or tyrosine-bound phosphate group from a wide range of phosphoproteins, including a number of enzymes which have been phosphorylated under the action of a kinase. Dual specificity protein phosphatases (DSPs) regulate mitogenic signal transduction and control the cell cycle. The crystal structure of a human DSP, vaccinia H1-related phosphatase (or VHR), has been determined at 2.1 angstrom resolution []. A shallow active site pocket in VHR allows for the hydrolysis of phosphorylated serine, threonine, or tyrosine protein residues, whereas the deeper active site of protein tyrosine phosphatases (PTPs) restricts substrate specificity to only phosphotyrosine. Positively charged crevices near the active site may explain the enzyme's preference for substrates with two phosphorylated residues. The VHR structure defines a conserved structural scaffold for both DSPs and PTPs. A "recognition region" connecting helix alpha1 to strand beta1, may determine differences in substrate specificity between VHR, the PTPs, and other DSPs. These proteins may also have inactive phosphatase domains, and dependent on the domain composition this loss of catalytic activity has different effects on protein function. Inactive single domain phosphatases can still specifically bind substrates, and protect again dephosphorylation, while the inactive domains of tandem phosphatases can be further subdivided into two classes. Those which bind phosphorylated tyrosine residues may recruit multi-phosphorylated substrates for the adjacent active domains and are more conserved, while the other class have accumulated several variable amino acid substitutions and have a complete loss of tyrosine binding capability. The second class shows a release of evolutionary constraint for the sites around the catalytic centre, which emphasises a difference in function from the first group. There is a region of higher conservation common to both classes, suggesting a new regulatory centre [].; GO: 0008138 protein tyrosine/serine/threonine phosphatase activity, 0006470 protein dephosphorylation; PDB: 2G6Z_A 1MKP_A 1YZ4_A 2P4D_A 1M3G_A 1ZZW_A 2OUD_A 2HXP_A 3LJ8_A 1OHD_A ....
Probab=98.58 E-value=2e-07 Score=80.89 Aligned_cols=70 Identities=20% Similarity=0.349 Sum_probs=52.9
Q ss_pred hHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHH
Q psy15334 338 NTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFI 417 (451)
Q Consensus 338 ~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fi 417 (451)
..+.+|+..... .+++|+|||.+|+||||+++++..+.. ...++.++++.+|+.|+.+..+..|+..+
T Consensus 60 ~~~~~~i~~~~~----~~~~VlVHC~~G~~RS~~v~~ayLm~~--------~~~~~~~A~~~v~~~rp~~~~~~~~~~~L 127 (133)
T PF00782_consen 60 DQAVEFIENAIS----EGGKVLVHCKAGLSRSGAVAAAYLMKK--------NGMSLEEAIEYVRSRRPQINPNPSFIRQL 127 (133)
T ss_dssp HHHHHHHHHHHH----TTSEEEEEESSSSSHHHHHHHHHHHHH--------HTSSHHHHHHHHHHHSTTSTHHHHHHHHH
T ss_pred HHHHHhhhhhhc----ccceeEEEeCCCcccchHHHHHHHHHH--------cCCCHHHHHHHHHHHCCCCCCCHHHHHHH
Confidence 345555554432 267999999999999999998765432 23588899999999999998888887776
Q ss_pred HH
Q psy15334 418 YE 419 (451)
Q Consensus 418 y~ 419 (451)
+.
T Consensus 128 ~~ 129 (133)
T PF00782_consen 128 YE 129 (133)
T ss_dssp HH
T ss_pred HH
Confidence 53
No 28
>PRK12361 hypothetical protein; Provisional
Probab=98.03 E-value=3.2e-05 Score=83.27 Aligned_cols=82 Identities=18% Similarity=0.280 Sum_probs=59.7
Q ss_pred CCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCC
Q psy15334 329 PDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMV 408 (451)
Q Consensus 329 pd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~V 408 (451)
.|+..| +.+.+-+.++.+.+... ..++|+|||.+|.|||+++++...+. .. ...++.+++..+|+.||.+.
T Consensus 151 ~D~~~p-~~~~l~~a~~~i~~~~~-~~~~VlVHC~~G~sRSa~vv~ayLm~----~~---~~~~~~eA~~~vr~~Rp~v~ 221 (547)
T PRK12361 151 LDHSVP-TLAQLNQAINWIHRQVR-ANKSVVVHCALGRGRSVLVLAAYLLC----KD---PDLTVEEVLQQIKQIRKTAR 221 (547)
T ss_pred CCCCCC-cHHHHHHHHHHHHHHHH-CCCeEEEECCCCCCcHHHHHHHHHHH----hc---cCCCHHHHHHHHHHHCCCCC
Confidence 366666 44555444444443332 25799999999999999998876432 22 35688999999999999999
Q ss_pred CCHHHHHHHHH
Q psy15334 409 QSLAQYAFIYE 419 (451)
Q Consensus 409 qt~~QY~fiy~ 419 (451)
.+..|...+-+
T Consensus 222 ~n~~q~~~l~~ 232 (547)
T PRK12361 222 LNKRQLRALEK 232 (547)
T ss_pred CCHHHHHHHHH
Confidence 99999877554
No 29
>PF05706 CDKN3: Cyclin-dependent kinase inhibitor 3 (CDKN3); InterPro: IPR022778 This entry represents a domain found in cyclin-dependent kinase inhibitor 3 or kinase associated phosphatase proteins from several mammalian species. The cyclin-dependent kinase (Cdk)-associated protein phosphatase (KAP) is a human dual specificity protein phosphatase that dephosphorylates Cdk2 on threonine 160 in a cyclin-dependent manner [], []. This domain is also found in MAP kinase phosphatase and esterases. This entry contains both eukaryotic and bacterial proteins.; GO: 0004721 phosphoprotein phosphatase activity, 0004725 protein tyrosine phosphatase activity; PDB: 1FQ1_A 1FPZ_F.
Probab=97.96 E-value=1.4e-05 Score=71.46 Aligned_cols=89 Identities=22% Similarity=0.280 Sum_probs=47.6
Q ss_pred HhhhccCCeEEEeccceecccccccccCCCCCCCceeeeEEEEEeeeEeecceEEEEEEEEeCCeeeEEEEEEeecCCCC
Q psy15334 252 MCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMCRGGEHRMISHYHFTTWPDS 331 (451)
Q Consensus 252 MVwe~~v~~IVmLt~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~~~~~~r~V~h~~~~~Wpd~ 331 (451)
-+..+.+..||.|+...|..+-+ +|.-++.. +.+-+..+|| -.||.
T Consensus 66 ~Lk~~G~~~Vvtl~~~~EL~~l~----Vp~L~~~~-----------------------------~~~Gi~~~h~-PI~D~ 111 (168)
T PF05706_consen 66 RLKDWGAQDVVTLLTDHELARLG----VPDLGEAA-----------------------------QARGIAWHHL-PIPDG 111 (168)
T ss_dssp HHHHTT--EEEE-S-HHHHHHTT-----TTHHHHH-----------------------------HHTT-EEEE-----TT
T ss_pred HHHHCCCCEEEEeCcHHHHHHcC----CccHHHHH-----------------------------HHcCCEEEec-CccCC
Confidence 46678999999998777765432 23211000 1122333444 46899
Q ss_pred CCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHH
Q psy15334 332 GVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALD 376 (451)
Q Consensus 332 gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~ 376 (451)
++|+ ...+.+++..+.+.... ...|+|||..|.||||+++|..
T Consensus 112 ~aPd-~~~~~~i~~eL~~~L~~-g~~V~vHC~GGlGRtGlvAAcL 154 (168)
T PF05706_consen 112 SAPD-FAAAWQILEELAARLEN-GRKVLVHCRGGLGRTGLVAACL 154 (168)
T ss_dssp S----HHHHHHHHHHHHHHHHT-T--EEEE-SSSSSHHHHHHHHH
T ss_pred CCCC-HHHHHHHHHHHHHHHHc-CCEEEEECCCCCCHHHHHHHHH
Confidence 9995 56677777777766554 5689999999999999988643
No 30
>PF03162 Y_phosphatase2: Tyrosine phosphatase family; InterPro: IPR004861 Protein tyrosine (pTyr) phosphorylation is a common post-translational modification which can create novel recognition motifs for protein interactions and cellular localisation, affect protein stability, and regulate enzyme activity. Consequently, maintaining an appropriate level of protein tyrosine phosphorylation is essential for many cellular functions. Tyrosine-specific protein phosphatases (PTPase; 3.1.3.48 from EC) catalyse the removal of a phosphate group attached to a tyrosine residue, using a cysteinyl-phosphate enzyme intermediate. These enzymes are key regulatory components in signal transduction pathways (such as the MAP kinase pathway) and cell cycle control, and are important in the control of cell growth, proliferation, differentiation and transformation [, ]. The PTP superfamily can be divided into four subfamilies []: (1) pTyr-specific phosphatases (2) dual specificity phosphatases (dTyr and dSer/dThr) (3) Cdc25 phosphatases (dTyr and/or dThr) (4) LMW (low molecular weight) phosphatases Based on their cellular localisation, PTPases are also classified as: Receptor-like, which are transmembrane receptors that contain PTPase domains [] Non-receptor (intracellular) PTPases [] All PTPases carry the highly conserved active site motif C(X)5R (PTP signature motif), employ a common catalytic mechanism, and share a similar core structure made of a central parallel beta-sheet with flanking alpha-helices containing a beta-loop-alpha-loop that encompasses the PTP signature motif []. Functional diversity between PTPases is endowed by regulatory domains and subunits. This entry represents protein-tyrosine phosphatases predominantly from fungi, plants and bacteria, several of which are putative enzymes. These proteins are closely related to the Y-phosphatase and DSPc families. This entry includes the PTPase SIW14 from Saccharomyces cerevisiae (Baker's yeast), which plays a role in actin filament organisation and endocytosis.; PDB: 2Q47_A 1XRI_A.
Probab=97.62 E-value=9.3e-05 Score=66.95 Aligned_cols=72 Identities=22% Similarity=0.329 Sum_probs=40.8
Q ss_pred EEEEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHH
Q psy15334 320 ISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHG 399 (451)
Q Consensus 320 V~h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~ 399 (451)
+.|+....+.+..++.+.+.+.+.++.+... .+.||+|||..|..|||+.+|+... + ..-++.+++.+
T Consensus 59 l~~~~~~~~~~~~~~~~~~~v~~aL~~ild~---~n~PvLiHC~~G~~rTG~vvg~lRk---~------Q~W~~~~i~~E 126 (164)
T PF03162_consen 59 LIHIPMSSSKDPWVPISEEQVAEALEIILDP---RNYPVLIHCNHGKDRTGLVVGCLRK---L------QGWSLSSIFDE 126 (164)
T ss_dssp EEE-------GGG----HHHHHHHHHHHH-G---GG-SEEEE-SSSSSHHHHHHHHHHH---H------TTB-HHHHHHH
T ss_pred EEEeccccccCccccCCHHHHHHHHHHHhCC---CCCCEEEEeCCCCcchhhHHHHHHH---H------cCCCHHHHHHH
Confidence 4455555555545666677777777655432 3679999999999999999986653 3 23466788888
Q ss_pred HHhh
Q psy15334 400 LRRQ 403 (451)
Q Consensus 400 lR~q 403 (451)
+|.-
T Consensus 127 y~~f 130 (164)
T PF03162_consen 127 YRRF 130 (164)
T ss_dssp HHHH
T ss_pred HHHh
Confidence 8763
No 31
>PF14566 PTPlike_phytase: Inositol hexakisphosphate; PDB: 1U24_A 2PSZ_B 3MOZ_A 3D1H_B 2B4P_B 3D1Q_A 2B4O_A 3MMJ_B 1U25_A 1U26_B ....
Probab=97.28 E-value=0.00045 Score=61.48 Aligned_cols=49 Identities=29% Similarity=0.572 Sum_probs=34.6
Q ss_pred CCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHH
Q psy15334 328 WPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVL 379 (451)
Q Consensus 328 Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l 379 (451)
.+|+..|. ++.+-.|+..++.. +....+++||.+|.|||.+|.++..|+
T Consensus 100 itd~~~P~-~~~iD~fi~~v~~~--p~~~~l~fhC~~G~GRTTt~Mv~~~li 148 (149)
T PF14566_consen 100 ITDHQAPD-PEDIDAFINFVKSL--PKDTWLHFHCQAGRGRTTTFMVMYDLI 148 (149)
T ss_dssp E-TTS----HHHHHHHHHHHHTS---TT-EEEEE-SSSSHHHHHHHHHHHHH
T ss_pred CCCcCCCC-HHHHHHHHHHHHhC--CCCCeEEEECCCCCCHHHHHHHHHHHh
Confidence 36777674 67788888888765 446789999999999999999987665
No 32
>KOG2283|consensus
Probab=96.98 E-value=0.0012 Score=68.41 Aligned_cols=89 Identities=24% Similarity=0.315 Sum_probs=59.4
Q ss_pred ecCCCCCCCCChhHHHHHHHHHHHhcCCCCC-CEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhc
Q psy15334 326 TTWPDSGVPSSPNTLVSFVLSFRTHISPAQH-PIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQR 404 (451)
Q Consensus 326 ~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~-PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR 404 (451)
.+|+||.+|. -+.|..|.+.+...+..+.. =++|||.+|-||||+.+++..+...+.... ...+|.+..- |-..
T Consensus 78 ~~~~Dh~~P~-L~~l~~~c~~~~~WL~~d~~nVvvvHCk~Gkgrtg~~icA~L~~~~~~~ta-~eald~~~~k---R~~~ 152 (434)
T KOG2283|consen 78 FGFDDHNPPP-LELLCPFCKSMDNWLSEDPKNVVVVHCKAGKGRTGVMICAYLIYSGISATA-EEALDYFNEK---RFDE 152 (434)
T ss_pred cCCCCCCCCc-HHHHHHHHHCHHHHHhcCccceEEEEccCCCcceEEEEeHHHHhhhhcCCH-HHHHHHHhhh---hccc
Confidence 4899999884 56777888888777665443 479999999999999988887776665421 0234444322 2111
Q ss_pred c--CCCCCHHHHHHHHH
Q psy15334 405 V--LMVQSLAQYAFIYE 419 (451)
Q Consensus 405 ~--~~Vqt~~QY~fiy~ 419 (451)
. +-+-...|-.|++.
T Consensus 153 ~~~~~~~~PSq~RYv~Y 169 (434)
T KOG2283|consen 153 GKSKGVTIPSQRRYVGY 169 (434)
T ss_pred cccCCccCchhhHHHHH
Confidence 2 34566777777654
No 33
>KOG1718|consensus
Probab=96.50 E-value=0.0057 Score=54.55 Aligned_cols=56 Identities=18% Similarity=0.276 Sum_probs=37.9
Q ss_pred HHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccC
Q psy15334 339 TLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVL 406 (451)
Q Consensus 339 ~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~ 406 (451)
.+.+.+..|... .|-++|||-|||+||+.+|.+..+ +.+ ..++.++-..+...||-
T Consensus 82 ~vAD~I~~v~~~----gG~TLvHC~AGVSRSAsLClAYLm----K~~----~msLreAy~~vKa~Rpi 137 (198)
T KOG1718|consen 82 PVADKIHSVIMR----GGKTLVHCVAGVSRSASLCLAYLM----KYH----CMSLREAYHWVKARRPI 137 (198)
T ss_pred HHHHHHHHHHhc----CCcEEEEEccccchhHHHHHHHHH----HHc----cchHHHHHHHHHhhCce
Confidence 344555555443 688999999999999999977643 332 45566666666666653
No 34
>PF13350 Y_phosphatase3: Tyrosine phosphatase family; PDB: 1YWF_A 2OZ5_B.
Probab=96.50 E-value=0.004 Score=56.17 Aligned_cols=39 Identities=21% Similarity=0.138 Sum_probs=24.0
Q ss_pred cEEEecCCCCCCHHHHHhHhhhccCCeEEEeccceecccc
Q psy15334 234 QYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRD 273 (451)
Q Consensus 234 ~yI~tQ~Pl~~T~~dFWrMVwe~~v~~IVmLt~~~E~~~~ 273 (451)
.++=+=.|..-|-+|.+.+. +.++++||=|-...|..+.
T Consensus 20 ~lyRS~~l~~lt~~d~~~L~-~lgI~tIiDLRs~~E~~~~ 58 (164)
T PF13350_consen 20 RLYRSGNLSNLTEADLERLR-ELGIRTIIDLRSPTERERA 58 (164)
T ss_dssp SEEEES--TT--HHHHHHHH-HTT--EEEE-S-HHHHHHH
T ss_pred cEEecCCcCcCCHHHHHHHH-hCCCCEEEECCCccccccC
Confidence 45556667778999999888 9999999999887776543
No 35
>TIGR01244 conserved hypothetical protein TIGR01244. No member of this family is characterized. The member from Xylella fastidiosa is a longer protein with an N-terminal region described by this model, followed by a metallo-beta-lactamase family domain and an additional C-terminal region. Members scoring above the trusted cutoff are limited to the proteobacteria.
Probab=96.40 E-value=0.0082 Score=52.45 Aligned_cols=56 Identities=20% Similarity=0.427 Sum_probs=38.4
Q ss_pred ChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhcc
Q psy15334 336 SPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRV 405 (451)
Q Consensus 336 ~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~ 405 (451)
+...+..|.+.+.. ..+||++||.+|- |||++.++.... . + ++...++...|..-.
T Consensus 71 ~~~~v~~f~~~~~~----~~~pvL~HC~sG~-Rt~~l~al~~~~-----~---g-~~~~~i~~~~~~~G~ 126 (135)
T TIGR01244 71 TPDDVETFRAAIGA----AEGPVLAYCRSGT-RSSLLWGFRQAA-----E---G-VPVEEIVRRAQAAGY 126 (135)
T ss_pred CHHHHHHHHHHHHh----CCCCEEEEcCCCh-HHHHHHHHHHHH-----c---C-CCHHHHHHHHHHcCC
Confidence 44556666665542 2589999999999 999988765332 1 3 567777777766543
No 36
>COG5350 Predicted protein tyrosine phosphatase [General function prediction only]
Probab=96.38 E-value=0.024 Score=49.83 Aligned_cols=82 Identities=18% Similarity=0.344 Sum_probs=49.5
Q ss_pred EEEeecCCCCC-CCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHH
Q psy15334 322 HYHFTTWPDSG-VPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGL 400 (451)
Q Consensus 322 h~~~~~Wpd~g-vP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~l 400 (451)
||+=..-||.| .-.....+..++.-+++. +...|+||||.+|++||-....+..+ . + . ...|=-++...|
T Consensus 61 ~fnDI~~~~~g~~ap~e~Hv~~i~DF~~~w--p~~apllIHC~aGISRStA~A~i~a~-a-l--a---~~~de~ela~~L 131 (172)
T COG5350 61 HFNDIAEPDDGWIAPGEAHVRAIIDFADEW--PRFAPLLIHCYAGISRSTAAALIAAL-A-L--A---PDMDETELAERL 131 (172)
T ss_pred eeccccCCCccccCCCHHHHHHHHHHHhcC--ccccceeeeeccccccchHHHHHHHH-h-h--c---cccChHHHHHHH
Confidence 44444556555 333334433333333333 34689999999999999765443222 1 1 1 345666788899
Q ss_pred HhhccCCCCCHH
Q psy15334 401 RRQRVLMVQSLA 412 (451)
Q Consensus 401 R~qR~~~Vqt~~ 412 (451)
|.-+|.+-.+.-
T Consensus 132 ra~sp~atPN~R 143 (172)
T COG5350 132 RALSPYATPNPR 143 (172)
T ss_pred HhcCcccCCChh
Confidence 999998877643
No 37
>KOG1716|consensus
Probab=96.31 E-value=0.013 Score=57.95 Aligned_cols=61 Identities=20% Similarity=0.354 Sum_probs=46.4
Q ss_pred hHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCC
Q psy15334 338 NTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQS 410 (451)
Q Consensus 338 ~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt 410 (451)
...++|+..++.. .+-|+|||.+|++||.+++.+++|.. ....+.++.+.++..|+....+
T Consensus 142 ~~~~~fI~~a~~~----~~~vlVHC~~GvSRSat~viAYlM~~--------~~~~l~~A~~~vk~~R~~i~PN 202 (285)
T KOG1716|consen 142 PEAISFIEKAREK----GGKVLVHCQAGVSRSATLVIAYLMKY--------EGLSLEDAYELVKSRRPIISPN 202 (285)
T ss_pred HHHHHHHHHHHhC----CCeEEEEcCCccchhHHHHHHHHHHH--------cCCCHHHHHHHHHHhCCccCCC
Confidence 4456677766654 67899999999999999998776542 2456778888888888877664
No 38
>KOG1719|consensus
Probab=96.03 E-value=0.09 Score=46.44 Aligned_cols=55 Identities=24% Similarity=0.310 Sum_probs=43.8
Q ss_pred CCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHH
Q psy15334 355 QHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFI 417 (451)
Q Consensus 355 ~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fi 417 (451)
..-+-|||-||-|||.|.++...+. . +.-....++..+|+.|+...=-..|..-+
T Consensus 109 GktvYVHCKAGRtRSaTvV~cYLmq----~----~~wtpe~A~~~vr~iRp~VlL~~~Qw~~l 163 (183)
T KOG1719|consen 109 GKTVYVHCKAGRTRSATVVACYLMQ----H----KNWTPEAAVEHVRKIRPRVLLRPAQWDVL 163 (183)
T ss_pred CCeEEEEecCCCccchhhhhhhhhh----h----cCCCHHHHHHHHHhcCcceeecHHHHHHH
Confidence 4578999999999999998876432 1 34566789999999999988888887543
No 39
>PF04273 DUF442: Putative phosphatase (DUF442); InterPro: IPR005939 Although this domain is uncharacterised it seems likely that it performs a phosphatase function.; GO: 0016787 hydrolase activity; PDB: 2F46_A 3GXH_B 3GXG_B.
Probab=95.28 E-value=0.034 Score=46.77 Aligned_cols=38 Identities=21% Similarity=0.525 Sum_probs=27.7
Q ss_pred ChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHH
Q psy15334 336 SPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTV 378 (451)
Q Consensus 336 ~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~ 378 (451)
+.+.+..|.+.+... .+||++||..|. |++++.++.-.
T Consensus 71 ~~~~v~~f~~~l~~~----~~Pvl~hC~sG~-Ra~~l~~l~~~ 108 (110)
T PF04273_consen 71 TEEDVEAFADALESL----PKPVLAHCRSGT-RASALWALAQA 108 (110)
T ss_dssp -HHHHHHHHHHHHTT----TTSEEEE-SCSH-HHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHhC----CCCEEEECCCCh-hHHHHHHHHhh
Confidence 567777777766653 579999999998 99988876543
No 40
>COG2365 Protein tyrosine/serine phosphatase [Signal transduction mechanisms]
Probab=95.09 E-value=0.038 Score=53.44 Aligned_cols=27 Identities=41% Similarity=0.695 Sum_probs=23.5
Q ss_pred CCCEEEEcCCCCchhhhHHHHHHHHHH
Q psy15334 355 QHPIVVHCSAGVGRTGTFIALDTVLQR 381 (451)
Q Consensus 355 ~~PivVHCs~GvGRtG~fiai~~~l~~ 381 (451)
++|+++||.+|.-|||..+|+...+..
T Consensus 136 ~~PvL~HC~~GkdRTGl~~al~r~~~~ 162 (249)
T COG2365 136 NGPVLIHCTAGKDRTGLVAALYRKLVG 162 (249)
T ss_pred cCCEEEecCCCCcchHHHHHHHHHHhC
Confidence 689999999999999999998766543
No 41
>PLN02727 NAD kinase
Probab=94.66 E-value=0.071 Score=59.51 Aligned_cols=42 Identities=21% Similarity=0.316 Sum_probs=34.0
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHH
Q psy15334 334 PSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTV 378 (451)
Q Consensus 334 P~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~ 378 (451)
+.+.+.+.+|.+.+++. ..+||++||..|.+|||+++|....
T Consensus 323 apt~EqVe~fa~~l~~s---lpkPVLvHCKSGarRAGamvA~yl~ 364 (986)
T PLN02727 323 APSAEQVEKFASLVSDS---SKKPIYLHSKEGVWRTSAMVSRWKQ 364 (986)
T ss_pred CCCHHHHHHHHHHHHhh---cCCCEEEECCCCCchHHHHHHHHHH
Confidence 34568888888888543 2679999999999999999998765
No 42
>KOG0196|consensus
Probab=93.82 E-value=0.14 Score=56.19 Aligned_cols=21 Identities=43% Similarity=0.457 Sum_probs=18.5
Q ss_pred EcCCCCCCCeeeEEEEEeeCC
Q psy15334 47 CNGPLKQGTTYRVKVRAFTSP 67 (451)
Q Consensus 47 ~NgpL~p~~~Y~i~vrA~t~~ 67 (451)
.--.|+||+.|.|.|||.|..
T Consensus 501 ti~gL~p~t~YvfqVRarT~a 521 (996)
T KOG0196|consen 501 TITGLKPGTVYVFQVRARTAA 521 (996)
T ss_pred EeeccCCCcEEEEEEEEeccc
Confidence 345799999999999999986
No 43
>KOG4471|consensus
Probab=93.81 E-value=0.086 Score=55.61 Aligned_cols=33 Identities=27% Similarity=0.612 Sum_probs=26.4
Q ss_pred HHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHH
Q psy15334 347 FRTHISPAQHPIVVHCSAGVGRTGTFIALDTVL 379 (451)
Q Consensus 347 v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l 379 (451)
|.......+.||+||||+|=.||..++++..++
T Consensus 366 Ia~kVe~~~~sVlVHCSDGWDRT~QlvsLA~Ll 398 (717)
T KOG4471|consen 366 IADKVESESRSVLVHCSDGWDRTAQLVSLAMLL 398 (717)
T ss_pred HHHHHhcCCceEEEEcCCCccchHHHHHHHHHH
Confidence 333445567899999999999999999987764
No 44
>KOG1717|consensus
Probab=93.60 E-value=0.18 Score=48.40 Aligned_cols=68 Identities=19% Similarity=0.359 Sum_probs=42.0
Q ss_pred HHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHHhhccCCCCCHHHHHHHH
Q psy15334 339 TLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIY 418 (451)
Q Consensus 339 ~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR~qR~~~Vqt~~QY~fiy 418 (451)
..+.||...|.. +.-++|||-+|+.||-|.+++..| +.+. +..-++...+.+.+.+.-.+ |-|++
T Consensus 239 EAIsfIdeArsk----~cgvLVHClaGISRSvTvtvaYLM-qkl~-------lslndAyd~Vk~kksnisPN---FnFMg 303 (343)
T KOG1717|consen 239 EAISFIDEARSK----NCGVLVHCLAGISRSVTVTVAYLM-QKLN-------LSLNDAYDFVKHKKSNISPN---FNFMG 303 (343)
T ss_pred HHHHHHHHhhcc----CCcEEEeeeccccchhHHHHHHHH-HHhc-------cchhhHHHHHHHhccCCCCC---cchhH
Confidence 346677666543 457999999999999999887644 3332 23334444444445554444 45666
Q ss_pred HHH
Q psy15334 419 ECV 421 (451)
Q Consensus 419 ~~l 421 (451)
+.+
T Consensus 304 QLl 306 (343)
T KOG1717|consen 304 QLL 306 (343)
T ss_pred HHH
Confidence 654
No 45
>PF07353 Uroplakin_II: Uroplakin II; InterPro: IPR009952 This family contains uroplakin II, which is approximately 180 residues long and seems to be restricted to mammals. Uroplakin II is an integral membrane protein, and is one of the components of the apical plaques of mammalian urothelium formed by the asymmetric unit membrane - this is believed to play a role in strengthening the urothelial apical surface to prevent the cells from rupturing during bladder distension [].; GO: 0016044 cellular membrane organization, 0030176 integral to endoplasmic reticulum membrane
Probab=90.87 E-value=0.29 Score=43.28 Aligned_cols=25 Identities=20% Similarity=0.239 Sum_probs=21.1
Q ss_pred CCCceEcCCCCCCCeeeEEEEEeeC
Q psy15334 42 NEAHYCNGPLKQGTTYRVKVRAFTS 66 (451)
Q Consensus 42 ~~~~~~NgpL~p~~~Y~i~vrA~t~ 66 (451)
+-.+|++++|.||+.|.++...-.+
T Consensus 100 rlsaYqVtNL~pGTkY~isY~Vtkg 124 (184)
T PF07353_consen 100 RLSAYQVTNLQPGTKYYISYLVTKG 124 (184)
T ss_pred cceeEEeeccCCCcEEEEEEEEecC
Confidence 4578999999999999999876444
No 46
>KOG1572|consensus
Probab=88.40 E-value=1.3 Score=42.02 Aligned_cols=28 Identities=29% Similarity=0.585 Sum_probs=22.9
Q ss_pred cCCCCCCEEEEcCCCCchhhhHHHHHHH
Q psy15334 351 ISPAQHPIVVHCSAGVGRTGTFIALDTV 378 (451)
Q Consensus 351 ~~~~~~PivVHCs~GvGRtG~fiai~~~ 378 (451)
....+.|+++||.-|--|||++++...-
T Consensus 144 ld~~N~P~Lihc~rGkhRtg~lVgclRk 171 (249)
T KOG1572|consen 144 LDKRNYPILIHCKRGKHRTGCLVGCLRK 171 (249)
T ss_pred hcccCCceEEecCCCCcchhhhHHHHHH
Confidence 3445789999999999999999985543
No 47
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=87.78 E-value=0.17 Score=43.29 Aligned_cols=25 Identities=16% Similarity=0.186 Sum_probs=13.1
Q ss_pred chhhhHH--HHHHHHhhheeeeeeccc
Q psy15334 87 WLLVMGF--LVLIVLPVLFIIIRRKIV 111 (451)
Q Consensus 87 ~ii~gv~--Liliil~vll~~~rrr~~ 111 (451)
+||+||+ +|+++++++|+++|+|++
T Consensus 68 ~Ii~gv~aGvIg~Illi~y~irR~~Kk 94 (122)
T PF01102_consen 68 GIIFGVMAGVIGIILLISYCIRRLRKK 94 (122)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHS--
T ss_pred ehhHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 5556665 455555555666655553
No 48
>PF08693 SKG6: Transmembrane alpha-helix domain; InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=87.26 E-value=0.39 Score=32.51 Aligned_cols=21 Identities=29% Similarity=0.407 Sum_probs=9.7
Q ss_pred hhhhHHHHHHHHhhhee-eeee
Q psy15334 88 LLVMGFLVLIVLPVLFI-IIRR 108 (451)
Q Consensus 88 ii~gv~Liliil~vll~-~~rr 108 (451)
+++++..|++++++++| ++||
T Consensus 17 VvVPV~vI~~vl~~~l~~~~rR 38 (40)
T PF08693_consen 17 VVVPVGVIIIVLGAFLFFWYRR 38 (40)
T ss_pred EEechHHHHHHHHHHhheEEec
Confidence 33444444555544444 4444
No 49
>PF06602 Myotub-related: Myotubularin-like phosphatase domain; InterPro: IPR010569 This family represents a region within eukaryotic myotubularin-related proteins that is sometimes found with IPR004182 from INTERPRO. Myotubularin is a dual-specific lipid phosphatase that dephosphorylates phosphatidylinositol 3-phosphate and phosphatidylinositol (3,5)-bi-phosphate []. Mutations in gene encoding myotubularin-related proteins have been associated with disease [].; GO: 0016791 phosphatase activity, 0016311 dephosphorylation; PDB: 1M7R_B 1LW3_A 1ZVR_A 1ZSQ_A 2YF0_A.
Probab=86.69 E-value=0.88 Score=46.36 Aligned_cols=41 Identities=24% Similarity=0.413 Sum_probs=26.4
Q ss_pred HHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHHHHHHHH
Q psy15334 339 TLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVL 379 (451)
Q Consensus 339 ~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fiai~~~l 379 (451)
.++.-...+-+.......+|+|||++|-.||..++++..++
T Consensus 215 ~~L~~a~~i~~~l~~~~~~Vlvh~~dGwDrt~q~~sL~ql~ 255 (353)
T PF06602_consen 215 SILSGASRIADLLHDEGSSVLVHCSDGWDRTSQLSSLAQLL 255 (353)
T ss_dssp HHHHHHHHHHHHHHTT--EEEEECTTSSSHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhhccCceEEEEcCCCCcccHHHHHHHHHH
Confidence 34443334444443456799999999999999988765543
No 50
>KOG2386|consensus
Probab=85.39 E-value=1.1 Score=45.85 Aligned_cols=88 Identities=19% Similarity=0.256 Sum_probs=58.9
Q ss_pred eecCCCCCCCCChhHHHHHHHHHHHhcC---CCCCCEEEEcCCCCchhhhHHHHHHHHHHHHcccCCCCCCHHHHHHHHH
Q psy15334 325 FTTWPDSGVPSSPNTLVSFVLSFRTHIS---PAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLR 401 (451)
Q Consensus 325 ~~~Wpd~gvP~~~~~ll~fi~~v~~~~~---~~~~PivVHCs~GvGRtG~fiai~~~l~~l~~~~~~~~vdi~~~v~~lR 401 (451)
...-+-+++-++....-.|++.+..... -...=|.|||..|..|||-+|+...+ .. +...+-++++.+-
T Consensus 91 K~~c~g~~~vp~~~~v~~fv~~v~~f~~~~~~~~~LI~vhcthG~NrtgyLI~~yL~----~~----~~~s~~~aik~f~ 162 (393)
T KOG2386|consen 91 KRNCPGRGVVPRTELVDKFVKLVKGFVDDTKLDDELIGVHCTHGLNRTGYLICAYLA----DV----GGYSSSEAIKRFA 162 (393)
T ss_pred EeccCCcccCCCccchHHHHHHHHHHHhcccCCCCEEEEeCCCcccccceeeeeeee----ec----cCccHHHHHHHHH
Confidence 3455666633344455566666655433 23445999999999999987654322 22 3378889999999
Q ss_pred hhccCCCCCHHHHHHHHHH
Q psy15334 402 RQRVLMVQSLAQYAFIYEC 420 (451)
Q Consensus 402 ~qR~~~Vqt~~QY~fiy~~ 420 (451)
..|+..+.-.+-+.=+|..
T Consensus 163 ~~r~~gi~k~dyi~~L~~~ 181 (393)
T KOG2386|consen 163 DARPPGIEKQDYIDALYSR 181 (393)
T ss_pred HhCCCccCchHHHHHHhhc
Confidence 9999998887766655543
No 51
>KOG1089|consensus
Probab=82.40 E-value=1.9 Score=46.02 Aligned_cols=30 Identities=30% Similarity=0.498 Sum_probs=23.2
Q ss_pred HhcCCCCCCEEEEcCCCCchhhhHHHHHHH
Q psy15334 349 THISPAQHPIVVHCSAGVGRTGTFIALDTV 378 (451)
Q Consensus 349 ~~~~~~~~PivVHCs~GvGRtG~fiai~~~ 378 (451)
+.+...+.+||||||+|-.||...+.+.-+
T Consensus 338 ~~l~~~~~sVlvhcsdGwDrT~qV~SLaQl 367 (573)
T KOG1089|consen 338 KCLSSEGASVLVHCSDGWDRTCQVSSLAQL 367 (573)
T ss_pred HHHHhCCCeEEEEccCCcchhHHHHHHHHH
Confidence 334445679999999999999998876543
No 52
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=80.79 E-value=3.8 Score=40.86 Aligned_cols=54 Identities=17% Similarity=0.242 Sum_probs=26.3
Q ss_pred CCeeeEEEEEeeCCCCcccccCCCCcccCCC----ccchhhhHHHH--HHHHhhheeeeeeccc
Q psy15334 54 GTTYRVKVRAFTSPELFTDTIYSPPVSTGKD----VSWLLVMGFLV--LIVLPVLFIIIRRKIV 111 (451)
Q Consensus 54 ~~~Y~i~vrA~t~~~~~t~s~~S~pi~t~~~----~~~ii~gv~Li--liil~vll~~~rrr~~ 111 (451)
-....+.|+||-..+. .|+....+..| -.+|++|+.|. +||+++.+++.|||.+
T Consensus 241 l~~~~lqvQaF~~~~~----~Fg~a~~C~~D~~~~~vPIaVG~~La~lvlivLiaYli~Rrr~~ 300 (306)
T PF01299_consen 241 LNLSDLQVQAFRVKNN----TFGTAEECSSDDTSDLVPIAVGAALAGLVLIVLIAYLIGRRRSR 300 (306)
T ss_pred EEEeEEEEEEEEecCC----CCCChhcCCcCCccchHHHHHHHHHHHHHHHHHHhheeEecccc
Confidence 4555678888865322 14444334332 23556676533 2222333555555543
No 53
>PF00041 fn3: Fibronectin type III domain; InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=79.40 E-value=3.1 Score=31.76 Aligned_cols=24 Identities=42% Similarity=0.594 Sum_probs=21.1
Q ss_pred CceEcCCCCCCCeeeEEEEEeeCC
Q psy15334 44 AHYCNGPLKQGTTYRVKVRAFTSP 67 (451)
Q Consensus 44 ~~~~NgpL~p~~~Y~i~vrA~t~~ 67 (451)
..+.-..|+|++.|.|.|+|++..
T Consensus 56 ~~~~i~~L~p~t~Y~~~v~a~~~~ 79 (85)
T PF00041_consen 56 TSYTITGLQPGTTYEFRVRAVNSD 79 (85)
T ss_dssp SEEEEESCCTTSEEEEEEEEEETT
T ss_pred eeeeeccCCCCCEEEEEEEEEeCC
Confidence 367888999999999999999874
No 54
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 [].
Probab=77.83 E-value=0.63 Score=40.27 Aligned_cols=7 Identities=29% Similarity=0.757 Sum_probs=3.1
Q ss_pred chhhhHH
Q psy15334 87 WLLVMGF 93 (451)
Q Consensus 87 ~ii~gv~ 93 (451)
|++.+++
T Consensus 2 W~l~~ii 8 (130)
T PF12273_consen 2 WVLFAII 8 (130)
T ss_pred eeeHHHH
Confidence 3444444
No 55
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=73.85 E-value=1.1 Score=33.61 Aligned_cols=24 Identities=29% Similarity=0.555 Sum_probs=0.4
Q ss_pred chhhhHH--HHHHHHhhheeeeeecc
Q psy15334 87 WLLVMGF--LVLIVLPVLFIIIRRKI 110 (451)
Q Consensus 87 ~ii~gv~--Liliil~vll~~~rrr~ 110 (451)
++|+|++ +++++++++|++.|.|+
T Consensus 13 avIaG~Vvgll~ailLIlf~iyR~rk 38 (64)
T PF01034_consen 13 AVIAGGVVGLLFAILLILFLIYRMRK 38 (64)
T ss_dssp ------------------------S-
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4555554 44444445555555433
No 56
>PF15102 TMEM154: TMEM154 protein family
Probab=69.07 E-value=7.8 Score=34.10 Aligned_cols=8 Identities=38% Similarity=0.879 Sum_probs=3.6
Q ss_pred hHHHhHHh
Q psy15334 166 IAVEFEDL 173 (451)
Q Consensus 166 ~~~Ef~~l 173 (451)
+.-|-|+|
T Consensus 123 meiEmeel 130 (146)
T PF15102_consen 123 MEIEMEEL 130 (146)
T ss_pred hhhhHHHH
Confidence 33444555
No 57
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=67.20 E-value=2.5 Score=28.23 Aligned_cols=22 Identities=9% Similarity=0.245 Sum_probs=11.7
Q ss_pred cchhhhHH--HHHHHHhhheeeee
Q psy15334 86 SWLLVMGF--LVLIVLPVLFIIIR 107 (451)
Q Consensus 86 ~~ii~gv~--Liliil~vll~~~r 107 (451)
.++|+||+ ++++++.+++|.+.
T Consensus 6 IaIIv~V~vg~~iiii~~~~YaCc 29 (38)
T PF02439_consen 6 IAIIVAVVVGMAIIIICMFYYACC 29 (38)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHH
Confidence 35677776 44444555554433
No 58
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=63.59 E-value=5.5 Score=35.26 Aligned_cols=17 Identities=29% Similarity=0.567 Sum_probs=8.2
Q ss_pred HHHHHHhhheeeeeecc
Q psy15334 94 LVLIVLPVLFIIIRRKI 110 (451)
Q Consensus 94 Liliil~vll~~~rrr~ 110 (451)
|||++++++|++++|++
T Consensus 62 ill~il~lvf~~c~r~k 78 (154)
T PF04478_consen 62 ILLGILALVFIFCIRRK 78 (154)
T ss_pred HHHHHHHhheeEEEecc
Confidence 44444555544544444
No 59
>PF05283 MGC-24: Multi-glycosylated core protein 24 (MGC-24); InterPro: IPR007947 CD164 is a mucin-like receptor, or sialomucin, with specificity in receptor/ ligand interactions that depends on the structural characteristics of the mucin-like receptor. Its functions include mediating, or regulating, haematopoietic progenitor cell adhesion and the negative regulation of their growth and/or-differentiation. It exists in the native state as a disulphide- linked homodimer of two 80-85kDa subunits. It is usually expressed by CD34+ and CD341o/- haematopoietic stem cells and associated microenvironmental cells. It contains, in its extracellular region, two mucin domains (I and II) linked by a non-mucin domain, which has been predicted to contain intra- disulphide bridges. This receptor may play a key role in haematopoiesis by facilitating the adhesion of human CD34+ cells to bone marrow stroma and by negatively regulating CD34+ CD341o/- haematopoietic progenitor cell proliferation. These effects involve the CD164 class I and/or II epitopes recognised by the monoclonal antibodies (mAbs) 105A5 and 103B2/9E10. These epitopes are carbohydrate-dependent and are located on the N-terminal mucin domain I [, ]. It has been found that murine MGC-24v and rat endolyn share significant sequence similarities with human CD164. However, CD164 lacks the consensus glycosaminoglycan (GAG)-attachment site found in MGC-24; it is possible that GAG-association is responsible for the high molecular weight of the epithelial-derived MGC-24 glycoprotein []. Genomic structure studies have placed CD164 within the mucin-subgroup that comprises multiple exons, and demonstrate the diverse chromosomal distribution of this family of molecules. Molecules with such multiple exons may have sophisticated regulatory mechanisms that involve not only post-translational modifications of the oligosaccharide side chains, but also differential exon usage. Although differences in the intron and exon sizes are seen between the mouse and human genes, the predicted proteins are similar in size and structure, maintaining functionally important motifs that regulate cell proliferation or subcellular distribution []. CD164 is a gene whose expression depends on differential usage of poly- adenylation sites within the 3'-UTR. The conserved distribution of the 3.2- and 1.2-kb CD164 transcripts between mouse and human suggests that (i) a mechanism may exist to regulate tissue-specific polyadenylation, and (ii) differences in polyadenylation are important for the expression and function of CD164 in different tissues. Two other aspects of the structure of CD164 are of particular interest. First, it shares one of several conserved features of a cytokine-binding pocket - in this respect, it is notable that evidence exists for a class of cell-surface sialomucin modulators that directly interact with growth factor receptors to regulate their response to physiological ligands. Second, its cytoplasmic tail contains a C-terminal YHTL motif found in many endocytic membrane proteins or receptors. These Tyr-based motifs bind to adaptor proteins, which mediate the sorting of membrane proteins into transport vesicles from the plasma membrane to the endosomes, and between intracellular compartments.
Probab=63.40 E-value=4.9 Score=36.96 Aligned_cols=24 Identities=25% Similarity=0.364 Sum_probs=19.1
Q ss_pred CccchhhhHHHHHHHHhhheeeee
Q psy15334 84 DVSWLLVMGFLVLIVLPVLFIIIR 107 (451)
Q Consensus 84 ~~~~ii~gv~Liliil~vll~~~r 107 (451)
|....|+||||.|-+++|+||++|
T Consensus 159 D~~SFiGGIVL~LGv~aI~ff~~K 182 (186)
T PF05283_consen 159 DAASFIGGIVLTLGVLAIIFFLYK 182 (186)
T ss_pred chhhhhhHHHHHHHHHHHHHHHhh
Confidence 345679999999999888887765
No 60
>PF12877 DUF3827: Domain of unknown function (DUF3827); InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells.
Probab=62.56 E-value=3.2 Score=44.71 Aligned_cols=12 Identities=8% Similarity=0.301 Sum_probs=9.1
Q ss_pred CCCccchhhhHH
Q psy15334 82 GKDVSWLLVMGF 93 (451)
Q Consensus 82 ~~~~~~ii~gv~ 93 (451)
...++|||+||+
T Consensus 265 ~~~NlWII~gVl 276 (684)
T PF12877_consen 265 PPNNLWIIAGVL 276 (684)
T ss_pred CCCCeEEEehHh
Confidence 456789988885
No 61
>PF05808 Podoplanin: Podoplanin; InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=58.26 E-value=3.3 Score=36.97 Aligned_cols=28 Identities=21% Similarity=0.295 Sum_probs=0.0
Q ss_pred chhhhHHHHHHHHhhheeeeeecccCcC
Q psy15334 87 WLLVMGFLVLIVLPVLFIIIRRKIVGVT 114 (451)
Q Consensus 87 ~ii~gv~Liliil~vll~~~rrr~~~~~ 114 (451)
+||+|+++-+.+|+.++++.-||.++|+
T Consensus 133 GIIVGVLlaIG~igGIIivvvRKmSGRy 160 (162)
T PF05808_consen 133 GIIVGVLLAIGFIGGIIIVVVRKMSGRY 160 (162)
T ss_dssp ----------------------------
T ss_pred eehhhHHHHHHHHhheeeEEeehhcccc
Confidence 4556665433334444444444555554
No 62
>PF14575 EphA2_TM: Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=55.59 E-value=5.6 Score=30.96 Aligned_cols=11 Identities=27% Similarity=0.543 Sum_probs=4.4
Q ss_pred hhhHHHHHHHH
Q psy15334 89 LVMGFLVLIVL 99 (451)
Q Consensus 89 i~gv~Liliil 99 (451)
++|++++++++
T Consensus 7 ~~g~~~ll~~v 17 (75)
T PF14575_consen 7 IVGVLLLLVLV 17 (75)
T ss_dssp HHHHHHHHHHH
T ss_pred HHHHHHHHHhh
Confidence 34444333333
No 63
>COG3453 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=54.92 E-value=22 Score=30.36 Aligned_cols=27 Identities=15% Similarity=0.453 Sum_probs=19.5
Q ss_pred ChhHHHHHHHHHHHhcCCCCCCEEEEcCCCC
Q psy15334 336 SPNTLVSFVLSFRTHISPAQHPIVVHCSAGV 366 (451)
Q Consensus 336 ~~~~ll~fi~~v~~~~~~~~~PivVHCs~Gv 366 (451)
+...+-.|.+.+.+. .+||+.||..|.
T Consensus 72 T~~dV~~f~~Al~ea----egPVlayCrsGt 98 (130)
T COG3453 72 TEADVEAFQRALDEA----EGPVLAYCRSGT 98 (130)
T ss_pred CHHHHHHHHHHHHHh----CCCEEeeecCCc
Confidence 455666666666554 789999998775
No 64
>COG1470 Predicted membrane protein [Function unknown]
Probab=54.63 E-value=9.7 Score=39.73 Aligned_cols=52 Identities=17% Similarity=0.275 Sum_probs=25.8
Q ss_pred CCeeeEEEEEeeCCCCcccccCCCCcccCCCccchhhhHHHHHHHHhhheeeeee
Q psy15334 54 GTTYRVKVRAFTSPELFTDTIYSPPVSTGKDVSWLLVMGFLVLIVLPVLFIIIRR 108 (451)
Q Consensus 54 ~~~Y~i~vrA~t~~~~~t~s~~S~pi~t~~~~~~ii~gv~Liliil~vll~~~rr 108 (451)
...|++.++|-+.. .+ +.-+.-+.......+-.+|+.+|++++++++|+.||
T Consensus 458 aGdY~i~i~~ksDq--~s-~e~tlrV~V~~sS~st~iGI~Ii~~~v~~L~fviRK 509 (513)
T COG1470 458 AGDYRITITAKSDQ--AS-SEDTLRVVVGQSSTSTYIGIAIIVLVVLGLIFVIRK 509 (513)
T ss_pred CCcEEEEEEEeecc--cc-ccceEEEEEeccccchhhhHHHHHHHHHHHHhhhHH
Confidence 34577777765541 11 111111112223345566776666677777776664
No 65
>KOG4221|consensus
Probab=47.37 E-value=31 Score=40.27 Aligned_cols=20 Identities=25% Similarity=0.217 Sum_probs=17.4
Q ss_pred cCCCCCCCeeeEEEEEeeCC
Q psy15334 48 NGPLKQGTTYRVKVRAFTSP 67 (451)
Q Consensus 48 NgpL~p~~~Y~i~vrA~t~~ 67 (451)
-.+|.|++.|-|+++|-+..
T Consensus 986 v~~l~p~t~yffkiQAr~~k 1005 (1381)
T KOG4221|consen 986 VPNLDPDTGYFFKIQARNEK 1005 (1381)
T ss_pred cCCCCCCCceEEEEEeeccC
Confidence 36899999999999998764
No 66
>PF05568 ASFV_J13L: African swine fever virus J13L protein; InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=46.38 E-value=7 Score=34.11 Aligned_cols=23 Identities=17% Similarity=0.408 Sum_probs=11.0
Q ss_pred hhhhHHHHHHHHhhheeeeeecc
Q psy15334 88 LLVMGFLVLIVLPVLFIIIRRKI 110 (451)
Q Consensus 88 ii~gv~Liliil~vll~~~rrr~ 110 (451)
|++++++++|++.++++|+-+|+
T Consensus 34 ILiaIvVliiiiivli~lcssRK 56 (189)
T PF05568_consen 34 ILIAIVVLIIIIIVLIYLCSSRK 56 (189)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhh
Confidence 44555544444445555544444
No 67
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=44.79 E-value=24 Score=25.25 Aligned_cols=23 Identities=39% Similarity=0.588 Sum_probs=20.0
Q ss_pred CceEcCCCCCCCeeeEEEEEeeC
Q psy15334 44 AHYCNGPLKQGTTYRVKVRAFTS 66 (451)
Q Consensus 44 ~~~~NgpL~p~~~Y~i~vrA~t~ 66 (451)
..+.-..|.|++.|.+.|+|++.
T Consensus 57 ~~~~i~~L~~~~~Y~v~v~a~~~ 79 (83)
T smart00060 57 TSYTLTGLKPGTEYEFRVRAVNG 79 (83)
T ss_pred cEEEEeCcCCCCEEEEEEEEEcc
Confidence 45777889999999999999875
No 68
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=43.66 E-value=7.7 Score=38.29 Aligned_cols=8 Identities=13% Similarity=0.202 Sum_probs=0.0
Q ss_pred eeeeeecc
Q psy15334 103 FIIIRRKI 110 (451)
Q Consensus 103 l~~~rrr~ 110 (451)
+|++|||+
T Consensus 168 icyrrkR~ 175 (290)
T PF05454_consen 168 ICYRRKRK 175 (290)
T ss_dssp --------
T ss_pred Hhhhhhhc
Confidence 34444333
No 69
>PF02061 Lambda_CIII: Lambda Phage CIII; InterPro: IPR013056 Bacteriophage lambda regulatory protein CIII is a small protein that plays a role in stabilising the CII transcriptional activator, via a mechanism that is not yet fully understood [, ]. Stabilised CII activates CI, the gene for the repressor protein that prevents transcription of proteins required for lytic development. The central portion of the protein is well conserved and is both necessary and sufficient for the activity of the protein []. Comparative analysis of the CIII sequence in lambda, Bacteriophage HK022 and the lambdoid Enterobacteria phage P22 has led to the suggestion that this central region assumes an amphipathic alpha-helical structure []. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=42.10 E-value=34 Score=23.21 Aligned_cols=24 Identities=25% Similarity=0.678 Sum_probs=20.2
Q ss_pred EEEeecCCCCCCCCChhHHHHHHH
Q psy15334 322 HYHFTTWPDSGVPSSPNTLVSFVL 345 (451)
Q Consensus 322 h~~~~~Wpd~gvP~~~~~ll~fi~ 345 (451)
+|++.+||-.|+|.-.++|++-+.
T Consensus 2 ~~~~AG~~~~G~~ql~ESLLdrIt 25 (45)
T PF02061_consen 2 QYAIAGWPRMGCPQLSESLLDRIT 25 (45)
T ss_pred eeeecCccccCCchhhHHHHHHHH
Confidence 688999999999977788887653
No 70
>cd01533 4RHOD_Repeat_2 Member of the Rhodanese Homology Domain superfamily, repeat 2. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. This CD aligns the 2nd repeat which does contain the putative catalytic Cys residue.
Probab=41.37 E-value=44 Score=27.21 Aligned_cols=15 Identities=40% Similarity=0.769 Sum_probs=12.0
Q ss_pred CCCEEEEcCCCCchhh
Q psy15334 355 QHPIVVHCSAGVGRTG 370 (451)
Q Consensus 355 ~~PivVHCs~GvGRtG 370 (451)
..||+|+|..|. ||.
T Consensus 66 ~~~ivv~C~~G~-rs~ 80 (109)
T cd01533 66 RTPIVVNCAGRT-RSI 80 (109)
T ss_pred CCeEEEECCCCc-hHH
Confidence 579999999886 663
No 71
>cd01518 RHOD_YceA Member of the Rhodanese Homology Domain superfamily. This CD includes Escherichia coli YceA, Bacillus subtilis YbfQ, and similar uncharacterized proteins.
Probab=39.25 E-value=31 Score=27.65 Aligned_cols=18 Identities=17% Similarity=0.366 Sum_probs=13.8
Q ss_pred CCCCEEEEcCCCCchhhhH
Q psy15334 354 AQHPIVVHCSAGVGRTGTF 372 (451)
Q Consensus 354 ~~~PivVHCs~GvGRtG~f 372 (451)
...||||+|..| .||...
T Consensus 60 ~~~~ivvyC~~G-~rs~~a 77 (101)
T cd01518 60 KGKKVLMYCTGG-IRCEKA 77 (101)
T ss_pred CCCEEEEECCCc-hhHHHH
Confidence 357999999876 677654
No 72
>cd01523 RHOD_Lact_B Member of the Rhodanese Homology Domain superfamily. This CD includes predicted proteins with rhodanese-like domains found N-terminal of the metallo-beta-lactamase domain.
Probab=38.07 E-value=31 Score=27.61 Aligned_cols=17 Identities=12% Similarity=0.233 Sum_probs=12.9
Q ss_pred CCCCEEEEcCCCCchhhh
Q psy15334 354 AQHPIVVHCSAGVGRTGT 371 (451)
Q Consensus 354 ~~~PivVHCs~GvGRtG~ 371 (451)
...||||+|..|. ||..
T Consensus 60 ~~~~ivv~C~~G~-rs~~ 76 (100)
T cd01523 60 DDQEVTVICAKEG-SSQF 76 (100)
T ss_pred CCCeEEEEcCCCC-cHHH
Confidence 3579999999884 6643
No 73
>TIGR01167 LPXTG_anchor LPXTG-motif cell wall anchor domain. A common feature of this proteins containing this domain appears to be a high proportion of charged and zwitterionic residues immediatedly upstream of the LPXTG motif. This model differs from other descriptions of the LPXTG region by including a portion of that upstream charged region.
Probab=37.85 E-value=34 Score=21.67 Aligned_cols=8 Identities=25% Similarity=0.173 Sum_probs=3.1
Q ss_pred hhheeeee
Q psy15334 100 PVLFIIIR 107 (451)
Q Consensus 100 ~vll~~~r 107 (451)
+++++.+|
T Consensus 24 ~~~~~~~r 31 (34)
T TIGR01167 24 GGLLLRKR 31 (34)
T ss_pred HHHHheec
Confidence 33344443
No 74
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=37.76 E-value=9.4 Score=37.91 Aligned_cols=8 Identities=25% Similarity=0.413 Sum_probs=3.5
Q ss_pred eeeeeecc
Q psy15334 103 FIIIRRKI 110 (451)
Q Consensus 103 l~~~rrr~ 110 (451)
++||.||+
T Consensus 277 LILRYRRK 284 (299)
T PF02009_consen 277 LILRYRRK 284 (299)
T ss_pred HHHHHHHH
Confidence 44444443
No 75
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=37.67 E-value=20 Score=29.29 Aligned_cols=6 Identities=17% Similarity=0.573 Sum_probs=2.6
Q ss_pred eeeeee
Q psy15334 103 FIIIRR 108 (451)
Q Consensus 103 l~~~rr 108 (451)
+|++||
T Consensus 89 ~f~~r~ 94 (96)
T PTZ00382 89 WFVCRG 94 (96)
T ss_pred eeEEee
Confidence 444443
No 76
>PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=36.72 E-value=14 Score=29.83 Aligned_cols=20 Identities=15% Similarity=0.222 Sum_probs=9.4
Q ss_pred hHHHHHHHHhhheeeeeecc
Q psy15334 91 MGFLVLIVLPVLFIIIRRKI 110 (451)
Q Consensus 91 gv~Liliil~vll~~~rrr~ 110 (451)
|++|++|+++++++.+.|++
T Consensus 50 G~iLilIii~Lv~CC~~K~K 69 (98)
T PF07204_consen 50 GLILILIIIALVCCCRAKHK 69 (98)
T ss_pred hhhhHHHHHHHHHHhhhhhh
Confidence 44455555544444444333
No 77
>PF05337 CSF-1: Macrophage colony stimulating factor-1 (CSF-1); InterPro: IPR008001 Colony stimulating factor 1 (CSF-1) is a homodimeric polypeptide growth factor whose primary function is to regulate the survival, proliferation, differentiation, and function of cells of the mononuclear phagocytic lineage. This lineage includes mononuclear phagocytic precursors, blood monocytes, tissue macrophages, osteoclasts, and microglia of the brain, all of which possess cell surface receptors for CSF-1. The protein has also been linked with male fertility [] and mutations in the Csf-1 gene have been found to cause osteopetrosis and failure of tooth eruption [].; GO: 0005125 cytokine activity, 0008083 growth factor activity, 0016021 integral to membrane; PDB: 3EJJ_A.
Probab=35.78 E-value=12 Score=36.33 Aligned_cols=23 Identities=26% Similarity=0.275 Sum_probs=0.0
Q ss_pred hhhHHHHHHHHhhheeeeeeccc
Q psy15334 89 LVMGFLVLIVLPVLFIIIRRKIV 111 (451)
Q Consensus 89 i~gv~Liliil~vll~~~rrr~~ 111 (451)
+.+|+|+|+.++.|+|.+|||..
T Consensus 232 VPSiILVLLaVGGLLfYr~rrRs 254 (285)
T PF05337_consen 232 VPSIILVLLAVGGLLFYRRRRRS 254 (285)
T ss_dssp -----------------------
T ss_pred ccchhhhhhhccceeeecccccc
Confidence 34455677777777666665543
No 78
>PF06084 Cytomega_TRL10: Cytomegalovirus TRL10 protein; InterPro: IPR009284 This family consists of several Cytomegalovirus TRL10 proteins. TRL10 represents a structural component of the virus particle and like the other HCMV envelope glycoproteins, is present in a disulphide-linked complex [].
Probab=34.66 E-value=29 Score=29.09 Aligned_cols=33 Identities=33% Similarity=0.398 Sum_probs=18.4
Q ss_pred CCCCcccCCCccchhhhHH-HHHHHHhhheeeee
Q psy15334 75 YSPPVSTGKDVSWLLVMGF-LVLIVLPVLFIIIR 107 (451)
Q Consensus 75 ~S~pi~t~~~~~~ii~gv~-Liliil~vll~~~r 107 (451)
+.+||+.-..-.+|-+|.+ ..||+++++||+.-
T Consensus 47 ~tepis~lg~ysawgagsfiatliillviffviy 80 (150)
T PF06084_consen 47 YTEPISMLGIYSAWGAGSFIATLIILLVIFFVIY 80 (150)
T ss_pred ccCcchhhhhhhhcccchHHHHHHHHHHHhheeE
Confidence 5567766555556667765 34555555544443
No 79
>COG3889 Predicted solute binding protein [General function prediction only]
Probab=33.02 E-value=39 Score=37.55 Aligned_cols=15 Identities=27% Similarity=0.698 Sum_probs=7.2
Q ss_pred HHHHHhhh-eeeeeec
Q psy15334 95 VLIVLPVL-FIIIRRK 109 (451)
Q Consensus 95 iliil~vl-l~~~rrr 109 (451)
||++|++| ++++|||
T Consensus 856 iiv~LaAla~lLrRRr 871 (872)
T COG3889 856 IIVGLAALALLLRRRR 871 (872)
T ss_pred HHHHHHHHHHHHHhhc
Confidence 33444443 5565554
No 80
>PF11770 GAPT: GRB2-binding adapter (GAPT); InterPro: IPR021082 This entry represents a family of transmembrane proteins which bind the growth factor receptor-bound protein 2 (GRB2) in B cells []. In contrast to other transmembrane adaptor proteins, GAPT, which this entry represents, is not phosphorylated upon BCR ligation. It associates with GRB2 constitutively through its proline-rich region [].
Probab=32.55 E-value=40 Score=29.80 Aligned_cols=25 Identities=8% Similarity=-0.065 Sum_probs=12.8
Q ss_pred CCccchhhhHH--HHHHHHhhheeeee
Q psy15334 83 KDVSWLLVMGF--LVLIVLPVLFIIIR 107 (451)
Q Consensus 83 ~~~~~ii~gv~--Liliil~vll~~~r 107 (451)
...+++.+|++ |+|++.|+-++|.+
T Consensus 7 n~sv~i~igi~Ll~lLl~cgiGcvwhw 33 (158)
T PF11770_consen 7 NTSVAISIGISLLLLLLLCGIGCVWHW 33 (158)
T ss_pred CchHHHHHHHHHHHHHHHHhcceEEEe
Confidence 34456777776 33333444455543
No 81
>PF11353 DUF3153: Protein of unknown function (DUF3153); InterPro: IPR021499 This family of proteins with unknown function appear to be restricted to Cyanobacteria. Some members are annotated as membrane proteins however this cannot be confirmed.
Probab=32.06 E-value=42 Score=31.42 Aligned_cols=15 Identities=13% Similarity=0.313 Sum_probs=10.2
Q ss_pred CCCCCCeeeEEEEEe
Q psy15334 50 PLKQGTTYRVKVRAF 64 (451)
Q Consensus 50 pL~p~~~Y~i~vrA~ 64 (451)
+|+||..-++.+.+.
T Consensus 166 ~L~pGe~N~L~~~~w 180 (209)
T PF11353_consen 166 KLQPGEINHLEASFW 180 (209)
T ss_pred ecCCCceeEEEEEEE
Confidence 677777777766653
No 82
>cd01519 RHOD_HSP67B2 Member of the Rhodanese Homology Domain superfamily. This CD includes the heat shock protein 67B2 of Drosophila melanogaster and other similar proteins, many of which are uncharacterized.
Probab=31.85 E-value=59 Score=26.04 Aligned_cols=16 Identities=31% Similarity=0.698 Sum_probs=12.0
Q ss_pred CCCCEEEEcCCCCchhh
Q psy15334 354 AQHPIVVHCSAGVGRTG 370 (451)
Q Consensus 354 ~~~PivVHCs~GvGRtG 370 (451)
...||||+|..|. |+.
T Consensus 65 ~~~~ivv~c~~g~-~s~ 80 (106)
T cd01519 65 KDKELIFYCKAGV-RSK 80 (106)
T ss_pred CCCeEEEECCCcH-HHH
Confidence 3679999999875 553
No 83
>TIGR03000 plancto_dom_1 Planctomycetes uncharacterized domain TIGR03000. Domains described by this model are found, so far, only in the Planctomycetes (Pirellula sp. strain 1 and Gemmata obscuriglobus), in up to six proteins per genome, and may be duplicated within a protein. The function is unknown.
Probab=31.69 E-value=65 Score=25.08 Aligned_cols=28 Identities=21% Similarity=0.303 Sum_probs=23.7
Q ss_pred CCCceEcCCCCCCCeeeEEEEEeeCCCC
Q psy15334 42 NEAHYCNGPLKQGTTYRVKVRAFTSPEL 69 (451)
Q Consensus 42 ~~~~~~NgpL~p~~~Y~i~vrA~t~~~~ 69 (451)
....|.-.+|++|..|++.|+|-...++
T Consensus 27 ~~R~F~T~~L~~G~~y~Y~v~a~~~~dG 54 (75)
T TIGR03000 27 TVRTFTTPPLEAGKEYEYTVTAEYDRDG 54 (75)
T ss_pred cEEEEECCCCCCCCEEEEEEEEEEecCC
Confidence 4567999999999999999999776544
No 84
>PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated [].
Probab=31.12 E-value=14 Score=34.64 Aligned_cols=11 Identities=36% Similarity=0.307 Sum_probs=5.1
Q ss_pred EEEecCCCCCC
Q psy15334 235 YIATQGPLCST 245 (451)
Q Consensus 235 yI~tQ~Pl~~T 245 (451)
-|+---|+.+|
T Consensus 181 ~~iQeLP~~NT 191 (221)
T PF08374_consen 181 HIIQELPLDNT 191 (221)
T ss_pred hhhhhcCCcce
Confidence 34444455554
No 85
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=30.75 E-value=59 Score=23.92 Aligned_cols=23 Identities=39% Similarity=0.588 Sum_probs=18.6
Q ss_pred CceEcCCCCCCCeeeEEEEEeeC
Q psy15334 44 AHYCNGPLKQGTTYRVKVRAFTS 66 (451)
Q Consensus 44 ~~~~NgpL~p~~~Y~i~vrA~t~ 66 (451)
..+--..|.|++.|.+.|+|+..
T Consensus 57 ~~~~i~~l~p~~~Y~~~v~a~~~ 79 (93)
T cd00063 57 TSYTLTGLKPGTEYEFRVRAVNG 79 (93)
T ss_pred cEEEEccccCCCEEEEEEEEECC
Confidence 34555569999999999999875
No 86
>PRK14750 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=30.24 E-value=34 Score=21.34 Aligned_cols=18 Identities=28% Similarity=0.351 Sum_probs=10.8
Q ss_pred cchhhhHHHHHHHHhhhe
Q psy15334 86 SWLLVMGFLVLIVLPVLF 103 (451)
Q Consensus 86 ~~ii~gv~Liliil~vll 103 (451)
..+|.|++|++++++-++
T Consensus 3 ~~vi~g~llv~lLl~YLv 20 (29)
T PRK14750 3 FSIVCGALLVLLLLGYLV 20 (29)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 356777776666655443
No 87
>PF06365 CD34_antigen: CD34/Podocalyxin family; InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=30.01 E-value=26 Score=32.66 Aligned_cols=14 Identities=14% Similarity=0.090 Sum_probs=6.9
Q ss_pred HhhheeeeeecccC
Q psy15334 99 LPVLFIIIRRKIVG 112 (451)
Q Consensus 99 l~vll~~~rrr~~~ 112 (451)
+++.+|++.||..+
T Consensus 118 ~~~~Y~~~~Rrs~~ 131 (202)
T PF06365_consen 118 LGAGYCCHQRRSWS 131 (202)
T ss_pred HHHHHHhhhhccCC
Confidence 33345555555543
No 88
>cd01448 TST_Repeat_1 Thiosulfate sulfurtransferase (TST), N-terminal, inactive domain. TST contains 2 copies of the Rhodanese Homology Domain; this is the 1st repeat, which does not contain the catalytically active Cys residue. The role of the 1st repeat is uncertain, but it is believed to be involved in protein interaction.
Probab=29.33 E-value=89 Score=25.82 Aligned_cols=33 Identities=21% Similarity=0.233 Sum_probs=19.7
Q ss_pred ChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhH
Q psy15334 336 SPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTF 372 (451)
Q Consensus 336 ~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~f 372 (451)
+++.+.+++... -.....||||+|..| |+....
T Consensus 63 ~~~~~~~~~~~~---~~~~~~~vv~~c~~g-~~~a~~ 95 (122)
T cd01448 63 SPEEFAELLGSL---GISNDDTVVVYDDGG-GFFAAR 95 (122)
T ss_pred CHHHHHHHHHHc---CCCCCCEEEEECCCC-CccHHH
Confidence 455666665432 123367999999998 444433
No 89
>COG5495 Uncharacterized conserved protein [Function unknown]
Probab=29.22 E-value=56 Score=31.30 Aligned_cols=18 Identities=39% Similarity=0.700 Sum_probs=11.8
Q ss_pred CCCCCEEEEcCCCCchhhh
Q psy15334 353 PAQHPIVVHCSAGVGRTGT 371 (451)
Q Consensus 353 ~~~~PivVHCs~GvGRtG~ 371 (451)
...+.||+||| |.+-+|.
T Consensus 89 ~rpg~iv~HcS-ga~~~~i 106 (289)
T COG5495 89 NRPGTIVAHCS-GANGSGI 106 (289)
T ss_pred cCCCeEEEEcc-CCCchhh
Confidence 34678999999 4444443
No 90
>PLN02160 thiosulfate sulfurtransferase
Probab=28.50 E-value=51 Score=28.47 Aligned_cols=18 Identities=28% Similarity=0.503 Sum_probs=13.6
Q ss_pred CCCCEEEEcCCCCchhhhH
Q psy15334 354 AQHPIVVHCSAGVGRTGTF 372 (451)
Q Consensus 354 ~~~PivVHCs~GvGRtG~f 372 (451)
...||+|||..| +||..-
T Consensus 80 ~~~~IivyC~sG-~RS~~A 97 (136)
T PLN02160 80 PADDILVGCQSG-ARSLKA 97 (136)
T ss_pred CCCcEEEECCCc-HHHHHH
Confidence 357999999987 677643
No 91
>PRK11493 sseA 3-mercaptopyruvate sulfurtransferase; Provisional
Probab=25.77 E-value=61 Score=31.65 Aligned_cols=49 Identities=16% Similarity=0.308 Sum_probs=24.4
Q ss_pred EEEeecCCCCCCCCChhHHHHHHHHHHHhcCCCCCCEEEEcCCCCchhhhHH
Q psy15334 322 HYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFI 373 (451)
Q Consensus 322 h~~~~~Wpd~gvP~~~~~ll~fi~~v~~~~~~~~~PivVHCs~GvGRtG~fi 373 (451)
|+.+..+.+.+...++..|..++.. .-.....|||++|..|.--+.++.
T Consensus 201 ~i~~~~~~~~~~~~~~~~l~~~~~~---~g~~~~~~ii~yC~~G~~A~~~~~ 249 (281)
T PRK11493 201 NVPWTELVREGELKTTDELDAIFFG---RGVSFDRPIIASCGSGVTAAVVVL 249 (281)
T ss_pred CCCHHHhcCCCCcCCHHHHHHHHHh---cCCCCCCCEEEECCcHHHHHHHHH
Confidence 3444444443333344444444322 111235699999988774444333
No 92
>TIGR03865 PQQ_CXXCW PQQ-dependent catabolism-associated CXXCW motif protein. Members of this protein family have a CXXXCW motif, consistent with a possible role in redox cofactor binding. This protein family shows strong relationships by phylogenetic profiling and conserved gene neighborhoods with a transport system for alcohols metabolized by PQQ-dependent enzymes.
Probab=25.36 E-value=1.1e+02 Score=27.25 Aligned_cols=18 Identities=22% Similarity=0.316 Sum_probs=14.9
Q ss_pred CCCCEEEEcCCCCchhhh
Q psy15334 354 AQHPIVVHCSAGVGRTGT 371 (451)
Q Consensus 354 ~~~PivVHCs~GvGRtG~ 371 (451)
...||||.|..|..|+..
T Consensus 115 ~d~~IVvYC~~G~~~S~~ 132 (162)
T TIGR03865 115 KDRPLVFYCLADCWMSWN 132 (162)
T ss_pred CCCEEEEEECCCCHHHHH
Confidence 367999999998878875
No 93
>PF14610 DUF4448: Protein of unknown function (DUF4448)
Probab=25.24 E-value=40 Score=30.96 Aligned_cols=13 Identities=15% Similarity=0.235 Sum_probs=5.1
Q ss_pred hhHHHHHHHHhhh
Q psy15334 90 VMGFLVLIVLPVL 102 (451)
Q Consensus 90 ~gv~Liliil~vl 102 (451)
+++++++++++++
T Consensus 164 lPvvv~~~~~~~~ 176 (189)
T PF14610_consen 164 LPVVVVVLALIMY 176 (189)
T ss_pred ccHHHHHHHHHHH
Confidence 3344443333333
No 94
>PF10727 Rossmann-like: Rossmann-like domain; InterPro: IPR019665 This entry represents an NAD/NADP-binding domain with a core Rossmann-type fold, found in an uncharacterised protein family thought to be putative NADP oxidoreductase coenzyme F420-dependent proteins and/or NAD-dependent glycerol-3-phosphate dehydrogenase-like proteins. This Rossmann-fold domain consists of 3-layers alpha/beta/alpha, where the six beta strands are parallel in the order 321456.; PDB: 3DFU_A 3C24_A.
Probab=24.74 E-value=81 Score=27.10 Aligned_cols=29 Identities=21% Similarity=0.204 Sum_probs=13.3
Q ss_pred HHHHHHHHHHHhcCCCCCCEEEEcCCCCc
Q psy15334 339 TLVSFVLSFRTHISPAQHPIVVHCSAGVG 367 (451)
Q Consensus 339 ~ll~fi~~v~~~~~~~~~PivVHCs~GvG 367 (451)
.+-.....+...-....+.+|||||...+
T Consensus 80 aI~~va~~La~~~~~~~g~iVvHtSGa~~ 108 (127)
T PF10727_consen 80 AIAEVAEQLAQYGAWRPGQIVVHTSGALG 108 (127)
T ss_dssp HHHHHHHHHHCC--S-TT-EEEES-SS--
T ss_pred HHHHHHHHHHHhccCCCCcEEEECCCCCh
Confidence 34444555544311235789999996554
No 95
>PF03684 UPF0179: Uncharacterised protein family (UPF0179); InterPro: IPR005369 The function of this family is unknown, however the proteins contain two cysteine clusters that may be iron sulphur redox centres.
Probab=24.66 E-value=44 Score=29.37 Aligned_cols=29 Identities=38% Similarity=0.733 Sum_probs=22.0
Q ss_pred eCC-CCCC--CCCceEcCCCCCCCeeeEE-EEE
Q psy15334 35 IGR-ETCP--NEAHYCNGPLKQGTTYRVK-VRA 63 (451)
Q Consensus 35 iG~-~~c~--~~~~~~NgpL~p~~~Y~i~-vrA 63 (451)
.|. +.|. .....|.|.|++|..|+|. ||.
T Consensus 19 ~G~~~eCe~Crlk~~C~~nLe~Gr~YrI~~VR~ 51 (142)
T PF03684_consen 19 YGPAEECEGCRLKNVCLGNLEPGRRYRIVEVRN 51 (142)
T ss_pred ecCccccCCCCChHHhcccCCCCceEEEEEEcC
Confidence 454 4576 5667899999999999965 664
No 96
>KOG1530|consensus
Probab=24.59 E-value=64 Score=27.97 Aligned_cols=29 Identities=24% Similarity=0.496 Sum_probs=22.0
Q ss_pred HHHHHHHHHhcCCCCCCEEEEcCCCCchhh
Q psy15334 341 VSFVLSFRTHISPAQHPIVVHCSAGVGRTG 370 (451)
Q Consensus 341 l~fi~~v~~~~~~~~~PivVHCs~GvGRtG 370 (451)
.+|++.+-....+....||++|..|. |+.
T Consensus 75 ~eF~kqvg~~kp~~d~eiIf~C~SG~-Rs~ 103 (136)
T KOG1530|consen 75 PEFLKQVGSSKPPHDKEIIFGCASGV-RSL 103 (136)
T ss_pred HHHHHHhcccCCCCCCcEEEEeccCc-chh
Confidence 35778777666666779999999986 554
No 97
>KOG0208|consensus
Probab=24.27 E-value=1.6e+02 Score=34.14 Aligned_cols=67 Identities=22% Similarity=0.304 Sum_probs=52.0
Q ss_pred CCEEEEcCCCCchhhhHHHHHHHHHHHHccc------CCCCCCHHHHHHHHHhhccCCCCCHH--HHHHHHHHHH
Q psy15334 356 HPIVVHCSAGVGRTGTFIALDTVLQRLYNSE------GAGSVDILRIVHGLRRQRVLMVQSLA--QYAFIYECVQ 422 (451)
Q Consensus 356 ~PivVHCs~GvGRtG~fiai~~~l~~l~~~~------~~~~vdi~~~v~~lR~qR~~~Vqt~~--QY~fiy~~l~ 422 (451)
+-.|-.|.+|+.-+|++-|+|..+..-+.|. ..+..+|..+..-+|..|..+|.+.. ||.-+|.++.
T Consensus 852 ~y~VgfCGDGANDCgALKaAdvGISLSeaEASvAApFTSk~~~I~cVp~vIrEGRaALVTSf~~FkYMalYs~iq 926 (1140)
T KOG0208|consen 852 GYKVGFCGDGANDCGALKAADVGISLSEAEASVAAPFTSKTPSISCVPDVIREGRAALVTSFACFKYMALYSAIQ 926 (1140)
T ss_pred CcEEEecCCCcchhhhhhhcccCcchhhhhHhhcCccccCCCchhhHhHHHhhhhhhhhhhHHHHHHHHHHHHHH
Confidence 5688999999999999999999877555442 12456888888889999999999864 6666665543
No 98
>PRK10287 thiosulfate:cyanide sulfurtransferase; Provisional
Probab=24.20 E-value=1.2e+02 Score=25.01 Aligned_cols=17 Identities=24% Similarity=0.587 Sum_probs=12.5
Q ss_pred CCCEEEEcCCCCchhhhH
Q psy15334 355 QHPIVVHCSAGVGRTGTF 372 (451)
Q Consensus 355 ~~PivVHCs~GvGRtG~f 372 (451)
..||+++|.+| .||...
T Consensus 60 ~~~IVlyC~~G-~rS~~a 76 (104)
T PRK10287 60 NDTVKLYCNAG-RQSGQA 76 (104)
T ss_pred CCeEEEEeCCC-hHHHHH
Confidence 56899999887 555544
No 99
>PF10577 UPF0560: Uncharacterised protein family UPF0560; InterPro: IPR018890 This family of proteins has no known function.
Probab=24.20 E-value=37 Score=37.96 Aligned_cols=18 Identities=50% Similarity=0.680 Sum_probs=10.9
Q ss_pred HHHHHHhhheeeeeeccc
Q psy15334 94 LVLIVLPVLFIIIRRKIV 111 (451)
Q Consensus 94 Liliil~vll~~~rrr~~ 111 (451)
|+|+++.+|+|.||||..
T Consensus 285 ivl~lL~vLl~yCrrkc~ 302 (807)
T PF10577_consen 285 IVLILLCVLLCYCRRKCL 302 (807)
T ss_pred HHHHHHHHHHHhhhcccC
Confidence 444555667777776554
No 100
>cd01520 RHOD_YbbB Member of the Rhodanese Homology Domain superfamily. This CD includes several putative ATP /GTP binding proteins including E. coli YbbB.
Probab=23.24 E-value=1.7e+02 Score=24.68 Aligned_cols=20 Identities=15% Similarity=0.398 Sum_probs=14.6
Q ss_pred CCCCEEEEcCCCCchhhhHH
Q psy15334 354 AQHPIVVHCSAGVGRTGTFI 373 (451)
Q Consensus 354 ~~~PivVHCs~GvGRtG~fi 373 (451)
...||||.|..|..||...+
T Consensus 85 ~~~~vvvyC~~~G~rs~~a~ 104 (128)
T cd01520 85 RDPKLLIYCARGGMRSQSLA 104 (128)
T ss_pred CCCeEEEEeCCCCccHHHHH
Confidence 46799999986666776544
No 101
>TIGR03024 arch_pef_cterm PEF-C-terminal archaeal protein sorting domain. This domain, distantly related to the PEP-Cterm domain described in model TIGR02595, is found in Methanosarcina mazei in four different proteins, as well as in other archaea such as Methanococcoides burtonii. Several proteins with this domain have their genes only a short distance from a distant homology of EpsH, a proposed integral membrane transpeptidase.
Probab=22.98 E-value=36 Score=20.85 Aligned_cols=9 Identities=33% Similarity=0.718 Sum_probs=3.4
Q ss_pred Hhhheeeee
Q psy15334 99 LPVLFIIIR 107 (451)
Q Consensus 99 l~vll~~~r 107 (451)
+++++++.|
T Consensus 15 l~i~~i~~r 23 (26)
T TIGR03024 15 LAIIVILRR 23 (26)
T ss_pred HHHHHHHhh
Confidence 333333333
No 102
>PTZ00208 65 kDa invariant surface glycoprotein; Provisional
Probab=22.98 E-value=58 Score=33.46 Aligned_cols=11 Identities=18% Similarity=0.452 Sum_probs=6.2
Q ss_pred hheeeeeeccc
Q psy15334 101 VLFIIIRRKIV 111 (451)
Q Consensus 101 vll~~~rrr~~ 111 (451)
+++|++|||..
T Consensus 406 ~~~~v~rrr~~ 416 (436)
T PTZ00208 406 FFIMVKRRRNS 416 (436)
T ss_pred hheeeeeccCC
Confidence 44556666553
No 103
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=22.95 E-value=52 Score=32.45 Aligned_cols=20 Identities=20% Similarity=0.562 Sum_probs=10.2
Q ss_pred hhHHHHHHHHhhheeeeeec
Q psy15334 90 VMGFLVLIVLPVLFIIIRRK 109 (451)
Q Consensus 90 ~gv~Liliil~vll~~~rrr 109 (451)
+|++++|+++++++.+.|||
T Consensus 238 LG~v~ll~l~Gii~~~~~r~ 257 (281)
T PF12768_consen 238 LGTVFLLVLIGIILAYIRRR 257 (281)
T ss_pred HHHHHHHHHHHHHHHHHHhh
Confidence 34445556666655444443
No 104
>PF02035 Coagulin: Coagulin; InterPro: IPR000275 Coagulogen is a gel-forming protein of hemolymph that hinders the spread of invaders by immobilising them [, ]. The protein contains a single 175- residue polypeptide chain; this is cleaved after Arg-18 and Arg-46 by a clotting enzyme contained in the hemocyte and activated by a bacterial endotoxin (lipopolysaccharide). Cleavage releases two chains of coagulin, A and B, linked by two disulphide bonds, together with the peptide C [, ]. Gel formation results from interlinking of coagulin molecules. Secondary structure prediction suggests the C peptide forms an alpha- helix, which is released during the proteolytic conversion of coagulogen to coagulin gel []. The beta-sheet structure and 16 half-cystines found in the molecule appear to yield a compact protein stable to acid and heat. Mammalian blood coagulation is based on the proteolytically induced polymerisation of fibrinogens. Initially, fibrin monomers noncovalently interact with each other. The resulting homopolymers are further stabilised when the plasma transglutaminase (TGase) intermolecularly cross-links epsilon-(gamma-glutamyl)lysine bonds. In crustaceans, hemolymph coagulation depends on the TGase-mediated cross-linking of specific plasma-clotting proteins, but without the proteolytic cascade. In horseshoe crabs, the proteolytic coagulation cascade triggered by lipopolysaccharides and beta-1,3-glucans leads to the conversion of coagulogen into coagulin, resulting in noncovalent coagulin homopolymers through head-to-tail interaction. Horseshoe crab TGase, however, does not cross-link coagulins intermolecularly. Recently, we found that coagulins are cross-linked on hemocyte cell surface proteins called proxins. This indicates that a cross-linking reaction at the final stage of hemolymph coagulation is an important innate immune system of horseshoe crabs [].; GO: 0042381 hemolymph coagulation, 0005576 extracellular region; PDB: 1AOC_A.
Probab=22.13 E-value=30 Score=29.62 Aligned_cols=28 Identities=21% Similarity=0.628 Sum_probs=15.7
Q ss_pred CCCCCccE-EEecCCCCCCHHHHHhHhhhccCC
Q psy15334 228 GYLSPRQY-IATQGPLCSTVDDFWRMCWESRTP 259 (451)
Q Consensus 228 g~~~~~~y-I~tQ~Pl~~T~~dFWrMVwe~~v~ 259 (451)
||....+| |+.|+|.. -|-+.|||++++
T Consensus 98 gyt~a~efrvivqapra----gfrqcvwqhkcr 126 (174)
T PF02035_consen 98 GYTVAGEFRVIVQAPRA----GFRQCVWQHKCR 126 (174)
T ss_dssp EE-TTS-EEEE--BCCC----TB-B---EEEET
T ss_pred cceecceEEEEEeCchh----hHHHHHHHhhhc
Confidence 66666676 88999986 688999999875
No 105
>TIGR02981 phageshock_pspE phage shock operon rhodanese PspE. Members of this very narrowly defined protein family are proteins active as rhodanese (EC 2.8.1.1) and found in the extended variants of the phage shock protein (psp operon) in Escherichia coli and a few closely related species. Note that the designation phage shock protein PspE has been applied, incorrectly, in many instances where the genome lacks the phage shock regulon entirely.
Probab=21.23 E-value=1.5e+02 Score=24.14 Aligned_cols=18 Identities=22% Similarity=0.517 Sum_probs=13.2
Q ss_pred CCCCEEEEcCCCCchhhhH
Q psy15334 354 AQHPIVVHCSAGVGRTGTF 372 (451)
Q Consensus 354 ~~~PivVHCs~GvGRtG~f 372 (451)
...||+|+|..|. ||...
T Consensus 57 ~~~~vvlyC~~G~-rS~~a 74 (101)
T TIGR02981 57 KNDTVKLYCNAGR-QSGMA 74 (101)
T ss_pred CCCeEEEEeCCCH-HHHHH
Confidence 3568999999884 66654
No 106
>PF13908 Shisa: Wnt and FGF inhibitory regulator
Probab=21.04 E-value=88 Score=28.36 Aligned_cols=8 Identities=13% Similarity=0.031 Sum_probs=4.3
Q ss_pred cchhhhHH
Q psy15334 86 SWLLVMGF 93 (451)
Q Consensus 86 ~~ii~gv~ 93 (451)
.++|+||+
T Consensus 78 ~~iivgvi 85 (179)
T PF13908_consen 78 TGIIVGVI 85 (179)
T ss_pred eeeeeehh
Confidence 45555554
No 107
>TIGR03154 sulfolob_CbsA cytochrome b558/566, subunit A. Members of this protein family are CbsA, one subunit of a highly glycosylated, heterodimeric, mono-heme cytochrome b558/566, found in Sulfolobus acidocaldarius and several other members of the Sulfolobales, a branch of the Crenarchaeota.
Probab=20.50 E-value=1.3e+02 Score=30.32 Aligned_cols=22 Identities=18% Similarity=0.534 Sum_probs=13.8
Q ss_pred chhhhHHHHHHHHhhheeeeee
Q psy15334 87 WLLVMGFLVLIVLPVLFIIIRR 108 (451)
Q Consensus 87 ~ii~gv~Liliil~vll~~~rr 108 (451)
+-|+|+++.++++++++.++||
T Consensus 444 wTVaGVvIAiVALV~l~~V~rr 465 (465)
T TIGR03154 444 VTIIGVVIAIVALVILYVVFRR 465 (465)
T ss_pred EEeehhHHHHHHHhheeEEecC
Confidence 4467776666666666666554
No 108
>PF11344 DUF3146: Protein of unknown function (DUF3146); InterPro: IPR021492 This family of proteins with unknown function appear to be restricted to Cyanobacteria.
Probab=20.12 E-value=71 Score=24.87 Aligned_cols=15 Identities=33% Similarity=0.541 Sum_probs=13.5
Q ss_pred CCCCCCCeeeEEEEE
Q psy15334 49 GPLKQGTTYRVKVRA 63 (451)
Q Consensus 49 gpL~p~~~Y~i~vrA 63 (451)
=.|+||..|.|.|||
T Consensus 65 Y~LEpGgdY~Ftira 79 (80)
T PF11344_consen 65 YQLEPGGDYSFTIRA 79 (80)
T ss_pred eeccCCCceEEEEec
Confidence 369999999999997
Done!