RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4348
(355 letters)
>gnl|CDD|214550 smart00194, PTPc, Protein tyrosine phosphatase, catalytic domain.
Length = 259
Score = 304 bits (782), Expect = e-103
Identities = 117/278 (42%), Positives = 164/278 (58%), Gaps = 23/278 (8%)
Query: 10 IELEFEDIDSKNLWAVVYQRIRTVSNAYDYTLDDAKRSENRHLNRYRDVIPYDHSRIALK 69
+E EFE +D D + A ENR NRY+DV+PYDH+R+ LK
Sbjct: 2 LEEEFEKLDRLKPD--------------DESCTVAAFPENRDKNRYKDVLPYDHTRVKLK 47
Query: 70 RC---SNDYINANLVEIEQANRKYILTQGPLPNTIAHFWVMVWEQNCKAIVMLNKIIEKN 126
+DYINA+ ++ + YI TQGPLP+T+ FW MVWEQ IVML +++EK
Sbjct: 48 PPPGEGSDYINASYIDGPNGPKAYIATQGPLPSTVEDFWRMVWEQKVTVIVMLTELVEKG 107
Query: 127 QLKCSQYWPTSSSSDLEFPDVNLSVHLDSEVNHSYFITRNIRVTDKESSKERHVILFHYT 186
+ KC+QYWP L + D ++V L S + R + VT+ S+ R V +HYT
Sbjct: 108 REKCAQYWPDEEGEPLTYGD--ITVTLKSVEKVDDYTIRTLEVTNTGCSETRTVTHYHYT 165
Query: 187 TWPDFGVPQSPTALLRFIRAVRKSGALDENMGPPIVHCSAGIGRSGTFILVDCVLKLISD 246
WPD GVP+SP ++L IRAVRKS + GP +VHCSAG+GR+GTFI +D +L+ +
Sbjct: 166 NWPDHGVPESPESILDLIRAVRKSQSTST--GPIVVHCSAGVGRTGTFIAIDILLQQLEA 223
Query: 247 GEINNVSVQEILLEMRHYRMGLIQTPDQLRFSYQAIIE 284
G+ V + EI+ E+R R G++QT +Q F Y+AI+E
Sbjct: 224 GK--EVDIFEIVKELRSQRPGMVQTEEQYIFLYRAILE 259
>gnl|CDD|238006 cd00047, PTPc, Protein tyrosine phosphatases (PTP) catalyze the
dephosphorylation of phosphotyrosine peptides; they
regulate phosphotyrosine levels in signal transduction
pathways. The depth of the active site cleft renders the
enzyme specific for phosphorylated Tyr (pTyr) residues,
instead of pSer or pThr. This family has a distinctive
active site signature motif, HCSAGxGRxG. Characterized
as either transmembrane, receptor-like or
non-transmembrane (soluble) PTPs. Receptor-like PTP
domains tend to occur in two copies in the cytoplasmic
region of the transmembrane proteins, only one copy may
be active.
Length = 231
Score = 302 bits (776), Expect = e-103
Identities = 101/237 (42%), Positives = 149/237 (62%), Gaps = 9/237 (3%)
Query: 51 HLNRYRDVIPYDHSRIALK---RCSNDYINANLVEIEQANRKYILTQGPLPNTIAHFWVM 107
NRY+D++PYDH+R+ LK +DYINA+ ++ + YI TQGPLPNT+ FW M
Sbjct: 1 KKNRYKDILPYDHTRVKLKPDDDEGSDYINASYIDGYNPPKAYIATQGPLPNTVEDFWRM 60
Query: 108 VWEQNCKAIVMLNKIIEKNQLKCSQYWPTSSSSDLEFPDVNLSVHLDSEVNHSYFITRNI 167
VWEQ IVML +++EK + KC+QYWP S L + +++V L SE + R +
Sbjct: 61 VWEQKVPVIVMLTELVEKGREKCAQYWPEEEGS-LTY--GDITVTLVSEEKLDDYTVRTL 117
Query: 168 RVTDKESSKERHVILFHYTTWPDFGVPQSPTALLRFIRAVRKSGALDENMGPPIVHCSAG 227
++++ + + R V F YT WPD GVP+SP +LL +R VRKS + GP +VHCSAG
Sbjct: 118 KLSNTGTGETRTVTHFQYTGWPDHGVPESPDSLLDLLRKVRKSQQQPGS-GPIVVHCSAG 176
Query: 228 IGRSGTFILVDCVLKLISDGEINNVSVQEILLEMRHYRMGLIQTPDQLRFSYQAIIE 284
+GR+GTFI +D +L+ + V + + + E+R R G++QT +Q F Y+AI+E
Sbjct: 177 VGRTGTFIAIDILLQRLEAE--GVVDIFQTVKELRSQRPGMVQTEEQYIFLYRAILE 231
>gnl|CDD|215717 pfam00102, Y_phosphatase, Protein-tyrosine phosphatase.
Length = 233
Score = 293 bits (752), Expect = 2e-99
Identities = 106/238 (44%), Positives = 145/238 (60%), Gaps = 7/238 (2%)
Query: 49 NRHLNRYRDVIPYDHSRIALKRC-SNDYINANLVEIEQANRKYILTQGPLPNTIAHFWVM 107
N NRY+D++PYDH+R+ L +DYINA+ ++ + R +I TQGPLPNTI FW M
Sbjct: 1 NLEKNRYKDILPYDHTRVKLTPGGPSDYINASYIDGYRKPRAFIATQGPLPNTIEDFWRM 60
Query: 108 VWEQNCKAIVMLNKIIEKNQLKCSQYWPTSSSSDLEFPDVNLSVHL-DSEVNHSYFITRN 166
VWEQ IVML ++ EK + KC QYWP L + ++SV L E + R
Sbjct: 61 VWEQKVTTIVMLTELEEKGREKCDQYWPEEEEGSLTY--GDISVTLVKEEEEEDDYTVRT 118
Query: 167 IRVTDKESSKERHVILFHYTTWPDFGVPQSPTALLRFIRAVRKSGALDENMGPPIVHCSA 226
+ + S + R V FHYT WPD GVP+SP +LL IR VRKS A + GP +VHCSA
Sbjct: 119 FELKNGGSEETRTVTHFHYTGWPDHGVPESPKSLLDLIRKVRKSQADPRS-GPIVVHCSA 177
Query: 227 GIGRSGTFILVDCVLKLISDGEINNVSVQEILLEMRHYRMGLIQTPDQLRFSYQAIIE 284
G+GR+GTFI +D +L+ + V V +I+ E+R R G++QT +Q F Y A++E
Sbjct: 178 GVGRTGTFIAIDILLQQLEAEGE--VDVFQIVKELRSQRPGMVQTEEQYIFLYDAVLE 233
>gnl|CDD|227886 COG5599, PTP2, Protein tyrosine phosphatase [Signal transduction
mechanisms].
Length = 302
Score = 172 bits (437), Expect = 3e-51
Identities = 89/259 (34%), Positives = 129/259 (49%), Gaps = 20/259 (7%)
Query: 38 DYTLDDAKRSENRHLNRYRDVIPYDHSRIALKRCS--NDYINANLVEIEQANRKYILTQG 95
++ D + S N NRY +++PY+H+R+ LK NDYINA+ ++ + KYI TQG
Sbjct: 42 RWSDDTSLSSTNYARNRYSNIVPYEHTRVHLKYGKSINDYINASYIKTPRG--KYIATQG 99
Query: 96 PLPNTIAHFWVMVWEQNC---KAIVMLNKIIEKNQLKCSQYWPTSSSSDLEFPDVNLSVH 152
P P TI FW MVW N IVML + E N+ KC QYWP L +
Sbjct: 100 PKPETIDDFWKMVW-HNVPNNGVIVMLTSLPEYNREKCDQYWPLGYDDTLIIGLRVIKQK 158
Query: 153 LDSEVNHSYFITRNIRVTDKESSKERHVILFHYTTWPDFGVPQSPTALLRFIRAVRKSGA 212
N + N +T + + F Y W DF VP +L I ++ S
Sbjct: 159 KYELFNDNIVNVHNFELTSINGPP-KKIHHFQYINWVDFNVP-DIRSLTEVIHSLNDSPV 216
Query: 213 LDENMGPPIVHCSAGIGRSGTFILVDCVLKLISDGEINNVSVQ-------EILLEMRHYR 265
GP IVHCSAG+GR+GTFI +D +L++ +D + + + +I+L +R R
Sbjct: 217 ---RTGPIIVHCSAGVGRTGTFIALDILLRMPNDTLNHTDTWEDTQDLIFQIVLSLRSQR 273
Query: 266 MGLIQTPDQLRFSYQAIIE 284
M ++Q Q +F Y A +E
Sbjct: 274 MKMVQNKTQFKFLYDAFLE 292
>gnl|CDD|165114 PHA02747, PHA02747, protein tyrosine phosphatase; Provisional.
Length = 312
Score = 146 bits (370), Expect = 2e-41
Identities = 77/258 (29%), Positives = 131/258 (50%), Gaps = 18/258 (6%)
Query: 33 VSNAYDYTLDDAKRSENRHLNRYRDVIPYDHSRIALKR---CSNDYINANLVEIEQANRK 89
+ +D + + ++ EN+ NRY D+ +DH+R+ L ++DYI+AN ++ + ++K
Sbjct: 35 ILKPFDGLIANFEKPENQPKNRYWDIPCWDHNRVILDSGGGSTSDYIHANWIDGFEDDKK 94
Query: 90 YILTQGPLPNTIAHFWVMVWEQNCKAIVMLNKIIEKN-QLKCSQYWPTSSSSDLEFPDVN 148
+I TQGP T A FW VW+++C IVML N + KC QYW + +++ D
Sbjct: 95 FIATQGPFAETCADFWKAVWQEHCSIIVMLTPTKGTNGEEKCYQYWCLNEDGNIDMEDFR 154
Query: 149 LSVHLDSEVNHSYFITRNIRVTDKESSKERHVILFHYTTWPDFGVPQSPTALLRFIRAV- 207
+ L + V Y +T I +TDK R + F + W + P ++FI+ +
Sbjct: 155 IET-LKTSVRAKYILT-LIEITDKILKDSRKISHFQCSEWFEDETPSDHPDFIKFIKIID 212
Query: 208 ---RKSGAL----DENMGPPIVHCSAGIGRSGTFILVD-CVLKLISDGEINNVSVQEILL 259
+KSG L D + P +VHCS G+G++G F VD C+ +L+ + + + +
Sbjct: 213 INRKKSGKLFNPKDALLCPIVVHCSDGVGKTGIFCAVDICLNQLV---KRKAICLAKTAE 269
Query: 260 EMRHYRMGLIQTPDQLRF 277
++R R I D F
Sbjct: 270 KIREQRHAGIMNFDDYLF 287
>gnl|CDD|222923 PHA02738, PHA02738, hypothetical protein; Provisional.
Length = 320
Score = 141 bits (356), Expect = 3e-39
Identities = 81/267 (30%), Positives = 131/267 (49%), Gaps = 24/267 (8%)
Query: 31 RTVSNAYDYTLDDAKRSENRHLNRYRDVIPYDHSRIAL--KRCSNDYINANLVEIEQANR 88
+ +S D T + K+ NR LNRY D + +DHSR+ L +R DYINAN V+ + +
Sbjct: 33 KVISEKVDGTFNAEKK--NRKLNRYLDAVCFDHSRVILPAERNRGDYINANYVDGFEYKK 90
Query: 89 KYILTQGPLPNTIAHFWVMVWEQNCKAIVMLNKIIEKNQLKCSQYWPTSSSSDLEFPDVN 148
K+I Q P T F+ M+W ++ + IVML K E + KC YW + F
Sbjct: 91 KFICGQAPTRQTCYDFYRMLWMEHVQIIVMLCKKKENGREKCFPYWSDVEQGSIRFGKFK 150
Query: 149 LSVHLDSEVNHSYFITRNIRVTDKESSKERHVILFHYTTWPDFGVPQSPTALLRFIRAVR 208
++ H +++ + +TD S+ + V F++T WPD VP++ + L F+ VR
Sbjct: 151 ITT--TQVETHPHYVKSTLLLTDGTSATQT-VTHFNFTAWPDHDVPKNTSEFLNFVLEVR 207
Query: 209 -----------KSGALDENMGPP--IVHCSAGIGRSGTFILVDCVLKLISDGEINNVSVQ 255
+ G + PP +VHC+AG+GR+ + +VD + VS+
Sbjct: 208 QCQKELAQESLQIG--HNRLQPPPIVVHCNAGLGRTPCYCVVDISISRFD--ACATVSIP 263
Query: 256 EILLEMRHYRMGLIQTPDQLRFSYQAI 282
I+ +R+ R + P Q F Y+A+
Sbjct: 264 SIVSSIRNQRYYSLFIPFQYFFCYRAV 290
>gnl|CDD|214649 smart00404, PTPc_motif, Protein tyrosine phosphatase, catalytic
domain motif.
Length = 105
Score = 125 bits (316), Expect = 8e-36
Identities = 46/105 (43%), Positives = 72/105 (68%), Gaps = 1/105 (0%)
Query: 180 VILFHYTTWPDFGVPQSPTALLRFIRAVRKSGALDENMGPPIVHCSAGIGRSGTFILVDC 239
V +HYT WPD GVP+SP ++L +RAV+K+ E+ GP +VHCSAG+GR+GTF+ +D
Sbjct: 2 VKHYHYTGWPDHGVPESPDSILELLRAVKKNLNQSESSGPVVVHCSAGVGRTGTFVAIDI 61
Query: 240 VLKLISDGEINNVSVQEILLEMRHYRMGLIQTPDQLRFSYQAIIE 284
+L+ + E V + + + E+R R G++QT +Q F Y+A++E
Sbjct: 62 LLQQLEA-EAGEVDIFDTVKELRSQRPGMVQTEEQYLFLYRALLE 105
>gnl|CDD|214469 smart00012, PTPc_DSPc, Protein tyrosine phosphatase, catalytic
domain, undefined specificity. Protein tyrosine
phosphatases. Homologues detected by this profile and
not by those of "PTPc" or "DSPc" are predicted to be
protein phosphatases with a similar fold to DSPs and
PTPs, yet with unpredicted specificities.
Length = 105
Score = 125 bits (316), Expect = 8e-36
Identities = 46/105 (43%), Positives = 72/105 (68%), Gaps = 1/105 (0%)
Query: 180 VILFHYTTWPDFGVPQSPTALLRFIRAVRKSGALDENMGPPIVHCSAGIGRSGTFILVDC 239
V +HYT WPD GVP+SP ++L +RAV+K+ E+ GP +VHCSAG+GR+GTF+ +D
Sbjct: 2 VKHYHYTGWPDHGVPESPDSILELLRAVKKNLNQSESSGPVVVHCSAGVGRTGTFVAIDI 61
Query: 240 VLKLISDGEINNVSVQEILLEMRHYRMGLIQTPDQLRFSYQAIIE 284
+L+ + E V + + + E+R R G++QT +Q F Y+A++E
Sbjct: 62 LLQQLEA-EAGEVDIFDTVKELRSQRPGMVQTEEQYLFLYRALLE 105
>gnl|CDD|165113 PHA02746, PHA02746, protein tyrosine phosphatase; Provisional.
Length = 323
Score = 131 bits (330), Expect = 2e-35
Identities = 81/301 (26%), Positives = 135/301 (44%), Gaps = 37/301 (12%)
Query: 14 FEDIDSKNLWAVVYQRIRTVSN-AYDYTLDDAKRSENRHLNRYRDVIPYDHSRIAL---- 68
F+ + V V + T + + EN NR+ D+ +DHSR+ +
Sbjct: 15 FDKTNHAKFCEFVLLEHAEVMDIPIRGTTNHFLKKENLKKNRFHDIPCWDHSRVVINAHE 74
Query: 69 -------------------KRCSNDYINANLVEIEQANRKYILTQGPLPNTIAHFWVMVW 109
+ + +YI+AN V+ + K+I QGP +T F+ ++
Sbjct: 75 SLKMFDVGDSDGKKIEVTSEDNAENYIHANFVDGFKEANKFICAQGPKEDTSEDFFKLIS 134
Query: 110 EQNCKAIVMLNKIIEKNQLKCSQYWPTSSSSDLEFPDVNLSVHLDSEVNHSYFITRNIRV 169
E + IV L I + ++ KC + W S+L F + LD S+ TR + +
Sbjct: 135 EHESQVIVSLTDIDDDDE-KCFELWTKEEDSELAFGRFVAKI-LDIIEELSFTKTR-LMI 191
Query: 170 TDKESSKERHVILFHYTTWPDFGVPQSPTALLRFI--------RAVRKSGALDENMGPPI 221
TDK S R + F + WPD G+P L I ++++ + +GP +
Sbjct: 192 TDKISDTSREIHHFWFPDWPDNGIPTGMAEFLELINKVNEEQAELIKQADNDPQTLGPIV 251
Query: 222 VHCSAGIGRSGTFILVDCVLKLISDGEINNVSVQEILLEMRHYRMGLIQTPDQLRFSYQA 281
VHCSAGIGR+GTF +D L+ + + V + EI+L++R R + P+Q F Y+A
Sbjct: 252 VHCSAGIGRAGTFCAIDNALEQLEKEK--EVCLGEIVLKIRKQRHSSVFLPEQYAFCYKA 309
Query: 282 I 282
+
Sbjct: 310 L 310
>gnl|CDD|165109 PHA02742, PHA02742, protein tyrosine phosphatase; Provisional.
Length = 303
Score = 128 bits (322), Expect = 2e-34
Identities = 67/252 (26%), Positives = 115/252 (45%), Gaps = 15/252 (5%)
Query: 39 YTLDDAKRSENRHLNRYRDVIPYDHSRIALK--RCSNDYINANLVEIEQANRKYILTQGP 96
++ +++ +N RY D +D +R+ LK +D+INA+ V+ A ++I TQ P
Sbjct: 42 FSCNESLELKNMKKCRYPDAPCFDRNRVILKIEDGGDDFINASYVDGHNAKGRFICTQAP 101
Query: 97 LPNTIAHFWVMVWEQNCKAIVMLNKIIEKNQLKCSQYWPTSSSSDLEFPDVNLSVHLDSE 156
L T FW +++ + IVM+ KI+E + C YW + +
Sbjct: 102 LEETALDFWQAIFQDQVRVIVMITKIMEDGKEACYPYWMPHERGKATHGE--FKIKTKKI 159
Query: 157 VNHSYFITRNIRVTDKESSKERHVILFHYTTWPDFGVPQSPTALLRFIRAVRKSGALDE- 215
+ + N+ +TD + + F Y WP G+P+ P L F+ AVR++ +
Sbjct: 160 KSFRNYAVTNLCLTDTNTGASLDIKHFAYEDWPHGGLPRDPNKFLDFVLAVREADLKADV 219
Query: 216 -------NMGPPI-VHCSAGIGRSGTFILVDCVLKLISDGEINNVSVQEILLEMRHYRMG 267
PPI VHCSAG+ R+G F +D + ++ I + + I+ ++R R
Sbjct: 220 DIKGENIVKEPPILVHCSAGLDRAGAFCAIDICISKYNERAI--IPLLSIVRDLRKQRHN 277
Query: 268 LIQTPDQLRFSY 279
+ P Q F Y
Sbjct: 278 CLSLPQQYIFCY 289
>gnl|CDD|165107 PHA02740, PHA02740, protein tyrosine phosphatase; Provisional.
Length = 298
Score = 73.5 bits (180), Expect = 9e-15
Identities = 51/254 (20%), Positives = 95/254 (37%), Gaps = 22/254 (8%)
Query: 42 DDAKRSENRHLNRYRD------VIPYDHSRIALKRCSNDYINANLVEIEQANRKYILTQG 95
D+A ++ + N+ +D + H RI L ++A V+ +K+I
Sbjct: 40 DEANKACAQAENKAKDENLALHITRLLHRRIKL-FNDEKVLDARFVDGYDFEQKFICIIN 98
Query: 96 PLPNTIAHFWVMVWEQNCKAIVMLNKIIEKNQLKCSQYWPTSSSSDLEFPDVNLSVHLDS 155
+ F + + + IV++++ +K +Q+W S +
Sbjct: 99 LCEDACDKFLQALSDNKVQIIVLISRHADKKCF--NQFW--SLKEGCVITSDKFQIETLE 154
Query: 156 EVNHSYFITRNIRVTDKESSKERHVILFHYTTWPDFGVPQSPTALLRFIRAVR------K 209
+ +F + +TDK + + + F YT WP G P A + F + +
Sbjct: 155 IIIKPHFNLTLLSLTDKF-GQAQKISHFQYTAWPADGFSHDPDAFIDFFCNIDDLCADLE 213
Query: 210 SGALDENMGPPIVHCSAGIGRSGTFILVD-CVLKLISDGEINNVSVQEILLEMRHYRMGL 268
D + P I+ C GI S F + D C + G + S+ L ++R + G
Sbjct: 214 KHKADGKIAPIIIDCIDGISSSAVFCVFDICATEFDKTGML---SIANALKKVRQKKYGC 270
Query: 269 IQTPDQLRFSYQAI 282
+ D F Y I
Sbjct: 271 MNCLDDYVFCYHLI 284
>gnl|CDD|225297 COG2453, CDC14, Predicted protein-tyrosine phosphatase [Signal
transduction mechanisms].
Length = 180
Score = 47.1 bits (112), Expect = 3e-06
Identities = 29/136 (21%), Positives = 43/136 (31%), Gaps = 23/136 (16%)
Query: 147 VNLSVHLDSEVNHSYFITRNIRVTDKESSKERHVILFHYTTWPDFGVPQSP--TALLRFI 204
+N V + E+ + + + +L D VP ++ FI
Sbjct: 48 INALVIEELELLGALY-------NVAIEENDGIQVLHLP--ILDGTVPDLEDLDKIVDFI 98
Query: 205 RAVRKSGALDENMGPPIVHCSAGIGRSGTFILVDCVLKLISDGEINNVSVQEILLEMRHY 264
G +VHC GIGRSGT I +L E + R
Sbjct: 99 EEALSKG------KKVVVHCQGGIGRSGTVIAAYLMLYG------GLSLADEAIAVKRRR 146
Query: 265 RMGLIQTPDQLRFSYQ 280
R G + T Q F +
Sbjct: 147 RPGAVVTEIQHLFELE 162
>gnl|CDD|214551 smart00195, DSPc, Dual specificity phosphatase, catalytic domain.
Length = 138
Score = 36.9 bits (86), Expect = 0.004
Identities = 13/37 (35%), Positives = 16/37 (43%), Gaps = 6/37 (16%)
Query: 199 ALLRFIRAVRKSGALDENMGPPIVHCSAGIGRSGTFI 235
+ FI G G +VHC AG+ RS T I
Sbjct: 66 EAVEFIEDAESKG------GKVLVHCQAGVSRSATLI 96
>gnl|CDD|238073 cd00127, DSPc, Dual specificity phosphatases (DSP); Ser/Thr and Tyr
protein phosphatases. Structurally similar to
tyrosine-specific phosphatases but with a shallower
active site cleft and a distinctive active site
signature motif, HCxxGxxR. Characterized as VHR- or
Cdc25-like.
Length = 139
Score = 36.4 bits (85), Expect = 0.005
Identities = 22/94 (23%), Positives = 38/94 (40%), Gaps = 11/94 (11%)
Query: 148 NLSVHLDSEVNHSYFITRNIRVTDKESSKERHVILFHYTTWPDFGVPQSPTA-----LLR 202
+ D E+ IT + V + ++ + F+Y P +P + +
Sbjct: 13 SYPAASDKELLKKLGITHVLNVAKEVPNENLFLSDFNYLYVPILDLPSQDISKYFDEAVD 72
Query: 203 FIRAVRKSGALDENMGPPIVHCSAGIGRSGTFIL 236
FI R+ G G +VHC AG+ RS T ++
Sbjct: 73 FIDDAREKG------GKVLVHCLAGVSRSATLVI 100
>gnl|CDD|216117 pfam00782, DSPc, Dual specificity phosphatase, catalytic domain.
Ser/Thr and Tyr protein phosphatases. The enzyme's
tertiary fold is highly similar to that of
tyrosine-specific phosphatases, except for a
"recognition" region.
Length = 131
Score = 35.3 bits (82), Expect = 0.012
Identities = 14/34 (41%), Positives = 17/34 (50%), Gaps = 6/34 (17%)
Query: 203 FIRAVRKSGALDENMGPPIVHCSAGIGRSGTFIL 236
FI + G G +VHC AGI RS T I+
Sbjct: 63 FIDDAIQKG------GKVLVHCQAGISRSATLII 90
>gnl|CDD|185273 PRK15375, PRK15375, pathogenicity island 1 effector protein StpP;
Provisional.
Length = 535
Score = 35.2 bits (80), Expect = 0.046
Identities = 35/147 (23%), Positives = 63/147 (42%), Gaps = 19/147 (12%)
Query: 98 PNTIAHFWVMVWEQNCKAIVMLNKIIEKNQLKCSQYWPTSSSSDLEFPDVNLSVHLDSEV 157
P+ + M+ E+ C +V+L ++Q++ Q P S F +V+ + S
Sbjct: 346 PDALEAHMKMLLEKECSCLVVLT---SEDQMQAKQLPPYFRGS-YTFGEVHTNSQKVSSA 401
Query: 158 NHSYFITR-NIRVTDKESSKERHVILFHYTTWPDFGVPQSPTALLRFIRAVRKSGALDEN 216
+ I + N++++ E K + + H WPD S L V+ S ++N
Sbjct: 402 SQGEAIDQYNMQLSCGE--KRYTIPVLHVKNWPDHQPLPSTDQLEYLADRVKNS---NQN 456
Query: 217 MGP---------PIVHCSAGIGRSGTF 234
P P++HC G+GR+GT
Sbjct: 457 GAPGRSSSDKHLPMIHCLGGVGRTGTM 483
>gnl|CDD|185524 PTZ00242, PTZ00242, protein tyrosine phosphatase; Provisional.
Length = 166
Score = 33.8 bits (78), Expect = 0.065
Identities = 31/103 (30%), Positives = 44/103 (42%), Gaps = 12/103 (11%)
Query: 177 ERHVILFHYTTWP-DFGVPQSPTALLRFIRAVRKSGALDENMGPPI-VHCSAGIGRSGTF 234
E++ I H WP D G P + ++R + + A I VHC AG+GR
Sbjct: 58 EKNGIEVH--DWPFDDGAPPPKAVIDNWLRLLDQEFAKQSTPPETIAVHCVAGLGR--AP 113
Query: 235 ILVDCVLKLISDGEINNVSVQEILLEMRHYRMGLIQTPDQLRF 277
ILV L L+ G + + + E R G I QL+F
Sbjct: 114 ILV--ALALVEYGGMEPLDAVGFVREKRK---GAINQT-QLQF 150
>gnl|CDD|203311 pfam05706, CDKN3, Cyclin-dependent kinase inhibitor 3 (CDKN3).
This family consists of cyclin-dependent kinase
inhibitor 3 or kinase associated phosphatase proteins
from several mammalian species. The cyclin-dependent
kinase (Cdk)-associated protein phosphatase (KAP) is a
human dual specificity protein phosphatase that
dephosphorylates Cdk2 on threonine 160 in a
cyclin-dependent manner.
Length = 168
Score = 33.1 bits (75), Expect = 0.097
Identities = 31/132 (23%), Positives = 55/132 (41%), Gaps = 17/132 (12%)
Query: 126 NQLKCSQYWPTSSSSDLEFPDVNLSVHLDSEVNHSY-------FITR----NIRVTDKES 174
+++ CSQ+ + +F DV +V D+E S F TR RV +
Sbjct: 35 SRVNCSQFLGLCALPGCKFKDVRRNVQKDTEELKSCGIQDIFVFCTRGELSKYRVPNLLD 94
Query: 175 SKERHVILFHYTTWPDFGVPQSPTALLRFIRAVRKSGALDENMGPPIVHCSAGIGRSGTF 234
++ I+ H+ D G P + + + +N ++HC G+GRS
Sbjct: 95 LYQQCGIITHHHPIADGGTPDIASCC----EIMEELTTCLKNYRKTLIHCYGGLGRS--C 148
Query: 235 ILVDCVLKLISD 246
++ C+L +SD
Sbjct: 149 LVAACLLLYLSD 160
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this family
(Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 31.5 bits (72), Expect = 0.14
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 321 RKRKTEEKAERIAEKVKQIKKKQSEAEDKKKK 352
++RK EEKA+R E + K+ E E+K +K
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEEKLEK 32
>gnl|CDD|233208 TIGR00956, 3a01205, Pleiotropic Drug Resistance (PDR) Family
protein. [Transport and binding proteins, Other].
Length = 1394
Score = 31.6 bits (72), Expect = 0.79
Identities = 21/69 (30%), Positives = 30/69 (43%), Gaps = 6/69 (8%)
Query: 135 PTSSSSDL--EFPDVNLSVHLDSEVNHSYFITRNIRVTDKESSKERHVILFHYTTWPDFG 192
S+DL E DVN ++ E F RN+ K ++R VIL + W
Sbjct: 731 EVLGSTDLTDESDDVNDEKDMEKESGEDIFHWRNLTYEVKIKKEKR-VILNNVDGWV--- 786
Query: 193 VPQSPTALL 201
P + TAL+
Sbjct: 787 KPGTLTALM 795
>gnl|CDD|183703 PRK12724, PRK12724, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 432
Score = 31.1 bits (70), Expect = 0.83
Identities = 13/36 (36%), Positives = 25/36 (69%)
Query: 320 PRKRKTEEKAERIAEKVKQIKKKQSEAEDKKKKSLE 355
P K+ + EK ER + +K++ K++S E ++KK+L+
Sbjct: 57 PEKQASREKIERKLQDLKELLKQKSYTEPERKKTLQ 92
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 31.0 bits (70), Expect = 0.97
Identities = 11/33 (33%), Positives = 17/33 (51%), Gaps = 2/33 (6%)
Query: 322 KRKTEEKAERIAEKVKQI--KKKQSEAEDKKKK 352
++ E+ A++ EK KQ K + AE K K
Sbjct: 104 AKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKA 136
Score = 31.0 bits (70), Expect = 1.0
Identities = 13/44 (29%), Positives = 23/44 (52%), Gaps = 12/44 (27%)
Query: 321 RKRKTEEKAERIAEKVKQIK------------KKQSEAEDKKKK 352
++K +E+A++ AE+ + K KK++EAE K K
Sbjct: 140 AEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA 183
Score = 29.4 bits (66), Expect = 2.8
Identities = 13/39 (33%), Positives = 20/39 (51%), Gaps = 4/39 (10%)
Query: 321 RKRKTEEKAER----IAEKVKQIKKKQSEAEDKKKKSLE 355
K K E +AE+ A+K + + K A + KKK+ E
Sbjct: 132 AKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAE 170
Score = 29.4 bits (66), Expect = 3.0
Identities = 14/36 (38%), Positives = 21/36 (58%), Gaps = 1/36 (2%)
Query: 321 RKRKTEEKAERIAEKV-KQIKKKQSEAEDKKKKSLE 355
+R EKA + AE+ KQ ++KQ +AE+ K K
Sbjct: 95 EQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAA 130
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 31.1 bits (71), Expect = 0.99
Identities = 12/31 (38%), Positives = 16/31 (51%)
Query: 322 KRKTEEKAERIAEKVKQIKKKQSEAEDKKKK 352
+RK E+KAE+ + KKK A K K
Sbjct: 418 QRKAEKKAEKEEAEKAAAKKKAEAAAKKAKG 448
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 30.9 bits (70), Expect = 1.1
Identities = 15/37 (40%), Positives = 21/37 (56%), Gaps = 3/37 (8%)
Query: 322 KRKTEEKAERIAEKVKQIK---KKQSEAEDKKKKSLE 355
K K E +A+R A K+ KK++EAE KK + E
Sbjct: 146 KAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAE 182
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 29.7 bits (68), Expect = 2.3
Identities = 9/35 (25%), Positives = 24/35 (68%)
Query: 321 RKRKTEEKAERIAEKVKQIKKKQSEAEDKKKKSLE 355
R+ + E+K + + +K ++++KK+ E E+ ++ L+
Sbjct: 108 REEELEKKEKELEQKQQELEKKEEELEELIEEQLQ 142
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes
the bacterial secreted proteins PopB, PepB, YopB and
EspD which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 29.4 bits (66), Expect = 2.7
Identities = 10/33 (30%), Positives = 20/33 (60%)
Query: 322 KRKTEEKAERIAEKVKQIKKKQSEAEDKKKKSL 354
+R +++ ++ E +QIKK +AE+ KK +
Sbjct: 25 ERARDKQEKKAEEYQEQIKKAIEKAEEAKKSGI 57
>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
production and conversion].
Length = 108
Score = 27.8 bits (62), Expect = 3.7
Identities = 12/37 (32%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 321 RKRKTEEKAERIAEKVKQ--IKKKQSEAEDKKKKSLE 355
R+ E+AE AEK+ Q +++ + EAE++ ++ L
Sbjct: 40 EAREIIEEAEEEAEKLAQEILEEAREEAEEEAEEILA 76
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 28.1 bits (63), Expect = 3.7
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 326 EEKAERIAEKVKQIKKKQSEAEDKKKKS 353
E+K + K K +KKQ + + KKKK
Sbjct: 97 EQKIVKQVLKEKAKQKKQKKKKKKKKKK 124
>gnl|CDD|217555 pfam03433, EspA, EspA-like secreted protein. EspA is the
prototypical member of this family. EspA, together with
EspB, EspD and Tir are exported by a type III secretion
system. These proteins are essential for attaching and
effacing lesion formation. EspA is a structural protein
and a major component of a large, transiently expressed,
filamentous surface organelle which forms a direct link
between the bacterium and the host cell.
Length = 188
Score = 28.3 bits (63), Expect = 4.4
Identities = 13/46 (28%), Positives = 21/46 (45%), Gaps = 4/46 (8%)
Query: 309 NGSTAGVTNNNPRKRKTEEKAERIAEKVKQIKKKQSEAEDKKKKSL 354
N A + R R +E A R+ + ++ K ++DKKK L
Sbjct: 58 NAKFAQMQKKAERSRNAQEMANRVDAVIAKVAK----SDDKKKVKL 99
>gnl|CDD|220979 pfam11101, DUF2884, Protein of unknown function (DUF2884). Some
members in this bacterial family of proteins are
annotated as YggN which currently has no known function.
Length = 228
Score = 28.5 bits (64), Expect = 5.1
Identities = 16/85 (18%), Positives = 34/85 (40%), Gaps = 3/85 (3%)
Query: 273 DQLRFSYQAI---IEGIHTDWETDDEFFVFHSDSSLINGNGSTAGVTNNNPRKRKTEEKA 329
D QA + ++E E V S S++ GS + E++
Sbjct: 116 DGFVLGAQAFSSLDDTFDEEFEQAIEQLVQQSIGSILIALGSEMMSSEGGNNLTAFEQRM 175
Query: 330 ERIAEKVKQIKKKQSEAEDKKKKSL 354
+ + +++++ + QS+A + K L
Sbjct: 176 DGLGQEIEEEMESQSQALEAKADEL 200
>gnl|CDD|197913 smart00845, GatB_Yqey, GatB domain. This domain is found in GatB
and proteins related to bacterial Yqey. It is about 140
amino acid residues long. This domain is found at the C
terminus of GatB which transamidates Glu-tRNA to
Gln-tRNA. The function of this domain is uncertain. It
does however suggest that Yqey and its relatives have a
role in tRNA metabolism.
Length = 147
Score = 27.9 bits (63), Expect = 5.1
Identities = 10/22 (45%), Positives = 15/22 (68%)
Query: 240 VLKLISDGEINNVSVQEILLEM 261
+LKLI DG I+ +E+L E+
Sbjct: 47 LLKLIDDGTISGKIAKEVLEEL 68
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 27.8 bits (62), Expect = 5.3
Identities = 10/32 (31%), Positives = 19/32 (59%)
Query: 324 KTEEKAERIAEKVKQIKKKQSEAEDKKKKSLE 355
K +++ + E+ + IK K+ + E K+KK E
Sbjct: 97 KEQKEVSKETEEKEAIKAKKEKKEKKEKKVAE 128
>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN. All proteins in this
family for which functions are known are ATP binding
proteins involved in the initiation of recombination and
recombinational repair [DNA metabolism, DNA replication,
recombination, and repair].
Length = 563
Score = 28.5 bits (64), Expect = 5.4
Identities = 16/58 (27%), Positives = 26/58 (44%), Gaps = 12/58 (20%)
Query: 33 VSNAYDYTLDDAKRSENRHLNRYRDVIPYDHSRIALKRCSNDYINANLVEIEQANRKY 90
V NA +++A R L Y D + +D R+ + I L +I++ RKY
Sbjct: 275 VGNAL-TEVEEATRE----LQNYLDELEFDPERL-------NEIEERLAQIKRLKRKY 320
>gnl|CDD|144538 pfam00981, Rota_NS53, Rotavirus RNA-binding Protein 53 (NS53).
This protein is also known as NSP1. NS53 is encoded by
gene 5. It is made in low levels in the infected cells
and is a component of early replication. The protein is
known to accumulate on the cytoskeleton of the infected
cell. NS53 is an RNA binding protein that contains a
characteristic cysteine rich region.
Length = 488
Score = 28.5 bits (64), Expect = 5.9
Identities = 16/45 (35%), Positives = 22/45 (48%), Gaps = 7/45 (15%)
Query: 181 ILFHYTTWPDFGVPQSPTALLRFIRAVRKSGALDENMGPPIVHCS 225
I + W DF + + L+ FIRA+ KS N+G HCS
Sbjct: 326 INNSFKVWNDFRIRKIYNNLMSFIRALVKSNV---NVG----HCS 363
>gnl|CDD|217159 pfam02637, GatB_Yqey, GatB domain. This domain is found in GatB.
It is about 140 amino acid residues long. This domain is
found at the C terminus of GatB, which transamidates
Glu-tRNA to Gln-tRNA.
Length = 148
Score = 27.5 bits (62), Expect = 6.9
Identities = 8/22 (36%), Positives = 16/22 (72%)
Query: 240 VLKLISDGEINNVSVQEILLEM 261
++KLI +G+I+ +E+L E+
Sbjct: 48 LIKLIDEGKISGKIAKEVLEEL 69
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 27.6 bits (62), Expect = 7.3
Identities = 5/28 (17%), Positives = 16/28 (57%)
Query: 326 EEKAERIAEKVKQIKKKQSEAEDKKKKS 353
+E+ +IA+ + + ++ + EA+ +
Sbjct: 36 DERQAKIADDLAEAERLKEEAQALLAEY 63
>gnl|CDD|219633 pfam07905, PucR, Purine catabolism regulatory protein-like family.
The bacterial proteins found in this family are similar
to the purine catabolism regulatory protein expressed by
Bacillus subtilis (PucR). PucR is thought to be a
transcriptional activator involved in the induction of
the purine degradation pathway, and may contain a
LysR-like DNA-binding domain. It is similar to LysR-type
regulators in that it represses its own expression. The
other members of this family are also annotated as being
putative regulatory proteins.
Length = 122
Score = 27.1 bits (61), Expect = 7.5
Identities = 19/98 (19%), Positives = 37/98 (37%), Gaps = 29/98 (29%)
Query: 191 FGVPQSPTALLRFIRAVRKSGALDENMGPPIVHCSAGIGRSGTFILVDCVLKLISDGEIN 250
+G+ P AL F+R + ++GA AG+G I + I +
Sbjct: 51 YGLKDDPEALREFVRELAEAGA-------------AGLG-----IKTGRYIPEIPE---- 88
Query: 251 NVSVQEILLEMRHYRMGLIQTPDQLRFSYQAIIEGIHT 288
E++ + LI+ P ++ F I + + +
Sbjct: 89 -----ELIAAANRLGLPLIELPREVPFV--DITQEVMS 119
>gnl|CDD|131972 TIGR02926, AhaH, ATP synthase archaeal, H subunit. he A1/A0 ATP
synthase is homologous to the V-type (V1/V0, vacuolar)
ATPase, but functions in the ATP synthetic direction as
does the F1/F0 ATPase of bacteria. The hydrophilic A1
"stalk" complex (AhaABCDEFG) is the site of ATP
generation and is coupled to the membrane-embedded
proton translocating A0 complex. It is unclear precisely
where AhaH fits into these complexes.
Length = 85
Score = 26.3 bits (58), Expect = 7.7
Identities = 12/37 (32%), Positives = 20/37 (54%), Gaps = 2/37 (5%)
Query: 321 RKRKTEEKAERIAEKVKQ--IKKKQSEAEDKKKKSLE 355
R+ E+AE A K+ + IK+ + E E + +K E
Sbjct: 32 EARELLEEAEEEASKLGEEIIKEAEEEIEKEAEKIRE 68
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 27.5 bits (61), Expect = 7.8
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 321 RKRKTEEKAERIAEKVKQIKKKQSEAEDKKKK 352
++ K K ++ EK++ KKKQ E E ++KK
Sbjct: 88 QREKELAKRQKELEKIELSKKKQKERERRRKK 119
>gnl|CDD|237205 PRK12792, flhA, flagellar biosynthesis protein FlhA; Reviewed.
Length = 694
Score = 28.2 bits (63), Expect = 8.1
Identities = 15/46 (32%), Positives = 23/46 (50%), Gaps = 1/46 (2%)
Query: 310 GSTAGVTNNNPRKRKTEEKAERIAEKVKQIKKKQSEAEDKKKKSLE 355
G A V PR+R AE A+ ++ + Q+EA+D K+ L
Sbjct: 314 GVMAFVAYTIPRRRAARAAAEA-AKVKREEESAQAEAKDSVKEQLR 358
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 27.5 bits (62), Expect = 8.5
Identities = 8/35 (22%), Positives = 22/35 (62%)
Query: 321 RKRKTEEKAERIAEKVKQIKKKQSEAEDKKKKSLE 355
++ EEK + +A + +Q+++K+ E E+ ++ +
Sbjct: 104 KEESLEEKEKELAARQQQLEEKEEELEELIEEQQQ 138
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 26.8 bits (60), Expect = 9.2
Identities = 8/37 (21%), Positives = 18/37 (48%)
Query: 316 TNNNPRKRKTEEKAERIAEKVKQIKKKQSEAEDKKKK 352
+ +E + IA++ + +KKQ E + + +K
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEK 37
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 27.7 bits (62), Expect = 9.3
Identities = 11/35 (31%), Positives = 20/35 (57%)
Query: 321 RKRKTEEKAERIAEKVKQIKKKQSEAEDKKKKSLE 355
+R+ E + E E +Q++ KQ E++K +S E
Sbjct: 126 EERRVERRRELGLEDPEQLRLKQKAKEEQKAESEE 160
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.132 0.389
Gapped
Lambda K H
0.267 0.0833 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,988,139
Number of extensions: 1738597
Number of successful extensions: 2892
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2809
Number of HSP's successfully gapped: 100
Length of query: 355
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 257
Effective length of database: 6,590,910
Effective search space: 1693863870
Effective search space used: 1693863870
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (26.7 bits)