RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15334
(451 letters)
>gnl|CDD|238006 cd00047, PTPc, Protein tyrosine phosphatases (PTP) catalyze the
dephosphorylation of phosphotyrosine peptides; they
regulate phosphotyrosine levels in signal transduction
pathways. The depth of the active site cleft renders the
enzyme specific for phosphorylated Tyr (pTyr) residues,
instead of pSer or pThr. This family has a distinctive
active site signature motif, HCSAGxGRxG. Characterized
as either transmembrane, receptor-like or
non-transmembrane (soluble) PTPs. Receptor-like PTP
domains tend to occur in two copies in the cytoplasmic
region of the transmembrane proteins, only one copy may
be active.
Length = 231
Score = 309 bits (794), Expect = e-104
Identities = 116/234 (49%), Positives = 153/234 (65%), Gaps = 7/234 (2%)
Query: 192 PKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWR 251
KNRY +I PYD +R KL+P DD+EGSDY+NA+YI GY P+ YIATQGPL +TV+DFWR
Sbjct: 1 KKNRYKDILPYDHTRVKLKP-DDDEGSDYINASYIDGYNPPKAYIATQGPLPNTVEDFWR 59
Query: 252 MCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELRM 311
M WE + P IVMLT+ +EK R+KC +YWP ++ + DI ++L + + L++
Sbjct: 60 MVWEQKVPVIVMLTELVEKGREKCAQYWPEEEGSLTYGDITVTLVSEEKLDDYTVRTLKL 119
Query: 312 CRGG--EHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHIS-PAQHPIVVHCSAGVGR 368
G E R ++H+ +T WPD GVP SP++L+ + R P PIVVHCSAGVGR
Sbjct: 120 SNTGTGETRTVTHFQYTGWPDHGVPESPDSLLDLLRKVRKSQQQPGSGPIVVHCSAGVGR 179
Query: 369 TGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQ 422
TGTFIA+D +LQRL G VDI + V LR QR MVQ+ QY F+Y +
Sbjct: 180 TGTFIAIDILLQRLEAE---GVVDIFQTVKELRSQRPGMVQTEEQYIFLYRAIL 230
>gnl|CDD|214550 smart00194, PTPc, Protein tyrosine phosphatase, catalytic domain.
Length = 259
Score = 308 bits (790), Expect = e-103
Identities = 121/257 (47%), Positives = 160/257 (62%), Gaps = 8/257 (3%)
Query: 169 EFEDLKSVGRG-QPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIG 227
EFE L + + T A NR KNRY ++ PYD +R KL+P EGSDY+NA+YI
Sbjct: 5 EFEKLDRLKPDDESCTVAAFPENRDKNRYKDVLPYDHTRVKLKP-PPGEGSDYINASYID 63
Query: 228 GYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQH-PM 286
G P+ YIATQGPL STV+DFWRM WE + IVMLT+ +EK R+KC +YWP ++ P+
Sbjct: 64 GPNGPKAYIATQGPLPSTVEDFWRMVWEQKVTVIVMLTELVEKGREKCAQYWPDEEGEPL 123
Query: 287 RCADIEISLQDVCHYGSWVLS--ELRMCRGGEHRMISHYHFTTWPDSGVPSSPNTLVSFV 344
DI ++L+ V + + E+ E R ++HYH+T WPD GVP SP +++ +
Sbjct: 124 TYGDITVTLKSVEKVDDYTIRTLEVTNTGCSETRTVTHYHYTNWPDHGVPESPESILDLI 183
Query: 345 LSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQR 404
+ R S + PIVVHCSAGVGRTGTFIA+D +LQ+L VDI IV LR QR
Sbjct: 184 RAVRKSQSTSTGPIVVHCSAGVGRTGTFIAIDILLQQLEAG---KEVDIFEIVKELRSQR 240
Query: 405 VLMVQSLAQYAFIYECV 421
MVQ+ QY F+Y +
Sbjct: 241 PGMVQTEEQYIFLYRAI 257
>gnl|CDD|215717 pfam00102, Y_phosphatase, Protein-tyrosine phosphatase.
Length = 233
Score = 284 bits (729), Expect = 1e-94
Identities = 113/238 (47%), Positives = 148/238 (62%), Gaps = 11/238 (4%)
Query: 190 NRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDF 249
N KNRY +I PYD +R KL P SDY+NA+YI GY PR +IATQGPL +T++DF
Sbjct: 1 NLEKNRYKDILPYDHTRVKLTPGGP---SDYINASYIDGYRKPRAFIATQGPLPNTIEDF 57
Query: 250 WRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQH-PMRCADIEISLQDVC-HYGSWVLS 307
WRM WE + IVMLT+ EK R+KC +YWP ++ + DI ++L + +
Sbjct: 58 WRMVWEQKVTTIVMLTELEEKGREKCDQYWPEEEEGSLTYGDISVTLVKEEEEEDDYTVR 117
Query: 308 ELRMCRGG--EHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTH-ISPAQHPIVVHCSA 364
+ GG E R ++H+H+T WPD GVP SP +L+ + R P PIVVHCSA
Sbjct: 118 TFELKNGGSEETRTVTHFHYTGWPDHGVPESPKSLLDLIRKVRKSQADPRSGPIVVHCSA 177
Query: 365 GVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQ 422
GVGRTGTFIA+D +LQ+L E G VD+ +IV LR QR MVQ+ QY F+Y+ V
Sbjct: 178 GVGRTGTFIAIDILLQQL---EAEGEVDVFQIVKELRSQRPGMVQTEEQYIFLYDAVL 232
>gnl|CDD|227886 COG5599, PTP2, Protein tyrosine phosphatase [Signal transduction
mechanisms].
Length = 302
Score = 177 bits (451), Expect = 2e-52
Identities = 86/252 (34%), Positives = 124/252 (49%), Gaps = 30/252 (11%)
Query: 193 KNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWRM 252
+NRY+NI PY+ +R L+ + +DY+NA+YI +YIATQGP T+DDFW+M
Sbjct: 56 RNRYSNIVPYEHTRVHLKYG--KSINDYINASYIK--TPRGKYIATQGPKPETIDDFWKM 111
Query: 253 CWESRTPA--IVMLTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCHYGSWVLSELR 310
W + IVMLT E +R+KC +YWP + + Q + + +
Sbjct: 112 VWHNVPNNGVIVMLTSLPEYNREKCDQYWPLGYDDTLIIGLRVIKQKKYELFNDNIVNVH 171
Query: 311 MCR----GGEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHI-----SPAQ-HPIVV 360
G + I H+ + W D VP + S I SP + PI+V
Sbjct: 172 NFELTSINGPPKKIHHFQYINWVDFNVPD--------IRSLTEVIHSLNDSPVRTGPIIV 223
Query: 361 HCSAGVGRTGTFIALDTVLQRLYNSEGAGSVD------ILRIVHGLRRQRVLMVQSLAQY 414
HCSAGVGRTGTFIALD +L+ ++ I +IV LR QR+ MVQ+ Q+
Sbjct: 224 HCSAGVGRTGTFIALDILLRMPNDTLNHTDTWEDTQDLIFQIVLSLRSQRMKMVQNKTQF 283
Query: 415 AFIYECVQVYLQ 426
F+Y+ +
Sbjct: 284 KFLYDAFLELNK 295
>gnl|CDD|222923 PHA02738, PHA02738, hypothetical protein; Provisional.
Length = 320
Score = 165 bits (418), Expect = 3e-47
Identities = 85/259 (32%), Positives = 125/259 (48%), Gaps = 20/259 (7%)
Query: 183 TAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPL 242
T + NR NRY + +D SR L P + G DY+NANY+ G+ +++I Q P
Sbjct: 42 TFNAEKKNRKLNRYLDAVCFDHSRVIL-PAERNRG-DYINANYVDGFEYKKKFICGQAPT 99
Query: 243 CSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPS-DQHPMRCADIEISLQDVCHY 301
T DF+RM W IVML + E R+KC+ YW +Q +R +I+ V +
Sbjct: 100 RQTCYDFYRMLWMEHVQIIVMLCKKKENGREKCFPYWSDVEQGSIRFGKFKITTTQVETH 159
Query: 302 GSWVLSELRMCRG-GEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRT----------- 349
+V S L + G + ++H++FT WPD VP + + ++FVL R
Sbjct: 160 PHYVKSTLLLTDGTSATQTVTHFNFTAWPDHDVPKNTSEFLNFVLEVRQCQKELAQESLQ 219
Query: 350 --HISPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLM 407
H PIVVHC+AG+GRT + +D + R +V I IV +R QR
Sbjct: 220 IGHNRLQPPPIVVHCNAGLGRTPCYCVVDISISRFDACA---TVSIPSIVSSIRNQRYYS 276
Query: 408 VQSLAQYAFIYECVQVYLQ 426
+ QY F Y V+ Y+
Sbjct: 277 LFIPFQYFFCYRAVKRYVN 295
>gnl|CDD|165109 PHA02742, PHA02742, protein tyrosine phosphatase; Provisional.
Length = 303
Score = 148 bits (375), Expect = 3e-41
Identities = 76/249 (30%), Positives = 126/249 (50%), Gaps = 19/249 (7%)
Query: 190 NRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDF 249
N K RY + +D +R L ++D G D++NA+Y+ G+ + ++I TQ PL T DF
Sbjct: 52 NMKKCRYPDAPCFDRNRVIL-KIEDG-GDDFINASYVDGHNAKGRFICTQAPLEETALDF 109
Query: 250 WRMCWESRTPAIVMLTQCIEKHRDKCYKYW-PSDQHPMRCADIEISLQDVCHYGSWVLSE 308
W+ ++ + IVM+T+ +E ++ CY YW P ++ + +I + + + ++ ++
Sbjct: 110 WQAIFQDQVRVIVMITKIMEDGKEACYPYWMPHERGKATHGEFKIKTKKIKSFRNYAVTN 169
Query: 309 LRMCRG--GEHRMISHYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPA-----------Q 355
L + G I H+ + WP G+P PN + FVL+ R A +
Sbjct: 170 LCLTDTNTGASLDIKHFAYEDWPHGGLPRDPNKFLDFVLAVREADLKADVDIKGENIVKE 229
Query: 356 HPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYA 415
PI+VHCSAG+ R G F A+D + YN + +L IV LR+QR + QY
Sbjct: 230 PPILVHCSAGLDRAGAFCAIDICIS-KYNE--RAIIPLLSIVRDLRKQRHNCLSLPQQYI 286
Query: 416 FIYECVQVY 424
F Y V ++
Sbjct: 287 FCYFIVLIF 295
>gnl|CDD|165114 PHA02747, PHA02747, protein tyrosine phosphatase; Provisional.
Length = 312
Score = 143 bits (362), Expect = 2e-39
Identities = 74/250 (29%), Positives = 117/250 (46%), Gaps = 18/250 (7%)
Query: 190 NRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDF 249
N+PKNRY +I +D +R L SDY++AN+I G+ +++IATQGP T DF
Sbjct: 51 NQPKNRYWDIPCWDHNRVILD-SGGGSTSDYIHANWIDGFEDDKKFIATQGPFAETCADF 109
Query: 250 WRMCWESRTPAIVMLTQCIEKH-RDKCYKYW-PSDQHPMRCADIEISLQDVCHYGSWVLS 307
W+ W+ IVMLT + +KCY+YW ++ + D I ++L+
Sbjct: 110 WKAVWQEHCSIIVMLTPTKGTNGEEKCYQYWCLNEDGNIDMEDFRIETLKTSVRAKYILT 169
Query: 308 ELRMCR--GGEHRMISHYHFTTWPDSGVPSSPNTLVSFV-------LSFRTHISPAQH-- 356
+ + + R ISH+ + W + PS + F+ +P
Sbjct: 170 LIEITDKILKDSRKISHFQCSEWFEDETPSDHPDFIKFIKIIDINRKKSGKLFNPKDALL 229
Query: 357 -PIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYA 415
PIVVHCS GVG+TG F A+D L +L + ++ + + +R QR + + Y
Sbjct: 230 CPIVVHCSDGVGKTGIFCAVDICLNQLVKRK---AICLAKTAEKIREQRHAGIMNFDDYL 286
Query: 416 FIYECVQVYL 425
FI +V
Sbjct: 287 FIQPGYEVLH 296
>gnl|CDD|214649 smart00404, PTPc_motif, Protein tyrosine phosphatase, catalytic
domain motif.
Length = 105
Score = 130 bits (328), Expect = 9e-37
Identities = 50/105 (47%), Positives = 69/105 (65%), Gaps = 4/105 (3%)
Query: 319 MISHYHFTTWPDSGVPSSPNTLVSFVLSFRTH--ISPAQHPIVVHCSAGVGRTGTFIALD 376
+ HYH+T WPD GVP SP++++ + + + + S + P+VVHCSAGVGRTGTF+A+D
Sbjct: 1 TVKHYHYTGWPDHGVPESPDSILELLRAVKKNLNQSESSGPVVVHCSAGVGRTGTFVAID 60
Query: 377 TVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECV 421
+LQ+L AG VDI V LR QR MVQ+ QY F+Y +
Sbjct: 61 ILLQQLEA--EAGEVDIFDTVKELRSQRPGMVQTEEQYLFLYRAL 103
>gnl|CDD|214469 smart00012, PTPc_DSPc, Protein tyrosine phosphatase, catalytic
domain, undefined specificity. Protein tyrosine
phosphatases. Homologues detected by this profile and
not by those of "PTPc" or "DSPc" are predicted to be
protein phosphatases with a similar fold to DSPs and
PTPs, yet with unpredicted specificities.
Length = 105
Score = 130 bits (328), Expect = 9e-37
Identities = 50/105 (47%), Positives = 69/105 (65%), Gaps = 4/105 (3%)
Query: 319 MISHYHFTTWPDSGVPSSPNTLVSFVLSFRTH--ISPAQHPIVVHCSAGVGRTGTFIALD 376
+ HYH+T WPD GVP SP++++ + + + + S + P+VVHCSAGVGRTGTF+A+D
Sbjct: 1 TVKHYHYTGWPDHGVPESPDSILELLRAVKKNLNQSESSGPVVVHCSAGVGRTGTFVAID 60
Query: 377 TVLQRLYNSEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECV 421
+LQ+L AG VDI V LR QR MVQ+ QY F+Y +
Sbjct: 61 ILLQQLEA--EAGEVDIFDTVKELRSQRPGMVQTEEQYLFLYRAL 103
>gnl|CDD|165113 PHA02746, PHA02746, protein tyrosine phosphatase; Provisional.
Length = 323
Score = 125 bits (315), Expect = 1e-32
Identities = 77/265 (29%), Positives = 125/265 (47%), Gaps = 36/265 (13%)
Query: 190 NRPKNRYTNIFPYDSSR--------YKLQPVDDEEG-----------SDYVNANYIGGYL 230
N KNR+ +I +D SR K+ V D +G +Y++AN++ G+
Sbjct: 51 NLKKNRFHDIPCWDHSRVVINAHESLKMFDVGDSDGKKIEVTSEDNAENYIHANFVDGFK 110
Query: 231 SPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCYKYWPSDQ-HPMRCA 289
++I QGP T +DF+++ E + IV LT I+ +KC++ W ++ +
Sbjct: 111 EANKFICAQGPKEDTSEDFFKLISEHESQVIVSLTD-IDDDDEKCFELWTKEEDSELAFG 169
Query: 290 DIEISLQDVCHYGSWVLSELRMCR--GGEHRMISHYHFTTWPDSGVPSSPN---TLVSFV 344
+ D+ S+ + L + R I H+ F WPD+G+P+ L++ V
Sbjct: 170 RFVAKILDIIEELSFTKTRLMITDKISDTSREIHHFWFPDWPDNGIPTGMAEFLELINKV 229
Query: 345 LSFRTHI-------SPAQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIV 397
+ + PIVVHCSAG+GR GTF A+D L++L E V + IV
Sbjct: 230 NEEQAELIKQADNDPQTLGPIVVHCSAGIGRAGTFCAIDNALEQL---EKEKEVCLGEIV 286
Query: 398 HGLRRQRVLMVQSLAQYAFIYECVQ 422
+R+QR V QYAF Y+ ++
Sbjct: 287 LKIRKQRHSSVFLPEQYAFCYKALK 311
>gnl|CDD|165107 PHA02740, PHA02740, protein tyrosine phosphatase; Provisional.
Length = 298
Score = 80.4 bits (198), Expect = 8e-17
Identities = 42/217 (19%), Positives = 93/217 (42%), Gaps = 17/217 (7%)
Query: 221 VNANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHRDKCY-KYW 279
++A ++ GY +++I D F + +++ IV++++ +K KC+ ++W
Sbjct: 79 LDARFVDGYDFEQKFICIINLCEDACDKFLQALSDNKVQIIVLISRHADK---KCFNQFW 135
Query: 280 PSDQHPMRCAD-IEISLQDVCHYGSWVLSELRMC-RGGEHRMISHYHFTTWPDSGVPSSP 337
+ + +D +I ++ + L+ L + + G+ + ISH+ +T WP G P
Sbjct: 136 SLKEGCVITSDKFQIETLEIIIKPHFNLTLLSLTDKFGQAQKISHFQYTAWPADGFSHDP 195
Query: 338 NTLVSF-------VLSFRTHISPAQ-HPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAG 389
+ + F H + + PI++ C G+ + F D + G
Sbjct: 196 DAFIDFFCNIDDLCADLEKHKADGKIAPIIIDCIDGISSSAVFCVFDICATEF---DKTG 252
Query: 390 SVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQVYLQ 426
+ I + +R+++ + L Y F Y + YL+
Sbjct: 253 MLSIANALKKVRQKKYGCMNCLDDYVFCYHLIAAYLK 289
>gnl|CDD|225297 COG2453, CDC14, Predicted protein-tyrosine phosphatase [Signal
transduction mechanisms].
Length = 180
Score = 42.4 bits (100), Expect = 1e-04
Identities = 28/98 (28%), Positives = 42/98 (42%), Gaps = 15/98 (15%)
Query: 328 WPDSGVPSSPNT--LVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIALDTVL-QRLYN 384
D VP + +V F+ +S +VVHC G+GR+GT IA +L L
Sbjct: 80 ILDGTVPDLEDLDKIVDFI---EEALS-KGKKVVVHCQGGIGRSGTVIAAYLMLYGGLSL 135
Query: 385 SEGAGSVDILRIVHGLRRQRVLMVQSLAQYAFIYECVQ 422
++ A +V RR+R V + Q+ F E
Sbjct: 136 ADEAIAV--------KRRRRPGAVVTEIQHLFELEQEL 165
>gnl|CDD|222063 pfam13350, Y_phosphatase3, Tyrosine phosphatase family. This
family is closely related to the pfam00102 and pfam00782
families.
Length = 163
Score = 35.9 bits (84), Expect = 0.016
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 349 THISPAQHPIVVHCSAGVGRTGTFIAL 375
++ P++ HC+AG RTG AL
Sbjct: 117 ELLADPDGPVLFHCTAGKDRTGVAAAL 143
>gnl|CDD|214551 smart00195, DSPc, Dual specificity phosphatase, catalytic domain.
Length = 138
Score = 33.8 bits (78), Expect = 0.065
Identities = 9/20 (45%), Positives = 12/20 (60%)
Query: 355 QHPIVVHCSAGVGRTGTFIA 374
++VHC AGV R+ T I
Sbjct: 78 GGKVLVHCQAGVSRSATLII 97
>gnl|CDD|216117 pfam00782, DSPc, Dual specificity phosphatase, catalytic domain.
Ser/Thr and Tyr protein phosphatases. The enzyme's
tertiary fold is highly similar to that of
tyrosine-specific phosphatases, except for a
"recognition" region.
Length = 131
Score = 32.6 bits (75), Expect = 0.13
Identities = 8/20 (40%), Positives = 12/20 (60%)
Query: 355 QHPIVVHCSAGVGRTGTFIA 374
++VHC AG+ R+ T I
Sbjct: 71 GGKVLVHCQAGISRSATLII 90
>gnl|CDD|225240 COG2365, COG2365, Protein tyrosine/serine phosphatase [Signal
transduction mechanisms].
Length = 249
Score = 34.0 bits (78), Expect = 0.13
Identities = 14/56 (25%), Positives = 22/56 (39%), Gaps = 9/56 (16%)
Query: 351 ISPAQHPIVVHCSAGVGRTGTFIAL---------DTVLQRLYNSEGAGSVDILRIV 397
P+++HC+AG RTG AL +TV + G + I+
Sbjct: 132 ADAENGPVLIHCTAGKDRTGLVAALYRKLVGGSDETVAADYLLTNRYGEPERRGIL 187
>gnl|CDD|223751 COG0679, COG0679, Predicted permeases [General function prediction
only].
Length = 311
Score = 33.0 bits (76), Expect = 0.24
Identities = 10/58 (17%), Positives = 22/58 (37%), Gaps = 1/58 (1%)
Query: 64 FTSPELFTDTIYSPPVSTGKDVSWLLVMGFLVLIVLPVLFIIIRRKIVGVTSPRAGGI 121
P L ++I + +S D+ L+V + ++ L +I R + +
Sbjct: 43 VALPALLFNSIATADLSGLADL-GLIVASLVATLLAFFLLALIGRFLFKLDKRETVIF 99
>gnl|CDD|181208 PRK08049, PRK08049, F0F1 ATP synthase subunit I; Validated.
Length = 124
Score = 30.4 bits (69), Expect = 0.68
Identities = 16/53 (30%), Positives = 24/53 (45%), Gaps = 4/53 (7%)
Query: 78 PVS--TGKDVSWLLVMGFLVLIVLPVLFIIIRRKIVGVTSPRAGGIRGWLPRM 128
VS +GK LL++ L + LF + + S AGG+ WLP +
Sbjct: 2 SVSLYSGKVARKLLLLQLLTFVASGALFSL--KDPFWGVSALAGGLAAWLPNV 52
>gnl|CDD|197379 cd09970, UP_IIIa, Uroplakin IIIa. Uroplakin IIIa, mayor isoform of
the dimerization partner of uroplakin Ib, is a members
of the uroplakin family. Uroplakins (UPs) are a family
of proteins that associate with each other to form
plaques on the apical surface of the urothelium, the
pseudo-stratified epithelium lining the urinary tract
from renal pelvis to the bladder outlet. UPs are
classified into 3 types: UPIa and UPIb, UPII, and
UPIIIa and IIIb. UPIs are tetraspanins that have four
transmembrane domains seperating one large and one small
extracellular domain while UPII and UPIIIs are
single-pass transmembrane proteins. UPIa and UPIb form
specific heterodimers with UPII and UPIII, respectively,
which allows them to exit the endoplasmatic rediculum.
UPII/UPIa and UPIIIs/UPIb form heterotetramers and six
of these tetramers form the 16nm particle, seen in the
hexagonal array of the asymmetric unit membrane, which
is believed to form a urinary tract barrier. Uroplakins
are also believed to play a role during urinary tract
morphogenesis.
Length = 212
Score = 31.5 bits (71), Expect = 0.73
Identities = 18/43 (41%), Positives = 21/43 (48%), Gaps = 2/43 (4%)
Query: 41 PNEAHYCNGPLKQGTTYRVK-VRAFTSPELFTD-TIYSPPVST 81
PN CN PL T YR K V S L D T++S P+ T
Sbjct: 130 PNFKGLCNPPLSAATEYRFKYVLVNMSTGLVEDQTLWSDPIRT 172
>gnl|CDD|179544 PRK03140, PRK03140, phosphatidylserine decarboxylase; Provisional.
Length = 259
Score = 31.1 bits (71), Expect = 1.1
Identities = 11/38 (28%), Positives = 17/38 (44%)
Query: 213 DDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFW 250
++E Y Y+ YLSP Y P+ TV + +
Sbjct: 116 NEERAQRYAGGTYMVLYLSPSHYHRIHSPISGTVTEQF 153
>gnl|CDD|219177 pfam06789, UPF0258, Uncharacterized protein family (UPF0258).
Length = 159
Score = 30.4 bits (68), Expect = 1.2
Identities = 10/35 (28%), Positives = 18/35 (51%)
Query: 148 SHPIPVTLFCTMTLDPDWIAVEFEDLKSVGRGQPT 182
H P L + ++P + A E+ + GRG+P+
Sbjct: 38 KHISPQALSSRVKINPLFPAQRSEEAEDGGRGRPS 72
>gnl|CDD|197377 cd09968, UP_III, Uroplakin III. Uroplakin IIIa and IIIb, the
dimerization partners of uroplakin Ib, are a members of
the uroplakin family. Uroplakins (UPs) are a family of
proteins that associate with each other to form plaques
on the apical surface of the urothelium, the
pseudo-stratified epithelium lining the urinary tract
from renal pelvis to the bladder outlet. UPs are
classified into 3 types: UPIa and UPIb, UPII, and
UPIIIa and IIIb. UPIs are tetraspanins that have four
transmembrane domains seperating one large and one small
extracellular domain while UPII and UPIIIs are
single-pass transmembrane proteins. UPIa and UPIb form
specific heterodimers with UPII and UPIII, respectively,
which allows them to exit the endoplasmatic rediculum.
UPII/UPIa and UPIIIs/UPIb form heterotetramers and six
of these tetramers form the 16nm particle, seen in the
hexagonal array of the asymmetric unit membrane, which
is believed to form a urinary tract barrier. Uroplakins
are also believed to play a role during urinary tract
morphogenesis.
Length = 187
Score = 30.4 bits (69), Expect = 1.4
Identities = 13/49 (26%), Positives = 19/49 (38%)
Query: 33 FTIGRETCPNEAHYCNGPLKQGTTYRVKVRAFTSPELFTDTIYSPPVST 81
+G + + CNGPL YRVK + T +S P+
Sbjct: 99 LRVGADGNCTDNGNCNGPLPGPGPYRVKYLVMNGSGVVAQTNWSDPIRL 147
>gnl|CDD|197375 cd09966, UP_III_II, Uroplakin IIIb, IIIa and II. Uroplakins (UPs)
are a family of proteins that associate with each other
to form plaques on the apical surface of the urothelium,
the pseudo-stratified epithelium lining the urinary
tract from renal pelvis to the bladder outlet. UPs are
classified into 3 types: UPIa and UPIb, UPII, and
UPIIIa and IIIb. UPIs are tetraspanins that have four
transmembrane domains separating one large and one small
extracellular domain while UPII and UPIIIs are
single-pass transmembrane proteins. UPIa and UPIb form
specific heterodimers with UPII and UPIII, respectively,
which allows them to exit the endoplasmatic rediculum.
UPII/UPIa and UPIIIs/UPIb form heterotetramers; six of
these tetramers form the 16nm particle, seen in the
hexagonal array of the asymmetric unit membrane, which
is believed to form a urinary tract barrier. Uroplakins
are also believed to play a role during urinary tract
morphogenesis.
Length = 181
Score = 30.1 bits (68), Expect = 1.5
Identities = 9/15 (60%), Positives = 9/15 (60%)
Query: 46 YCNGPLKQGTTYRVK 60
YCN PL YRVK
Sbjct: 108 YCNTPLPGPGPYRVK 122
>gnl|CDD|111998 pfam03162, Y_phosphatase2, Tyrosine phosphatase family. This
family is closely related to the pfam00102 and pfam00782
families.
Length = 150
Score = 29.6 bits (67), Expect = 1.8
Identities = 13/53 (24%), Positives = 23/53 (43%), Gaps = 3/53 (5%)
Query: 322 HYHFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIA 374
H H D V + L + + ++ +P+++HC+ G RTG I
Sbjct: 61 HIHMEGNKDPFVNIPSHLLRR---ALKLLLNKDNYPVLIHCNRGKHRTGLVIG 110
>gnl|CDD|187643 cd08939, KDSR-like_SDR_c, 3-ketodihydrosphingosine reductase (KDSR)
and related proteins, classical (c) SDR. These proteins
include members identified as KDSR, ribitol type
dehydrogenase, and others. The group shows strong
conservation of the active site tetrad and glycine rich
NAD-binding motif of the classical SDRs. SDRs are a
functionally diverse family of oxidoreductases that have
a single domain with a structurally conserved Rossmann
fold (alpha/beta folding pattern with a central
beta-sheet), an NAD(P)(H)-binding region, and a
structurally diverse C-terminal region. Classical SDRs
are typically about 250 residues long, while extended
SDRs are approximately 350 residues. Sequence identity
between different SDR enzymes are typically in the
15-30% range, but the enzymes share the Rossmann fold
NAD-binding motif and characteristic NAD-binding and
catalytic sequence patterns. These enzymes catalyze a
wide range of activities including the metabolism of
steroids, cofactors, carbohydrates, lipids, aromatic
compounds, and amino acids, and act in redox sensing.
Classical SDRs have an TGXXX[AG]XG cofactor binding
motif and a YXXXK active site motif, with the Tyr
residue of the active site motif serving as a critical
catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase (15-PGDH)
numbering). In addition to the Tyr and Lys, there is
often an upstream Ser (Ser-138, 15-PGDH numbering)
and/or an Asn (Asn-107, 15-PGDH numbering) contributing
to the active site; while substrate binding is in the
C-terminal region, which determines specificity. The
standard reaction mechanism is a 4-pro-S hydride
transfer and proton relay involving the conserved Tyr
and Lys, a water molecule stabilized by Asn, and
nicotinamide. Extended SDRs have additional elements in
the C-terminal region, and typically have a TGXXGXXG
cofactor binding motif. Complex (multidomain) SDRs such
as ketoreductase domains of fatty acid synthase have a
GGXGXXG NAD(P)-binding motif and an altered active site
motif (YXXXN). Fungal type ketoacyl reductases have a
TGXXXGX(1-2)G NAD(P)-binding motif. Some atypical SDRs
have lost catalytic activity and/or have an unusual
NAD(P)-binding motif and missing or unusual active site
residues. Reactions catalyzed within the SDR family
include isomerization, decarboxylation, epimerization,
C=N bond reduction, dehydratase activity,
dehalogenation, Enoyl-CoA reduction, and
carbonyl-alcohol oxidoreduction.
Length = 239
Score = 29.9 bits (68), Expect = 2.1
Identities = 16/66 (24%), Positives = 28/66 (42%), Gaps = 7/66 (10%)
Query: 358 IVVHCSAGVGRTGTFIALD-TVLQRLYNSEGAGSVD----ILRIVHGLRRQRVLMVQS-L 411
+VV+C AG+ G F L +R + GS++ +L ++ R ++ V S
Sbjct: 85 LVVNC-AGISIPGLFEDLTAEEFERGMDVNYFGSLNVAHAVLPLMKEQRPGHIVFVSSQA 143
Query: 412 AQYAFI 417
A
Sbjct: 144 ALVGIY 149
>gnl|CDD|223279 COG0201, SecY, Preprotein translocase subunit SecY [Intracellular
trafficking and secretion].
Length = 436
Score = 30.3 bits (69), Expect = 2.3
Identities = 13/42 (30%), Positives = 19/42 (45%), Gaps = 4/42 (9%)
Query: 88 LLVMGFLVLIVLPVLFIIIRRKIVGVTSPR---AGGIRGWLP 126
LLV+ L +I L V RR+I + + G +LP
Sbjct: 221 LLVLLTLAIIFLVVYVEQARRRIP-IQYAKRQSYRGQSSYLP 261
>gnl|CDD|215416 PLN02779, PLN02779, haloacid dehalogenase-like hydrolase family
protein.
Length = 286
Score = 29.3 bits (66), Expect = 4.0
Identities = 13/43 (30%), Positives = 19/43 (44%), Gaps = 5/43 (11%)
Query: 297 DVCHYGSWVLSELRMCRGGEHRMISHYHFTTWPDSGVPSSPNT 339
DV Y EL GG+ RM +++ WP S + +P
Sbjct: 78 DVELYD-----ELLNIGGGKERMTWYFNENGWPTSTIEKAPKD 115
>gnl|CDD|185159 PRK15247, PRK15247, putative fimbrial usher protein StbD;
Provisional.
Length = 441
Score = 29.5 bits (66), Expect = 4.3
Identities = 13/50 (26%), Positives = 23/50 (46%), Gaps = 6/50 (12%)
Query: 192 PKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGYLSPRQYIATQGP 241
P + ++N+ Y++ +D + Y N + PR YIA +GP
Sbjct: 181 PASAFSNVL------YEMFKIDSRKYFAYQNPMDRDTWTQPRGYIAFKGP 224
>gnl|CDD|181406 PRK08377, PRK08377, NADH dehydrogenase subunit N; Validated.
Length = 494
Score = 29.3 bits (66), Expect = 4.4
Identities = 26/90 (28%), Positives = 38/90 (42%), Gaps = 15/90 (16%)
Query: 354 AQHPIVVHCSAGVGRTGTFIALDTVLQRLYNSEGAGSVDILRIVHGL----------RRQ 403
A HPI V SA V + G + A+ +L + N+ G SV L I+ RQ
Sbjct: 236 APHPITVMFSAFVVKAGLY-AMARILYLMQNANGWSSVLKLVIIMATLTVFFAELSALRQ 294
Query: 404 ----RVLMVQSLAQYAFIYECVQVYLQGGV 429
R++ S+ Q I + + Q GV
Sbjct: 295 KNVKRMIAYSSIGQVGMIALALALGTQAGV 324
>gnl|CDD|222193 pfam13520, AA_permease_2, Amino acid permease.
Length = 425
Score = 29.2 bits (66), Expect = 4.5
Identities = 11/56 (19%), Positives = 20/56 (35%), Gaps = 3/56 (5%)
Query: 74 IYSPPVSTGKDVSWLLVMGFLVLIVLPVLFIIIRRKIVGVTSPRAGGIRGWLPRMS 129
+P +++ + L ++ L V + + PR GGI WL
Sbjct: 20 FVTPLIASAGAAAILWGWIAALIFSLAVALVYA---ELSSAFPRNGGIYVWLKNAF 72
>gnl|CDD|215312 PLN02574, PLN02574, 4-coumarate--CoA ligase-like.
Length = 560
Score = 29.4 bits (66), Expect = 4.9
Identities = 22/76 (28%), Positives = 30/76 (39%), Gaps = 16/76 (21%)
Query: 326 TTWPDSGVPSSPN-TLVSFVLSFRTH--------------ISPAQ-HPIVVHCSAGVGRT 369
+ P +PS PN VSF+ S H IS ++ P+V +AG+
Sbjct: 27 SKHPPVPLPSDPNLDAVSFIFSHHNHNGDTALIDSSTGFSISYSELQPLVKSMAAGLYHV 86
Query: 370 GTFIALDTVLQRLYNS 385
D VL L NS
Sbjct: 87 MGVRQGDVVLLLLPNS 102
>gnl|CDD|214463 MTH00218, ND1, NADH dehydrogenase subunit 1; Provisional.
Length = 311
Score = 28.7 bits (65), Expect = 5.8
Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 5/38 (13%)
Query: 87 WLLVMGFLVLIVLPVL-----FIIIRRKIVGVTSPRAG 119
LL + L+L +LPVL ++ RK++G R G
Sbjct: 1 MLLSLINLLLYILPVLLAVAFLTLLERKVLGYMQLRKG 38
>gnl|CDD|233877 TIGR02466, TIGR02466, conserved hypothetical protein. This family
consists of uncharacterized proteins in Caulobacter
crescentus CB15, Bdellovibrio bacteriovorus HD100,
Synechococcus sp. WH 8102 (2), Silicibacter pomeroyi
DSS-3 (2), and Hyphomonas neptunium ATCC 15444. The
context of nearby genes differs substantially between
members and does point to any specific biological role
[Hypothetical proteins, Conserved].
Length = 201
Score = 28.6 bits (64), Expect = 6.0
Identities = 27/114 (23%), Positives = 38/114 (33%), Gaps = 33/114 (28%)
Query: 213 DDEEGSDYVNANYIGGYLSPRQYIATQGPLCSTVDDFWRMCWESRTPAIVMLTQCIEKHR 272
DDE G ++ N N +GGY S S +D R P L Q ++KH
Sbjct: 34 DDEAGQEWSNRNGLGGYTSYA----------SLLDLPHRF------PEFAELVQQLDKHV 77
Query: 273 DKCYK--YWPSDQHPMRCADIEISLQDVCHYGSWVLSELRMCRGGEHRMISHYH 324
K + +D +R WV + +GG H H
Sbjct: 78 AKFARDLEGDNDGLELRIQKA------------WVNI---LPQGGTHSPHLHPG 116
>gnl|CDD|217681 pfam03706, UPF0104, Uncharacterized protein family (UPF0104). This
family of proteins are integral membrane proteins. These
proteins are uncharacterized but contain a conserved PG
motif. Some members of this family are annotated as
dolichol-P-glucose synthetase and contain a pfam00535
domain.
Length = 292
Score = 28.5 bits (64), Expect = 6.8
Identities = 10/43 (23%), Positives = 21/43 (48%)
Query: 85 VSWLLVMGFLVLIVLPVLFIIIRRKIVGVTSPRAGGIRGWLPR 127
+ LL++G L+L ++ +L +++R R + L R
Sbjct: 136 LLLLLLLGVLLLALVLLLLLLLRLLRKRRRLLRLERLPERLRR 178
>gnl|CDD|222890 PHA02584, 34, long tail fiber, proximal subunit; Provisional.
Length = 1229
Score = 29.0 bits (65), Expect = 6.8
Identities = 13/56 (23%), Positives = 27/56 (48%), Gaps = 3/56 (5%)
Query: 174 KSVGRGQPTTAAELRCNRPKNRYTNIFPYDSSRYKLQPVDDEEGSDYVNANYIGGY 229
+S+ R + +A +R P + TN+F ++ + +++ E S YV G+
Sbjct: 158 QSIVRFRGNQSASVRITHPYS--TNVFIFNGNLWQVYEAGYERRSVYVEPT-GAGF 210
>gnl|CDD|240399 PTZ00393, PTZ00393, protein tyrosine phosphatase; Provisional.
Length = 241
Score = 28.4 bits (63), Expect = 7.2
Identities = 14/52 (26%), Positives = 23/52 (44%), Gaps = 2/52 (3%)
Query: 324 HFTTWPDSGVPSSPNTLVSFVLSFRTHISPAQHPIVVHCSAGVGRTGTFIAL 375
H +PD P +VS L+ ++ + VHC AG+GR ++
Sbjct: 141 HELIFPDGDAP--TVDIVSNWLTIVNNVIKNNRAVAVHCVAGLGRAPVLASI 190
>gnl|CDD|235700 PRK06112, PRK06112, acetolactate synthase catalytic subunit;
Validated.
Length = 578
Score = 28.2 bits (63), Expect = 9.9
Identities = 18/63 (28%), Positives = 22/63 (34%), Gaps = 11/63 (17%)
Query: 264 LTQCIEKHRDKCYKYWPSDQHPMRCADIEISLQDVCH----------YGS-WVLSELRMC 312
+ E HR+ SD P+R I LQ V Y S WV + L
Sbjct: 365 IAAGREAHREDSAPVALSDASPIRPERIMAELQAVLTGDTIVVADASYSSIWVANFLTAR 424
Query: 313 RGG 315
R G
Sbjct: 425 RAG 427
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.138 0.447
Gapped
Lambda K H
0.267 0.0705 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 23,302,932
Number of extensions: 2242557
Number of successful extensions: 2175
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2133
Number of HSP's successfully gapped: 47
Length of query: 451
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 351
Effective length of database: 6,502,202
Effective search space: 2282272902
Effective search space used: 2282272902
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 61 (27.3 bits)