RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17757
(225 letters)
>gnl|CDD|224276 COG1357, COG1357, Pentapeptide repeats containing protein [Function
unknown].
Length = 238
Score = 78.8 bits (194), Expect = 7e-18
Identities = 49/131 (37%), Positives = 63/131 (48%), Gaps = 9/131 (6%)
Query: 91 LSRRDVINALILTPI-TAELRFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACS------ 143
LS D+ NA + + G +L G++L +L N +
Sbjct: 61 LSGADLSNANLRGADLSKGADLSGADLEGANLRGANLSGANLSGANLSNATRANLSGADL 120
Query: 144 --VNLKGANLEGSNMAGVNLRVATLKNANLQNCDLRAAVLAGADLENCDLSGSDLHEANL 201
NL ANL G+N++G NL A L NANL DL A L+GADL +LSG+DL ANL
Sbjct: 121 SGANLSNANLTGANLSGANLNGANLSNANLSGADLSGADLSGADLSGANLSGADLRGANL 180
Query: 202 RGANLKDAALE 212
GANL A L
Sbjct: 181 SGANLSGADLR 191
Score = 74.9 bits (184), Expect = 2e-16
Identities = 40/101 (39%), Positives = 60/101 (59%)
Query: 111 FQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNAN 170
G +L+G++LS +L N + NL GA+L G++++G +L A L A+
Sbjct: 115 LSGADLSGANLSNANLTGANLSGANLNGANLSNANLSGADLSGADLSGADLSGANLSGAD 174
Query: 171 LQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDAAL 211
L+ +L A L+GADL N +LSG+DL A+L GANL +A L
Sbjct: 175 LRGANLSGANLSGADLRNANLSGADLSGADLTGANLVNANL 215
Score = 74.1 bits (182), Expect = 4e-16
Identities = 47/119 (39%), Positives = 59/119 (49%), Gaps = 18/119 (15%)
Query: 109 LRFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACS-------------VNLKGANLEGSN 155
L G NL+G+DLS +LR + KG L NL GANL +
Sbjct: 54 LDLSGANLSGADLSNANLRGADL--SKGADLSGADLEGANLRGANLSGANLSGANLSNAT 111
Query: 156 MA---GVNLRVATLKNANLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDAAL 211
A G +L A L NANL +L A L GA+L N +LSG+DL A+L GA+L A L
Sbjct: 112 RANLSGADLSGANLSNANLTGANLSGANLNGANLSNANLSGADLSGADLSGADLSGANL 170
Score = 65.3 bits (159), Expect = 6e-13
Identities = 43/105 (40%), Positives = 57/105 (54%), Gaps = 10/105 (9%)
Query: 110 RFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNA 169
NL+G+DLS DL + NL GA+L G+N++G NL A L+NA
Sbjct: 144 NLSNANLSGADLSGADLSGADLS----------GANLSGADLRGANLSGANLSGADLRNA 193
Query: 170 NLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDAALELM 214
NL DL A L GA+L N +L G++L ANL GA+L A L +
Sbjct: 194 NLSGADLSGADLTGANLVNANLGGANLFGANLLGADLSGATLGDV 238
Score = 54.9 bits (132), Expect = 3e-09
Identities = 39/103 (37%), Positives = 52/103 (50%), Gaps = 12/103 (11%)
Query: 110 RFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNA 169
R QG+ ++L DL+ + +N + ANL G +++G NL A L NA
Sbjct: 18 RLQGIRELKANLLGADLQGSELQ--------LGDLNFENANLSGLDLSGANLSGADLSNA 69
Query: 170 NLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDAALE 212
NL+ DL GADL DL G++L ANL GANL A L
Sbjct: 70 NLRGADLS----KGADLSGADLEGANLRGANLSGANLSGANLS 108
Score = 41.4 bits (97), Expect = 1e-04
Identities = 25/69 (36%), Positives = 35/69 (50%), Gaps = 3/69 (4%)
Query: 142 CSVNLKGANLEGSNMAGVNLRVATLKNANLQNCDLRAAVLAGADLENCDLSGSDLHEANL 201
S L+G +N+ G +L L + + A L+G DL +LSG+DL ANL
Sbjct: 15 ESSRLQGIRELKANLLGADL---QGSELQLGDLNFENANLSGLDLSGANLSGADLSNANL 71
Query: 202 RGANLKDAA 210
RGA+L A
Sbjct: 72 RGADLSKGA 80
Score = 35.2 bits (81), Expect = 0.013
Identities = 26/72 (36%), Positives = 36/72 (50%), Gaps = 2/72 (2%)
Query: 142 CSVNLKGANLEGSNMAGVNLRVATLKNANLQ--NCDLRAAVLAGADLENCDLSGSDLHEA 199
C + L +LE S + G+ A L A+LQ L A+L DLSG++L A
Sbjct: 5 CEIFLADLDLESSRLQGIRELKANLLGADLQGSELQLGDLNFENANLSGLDLSGANLSGA 64
Query: 200 NLRGANLKDAAL 211
+L ANL+ A L
Sbjct: 65 DLSNANLRGADL 76
Score = 29.8 bits (67), Expect = 0.88
Identities = 26/84 (30%), Positives = 35/84 (41%), Gaps = 15/84 (17%)
Query: 91 LSRRDVINALILTPITAELRFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGAN 150
LS D+ A + + + NL+G+DLS DL NL AN
Sbjct: 170 LSGADLRGANLSGANLSGADLRNANLSGADLSGADLTGA---------------NLVNAN 214
Query: 151 LEGSNMAGVNLRVATLKNANLQNC 174
L G+N+ G NL A L A L +
Sbjct: 215 LGGANLFGANLLGADLSGATLGDV 238
>gnl|CDD|185119 PRK15197, PRK15197, secreted effector protein PipB; Provisional.
Length = 291
Score = 68.0 bits (166), Expect = 1e-13
Identities = 47/121 (38%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 101 ILTPITAELRFQGVNLA-----GSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSN 155
++T L +GVNLA G DLS +D N +F+ TL +VNL GANL +N
Sbjct: 147 LITE-DGFLNLRGVNLAHKDFQGEDLSDIDASNADFR---ETTLS--NVNLVGANLCCAN 200
Query: 156 MAGVNLRVATLKNANLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDAALELML 215
+ VNL + + ANL + DL A ++G +L L GSDL + L GA L AL L
Sbjct: 201 LHAVNLMGSNMTKANLTHADLTCANMSGVNLTAAILFGSDLTDTKLNGAKLDKIALTLAK 260
Query: 216 T 216
Sbjct: 261 A 261
Score = 41.5 bits (97), Expect = 1e-04
Identities = 25/87 (28%), Positives = 39/87 (44%), Gaps = 10/87 (11%)
Query: 130 NFKVRKGLTLPACSVNLKGANL-----EGSNMAGVNLRVATLKNANLQNCDLRAAVLAGA 184
N + L +NL+G NL +G +++ ++ A + L N +L A L A
Sbjct: 140 NIQQPDSLITEDGFLNLRGVNLAHKDFQGEDLSDIDASNADFRETTLSNVNLVGANLCCA 199
Query: 185 DLENCDLSGSDLHEANLRGANLKDAAL 211
+L +L GS N+ ANL A L
Sbjct: 200 NLHAVNLMGS-----NMTKANLTHADL 221
>gnl|CDD|202161 pfam02214, K_tetra, K+ channel tetramerisation domain. The
N-terminal, cytoplasmic tetramerisation domain (T1) of
voltage-gated K+ channels encodes molecular
determinants for subfamily-specific assembly of
alpha-subunits into functional tetrameric channels. It
is distantly related to the BTB/POZ domain pfam00651.
Length = 92
Score = 63.3 bits (155), Expect = 2e-13
Identities = 21/58 (36%), Positives = 28/58 (48%), Gaps = 1/58 (1%)
Query: 16 MNPGDVDSTGAYLIDRSPTYFEPILNYLR-NGQLVLDPNINPEGVLEEARFFGIESLV 72
D T Y DRSP +FE ILN+ R G+L + + LEE F+G+ L
Sbjct: 31 RCDFYDDDTNEYFFDRSPKHFETILNFYRTGGKLHRPEEVCLDSFLEELEFYGLGELA 88
>gnl|CDD|109845 pfam00805, Pentapeptide, Pentapeptide repeats (8 copies). These
repeats are found in many cyanobacterial proteins. The
repeats were first identified in hglK. The function of
these repeats is unknown. The structure of this repeat
has been predicted to be a beta-helix. The repeat can be
approximately described as A(D/N)LXX, where X can be any
amino acid.
Length = 40
Score = 53.5 bits (130), Expect = 3e-10
Identities = 22/40 (55%), Positives = 26/40 (65%)
Query: 169 ANLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKD 208
ANL +LR A L GADL DL+G++L ANL GANL
Sbjct: 1 ANLSGANLRGADLRGADLSGADLTGANLSGANLSGANLSG 40
Score = 48.9 bits (118), Expect = 2e-08
Identities = 21/36 (58%), Positives = 23/36 (63%)
Query: 174 CDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDA 209
+L A L GADL DLSG+DL ANL GANL A
Sbjct: 1 ANLSGANLRGADLRGADLSGADLTGANLSGANLSGA 36
Score = 45.1 bits (108), Expect = 4e-07
Identities = 19/40 (47%), Positives = 27/40 (67%)
Query: 154 SNMAGVNLRVATLKNANLQNCDLRAAVLAGADLENCDLSG 193
+N++G NLR A L+ A+L DL A L+GA+L +LSG
Sbjct: 1 ANLSGANLRGADLRGADLSGADLTGANLSGANLSGANLSG 40
Score = 43.5 bits (104), Expect = 1e-06
Identities = 17/37 (45%), Positives = 24/37 (64%)
Query: 160 NLRVATLKNANLQNCDLRAAVLAGADLENCDLSGSDL 196
NL A L+ A+L+ DL A L GA+L +LSG++L
Sbjct: 2 NLSGANLRGADLRGADLSGADLTGANLSGANLSGANL 38
Score = 43.5 bits (104), Expect = 1e-06
Identities = 18/34 (52%), Positives = 22/34 (64%)
Query: 179 AVLAGADLENCDLSGSDLHEANLRGANLKDAALE 212
A L+GA+L DL G+DL A+L GANL A L
Sbjct: 1 ANLSGANLRGADLRGADLSGADLTGANLSGANLS 34
Score = 40.4 bits (96), Expect = 2e-05
Identities = 14/34 (41%), Positives = 22/34 (64%)
Query: 144 VNLKGANLEGSNMAGVNLRVATLKNANLQNCDLR 177
NL+GA+L G++++G +L A L ANL +L
Sbjct: 6 ANLRGADLRGADLSGADLTGANLSGANLSGANLS 39
Score = 35.8 bits (84), Expect = 8e-04
Identities = 17/48 (35%), Positives = 22/48 (45%), Gaps = 10/48 (20%)
Query: 111 FQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAG 158
G NL G+DL DL + NL GANL G+N++G
Sbjct: 3 LSGANLRGADLRGADLSGADLT----------GANLSGANLSGANLSG 40
Score = 31.6 bits (73), Expect = 0.024
Identities = 17/48 (35%), Positives = 25/48 (52%), Gaps = 10/48 (20%)
Query: 115 NLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLR 162
NL+G++L DLR + +L GANL G+N++G NL
Sbjct: 2 NLSGANLRGADLRGADL----------SGADLTGANLSGANLSGANLS 39
>gnl|CDD|205777 pfam13599, Pentapeptide_4, Pentapeptide repeats (9 copies).
Length = 77
Score = 54.6 bits (132), Expect = 3e-10
Identities = 20/69 (28%), Positives = 31/69 (44%)
Query: 144 VNLKGANLEGSNMAGVNLRVATLKNANLQNCDLRAAVLAGADLENCDLSGSDLHEANLRG 203
+ + + E ++ VN ATLKN N NC L + DL + D + +DL A
Sbjct: 1 ADFRNCSFENCDLKNVNFSNATLKNCNFSNCKLFNVNFSDCDLRDVDFTEADLKNAKFSN 60
Query: 204 ANLKDAALE 212
+LK+
Sbjct: 61 CDLKNVNFS 69
Score = 50.4 bits (121), Expect = 1e-08
Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 11/87 (12%)
Query: 125 DLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNANLQNCDLRAAVLAGA 184
D RN +F+ + +LK N + + N L N N +CDLR A
Sbjct: 2 DFRNCSFE----------NCDLKNVNFSNATLKNCNFSNCKLFNVNFSDCDLRDVDFTEA 51
Query: 185 DLENCDLSGSDLHEANLRGANLKDAAL 211
DL+N S DL N G +LKD
Sbjct: 52 DLKNAKFSNCDLKNVNFSG-SLKDCDF 77
Score = 46.9 bits (112), Expect = 2e-07
Identities = 23/82 (28%), Positives = 35/82 (42%), Gaps = 11/82 (13%)
Query: 110 RFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNA 169
F+ +L + S L+N NF + L N ++ V+ A LKNA
Sbjct: 7 SFENCDLKNVNFSNATLKNCNFS----------NCKLFNVNFSDCDLRDVDFTEADLKNA 56
Query: 170 NLQNCDLRAAVLAGADLENCDL 191
NCDL+ +G L++CD
Sbjct: 57 KFSNCDLKNVNFSG-SLKDCDF 77
Score = 46.5 bits (111), Expect = 3e-07
Identities = 21/85 (24%), Positives = 33/85 (38%), Gaps = 15/85 (17%)
Query: 110 RFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNA 169
F+ + DL N+NF + LK N + VN L++
Sbjct: 2 DFRNCSFENCDLK-----NVNF----------SNATLKNCNFSNCKLFNVNFSDCDLRDV 46
Query: 170 NLQNCDLRAAVLAGADLENCDLSGS 194
+ DL+ A + DL+N + SGS
Sbjct: 47 DFTEADLKNAKFSNCDLKNVNFSGS 71
Score = 26.5 bits (59), Expect = 3.7
Identities = 12/47 (25%), Positives = 18/47 (38%), Gaps = 10/47 (21%)
Query: 108 ELRFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGS 154
+ F +L D + DL+N F + +LK N GS
Sbjct: 35 NVNFSDCDLRDVDFTEADLKNAKFS----------NCDLKNVNFSGS 71
>gnl|CDD|197585 smart00225, BTB, Broad-Complex, Tramtrack and Bric a brac.
Domain in Broad-Complex, Tramtrack and Bric a brac.
Also known as POZ (poxvirus and zinc finger) domain.
Known to be a protein-protein interaction motif found
at the N-termini of several C2H2-type transcription
factors as well as Shaw-type potassium channels. Known
structure reveals a tightly intertwined dimer formed
via interactions between N-terminal strand and helix
structures. However in a subset of BTB/POZ domains,
these two secondary structures appear to be missing. Be
aware SMART predicts BTB/POZ domains without the beta1-
and alpha1-secondary structures.
Length = 97
Score = 52.7 bits (127), Expect = 2e-09
Identities = 22/67 (32%), Positives = 32/67 (47%), Gaps = 1/67 (1%)
Query: 17 NPGDVDSTGAYLIDRSPTYFEPILNYLRNGQLVLDPNINPEGVLEEARFFGIESLVPQLM 76
+ + D + YL D SP F +LN+L G+L L N E +LE A + I LV
Sbjct: 32 DFKESDKSEIYLDDVSPEDFRALLNFLYTGKLDLPEE-NVEELLELADYLQIPGLVELCE 90
Query: 77 EIIMSRE 83
E ++
Sbjct: 91 EFLLKLL 97
>gnl|CDD|185118 PRK15196, PRK15196, secreted effector protein PipB2; Provisional.
Length = 350
Score = 51.9 bits (124), Expect = 5e-08
Identities = 39/107 (36%), Positives = 54/107 (50%), Gaps = 11/107 (10%)
Query: 113 GVNLAGSDLSRLDLRNINFKVRKGLTLPACSV---NLKGANLEGSNMAGVNLRVATLKNA 169
G NL G+ L D NFK G L S+ N K A LE S M G L A L A
Sbjct: 176 GANLEGAVLFMADCEGANFK---GANLSGTSLGDSNFKNACLEDSIMCGATLDHANLTGA 232
Query: 170 NLQN-----CDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDAAL 211
NLQ+ C + +GA++++ +LSG+ L A++ GA L+ A +
Sbjct: 233 NLQHASLLGCSMIECNCSGANMDHTNLSGATLIRADMSGATLQGATI 279
Score = 51.2 bits (122), Expect = 1e-07
Identities = 38/131 (29%), Positives = 60/131 (45%), Gaps = 30/131 (22%)
Query: 111 FQGVNLAGSDL-----------------SRLDLRNINFKVRKGLTLPACSV---NLKGAN 150
F+G NL+G+ L + LD N+ + +L CS+ N GAN
Sbjct: 194 FKGANLSGTSLGDSNFKNACLEDSIMCGATLDHANLTGANLQHASLLGCSMIECNCSGAN 253
Query: 151 LEGSNMAG-----VNLRVATLKNANLQNCDLRAAVLAGADLE-----NCDLSGSDLHEAN 200
++ +N++G ++ ATL+ A + + AVL A+L + +L G+DL EAN
Sbjct: 254 MDHTNLSGATLIRADMSGATLQGATIMAAIMEGAVLTRANLRKASFISTNLDGADLAEAN 313
Query: 201 LRGANLKDAAL 211
L KD L
Sbjct: 314 LNNTCFKDCTL 324
Score = 48.5 bits (115), Expect = 9e-07
Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 17/106 (16%)
Query: 115 NLAGSDLSRLDLRNINFKVRKGLTLPACS-VNLKGANLEGSNMAGVNLRV---------- 163
NL +DLS +L L + C N KGANL G+++ N +
Sbjct: 168 NLCDADLSGANLEG------AVLFMADCEGANFKGANLSGTSLGDSNFKNACLEDSIMCG 221
Query: 164 ATLKNANLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDA 209
ATL +ANL +L+ A L G + C+ SG+++ NL GA L A
Sbjct: 222 ATLDHANLTGANLQHASLLGCSMIECNCSGANMDHTNLSGATLIRA 267
Score = 48.1 bits (114), Expect = 1e-06
Identities = 32/106 (30%), Positives = 55/106 (51%), Gaps = 20/106 (18%)
Query: 113 GVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGS----------NMAGVNLR 162
G+NL G+ L+ +L N C +L GANLEG+ N G NL
Sbjct: 151 GMNLKGAVLTGANLTAENL----------CDADLSGANLEGAVLFMADCEGANFKGANLS 200
Query: 163 VATLKNANLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKD 208
+L ++N +N L +++ GA L++ +L+G++L A+L G ++ +
Sbjct: 201 GTSLGDSNFKNACLEDSIMCGATLDHANLTGANLQHASLLGCSMIE 246
Score = 47.3 bits (112), Expect = 2e-06
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 3/82 (3%)
Query: 134 RKGLTLPACSVNLK---GANLEGSNMAGVNLRVATLKNANLQNCDLRAAVLAGADLENCD 190
R +LP SV L G NL+G+ + G NL L +A+L +L AVL AD E +
Sbjct: 134 RCEFSLPQDSVILTAQGGMNLKGAVLTGANLTAENLCDADLSGANLEGAVLFMADCEGAN 193
Query: 191 LSGSDLHEANLRGANLKDAALE 212
G++L +L +N K+A LE
Sbjct: 194 FKGANLSGTSLGDSNFKNACLE 215
Score = 43.1 bits (101), Expect = 6e-05
Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 5/72 (6%)
Query: 145 NLKGANLEGSNMAGVNLRVATLKNANLQNCDLRAAVLAGADLENCDLS-----GSDLHEA 199
NL A+L G+N+ G L +A + AN + +L L ++ +N L G+ L A
Sbjct: 168 NLCDADLSGANLEGAVLFMADCEGANFKGANLSGTSLGDSNFKNACLEDSIMCGATLDHA 227
Query: 200 NLRGANLKDAAL 211
NL GANL+ A+L
Sbjct: 228 NLTGANLQHASL 239
Score = 37.3 bits (86), Expect = 0.004
Identities = 22/87 (25%), Positives = 37/87 (42%), Gaps = 10/87 (11%)
Query: 115 NLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNANLQNC 174
NL+G+ L R D+ + + A +EG+ + NLR A+ + NL
Sbjct: 258 NLSGATLIRADMSGATLQ----------GATIMAAIMEGAVLTRANLRKASFISTNLDGA 307
Query: 175 DLRAAVLAGADLENCDLSGSDLHEANL 201
DL A L ++C L+ +A +
Sbjct: 308 DLAEANLNNTCFKDCTLTHLRTEDATM 334
>gnl|CDD|185275 PRK15377, PRK15377, E3 ubiquitin-protein ligase SopA; Provisional.
Length = 782
Score = 48.5 bits (115), Expect = 1e-06
Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 4/91 (4%)
Query: 114 VNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKNANLQN 173
+ A D S D RNINF P+ + G ++G N + + + + +
Sbjct: 190 LTYADRDFSNADFRNINFSK----INPSGFMLRDGDIIKGFNFSNSKFTYSDISHLHFDE 245
Query: 174 CDLRAAVLAGADLENCDLSGSDLHEANLRGA 204
C + L+ N S SD++EA+L+ +
Sbjct: 246 CRFTYSTLSDVVCSNTKFSNSDMNEASLKYS 276
>gnl|CDD|216043 pfam00651, BTB, BTB/POZ domain. The BTB (for BR-C, ttk and bab) or
POZ (for Pox virus and Zinc finger) domain is present
near the N-terminus of a fraction of zinc finger
(pfam00096) proteins and in proteins that contain the
pfam01344 motif such as Kelch and a family of pox virus
proteins. The BTB/POZ domain mediates homomeric
dimerisation and in some instances heteromeric
dimerisation. The structure of the dimerised PLZF
BTB/POZ domain has been solved and consists of a tightly
intertwined homodimer. The central scaffolding of the
protein is made up of a cluster of alpha-helices flanked
by short beta-sheets at both the top and bottom of the
molecule. POZ domains from several zinc finger proteins
have been shown to mediate transcriptional repression
and to interact with components of histone deacetylase
co-repressor complexes including N-CoR and SMRT. The POZ
or BTB domain is also known as BR-C/Ttk or ZiN.
Length = 101
Score = 31.4 bits (72), Expect = 0.090
Identities = 15/53 (28%), Positives = 26/53 (49%), Gaps = 1/53 (1%)
Query: 27 YLIDRSPTYFEPILNYLRNGQLVLDPNINPEGVLEEARFFGIESLVPQLMEII 79
L D SP FE +L ++ G+L + N + +L A I +L+ + E +
Sbjct: 49 TLEDVSPEDFEALLEFIYTGKLEITE-ENVDDLLALADKLQIPALIDKCEEFL 100
>gnl|CDD|224055 COG1132, MdlB, ABC-type multidrug transport system, ATPase and
permease components [Defense mechanisms].
Length = 567
Score = 32.4 bits (74), Expect = 0.18
Identities = 11/75 (14%), Positives = 26/75 (34%), Gaps = 1/75 (1%)
Query: 65 FFGIESLVPQLMEIIMSRERSRDMMPLSRRDVINALILTPITAELRFQGVNLAGSDLSRL 124
+ +V L + ER +++ L + F+ V+ +
Sbjct: 285 ILQLGEVVSLLQRASAAAERLFELLDEEPEVEDPPDPLKDTIGSIEFENVSF-SYPGKKP 343
Query: 125 DLRNINFKVRKGLTL 139
L++I+F + G +
Sbjct: 344 VLKDISFSIEPGEKV 358
>gnl|CDD|177853 PLN02202, PLN02202, carbonate dehydratase.
Length = 284
Score = 32.0 bits (72), Expect = 0.20
Identities = 17/45 (37%), Positives = 25/45 (55%), Gaps = 6/45 (13%)
Query: 108 ELRFQGVNLAGSDLSRLDLRNINFKVRK------GLTLPACSVNL 146
E RF+G + A ++ ++D R+I K RK LT P CS N+
Sbjct: 176 EERFKGNHTAQVEVGKIDTRHIERKTRKYFRYIGSLTTPPCSENV 220
>gnl|CDD|200495 cd09295, Sema, The Sema domain, a protein interacting module, of
semaphorins and plexins. Both semaphorins and plexins
have a Sema domain on their N-termini. Plexins function
as receptors for the semaphorins. Evolutionarily,
plexins may be the ancestor of semaphorins. Semaphorins
are regulatory molecules in the development of the
nervous system and in axonal guidance. They also play
important roles in other biological processes, such as
angiogenesis, immune regulation, respiration systems,
and cancer. Semaphorins can be divided into 7 classes.
Vertebrates have members in classes 3-7, whereas classes
1 and 2 are known only in invertebrates. Class 2 and 3
semaphorins are secreted; classes 1 and 4 through 6 are
transmembrane proteins; and class 7 is membrane
associated via glycosylphosphatidylinositol (GPI)
linkage. Plexins are a large family of transmembrane
proteins, which are divided into four types (A-D)
according to sequence similarity. In vertebrates, type A
plexins serve as co-receptors for neuropilins to mediate
the signalling of class 3 semaphorins. Plexins serve as
direct receptors for several other members of the
semaphorin family: class 6 semaphorins signal through
type A plexins and class 4 semaphorins through type B
plexins. This family also includes the MET and RON
receptor tyrosine kinases. The Sema domain is located at
the N-terminus and contains four disulfide bonds formed
by eight conserved cysteine residues. It serves to
recognize and bind receptors.
Length = 392
Score = 30.6 bits (69), Expect = 0.55
Identities = 19/69 (27%), Positives = 26/69 (37%), Gaps = 6/69 (8%)
Query: 101 ILTPITAELRFQ--GVNLA--GSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNM 156
+L G N A RLD+ KVR P C ++ K +N+ G N+
Sbjct: 69 VLQQKGDLDILAVCGSNAAQPSCGSYRLDVLVELGKVRWPSGRPRCPIDNKHSNM-GVNV 127
Query: 157 AGVNLRVAT 165
L AT
Sbjct: 128 DSK-LYSAT 135
>gnl|CDD|221494 pfam12257, DUF3608, Protein of unknown function (DUF3608). This
domain family is found in eukaryotes, and is
approximately 280 amino acids in length. The family is
found in association with pfam00610.
Length = 281
Score = 29.7 bits (67), Expect = 1.2
Identities = 20/83 (24%), Positives = 30/83 (36%), Gaps = 10/83 (12%)
Query: 95 DVINALILTPITAELRFQGVNLAGSDLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGS 154
D +N + I LR + N + + V +G LP +K LE
Sbjct: 154 DQVNIIHWDEIMETLRKEFANFKRDLMLNKTSEDKGKSVIRGRFLPV----IKSNILEAI 209
Query: 155 NMAGVNLRVATLKNANLQNCDLR 177
N+A TL N + DL+
Sbjct: 210 NLA------TTLVNDPFRQTDLK 226
>gnl|CDD|181028 PRK07559, PRK07559, 2'-deoxycytidine 5'-triphosphate deaminase;
Provisional.
Length = 365
Score = 28.7 bits (65), Expect = 2.3
Identities = 12/34 (35%), Positives = 20/34 (58%), Gaps = 4/34 (11%)
Query: 20 DVDSTGAYLIDRSPTYFEPILNYLRNGQLVLDPN 53
DVD GA+ + ++EP+ +G+L+LDP
Sbjct: 224 DVDRIGAHDVLD---FWEPLYAR-GDGELILDPG 253
>gnl|CDD|233117 TIGR00759, aceE, pyruvate dehydrogenase E1 component, homodimeric
type. Most members of this family are pyruvate
dehydrogenase complex, E1 component. Note: this family
was classified as subfamily rather than equivalog
because it includes a counterexample from Pseudomonas
putida, MdeB, that is active as an E1 component of an
alpha-ketoglutarate dehydrogenase complex rather than a
pyruvate dehydrogase complex. The second pyruvate
dehydrogenase complex E1 protein from Alcaligenes
eutrophus, PdhE, complements an aceE mutant of E. coli
but is not part of a pyruvate dehydrogenase complex
operon, is more similar to the Pseudomonas putida MdeB
than to E. coli AceE, and may have also have a different
primary specificity.
Length = 885
Score = 29.0 bits (65), Expect = 2.4
Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 3/33 (9%)
Query: 46 GQLVLDPNINPEGV---LEEARFFGIESLVPQL 75
+L+ D I V +EAR FG+E L Q+
Sbjct: 500 NKLLKDKEIGKRIVPIVPDEARTFGMEGLFRQI 532
>gnl|CDD|182045 PRK09718, PRK09718, hypothetical protein; Validated.
Length = 512
Score = 28.9 bits (64), Expect = 2.8
Identities = 25/107 (23%), Positives = 39/107 (36%), Gaps = 15/107 (14%)
Query: 120 DLSRLDLRNINFKVRKGLTLPACSVNLKGANLEGSNMAGVNLRVATLKN----------- 168
DLS LDL + +FK + S NL+ E + N A + N
Sbjct: 91 DLSDLDLTSCHFKGDVISKVSFLSSNLQHVTFECKEIGDCNFTTAIVDNVIFKCRRLHNV 150
Query: 169 ----ANLQNCDLRAAVLAGADLENCDLSGSDLHEANLRGANLKDAAL 211
A+ + D +L D L+ S+ E +R +N + L
Sbjct: 151 IFIKASGEYVDFSKNILDTVDFSQSQLTHSNFRECQIRNSNFDNCYL 197
>gnl|CDD|220128 pfam09157, TruB-C_2, Pseudouridine synthase II TruB, C-terminal.
Members of this family adopt a secondary structure
consisting of a four-stranded beta sheet and one alpha
helix. They are predominantly RNA-binding domains,
mostly found in Pseudouridine synthase II TruB.
Length = 57
Score = 26.0 bits (58), Expect = 3.4
Identities = 11/31 (35%), Positives = 13/31 (41%), Gaps = 4/31 (12%)
Query: 42 YLRNGQLVLDPNINPEGVL----EEARFFGI 68
GQ V EG++ EE RF GI
Sbjct: 12 RFLQGQPVRVDGAPAEGLVRVYGEEGRFLGI 42
>gnl|CDD|100001 cd04955, GT1_like_6, This family is most closely related to the GT1
family of glycosyltransferases. Glycosyltransferases
catalyze the transfer of sugar moieties from activated
donor molecules to specific acceptor molecules, forming
glycosidic bonds. The acceptor molecule can be a lipid,
a protein, a heterocyclic compound, or another
carbohydrate residue. This group of glycosyltransferases
is most closely related to the previously defined
glycosyltransferase family 1 (GT1). The members of this
family may transfer UDP, ADP, GDP, or CMP linked sugars.
The diverse enzymatic activities among members of this
family reflect a wide range of biological functions. The
protein structure available for this family has the GTB
topology, one of the two protein topologies observed for
nucleotide-sugar-dependent glycosyltransferases. GTB
proteins have distinct N- and C- terminal domains each
containing a typical Rossmann fold. The two domains have
high structural homology despite minimal sequence
homology. The large cleft that separates the two domains
includes the catalytic center and permits a high degree
of flexibility. The members of this family are found in
certain bacteria and Archaea.
Length = 363
Score = 28.3 bits (64), Expect = 3.6
Identities = 11/53 (20%), Positives = 21/53 (39%), Gaps = 1/53 (1%)
Query: 48 LVLDPNINPEGVLEEARFFGIESLVPQLM-EIIMSRERSRDMMPLSRRDVINA 99
L D N E + ++A +F + + L+ E+ E M +R +
Sbjct: 296 LASDNPFNREVLGDKAIYFKVGDDLASLLEELEADPEEVSAMAKAARERIREK 348
>gnl|CDD|226443 COG3933, COG3933, Transcriptional antiterminator [Transcription].
Length = 470
Score = 28.1 bits (63), Expect = 3.7
Identities = 21/82 (25%), Positives = 35/82 (42%), Gaps = 1/82 (1%)
Query: 28 LIDR-SPTYFEPILNYLRNGQLVLDPNINPEGVLEEARFFGIESLVPQLMEIIMSRERSR 86
L+D S T F I++ + + PN++ VLE R + + Q+ + I
Sbjct: 171 LVDMGSLTSFGSIISEEFGIPVKVIPNVSTSMVLEAGRKISLGLNLEQIYQSIQKDFPCY 230
Query: 87 DMMPLSRRDVINALILTPITAE 108
P ++ AL+ T IT E
Sbjct: 231 WCTPWPAKNKKKALVTTCITGE 252
>gnl|CDD|235339 PRK05084, xerS, site-specific tyrosine recombinase XerS; Reviewed.
Length = 357
Score = 28.0 bits (63), Expect = 4.1
Identities = 19/50 (38%), Positives = 23/50 (46%), Gaps = 17/50 (34%)
Query: 94 RDV-INALILTPITAELRFQGVNLAGSDLSRLDLRNINFK------VRKG 136
RD+ I ALIL G L S+L LDL ++N K RKG
Sbjct: 196 RDLAIIALIL----------GSGLRVSELVNLDLSDLNLKQMTIDVTRKG 235
>gnl|CDD|163637 cd07394, MPP_Vps29, Homo sapiens Vps29 and related proteins,
metallophosphatase domain. Vps29 (vacuolar sorting
protein 29), also known as vacuolar membrane protein
Pep11, is a subunit of the retromer complex which is
responsible for the retrieval of mannose-6-phosphate
receptors (MPRs) from the endosomes for retrograde
transport back to the Golgi. Vps29 has a phosphoesterase
fold that acts as a protein interaction scaffold for
retromer complex assembly as well as a phosphatase with
specificity for the cytoplasmic tail of the MPR. The
retromer includes the following 5 subunits: Vps35,
Vps26, Vps29, and a dimer of the sorting nexins Vps5
(Snx1), and Vps17 (Snx2). Vps29 belongs to the
metallophosphatase (MPP) superfamily. MPPs are
functionally diverse, but all share a conserved domain
with an active site consisting of two metal ions
(usually manganese, iron, or zinc) coordinated with
octahedral geometry by a cage of histidine, aspartate,
and asparagine residues. The MPP superfamily includes:
Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
debranching enzymes, YfcE-like phosphodiesterases,
purple acid phosphatases (PAPs), YbbF-like
UDP-2,3-diacylglucosamine hydrolases, and acid
sphingomyelinases (ASMases). The conserved domain is a
double beta-sheet sandwich with a di-metal active site
made up of residues located at the C-terminal side of
the sheets. This domain is thought to allow for
productive metal coordination.
Length = 178
Score = 27.2 bits (61), Expect = 5.1
Identities = 9/17 (52%), Positives = 12/17 (70%), Gaps = 2/17 (11%)
Query: 11 EGGYFMNPGDVDSTGAY 27
EG +F+NPG TGA+
Sbjct: 124 EGKFFINPGSA--TGAF 138
>gnl|CDD|234514 TIGR04231, seadorna_VP5, seadornavirus VP5 protein. This protein
family occurs in the seadornavirus virus group, with
designations VP5 in Banna virus, and VP6 in Kadipiro
virus and Liao ning virus. The function is unassigned.
Length = 505
Score = 28.0 bits (62), Expect = 5.3
Identities = 12/42 (28%), Positives = 16/42 (38%)
Query: 4 KMFAEGSEGGYFMNPGDVDSTGAYLIDRSPTYFEPILNYLRN 45
K F E E + + S A+L S F + N RN
Sbjct: 19 KFFDEKEEKDVILTASEFKSNNAFLFKDSGNLFLGLENIFRN 60
>gnl|CDD|236500 PRK09405, aceE, pyruvate dehydrogenase subunit E1; Reviewed.
Length = 891
Score = 27.8 bits (63), Expect = 5.4
Identities = 12/32 (37%), Positives = 16/32 (50%), Gaps = 3/32 (9%)
Query: 47 QLVLDPNINPEGVL---EEARFFGIESLVPQL 75
L+ D I V +EAR FG+E L Q+
Sbjct: 507 ILLKDKEIGKRIVPIIPDEARTFGMEGLFRQI 538
>gnl|CDD|222805 PHA00527, PHA00527, hypothetical protein.
Length = 129
Score = 26.9 bits (59), Expect = 5.9
Identities = 10/26 (38%), Positives = 15/26 (57%)
Query: 13 GYFMNPGDVDSTGAYLIDRSPTYFEP 38
G PGD + T Y++DR ++F P
Sbjct: 93 GVTTEPGDANETYCYMLDRMFSHFLP 118
>gnl|CDD|237267 PRK13012, PRK13012, 2-oxoacid dehydrogenase subunit E1;
Provisional.
Length = 896
Score = 27.2 bits (61), Expect = 8.3
Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 3/33 (9%)
Query: 46 GQLVLDPNINPEGV---LEEARFFGIESLVPQL 75
G L+ D + P V +EAR FG+ +L Q+
Sbjct: 514 GNLLKDKALGPRIVPIVADEARTFGMANLFRQV 546
>gnl|CDD|224857 COG1946, TesB, Acyl-CoA thioesterase [Lipid metabolism].
Length = 289
Score = 26.9 bits (60), Expect = 9.2
Identities = 11/37 (29%), Positives = 16/37 (43%), Gaps = 11/37 (29%)
Query: 10 SEGGYFMNPGDVDSTGAYLIDRSPTYFEPILNYLRNG 46
S YF+ PGD + Y ++R LR+G
Sbjct: 59 SLHSYFLRPGDPEQPIIYDVER-----------LRDG 84
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 27.3 bits (60), Expect = 9.5
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 68 IESLVPQLMEIIMSRERSRDM 88
IESL L+EII SR + R+
Sbjct: 1241 IESLAHALVEIIKSRSQQRNF 1261
>gnl|CDD|223971 COG1041, COG1041, Predicted DNA modification methylase [DNA
replication, recombination, and repair].
Length = 347
Score = 27.0 bits (60), Expect = 9.8
Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 7/45 (15%)
Query: 32 SPTYFEPI-------LNYLRNGQLVLDPNINPEGVLEEARFFGIE 69
P +P L ++ G+LVLDP G+L EA G
Sbjct: 177 RPGSMDPRLARAMVNLARVKRGELVLDPFCGTGGILIEAGLMGAR 221
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.136 0.385
Gapped
Lambda K H
0.267 0.0732 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,532,966
Number of extensions: 1119026
Number of successful extensions: 1113
Number of sequences better than 10.0: 1
Number of HSP's gapped: 978
Number of HSP's successfully gapped: 67
Length of query: 225
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 132
Effective length of database: 6,812,680
Effective search space: 899273760
Effective search space used: 899273760
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 57 (25.7 bits)