RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy13710
(615 letters)
>gnl|CDD|179668 PRK03893, PRK03893, putative sialic acid transporter; Provisional.
Length = 496
Score = 35.1 bits (81), Expect = 0.11
Identities = 24/84 (28%), Positives = 37/84 (44%), Gaps = 6/84 (7%)
Query: 164 RWLFFVNFI-LALGLILFVTIPE--EWQHSKRELANV---MNLLIPWELRIKEIESHFGS 217
R LFF+ + + L L +PE +W+ A V +++L E RI I +
Sbjct: 174 RALFFIGILPIIFALWLRKNLPEAEDWKEKHAGKAPVRTMVDILYRGEHRIINILLTLAA 233
Query: 218 AVASYFTFLRWLFFVNFILALGLI 241
A A +F F L + LGL+
Sbjct: 234 AAALWFCFAGNLQNAAIVAVLGLL 257
>gnl|CDD|224125 COG1204, COG1204, Superfamily II helicase [General function
prediction only].
Length = 766
Score = 35.1 bits (81), Expect = 0.13
Identities = 30/165 (18%), Positives = 50/165 (30%), Gaps = 14/165 (8%)
Query: 155 AVASYFTFLRWL----FFVNFILALGLILFVTIPEEWQHSKRELANVMNLLIPWELRIKE 210
V S + WL F+ + EE S R L +L +
Sbjct: 442 GVISVGDAVSWLELTDFYERTFYNPQTYGEGMLREEILASLRYLEENGLILDADWEALHA 501
Query: 211 IESHFGSAVASYF-TFLRWLFFVNFILALGLILFVTIPELLSNPTDCREMKKPLPEEEKE 269
E G V+ + F + + L L T LL + ++ P+ E+E
Sbjct: 502 TE--LGKLVSRLYIDPESAKIFRDLLAELALEP--TEIGLLYLISLTPDL-MPIKLRERE 556
Query: 270 SRKLYTLFEFEGILKYSPIFYGYYNNQDNSRYKTPLAFFIVTLLL 314
S +L + + + S G ++ Y L L
Sbjct: 557 SSELV----LDELEEQSDYLLGERLDELAVEYNLLLQALKTAARL 597
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 30.8 bits (70), Expect = 0.18
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 591 EEEKEEDLEEMEKEEVEEEKKEMKK 615
EEEK ED EE E EE EE+ E+K+
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVKR 46
>gnl|CDD|183863 PRK13107, PRK13107, preprotein translocase subunit SecA; Reviewed.
Length = 908
Score = 34.6 bits (79), Expect = 0.18
Identities = 25/92 (27%), Positives = 44/92 (47%), Gaps = 10/92 (10%)
Query: 522 LLEKLGNL--EQLHPRKHLRMLLARNEYSESIGCQIIKQDDVTQFQRIIDSLDSSVDLTR 579
+L+ L L E L HLR + Y++ Q K++ FQ+++++L V
Sbjct: 767 MLQTLDGLWKEHLAAMDHLRQGIHLRGYAQKNPKQEYKRESFELFQQLLETLKHDV---- 822
Query: 580 YACQIYLIKKQEEEKEEDLEEMEKEEVEEEKK 611
I ++ K + + + D+EEME EE+ K
Sbjct: 823 ----ISVLSKVQVQAQSDVEEMEARRREEDAK 850
>gnl|CDD|223805 COG0733, COG0733, Na+-dependent transporters of the SNF family
[General function prediction only].
Length = 439
Score = 33.3 bits (77), Expect = 0.36
Identities = 18/95 (18%), Positives = 36/95 (37%), Gaps = 24/95 (25%)
Query: 161 TFLRWL-FFV-NFILALGLILFVTIPEEWQHSKRELANVMNLLIPWELRIKEIESHFGSA 218
+ + F V N ++ LG +L + + W L+ + + +
Sbjct: 364 SIFDLVDFVVSNILMPLGALLIA------------------IFVGWVLKKELLREELNAG 405
Query: 219 VASYFTFLRWLFFVNFI--LALGLILFVTIPELLS 251
A F WL+ V +I + + ++L + ELL
Sbjct: 406 SAKLFKI--WLYLVRYIAPIVILVVLILGFYELLF 438
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 33.4 bits (76), Expect = 0.45
Identities = 14/24 (58%), Positives = 21/24 (87%)
Query: 589 KQEEEKEEDLEEMEKEEVEEEKKE 612
++EEE+EE+ EE E+EE EEE++E
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEE 887
Score = 32.7 bits (74), Expect = 0.70
Identities = 14/24 (58%), Positives = 21/24 (87%)
Query: 589 KQEEEKEEDLEEMEKEEVEEEKKE 612
++EEE+EE+ EE E+EE EEE++E
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEE 889
Score = 32.3 bits (73), Expect = 0.87
Identities = 13/25 (52%), Positives = 21/25 (84%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEEKKE 612
+++EEE+EE+ EE E+EE EEE++
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEEN 890
Score = 32.3 bits (73), Expect = 0.89
Identities = 14/25 (56%), Positives = 21/25 (84%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEEKKE 612
+++EEE+EE+ EE E+EE EEE +E
Sbjct: 868 EEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 31.9 bits (72), Expect = 1.1
Identities = 14/25 (56%), Positives = 21/25 (84%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEEKKE 612
+++EEE+EE+ EE E+EE EEE+ E
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 31.9 bits (72), Expect = 1.2
Identities = 14/25 (56%), Positives = 22/25 (88%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEEKKE 612
+++EEE+EE+ EE E+EE EEE++E
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEE 888
Score = 29.2 bits (65), Expect = 7.7
Identities = 12/22 (54%), Positives = 18/22 (81%)
Query: 591 EEEKEEDLEEMEKEEVEEEKKE 612
+ E+EE+ EE E+EE EEE++E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEE 883
>gnl|CDD|225348 COG2738, COG2738, Predicted Zn-dependent protease [General function
prediction only].
Length = 226
Score = 32.0 bits (73), Expect = 0.65
Identities = 29/95 (30%), Positives = 40/95 (42%), Gaps = 10/95 (10%)
Query: 152 FGSAVASYFTFLRWLFFVNFILALGLILF--------VTIPEEWQHSKRELANVMNLLIP 203
FGS +A L L +L LG+ILF VT+P E+ S R L + +L I
Sbjct: 125 FGSNLAPLLFILGILLGSTGLLWLGIILFSAAVLFQLVTLPVEFDASSRALKQLESLGII 184
Query: 204 WELRIKEIESHFGSAVASYF--TFLRWLFFVNFIL 236
E K +A +Y + L + FIL
Sbjct: 185 RENEEKGARKVLSAAALTYVAAALMALLQLLRFIL 219
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 32.8 bits (74), Expect = 0.68
Identities = 26/117 (22%), Positives = 51/117 (43%), Gaps = 4/117 (3%)
Query: 498 RPQTVWHKNEAVVVIWIIGVTFPRLLEKLGNLEQLHPRKHLRMLLARNEYSESIGCQIIK 557
V +++ + ++ F L +K +L QL R L E E ++I+
Sbjct: 542 DVSKVANESISSNHEKLMESEFEELKKKWSSLAQLKSRFQKDATLDSIEGEE----ELIQ 597
Query: 558 QDDVTQFQRIIDSLDSSVDLTRYACQIYLIKKQEEEKEEDLEEMEKEEVEEEKKEMK 614
D+ F+ + D +SS + + + + EE +E E E+EE +K+E++
Sbjct: 598 DDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREENARKKEELR 654
>gnl|CDD|178497 PLN02909, PLN02909, Endoglucanase.
Length = 486
Score = 32.2 bits (73), Expect = 0.84
Identities = 11/22 (50%), Positives = 15/22 (68%)
Query: 291 GYYNNQDNSRYKTPLAFFIVTL 312
GYY+ DN +Y P+AF + TL
Sbjct: 81 GYYDAGDNVKYGLPMAFTVTTL 102
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 30.3 bits (69), Expect = 0.90
Identities = 10/21 (47%), Positives = 14/21 (66%)
Query: 589 KQEEEKEEDLEEMEKEEVEEE 609
EEK+E+ EE E++E EE
Sbjct: 75 AAAEEKKEEEEEEEEKEESEE 95
Score = 29.2 bits (66), Expect = 2.2
Identities = 14/22 (63%), Positives = 16/22 (72%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEE 609
EE+KEE+ EE EKEE EEE
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEE 96
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 32.2 bits (73), Expect = 0.91
Identities = 14/29 (48%), Positives = 18/29 (62%)
Query: 587 IKKQEEEKEEDLEEMEKEEVEEEKKEMKK 615
K E E++ED EE E+E+ EEE KK
Sbjct: 309 PAKPEIEQDEDSEESEEEKNEEEGGLSKK 337
Score = 29.2 bits (65), Expect = 8.7
Identities = 9/31 (29%), Positives = 20/31 (64%), Gaps = 1/31 (3%)
Query: 586 LIKKQEEEKEEDLEEMEKEEVE-EEKKEMKK 615
I++ E+ +E + E+ E+E ++ K++KK
Sbjct: 313 EIEQDEDSEESEEEKNEEEGGLSKKGKKLKK 343
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 32.0 bits (73), Expect = 0.96
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEEKKEM 613
K +EEE+E++ EE E+EE EE + E
Sbjct: 347 KDEEEEQEDEEEEEEEEEPEEPEPEE 372
>gnl|CDD|151656 pfam11214, Med2, Mediator complex subunit 2. This family of
mediator complex subunit 2 proteins is conserved in
fungi. Cyclin-dependent kinase CDK8 or Srb10 interacts
with and phosphorylates Med2. Post-translational
modifications of Mediator subunits are important for
regulation of gene expression.
Length = 99
Score = 29.3 bits (66), Expect = 1.4
Identities = 14/47 (29%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 559 DDVTQFQRIIDSLDSSVDLTR-YACQIYLIKKQEEEKEEDLEEMEKE 604
+ +T+F I+D +S ++ T+ Y + KK++EE+E +E E++
Sbjct: 52 EKITKFHSILDDTESKLNDTKWYVDTLVENKKKQEEEERKKKEEEEK 98
>gnl|CDD|233099 TIGR00710, efflux_Bcr_CflA, drug resistance transporter, Bcr/CflA
subfamily. This subfamily of drug efflux proteins, a
part of the major faciliator family, is predicted to
have 12 membrane-spanning regions. Members with known
activity include Bcr (bicyclomycin resistance protein)
in E. coli, Flor (chloramphenicol and florfenicol
resistance) in Salmonella typhimurium DT104, and CmlA
(chloramphenicol resistance) in Pseudomonas sp. plasmid
R1033.
Length = 385
Score = 31.2 bits (71), Expect = 1.5
Identities = 18/108 (16%), Positives = 42/108 (38%), Gaps = 10/108 (9%)
Query: 126 HSKRELANVMNLLIPWELRIKEIESHFGSAVASYFTF--LRWLFFVNFILALGLILFVTI 183
+ EL+ + ++L+P + G + + ++ + + IL LI F+ +
Sbjct: 122 YPGEELSRIYSILMPVLALAPAVAPLLGGYILVWLSWHAIFAFLSLAGILLSALIFFI-L 180
Query: 184 PEEWQHSKRELAN--VMNLLIPWELRIKEIESH---FGSAVASYFTFL 226
PE + R + L+ + K + +G++ +F F
Sbjct: 181 PETLPYEARIQISTYFETFLLLL--KSKMFWGYALIYGASFGGFFAFF 226
Score = 29.3 bits (66), Expect = 7.4
Identities = 25/156 (16%), Positives = 47/156 (30%), Gaps = 42/156 (26%)
Query: 189 HSKRELANVMNLLIPWELRIKEIESHFGSAVASYFTF--LRWLFFVNFILALGLILFVTI 246
+ EL+ + ++L+P + G + + ++ + + IL LI F+ +
Sbjct: 122 YPGEELSRIYSILMPVLALAPAVAPLLGGYILVWLSWHAIFAFLSLAGILLSALIFFI-L 180
Query: 247 PELL------SNPTDCREMKKPLPEEEKESRKLYTL-----------------FEFEGIL 283
PE L T L + K Y L F + I+
Sbjct: 181 PETLPYEARIQISTYFETFLLLL--KSKMFW-GYALIYGASFGGFFAFFSGAPFVYIDIM 237
Query: 284 KYSPIFYGYYNNQDNSRYKTPLAFFIVTLLLYIYSF 319
SP +G F + + + F
Sbjct: 238 GVSPSVFGLL-------------FALNIIAMIFGGF 260
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 29.6 bits (67), Expect = 1.6
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEE 609
EEE+EE+ EE E+EE EE
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEE 94
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 30.8 bits (70), Expect = 1.6
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 587 IKKQEEEKEEDLEEMEKEEVEEEKKE 612
IKK+ E++E EE + E E+ ++E
Sbjct: 154 IKKERAEEKEREEEEKAAEEEKAREE 179
>gnl|CDD|217633 pfam03600, CitMHS, Citrate transporter.
Length = 377
Score = 31.1 bits (71), Expect = 1.8
Identities = 24/127 (18%), Positives = 39/127 (30%), Gaps = 21/127 (16%)
Query: 132 ANVMNLLIPWELRIKEIESHFGS---AVASYFTFLRWLFFVNFILALGL-ILFVTIPEEW 187
AN+ +L PW G A+ F FF I + +L V + +
Sbjct: 143 ANIGGMLTPW-----------GGPQNRAAAGLNFFPTDFFAPMIPGVIAGLLIVILGAYY 191
Query: 188 QHSKRELANVMNLLIPWELRIKEIESHFGSAVASYFTFLRWLFFVNFILALGLILFVTIP 247
K L ++E+E+ L + N L L L+L + +
Sbjct: 192 LGKKERKRLGEGELELERDTLEELEAE---YEIRKRP---LLSWSNLALVLVLMLVLLLL 245
Query: 248 ELLSNPT 254
L
Sbjct: 246 GLWPGWV 252
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 29.3 bits (66), Expect = 1.9
Identities = 13/33 (39%), Positives = 17/33 (51%), Gaps = 7/33 (21%)
Query: 590 QEEE-------KEEDLEEMEKEEVEEEKKEMKK 615
QEEE + ED EE E+ EE +K +K
Sbjct: 72 QEEERRSARLARLEDREEERLEKEEEREKRARK 104
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 31.3 bits (71), Expect = 2.0
Identities = 11/28 (39%), Positives = 22/28 (78%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEEKKEMKK 615
++++EE+E+ ++ E+ + EEEK+E KK
Sbjct: 45 EEKKEEEEKTTDKEEEVDEEEEKEEKKK 72
Score = 30.1 bits (68), Expect = 3.7
Identities = 12/28 (42%), Positives = 21/28 (75%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEEKKEMKK 615
K++EE+ + EE+++EE +EEKK+ K
Sbjct: 48 KEEEEKTTDKEEEVDEEEEKEEKKKKTK 75
Score = 30.1 bits (68), Expect = 4.1
Identities = 13/28 (46%), Positives = 21/28 (75%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEEKKEMKK 615
K++++E+EE + E+E EEE+KE KK
Sbjct: 44 KEEKKEEEEKTTDKEEEVDEEEEKEEKK 71
Score = 29.4 bits (66), Expect = 6.3
Identities = 14/32 (43%), Positives = 22/32 (68%)
Query: 584 IYLIKKQEEEKEEDLEEMEKEEVEEEKKEMKK 615
IYL ++E EKE EE E+E+ E++++E K
Sbjct: 23 IYLWVEKEVEKEVPDEEEEEEKEEKKEEEEKT 54
Score = 29.0 bits (65), Expect = 9.7
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEEKKEMKK 615
KK+EEEK D EE EE E+E+K+ K
Sbjct: 47 KKEEEEKTTDKEEEVDEEEEKEEKKKKT 74
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 29.5 bits (67), Expect = 2.0
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEEKKEMKK 615
+K E E E+ +E+E+ E E ++E+K
Sbjct: 45 EKAEYELEKLEKELEELEAELARRELKA 72
Score = 27.6 bits (62), Expect = 8.2
Identities = 13/31 (41%), Positives = 21/31 (67%), Gaps = 3/31 (9%)
Query: 588 KKQEEEKEE--DLEEM-EKEEVEEEKKEMKK 615
KKQEE+K E L +M +E+ E E ++++K
Sbjct: 26 KKQEEKKSEAEKLAKMSAEEKAEYELEKLEK 56
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 29.0 bits (65), Expect = 2.1
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 589 KQEEEKEEDLEEMEKEEVEEEKKE 612
EE+ EE EE +KEE +EE++E
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEE 96
Score = 29.0 bits (65), Expect = 2.5
Identities = 13/22 (59%), Positives = 17/22 (77%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEE 609
+K EE++EE +E EKEE EEE
Sbjct: 76 EKAEEKEEEKKKEEEKEEEEEE 97
>gnl|CDD|218140 pfam04547, Anoctamin, Calcium-activated chloride channel. The
family carries eight putative transmembrane domains,
and, although it has no similarity to other known
channel proteins, it is clearly a calcium-activated
ionic channel. It is expressed in various secretory
epithelia, the retina and sensory neurons, and mediates
receptor-activated chloride currents in diverse
physiological processes.
Length = 449
Score = 30.6 bits (70), Expect = 2.3
Identities = 15/35 (42%), Positives = 20/35 (57%), Gaps = 1/35 (2%)
Query: 148 IESHFGSAVASYFTFLRWLFFVNFILA-LGLILFV 181
I S+FG +A YF FL + + A LGLI F+
Sbjct: 1 IRSYFGEKIALYFAFLGFYTKMLLPPAVLGLICFL 35
Score = 30.6 bits (70), Expect = 2.3
Identities = 15/35 (42%), Positives = 20/35 (57%), Gaps = 1/35 (2%)
Query: 211 IESHFGSAVASYFTFLRWLFFVNFILA-LGLILFV 244
I S+FG +A YF FL + + A LGLI F+
Sbjct: 1 IRSYFGEKIALYFAFLGFYTKMLLPPAVLGLICFL 35
>gnl|CDD|215115 PLN02171, PLN02171, endoglucanase.
Length = 629
Score = 30.5 bits (69), Expect = 2.8
Identities = 20/66 (30%), Positives = 31/66 (46%), Gaps = 12/66 (18%)
Query: 291 GYYNNQDNSRYKTPLAFFIVTLLLYIYSFVAILKRMAANSKMSKLADKDDECVFTWKLFS 350
GYY+ DN ++ P+AF T+ + +S + K+MAA ++ D W
Sbjct: 77 GYYDAGDNVKFGLPMAF---TVTMMSWSIIEYGKQMAAAGELGHAMD-----AVKW---- 124
Query: 351 GWDYMI 356
G DY I
Sbjct: 125 GTDYFI 130
>gnl|CDD|233126 TIGR00785, dass, anion transporter. The Divalent Anion:Na+
Symporter (DASS) Family (TC 2.A.47) Functionally
characterized proteins of the DASS family transport (1)
organic di- and tricarboxylates of the Krebs Cycle as
well as dicarboxylate amino acid, (2) inorganic sulfate
and (3) phosphate. The animal NaDC-1 cotransport 3 Na+
with each dicarboxylate. Protonated tricarboxylates are
also cotransported with 3Na+ [Transport and binding
proteins, Anions, Transport and binding proteins,
Cations and iron carrying compounds].
Length = 444
Score = 30.4 bits (69), Expect = 2.9
Identities = 20/87 (22%), Positives = 32/87 (36%), Gaps = 27/87 (31%)
Query: 165 WLFFVNFILA----------LGLILFVTIPEEWQHSKRELANVMNLLIPWELRIKEIESH 214
WL F+ A L + + IP ++ +L+ K
Sbjct: 11 WLLLAIFVAAIVGWILEPLPLPVTALLAIP------------IIAVLLGVLSAFK----- 53
Query: 215 FGSAVASYFTFLRWLFFVNFILALGLI 241
F +A++ + WLFF FILA L+
Sbjct: 54 FKNALSGFADPTIWLFFGAFILATALV 80
>gnl|CDD|225437 COG2882, FliJ, Flagellar biosynthesis chaperone [Cell motility and
secretion / Intracellular trafficking and secretion /
Posttranslational modification, protein turnover,
chaperones].
Length = 148
Score = 29.2 bits (66), Expect = 3.7
Identities = 24/97 (24%), Positives = 45/97 (46%), Gaps = 17/97 (17%)
Query: 536 KHLRMLLA-RNEYSESIGCQIIKQDDVTQ---FQRIIDSLDSSVDLTRYAC--------- 582
+ L+ML RNEY +++ ++ Q +Q+ I L+ ++D +
Sbjct: 37 EQLKMLSGYRNEYEQNLNEKLKSGVSAAQWQNYQQFISQLEVAIDQQQSQLSKLRKQVEQ 96
Query: 583 --QIYLIKKQEEEKEEDLEEMEKEEV--EEEKKEMKK 615
+I+ K+ E + E L+E +K E EE ++E K
Sbjct: 97 KREIWQEKQIELKALEKLKERQKTEFLLEENRREQKI 133
>gnl|CDD|218013 pfam04298, Zn_peptidase_2, Putative neutral zinc metallopeptidase.
Zinc metallopeptidase zinc binding regions have been
predicted in some family members by a pattern match
(Prosite:PS00142).
Length = 222
Score = 29.8 bits (68), Expect = 3.7
Identities = 23/89 (25%), Positives = 39/89 (43%), Gaps = 23/89 (25%)
Query: 165 WLFFVNFILA-------LGLILF--------VTIPEEWQHSKRELA--NVMNLLIPWELR 207
LF + +L +G+ILF VT+P E+ S+R LA +L P EL+
Sbjct: 129 ILFIIGLLLGLGPTLLLIGIILFSAAVLFQLVTLPVEFDASRRALAILESGGILSPEELK 188
Query: 208 -IKEIES-----HFGSAVASYFTFLRWLF 230
K++ S + +A+ + L +
Sbjct: 189 GAKKVLSAAALTYVAAALTAIAQLLYLIL 217
>gnl|CDD|146805 pfam04358, DsrC, DsrC like protein. Family member dsvC has been
observed to co-purify with Desulfovibrio vulgaris
dissimilatory sulfite reductase, and many members of
this family are annotated as the third (gamma) subunit
of dissimilatory sulphite reductase. However, this
protein appears to be only loosely associated to the
sulfite reductase, which suggests that DsrC may not be
an integral part of the dissimilatory sulphite
reductase. Members of this family are found in organisms
such as E. coli and H. influenzae which do not contain
dissimilatory sulphite reductases but can synthesise
assimilatory sirohaem sulphite and nitrite reductases.
It is speculated that DsrC may be involved in the
assembly, folding or stabilisation of sirohaem proteins.
The strictly conserved cysteine in the C terminus
suggests that DsrC may have a catalytic function in the
metabolism of sulphur compounds.
Length = 109
Score = 28.3 bits (64), Expect = 3.9
Identities = 8/22 (36%), Positives = 13/22 (59%)
Query: 256 CREMKKPLPEEEKESRKLYTLF 277
+ + K L E++ S+ LY LF
Sbjct: 67 VKALGKKLGEDKGNSKYLYQLF 88
>gnl|CDD|235551 PRK05667, dnaG, DNA primase; Validated.
Length = 580
Score = 30.2 bits (69), Expect = 3.9
Identities = 21/98 (21%), Positives = 36/98 (36%), Gaps = 8/98 (8%)
Query: 518 TFPRLLEKLGNLEQLHPRKHLRMLLARNEYSESIGCQIIKQDDVTQFQRIIDSLDSSVDL 577
T +LLE L + L L E SE + K+ + ++ L +
Sbjct: 490 TGSQLLEHLRDAGLEELAALLESLAVWEEISEEDIAALEKE-----LKDALEKLRDQLLE 544
Query: 578 TRYACQIYLIKKQEEEKEEDLEEMEKEEVEEEKKEMKK 615
R LI K+ + L E+ E+ + E+K+
Sbjct: 545 ERLEE---LIAKERLLEGHGLSSEERLELLQLLIELKR 579
>gnl|CDD|163564 TIGR03852, sucrose_gtfA, sucrose phosphorylase. In the forward
direction, this enzyme uses phosphate to cleave sucrose
into D-fructose + alpha-D-glucose 1-phosphate.
Characterized representatives from Streptococcus mutans
and Bifidobacterium adolescentis represent
well-separated branches of a molecular phylogenetic
tree. In S. mutans, the region including this gene has
been associated with neighboring transporter genes and
multiple sugar metabolism.
Length = 470
Score = 29.7 bits (67), Expect = 4.7
Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 2/38 (5%)
Query: 267 EKESRKLYTLFEF--EGILKYSPIFYGYYNNQDNSRYK 302
E S K Y +F+F I + S + + +DNS+YK
Sbjct: 67 EALSEKYYLMFDFMINHISRQSEYYQDFLEKKDNSKYK 104
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 29.8 bits (67), Expect = 4.7
Identities = 9/25 (36%), Positives = 20/25 (80%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEEKKE 612
+++E+EK + L++M ++E E+E+ E
Sbjct: 288 EEKEKEKRKRLKKMMEDEDEDEEME 312
>gnl|CDD|204614 pfam11221, Med21, Subunit 21 of Mediator complex. Med21 has been
known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in
Drosophila. The heterodimer of the two subunits Med7 and
Med21 appears to act as a hinge between the middle and
the tail regions of Mediator.
Length = 132
Score = 28.4 bits (64), Expect = 5.0
Identities = 24/70 (34%), Positives = 39/70 (55%), Gaps = 13/70 (18%)
Query: 552 GCQIIKQDDVTQFQRI---IDSL---DSSVDLTRYACQIYLIKKQEEEKEEDLEEMEKEE 605
G + + +D + + Q+I IDSL +SS + Q+ IK+ EEE E E E+EE
Sbjct: 58 GQRELARDIILKAQQIEYLIDSLPGIESSEEE-----QLRRIKELEEELRE--VEAEREE 110
Query: 606 VEEEKKEMKK 615
+EK+++ K
Sbjct: 111 AVKEKEKLLK 120
>gnl|CDD|200378 TIGR04127, flavo_near_exo, exosortase F-associated protein.
Members of this protein family are always found next to
an exosortase/archaeosortase-like protein, and occur so
far only in the flavobacteria, within the Bacteroidetes.
Members do not have an obvious PEP-CTERM-like C-terminal
protein sorting domain.
Length = 136
Score = 28.3 bits (64), Expect = 5.2
Identities = 12/45 (26%), Positives = 22/45 (48%), Gaps = 14/45 (31%)
Query: 273 LYTLFEFEGILKYSPIFYGYYNNQDNSRYKTPLAFFIVTLLLYIY 317
+Y LF+ +GI+K+S I Y L F++ ++L+
Sbjct: 67 IYILFKDKGIIKFSAILY--------------LVVFVILIILFYI 97
>gnl|CDD|222791 PHA00431, PHA00431, internal virion protein C.
Length = 746
Score = 29.8 bits (67), Expect = 5.2
Identities = 12/61 (19%), Positives = 28/61 (45%)
Query: 555 IIKQDDVTQFQRIIDSLDSSVDLTRYACQIYLIKKQEEEKEEDLEEMEKEEVEEEKKEMK 614
+ Q+D + + L + +D + ++ ++ + EE +++ K E + KEM
Sbjct: 318 ALNQEDPSTGWEQLQGLKAELDKLQPGEEMTPQRQWLIQAEEQMQDRMKRETAAQAKEMD 377
Query: 615 K 615
K
Sbjct: 378 K 378
>gnl|CDD|219516 pfam07690, MFS_1, Major Facilitator Superfamily.
Length = 346
Score = 29.3 bits (66), Expect = 5.8
Identities = 24/109 (22%), Positives = 37/109 (33%), Gaps = 10/109 (9%)
Query: 152 FGSAVASYFTFLRWLFFVNFILAL------GLILFVTIPEEWQHSKRELANVMNLLIPWE 205
G +AS F + R F + ILAL L+L PE + E A +
Sbjct: 140 LGGLLASLFGW-RAAFLILAILALLAAVLAALLLPRPPPESKRPKPAEEAPAPLVPAWKL 198
Query: 206 LRIKEIES---HFGSAVASYFTFLRWLFFVNFILALGLILFVTIPELLS 251
L + ++F L +L +L L +L + L
Sbjct: 199 LLRDPVLWLLLALLLFGFAFFALLTYLPLYQEVLGLSALLAGLLLGLAG 247
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 26.9 bits (60), Expect = 6.3
Identities = 12/28 (42%), Positives = 18/28 (64%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEEKKEMKK 615
KK+ + +EE EE E+ EE+KE +K
Sbjct: 26 KKELKAEEEKREEEEEARKREERKEREK 53
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 29.3 bits (67), Expect = 6.5
Identities = 11/32 (34%), Positives = 19/32 (59%)
Query: 584 IYLIKKQEEEKEEDLEEMEKEEVEEEKKEMKK 615
IY +K +E + + EKE++E KE+K+
Sbjct: 536 IYQTEKTLKELGDKVPADEKEKIEAAIKELKE 567
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 29.1 bits (65), Expect = 7.1
Identities = 14/36 (38%), Positives = 24/36 (66%), Gaps = 9/36 (25%)
Query: 587 IKKQEEEKEEDLEEMEKE---------EVEEEKKEM 613
+KK+EE ++LEE+EKE E++EEK+++
Sbjct: 75 LKKEEERLLDELEELEKEDDDLDGELVELQEEKEQL 110
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 28.9 bits (64), Expect = 7.6
Identities = 20/98 (20%), Positives = 44/98 (44%), Gaps = 5/98 (5%)
Query: 521 RLLEKLGNLEQLHPRKHLRMLLARNEYSESIGCQIIKQDDVTQFQRIIDSLDSSVDLTRY 580
R +K +E K + LL ++ +ES+ + + ++ + +R+ + +
Sbjct: 192 RFRKKSSKIEIEEVEKKVDDLLEKDMKAESVSVVLKDEKELARQERVSSWENFKEEPGEP 251
Query: 581 ACQIYLIKKQEEEKEEDLEEMEKEE-----VEEEKKEM 613
+ L K+++ +EE E M +E+ E E KE+
Sbjct: 252 LSRPALKKEKQGAEEEGEEGMSEEDLDVGAAEIENKEV 289
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 29.1 bits (66), Expect = 7.7
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 589 KQEEEKEEDLEEMEKEEVEEE 609
EE+EE+ EE E+EE EE
Sbjct: 299 AAAEEEEEEEEEEEEEEPSEE 319
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 29.3 bits (65), Expect = 7.8
Identities = 9/25 (36%), Positives = 15/25 (60%)
Query: 588 KKQEEEKEEDLEEMEKEEVEEEKKE 612
K+ EE ++ EE +K E +E+K
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAA 1691
Score = 29.3 bits (65), Expect = 8.8
Identities = 15/31 (48%), Positives = 20/31 (64%), Gaps = 2/31 (6%)
Query: 587 IKKQEEEKEEDLEEMEKEEVEEEKK--EMKK 615
+KK EEE + EE +KE E++KK E KK
Sbjct: 1721 LKKAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 29.4 bits (66), Expect = 8.4
Identities = 15/30 (50%), Positives = 23/30 (76%)
Query: 586 LIKKQEEEKEEDLEEMEKEEVEEEKKEMKK 615
L +K+EE+ EE EE++ EEV+EE +E +K
Sbjct: 279 LKEKKEEKDEEKSEEVKTEEVDEEFEEEEK 308
>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein.
Length = 122
Score = 27.6 bits (62), Expect = 8.4
Identities = 29/110 (26%), Positives = 46/110 (41%), Gaps = 18/110 (16%)
Query: 521 RLLEKLGNLEQLHPRKHLRMLLA-RNEYSESI--GCQIIKQDDVTQFQRIIDSLDSSVDL 577
L E LEQ + L L R EY + + Q I ++ +Q I +LD ++
Sbjct: 6 NLAEAQQELEQA--EEKLEELQQYRLEYRQQLSGSGQGISAAELRNYQAFISALDEAIAQ 63
Query: 578 TR-----------YACQIYLIKKQEEEKEEDLEE--MEKEEVEEEKKEMK 614
+ A + + QE +K E L E +KE+ EE ++E K
Sbjct: 64 QQQELEQAEKQVEQAREQWQEANQERKKLEKLLEREKKKEQKEENRREQK 113
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 28.6 bits (64), Expect = 8.5
Identities = 12/29 (41%), Positives = 15/29 (51%)
Query: 587 IKKQEEEKEEDLEEMEKEEVEEEKKEMKK 615
K+ E KEED + E E EE +KE
Sbjct: 17 CCKENENKEEDKGKEEDLEFEEIEKEEII 45
>gnl|CDD|218534 pfam05278, PEARLI-4, Arabidopsis phospholipase-like protein (PEARLI
4). This family contains several phospholipase-like
proteins from Arabidopsis thaliana which are homologous
to PEARLI 4.
Length = 234
Score = 28.6 bits (64), Expect = 8.9
Identities = 13/30 (43%), Positives = 19/30 (63%), Gaps = 1/30 (3%)
Query: 586 LIKKQE-EEKEEDLEEMEKEEVEEEKKEMK 614
L+KKQE E++E +L EKE E +K +
Sbjct: 168 LLKKQEMEKQEAELVRKEKEVKEFREKVEE 197
>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
and diverse group of secondary transporters that
includes uniporters, symporters, and antiporters. MFS
proteins facilitate the transport across cytoplasmic or
internal membranes of a variety of substrates including
ions, sugar phosphates, drugs, neurotransmitters,
nucleosides, amino acids, and peptides. They do so using
the electrochemical potential of the transported
substrates. Uniporters transport a single substrate,
while symporters and antiporters transport two
substrates in the same or in opposite directions,
respectively, across membranes. MFS proteins are
typically 400 to 600 amino acids in length, and the
majority contain 12 transmembrane alpha helices (TMs)
connected by hydrophilic loops. The N- and C-terminal
halves of these proteins display weak similarity and may
be the result of a gene duplication/fusion event. Based
on kinetic studies and the structures of a few bacterial
superfamily members, GlpT (glycerol-3-phosphate
transporter), LacY (lactose permease), and EmrD
(multidrug transporter), MFS proteins are thought to
function through a single substrate binding site,
alternating-access mechanism involving a rocker-switch
type of movement. Bacterial members function primarily
for nutrient uptake, and as drug-efflux pumps to confer
antibiotic resistance. Some MFS proteins have medical
significance in humans such as the glucose transporter
Glut4, which is impaired in type II diabetes, and
glucose-6-phosphate transporter (G6PT), which causes
glycogen storage disease when mutated.
Length = 352
Score = 28.8 bits (65), Expect = 8.9
Identities = 26/106 (24%), Positives = 38/106 (35%), Gaps = 24/106 (22%)
Query: 152 FGSAVASYFTFLRWLFFVNFILALG-----LILFVTIPEEWQHSKRELANVMN------- 199
GS + ++ + L L F+L LG I E + +R A +
Sbjct: 76 LGSLLLAFASSLWLLLVGRFLLGLGGGALYPAAAALIAEWFPPKERGRALGLFSAGFGLG 135
Query: 200 -LLIPWELRIKEIESHFGSAVASYFTFLRWLFFVNFILALGLILFV 244
LL P G +A + RWLF + IL L L L +
Sbjct: 136 ALLGPL----------LGGLLAESLGW-RWLFLILAILGLLLALLL 170
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 28.5 bits (64), Expect = 9.0
Identities = 8/40 (20%), Positives = 25/40 (62%)
Query: 576 DLTRYACQIYLIKKQEEEKEEDLEEMEKEEVEEEKKEMKK 615
+ + Q + +KK ++++++ EE K+ +++++E +K
Sbjct: 178 EEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERK 217
>gnl|CDD|220750 pfam10432, bact-PGI_C, Bacterial phospho-glucose isomerase
C-terminal region. This is the C-terminal half of a
bacterial phospho-glucose isomerase EC:5.3.1.9 protein
which is similar to eukaryote homologues to the extent
that the sequence includes the cluster of threonines and
serines that forms the sugar phosphate-binding site in
conventional PGI. This domain contributes a good
proportion of the active catalytic site residues. This
PGI uses the same catalytic mechanisms for both glucose
ring-opening and isomerisation for the interconversion
of glucose 6-phosphate to fructose 6-phosphate. It is
associated with family SIS, pfam01380.
Length = 154
Score = 28.1 bits (63), Expect = 9.1
Identities = 12/63 (19%), Positives = 25/63 (39%)
Query: 178 ILFVTIPEEWQHSKRELANVMNLLIPWELRIKEIESHFGSAVASYFTFLRWLFFVNFILA 237
++ + E+ +KR + + + EIE+ GS + + + F + LA
Sbjct: 74 VVLLRDREDDPRTKRRADITEEIAEDRGVNVIEIEAEGGSPLERLASLIYLGDFASVYLA 133
Query: 238 LGL 240
L
Sbjct: 134 LIR 136
>gnl|CDD|215625 PLN03192, PLN03192, Voltage-dependent potassium channel;
Provisional.
Length = 823
Score = 29.1 bits (65), Expect = 9.4
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 460 LHPRKHLRMLLASFHLLFLLGISAYTI 486
LH + M+ F++LF LG++AY I
Sbjct: 270 LHAVNTIEMIFIIFYMLFNLGLTAYLI 296
>gnl|CDD|215573 PLN03098, LPA1, LOW PSII ACCUMULATION1; Provisional.
Length = 453
Score = 28.7 bits (64), Expect = 9.7
Identities = 13/34 (38%), Positives = 19/34 (55%)
Query: 217 SAVASYFTFLRWLFFVNFILALGLILFVTIPELL 250
S V + F +R F+V F A G+ F T+P L+
Sbjct: 190 SEVQAPFRGVRKFFYVAFTAAAGISTFFTVPRLI 223
>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
Length = 222
Score = 28.5 bits (63), Expect = 9.9
Identities = 10/26 (38%), Positives = 19/26 (73%)
Query: 587 IKKQEEEKEEDLEEMEKEEVEEEKKE 612
K Q++E +E E++E+ E++ EK+E
Sbjct: 150 SKNQKQENQEKQEKLEESEIQSEKQE 175
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.138 0.409
Gapped
Lambda K H
0.267 0.0857 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 32,907,422
Number of extensions: 3402821
Number of successful extensions: 7653
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7466
Number of HSP's successfully gapped: 205
Length of query: 615
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 512
Effective length of database: 6,369,140
Effective search space: 3260999680
Effective search space used: 3260999680
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 62 (27.4 bits)