RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16002
(229 letters)
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 35.4 bits (82), Expect = 0.020
Identities = 31/151 (20%), Positives = 50/151 (33%), Gaps = 11/151 (7%)
Query: 46 LESYHVAISEAPKVPKSSSEARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLE 105
L + V + EA +E V L +A + + + + D +
Sbjct: 152 LRGLKFLLVRLGLVRREKLEALVGVIEDEV---ALYG-ENVEASVVIVVAHGAEDLDKVS 207
Query: 106 MKLNSLACEELTQHAGDTDDKSVENKLRERLKEMSQTLERV-SRTYELRQMQSTQLTNEL 164
LN L E D + ++L E + E+ LE + S L + + EL
Sbjct: 208 KILNELGFELYEVPEFDGGPSELISELEEVIAEIQDELESLRSELEALAEK----IAEEL 263
Query: 165 KMANNALMNSFEKGKKKSQSRLKRLENEIVI 195
L EK S+L R E + I
Sbjct: 264 LAVREIL--EIEKALGDVLSKLARTEYTLAI 292
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 32.6 bits (75), Expect = 0.18
Identities = 35/210 (16%), Positives = 75/210 (35%), Gaps = 31/210 (14%)
Query: 29 IDKLDKERASIGNTTVKLESY----HVAISEAPKVPKSSSE--ARKMDLEMAVLMQELLS 82
+ KL + KL SY + E KV S E + ++ E+ + +E+
Sbjct: 45 LRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKE 104
Query: 83 MREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKEMSQT 142
+ E+ +EL I LE E + LE + ++ L K +S
Sbjct: 105 LEEEISELENEIKELEQEIERLE-PWGNF---------------DLDLSLLLGFKYVSVF 148
Query: 143 LERVSRTYELRQMQSTQLTNELKMANN------ALMNSFEKGKKKSQSRLKRLENE-IVI 195
+ V + + N ++ + ++ E + LK+L E + +
Sbjct: 149 VGTVPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVE-EELKKLGFERLEL 207
Query: 196 MSERHATQV-QHLKQRIVSLEEHIARYNSK 224
E +++ + +K+ + +E+ +
Sbjct: 208 EEEGTPSELIREIKEELEEIEKERESLLEE 237
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 32.5 bits (74), Expect = 0.20
Identities = 20/69 (28%), Positives = 36/69 (52%), Gaps = 8/69 (11%)
Query: 29 IDKLDKERASIGNTTVKLESYHVAISE-----APKVPKSSSEARKMD--LEMAVLMQELL 81
I+K+DK A+ +L Y++ I E P +P S+S+ +D LE +L+ E+
Sbjct: 354 INKIDKANANTERIKQQLAKYNL-IPEKWGGDTPMIPISASQGTNIDKLLETILLLAEIE 412
Query: 82 SMREDKAEL 90
++ D +L
Sbjct: 413 DLKADPTQL 421
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 31.4 bits (72), Expect = 0.40
Identities = 31/125 (24%), Positives = 53/125 (42%), Gaps = 36/125 (28%)
Query: 78 QELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLK 137
QE+L + K L + LE E D+L+ +E ++R+++K
Sbjct: 182 QEILETLDVKERLEKLLDLLEKEIDLLQ----------------------LEKRIRKKVK 219
Query: 138 EMSQTLERVSRTYELR-QMQSTQ----LTNELKMANNALMNSFEKG------KKKSQSRL 186
E +E+ R Y LR Q+++ Q ++ K L EK K+K++ L
Sbjct: 220 E---QMEKSQREYYLREQLKAIQKELGEDDDDKDEVEELREKIEKLKLPKEAKEKAEKEL 276
Query: 187 KRLEN 191
K+LE
Sbjct: 277 KKLET 281
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 31.5 bits (72), Expect = 0.42
Identities = 34/147 (23%), Positives = 65/147 (44%), Gaps = 9/147 (6%)
Query: 76 LMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDD--KSVENKLR 133
L+ + S E E I + E E ++ + EEL ++ + KL
Sbjct: 290 LIDKYESHIEKALEELESILDTEKENSEFKLDV-----EELKALLEALEEILEKNLQKLE 344
Query: 134 ERLKEMSQTLERVSRTYELRQMQSTQLT-NELKMANNALMNSFEKGKKKSQSRLKRLE-N 191
E+LK+ S ++E S T + + NEL +N +++ +K K K++ +L
Sbjct: 345 EKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAKKKLWLHLVA 404
Query: 192 EIVIMSERHATQVQHLKQRIVSLEEHI 218
E+ + + + + L++ I SLE+ I
Sbjct: 405 ELKEDIDAYQKEKKGLEKAINSLEKEI 431
>gnl|CDD|220664 pfam10267, Tmemb_cc2, Predicted transmembrane and coiled-coil 2
protein. This family of transmembrane coiled-coil
containing proteins is conserved from worms to humans.
Its function is unknown.
Length = 387
Score = 30.9 bits (70), Expect = 0.50
Identities = 29/113 (25%), Positives = 53/113 (46%), Gaps = 12/113 (10%)
Query: 114 EELTQHAGDTDDKSVENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMN 173
A +KL E L+E+ + R+ +YE + QL + + +L
Sbjct: 192 GSGPGSAVAQQQSLNLDKLLEELREIKEGQSRLEESYERLK---EQLQRDYQYITQSL-- 246
Query: 174 SFEKGKKKSQSRLKRLENEIVIMSERHATQVQHLKQRIVSLEEHIARYNSKER 226
++ + R +RLE ++ ++E H ++ +LKQ + S+EE +A Y S ER
Sbjct: 247 ------QEERYRYERLEEQLNDLTELHQNEIANLKQELASMEEKVA-YQSYER 292
>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
exopolysaccharide biosynthesis [Cell envelope
biogenesis, outer membrane].
Length = 458
Score = 30.9 bits (70), Expect = 0.50
Identities = 21/160 (13%), Positives = 54/160 (33%), Gaps = 28/160 (17%)
Query: 70 DLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVE 129
+ +++ L +L S R A+ AR+ +L + + +
Sbjct: 238 EQQLSALNTQLQSARARLAQAEARLASLLQLLPLGREAAALR----------EVLESPTI 287
Query: 130 NKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRL 189
LR++ ++ Q + +S + Q L +L +++
Sbjct: 288 QDLRQQYAQVRQQIADLSTELGAKHPQLVALEAQLAELR------------------QQI 329
Query: 190 ENEIVIMSERHATQVQHLKQRIVSLEEHIARYNSKERNTS 229
E+ + ++ L+Q+ +LE+ +A+ +
Sbjct: 330 AAELRQILASLPNELALLEQQEAALEKELAQLKGRLSKLP 369
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 31.2 bits (71), Expect = 0.50
Identities = 33/165 (20%), Positives = 75/165 (45%), Gaps = 11/165 (6%)
Query: 70 DLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQH-AGDTDDKSV 128
+LE+A+L+ L +RE+ EL+ + E E + L +L L E+L + ++ +
Sbjct: 224 ELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELE-EKLEELRLEVSELEEE 282
Query: 129 ENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKG------KKKS 182
+L++ L ++ + R+ + ++ + + L +L+ L K +
Sbjct: 283 IEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAEL 342
Query: 183 QSRLKRLENEIVIMSERHAT---QVQHLKQRIVSLEEHIARYNSK 224
+ +L+ L+ E+ + +++ L+ R+ LEE + SK
Sbjct: 343 EEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSK 387
Score = 30.8 bits (70), Expect = 0.74
Identities = 42/208 (20%), Positives = 93/208 (44%), Gaps = 21/208 (10%)
Query: 20 AEMQLLLNNIDKLDKERASIGNTTVKLESYHVAISEAPKVPKSSSEARKMDLEMAVLMQE 79
E+ LL+ +++L +E + + E ++ +E ++++ ++ L E
Sbjct: 225 LELALLVLRLEELREELEELQEELKEAEEELEELT---------AELQELEEKLEELRLE 275
Query: 80 LLSMREDKAELRARIYN-------LEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKL 132
+ + E+ EL+ +Y LE +K +L +L +L + A + +S ++L
Sbjct: 276 VSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDEL 335
Query: 133 RERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENE 192
E L E+ + LE + E + + +L EL+ + L E+ + S++ +LE +
Sbjct: 336 AEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLR-SKVAQLELQ 394
Query: 193 IVIMSERHATQVQHLKQRIVSLEEHIAR 220
I +++ L+ R+ LE+ R
Sbjct: 395 I----ASLNNEIERLEARLERLEDRRER 418
Score = 29.3 bits (66), Expect = 1.8
Identities = 39/196 (19%), Positives = 68/196 (34%), Gaps = 37/196 (18%)
Query: 64 SEARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDT 123
E ++ E+ L +++ ++R+D A L A + LE L +L L E
Sbjct: 712 EELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEER-- 769
Query: 124 DDKSVENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNAL------------ 171
+ E L E +E + E + + L L L
Sbjct: 770 -----LEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRE 824
Query: 172 -MNSFEKGKKKSQSRLKRLENEIVIMSER-----------------HATQVQHLKQRIVS 213
+ S E+ ++ RL+ LE +I +SE ++++ L S
Sbjct: 825 RLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERAS 884
Query: 214 LEEHIARYNSKERNTS 229
LEE +A S+ S
Sbjct: 885 LEEALALLRSELEELS 900
>gnl|CDD|176470 cd01598, PurB, PurB_like adenylosuccinases (adenylsuccinate lyase,
ASL). This subgroup contains EcASL, the product of the
purB gene in Escherichia coli, and related proteins. It
is a member of the Lyase class I family of the Lyase_I
superfamily. Members of the Lyase class I family
function as homotetramers to catalyze similar
beta-elimination reactions in which a Calpha-N or
Calpha-O bond is cleaved with the subsequent release of
fumarate as one of the products. The four active sites
of the homotetrameric enzyme are each formed by residues
from three different subunits. ASL catalyzes two
non-sequential steps in the de novo purine biosynthesis
pathway: the conversion of
5-aminoimidazole-(N-succinylocarboxamide) ribotide
(SAICAR) into 5-aminoimidazole-4-carboxamide ribotide
(AICAR) and; the conversion of adenylsuccinate (SAMP)
into adenosine monophosphate (AMP).
Length = 425
Score = 30.7 bits (70), Expect = 0.66
Identities = 18/55 (32%), Positives = 26/55 (47%), Gaps = 9/55 (16%)
Query: 85 EDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKEM 139
E+ LRA I E+D L +K E T H D K+VE L+E+ + +
Sbjct: 40 EELKFLRA-IIENFSEEDALRIK----EIEATTNH----DVKAVEYFLKEKFETL 85
>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs, all of which are expressed
during embryonic and early development in the nervous
system but with different localization and timing.
srGAPs contain an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 241
Score = 30.0 bits (68), Expect = 0.80
Identities = 13/70 (18%), Positives = 28/70 (40%), Gaps = 2/70 (2%)
Query: 126 KSVENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALM--NSFEKGKKKSQ 183
+ + ++L + L + L+ +TY +S +LK A + K +
Sbjct: 116 REIGSQLHDELLRVLNELQTAMKTYHTYHAESKSAERKLKEAEKQEEKQEQSPEKKLERS 175
Query: 184 SRLKRLENEI 193
K++E E+
Sbjct: 176 RSSKKIEKEV 185
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 30.1 bits (68), Expect = 1.0
Identities = 23/96 (23%), Positives = 44/96 (45%), Gaps = 3/96 (3%)
Query: 135 RLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENEI- 193
+LK++ + + + + +Q Q +L +LK + S E ++ LK+L +I
Sbjct: 39 QLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETE-IASLEAQLIETADDLKKLRKQIA 97
Query: 194 VIMSERHATQVQHLKQRIVSLEEHIARYNSKERNTS 229
+ + +A +VQ +QR L E +A RN
Sbjct: 98 DLNARLNALEVQEREQRRR-LAEQLAALQRSGRNPP 132
Score = 27.0 bits (60), Expect = 9.5
Identities = 18/91 (19%), Positives = 31/91 (34%), Gaps = 8/91 (8%)
Query: 130 NKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRL 189
++L + + EL + S Q + K+A L+ +K + S L
Sbjct: 167 KATLKQLAAVRAEIAAEQA--ELTTLLSEQRAQQAKLA--QLLEERKKTLAQLNSELSAD 222
Query: 190 ENEIVIMSERHATQVQHLKQRIVSLEEHIAR 220
+ + E LK I S E A+
Sbjct: 223 QKK----LEELRANESRLKNEIASAEAAAAK 249
>gnl|CDD|146095 pfam03288, Pox_D5, Poxvirus D5 protein-like. This family includes
D5 from Poxviruses which is necessary for viral DNA
replication, and is a nucleic acid independent
nucleoside triphosphatase. Members of this family are
also found outside of poxviruses. This domain is a
DNA-binding winged HTH domain.
Length = 88
Score = 28.1 bits (63), Expect = 1.1
Identities = 11/41 (26%), Positives = 14/41 (34%)
Query: 155 MQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENEIVI 195
+ L N + SFEK KKK + I I
Sbjct: 43 LSKRTFQQRLAKHLNERIESFEKKKKKPGNVPNEYLEYIFI 83
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 30.0 bits (68), Expect = 1.2
Identities = 27/190 (14%), Positives = 63/190 (33%), Gaps = 28/190 (14%)
Query: 66 ARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACE------ELTQH 119
K + E L++E ++ K + ++ +LE E + L +++ L L +
Sbjct: 218 KEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEEL 277
Query: 120 AGDTDDKSVE--NKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMAN--------- 168
D E +++E++ E+ + + R+ ++ + L
Sbjct: 278 NKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAE 337
Query: 169 -----------NALMNSFEKGKKKSQSRLKRLENEIVIMSERHATQVQHLKQRIVSLEEH 217
+ + + + L+ L E+ + + A LK LE+
Sbjct: 338 IEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKL 397
Query: 218 IARYNSKERN 227
N +R
Sbjct: 398 KREINELKRE 407
Score = 28.5 bits (64), Expect = 3.9
Identities = 42/235 (17%), Positives = 89/235 (37%), Gaps = 27/235 (11%)
Query: 16 PDDKAEMQLLLNNIDKLDKERASIGNTTVKLESYHVAISEAPKVPKSSSEARKMDLEMA- 74
+ AE+Q L ++ L +E +S+ + ++E+ +S+ + ++E
Sbjct: 670 RSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLE 729
Query: 75 ------------------VLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEEL 116
L QE+ +++ + EL ARI LE + LE LN L
Sbjct: 730 QEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLS 789
Query: 117 TQHAGDTDDKSVE-NKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNAL---M 172
+ + + + R++ + +E+ L + + EL+ L +
Sbjct: 790 HSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQI 849
Query: 173 NSFEKGKKKSQSRLKRLENEIVIMSERHATQVQHLKQRIVSLEEHIARYNSKERN 227
S EK + + + LE E+ E ++ L+ R+ L++ ++ R
Sbjct: 850 KSIEKEIENLNGKKEELEEEL----EELEAALRDLESRLGDLKKERDELEAQLRE 900
Score = 28.1 bits (63), Expect = 4.9
Identities = 33/167 (19%), Positives = 74/167 (44%), Gaps = 7/167 (4%)
Query: 12 LQDRPDDKAEMQLLLNNI-DKLDKER-ASIGNTTVKLESYHVAISEA-----PKVPKSSS 64
+++ +D +++ LN++ +L R I KLE I K+ + +
Sbjct: 767 IEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTL 826
Query: 65 EARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTD 124
E ++ E+ L ++ + ++E + I NL +K+ LE +L L + D
Sbjct: 827 EKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGD 886
Query: 125 DKSVENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNAL 171
K ++L +L+E+ + +E + E ++ + ++L +L+ L
Sbjct: 887 LKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEEL 933
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 29.7 bits (67), Expect = 1.4
Identities = 41/170 (24%), Positives = 72/170 (42%), Gaps = 3/170 (1%)
Query: 25 LLNNIDKLDKERASIGNTTVKLESYHVAISEAPKVPKSSSEARKMDLE--MAVLMQELLS 82
L I+ L +E +LE + EA + + A +LE L +EL
Sbjct: 328 LKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAE 387
Query: 83 MREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKEMSQT 142
+ + AE+R + L+ E + LE +L L+ + ++ +L+ L+E+++
Sbjct: 388 LEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEE 447
Query: 143 LERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENE 192
LE + E + + +L EL L EK ++RL RLE E
Sbjct: 448 LEELEEQLEELRDRLKELERELAELQEELQR-LEKELSSLEARLDRLEAE 496
Score = 29.7 bits (67), Expect = 1.4
Identities = 39/229 (17%), Positives = 89/229 (38%), Gaps = 15/229 (6%)
Query: 3 ETYKVLLHTLQDRPDDKAEMQLLLNNIDKLDKERASIGNTTVKLESYHVAISEAPKVPKS 62
L + + +++ L +++L +E A++ +L+S + E
Sbjct: 692 SLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEE------- 744
Query: 63 SSEARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGD 122
E +++ E+ L + L + E+ L + L+ E + LE K +L E
Sbjct: 745 --ELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEEL 802
Query: 123 TDDKSVENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNAL-MNSFEKGKKK 181
+ + + L L+ + Q ER+ + E+ +++ E K+ + EK ++
Sbjct: 803 EEAERRLDALERELESLEQRRERLEQ--EIEELEEEIEELEEKLDELEEELEELEKELEE 860
Query: 182 SQSRLKRLENEIVIMSERHATQVQH---LKQRIVSLEEHIARYNSKERN 227
+ L+ LE E + + + L++ + LE +A +
Sbjct: 861 LKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEK 909
>gnl|CDD|233162 TIGR00871, zwf, glucose-6-phosphate 1-dehydrogenase. This enzyme
(EC 1.1.1.49) acts on glucose 6-phospate and reduces
NADP(+). An alternate name appearing in the literature
for the human enzyme, based on a slower activity with
beta-D-glucose, is glucose 1-dehydrogenase (EC
1.1.1.47), but that name more properly describes a
subfamily of the short chain dehydrogenases/reductases
family. This is a well-studied enzyme family, with
sequences available from well over 50 species. The
trusted cutoff is set above the score for the Drosophila
melanogaster CG7140 gene product, a homolog of unknown
function. G6PD homologs from the bacteria Aquifex
aeolicus and Helicobacter pylori lack several motifs
well conserved most other members, were omitted from the
seed alignment, and score well below the trusted cutoff
[Energy metabolism, Pentose phosphate pathway].
Length = 482
Score = 29.6 bits (67), Expect = 1.5
Identities = 12/60 (20%), Positives = 30/60 (50%), Gaps = 3/60 (5%)
Query: 86 DKAELRARIYN--LEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKEMSQTL 143
+ R ++ ++ E + ++ +L+ + L+ +GD DD + L E L+++ +T
Sbjct: 47 SVEDFRKQVREAIIKFETEEIDEQLDDF-AQRLSYVSGDYDDDESYDSLNEHLEQLDKTR 105
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 28.8 bits (65), Expect = 2.0
Identities = 21/89 (23%), Positives = 38/89 (42%), Gaps = 5/89 (5%)
Query: 132 LRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQ--SRLKRL 189
RERL E+ Q L + Q Q +L E + L + ++ + +R+K+L
Sbjct: 64 ARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQL 123
Query: 190 ENEIVIMSERHA---TQVQHLKQRIVSLE 215
+ + E + ++ LKQ +LE
Sbjct: 124 SANAIELDEENRELREELAELKQENEALE 152
Score = 26.5 bits (59), Expect = 9.0
Identities = 24/107 (22%), Positives = 46/107 (42%), Gaps = 4/107 (3%)
Query: 60 PKSSSEARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQH 119
P + ++ E+A L +EL ++E AEL+ L+ E LE +L L E
Sbjct: 62 PSARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKE--LAR 119
Query: 120 AGDTDDKSVENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKM 166
++E L E +E+ + L + + E + ++ +L +
Sbjct: 120 IKQLSANAIE--LDEENRELREELAELKQENEALEAENERLQENEQR 164
>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
Length = 874
Score = 29.3 bits (67), Expect = 2.1
Identities = 33/140 (23%), Positives = 61/140 (43%), Gaps = 20/140 (14%)
Query: 19 KAEMQ--------LLLNNIDKLDKERASIGNTTV----KLESYHVAISEAPKVPKSSSEA 66
+AEM LLL D D+ R + +LES + ++ + P+ ++ A
Sbjct: 734 RAEMNIPPSKKLPLLLKGADAEDRARLEANEAYIKRLARLESLEIL-ADDEEAPEGAASA 792
Query: 67 RKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDK 126
E+ + ++ L+ + + A L + LE E + +E KL++ E A +
Sbjct: 793 VVGGAELFLPLEGLIDVEAELARLEKELAKLEKEIERVEKKLSN---EGFVAKA----PE 845
Query: 127 SVENKLRERLKEMSQTLERV 146
V K RE+L E + L ++
Sbjct: 846 EVVEKEREKLAEYEEKLAKL 865
>gnl|CDD|235599 PRK05769, PRK05769, 4-aminobutyrate aminotransferase; Provisional.
Length = 441
Score = 28.7 bits (65), Expect = 2.6
Identities = 18/86 (20%), Positives = 34/86 (39%), Gaps = 21/86 (24%)
Query: 109 NSLAC-----------EELTQHAGDTDDKSVENKLRERLKEMSQTLERVSRTYELRQMQS 157
N +A E L ++A + + LR+ LKE+ + E + L M
Sbjct: 322 NPVAAAAALATLEELEEGLLENA-----QKLGEYLRKELKELKEKYEFIGDVRGLGLMIG 376
Query: 158 TQLTNELK-----MANNALMNSFEKG 178
+L + K + + L +F++G
Sbjct: 377 VELVKDRKEPDPKLRDKVLYEAFKRG 402
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 28.9 bits (65), Expect = 2.9
Identities = 23/106 (21%), Positives = 37/106 (34%), Gaps = 3/106 (2%)
Query: 69 MDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAG-DTDDKS 127
+ L ++QELL R + L A LE +L+ +NS EE A
Sbjct: 1 LALAGNNVLQELLQSRRE--LLTAESAQLEAALQLLQEAVNSKRQEEAEPAAEEAELQAE 58
Query: 128 VENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMN 173
+ + ++SQ L + + QL N L +
Sbjct: 59 LIQQELAINDQLSQALNQQTERLNALASDDRQLANLLLQLLQSSRT 104
>gnl|CDD|218417 pfam05075, DUF684, Protein of unknown function (DUF684). This
family contains several uncharacterized proteins from
Caenorhabditis elegans. The GO annotation suggests that
the protein is involved in nematode larval development
and has a positive regulation on growth rate.
Length = 346
Score = 28.6 bits (64), Expect = 2.9
Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 5/69 (7%)
Query: 116 LTQHAGDTDDKSVENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNA--LMN 173
L ++ DT K + K +E +E+ Q + ++ + ++TN LKMA A L
Sbjct: 45 LMKYMQDTV-KHPDKKTKEIFREVYQKHSPLELAEKMIILLEQEVTNPLKMAMKADPLRT 103
Query: 174 S--FEKGKK 180
FEK +K
Sbjct: 104 KETFEKWRK 112
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 28.8 bits (64), Expect = 3.1
Identities = 20/91 (21%), Positives = 40/91 (43%)
Query: 87 KAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKEMSQTLERV 146
+AE + RI LE + L K + + + ++ +L++RLK+M+
Sbjct: 215 RAENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTIQRRDE 274
Query: 147 SRTYELRQMQSTQLTNELKMANNALMNSFEK 177
Q+++ QL +L+ L +S +K
Sbjct: 275 ETERIDLQLENEQLHEDLRTLQERLESSQQK 305
>gnl|CDD|178106 PLN02488, PLN02488, probable pectinesterase/pectinesterase
inhibitor.
Length = 509
Score = 28.4 bits (63), Expect = 3.5
Identities = 26/86 (30%), Positives = 37/86 (43%), Gaps = 13/86 (15%)
Query: 55 EAPKVPKSSSEARKMDLEMAVLMQE-------LLSMREDKAELRARIYNLEHEKDVLEMK 107
E V S SE R DL + VL ++ + ED L E E D+L +K
Sbjct: 20 ELKTVSSSLSEMRCRDLLIIVLKNSVWRIDMAMIGVMEDTKLLE------EMENDMLGVK 73
Query: 108 LNSLACEELTQHAGDTDDKSVENKLR 133
++ EE+ + A D +SVE L
Sbjct: 74 EDTNLFEEMMESAKDRMIRSVEELLG 99
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 28.4 bits (64), Expect = 3.5
Identities = 27/109 (24%), Positives = 43/109 (39%), Gaps = 8/109 (7%)
Query: 81 LSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKEMS 140
+++ A LRA + E E+ L+ L LA G + + E L
Sbjct: 77 QDLQDSVANLRASLSAAEAERSRLQALLAELA-GAGAAAEGRAGELAQE------LDSEK 129
Query: 141 QTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRL 189
Q R EL Q L +L AL ++ EK ++SQ+++ L
Sbjct: 130 QVSARALAQVELLNQQIAALRRQLAALEAAL-DASEKRDRESQAKIADL 177
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 28.1 bits (63), Expect = 3.7
Identities = 24/122 (19%), Positives = 41/122 (33%), Gaps = 15/122 (12%)
Query: 75 VLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACE--ELTQHAGDTDDKSVE--N 130
L + L M+ D+ L + + K L KL +L E L Q A + +
Sbjct: 151 GLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELE 210
Query: 131 KLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLE 190
K R+ L+ +S + + EL+ L + E K L+ +
Sbjct: 211 KARQELRSLSVKISE-----------KRKQLEELQQELQELTIAIEALTNKKSELLEEIA 259
Query: 191 NE 192
Sbjct: 260 EA 261
>gnl|CDD|223294 COG0216, PrfA, Protein chain release factor A [Translation,
ribosomal structure and biogenesis].
Length = 363
Score = 27.9 bits (63), Expect = 4.4
Identities = 23/103 (22%), Positives = 46/103 (44%), Gaps = 21/103 (20%)
Query: 107 KLNSLAC--EELTQHAGDTDDKSVENKLRERLKEMSQTLERVSRTY-----------ELR 153
KL SL EEL D + S ++ R+ KE ++ LE + Y + +
Sbjct: 8 KLESLLERYEELEALLSDPEVISDPDEYRKLSKEYAE-LEPIVEKYREYKKAQEDLEDAK 66
Query: 154 QMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENEIVIM 196
+M + + E++ E+ K+ +++++ LE E+ I+
Sbjct: 67 EMLAEEKDPEMRE-------MAEEEIKELEAKIEELEEELKIL 102
>gnl|CDD|151799 pfam11358, DUF3158, Protein of unknown function (DUF3158). Some
members in this family of proteins are annotated as
integrase regulator R however this cannot be confirmed.
This family of proteins with unknown function appear to
be restricted to Proteobacteria.
Length = 160
Score = 27.3 bits (61), Expect = 4.6
Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 6/80 (7%)
Query: 67 RKMD-LEMAVLM-QELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTD 124
R D M V + QEL++ ELR +Y +E ++ VL M+++ L L + A +
Sbjct: 83 RSHDRSRMGVAVWQELVADPATPVELRDDLYAMEKQRIVLNMQISLL--HSLGRQAAECA 140
Query: 125 DK--SVENKLRERLKEMSQT 142
K E RL+ S +
Sbjct: 141 SKLAQAEAVYLRRLQSPSNS 160
>gnl|CDD|189038 cd09868, PIN_XPG, PIN domain of Xeroderma pigmentosum
complementation group G (XPG) nuclease, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease and homologs. The Xeroderma pigmentosum
complementation group G (XPG) nuclease plays a central
role in nucleotide excision repair (NER) in cleaving DNA
bubble structures or loops. XPG is a member of the
structure-specific, 5' nuclease family that catalyzes
hydrolysis of DNA duplex-containing nucleic acid
structures during DNA replication, repair, and
recombination. These nucleases contain a PIN (PilT N
terminus) domain with a helical arch/clamp region (I
domain). In XPG PIN domains, this arch region can be
quite variable and extensive (400 - 800 residues) in
length and is required for NER activity and for
efficient processing of bubble substrates. Inserted
within the PIN domain of these 5' nucleases is a H3TH
(helix-3-turn-helix) domain, an atypical
helix-hairpin-helix-2-like region. Both the H3TH domain
(not included here) and the helical arch/clamp region
are involved in DNA binding. Nucleases within this group
also have a carboxylate-rich active site that is
involved in binding essential divalent metal ion
cofactors (Mg2+/Mn2+).
Length = 249
Score = 27.6 bits (62), Expect = 4.8
Identities = 14/48 (29%), Positives = 21/48 (43%), Gaps = 2/48 (4%)
Query: 148 RTYELRQMQSTQLTNEL-KMANNALMNSFEKGKKKSQSR-LKRLENEI 193
RT R+ Q + K A L + +K KSQ + +R +EI
Sbjct: 86 RTLAKRRQQREKAEINAKKTAEKLLASQLKKSAVKSQQQQQERDADEI 133
>gnl|CDD|220879 pfam10805, DUF2730, Protein of unknown function (DUF2730). This
family of proteins with unknown function appears to be
restricted to Gammaproteobacteria.
Length = 106
Score = 26.6 bits (59), Expect = 4.8
Identities = 22/70 (31%), Positives = 32/70 (45%), Gaps = 7/70 (10%)
Query: 84 REDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVE-NKLRERLKEMSQT 142
RED +L I + + +E KL SL A D +E +LR LKE+S
Sbjct: 34 REDVEKLIDIIASNDKRLTEMETKLESLPT------AKDVHALKIEITELRGELKELSAQ 87
Query: 143 LERVSRTYEL 152
++ +S EL
Sbjct: 88 IQSISHQLEL 97
>gnl|CDD|204102 pfam08942, DUF1919, Domain of unknown function (DUF1919). This
domain has no known function. It is found in various
hypothetical and putative bacterial proteins.
Length = 198
Score = 27.3 bits (61), Expect = 5.1
Identities = 11/29 (37%), Positives = 16/29 (55%), Gaps = 2/29 (6%)
Query: 173 NSFEKGKKKSQSRLKRL--ENEIVIMSER 199
SFE+ K K R KR+ +N V ++R
Sbjct: 100 ASFEEAKNKWDERKKRINWDNLFVKFTDR 128
>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
Length = 438
Score = 27.9 bits (63), Expect = 5.2
Identities = 14/82 (17%), Positives = 35/82 (42%), Gaps = 1/82 (1%)
Query: 133 RERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENE 192
++RL + Q L+R LR + ++ E+ +++ + +RL
Sbjct: 311 QQRLDRLQQRLQRALERR-LRLAKQRLERLSQRLQQQNPQRRIERAQQRLEQLEQRLRRA 369
Query: 193 IVIMSERHATQVQHLKQRIVSL 214
+ +R +++ L Q++ +L
Sbjct: 370 MRRQLKRKRQRLEALAQQLEAL 391
>gnl|CDD|221296 pfam11894, DUF3414, Protein of unknown function (DUF3414). This
family of proteins are functionally uncharacterized.
This protein is found in eukaryotes. Proteins in this
family are typically between 764 to 2011 amino acids in
length. This protein has a conserved LLG sequence motif.
Length = 1612
Score = 27.7 bits (62), Expect = 6.0
Identities = 15/82 (18%), Positives = 33/82 (40%), Gaps = 5/82 (6%)
Query: 134 ERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENEI 193
E K++ +++ +S L EL+ +N + K +SR ++LE+
Sbjct: 5 EPFKDLYNSIQALSD---SDPDLLNDLEVELEKHKPDFLNLLDNPPKNDKSR-QQLESGK 60
Query: 194 VIMSERHATQV-QHLKQRIVSL 214
+ + + + Q Q + L
Sbjct: 61 ITLGDGDEYSLNQEFIQEALIL 82
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 27.3 bits (61), Expect = 7.2
Identities = 24/123 (19%), Positives = 47/123 (38%), Gaps = 16/123 (13%)
Query: 89 ELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKEMSQTLERVSR 148
ELR + L E + L+ K+ + + KL L + + R
Sbjct: 24 ELRLDLARLLLENEELKQKVEEA----------LEGATNEDGKLAADLL-KLEVARKKER 72
Query: 149 TYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENEIVIMSERHATQVQHLK 208
++R + +QL E++ E+ K+ R L + + +R A+Q++ L+
Sbjct: 73 LNQIRA-RISQLKEEIE----QKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQ 127
Query: 209 QRI 211
I
Sbjct: 128 DEI 130
Score = 26.9 bits (60), Expect = 9.8
Identities = 15/89 (16%), Positives = 34/89 (38%), Gaps = 6/89 (6%)
Query: 79 ELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKE 138
E+ +E ++RARI L+ E + ++ L + A + + + K
Sbjct: 65 EVARKKERLNQIRARISQLKEEIEQKRERIEEL------KRALAQRRSDLSSASYQLEKR 118
Query: 139 MSQTLERVSRTYELRQMQSTQLTNELKMA 167
+ LE++ + + + L + L
Sbjct: 119 RASQLEKLQDEIKRTRSKLNALHSLLAEK 147
>gnl|CDD|226228 COG3705, HisZ, ATP phosphoribosyltransferase involved in histidine
biosynthesis [Amino acid transport and metabolism].
Length = 390
Score = 27.3 bits (61), Expect = 7.9
Identities = 11/55 (20%), Positives = 16/55 (29%), Gaps = 4/55 (7%)
Query: 51 VAISEAPKVPKSSSEARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLE 105
+ + S E R E+ LL RE R + L + L
Sbjct: 190 LGLELLVLAAPLSPELRGRLSELL----ALLGGREVLERARGLLDELMAQGIALN 240
>gnl|CDD|215966 pfam00517, GP41, Retroviral envelope protein. This family includes
envelope protein from a variety of retroviruses. It
includes the GP41 subunit of the envelope protein
complex from human and simian immunodeficiency viruses
(HIV and SIV) which mediate membrane fusion during viral
entry. The family also includes bovine immunodeficiency
virus, feline immunodeficiency virus and Equine
infectious anaemia (EIAV). The family also includes the
Gp36 protein from mouse mammary tumour virus (MMTV) and
human endogenous retroviruses (HERVs).
Length = 204
Score = 27.0 bits (60), Expect = 7.9
Identities = 15/57 (26%), Positives = 29/57 (50%), Gaps = 4/57 (7%)
Query: 154 QMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENEIVIMSERHATQVQHLKQR 210
+QS Q+ + L+N+ E+ K Q R+ L+N ++++ R VQ L++
Sbjct: 9 SVQSAQVLAGIVQNQTNLLNAQEQIDKLLQLRIWGLKNVVLVLGAR----VQALEKY 61
>gnl|CDD|184150 PRK13569, PRK13569, anthranilate synthase component I; Provisional.
Length = 506
Score = 27.3 bits (61), Expect = 8.3
Identities = 24/103 (23%), Positives = 39/103 (37%), Gaps = 3/103 (2%)
Query: 52 AISEAPKVPKSSSEARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSL 111
AIS KVPK S +M + L++ + EL Y + ++ E K+
Sbjct: 127 AISLIEKVPKHHSRDTEMPTCHFFFCETLIAYDHETKELHFIHYVRLNGQETEEEKIEKY 186
Query: 112 --ACEELTQHAGDTDDKSVENKL-RERLKEMSQTLERVSRTYE 151
A E+ + E +L E + + E V+ YE
Sbjct: 187 KEAQAEIETLIEKLARRKAEKELLLPADSERTVSFEGVTSNYE 229
>gnl|CDD|239947 cd04506, SGNH_hydrolase_YpmR_like, Members of the SGNH-hydrolase
superfamily, a diverse family of lipases and esterases.
The tertiary fold of the enzyme is substantially
different from that of the alpha/beta hydrolase family
and unique among all known hydrolases; its active site
closely resembles the Ser-His-Asp(Glu) triad from other
serine hydrolases, but may lack the carboxlic acid. This
subfamily contains sequences similar to Bacillus YpmR.
Length = 204
Score = 26.8 bits (60), Expect = 8.5
Identities = 25/107 (23%), Positives = 45/107 (42%), Gaps = 29/107 (27%)
Query: 116 LTQHAGDTDDKS----VENKLRERLKEMSQTLE----RVSRTYE-LRQMQSTQLTNELKM 166
LT+ GD K +KL E T++ R+ + L+++++ ++ ELK
Sbjct: 9 LTEGVGDETGKGGYVGRLDKLIETKTVKKVTVQNFGVSGDRSDQLLKRLKTKKVQKELKK 68
Query: 167 AN--------NALMN------------SFEKGKKKSQSRLKRLENEI 193
A+ N LM F+K ++ Q+ LK++ EI
Sbjct: 69 ADVITITIGGNDLMQVLEKNFLSLDVEDFKKAEETYQNNLKKIFKEI 115
>gnl|CDD|204632 pfam11365, DUF3166, Protein of unknown function (DUF3166). This
eukaryotic family of proteins has no known function.
Length = 97
Score = 26.0 bits (57), Expect = 8.6
Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 5/67 (7%)
Query: 76 LMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSL-----ACEELTQHAGDTDDKSVEN 130
L +L ++E+ A +R ++ LE EKD +E +L + +HA D + E
Sbjct: 6 LRCQLQFVKEEAALMRKKLAKLEEEKDKVEGELQKYKSKYGDLDSGAEHAAGGPDTTREA 65
Query: 131 KLRERLK 137
+L+ RLK
Sbjct: 66 ELQLRLK 72
>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
Members of this protein family are homologs of ClpB, an
ATPase associated with chaperone-related functions.
These ClpB homologs, designated ClpV1, are a key
component of the bacterial pathogenicity-associated type
VI secretion system [Protein fate, Protein and peptide
secretion and trafficking, Cellular processes,
Pathogenesis].
Length = 852
Score = 27.2 bits (61), Expect = 9.0
Identities = 20/73 (27%), Positives = 33/73 (45%), Gaps = 3/73 (4%)
Query: 69 MDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSV 128
D +A L EL ++ + A L AR + EK+++E L A E A DD ++
Sbjct: 448 HDERLAELRAELAALEAELAALEAR---WQQEKELVEAILALRAELEADADAPADDDAAL 504
Query: 129 ENKLRERLKEMSQ 141
+L E ++
Sbjct: 505 RAQLAELEAALAS 517
>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein. This family of proteins
is found in bacteria. Proteins in this family are
typically between 397 and 1543 amino acids in length.
This family is the SidE protein in the Dot/Icm pathway of
Legionella pneumophila bacteria. There is little
literature describing the family.
Length = 1443
Score = 27.2 bits (60), Expect = 9.2
Identities = 25/86 (29%), Positives = 43/86 (50%), Gaps = 8/86 (9%)
Query: 63 SSEARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGD 122
S EAR+ +M + Q M E+K L A L +D LE + + A E L + +
Sbjct: 1011 SGEARQAKRQMDAIKQ----MVENKVVLDA----LTQCQDALEKQNIAGALEALKKIPSE 1062
Query: 123 TDDKSVENKLRERLKEMSQTLERVSR 148
+ ++ +LRE+++ + Q LE + R
Sbjct: 1063 KEMGTISRELREQIQSVRQELESLQR 1088
>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP). This family consists of
several PV-1 (PLVAP) proteins which seem to be specific
to mammals. PV-1 is a novel protein component of the
endothelial fenestral and stomatal diaphragms. The
function of this family is unknown.
Length = 442
Score = 26.9 bits (59), Expect = 9.2
Identities = 24/90 (26%), Positives = 43/90 (47%), Gaps = 1/90 (1%)
Query: 25 LLNNIDKLDKERASIGNTTVKLESYHVAISEAPKVPKSSSEARKMDLEMAVLMQELLSMR 84
L +I+++ +E A + ++ E A EA + ++AR+ L+ Q L++
Sbjct: 290 LRADIERVARENADLQRQKLEAEQAAQAAQEAKAKAEKEAQAREAKLQAECARQTQLAL- 348
Query: 85 EDKAELRARIYNLEHEKDVLEMKLNSLACE 114
E+KA LRA NL E + + + L E
Sbjct: 349 EEKAALRAERDNLAKELEAKKREAEQLRME 378
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.124 0.317
Gapped
Lambda K H
0.267 0.0704 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,968,738
Number of extensions: 1011854
Number of successful extensions: 1476
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1382
Number of HSP's successfully gapped: 292
Length of query: 229
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 135
Effective length of database: 6,768,326
Effective search space: 913724010
Effective search space used: 913724010
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 57 (25.8 bits)