RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddA
21,609 sequences; 6,263,737 total letters
Searching..................................................done
Query= gi|254781019|ref|YP_003065432.1| hydroxymethylglutaryl-coenzyme
A synthase [Candidatus Liberibacter asiaticus str. psy62]
(396 letters)
>gnl|CDD|33231 COG3425, PksG, 3-hydroxy-3-methylglutaryl CoA synthase [Lipid
metabolism].
Length = 377
Score = 334 bits (859), Expect = 2e-92
Identities = 125/388 (32%), Positives = 179/388 (46%), Gaps = 15/388 (3%)
Query: 1 MAIGIEDISFYTTNQYLDLSVIAEKYRLDVAKFYVGIGQERMSVLNPDEDIVTMAAAAAL 60
M IGI I Y + L +A + +D K G+G E SV DED VTMA AA
Sbjct: 1 MMIGIVGIGAYIPRYRIKLEELARAWGVDPEKIKKGLGVEEKSVPPWDEDAVTMAVEAAR 60
Query: 61 PIM--QNQDKNLIDTLFFATESSVDQSKSAAIWLHKLLGLNSSCRVVELKQACYSATCAL 118
+ + D + I + +ES D K A + + LGLN S R +++ ACY+ T AL
Sbjct: 61 NALKRADIDPSKIGAVIVGSESGPDAVKPTATIVAEALGLNPSARAADVEFACYAGTAAL 120
Query: 119 HMACALVAKSPERKVLIVASDVARYDLGSSGEPTQGCGAVAILISSQTSILEIEDITGIY 178
A V L++ +D A+Y G + E T G GAVA LI I+ + TG Y
Sbjct: 121 QAAIGWVESGMISYGLVIGADTAQYAPGDALEYTAGAGAVAFLIGKNPPIVAEFEGTGSY 180
Query: 179 TNDCMDFWRPNYRRTALVDGKYSTKIYLQSLEAVWHDYQKNKGHDFNDFQYFCYHQPFTR 238
T D DFWRP+ + VDG++S Y + +E Y + G +DF Y +HQP +
Sbjct: 181 TTDTPDFWRPDGQPYPYVDGRFSEPAYFKHVENAAKGYMEKTGLSPDDFDYIVFHQPNGK 240
Query: 239 MAEKAHIRLSKIVHQNLSVAEIEKAIGITTLYNRLIGNSYTASLYIALISLLDHSSENLA 298
+KA L E+ + +Y + IGN+YT SL + L SLLD++ L
Sbjct: 241 FPKKAAKSL----------GFKEEQVKPGLVYPQRIGNTYTGSLLLGLASLLDNAK--LP 288
Query: 299 GKRVGFFSYGSGCVAEFFSGIVQKNYRKKSHQKRHQHIINSRIPISYQVYCDLHQDIIPS 358
G R+ FSYGSG +E FS V ++ + ++ R +SY Y +
Sbjct: 289 GDRILLFSYGSGAGSEAFSITVTDGIEERRKKLAPTVLLEQRNRLSYIDYELYFKRRKKI 348
Query: 359 VEGNCTIPHTTTGPFRLATIQNHKRIYE 386
G +G + L I +H R Y
Sbjct: 349 RLGK-DNEFYDSGDYYLEEIDDHFRRYR 375
>gnl|CDD|29414 cd00827, init_cond_enzymes, "initiating" condensing enzymes are a
subclass of decarboxylating condensing enzymes,
including beta-ketoacyl [ACP] synthase, type III and
polyketide synthases, type III, which include chalcone
synthase and related enzymes. They are characterized by
the utlization of CoA substrate primers, as well as the
nature of their active site residues..
Length = 324
Score = 200 bits (510), Expect = 5e-52
Identities = 114/332 (34%), Positives = 152/332 (45%), Gaps = 25/332 (7%)
Query: 3 IGIEDISFYTTNQYLDLSVIAEKYRLDVAKFYVGIGQERMSVLNPDEDIVTMAAAAALPI 62
+GIE I Y +D +AE +D K+ GIGQ M+ DED+ TMA AA
Sbjct: 2 VGIEAIGAYLPRYRVDNEELAEGLGVDPGKYTTGIGQRHMA--GDDEDVPTMAVEAARRA 59
Query: 63 MQN--QDKNLIDTLFFATESSVDQSKSAAIWLHKLLGLNSSCRVVELKQACYSATCALHM 120
++ D + I L ATES +D+ KSAA +L +LLGL + +LKQACY T AL +
Sbjct: 60 LERAGIDPDDIGLLIVATESPIDKGKSAATYLAELLGLTN-AEAFDLKQACYGGTAALQL 118
Query: 121 ACALVAKSPERKVLIVASDVARY--DLGSSGEPTQGCGAVAILISSQTSILEIEDITGIY 178
A LV P R L+VASD+A Y D GS+ EPT G GA A+L+S IL ++
Sbjct: 119 AANLVESGPWRYALVVASDIASYLLDEGSALEPTLGDGAAAMLVSRNPGILAAGIVSTHS 178
Query: 179 TND---------CMDFWRPNYRRTALVDGKYSTKIYLQSLEAVWHDYQK---NKGHDFND 226
T+D MD P + A + EA K
Sbjct: 179 TSDPGYDFSPYPVMDGGYPKPCKLAYAIRLTAEPAGRAVFEAAHKLIAKVVRKALDRAGL 238
Query: 227 FQYFCYHQPFTRMAEKAHIRLSKIVHQNLSVAEIEKAIGITTLYNRLIGNSYTASLYIAL 286
+ Y P +K ++K + A + I R +GN Y AS+ + L
Sbjct: 239 SEDIDYFVPHQPNGKKILEAVAKKLGGPPEKASQTRWI-----LLRRVGNMYAASILLGL 293
Query: 287 ISLLDHSSENLAGKRVGFFSYGSGCVAEFFSG 318
SLL+ S + AG RV FSYGSG AE F
Sbjct: 294 ASLLE-SGKLKAGDRVLLFSYGSGFTAEAFVL 324
>gnl|CDD|36607 KOG1393, KOG1393, KOG1393, Hydroxymethylglutaryl-CoA synthase
[Lipid transport and metabolism].
Length = 462
Score = 172 bits (438), Expect = 1e-43
Identities = 110/398 (27%), Positives = 183/398 (45%), Gaps = 53/398 (13%)
Query: 3 IGIEDISFYTTNQYLDLSVIAEKYRLDVAKFYVGIGQERMSVLNPDEDIVTMAAAAALPI 62
+GI I Y +QY+D + + + K+ +G+GQ +M + EDI++++ +
Sbjct: 12 VGIIAIEIYFPSQYVDQEELEKFDGVSAGKYTIGLGQTQMGFCSDREDIISLSLTVVSRL 71
Query: 63 M--QNQDKNLIDTLFFATESSVDQSKSAAIWLHKLLGL--NSSCRVVELKQACYSATCAL 118
M N D + I L TE+ +D+SKS L +L N+ ++ ACY T AL
Sbjct: 72 MERYNIDPDSIGRLEVGTETIIDKSKSVKTVLMQLFEESGNTDIEGIDTTNACYGGTAAL 131
Query: 119 HMACALVAKSP--ERKVLIVASDVARYDLGSSGEPTQGCGAVAILISSQTSILEIEDITG 176
A + S R ++V D+A Y G++ PT G GAVA+LI I+ +
Sbjct: 132 FNAVNWIESSSWDGRYAIVVCGDIAVYAKGNA-RPTGGAGAVAMLIGPNAPIIFERGLRA 190
Query: 177 IYTNDCMDFWRPNY-RRTALVDGKYSTKIYLQSLEAVWHDY--------QKNKGHDFNDF 227
+ DF++P+ +VDGK S + YL +L+ + Y QK+ + N F
Sbjct: 191 THMQHAYDFYKPDLLSEYPVVDGKLSIQCYLSALDRCYTVYRKKIAAQWQKSGSDNLNSF 250
Query: 228 QYFCYHQPFTRMAEKAHIRL--------------------SKIVHQNLSVAEIEKA---- 263
Y +H P+ ++ +K+ RL + + ++ + ++EKA
Sbjct: 251 DYIIFHSPYCKLVQKSLARLLYNDFLLNPSQLPESAGLESTLKLDESYTDRDLEKAFIKI 310
Query: 264 --------IGITTLYNRLIGNSYTASLYIALISLLD-HSSENLAGKRVGFFSYGSGCVAE 314
+ L R GN YTASLY +L SLL ++ LAGKR+G FSYGSG A
Sbjct: 311 SKPIFKKKTKPSLLAPRRNGNMYTASLYASLASLLSAVPADELAGKRIGMFSYGSGLAAS 370
Query: 315 FFSGIVQKNY----RKKSHQKRHQHIINSRIPISYQVY 348
FS V ++ + + + ++SR ++ + +
Sbjct: 371 MFSLRVTQDATPLDKIIASLTDLKKRLDSRKCVTPEEF 408
>gnl|CDD|110176 pfam01154, HMG_CoA_synt_N, Hydroxymethylglutaryl-coenzyme A
synthase N terminal.
Length = 171
Score = 84.9 bits (210), Expect = 3e-17
Identities = 56/167 (33%), Positives = 85/167 (50%), Gaps = 9/167 (5%)
Query: 3 IGIEDISFYTTNQYLDLSVIAEKY-RLDVAKFYVGIGQERMSVLNPDEDIVTMAAAAALP 61
+GI + Y +QY+D + + EKY + K+ +G+GQ +M EDI ++
Sbjct: 2 VGIVALEIYFPSQYVDQAEL-EKYDGVSAGKYTIGLGQTKMGFCTDREDINSLCLTVVQK 60
Query: 62 IMQ--NQDKNLIDTLFFATESSVDQSKSAAIWLHKLLG--LNSSCRVVELKQACYSATCA 117
+M+ N D + I L TE+ +D+SKS L +L N+ ++ ACY T A
Sbjct: 61 LMERNNLDYDCIGRLEVGTETIIDKSKSVKTVLMQLFEESGNTDIEGIDTTNACYGGTAA 120
Query: 118 LHMACALVAKSP--ERKVLIVASDVARYDLGSSGEPTQGCGAVAILI 162
L A + S R L+VA D+A Y G++ PT G GAVA+LI
Sbjct: 121 LFNALNWIESSSWDGRYALVVAGDIAIYAKGNA-RPTGGAGAVAMLI 166
>gnl|CDD|29417 cd00830, KAS_III, Ketoacyl-acyl carrier protein synthase III
(KASIII) initiates the elongation in type II fatty acid
synthase systems. It is found in bacteria and plants.
Elongation of fatty acids in the type II systems occurs
by Claisen condensation of malonyl-acyl carrier protein
(ACP) with acyl-ACP. KASIII initiates this process by
specifically using acetyl-CoA over acyl-CoA..
Length = 320
Score = 66.7 bits (163), Expect = 1e-11
Identities = 58/295 (19%), Positives = 107/295 (36%), Gaps = 35/295 (11%)
Query: 35 VGIGQERMSVLNPDEDIVTMAAAAALPIMQNQ--DKNLIDTLFFATESSVDQSKSAAIWL 92
GI + R++ +P E +A AA +++ D + ID + AT + + A +
Sbjct: 36 TGIRERRIA--DPGETTSDLAVEAAKKALEDAGIDADDIDLIIVATSTPDYLFPATACLV 93
Query: 93 HKLLGLNSSCRVVELKQACYSATCALHMACALVAKSPERKVLIVASDVAR---YDLGSSG 149
LG + ++ AC L A L+ + VL+V ++ S
Sbjct: 94 QARLGA-KNAAAFDINAACSGFLYGLSTAAGLIRSGGAKNVLVVGAETLSRILDWTDRST 152
Query: 150 EPTQGCGAVAILISSQTSILEI-----------EDITGIYTNDCMDFWRPNYRRTALV-- 196
G GA A+++ + I D+ I + +
Sbjct: 153 AVLFGDGAGAVVLEATEEDPGILDSVLGSDGSGADLLTIPAGGSRSPFEDAEGGDPYLVM 212
Query: 197 DGKYSTKIYLQSLEAVWHDYQKNKGHDFNDFQYFCYHQPFTRMAEKAHIRLSKIVHQNLS 256
DG+ K ++ + + + G +D +F HQ A++R+ + V + L
Sbjct: 213 DGREVFKFAVRLMPESIEEALEKAGLTPDDIDWFVPHQ--------ANLRIIEAVAKRLG 264
Query: 257 VAEIEKAIGITTLYNRLIGNSYTASLYIALISLLDHSSENLAGKRVGFFSYGSGC 311
+ E EK + Y GN+ AS+ +AL + G V +G+G
Sbjct: 265 LPE-EKVVVNLDRY----GNTSAASIPLAL-DEAIEEGKLKKGDLVLLLGFGAGL 313
>gnl|CDD|30680 COG0332, FabH, 3-oxoacyl-[acyl-carrier-protein].
Length = 323
Score = 58.3 bits (141), Expect = 4e-09
Identities = 59/297 (19%), Positives = 106/297 (35%), Gaps = 45/297 (15%)
Query: 36 GIGQERMSVLNPDEDIVTMAAAAALPIMQNQ--DKNLIDTLFFATESSVDQSKSAAIWLH 93
GI + R++ DE +A AA +++ + ID + AT + S A +
Sbjct: 39 GIRERRIA--ADDETTSDLAVEAARKALEDAGISPDDIDLIIVATSTPDHLFPSTACLVQ 96
Query: 94 KLLGLNSSCRVVELKQACYSATCALHMACALVAKSPERKVLIVASDVA--RYDLGS-SGE 150
LGL +L+ AC AL +A L+ + VL+V ++ D
Sbjct: 97 ARLGLGG-APAFDLQAACSGFLYALSVADGLIRSGGYKNVLVVGAETLSRILDWTDRDTC 155
Query: 151 PTQGCGAVAILISS---QTSILEIEDITGIYTNDCMDFWRPNYRRTALVDGKYSTKIYLQ 207
G GA A+++ + IL+ + + D + + P + + +
Sbjct: 156 VLFGDGAGAVVLEATEDDNGILDTDLGSDGSQGDLL--YLPGGGSATPKEESGGGLLVMD 213
Query: 208 ----------SLEAVWHDYQKNKGHDFNDFQYFCYHQPFTRM----AEKAHIRLSKIVHQ 253
++ + + G D +F HQ R+ A+K I K+V
Sbjct: 214 GREVFKFAVRAMPKAIEEVLEKAGLTPEDIDWFVPHQANLRIIEAIAKKLGIPEEKVV-- 271
Query: 254 NLSVAEIEKAIGITTLYNRLIGNSYTASLYIALISLLDHSSENLAGKRVGFFSYGSG 310
++V + GN+ AS+ +AL L G V ++G G
Sbjct: 272 -VTVDK--------------YGNTSAASIPLALDEALREGRIK-PGDLVLLEAFGGG 312
>gnl|CDD|109260 pfam00195, Chal_sti_synt_N, Chalcone and stilbene synthases,
N-terminal domain. The C-terminal domain of Chalcone
synthase is reported to be structurally similar to
domains in thiolase and beta-ketoacyl synthase. The
differences in activity are accounted for by differences
in this N-terminal domain.
Length = 228
Score = 43.1 bits (102), Expect = 1e-04
Identities = 34/104 (32%), Positives = 50/104 (48%), Gaps = 9/104 (8%)
Query: 45 LNPDEDIVT-----MAAAAALPIMQ--NQDKNLIDTLFFATESSVDQSKSAAIWLHKLLG 97
L+ +DI + AAL ++ Q K+ I L F T S VD A L KLLG
Sbjct: 91 LDARQDIAVVEVPKLGKEAALKAIKEWGQPKSKITHLVFCTTSGVDMP-GADYQLTKLLG 149
Query: 98 LNSSC-RVVELKQACYSATCALHMACALVAKSPERKVLIVASDV 140
L S RV+ +Q C++ L +A L + +VL+V S++
Sbjct: 150 LRPSVKRVMLYQQGCFAGGTVLRLAKDLAENNKGARVLVVCSEI 193
>gnl|CDD|29418 cd00831, CHS_like, Chalcone and stilbene synthases; plant-specific
polyketide synthases (PKS) and related enzymes, also
called type III PKSs. PKS generate an array of different
products, dependent on the nature of the starter
molecule. They share a common chemical strategy, after
the starter molecule is loaded onto the active site
cysteine, a carboxylative condensation reation extends
the polyketide chain. Plant-specific PKS are dimeric
iterative PKSs, using coenzyme A esters to deliver
substrate to the active site, but they differ in the
choice of starter molecule and the number of
condensation reactions..
Length = 361
Score = 42.5 bits (100), Expect = 2e-04
Identities = 62/322 (19%), Positives = 108/322 (33%), Gaps = 39/322 (12%)
Query: 15 QYLDLSVIAEKYRLDVAKFYVGIGQERMSVLNPDEDIVTMAAAAALPIMQN--QDKNLID 72
+YL L E Y ER + E+ +A AA + + ID
Sbjct: 53 RYLVLPGGEETYAPR--PEMSPSLDERNDI--ALEEARELAEEAARGALDEAGLRPSDID 108
Query: 73 TLFFATESSVDQSKSAAIWLHKLLGLNSSCRVVELKQ-ACYSATCALHMACALVAKSPER 131
L T + + S L LGL + L C + AL +A L+ +P
Sbjct: 109 HLVVNTSTGNP-TPSLDAMLINRLGLRPDVKRYNLGGMGCSAGAIALDLAKDLLEANPGA 167
Query: 132 KVLIVASD-----VARYDLGSSGEPTQ--GCGAVAILISS-------QTSILEIEDI-TG 176
+VL+V+++ D S G GA A+L+S+ + + E+ +
Sbjct: 168 RVLVVSTELCSLWYRGPDHRSMLVGNALFGDGAAAVLLSNDPRDRRRERPLFELVRAAST 227
Query: 177 IY--TNDCMDFWRPNYRRTALVDGKYSTKIYLQSLEAVWHDYQK--NKGHDFNDFQYFCY 232
+ + D M W V + ++ ++LE V G F ++C
Sbjct: 228 LLPDSEDAMG-WHLGEEGLTFVLSRDVPRLVEKNLERVLRKLLARLGIGLFKLAFDHWCV 286
Query: 233 HQPFTRMAEKAHIRLSKIVHQNLSVAEIEKAIGITTLYNRLIGNSYTASLYIALISLLDH 292
H + + V + L ++ + L GN ++S+ L +
Sbjct: 287 HPGGRAVLDA--------VEKALGLSPEDLEASRMVLRR--YGNMSSSSVLYVLAYMEAK 336
Query: 293 SSENLAGKRVGFFSYGSGCVAE 314
G R ++G G E
Sbjct: 337 GRVK-RGDRGLLIAFGPGFTCE 357
>gnl|CDD|73194 cd00327, cond_enzymes, Condensing enzymes; Family of enzymes that
catalyze a (decarboxylating or non-decarboxylating)
Claisen-like condensation reaction. Members are share
strong structural similarity, and are involved in the
synthesis and degradation of fatty acids, and the
production of polyketides, a diverse group of natural
products..
Length = 254
Score = 36.1 bits (83), Expect = 0.018
Identities = 22/97 (22%), Positives = 31/97 (31%), Gaps = 10/97 (10%)
Query: 68 KNLIDTLFFATESSVDQSKSAAIWLHKLLGLNSSCRVVELKQACYSATCALHMACALVAK 127
K I + T + AA L LG+ S + QAC + AL +A V
Sbjct: 26 KGPIVGVIVGTTGGSGEFSGAAGQLAYHLGI-SGGPAYSVNQACATGLTALALAVQQVQN 84
Query: 128 SPERKVLIVASDVARYDLGSSGEPTQGCGAVAILISS 164
VL S+ A A++ S
Sbjct: 85 GKADIVLAGGSEEF---------VFGDGAAAAVVESE 112
>gnl|CDD|73238 cd00825, decarbox_cond_enzymes, decarboxylating condensing enzymes;
Family of enzymes that catalyze the formation of a new
carbon-carbon bond by a decarboxylating Claisen-like
condensation reaction. Members are involved in the
synthesis of fatty acids and polyketides, a diverse
group of natural products. Both pathways are an
iterative series of additions of small carbon units,
usually acetate, to a nascent acyl group. There are 2
classes of decarboxylating condensing enzymes, which can
be distinguished by sequence similarity, type of active
site residues and type of primer units (acetyl CoA or
acyl carrier protein (ACP) linked units)..
Length = 332
Score = 32.4 bits (73), Expect = 0.21
Identities = 22/127 (17%), Positives = 36/127 (28%), Gaps = 29/127 (22%)
Query: 35 VGIGQERMSVLNPDEDIVTMAAAAALPIMQNQDKNLIDTLFFATESSVD---QSKSAAIW 91
+G + + I A A L + KN I + T + A+
Sbjct: 7 LGSYVSILGFEAAERAI----ADAGLS--REYQKNPIVGVVVGTGGGSPRFQVFGADAMR 60
Query: 92 ------------------LHKLLGLNSSCRVVELKQACYSATCALHMACALVAKSPERKV 133
+ LG++ V AC + AL +A V + V
Sbjct: 61 AVGPYVVTKAMFPGASGQIATPLGIHGPAYDV--SAACAGSLHALSLAADAVQNGKQDIV 118
Query: 134 LIVASDV 140
L S+
Sbjct: 119 LAGGSEE 125
>gnl|CDD|29416 cd00829, SCP-x_thiolase, Thiolase domain associated with sterol
carrier protein (SCP)-x isoform and related proteins;
SCP-2 has multiple roles in intracellular lipid
circulation and metabolism. The N-terminal presequence
in the SCP-x isoform represents a peroxisomal
3-ketacyl-Coa thiolase specific for branched-chain acyl
CoAs, which is proteolytically cleaved from the sterol
carrier protein..
Length = 375
Score = 31.4 bits (71), Expect = 0.42
Identities = 26/108 (24%), Positives = 40/108 (37%), Gaps = 6/108 (5%)
Query: 35 VGIGQERMSVLNPDEDIVTMAAAAALPIMQNQ--DKNLIDTLFFAT-ESSVDQSKSAAIW 91
VG+G D + +AA AA + + + ID + QS A+
Sbjct: 1 VGVGMTPFGRR-SDRSPLELAAEAARAALDDAGLEPADIDAVVVGNAAGGRFQSFPGALI 59
Query: 92 LHKLLGLNSSCRVVELKQACYSATCALHMACALVAKSPERKVLIVASD 139
L L VE A S + A+ A A +A VL+V ++
Sbjct: 60 AEYLGLLGKPATRVEA--AGASGSAAVRAAAAAIASGLADVVLVVGAE 105
>gnl|CDD|146146 pfam03358, FMN_red, NADPH-dependent FMN reductase.
Length = 147
Score = 30.6 bits (70), Expect = 0.85
Identities = 13/42 (30%), Positives = 15/42 (35%), Gaps = 9/42 (21%)
Query: 270 YNRLIGNSYTASLYIALISLLDHSSEN-LAGKRVGFFSYGSG 310
YN S AL + +D S L GK V S G
Sbjct: 79 YNG----SVPG----ALKNAIDWLSRKELRGKPVAIVSVSGG 112
>gnl|CDD|33230 COG3424, BcsA, Predicted naringenin-chalcone synthase [Secondary
metabolites biosynthesis, transport, and catabolism].
Length = 356
Score = 29.9 bits (67), Expect = 1.4
Identities = 16/68 (23%), Positives = 26/68 (38%), Gaps = 2/68 (2%)
Query: 71 IDTLFFATESSVDQSKSAAIWLHKLLGLNSSC-RVVELKQACYSATCALHMACALVAKSP 129
ID + T + + + S + LGL+ RV C + L A +P
Sbjct: 98 IDAVVTVTSTGL-ATPSLDARIVGELGLSPDTRRVPVWGLGCAAGAAGLARARDYCRAAP 156
Query: 130 ERKVLIVA 137
+ VL+V
Sbjct: 157 DANVLVVC 164
>gnl|CDD|32882 COG3068, COG3068, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 194
Score = 28.8 bits (64), Expect = 3.0
Identities = 22/97 (22%), Positives = 36/97 (37%), Gaps = 20/97 (20%)
Query: 188 PNYRRTALVDGKYSTKIYLQSLEAVWH-----------DYQKNKGH----DFNDFQYFCY 232
PNY +IY + L+ VW + Q K +DF +
Sbjct: 31 PNYAMFCQQTEFGDGQIYRRILDLVWETLTVKDAKVNFESQLEKLEEAIPSADDFDLYGV 90
Query: 233 HQPFTRMAEKAHIRLSKIVHQNLSVAEIEKAIGITTL 269
+ A A + LS+++H LS +E AI ++
Sbjct: 91 YP-----AIDACVALSELLHSRLSGETLEHAIEVSKT 122
>gnl|CDD|37176 KOG1965, KOG1965, KOG1965, Sodium/hydrogen exchanger protein
[Inorganic ion transport and metabolism].
Length = 575
Score = 28.7 bits (64), Expect = 3.3
Identities = 22/63 (34%), Positives = 29/63 (46%), Gaps = 6/63 (9%)
Query: 263 AIG-ITTLYNRLIGNSYTASLYIALISLLDHSSENLAGKRVGFFSYGSGCVAEFFSGIVQ 321
AIG I+ L + + T SL AL+ L+ + S LA SG V F GIV
Sbjct: 248 AIGLISALVLKFLYLRRTPSLESALMLLMSYLSYLLAE----GCGL-SGIVTVLFCGIVM 302
Query: 322 KNY 324
+Y
Sbjct: 303 SHY 305
>gnl|CDD|145954 pfam03078, ATHILA, ATHILA ORF-1 family. ATHILA is a group of
Arabidopsis thaliana retrotransposons belonging to the
Ty3/gypsy family of the long terminal repeat (LTR) class
of eukaryotic retrotransposons. The central region of
ATHILA retrotransposons contains two or three open
reading frames (ORFs). This family represents the ORF1
product. The function of ORF1 is unknown.
Length = 456
Score = 28.4 bits (63), Expect = 3.6
Identities = 20/87 (22%), Positives = 37/87 (42%), Gaps = 5/87 (5%)
Query: 168 ILEIEDITGIYTNDCMDFWRPNYRRTALVDGKYSTKIYLQSLEAVWHDYQKNKGHDFNDF 227
+L ++IT I D +DF A D +++ + + ++ D + F+
Sbjct: 339 LLPCKEITTIREGDNIDFKPSKEILGATDDIEHTEEATEEDVDEEDEDDEAELE-KFDTS 397
Query: 228 QY-FCYHQPFTRMAE---KAHIRLSKI 250
Y F H P R ++ +AH SK+
Sbjct: 398 MYHFEEHVPPARESKSLSEAHKNNSKL 424
>gnl|CDD|164495 CHL00091, apcE, phycobillisome linker protein.
Length = 877
Score = 27.9 bits (62), Expect = 4.6
Identities = 15/47 (31%), Positives = 26/47 (55%), Gaps = 5/47 (10%)
Query: 231 CYHQPFTRMAEKAH-IRL----SKIVHQNLSVAEIEKAIGITTLYNR 272
CY Q F R KA+ + L SK+ + +S+ E +A+G + +Y +
Sbjct: 279 CYRQIFERDIAKAYGLSLSELESKVKNGQISIKEFIRALGKSEIYRK 325
>gnl|CDD|144198 pfam00517, GP41, Retroviral envelope protein. This family includes
envelope protein from a variety of retroviruses. It
includes the GP41 subunit of the envelope protein
complex from human and simian immunodeficiency viruses
(HIV and SIV) which mediate membrane fusion during viral
entry. The family also includes bovine immunodeficiency
virus, feline immunodeficiency virus and Equine
infectious anaemia (EIAV). The family also includes the
Gp36 protein from mouse mammary tumour virus (MMTV) and
human endogenous retroviruses (HERVs).
Length = 204
Score = 28.2 bits (63), Expect = 4.6
Identities = 10/43 (23%), Positives = 18/43 (41%)
Query: 313 AEFFSGIVQKNYRKKSHQKRHQHIINSRIPISYQVYCDLHQDI 355
A+ +GIVQ + Q++ ++ RI V L +
Sbjct: 13 AQVLAGIVQNQTNLLNAQEQIDKLLQLRIWGLKNVVLVLGARV 55
Database: CddA
Posted date: Feb 4, 2011 9:38 PM
Number of letters in database: 6,263,737
Number of sequences in database: 21,609
Lambda K H
0.321 0.134 0.399
Gapped
Lambda K H
0.267 0.0852 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21609
Number of Hits to DB: 4,714,560
Number of extensions: 241920
Number of successful extensions: 501
Number of sequences better than 10.0: 1
Number of HSP's gapped: 481
Number of HSP's successfully gapped: 22
Length of query: 396
Length of database: 6,263,737
Length adjustment: 96
Effective length of query: 300
Effective length of database: 4,189,273
Effective search space: 1256781900
Effective search space used: 1256781900
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 58 (25.9 bits)