RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254781019|ref|YP_003065432.1| hydroxymethylglutaryl-coenzyme A synthase [Candidatus Liberibacter asiaticus str. psy62] (396 letters) >gnl|CDD|33231 COG3425, PksG, 3-hydroxy-3-methylglutaryl CoA synthase [Lipid metabolism]. Length = 377 Score = 334 bits (859), Expect = 2e-92 Identities = 125/388 (32%), Positives = 179/388 (46%), Gaps = 15/388 (3%) Query: 1 MAIGIEDISFYTTNQYLDLSVIAEKYRLDVAKFYVGIGQERMSVLNPDEDIVTMAAAAAL 60 M IGI I Y + L +A + +D K G+G E SV DED VTMA AA Sbjct: 1 MMIGIVGIGAYIPRYRIKLEELARAWGVDPEKIKKGLGVEEKSVPPWDEDAVTMAVEAAR 60 Query: 61 PIM--QNQDKNLIDTLFFATESSVDQSKSAAIWLHKLLGLNSSCRVVELKQACYSATCAL 118 + + D + I + +ES D K A + + LGLN S R +++ ACY+ T AL Sbjct: 61 NALKRADIDPSKIGAVIVGSESGPDAVKPTATIVAEALGLNPSARAADVEFACYAGTAAL 120 Query: 119 HMACALVAKSPERKVLIVASDVARYDLGSSGEPTQGCGAVAILISSQTSILEIEDITGIY 178 A V L++ +D A+Y G + E T G GAVA LI I+ + TG Y Sbjct: 121 QAAIGWVESGMISYGLVIGADTAQYAPGDALEYTAGAGAVAFLIGKNPPIVAEFEGTGSY 180 Query: 179 TNDCMDFWRPNYRRTALVDGKYSTKIYLQSLEAVWHDYQKNKGHDFNDFQYFCYHQPFTR 238 T D DFWRP+ + VDG++S Y + +E Y + G +DF Y +HQP + Sbjct: 181 TTDTPDFWRPDGQPYPYVDGRFSEPAYFKHVENAAKGYMEKTGLSPDDFDYIVFHQPNGK 240 Query: 239 MAEKAHIRLSKIVHQNLSVAEIEKAIGITTLYNRLIGNSYTASLYIALISLLDHSSENLA 298 +KA L E+ + +Y + IGN+YT SL + L SLLD++ L Sbjct: 241 FPKKAAKSL----------GFKEEQVKPGLVYPQRIGNTYTGSLLLGLASLLDNAK--LP 288 Query: 299 GKRVGFFSYGSGCVAEFFSGIVQKNYRKKSHQKRHQHIINSRIPISYQVYCDLHQDIIPS 358 G R+ FSYGSG +E FS V ++ + ++ R +SY Y + Sbjct: 289 GDRILLFSYGSGAGSEAFSITVTDGIEERRKKLAPTVLLEQRNRLSYIDYELYFKRRKKI 348 Query: 359 VEGNCTIPHTTTGPFRLATIQNHKRIYE 386 G +G + L I +H R Y Sbjct: 349 RLGK-DNEFYDSGDYYLEEIDDHFRRYR 375 >gnl|CDD|29414 cd00827, init_cond_enzymes, "initiating" condensing enzymes are a subclass of decarboxylating condensing enzymes, including beta-ketoacyl [ACP] synthase, type III and polyketide synthases, type III, which include chalcone synthase and related enzymes. They are characterized by the utlization of CoA substrate primers, as well as the nature of their active site residues.. Length = 324 Score = 200 bits (510), Expect = 5e-52 Identities = 114/332 (34%), Positives = 152/332 (45%), Gaps = 25/332 (7%) Query: 3 IGIEDISFYTTNQYLDLSVIAEKYRLDVAKFYVGIGQERMSVLNPDEDIVTMAAAAALPI 62 +GIE I Y +D +AE +D K+ GIGQ M+ DED+ TMA AA Sbjct: 2 VGIEAIGAYLPRYRVDNEELAEGLGVDPGKYTTGIGQRHMA--GDDEDVPTMAVEAARRA 59 Query: 63 MQN--QDKNLIDTLFFATESSVDQSKSAAIWLHKLLGLNSSCRVVELKQACYSATCALHM 120 ++ D + I L ATES +D+ KSAA +L +LLGL + +LKQACY T AL + Sbjct: 60 LERAGIDPDDIGLLIVATESPIDKGKSAATYLAELLGLTN-AEAFDLKQACYGGTAALQL 118 Query: 121 ACALVAKSPERKVLIVASDVARY--DLGSSGEPTQGCGAVAILISSQTSILEIEDITGIY 178 A LV P R L+VASD+A Y D GS+ EPT G GA A+L+S IL ++ Sbjct: 119 AANLVESGPWRYALVVASDIASYLLDEGSALEPTLGDGAAAMLVSRNPGILAAGIVSTHS 178 Query: 179 TND---------CMDFWRPNYRRTALVDGKYSTKIYLQSLEAVWHDYQK---NKGHDFND 226 T+D MD P + A + EA K Sbjct: 179 TSDPGYDFSPYPVMDGGYPKPCKLAYAIRLTAEPAGRAVFEAAHKLIAKVVRKALDRAGL 238 Query: 227 FQYFCYHQPFTRMAEKAHIRLSKIVHQNLSVAEIEKAIGITTLYNRLIGNSYTASLYIAL 286 + Y P +K ++K + A + I R +GN Y AS+ + L Sbjct: 239 SEDIDYFVPHQPNGKKILEAVAKKLGGPPEKASQTRWI-----LLRRVGNMYAASILLGL 293 Query: 287 ISLLDHSSENLAGKRVGFFSYGSGCVAEFFSG 318 SLL+ S + AG RV FSYGSG AE F Sbjct: 294 ASLLE-SGKLKAGDRVLLFSYGSGFTAEAFVL 324 >gnl|CDD|36607 KOG1393, KOG1393, KOG1393, Hydroxymethylglutaryl-CoA synthase [Lipid transport and metabolism]. Length = 462 Score = 172 bits (438), Expect = 1e-43 Identities = 110/398 (27%), Positives = 183/398 (45%), Gaps = 53/398 (13%) Query: 3 IGIEDISFYTTNQYLDLSVIAEKYRLDVAKFYVGIGQERMSVLNPDEDIVTMAAAAALPI 62 +GI I Y +QY+D + + + K+ +G+GQ +M + EDI++++ + Sbjct: 12 VGIIAIEIYFPSQYVDQEELEKFDGVSAGKYTIGLGQTQMGFCSDREDIISLSLTVVSRL 71 Query: 63 M--QNQDKNLIDTLFFATESSVDQSKSAAIWLHKLLGL--NSSCRVVELKQACYSATCAL 118 M N D + I L TE+ +D+SKS L +L N+ ++ ACY T AL Sbjct: 72 MERYNIDPDSIGRLEVGTETIIDKSKSVKTVLMQLFEESGNTDIEGIDTTNACYGGTAAL 131 Query: 119 HMACALVAKSP--ERKVLIVASDVARYDLGSSGEPTQGCGAVAILISSQTSILEIEDITG 176 A + S R ++V D+A Y G++ PT G GAVA+LI I+ + Sbjct: 132 FNAVNWIESSSWDGRYAIVVCGDIAVYAKGNA-RPTGGAGAVAMLIGPNAPIIFERGLRA 190 Query: 177 IYTNDCMDFWRPNY-RRTALVDGKYSTKIYLQSLEAVWHDY--------QKNKGHDFNDF 227 + DF++P+ +VDGK S + YL +L+ + Y QK+ + N F Sbjct: 191 THMQHAYDFYKPDLLSEYPVVDGKLSIQCYLSALDRCYTVYRKKIAAQWQKSGSDNLNSF 250 Query: 228 QYFCYHQPFTRMAEKAHIRL--------------------SKIVHQNLSVAEIEKA---- 263 Y +H P+ ++ +K+ RL + + ++ + ++EKA Sbjct: 251 DYIIFHSPYCKLVQKSLARLLYNDFLLNPSQLPESAGLESTLKLDESYTDRDLEKAFIKI 310 Query: 264 --------IGITTLYNRLIGNSYTASLYIALISLLD-HSSENLAGKRVGFFSYGSGCVAE 314 + L R GN YTASLY +L SLL ++ LAGKR+G FSYGSG A Sbjct: 311 SKPIFKKKTKPSLLAPRRNGNMYTASLYASLASLLSAVPADELAGKRIGMFSYGSGLAAS 370 Query: 315 FFSGIVQKNY----RKKSHQKRHQHIINSRIPISYQVY 348 FS V ++ + + + ++SR ++ + + Sbjct: 371 MFSLRVTQDATPLDKIIASLTDLKKRLDSRKCVTPEEF 408 >gnl|CDD|110176 pfam01154, HMG_CoA_synt_N, Hydroxymethylglutaryl-coenzyme A synthase N terminal. Length = 171 Score = 84.9 bits (210), Expect = 3e-17 Identities = 56/167 (33%), Positives = 85/167 (50%), Gaps = 9/167 (5%) Query: 3 IGIEDISFYTTNQYLDLSVIAEKY-RLDVAKFYVGIGQERMSVLNPDEDIVTMAAAAALP 61 +GI + Y +QY+D + + EKY + K+ +G+GQ +M EDI ++ Sbjct: 2 VGIVALEIYFPSQYVDQAEL-EKYDGVSAGKYTIGLGQTKMGFCTDREDINSLCLTVVQK 60 Query: 62 IMQ--NQDKNLIDTLFFATESSVDQSKSAAIWLHKLLG--LNSSCRVVELKQACYSATCA 117 +M+ N D + I L TE+ +D+SKS L +L N+ ++ ACY T A Sbjct: 61 LMERNNLDYDCIGRLEVGTETIIDKSKSVKTVLMQLFEESGNTDIEGIDTTNACYGGTAA 120 Query: 118 LHMACALVAKSP--ERKVLIVASDVARYDLGSSGEPTQGCGAVAILI 162 L A + S R L+VA D+A Y G++ PT G GAVA+LI Sbjct: 121 LFNALNWIESSSWDGRYALVVAGDIAIYAKGNA-RPTGGAGAVAMLI 166 >gnl|CDD|29417 cd00830, KAS_III, Ketoacyl-acyl carrier protein synthase III (KASIII) initiates the elongation in type II fatty acid synthase systems. It is found in bacteria and plants. Elongation of fatty acids in the type II systems occurs by Claisen condensation of malonyl-acyl carrier protein (ACP) with acyl-ACP. KASIII initiates this process by specifically using acetyl-CoA over acyl-CoA.. Length = 320 Score = 66.7 bits (163), Expect = 1e-11 Identities = 58/295 (19%), Positives = 107/295 (36%), Gaps = 35/295 (11%) Query: 35 VGIGQERMSVLNPDEDIVTMAAAAALPIMQNQ--DKNLIDTLFFATESSVDQSKSAAIWL 92 GI + R++ +P E +A AA +++ D + ID + AT + + A + Sbjct: 36 TGIRERRIA--DPGETTSDLAVEAAKKALEDAGIDADDIDLIIVATSTPDYLFPATACLV 93 Query: 93 HKLLGLNSSCRVVELKQACYSATCALHMACALVAKSPERKVLIVASDVAR---YDLGSSG 149 LG + ++ AC L A L+ + VL+V ++ S Sbjct: 94 QARLGA-KNAAAFDINAACSGFLYGLSTAAGLIRSGGAKNVLVVGAETLSRILDWTDRST 152 Query: 150 EPTQGCGAVAILISSQTSILEI-----------EDITGIYTNDCMDFWRPNYRRTALV-- 196 G GA A+++ + I D+ I + + Sbjct: 153 AVLFGDGAGAVVLEATEEDPGILDSVLGSDGSGADLLTIPAGGSRSPFEDAEGGDPYLVM 212 Query: 197 DGKYSTKIYLQSLEAVWHDYQKNKGHDFNDFQYFCYHQPFTRMAEKAHIRLSKIVHQNLS 256 DG+ K ++ + + + G +D +F HQ A++R+ + V + L Sbjct: 213 DGREVFKFAVRLMPESIEEALEKAGLTPDDIDWFVPHQ--------ANLRIIEAVAKRLG 264 Query: 257 VAEIEKAIGITTLYNRLIGNSYTASLYIALISLLDHSSENLAGKRVGFFSYGSGC 311 + E EK + Y GN+ AS+ +AL + G V +G+G Sbjct: 265 LPE-EKVVVNLDRY----GNTSAASIPLAL-DEAIEEGKLKKGDLVLLLGFGAGL 313 >gnl|CDD|30680 COG0332, FabH, 3-oxoacyl-[acyl-carrier-protein]. Length = 323 Score = 58.3 bits (141), Expect = 4e-09 Identities = 59/297 (19%), Positives = 106/297 (35%), Gaps = 45/297 (15%) Query: 36 GIGQERMSVLNPDEDIVTMAAAAALPIMQNQ--DKNLIDTLFFATESSVDQSKSAAIWLH 93 GI + R++ DE +A AA +++ + ID + AT + S A + Sbjct: 39 GIRERRIA--ADDETTSDLAVEAARKALEDAGISPDDIDLIIVATSTPDHLFPSTACLVQ 96 Query: 94 KLLGLNSSCRVVELKQACYSATCALHMACALVAKSPERKVLIVASDVA--RYDLGS-SGE 150 LGL +L+ AC AL +A L+ + VL+V ++ D Sbjct: 97 ARLGLGG-APAFDLQAACSGFLYALSVADGLIRSGGYKNVLVVGAETLSRILDWTDRDTC 155 Query: 151 PTQGCGAVAILISS---QTSILEIEDITGIYTNDCMDFWRPNYRRTALVDGKYSTKIYLQ 207 G GA A+++ + IL+ + + D + + P + + + Sbjct: 156 VLFGDGAGAVVLEATEDDNGILDTDLGSDGSQGDLL--YLPGGGSATPKEESGGGLLVMD 213 Query: 208 ----------SLEAVWHDYQKNKGHDFNDFQYFCYHQPFTRM----AEKAHIRLSKIVHQ 253 ++ + + G D +F HQ R+ A+K I K+V Sbjct: 214 GREVFKFAVRAMPKAIEEVLEKAGLTPEDIDWFVPHQANLRIIEAIAKKLGIPEEKVV-- 271 Query: 254 NLSVAEIEKAIGITTLYNRLIGNSYTASLYIALISLLDHSSENLAGKRVGFFSYGSG 310 ++V + GN+ AS+ +AL L G V ++G G Sbjct: 272 -VTVDK--------------YGNTSAASIPLALDEALREGRIK-PGDLVLLEAFGGG 312 >gnl|CDD|109260 pfam00195, Chal_sti_synt_N, Chalcone and stilbene synthases, N-terminal domain. The C-terminal domain of Chalcone synthase is reported to be structurally similar to domains in thiolase and beta-ketoacyl synthase. The differences in activity are accounted for by differences in this N-terminal domain. Length = 228 Score = 43.1 bits (102), Expect = 1e-04 Identities = 34/104 (32%), Positives = 50/104 (48%), Gaps = 9/104 (8%) Query: 45 LNPDEDIVT-----MAAAAALPIMQ--NQDKNLIDTLFFATESSVDQSKSAAIWLHKLLG 97 L+ +DI + AAL ++ Q K+ I L F T S VD A L KLLG Sbjct: 91 LDARQDIAVVEVPKLGKEAALKAIKEWGQPKSKITHLVFCTTSGVDMP-GADYQLTKLLG 149 Query: 98 LNSSC-RVVELKQACYSATCALHMACALVAKSPERKVLIVASDV 140 L S RV+ +Q C++ L +A L + +VL+V S++ Sbjct: 150 LRPSVKRVMLYQQGCFAGGTVLRLAKDLAENNKGARVLVVCSEI 193 >gnl|CDD|29418 cd00831, CHS_like, Chalcone and stilbene synthases; plant-specific polyketide synthases (PKS) and related enzymes, also called type III PKSs. PKS generate an array of different products, dependent on the nature of the starter molecule. They share a common chemical strategy, after the starter molecule is loaded onto the active site cysteine, a carboxylative condensation reation extends the polyketide chain. Plant-specific PKS are dimeric iterative PKSs, using coenzyme A esters to deliver substrate to the active site, but they differ in the choice of starter molecule and the number of condensation reactions.. Length = 361 Score = 42.5 bits (100), Expect = 2e-04 Identities = 62/322 (19%), Positives = 108/322 (33%), Gaps = 39/322 (12%) Query: 15 QYLDLSVIAEKYRLDVAKFYVGIGQERMSVLNPDEDIVTMAAAAALPIMQN--QDKNLID 72 +YL L E Y ER + E+ +A AA + + ID Sbjct: 53 RYLVLPGGEETYAPR--PEMSPSLDERNDI--ALEEARELAEEAARGALDEAGLRPSDID 108 Query: 73 TLFFATESSVDQSKSAAIWLHKLLGLNSSCRVVELKQ-ACYSATCALHMACALVAKSPER 131 L T + + S L LGL + L C + AL +A L+ +P Sbjct: 109 HLVVNTSTGNP-TPSLDAMLINRLGLRPDVKRYNLGGMGCSAGAIALDLAKDLLEANPGA 167 Query: 132 KVLIVASD-----VARYDLGSSGEPTQ--GCGAVAILISS-------QTSILEIEDI-TG 176 +VL+V+++ D S G GA A+L+S+ + + E+ + Sbjct: 168 RVLVVSTELCSLWYRGPDHRSMLVGNALFGDGAAAVLLSNDPRDRRRERPLFELVRAAST 227 Query: 177 IY--TNDCMDFWRPNYRRTALVDGKYSTKIYLQSLEAVWHDYQK--NKGHDFNDFQYFCY 232 + + D M W V + ++ ++LE V G F ++C Sbjct: 228 LLPDSEDAMG-WHLGEEGLTFVLSRDVPRLVEKNLERVLRKLLARLGIGLFKLAFDHWCV 286 Query: 233 HQPFTRMAEKAHIRLSKIVHQNLSVAEIEKAIGITTLYNRLIGNSYTASLYIALISLLDH 292 H + + V + L ++ + L GN ++S+ L + Sbjct: 287 HPGGRAVLDA--------VEKALGLSPEDLEASRMVLRR--YGNMSSSSVLYVLAYMEAK 336 Query: 293 SSENLAGKRVGFFSYGSGCVAE 314 G R ++G G E Sbjct: 337 GRVK-RGDRGLLIAFGPGFTCE 357 >gnl|CDD|73194 cd00327, cond_enzymes, Condensing enzymes; Family of enzymes that catalyze a (decarboxylating or non-decarboxylating) Claisen-like condensation reaction. Members are share strong structural similarity, and are involved in the synthesis and degradation of fatty acids, and the production of polyketides, a diverse group of natural products.. Length = 254 Score = 36.1 bits (83), Expect = 0.018 Identities = 22/97 (22%), Positives = 31/97 (31%), Gaps = 10/97 (10%) Query: 68 KNLIDTLFFATESSVDQSKSAAIWLHKLLGLNSSCRVVELKQACYSATCALHMACALVAK 127 K I + T + AA L LG+ S + QAC + AL +A V Sbjct: 26 KGPIVGVIVGTTGGSGEFSGAAGQLAYHLGI-SGGPAYSVNQACATGLTALALAVQQVQN 84 Query: 128 SPERKVLIVASDVARYDLGSSGEPTQGCGAVAILISS 164 VL S+ A A++ S Sbjct: 85 GKADIVLAGGSEEF---------VFGDGAAAAVVESE 112 >gnl|CDD|73238 cd00825, decarbox_cond_enzymes, decarboxylating condensing enzymes; Family of enzymes that catalyze the formation of a new carbon-carbon bond by a decarboxylating Claisen-like condensation reaction. Members are involved in the synthesis of fatty acids and polyketides, a diverse group of natural products. Both pathways are an iterative series of additions of small carbon units, usually acetate, to a nascent acyl group. There are 2 classes of decarboxylating condensing enzymes, which can be distinguished by sequence similarity, type of active site residues and type of primer units (acetyl CoA or acyl carrier protein (ACP) linked units).. Length = 332 Score = 32.4 bits (73), Expect = 0.21 Identities = 22/127 (17%), Positives = 36/127 (28%), Gaps = 29/127 (22%) Query: 35 VGIGQERMSVLNPDEDIVTMAAAAALPIMQNQDKNLIDTLFFATESSVD---QSKSAAIW 91 +G + + I A A L + KN I + T + A+ Sbjct: 7 LGSYVSILGFEAAERAI----ADAGLS--REYQKNPIVGVVVGTGGGSPRFQVFGADAMR 60 Query: 92 ------------------LHKLLGLNSSCRVVELKQACYSATCALHMACALVAKSPERKV 133 + LG++ V AC + AL +A V + V Sbjct: 61 AVGPYVVTKAMFPGASGQIATPLGIHGPAYDV--SAACAGSLHALSLAADAVQNGKQDIV 118 Query: 134 LIVASDV 140 L S+ Sbjct: 119 LAGGSEE 125 >gnl|CDD|29416 cd00829, SCP-x_thiolase, Thiolase domain associated with sterol carrier protein (SCP)-x isoform and related proteins; SCP-2 has multiple roles in intracellular lipid circulation and metabolism. The N-terminal presequence in the SCP-x isoform represents a peroxisomal 3-ketacyl-Coa thiolase specific for branched-chain acyl CoAs, which is proteolytically cleaved from the sterol carrier protein.. Length = 375 Score = 31.4 bits (71), Expect = 0.42 Identities = 26/108 (24%), Positives = 40/108 (37%), Gaps = 6/108 (5%) Query: 35 VGIGQERMSVLNPDEDIVTMAAAAALPIMQNQ--DKNLIDTLFFAT-ESSVDQSKSAAIW 91 VG+G D + +AA AA + + + ID + QS A+ Sbjct: 1 VGVGMTPFGRR-SDRSPLELAAEAARAALDDAGLEPADIDAVVVGNAAGGRFQSFPGALI 59 Query: 92 LHKLLGLNSSCRVVELKQACYSATCALHMACALVAKSPERKVLIVASD 139 L L VE A S + A+ A A +A VL+V ++ Sbjct: 60 AEYLGLLGKPATRVEA--AGASGSAAVRAAAAAIASGLADVVLVVGAE 105 >gnl|CDD|146146 pfam03358, FMN_red, NADPH-dependent FMN reductase. Length = 147 Score = 30.6 bits (70), Expect = 0.85 Identities = 13/42 (30%), Positives = 15/42 (35%), Gaps = 9/42 (21%) Query: 270 YNRLIGNSYTASLYIALISLLDHSSEN-LAGKRVGFFSYGSG 310 YN S AL + +D S L GK V S G Sbjct: 79 YNG----SVPG----ALKNAIDWLSRKELRGKPVAIVSVSGG 112 >gnl|CDD|33230 COG3424, BcsA, Predicted naringenin-chalcone synthase [Secondary metabolites biosynthesis, transport, and catabolism]. Length = 356 Score = 29.9 bits (67), Expect = 1.4 Identities = 16/68 (23%), Positives = 26/68 (38%), Gaps = 2/68 (2%) Query: 71 IDTLFFATESSVDQSKSAAIWLHKLLGLNSSC-RVVELKQACYSATCALHMACALVAKSP 129 ID + T + + + S + LGL+ RV C + L A +P Sbjct: 98 IDAVVTVTSTGL-ATPSLDARIVGELGLSPDTRRVPVWGLGCAAGAAGLARARDYCRAAP 156 Query: 130 ERKVLIVA 137 + VL+V Sbjct: 157 DANVLVVC 164 >gnl|CDD|32882 COG3068, COG3068, Uncharacterized protein conserved in bacteria [Function unknown]. Length = 194 Score = 28.8 bits (64), Expect = 3.0 Identities = 22/97 (22%), Positives = 36/97 (37%), Gaps = 20/97 (20%) Query: 188 PNYRRTALVDGKYSTKIYLQSLEAVWH-----------DYQKNKGH----DFNDFQYFCY 232 PNY +IY + L+ VW + Q K +DF + Sbjct: 31 PNYAMFCQQTEFGDGQIYRRILDLVWETLTVKDAKVNFESQLEKLEEAIPSADDFDLYGV 90 Query: 233 HQPFTRMAEKAHIRLSKIVHQNLSVAEIEKAIGITTL 269 + A A + LS+++H LS +E AI ++ Sbjct: 91 YP-----AIDACVALSELLHSRLSGETLEHAIEVSKT 122 >gnl|CDD|37176 KOG1965, KOG1965, KOG1965, Sodium/hydrogen exchanger protein [Inorganic ion transport and metabolism]. Length = 575 Score = 28.7 bits (64), Expect = 3.3 Identities = 22/63 (34%), Positives = 29/63 (46%), Gaps = 6/63 (9%) Query: 263 AIG-ITTLYNRLIGNSYTASLYIALISLLDHSSENLAGKRVGFFSYGSGCVAEFFSGIVQ 321 AIG I+ L + + T SL AL+ L+ + S LA SG V F GIV Sbjct: 248 AIGLISALVLKFLYLRRTPSLESALMLLMSYLSYLLAE----GCGL-SGIVTVLFCGIVM 302 Query: 322 KNY 324 +Y Sbjct: 303 SHY 305 >gnl|CDD|145954 pfam03078, ATHILA, ATHILA ORF-1 family. ATHILA is a group of Arabidopsis thaliana retrotransposons belonging to the Ty3/gypsy family of the long terminal repeat (LTR) class of eukaryotic retrotransposons. The central region of ATHILA retrotransposons contains two or three open reading frames (ORFs). This family represents the ORF1 product. The function of ORF1 is unknown. Length = 456 Score = 28.4 bits (63), Expect = 3.6 Identities = 20/87 (22%), Positives = 37/87 (42%), Gaps = 5/87 (5%) Query: 168 ILEIEDITGIYTNDCMDFWRPNYRRTALVDGKYSTKIYLQSLEAVWHDYQKNKGHDFNDF 227 +L ++IT I D +DF A D +++ + + ++ D + F+ Sbjct: 339 LLPCKEITTIREGDNIDFKPSKEILGATDDIEHTEEATEEDVDEEDEDDEAELE-KFDTS 397 Query: 228 QY-FCYHQPFTRMAE---KAHIRLSKI 250 Y F H P R ++ +AH SK+ Sbjct: 398 MYHFEEHVPPARESKSLSEAHKNNSKL 424 >gnl|CDD|164495 CHL00091, apcE, phycobillisome linker protein. Length = 877 Score = 27.9 bits (62), Expect = 4.6 Identities = 15/47 (31%), Positives = 26/47 (55%), Gaps = 5/47 (10%) Query: 231 CYHQPFTRMAEKAH-IRL----SKIVHQNLSVAEIEKAIGITTLYNR 272 CY Q F R KA+ + L SK+ + +S+ E +A+G + +Y + Sbjct: 279 CYRQIFERDIAKAYGLSLSELESKVKNGQISIKEFIRALGKSEIYRK 325 >gnl|CDD|144198 pfam00517, GP41, Retroviral envelope protein. This family includes envelope protein from a variety of retroviruses. It includes the GP41 subunit of the envelope protein complex from human and simian immunodeficiency viruses (HIV and SIV) which mediate membrane fusion during viral entry. The family also includes bovine immunodeficiency virus, feline immunodeficiency virus and Equine infectious anaemia (EIAV). The family also includes the Gp36 protein from mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). Length = 204 Score = 28.2 bits (63), Expect = 4.6 Identities = 10/43 (23%), Positives = 18/43 (41%) Query: 313 AEFFSGIVQKNYRKKSHQKRHQHIINSRIPISYQVYCDLHQDI 355 A+ +GIVQ + Q++ ++ RI V L + Sbjct: 13 AQVLAGIVQNQTNLLNAQEQIDKLLQLRIWGLKNVVLVLGARV 55 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.321 0.134 0.399 Gapped Lambda K H 0.267 0.0852 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 4,714,560 Number of extensions: 241920 Number of successful extensions: 501 Number of sequences better than 10.0: 1 Number of HSP's gapped: 481 Number of HSP's successfully gapped: 22 Length of query: 396 Length of database: 6,263,737 Length adjustment: 96 Effective length of query: 300 Effective length of database: 4,189,273 Effective search space: 1256781900 Effective search space used: 1256781900 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 58 (25.9 bits)