RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780558|ref|YP_003064971.1| hypothetical protein CLIBASIA_02225 [Candidatus Liberibacter asiaticus str. psy62] (396 letters) >gnl|CDD|32688 COG2861, COG2861, Uncharacterized protein conserved in bacteria [Function unknown]. Length = 250 Score = 184 bits (468), Expect = 4e-47 Identities = 73/229 (31%), Positives = 125/229 (54%), Gaps = 3/229 (1%) Query: 162 KNFCSNASGARIAIVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQ 221 S A G ++AIV+ LG+SQTGTQ AI LP +TLAFA W ++A+ G Sbjct: 20 ARPWSGARGPKLAIVIDDLGLSQTGTQ-AILALPPEVTLAFAPFAPHAREWAQKARNAGH 78 Query: 222 EAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSN 281 E ++ +PM+ F E TL+ + +++L RLR ++ + G+ N+ G+ SN Sbjct: 79 EVLIHMPMEPFSYPKIEPG--TLRPGMSAEEILRRLRKAMNKVPDAVGLNNHMGSRFTSN 136 Query: 282 KESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKL 341 +++ E + + +RGL F D G+ +L +A ++ +P + D++LDD+ + ++L Sbjct: 137 EDAMEKLMEALKERGLYFLDSGTIANSLAGKIAKEIGVPVIKRDVFLDDEDTEAAVLKQL 196 Query: 342 KGLEEIARTTGQAIGVAVAFDESIEVISQWLQQEHVRDVSVVPLSCLAK 390 E++AR G AIG+ ++ V+ QWL + R + +VP+S L Sbjct: 197 DAAEKLARKNGSAIGIGHPHKNTVAVLQQWLDELPARGIELVPVSALLG 245 >gnl|CDD|147083 pfam04748, Polysacc_deac_2, Divergent polysaccharide deacetylase. This family is divergently related to pfam01522 (personal obs:Yeats C). Length = 213 Score = 182 bits (464), Expect = 2e-46 Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 3/216 (1%) Query: 175 IVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDE 234 IV+ LG + T A+ LPA +TLA + A+ G E +L +PM+ Sbjct: 1 IVIDDLGYN-LATDEALLALPAPVTLAILPYAPHAAELAEAARAAGHEVLLHLPMEPLG- 58 Query: 235 SYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAK 294 Y + TL V + +++ RL +L R GV N+ G+ +++ + + +E K Sbjct: 59 -YKDPGPGTLTVGMSAEEIEKRLEAALSRVPYAVGVNNHMGSRFTADRAAMRWVMEELKK 117 Query: 295 RGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIARTTGQA 354 RGL F D +S R++ LA +L +P D++LD++ IR +L +AR G A Sbjct: 118 RGLFFVDSRTSGRSVAPKLAKELGVPAAKRDVFLDNEQTEAAIRRQLDQAAALARKEGSA 177 Query: 355 IGVAVAFDESIEVISQWLQQEHVRDVSVVPLSCLAK 390 I + E+I + +WL + R + +VP+S L + Sbjct: 178 IAIGHPHPETIAALKEWLPELRARGIELVPVSALLR 213 >gnl|CDD|38804 KOG3598, KOG3598, KOG3598, Thyroid hormone receptor-associated protein complex, subunit TRAP230 [Transcription]. Length = 2220 Score = 30.8 bits (69), Expect = 0.65 Identities = 19/95 (20%), Positives = 33/95 (34%), Gaps = 7/95 (7%) Query: 302 DGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIARTTGQAIGVAVAF 361 DG R+ + N P +Y D+++ KIREK E T I + Sbjct: 431 DGEEERDKNKDTDVADNEP---EKIYTDEELAEIKIREK----EMKIATLASHIKEKMVS 483 Query: 362 DESIEVISQWLQQEHVRDVSVVPLSCLAKLSSPSS 396 D+ + + + S L L+ + + Sbjct: 484 DKKWKEKTSHAGWRTLDQCSQFALHLLSSMVQYMA 518 >gnl|CDD|39047 KOG3843, KOG3843, KOG3843, Predicted serine hydroxymethyltransferase SLA/LP (autoimmune hepatitis marker in humans) [Translation, ribosomal structure and biogenesis]. Length = 432 Score = 30.5 bits (68), Expect = 0.76 Identities = 25/97 (25%), Positives = 40/97 (41%), Gaps = 7/97 (7%) Query: 146 AKNKVGREDTEVPAMDKNFCSNASGARIAIVVSGL--GISQTGTQRAINLLPANITLAFA 203 A + GR D V ++DKNF GA IA I++ RA ++ + Sbjct: 209 AAAECGRIDAFVQSLDKNFMVPVGGAIIAAFKDNFIQEIAKMYPGRASASPSLDLLITLL 268 Query: 204 SNGNSLDRWMKEAKKKGQEAILQIPMQ--AFDESYNE 238 S G + KE + +E L++ + F E+ E Sbjct: 269 SLGQNA---FKELFGEQKEMFLKLRNKIIKFAEAIGE 302 >gnl|CDD|144597 pfam01062, Bestrophin, Bestrophin. Bestrophin is a 68-kDa basolateral plasma membrane protein expressed in retinal pigment epithelial cells (RPE). It is encoded by the VMD2 gene, which is mutated in Best macular dystrophy, a disease characterized by a depressed light peak in the electrooculogram. VMD2 encodes a 585-amino acid protein with an approximate mass of 68 kDa which has been designated bestrophin. Bestrophin shares homology with the Caenorhabditis elegans RFP gene family, named for the presence of a conserved arginine (R), phenylalanine (F), proline (P), amino acid sequence motif. Bestrophin is a plasma membrane protein, localized to the basolateral surface of RPE cells consistent with a role for bestrophin in the generation or regulation of the EOG light peak. Bestrophin and other RFP family members represent a new class of chloride channels, indicating a direct role for bestrophin in generating the light peak. The VMD2 gene underlying Best disease was shown to represent the first human member of the RFP-TM protein family. More than 97% of the disease-causing mutations are located in the N-terminal RFP-TM domain implying important functional properties. Length = 275 Score = 30.2 bits (69), Expect = 1.1 Identities = 17/58 (29%), Positives = 26/58 (44%), Gaps = 8/58 (13%) Query: 16 RKSFYSQIISRLGLFLLFCTFIVGLSIYILISHAFVGTISEMIPYSVIREIAPIPLTI 73 R S I RL LFLL +V + Y+L F+ P++++ I L+I Sbjct: 8 RGSVLKSIWPRLLLFLLLSALVVVIYRYLLSQKIFL----PAAPFTLL----GIALSI 57 >gnl|CDD|32760 COG2937, PlsB, Glycerol-3-phosphate O-acyltransferase [Lipid metabolism]. Length = 810 Score = 29.9 bits (67), Expect = 1.3 Identities = 21/77 (27%), Positives = 32/77 (41%), Gaps = 13/77 (16%) Query: 261 LRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFA----KRG--LLFFDDGSSPRNLTRVLA 314 RRG +F ++G L S +F+E+ RG L +F +G R R+L Sbjct: 342 FRRGGAFFIRRTFKGNPLYS------TVFREYLGELFSRGYSLEYFIEGGRSRT-GRLLP 394 Query: 315 PKLNLPYMVADLYLDDQ 331 PK + M L + Sbjct: 395 PKTGMLSMTLQAMLRGR 411 >gnl|CDD|30386 COG0037, MesJ, Predicted ATPase of the PP-loop superfamily implicated in cell cycle control [Cell division and chromosome partitioning]. Length = 298 Score = 29.4 bits (65), Expect = 1.9 Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 1/87 (1%) Query: 306 PRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIARTTGQAIGVAVAFD-ES 364 + A + LPY+ + D + R++IRE+L LEE ++ A E Sbjct: 173 REKEIELYAKEKGLPYIEDESNYDLRYTRNRIREELLPLEEERPGLKFSLARAFELLREL 232 Query: 365 IEVISQWLQQEHVRDVSVVPLSCLAKL 391 ++ + ++ +R ++ L K+ Sbjct: 233 RLLLEKLSDEKLLRCLACGAKPSLGKI 259 >gnl|CDD|37313 KOG2102, KOG2102, KOG2102, Exosomal 3'-5' exoribonuclease complex, subunit Rrp44/Dis3 [Translation, ribosomal structure and biogenesis]. Length = 941 Score = 29.2 bits (65), Expect = 1.9 Identities = 14/42 (33%), Positives = 18/42 (42%) Query: 312 VLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIARTTGQ 353 V+ K L Y A LD D + L+ L EIA+ Q Sbjct: 587 VIRSKAALSYEEAQDILDTPNSSDPLTSSLRLLHEIAKILRQ 628 >gnl|CDD|33662 COG3872, COG3872, Predicted metal-dependent enzyme [General function prediction only]. Length = 318 Score = 29.1 bits (65), Expect = 2.4 Identities = 20/79 (25%), Positives = 35/79 (44%), Gaps = 7/79 (8%) Query: 13 TPKRKSFYSQIISRLGLFLLFCTFIVGLSIYILISHAFVGTISEMIPYSVIREIAPIPLT 72 TP+ S++ R G L T IVG+ +Y L V + + V+ I +P+ Sbjct: 194 TPENVQKQSRLHPRCGTSFLLFTMIVGIFVYSL-----VPFDN--LWLRVVDRIILLPVV 246 Query: 73 IPLNIEDKQSPSKRDNNTV 91 + ++ E + +K N V Sbjct: 247 LGISYEVLKLTNKVRNKPV 265 >gnl|CDD|36416 KOG1202, KOG1202, KOG1202, Animal-type fatty acid synthase and related proteins [Lipid transport and metabolism]. Length = 2376 Score = 28.0 bits (62), Expect = 5.1 Identities = 18/83 (21%), Positives = 34/83 (40%), Gaps = 1/83 (1%) Query: 213 MKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMN 272 + K + + L + AF + D +L+ ++ + L T Y+ +N Sbjct: 1399 FRHLKLEEDKPELPVE-HAFVNTLTRGDLSSLRWIESPLRHAQPTCPGLELCTVYYAPLN 1457 Query: 273 YRGAMLLSNKESAEVIFKEFAKR 295 +R ML S K S + I + A + Sbjct: 1458 FRDIMLASGKLSPDAIPGDLASQ 1480 >gnl|CDD|177158 MTH00095, ND5, NADH dehydrogenase subunit 5; Provisional. Length = 527 Score = 27.2 bits (61), Expect = 8.1 Identities = 9/27 (33%), Positives = 15/27 (55%), Gaps = 1/27 (3%) Query: 25 SRLGL-FLLFCTFIVGLSIYILISHAF 50 S++G L F + + +S L+SHA Sbjct: 261 SQMGFSMLTFGSGLSFVSFIHLLSHAL 287 >gnl|CDD|35730 KOG0510, KOG0510, KOG0510, Ankyrin repeat protein [General function prediction only]. Length = 929 Score = 27.3 bits (60), Expect = 8.8 Identities = 13/41 (31%), Positives = 18/41 (43%) Query: 5 LNHPLRKKTPKRKSFYSQIISRLGLFLLFCTFIVGLSIYIL 45 L HPL K L L++ F+V L+IY+L Sbjct: 488 LEHPLSSSFLFMKWNRYGKYFHLNNLLIYSLFLVSLTIYVL 528 >gnl|CDD|31822 COG1635, THI4, Flavoprotein involved in thiazole biosynthesis [Coenzyme metabolism]. Length = 262 Score = 27.1 bits (60), Expect = 8.9 Identities = 9/21 (42%), Positives = 14/21 (66%) Query: 275 GAMLLSNKESAEVIFKEFAKR 295 G MLLS K++AE I ++ + Sbjct: 242 GGMLLSGKKAAEEILEKLKLK 262 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.317 0.134 0.373 Gapped Lambda K H 0.267 0.0812 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 4,572,319 Number of extensions: 240470 Number of successful extensions: 727 Number of sequences better than 10.0: 1 Number of HSP's gapped: 723 Number of HSP's successfully gapped: 27 Length of query: 396 Length of database: 6,263,737 Length adjustment: 96 Effective length of query: 300 Effective length of database: 4,189,273 Effective search space: 1256781900 Effective search space used: 1256781900 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 58 (26.0 bits)