RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780883|ref|YP_003065296.1| hypothetical protein CLIBASIA_03900 [Candidatus Liberibacter asiaticus str. psy62] (139 letters) >gnl|CDD|145133 pfam01809, DUF37, Domain of unknown function DUF37. This domain is found in short (70 amino acid) hypothetical proteins from various bacteria. The domain contains three conserved cysteine residues. A member from Aeromonas hydrophila has been found to have hemolytic activity (unpublished). Length = 68 Score = 109 bits (275), Expect = 3e-25 Identities = 33/67 (49%), Positives = 44/67 (65%) Query: 56 RLIGIGIIRIYQLIFSNVFGNSCRYLPTCSEYGYEAIARYGLWIGSWLTLLRLIKCNPLG 115 + + IG+IR YQ S + G SCR+ PTCSEY EAI R+G G WL L R+++C+P Sbjct: 2 KRLLIGLIRFYQRFISPLLGPSCRFYPTCSEYAIEAIERHGALKGGWLALKRILRCHPFH 61 Query: 116 SDGFDPV 122 G+DPV Sbjct: 62 PGGYDPV 68 >gnl|CDD|31102 COG0759, COG0759, Uncharacterized conserved protein [Function unknown]. Length = 92 Score = 99.3 bits (247), Expect = 3e-22 Identities = 33/73 (45%), Positives = 47/73 (64%) Query: 58 IGIGIIRIYQLIFSNVFGNSCRYLPTCSEYGYEAIARYGLWIGSWLTLLRLIKCNPLGSD 117 + I +IR YQ S + SCR+ PTCSEY EA+ ++GL G WLTL R+++C+P Sbjct: 9 LLIFLIRFYQRFISPLLPPSCRFYPTCSEYAIEALKKHGLLKGLWLTLKRILRCHPFHKG 68 Query: 118 GFDPVPDDLPPKK 130 G+DPVP+ L + Sbjct: 69 GYDPVPERLSLRS 81 >gnl|CDD|36398 KOG1184, KOG1184, KOG1184, Thiamine pyrophosphate-requiring enzyme [Amino acid transport and metabolism, Coenzyme transport and metabolism]. Length = 561 Score = 27.9 bits (62), Expect = 0.96 Identities = 11/26 (42%), Positives = 16/26 (61%) Query: 113 PLGSDGFDPVPDDLPPKKNNPKLLSK 138 P+ + G PVP DL PK +N + L + Sbjct: 175 PVPAFGLLPVPLDLSPKPSNKEGLEE 200 >gnl|CDD|144107 pfam00389, 2-Hacid_dh, D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain. Length = 313 Score = 26.9 bits (60), Expect = 2.0 Identities = 11/42 (26%), Positives = 16/42 (38%), Gaps = 7/42 (16%) Query: 42 KSRNWNGKFP-------KTLGRLIGIGIIRIYQLIFSNVFGN 76 ++ +W P KTLG + G GI I I + Sbjct: 121 RAGDWKKGGPIGLELRGKTLGVIGGGGIGGIGAAIAKALGMG 162 >gnl|CDD|38830 KOG3624, KOG3624, KOG3624, M13 family peptidase [Amino acid transport and metabolism]. Length = 687 Score = 26.3 bits (57), Expect = 2.8 Identities = 18/96 (18%), Positives = 27/96 (28%), Gaps = 4/96 (4%) Query: 1 MYYECKDSTSSDRYLTTFSQSSYALPIQKKGS-PLLLKNNTPKSRNWNGKFPKTLGRLIG 59 Y C D+ + + + IQ G PLL N N N L R Sbjct: 122 FYESCLDAKALESSG---ALQLLFRIIQSIGGWPLLEGNWDESKFNLNEMLANLLRRYGL 178 Query: 60 IGIIRIYQLIFSNVFGNSCRYLPTCSEYGYEAIARY 95 + + + LP+ S+Y Sbjct: 179 TTLFLLEVALDYKNSSQPGLILPSRSKYLATDSDSE 214 >gnl|CDD|144198 pfam00517, GP41, Retroviral envelope protein. This family includes envelope protein from a variety of retroviruses. It includes the GP41 subunit of the envelope protein complex from human and simian immunodeficiency viruses (HIV and SIV) which mediate membrane fusion during viral entry. The family also includes bovine immunodeficiency virus, feline immunodeficiency virus and Equine infectious anaemia (EIAV). The family also includes the Gp36 protein from mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). Length = 204 Score = 25.8 bits (57), Expect = 3.8 Identities = 8/39 (20%), Positives = 14/39 (35%) Query: 44 RNWNGKFPKTLGRLIGIGIIRIYQLIFSNVFGNSCRYLP 82 +W + L+ I ++RI + N Y P Sbjct: 151 TSWISYIKIGILILLIIIVLRILPGVLRNCRRVRQGYSP 189 >gnl|CDD|35696 KOG0475, KOG0475, KOG0475, Cl- channel CLC-3 and related proteins (CLC superfamily) [Inorganic ion transport and metabolism]. Length = 696 Score = 25.3 bits (55), Expect = 5.5 Identities = 10/26 (38%), Positives = 15/26 (57%) Query: 53 TLGRLIGIGIIRIYQLIFSNVFGNSC 78 GR +GI + ++ Q N+FG SC Sbjct: 435 IAGRAVGIIVEQLAQNPDFNLFGLSC 460 >gnl|CDD|33390 COG3590, PepO, Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]. Length = 654 Score = 24.5 bits (53), Expect = 9.6 Identities = 10/25 (40%), Positives = 12/25 (48%) Query: 2 YYECKDSTSSDRYLTTFSQSSYALP 26 + D S RY+ FSQS LP Sbjct: 136 FSVSPDFKDSTRYVLYFSQSGLGLP 160 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.319 0.140 0.443 Gapped Lambda K H 0.267 0.0748 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 1,860,658 Number of extensions: 94816 Number of successful extensions: 172 Number of sequences better than 10.0: 1 Number of HSP's gapped: 172 Number of HSP's successfully gapped: 12 Length of query: 139 Length of database: 6,263,737 Length adjustment: 84 Effective length of query: 55 Effective length of database: 4,448,581 Effective search space: 244671955 Effective search space used: 244671955 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 52 (23.8 bits)