RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= 537021.9.peg.473_1 (165 letters) >gnl|CDD|32462 COG2308, COG2308, Uncharacterized conserved protein [Function unknown]. Length = 488 Score = 33.7 bits (77), Expect = 0.022 Identities = 26/84 (30%), Positives = 35/84 (41%), Gaps = 19/84 (22%) Query: 28 LPRNISSDECFPLAEAGITEKQDVLNRLSACTPDNYTKQLI--------ELCEKNHDYKM 79 +PR IS+ E L EAG+ ++ LN A D Y Q I EL N + Sbjct: 77 IPRIISAAEWARL-EAGLVQRVRALNAFLA---DIYGDQRILRDGVIPPELVASNPGFLR 132 Query: 80 VEML------QRFLHVVSFELGRL 97 EM+ +LHV +L R Sbjct: 133 -EMVGIRPPGGHYLHVCGTDLIRD 155 >gnl|CDD|143967 pfam00210, Ferritin, Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins. Length = 142 Score = 28.8 bits (65), Expect = 0.70 Identities = 10/24 (41%), Positives = 15/24 (62%) Query: 65 KQLIELCEKNHDYKMVEMLQRFLH 88 ++LIEL E+ DY + L+ FL Sbjct: 100 RELIELAEEEGDYATADFLEWFLE 123 >gnl|CDD|36882 KOG1669, KOG1669, KOG1669, Predicted mRNA cap-binding protein related to eIF-4E [Translation, ribosomal structure and biogenesis]. Length = 208 Score = 28.5 bits (63), Expect = 0.94 Identities = 18/83 (21%), Positives = 31/83 (37%), Gaps = 11/83 (13%) Query: 60 PDNYTKQLIELCEKNHDYKMVEMLQRFL-HVVSFEL--GRLEVSFLKEIPEDFIENLATK 116 +Y K + + ++ VE H+ + + ++ F KE + E+ A Sbjct: 49 KQSYEKNIKSIGT----FESVEQFWYLYHHLKRPDRMDRQFDIHFFKEGIKPVWEDKA-- 102 Query: 117 LKNWTGEDWEIRFSLGRCCEYFE 139 N G W IR G Y+E Sbjct: 103 --NCNGGKWIIRLKKGVSQRYWE 123 >gnl|CDD|146681 pfam04169, DUF404, Domain of unknown function (DUF404). Length = 253 Score = 28.6 bits (65), Expect = 0.97 Identities = 24/82 (29%), Positives = 35/82 (42%), Gaps = 17/82 (20%) Query: 28 LPRNISSDECFPLAEAGITEKQDVLNRLSACTPDNYTKQLI--------ELCEKNHDYK- 78 +PR IS+DE L E G+ ++ LN A D Y +Q I EL N + Sbjct: 49 VPRIISADEWARL-ERGLAQRARALNAFLA---DIYGEQRILKDGVLPPELVYGNPGFLR 104 Query: 79 -MVEM---LQRFLHVVSFELGR 96 M + +LH+ +L R Sbjct: 105 PMRGVKPPGGVYLHIYGIDLVR 126 >gnl|CDD|37944 KOG2733, KOG2733, KOG2733, Uncharacterized membrane protein [Function unknown]. Length = 423 Score = 26.4 bits (58), Expect = 3.5 Identities = 14/53 (26%), Positives = 28/53 (52%), Gaps = 2/53 (3%) Query: 40 LAEAGITEK--QDVLNRLSACTPDNYTKQLIELCEKNHDYKMVEMLQRFLHVV 90 LA AG EK Q+VL ++ T + + +I + + ++ + EM ++ +V Sbjct: 36 LAVAGRNEKKLQEVLEKVGEKTGTDLSSSVILIADSANEASLDEMAKQARVIV 88 >gnl|CDD|29823 cd00778, ProRS_core_arch_euk, Prolyl-tRNA synthetase (ProRS) class II core catalytic domain. ProRS is a homodimer. It is responsible for the attachment of proline to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs in the core domain. This subfamily contains the core domain of ProRS from archaea, the cytoplasm of eukaryotes and some bacteria.. Length = 261 Score = 26.7 bits (59), Expect = 3.7 Identities = 13/51 (25%), Positives = 22/51 (43%) Query: 73 KNHDYKMVEMLQRFLHVVSFELGRLEVSFLKEIPEDFIENLATKLKNWTGE 123 + + Y + E +Q+ L E G V F IPE +E ++ + E Sbjct: 28 RPYGYAIWENIQKILDKEIKETGHENVYFPLLIPESELEKEKEHIEGFAPE 78 >gnl|CDD|36107 KOG0889, KOG0889, KOG0889, Histone acetyltransferase SAGA, TRRAP/TRA1 component, PI-3 kinase superfamily [Signal transduction mechanisms, Chromatin structure and dynamics, Replication, recombination and repair, Cell cycle control, cell division, chromosome partitioning]. Length = 3550 Score = 26.4 bits (58), Expect = 3.8 Identities = 14/66 (21%), Positives = 25/66 (37%), Gaps = 3/66 (4%) Query: 62 NYTKQLIELCEKNHDYKMVEMLQRFLHVV--SFELGRLEVSFLKEIPEDFI-ENLATKLK 118 N + L+E C K+ + K+ E L + L + F + L ++ I E L + Sbjct: 1975 NQIQALLEKCLKSGNLKVTEALNQLLSRLLSIFPNTHESLELLYQVVSKLIDELLTSVTT 2034 Query: 119 NWTGED 124 Sbjct: 2035 VPLQLA 2040 >gnl|CDD|176895 cd08886, RHO_alpha_C_2, C-terminal catalytic domain of the oxygenase alpha subunit of an uncharacterized subgroup of Rieske-type non-heme iron aromatic ring-hydroxylating oxygenases. C-terminal catalytic domain of the oxygenase alpha subunit of a functionally uncharacterized subgroup of the Rieske-type non-heme iron aromatic ring-hydroxylating oxygenase (RHO) family. RHOs, also known as aromatic ring hydroxylating dioxygenases, utilize non-heme Fe(II) to catalyze the addition of hydroxyl groups to the aromatic ring, an initial step in the oxidative degradation of aromatic compounds. RHOs are composed of either two or three protein components, and are comprised of an electron transport chain (ETC) and an oxygenase. The ETC transfers reducing equivalents from the electron donor to the oxygenase component, which in turn transfers electrons to the oxygen molecules. The oxygenase components are oligomers, either (alpha)n or (alpha)n(beta)n. The alpha subunits are the catalytic components and have an N-terminal domain, which binds a Rieske-like 2Fe-2S cluster, and a C-terminal domain which binds the non-heme Fe(II). The Fe(II) is co-ordinated by conserved His and Asp residues. This subfamily belongs to the SRPBCC (START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC) domain superfamily of proteins that bind hydrophobic ligands. SRPBCC domains have a deep hydrophobic ligand-binding pocket. Length = 182 Score = 25.5 bits (56), Expect = 6.7 Identities = 13/42 (30%), Positives = 21/42 (50%), Gaps = 3/42 (7%) Query: 59 TPDNYTKQLIELCE---KNHDYKMVEMLQRFLHVVSFELGRL 97 + K+LIE + D ++VE +QR L +F GR+ Sbjct: 117 ELTDEEKELIEYYRQVLQPEDLELVESVQRGLKSRAFGQGRI 158 >gnl|CDD|37551 KOG2340, KOG2340, KOG2340, Uncharacterized conserved protein [Function unknown]. Length = 698 Score = 25.3 bits (55), Expect = 8.1 Identities = 8/42 (19%), Positives = 13/42 (30%) Query: 91 SFELGRLEVSFLKEIPEDFIENLATKLKNWTGEDWEIRFSLG 132 + F E+ I +L + K + I SL Sbjct: 128 NESEDPFWSHFDDELGSKHIADLIARTKTTVKLEKSILGSLI 169 >gnl|CDD|144612 pfam01084, Ribosomal_S18, Ribosomal protein S18. Length = 54 Score = 25.1 bits (56), Expect = 9.7 Identities = 8/12 (66%), Positives = 11/12 (91%) Query: 76 DYKMVEMLQRFL 87 DYK VE+L+RF+ Sbjct: 8 DYKDVELLRRFI 19 >gnl|CDD|144566 pfam01025, GrpE, GrpE. Length = 162 Score = 25.2 bits (56), Expect = 9.9 Identities = 19/74 (25%), Positives = 32/74 (43%), Gaps = 3/74 (4%) Query: 41 AEAGITEKQDVLNRLSACTPDNYTKQLIELCEKNHDYKMVEMLQRFLHVV-SFELGRLEV 99 E I E +D L RL A +NY K+ E+ Y + + + L V+ + E L Sbjct: 20 LEEEIEELKDRLLRLLAEF-ENYRKRTEREREEAKKYAIEKFAKDLLPVLDNLERA-LAA 77 Query: 100 SFLKEIPEDFIENL 113 + E + +E + Sbjct: 78 AKENEDVKSLVEGV 91 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.321 0.137 0.414 Gapped Lambda K H 0.267 0.0766 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 2,034,885 Number of extensions: 99261 Number of successful extensions: 270 Number of sequences better than 10.0: 1 Number of HSP's gapped: 270 Number of HSP's successfully gapped: 18 Length of query: 165 Length of database: 6,263,737 Length adjustment: 87 Effective length of query: 78 Effective length of database: 4,383,754 Effective search space: 341932812 Effective search space used: 341932812 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 54 (24.5 bits)