Query gi|254780253|ref|YP_003064666.1| ribosomal protein L29 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 67 No_of_seqs 100 out of 1117 Neff 5.8 Searched_HMMs 39220 Date Tue May 24 10:00:49 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780253.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00461 rpmC 50S ribosomal pr 99.9 4.8E-22 1.2E-26 152.2 9.1 65 1-65 1-65 (122) 2 PRK00306 50S ribosomal protein 99.8 3E-19 7.6E-24 135.6 9.2 65 2-66 1-65 (66) 3 PRK01450 rpmC 50S ribosomal pr 99.8 2.4E-19 6.2E-24 136.1 8.1 65 2-66 6-70 (71) 4 COG0255 RpmC Ribosomal protein 99.8 1.6E-18 4E-23 131.3 8.6 64 2-65 1-66 (69) 5 TIGR00012 L29 ribosomal protei 99.7 1.3E-18 3.2E-23 131.9 5.6 56 6-61 1-57 (57) 6 cd00427 Ribosomal_L29_HIP Ribo 99.7 1.4E-17 3.6E-22 125.6 6.5 57 5-61 1-57 (57) 7 pfam00831 Ribosomal_L29 Riboso 99.7 2.2E-17 5.7E-22 124.4 7.2 58 4-61 1-58 (58) 8 CHL00154 rpl29 ribosomal prote 99.5 1.3E-13 3.4E-18 101.9 8.2 63 2-64 4-66 (67) 9 KOG3436 consensus 99.2 2.9E-11 7.5E-16 87.9 7.8 65 1-65 1-68 (123) 10 KOG3331 consensus 98.0 3.3E-05 8.5E-10 51.9 8.2 64 3-66 67-135 (213) 11 pfam06984 MRP-L47 Mitochondria 97.9 7.2E-05 1.8E-09 49.9 7.3 59 3-61 24-87 (87) 12 COG5493 Uncharacterized conser 60.8 15 0.00038 18.2 5.4 54 8-61 44-107 (231) 13 pfam10655 DUF2482 Hypothetical 58.2 8.7 0.00022 19.6 2.5 27 5-31 3-29 (100) 14 PRK10613 hypothetical protein; 36.3 37 0.00094 15.8 3.2 49 11-59 11-73 (74) 15 pfam10769 DUF2594 Protein of u 35.5 38 0.00098 15.7 3.1 49 11-59 11-73 (74) 16 pfam06698 DUF1192 Protein of u 35.0 39 0.00099 15.7 3.0 27 5-31 15-41 (58) 17 smart00540 LEM in nuclear memb 25.1 46 0.0012 15.3 1.8 16 6-21 2-17 (44) 18 pfam08984 DUF1858 Domain of un 24.5 59 0.0015 14.6 3.1 45 9-61 4-48 (59) 19 PRK11449 putative deoxyribonuc 23.5 62 0.0016 14.5 2.6 25 4-28 233-257 (258) 20 pfam03020 LEM LEM domain. The 22.4 51 0.0013 15.0 1.6 16 6-21 2-17 (43) 21 COG1154 Dxs Deoxyxylulose-5-ph 21.7 68 0.0017 14.3 4.2 31 2-32 9-39 (627) 22 TIGR01597 PYST-B Plasmodium yo 20.6 59 0.0015 14.6 1.6 22 9-30 112-133 (274) 23 PRK04351 hypothetical protein; 20.3 43 0.0011 15.4 0.9 26 10-35 1-26 (151) No 1 >PRK00461 rpmC 50S ribosomal protein L29; Reviewed Probab=99.87 E-value=4.8e-22 Score=152.23 Aligned_cols=65 Identities=29% Similarity=0.598 Sum_probs=63.4 Q ss_pred CCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 98889997299999999999999999999999953786546999999999999999999999853 Q gi|254780253|r 1 MLKFKDISVMSIDQLTEKLIQLKKDQMSLRFQKASGQIEKPFRMREVSRDIARIKTMMNSRVFKN 65 (67) Q Consensus 1 mmk~~elr~~s~~eL~~~l~~l~~el~~lrfq~~~gql~np~~ir~~Rk~IARi~Tvl~er~~~~ 65 (67) ||+++|||.+|++||..++.+++.|||+||||.++|||+|||+|+.+|||||||+||++||.+++ T Consensus 1 MMk~~elR~~s~eEL~~~l~~lK~ELFnLRFQ~AtGQLen~~ri~~VRkdIARi~Til~ERkl~~ 65 (122) T PRK00461 1 MMKMKELRKKSVEELIKLVEKLKGELFELRFKLAVGELDQTHKIKEIRKTIARILTILTERKLEE 65 (122) T ss_pred CCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99779988479999999999999999998999986566451589999999999999999998610 No 2 >PRK00306 50S ribosomal protein L29; Reviewed Probab=99.80 E-value=3e-19 Score=135.61 Aligned_cols=65 Identities=48% Similarity=0.776 Sum_probs=62.9 Q ss_pred CCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 88899972999999999999999999999999537865469999999999999999999998532 Q gi|254780253|r 2 LKFKDISVMSIDQLTEKLIQLKKDQMSLRFQKASGQIEKPFRMREVSRDIARIKTMMNSRVFKNN 66 (67) Q Consensus 2 mk~~elr~~s~~eL~~~l~~l~~el~~lrfq~~~gql~np~~ir~~Rk~IARi~Tvl~er~~~~~ 66 (67) |+++|||+||.+||.+++.+++++||+||||+++||++|||+|+.+||+||||+||++|++.... T Consensus 1 Mk~~eir~ls~~eL~~~l~~lk~el~~lR~q~~~gql~n~~~i~~~Rk~IARi~Tvl~ek~~~~~ 65 (66) T PRK00306 1 MKAKELREKSVEELNEKLLELKKELFNLRFQKATGQLENTHRLREVRRDIARIKTVLREKELAAK 65 (66) T ss_pred CCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 98798876899999999999999999999999877876638999989999999999999986544 No 3 >PRK01450 rpmC 50S ribosomal protein L29; Provisional Probab=99.79 E-value=2.4e-19 Score=136.14 Aligned_cols=65 Identities=35% Similarity=0.489 Sum_probs=62.3 Q ss_pred CCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 88899972999999999999999999999999537865469999999999999999999998532 Q gi|254780253|r 2 LKFKDISVMSIDQLTEKLIQLKKDQMSLRFQKASGQIEKPFRMREVSRDIARIKTMMNSRVFKNN 66 (67) Q Consensus 2 mk~~elr~~s~~eL~~~l~~l~~el~~lrfq~~~gql~np~~ir~~Rk~IARi~Tvl~er~~~~~ 66 (67) |.++|||.+|.|||...|..|++|+|+||||+++||++||++++.+||+||||+||++|+..++. T Consensus 6 ~~~selr~Kt~deL~~~L~~lKkE~FNLRFQ~AtGQLeNtaR~r~VrrdIARIkTvL~eka~a~~ 70 (71) T PRK01450 6 LLRSKLSTETIEELYKNLNLLKKELFNLRFQQALGELKNTSRFSLVKKSIARIKTELTKRSNSEE 70 (71) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 88999872599999999999999999899998715541579999999999999999998650146 No 4 >COG0255 RpmC Ribosomal protein L29 [Translation, ribosomal structure and biogenesis] Probab=99.77 E-value=1.6e-18 Score=131.29 Aligned_cols=64 Identities=45% Similarity=0.734 Sum_probs=60.6 Q ss_pred CCHH--HHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8889--997299999999999999999999999953786546999999999999999999999853 Q gi|254780253|r 2 LKFK--DISVMSIDQLTEKLIQLKKDQMSLRFQKASGQIEKPFRMREVSRDIARIKTMMNSRVFKN 65 (67) Q Consensus 2 mk~~--elr~~s~~eL~~~l~~l~~el~~lrfq~~~gql~np~~ir~~Rk~IARi~Tvl~er~~~~ 65 (67) |++. |+|+||.+||.+.+.+|++|||+||||.++||++|||+|+.+||+||||+||++|+.... T Consensus 1 M~~~~~elR~~s~eeL~~~l~eLK~ELf~LR~q~a~g~l~n~~~ir~vRr~IARi~Tv~~E~~~~~ 66 (69) T COG0255 1 MKAKRKELREKSVEELEEELRELKKELFNLRFQLATGQLENPHRIREVRRDIARILTVLREKELEA 66 (69) T ss_pred CCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 975389898489999999999999999999999985788882889999999999999999999766 No 5 >TIGR00012 L29 ribosomal protein L29; InterPro: IPR001854 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Ribosomal protein L29 is one of the proteins from the large ribosomal subunit. L29 belongs to a family of ribosomal proteins of 63 to 138 amino-acid residues which, on the basis of sequence similarities , groups: Red algal L29. Bacterial L29. Mammalian L35 Caenorhabditis elegans L35 (ZK652.4). Yeast L35.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome. Probab=99.75 E-value=1.3e-18 Score=131.87 Aligned_cols=56 Identities=50% Similarity=0.795 Sum_probs=54.4 Q ss_pred HHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCC-CHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9972999999999999999999999999537865-4699999999999999999999 Q gi|254780253|r 6 DISVMSIDQLTEKLIQLKKDQMSLRFQKASGQIE-KPFRMREVSRDIARIKTMMNSR 61 (67) Q Consensus 6 elr~~s~~eL~~~l~~l~~el~~lrfq~~~gql~-np~~ir~~Rk~IARi~Tvl~er 61 (67) |||+||.+||.+++.++++|||+||||+++||+. |||+|+.+||+||||+||++|+ T Consensus 1 elR~~~~~EL~~~l~eLK~ELf~LR~q~~~g~l~~kp~~i~~~rr~IAR~lTv~~e~ 57 (57) T TIGR00012 1 ELREKSKEELAKKLKELKKELFELRFQKATGQLEEKPHRIRQVRRDIARILTVLREK 57 (57) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 944689889999999999999989999985586123057878889999999874029 No 6 >cd00427 Ribosomal_L29_HIP Ribosomal L29 protein/HIP. L29 is a protein of the large ribosomal Subunit. A homolog, called heparin/heparan sulfate interacting protein (HIP), has also been identified in mammals. L29 is located on the surface of the large ribosomal subunit, where it participates in forming a protein ring that surrounds the polypeptide exit channel, providing structural support for the ribosome. L29 is involved in forming the translocon binding site, along with L19, L22, L23, L24, and L31e. In addition, L29 and L23 form the interaction site for trigger factor (TF) on the ribosomal surface, adjacent to the exit tunnel. L29 forms numerous interactions with L23 and with the 23S rRNA. In some eukaryotes, L29 is referred to as L35, which is distinct from L35 found in bacteria and some eukaryotes (primarily plastids and mitochondria). The mammalian homolog, HIP, is found on the surface of many tissues and cell lines. It is believed to play a role in cell adhesion and modulat Probab=99.71 E-value=1.4e-17 Score=125.60 Aligned_cols=57 Identities=53% Similarity=0.817 Sum_probs=55.4 Q ss_pred HHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999729999999999999999999999995378654699999999999999999999 Q gi|254780253|r 5 KDISVMSIDQLTEKLIQLKKDQMSLRFQKASGQIEKPFRMREVSRDIARIKTMMNSR 61 (67) Q Consensus 5 ~elr~~s~~eL~~~l~~l~~el~~lrfq~~~gql~np~~ir~~Rk~IARi~Tvl~er 61 (67) +|||+||++||.+++.++++|||+||||+++||++|||+|+.+||+||||+||++|+ T Consensus 1 kelr~~s~~eL~~~l~elk~el~~Lr~q~~~gql~~~~~ik~~Rr~IARi~Tvl~ek 57 (57) T cd00427 1 KELREKSDEELQEKLDELKKELFNLRFQKATGQLENPHRIRKVRKDIARIKTVLNEK 57 (57) T ss_pred CCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 976779999999999999999999999998779865289999999999999998529 No 7 >pfam00831 Ribosomal_L29 Ribosomal L29 protein. Probab=99.71 E-value=2.2e-17 Score=124.43 Aligned_cols=58 Identities=52% Similarity=0.799 Sum_probs=56.3 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8999729999999999999999999999995378654699999999999999999999 Q gi|254780253|r 4 FKDISVMSIDQLTEKLIQLKKDQMSLRFQKASGQIEKPFRMREVSRDIARIKTMMNSR 61 (67) Q Consensus 4 ~~elr~~s~~eL~~~l~~l~~el~~lrfq~~~gql~np~~ir~~Rk~IARi~Tvl~er 61 (67) ++|||+||.+||.+.+.+++++||+|||++++||++|||+|+.+||+||||+||++|+ T Consensus 1 ~~elr~~s~~eL~~~l~el~~el~~Lr~q~~~gql~n~~~ir~~Rk~IARi~Tvl~ek 58 (58) T pfam00831 1 AKELRELSDEELKEKLKELKKELFNLRFQKATGQLENPHRIKEVRRDIARILTVLRER 58 (58) T ss_pred CHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 9278879999999999999999999999998778766189999999999999998629 No 8 >CHL00154 rpl29 ribosomal protein L29; Validated Probab=99.49 E-value=1.3e-13 Score=101.91 Aligned_cols=63 Identities=29% Similarity=0.427 Sum_probs=60.3 Q ss_pred CCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 888999729999999999999999999999995378654699999999999999999999985 Q gi|254780253|r 2 LKFKDISVMSIDQLTEKLIQLKKDQMSLRFQKASGQIEKPFRMREVSRDIARIKTMMNSRVFK 64 (67) Q Consensus 2 mk~~elr~~s~~eL~~~l~~l~~el~~lrfq~~~gql~np~~ir~~Rk~IARi~Tvl~er~~~ 64 (67) -++.|+|++|++||.+++.++++|||+||||++|+|+.+||+|+..|+.||+++||.+||... T Consensus 4 pki~e~r~Lsd~~l~eeI~~~KkeLF~LR~qqaTRq~~k~H~Fkh~rhrLAQLLTv~~eR~~s 66 (67) T CHL00154 4 PKIKDVQDLTDSDLQEEILKLKKELFDLRLKKATRQDIKPHLFKHKKHRLAQLLTIKKSKSKS 66 (67) T ss_pred CCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 746988758999999999999999999998999860025258889999999999999998714 No 9 >KOG3436 consensus Probab=99.24 E-value=2.9e-11 Score=87.94 Aligned_cols=65 Identities=29% Similarity=0.396 Sum_probs=59.6 Q ss_pred CC--CHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 98--88999729999999999999999999999995378-6546999999999999999999999853 Q gi|254780253|r 1 ML--KFKDISVMSIDQLTEKLIQLKKDQMSLRFQKASGQ-IEKPFRMREVSRDIARIKTMMNSRVFKN 65 (67) Q Consensus 1 mm--k~~elr~~s~~eL~~~l~~l~~el~~lrfq~~~gq-l~np~~ir~~Rk~IARi~Tvl~er~~~~ 65 (67) || ++.|||..+.++|.+.+.|++.||+.||.++.+|. -.+.+.|+.+||+|||++||++|+..++ T Consensus 1 M~kik~~eLr~~~ke~L~~ql~dlK~ELa~LRv~K~tgg~~~klskik~vrKsiArvLTVine~~k~~ 68 (123) T KOG3436 1 MAKIKARELRGKSKEQLLKQLDDLKVELAQLRVAKVTGGAASKLSKIKVVRKSIARVLTVINEKQKEE 68 (123) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 95326888777579999999999999999998998616512345899999997889987642464899 No 10 >KOG3331 consensus Probab=98.03 E-value=3.3e-05 Score=51.86 Aligned_cols=64 Identities=19% Similarity=0.316 Sum_probs=55.4 Q ss_pred CHHHHHHCCHHHHHHHHHHHHHHHHHHH-----HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 8899972999999999999999999999-----999537865469999999999999999999998532 Q gi|254780253|r 3 KFKDISVMSIDQLTEKLIQLKKDQMSLR-----FQKASGQIEKPFRMREVSRDIARIKTMMNSRVFKNN 66 (67) Q Consensus 3 k~~elr~~s~~eL~~~l~~l~~el~~lr-----fq~~~gql~np~~ir~~Rk~IARi~Tvl~er~~~~~ 66 (67) .+.|||-+|.+||.+.|..+.+|-..|- .+...|+..||.+|-.+|++.+||++|++||..+.. T Consensus 67 ~aeELR~KS~nDLH~LWYvcLkErNmL~T~~~~~k~~~~~~PnpERi~kV~~TM~~I~~Vl~ER~~Ay~ 135 (213) T KOG3331 67 SAEELRLKSFNDLHKLWYVCLKERNMLATMRHELKNIVGSFPNPERIDKVRTTMWRIEHVLNERNLAYS 135 (213) T ss_pred CHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 638876010357999999999988799999999998874389878889999999999999999888787 No 11 >pfam06984 MRP-L47 Mitochondrial 39-S ribosomal protein L47 (MRP-L47). This family represents the N-terminal region (approximately 8 residues) of the eukaryotic mitochondrial 39-S ribosomal protein L47 (MRP-L47). Mitochondrial ribosomal proteins (MRPs) are the counterparts of the cytoplasmic ribosomal proteins, in that they fulfil similar functions in protein biosynthesis. However, they are distinct in number, features and primary structure. Probab=97.86 E-value=7.2e-05 Score=49.86 Aligned_cols=59 Identities=20% Similarity=0.293 Sum_probs=49.6 Q ss_pred CHHHHHHCCHHHHHHHHHHHHHHHHHHHH-----HHHHCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 88999729999999999999999999999-----995378654699999999999999999999 Q gi|254780253|r 3 KFKDISVMSIDQLTEKLIQLKKDQMSLRF-----QKASGQIEKPFRMREVSRDIARIKTMMNSR 61 (67) Q Consensus 3 k~~elr~~s~~eL~~~l~~l~~el~~lrf-----q~~~gql~np~~ir~~Rk~IARi~Tvl~er 61 (67) .+.|||.+|.++|.+.+..+-+|-..|-. .......++|.++..+|++.++|++|++|| T Consensus 24 ~~~eLR~KS~eDLHkLWyv~lKErN~L~T~~~e~~~~~~~~~~peR~~kVr~SM~~Ik~Vl~ER 87 (87) T pfam06984 24 AAEELRHKSFNDLHKLWYVLLKEKNMLLTMEHELLRNQEVMPSPERLKKVKISMENIKTVLRER 87 (87) T ss_pred CHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 3899874459999999999999988999999999975336886688889999999999998159 No 12 >COG5493 Uncharacterized conserved protein containing a coiled-coil domain [Function unknown] Probab=60.80 E-value=15 Score=18.17 Aligned_cols=54 Identities=22% Similarity=0.301 Sum_probs=39.3 Q ss_pred HHCCHHHHHHHHHHHHHHHHH----HHHHHHHCCC------CCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 729999999999999999999----9999953786------54699999999999999999999 Q gi|254780253|r 8 SVMSIDQLTEKLIQLKKDQMS----LRFQKASGQI------EKPFRMREVSRDIARIKTMMNSR 61 (67) Q Consensus 8 r~~s~~eL~~~l~~l~~el~~----lrfq~~~gql------~np~~ir~~Rk~IARi~Tvl~er 61 (67) ..-..++|.+++...++++-+ .||+.++.+- .....++.++.+|-|+-|+++-- T Consensus 44 tk~dve~l~~e~E~~~k~l~de~~E~r~~~~tke~lk~l~~~~~~~f~a~~edi~rlE~~i~~l 107 (231) T COG5493 44 TKQDVEELRKETEQRQKELADEKLEVRKQKATKEDLKLLQRFQEEEFRATKEDIKRLETIITGL 107 (231) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8977999999999999999988999999774299999999888888887899999999999988 No 13 >pfam10655 DUF2482 Hypothetical protein of unknown function (DUF2482). All the members of this very small, very short family are derived from bacteriophages, of the SA bacteriophages 11, Mu50B, system, and from the Staphylococcal_phi-Mu50B-like_prophages subsystem. All members are hypothetical proteins. Probab=58.20 E-value=8.7 Score=19.57 Aligned_cols=27 Identities=19% Similarity=0.225 Sum_probs=24.2 Q ss_pred HHHHHCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 999729999999999999999999999 Q gi|254780253|r 5 KDISVMSIDQLTEKLIQLKKDQMSLRF 31 (67) Q Consensus 5 ~elr~~s~~eL~~~l~~l~~el~~lrf 31 (67) +.++.|+.+|+...|.+--.|||+|-- T Consensus 3 knyKdMtqdEirdLLseKngEL~el~~ 29 (100) T pfam10655 3 KNYKDMTQDEIRDLLSEKNGELYELAK 29 (100) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 407662399999999861037999999 No 14 >PRK10613 hypothetical protein; Provisional Probab=36.35 E-value=37 Score=15.82 Aligned_cols=49 Identities=20% Similarity=0.245 Sum_probs=28.0 Q ss_pred CHHHHHHHHHHHHHHHH--------------HHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999--------------99999953786546999999999999999999 Q gi|254780253|r 11 SIDQLTEKLIQLKKDQM--------------SLRFQKASGQIEKPFRMREVSRDIARIKTMMN 59 (67) Q Consensus 11 s~~eL~~~l~~l~~el~--------------~lrfq~~~gql~np~~ir~~Rk~IARi~Tvl~ 59 (67) +.++|..++.-|+.-+- -+.+.+...++++|.+--..+..|.+|+|..+ T Consensus 11 ~~e~LA~EV~CLK~~lTlmLkamGQADAGkVIi~mEr~ia~~eD~~QA~vf~nTi~QIK~ayR 73 (74) T PRK10613 11 NNQELANEVSCLKAMLTLMLQAMGQADAGRVILKMERQIAQIEDEAQAAVFSNTVKQIKQAYR 73 (74) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHC T ss_conf 799999999999999999999826323133778899999850588899999999999999863 No 15 >pfam10769 DUF2594 Protein of unknown function (DUF2594). This family of proteins with unknown function appear to be restricted to Enterobacteriaceae. Probab=35.47 E-value=38 Score=15.73 Aligned_cols=49 Identities=16% Similarity=0.207 Sum_probs=27.1 Q ss_pred CHHHHHHHHHHHHHHHH--------------HHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999--------------99999953786546999999999999999999 Q gi|254780253|r 11 SIDQLTEKLIQLKKDQM--------------SLRFQKASGQIEKPFRMREVSRDIARIKTMMN 59 (67) Q Consensus 11 s~~eL~~~l~~l~~el~--------------~lrfq~~~gql~np~~ir~~Rk~IARi~Tvl~ 59 (67) +.++|..++.-|+.-+- -+.+.+...++++|.+--..+..|.+|+|..+ T Consensus 11 ~~e~LA~EV~CLK~llTlmLka~GQADAGkvii~mEr~ia~~eD~~QAevf~ntv~QIK~aYR 73 (74) T pfam10769 11 NNETLATEVTCLKALLTLILKAIGQADAGRVILNMERQIAEIEDAEQAAVFTNTVKQIKQAYR 73 (74) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHC T ss_conf 889999999999999999999836323024789999999871388899999999999999862 No 16 >pfam06698 DUF1192 Protein of unknown function (DUF1192). This family consists of several short, hypothetical, bacterial proteins of around 60 residues in length. The function of this family is unknown. Probab=35.02 E-value=39 Score=15.69 Aligned_cols=27 Identities=26% Similarity=0.534 Sum_probs=19.0 Q ss_pred HHHHHCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 999729999999999999999999999 Q gi|254780253|r 5 KDISVMSIDQLTEKLIQLKKDQMSLRF 31 (67) Q Consensus 5 ~elr~~s~~eL~~~l~~l~~el~~lrf 31 (67) .+|..+|.+||...+..|+.|.-.+.- T Consensus 15 ~DLs~lSv~EL~~RIa~L~~EI~R~e~ 41 (58) T pfam06698 15 QDLSLLSVEELEERIALLQAEIARLEA 41 (58) T ss_pred CCCHHCCHHHHHHHHHHHHHHHHHHHH T ss_conf 880222799999999999999999999 No 17 >smart00540 LEM in nuclear membrane-associated proteins. LEM, domain in nuclear membrane-associated proteins, including lamino-associated polypeptide 2 and emerin. Probab=25.08 E-value=46 Score=15.29 Aligned_cols=16 Identities=31% Similarity=0.387 Sum_probs=12.2 Q ss_pred HHHHCCHHHHHHHHHH Q ss_conf 9972999999999999 Q gi|254780253|r 6 DISVMSIDQLTEKLIQ 21 (67) Q Consensus 6 elr~~s~~eL~~~l~~ 21 (67) ++..+|++||..++.. T Consensus 2 dv~~Lsd~eLr~~L~~ 17 (44) T smart00540 2 DVDRLSDAELRAELKQ 17 (44) T ss_pred CHHHHCHHHHHHHHHH T ss_conf 1878269999999998 No 18 >pfam08984 DUF1858 Domain of unknown function (DUF1858). This domain has no known function. It is found in various hypothetical bacterial proteins. Probab=24.51 E-value=59 Score=14.59 Aligned_cols=45 Identities=16% Similarity=0.302 Sum_probs=33.7 Q ss_pred HCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 29999999999999999999999995378654699999999999999999999 Q gi|254780253|r 9 VMSIDQLTEKLIQLKKDQMSLRFQKASGQIEKPFRMREVSRDIARIKTMMNSR 61 (67) Q Consensus 9 ~~s~~eL~~~l~~l~~el~~lrfq~~~gql~np~~ir~~Rk~IARi~Tvl~er 61 (67) +++.-||.+..-++..-|.++.| .++.||. .++.+||+.|+-..- T Consensus 4 ~~tV~~l~~~yPe~~~il~~~GF----~~l~nP~----m~~t~gr~~tl~~aa 48 (59) T pfam08984 4 DTTVGDLVERYPETIEILVELGF----KPLANPA----MRNTVGRVATLEQAA 48 (59) T ss_pred CCCHHHHHHHCHHHHHHHHHCCC----HHCCCHH----HHHHHHHHCCHHHHH T ss_conf 35299999879789999998695----1106969----998764455699999 No 19 >PRK11449 putative deoxyribonuclease YjjV; Provisional Probab=23.51 E-value=62 Score=14.48 Aligned_cols=25 Identities=12% Similarity=0.193 Sum_probs=12.5 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 8999729999999999999999999 Q gi|254780253|r 4 FKDISVMSIDQLTEKLIQLKKDQMS 28 (67) Q Consensus 4 ~~elr~~s~~eL~~~l~~l~~el~~ 28 (67) +.++++++.+|+.+.+.+--..+|+ T Consensus 233 iA~ik~~~~eev~~~~~~N~~~lF~ 257 (258) T PRK11449 233 LCELRPEPADEIAEVLLNNTYTLFR 257 (258) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHC T ss_conf 9988697999999999999999859 No 20 >pfam03020 LEM LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin, and Man1. Probab=22.41 E-value=51 Score=15.00 Aligned_cols=16 Identities=25% Similarity=0.383 Sum_probs=11.6 Q ss_pred HHHHCCHHHHHHHHHH Q ss_conf 9972999999999999 Q gi|254780253|r 6 DISVMSIDQLTEKLIQ 21 (67) Q Consensus 6 elr~~s~~eL~~~l~~ 21 (67) ++..+|++||..+|.. T Consensus 2 dv~~LsD~eL~~~L~~ 17 (43) T pfam03020 2 DVAQLSDAELRSELRK 17 (43) T ss_pred CHHHHCHHHHHHHHHH T ss_conf 6777185999999998 No 21 >COG1154 Dxs Deoxyxylulose-5-phosphate synthase [Coenzyme metabolism / Lipid metabolism] Probab=21.73 E-value=68 Score=14.26 Aligned_cols=31 Identities=13% Similarity=0.142 Sum_probs=25.7 Q ss_pred CCHHHHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8889997299999999999999999999999 Q gi|254780253|r 2 LKFKDISVMSIDQLTEKLIQLKKDQMSLRFQ 32 (67) Q Consensus 2 mk~~elr~~s~~eL~~~l~~l~~el~~lrfq 32 (67) ....||+.+|.+||.+.-.|+|..+.+---+ T Consensus 9 ~~P~dLk~ls~~eL~~La~EiR~~li~~vS~ 39 (627) T COG1154 9 NSPADLKKLSIEELPQLADEIREFLLEVVSA 39 (627) T ss_pred CCHHHHHHCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 8989986479999999999999999998622 No 22 >TIGR01597 PYST-B Plasmodium yoelii subtelomeric family PYST-B; InterPro: IPR006484 The sequences in this group represent a paralogous family of Plasmodium yoelii genes preferentially located in the subtelomeric regions of the chromosomes. There are no obvious homologs to these genes in any other organism. . Probab=20.65 E-value=59 Score=14.62 Aligned_cols=22 Identities=36% Similarity=0.401 Sum_probs=18.0 Q ss_pred HCCHHHHHHHHHHHHHHHHHHH Q ss_conf 2999999999999999999999 Q gi|254780253|r 9 VMSIDQLTEKLIQLKKDQMSLR 30 (67) Q Consensus 9 ~~s~~eL~~~l~~l~~el~~lr 30 (67) ++..++|+.+|.|++|||-+-+ T Consensus 112 KKlI~klqkElEE~kKELdN~~ 133 (274) T TIGR01597 112 KKLIDKLQKELEELKKELDNKR 133 (274) T ss_pred HHHHHHHHHHHHHHHHHHCCCC T ss_conf 7889998888887547652466 No 23 >PRK04351 hypothetical protein; Provisional Probab=20.29 E-value=43 Score=15.43 Aligned_cols=26 Identities=12% Similarity=0.249 Sum_probs=22.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999953 Q gi|254780253|r 10 MSIDQLTEKLIQLKKDQMSLRFQKAS 35 (67) Q Consensus 10 ~s~~eL~~~l~~l~~el~~lrfq~~~ 35 (67) |++.||+..+.++..+.|..-|.+.. T Consensus 1 m~~~eLq~lve~iS~~~F~~pF~H~a 26 (151) T PRK04351 1 MTNQELQRLVEEISLEYFGKPFRHKA 26 (151) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCEEE T ss_conf 98799999999999998388752257 Done!