Query psy4243
Match_columns 104
No_of_seqs 110 out of 845
Neff 6.0
Searched_HMMs 46136
Date Fri Aug 16 20:28:03 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4243.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4243hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 TIGR01029 rpsG_bact ribosomal 100.0 7E-41 1.5E-45 240.3 9.7 101 2-103 48-154 (154)
2 PRK05302 30S ribosomal protein 100.0 3.6E-40 7.8E-45 237.0 9.3 101 2-103 50-156 (156)
3 CHL00053 rps7 ribosomal protei 100.0 9.3E-39 2E-43 229.4 9.8 100 2-102 50-155 (155)
4 COG0049 RpsG Ribosomal protein 100.0 3.5E-36 7.6E-41 214.2 9.4 94 2-96 49-148 (148)
5 PRK04027 30S ribosomal protein 100.0 3E-35 6.4E-40 217.8 8.3 95 2-96 91-195 (195)
6 TIGR01028 S7_S5_E_A ribosomal 100.0 8.7E-35 1.9E-39 214.2 8.4 94 2-96 82-186 (186)
7 PTZ00091 40S ribosomal protein 100.0 9.6E-35 2.1E-39 214.8 8.4 95 1-96 88-193 (193)
8 KOG3291|consensus 100.0 2E-33 4.4E-38 209.2 8.8 95 2-96 106-208 (208)
9 PF00177 Ribosomal_S7: Ribosom 100.0 7.1E-32 1.5E-36 191.9 10.2 94 2-96 49-148 (148)
10 PF01524 Gemini_V1: Geminiviru 60.9 26 0.00057 22.7 4.5 48 42-101 17-64 (78)
11 PF15127 DUF4565: Protein of u 52.5 11 0.00025 25.0 1.9 26 62-99 56-81 (91)
12 PF00610 DEP: Domain found in 39.2 27 0.00059 21.0 2.0 30 45-74 22-51 (74)
13 PF06021 Gly_acyl_tr_N: Aralky 35.4 31 0.00068 26.1 2.1 32 1-33 1-34 (205)
14 PF12209 SAC3: Leucine permeas 33.8 1.2E+02 0.0027 19.3 4.5 33 60-100 35-67 (79)
15 COG5044 MRS6 RAB proteins gera 31.9 1.9E+02 0.0042 24.3 6.3 75 27-101 114-216 (434)
16 COG5573 Predicted nucleic-acid 28.6 28 0.00061 24.9 0.9 68 25-104 34-101 (142)
17 PF14160 FAM110_C: Centrosome- 28.3 23 0.0005 24.3 0.4 25 29-53 83-107 (111)
18 PF13314 DUF4083: Domain of un 26.7 1.4E+02 0.0031 18.3 3.6 33 44-77 24-57 (58)
19 PF14202 TnpW: Transposon-enco 24.8 40 0.00087 18.6 0.9 17 25-41 6-22 (37)
20 PF05164 ZapA: Cell division p 22.5 1.9E+02 0.004 17.7 3.8 33 24-56 4-36 (89)
21 PF05743 UEV: UEV domain; Int 20.8 49 0.0011 22.5 0.9 16 22-37 39-54 (121)
22 PHA02652 hypothetical protein; 20.6 47 0.001 20.5 0.7 21 3-23 18-38 (70)
23 PF06972 DUF1296: Protein of u 20.6 44 0.00094 20.7 0.5 12 4-15 42-53 (60)
24 PHA02100 hypothetical protein 20.2 60 0.0013 22.0 1.2 17 73-89 91-108 (112)
No 1
>TIGR01029 rpsG_bact ribosomal protein S7, bacterial/organelle. This model describes the bacterial and organellar branch of the ribosomal protein S7 family (includes prokaroytic S7 and eukaryotic S5). The eukaryotic and archaeal branch is described by model TIGR01028.
Probab=100.00 E-value=7e-41 Score=240.25 Aligned_cols=101 Identities=27% Similarity=0.454 Sum_probs=96.0
Q ss_pred ccccccChHHHHHHHhhhcccce------eecceeeeeccccChhHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHHHHHh
Q psy4243 2 LHLVCQDLIWVCLVLGLCNILTI------KAHLASIKVPIPISDKEARFKCMKWHIEIAKDKPSEVRFSDNLAQILIDGA 75 (104)
Q Consensus 2 ~~~~~~~~~~~~L~~Ai~nv~P~------r~gG~~~qvP~~i~~~rr~~lAirWii~~a~~k~~~~~~~~~La~Eii~A~ 75 (104)
|+..++.+|+++|.+||+|++|. ++||++||||++|++.||+++||+||+++|++|+ +++|+++||+||+||+
T Consensus 48 i~~~~~~nPi~vl~~Ai~nv~P~~evk~~r~gG~~yqvP~~v~~~rr~~lAirWil~~a~~r~-~~~~~~~La~Eii~A~ 126 (154)
T TIGR01029 48 IAKKTGEDPLEVFEQALENVKPLVEVKSRRVGGATYQVPVEVRPSRRYALAIRWLIEAARKRG-GKSMAERLANEILDAA 126 (154)
T ss_pred HHHHcCCCHHHHHHHHHHhCCCCeeeEEeecCCEEEEEeeEcCHHHHHHHHHHHHHHHHHhcC-CCCHHHHHHHHHHHHH
Confidence 45567888999999999999996 7899999999999999999999999999999876 6899999999999999
Q ss_pred cCCChhHHhHHHHHHHHHHhhhhccccc
Q psy4243 76 DNKGKVVKKKTDLHRQCEANRAYAHYYF 103 (104)
Q Consensus 76 ~~~g~aikkK~elhK~A~aNRa~ahyrw 103 (104)
+++|.+++||||+||||++||||+||||
T Consensus 127 ~~~g~aikkKee~hk~AeaNRaf~h~Rw 154 (154)
T TIGR01029 127 NNTGAAIKKREDTHRMAEANKAFAHYRW 154 (154)
T ss_pred hCCCceeeHHHHHHHHHHHhHhHhcccC
Confidence 9999999999999999999999999998
No 2
>PRK05302 30S ribosomal protein S7; Validated
Probab=100.00 E-value=3.6e-40 Score=236.99 Aligned_cols=101 Identities=30% Similarity=0.493 Sum_probs=95.9
Q ss_pred ccccccChHHHHHHHhhhcccce------eecceeeeeccccChhHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHHHHHh
Q psy4243 2 LHLVCQDLIWVCLVLGLCNILTI------KAHLASIKVPIPISDKEARFKCMKWHIEIAKDKPSEVRFSDNLAQILIDGA 75 (104)
Q Consensus 2 ~~~~~~~~~~~~L~~Ai~nv~P~------r~gG~~~qvP~~i~~~rr~~lAirWii~~a~~k~~~~~~~~~La~Eii~A~ 75 (104)
||..++.+|+++|.+||+|++|+ ++||++||||+||++.||+++||+||+++|++|+ +++|+++||+||+||+
T Consensus 50 I~~~~~~npl~il~~Ai~nv~P~~evk~~r~gG~~~~vP~~i~~~rr~~lAirwil~~a~~r~-~~~~~~~La~Eii~A~ 128 (156)
T PRK05302 50 IEEKTGKDPLEVFEKALENVKPVVEVKSRRVGGATYQVPVEVRPERRQALAMRWLVEAARKRG-EKTMAERLANELLDAA 128 (156)
T ss_pred HHHHcCCChHHHHHHHHHhCCCCeEEeeeccCCEEEEEeEEeCHHHHHHHHHHHHHHHHHHcC-CCCHHHHHHHHHHHHH
Confidence 45567788899999999999996 7899999999999999999999999999999876 5899999999999999
Q ss_pred cCCChhHHhHHHHHHHHHHhhhhccccc
Q psy4243 76 DNKGKVVKKKTDLHRQCEANRAYAHYYF 103 (104)
Q Consensus 76 ~~~g~aikkK~elhK~A~aNRa~ahyrw 103 (104)
+++|.+++||+|+||+|++||||+||||
T Consensus 129 ~~~g~aikkK~e~hk~A~aNRa~ah~R~ 156 (156)
T PRK05302 129 NNRGAAVKKREDTHRMAEANKAFAHYRW 156 (156)
T ss_pred hCCCCeeehHHHHHHHHHHhhhHhhcCC
Confidence 9999999999999999999999999998
No 3
>CHL00053 rps7 ribosomal protein S7
Probab=100.00 E-value=9.3e-39 Score=229.37 Aligned_cols=100 Identities=23% Similarity=0.421 Sum_probs=95.3
Q ss_pred ccccccChHHHHHHHhhhcccce------eecceeeeeccccChhHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHHHHHh
Q psy4243 2 LHLVCQDLIWVCLVLGLCNILTI------KAHLASIKVPIPISDKEARFKCMKWHIEIAKDKPSEVRFSDNLAQILIDGA 75 (104)
Q Consensus 2 ~~~~~~~~~~~~L~~Ai~nv~P~------r~gG~~~qvP~~i~~~rr~~lAirWii~~a~~k~~~~~~~~~La~Eii~A~ 75 (104)
||..++.+|+++|.+||+|++|. ++||++||||+||+|.||+++||+||+++|++++ +.+|+++||+||++|+
T Consensus 50 i~~~~~~npi~vl~~Ai~N~~P~~evk~~r~gG~~~qvPv~v~~~rr~~lAirWil~~ar~r~-~~~~~~~La~Eii~A~ 128 (155)
T CHL00053 50 IQQKTEKNPLSVLRQAIRNVTPDVEVKARRVGGSTYQVPIEIGSTRGKALAIRWLLKASRKRS-GRNMAFKLSSELVDAA 128 (155)
T ss_pred HHHHcCCCHHHHHHHHHHhCCCcEEEEEEeeCCEEEEEeeEcCHHHHHHHHHHHHHHHHHhcC-CCCHHHHHHHHHHHHH
Confidence 56678888999999999999996 8899999999999999999999999999999876 5799999999999999
Q ss_pred cCCChhHHhHHHHHHHHHHhhhhcccc
Q psy4243 76 DNKGKVVKKKTDLHRQCEANRAYAHYY 102 (104)
Q Consensus 76 ~~~g~aikkK~elhK~A~aNRa~ahyr 102 (104)
+|+|.+++||+|+||+|++||||+|||
T Consensus 129 ~~~g~aikkK~e~hk~A~aNRa~~h~r 155 (155)
T CHL00053 129 KGSGNAIRKKEETHRMAEANRAFAHFR 155 (155)
T ss_pred hCCCchhhhHHHHHHHHHHhHhHhccC
Confidence 999999999999999999999999997
No 4
>COG0049 RpsG Ribosomal protein S7 [Translation, ribosomal structure and biogenesis]
Probab=100.00 E-value=3.5e-36 Score=214.22 Aligned_cols=94 Identities=28% Similarity=0.404 Sum_probs=89.5
Q ss_pred ccccccChHHHHHHHhhhcccce------eecceeeeeccccChhHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHHHHHh
Q psy4243 2 LHLVCQDLIWVCLVLGLCNILTI------KAHLASIKVPIPISDKEARFKCMKWHIEIAKDKPSEVRFSDNLAQILIDGA 75 (104)
Q Consensus 2 ~~~~~~~~~~~~L~~Ai~nv~P~------r~gG~~~qvP~~i~~~rr~~lAirWii~~a~~k~~~~~~~~~La~Eii~A~ 75 (104)
|+--|+.+|++||++||+|++|. ++||.+|||||+|+|.||+++||+||+++|+++. +++|++|||+|||||+
T Consensus 49 i~~kTg~~Piqvf~~Ai~N~~P~~Ev~srrvGGa~yqvpVeV~p~RR~~lAlrwi~~~Ar~r~-~ktm~erLA~Eli~Aa 127 (148)
T COG0049 49 IEKKTGQNPLQVFEKAIENVKPREEVKSRRVGGATYQVPVEVRPRRRVALALRWIVEGARKRN-EKTMAERLANELIDAA 127 (148)
T ss_pred HHHHHCCChHHHHHHHHHhcCchhheeeeeeCCeeeEecCccCHHHHHHHHHHHHHHHHHhcC-CCCHHHHHHHHHHHHH
Confidence 45568899999999999999997 8899999999999999999999999999999874 7999999999999999
Q ss_pred cCCChhHHhHHHHHHHHHHhh
Q psy4243 76 DNKGKVVKKKTDLHRQCEANR 96 (104)
Q Consensus 76 ~~~g~aikkK~elhK~A~aNR 96 (104)
+|.|.|||||||+||||++||
T Consensus 128 n~~g~AikKked~hRmAeAnr 148 (148)
T COG0049 128 NNTGAAIKKKEDTHRMAEANR 148 (148)
T ss_pred cCCcchhhhHHHHHHHHHhcC
Confidence 999999999999999999997
No 5
>PRK04027 30S ribosomal protein S7P; Reviewed
Probab=100.00 E-value=3e-35 Score=217.85 Aligned_cols=95 Identities=20% Similarity=0.193 Sum_probs=90.2
Q ss_pred ccccccChHHHHHHHhhhcccce------eecceeeeeccccChhHHHHHHHHHHHHHHhcC--CCCCCHHHHHHHHHHH
Q psy4243 2 LHLVCQDLIWVCLVLGLCNILTI------KAHLASIKVPIPISDKEARFKCMKWHIEIAKDK--PSEVRFSDNLAQILID 73 (104)
Q Consensus 2 ~~~~~~~~~~~~L~~Ai~nv~P~------r~gG~~~qvP~~i~~~rr~~lAirWii~~a~~k--~~~~~~~~~La~Eii~ 73 (104)
||+.|+.+|+++|++||+|++|. ++||++||||++|+|.||+++|||||+++|+++ +++++|++|||+||+|
T Consensus 91 I~~kT~~nPiqVl~~AI~N~~P~~Ev~~~r~GG~tyqvpVdV~p~Rr~~lAirwI~~~ar~r~fr~~ksm~e~LA~Elid 170 (195)
T PRK04027 91 IEKRTKQNPIQVLVDAIENAAPREETTRIKYGGISYPKAVDVSPQRRVDLALRFIAEGARQASFKNKKSIEEALAEEIIA 170 (195)
T ss_pred HHHHHCCCHHHHHHHHHHcCCCCeEEEEEecCCEEEEeeeecCHHHHHHHHHHHHHHHHHhccccCCCCHHHHHHHHHHH
Confidence 78889999999999999999996 789999999999999999999999999999985 3579999999999999
Q ss_pred HhcC--CChhHHhHHHHHHHHHHhh
Q psy4243 74 GADN--KGKVVKKKTDLHRQCEANR 96 (104)
Q Consensus 74 A~~~--~g~aikkK~elhK~A~aNR 96 (104)
|+++ +|.|++||||+||||++||
T Consensus 171 Aa~~~~~g~AikKKeE~hRmAeANr 195 (195)
T PRK04027 171 AANNDPKSYAVRKKEEIERVAKSAR 195 (195)
T ss_pred HHcCCCCCceeecHHHHHHHHHhcC
Confidence 9999 5889999999999999998
No 6
>TIGR01028 S7_S5_E_A ribosomal protein S7(archaeal)/S5(eukaryotic). This model describes the members from the eukaryotic cytosol and the Archaea of the family that includes ribosomal protein S7 of bacteria and S5 of eukaryotes. A separate model describes bacterial and organellar S7.
Probab=100.00 E-value=8.7e-35 Score=214.23 Aligned_cols=94 Identities=17% Similarity=0.104 Sum_probs=88.5
Q ss_pred ccccccChHHHHHHHhhhcccce------eecceeeeeccccChhHHHHHHHHHHHHHHhc---CCCCCCHHHHHHHHHH
Q psy4243 2 LHLVCQDLIWVCLVLGLCNILTI------KAHLASIKVPIPISDKEARFKCMKWHIEIAKD---KPSEVRFSDNLAQILI 72 (104)
Q Consensus 2 ~~~~~~~~~~~~L~~Ai~nv~P~------r~gG~~~qvP~~i~~~rr~~lAirWii~~a~~---k~~~~~~~~~La~Eii 72 (104)
||..|+.+|+++|++||+|++|. ++||++||||+||+|.||+++|||||+++|++ |+ +++|+++||+||+
T Consensus 82 I~~~t~~nPi~vl~~AI~N~~P~~ev~~~r~GG~~yqVPv~v~p~Rr~~lAirWI~~~ar~~~~R~-~k~m~~~LA~Eii 160 (186)
T TIGR01028 82 IEKRTGENPIQVLVDAIENAGPREDTTRIGYGGIVYRQAVDISPLRRVDQALRNIALGAREAAFRN-KKSIAECLAEEII 160 (186)
T ss_pred HHHHhCCChHHHHHHHHHcCCCCeEEEEEecCCEEEEeccccCHHHHHHHHHHHHHHHHHHhcccC-CCcHHHHHHHHHH
Confidence 67788999999999999999996 88999999999999999999999999999997 44 6899999999999
Q ss_pred HHhcCC--ChhHHhHHHHHHHHHHhh
Q psy4243 73 DGADNK--GKVVKKKTDLHRQCEANR 96 (104)
Q Consensus 73 ~A~~~~--g~aikkK~elhK~A~aNR 96 (104)
||++|+ |.|++||||+||||++||
T Consensus 161 ~A~~~~g~g~AikKKeE~hr~AeaNR 186 (186)
T TIGR01028 161 LAANGDSKSYAIKKKEELERVAKSAR 186 (186)
T ss_pred HHHhCCCCCceeehHHHHHHHHHhcC
Confidence 999998 469999999999999997
No 7
>PTZ00091 40S ribosomal protein S5; Provisional
Probab=100.00 E-value=9.6e-35 Score=214.77 Aligned_cols=95 Identities=18% Similarity=0.176 Sum_probs=89.4
Q ss_pred CccccccChHHHHHHHhhhcccce------eecceeeeeccccChhHHHHHHHHHHHHHHhc---CCCCCCHHHHHHHHH
Q psy4243 1 MLHLVCQDLIWVCLVLGLCNILTI------KAHLASIKVPIPISDKEARFKCMKWHIEIAKD---KPSEVRFSDNLAQIL 71 (104)
Q Consensus 1 ~~~~~~~~~~~~~L~~Ai~nv~P~------r~gG~~~qvP~~i~~~rr~~lAirWii~~a~~---k~~~~~~~~~La~Ei 71 (104)
+||+.|+.+|++||++||+|++|. ++||++||||++|+|.||+++|||||+++|++ |+ +++|++|||+||
T Consensus 88 iI~~~t~~nPiqVl~~AI~N~~P~~e~~ri~~GG~~yqvpVdVsp~Rr~~lAirwI~~~ar~~~fR~-~ks~~e~LA~El 166 (193)
T PTZ00091 88 IIHLLTDKNPLQVLVDAVQNGGPREDSTRVGSGGVVRRQAVDVSPLRRVNQAIYLICKGAREAAFRN-IKTIAECLADEI 166 (193)
T ss_pred HHHHHHCCChHHHHHHHHHhCCCCeeeEEeecCCeEEEEeeecChHHHHHHHHHHHHHHHHhhcccC-CCCHHHHHHHHH
Confidence 378899999999999999999997 66899999999999999999999999999998 55 699999999999
Q ss_pred HHHhcCC--ChhHHhHHHHHHHHHHhh
Q psy4243 72 IDGADNK--GKVVKKKTDLHRQCEANR 96 (104)
Q Consensus 72 i~A~~~~--g~aikkK~elhK~A~aNR 96 (104)
|||++|+ |.|++||||+||||++||
T Consensus 167 i~Aa~~~~~s~AikKKeE~hR~AeANR 193 (193)
T PTZ00091 167 INASKESSNSYAIKKKDEVERVAKANR 193 (193)
T ss_pred HHHHhCCCCCcceehHHHHHHHHHhcC
Confidence 9999999 559999999999999998
No 8
>KOG3291|consensus
Probab=100.00 E-value=2e-33 Score=209.19 Aligned_cols=95 Identities=27% Similarity=0.389 Sum_probs=90.7
Q ss_pred ccccccChHHHHHHHhhhcccce------eecceeeeeccccChhHHHHHHHHHHHHHHhcCC--CCCCHHHHHHHHHHH
Q psy4243 2 LHLVCQDLIWVCLVLGLCNILTI------KAHLASIKVPIPISDKEARFKCMKWHIEIAKDKP--SEVRFSDNLAQILID 73 (104)
Q Consensus 2 ~~~~~~~~~~~~L~~Ai~nv~P~------r~gG~~~qvP~~i~~~rr~~lAirWii~~a~~k~--~~~~~~~~La~Eii~ 73 (104)
||++|++||++||++||.|+.|. .+||++|||||||+|.||+++||+||+++|++++ +.++|++|||+|||+
T Consensus 106 I~~~t~~NPiqvl~~AI~n~~P~~~~~~i~r~g~~~~vpv~v~~~rr~~~Ai~wi~~~ar~~~~~~~~~~ae~La~ELi~ 185 (208)
T KOG3291|consen 106 IKRLTERNPIQVLVQAIENCGPLEDVTRIGRAGTTYQVPVPVSPLRRVNLAIKWILTGARERSFRNIKTLAECLADELIA 185 (208)
T ss_pred HHHHhcCCHHHHHHHHHHhcCchhhceeecccceEEEecccCCcHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHH
Confidence 78999999999999999999997 5689999999999999999999999999999974 358999999999999
Q ss_pred HhcCCChhHHhHHHHHHHHHHhh
Q psy4243 74 GADNKGKVVKKKTDLHRQCEANR 96 (104)
Q Consensus 74 A~~~~g~aikkK~elhK~A~aNR 96 (104)
|++|+|+|++||||+||+|++||
T Consensus 186 Aakgs~~aikKKdeLhr~a~sNR 208 (208)
T KOG3291|consen 186 AAKGSGNAIKKKDELHRMAEANR 208 (208)
T ss_pred HHhCCchhhhhHHHHHHHHhccC
Confidence 99999999999999999999998
No 9
>PF00177 Ribosomal_S7: Ribosomal protein S7p/S5e; InterPro: IPR023798 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein S7 is one of the proteins from the small ribosomal subunit. In Escherichia coli, S7 is known to bind directly to part of the 3'end of 16S ribosomal RNA. It belongs to a family of ribosomal proteins which have been grouped on the basis of sequence similarities [, , ]. This entry represents the S7 structural domain, which consists of a bundle of six helices and an extended beta hairpin between helices 3 and 4, with two or more RNA-binding sites on its surface [].; PDB: 3IZ6_F 3V28_G 2WRN_G 2HHH_G 2V46_G 3KNL_G 2UUC_G 2Y16_G 2Y0W_G 2J02_G ....
Probab=99.97 E-value=7.1e-32 Score=191.87 Aligned_cols=94 Identities=35% Similarity=0.538 Sum_probs=87.4
Q ss_pred ccccccChHHHHHHHhhhcccce------eecceeeeeccccChhHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHHHHHh
Q psy4243 2 LHLVCQDLIWVCLVLGLCNILTI------KAHLASIKVPIPISDKEARFKCMKWHIEIAKDKPSEVRFSDNLAQILIDGA 75 (104)
Q Consensus 2 ~~~~~~~~~~~~L~~Ai~nv~P~------r~gG~~~qvP~~i~~~rr~~lAirWii~~a~~k~~~~~~~~~La~Eii~A~ 75 (104)
|+..++.+|+++|.+||+|++|. ++||++||||++|+++||+++||+||++++++++ +++|+++||+||+||+
T Consensus 49 i~~~~~~npi~il~~Ai~n~~P~~~v~~~k~gg~~~~vP~~i~~~r~~~lAikWi~~~~~~r~-~~~~~~~La~Eii~a~ 127 (148)
T PF00177_consen 49 IKKKTNKNPIQILKQAIENVKPLVEVRSIKKGGKTYQVPVPISPKRRIFLAIKWILEAARKRK-EKSFSERLANEIIDAS 127 (148)
T ss_dssp HHHHHSSSHHHHHHHHHHHHS-SEEEEEEEETTEEEEEEEEETHHHHHHHHHHHHHHHHHTSS-SSSHHHHHHHHHHHHH
T ss_pred hhcccCCCHHHHHHHHHHHhccceeeeeeccCceeEEEEEeecHHHHHHHHHHHHHHHHhhcc-cchHHHHHHHHHHHHH
Confidence 45566778889999999999997 7899999999999999999999999999999766 6899999999999999
Q ss_pred cCCChhHHhHHHHHHHHHHhh
Q psy4243 76 DNKGKVVKKKTDLHRQCEANR 96 (104)
Q Consensus 76 ~~~g~aikkK~elhK~A~aNR 96 (104)
+++|.++++|+|+||+|++||
T Consensus 128 ~~~g~aikkK~~lhk~A~~NR 148 (148)
T PF00177_consen 128 NNKGSAIKKKEELHKMAEANR 148 (148)
T ss_dssp TTSSHHHHHHHHHHHHHHHTG
T ss_pred cCCcHHHHHHHHHHHHHHhcC
Confidence 999999999999999999998
No 10
>PF01524 Gemini_V1: Geminivirus V1 protein; InterPro: IPR002511 Disruption of the V1 gene in Tomato yellow leaf curl virus (TYLCV) stopped its ability to systemically infect Solanum lycopersicum (Tomato) (Lycopersicon esculentum) plants, suggesting that the V1 gene product is required for successful infection of the host [].; GO: 0019048 virus-host interaction, 0060967 negative regulation of gene silencing by RNA, 0030430 host cell cytoplasm
Probab=60.87 E-value=26 Score=22.72 Aligned_cols=48 Identities=21% Similarity=0.257 Sum_probs=36.2
Q ss_pred HHHHHHHHHHHHHhcCCCCCCHHHHHHHHHHHHhcCCChhHHhHHHHHHHHHHhhhhccc
Q psy4243 42 ARFKCMKWHIEIAKDKPSEVRFSDNLAQILIDGADNKGKVVKKKTDLHRQCEANRAYAHY 101 (104)
Q Consensus 42 r~~lAirWii~~a~~k~~~~~~~~~La~Eii~A~~~~g~aikkK~elhK~A~aNRa~ahy 101 (104)
|-.||+|++......=. ..++..-|-.++|... .-...+||++.|.||
T Consensus 17 RCMLAiKYlq~~~~~Y~-p~tlG~dlirdLI~vl-----------R~rnY~eA~~RY~~f 64 (78)
T PF01524_consen 17 RCMLAIKYLQLVEKTYS-PDTLGYDLIRDLISVL-----------RARNYVEASRRYSHF 64 (78)
T ss_pred hHHHHHHHHHHcccccC-CCCccHHHHHHHHHHH-----------hhhhHHHHHHHHHHh
Confidence 56789999887544323 5788888999999887 344578888888887
No 11
>PF15127 DUF4565: Protein of unknown function (DUF4565)
Probab=52.53 E-value=11 Score=25.04 Aligned_cols=26 Identities=23% Similarity=0.463 Sum_probs=17.8
Q ss_pred CHHHHHHHHHHHHhcCCChhHHhHHHHHHHHHHhhhhc
Q psy4243 62 RFSDNLAQILIDGADNKGKVVKKKTDLHRQCEANRAYA 99 (104)
Q Consensus 62 ~~~~~La~Eii~A~~~~g~aikkK~elhK~A~aNRa~a 99 (104)
.++++|++||+.- .+..-|..|-.|.
T Consensus 56 EyA~rLSqEIl~d------------AlqQWA~~n~kY~ 81 (91)
T PF15127_consen 56 EYAHRLSQEILSD------------ALQQWAENNIKYS 81 (91)
T ss_pred HHHHHHHHHHHHH------------HHHHHHHhCcccc
Confidence 4899999999843 2344666666554
No 12
>PF00610 DEP: Domain found in Dishevelled, Egl-10, and Pleckstrin (DEP); InterPro: IPR000591 This entry represents the DEP (Dishevelled, Egl-10 and Pleckstrin) domain, a globular domain of about 80 residues that is found in over 50 proteins involved in G-protein signalling pathways. It was named after the three proteins it was initially found in: Dishevelled (Dsh and Dvl), which play a key role in the transduction of the Wg/Wnt signal from the cell surface to the nucleus; it is a segment polarity protein required to establish coherent arrays of polarized cells and segments in embryos, and plays a role in wingless signalling. Egl-10, which regulates G-protein signalling in the central nervous system. Pleckstrin, the major substrate of protein kinase C in platelets; Pleckstrin contains two PH domains flanking the DEP domain. Mammalian regulators of G-protein signalling also contain these domains, and regulate signal transduction by increasing the GTPase activity of G-protein alpha subunits, thereby driving them into their inactive GDP-bound form. It has been proposed that the DEP domain could play a selective role in targeting DEP domain-containing proteins to specific subcellular membranous sites, perhaps even to specific G protein-coupled signaling pathways [, ]. Nuclear magnetic resonance spectroscopy has revealed that the DEP domain comprises a three-helix bundle, a beta-hairpin 'arm' composed of two beta-strands and two short beta-strands in the C-terminal region [].; GO: 0035556 intracellular signal transduction; PDB: 1UHW_A 1V3F_A 2YSR_A 2CSO_A 1W4M_A 2PBI_C 1O7F_A 2BYV_E 1FSH_A 3ML6_D ....
Probab=39.19 E-value=27 Score=20.98 Aligned_cols=30 Identities=17% Similarity=0.256 Sum_probs=21.2
Q ss_pred HHHHHHHHHHhcCCCCCCHHHHHHHHHHHH
Q psy4243 45 KCMKWHIEIAKDKPSEVRFSDNLAQILIDG 74 (104)
Q Consensus 45 lAirWii~~a~~k~~~~~~~~~La~Eii~A 74 (104)
-++.||++.+........-+..|++++++.
T Consensus 22 e~v~WL~~~~~~~~~~r~eA~~l~q~Ll~~ 51 (74)
T PF00610_consen 22 EAVDWLMDNFEGFVRDREEAVQLGQELLDH 51 (74)
T ss_dssp HHHHHHHHTSCTSTSSHHHHHHHHHHHHHC
T ss_pred HHHHHHHHhccccccCHHHHHHHHHHHHHC
Confidence 478999976643222457888899999864
No 13
>PF06021 Gly_acyl_tr_N: Aralkyl acyl-CoA:amino acid N-acyltransferase; InterPro: IPR015938 This entry represents mammalian-specific glycine N-acyltransferase (also called aralkyl acyl-CoA:amino acid N-acyltransferase; 2.3.1.13 from EC). Mitochondrial acyltransferases catalyse the transfer of an acyl group from acyl-CoA to the N terminus of glycine to produce N-acylglycine. These enzymes can conjugate a multitude of substrates to form a variety of N-acylglycines. The CoA derivatives of a number of aliphatic and aromatic acids, but not phenylacetyl-CoA or (indol-3-yl)acetyl-CoA, can act as donor [, ].; GO: 0047961 glycine N-acyltransferase activity, 0005739 mitochondrion
Probab=35.43 E-value=31 Score=26.09 Aligned_cols=32 Identities=25% Similarity=0.235 Sum_probs=25.8
Q ss_pred CccccccChHHHHHHHhhhcccce--eecceeeee
Q psy4243 1 MLHLVCQDLIWVCLVLGLCNILTI--KAHLASIKV 33 (104)
Q Consensus 1 ~~~~~~~~~~~~~L~~Ai~nv~P~--r~gG~~~qv 33 (104)
|++|.|.+-+ +.|.+.+...-|- ++=|+++.+
T Consensus 1 M~~L~~~~~L-q~Le~~L~k~~PeSLKVYG~V~~I 34 (205)
T PF06021_consen 1 MLVLNCSQKL-QILEKSLRKSFPESLKVYGAVFNI 34 (205)
T ss_pred CccccCHHHH-HHHHHHHHHhCchhheeeeEEEEe
Confidence 8889888887 7999999999996 665555543
No 14
>PF12209 SAC3: Leucine permease transcriptional regulator helical domain; InterPro: IPR024293 This domain is found in fungal proteins, including the nuclear mRNA export protein SAC3. It has been suggested this domain provides a scaffold within the yeast Sac3:Cdc31:Sus1:Thp1 (TREX-2) complex to integrate interactions between protein complexes to facilitate the coupling of transcription and mRNA export during gene expression [].; PDB: 3FWC_N 3FWB_B.
Probab=33.84 E-value=1.2e+02 Score=19.26 Aligned_cols=33 Identities=18% Similarity=0.160 Sum_probs=21.4
Q ss_pred CCCHHHHHHHHHHHHhcCCChhHHhHHHHHHHHHHhhhhcc
Q psy4243 60 EVRFSDNLAQILIDGADNKGKVVKKKTDLHRQCEANRAYAH 100 (104)
Q Consensus 60 ~~~~~~~La~Eii~A~~~~g~aikkK~elhK~A~aNRa~ah 100 (104)
.+.+-+.|++||.+|+ -.|-++-....-+|-..
T Consensus 35 R~~iI~sLs~ELy~AF--------i~E~~Y~~~leskAd~~ 67 (79)
T PF12209_consen 35 RKQIIDSLSEELYDAF--------IHEQLYQIYLESKADQF 67 (79)
T ss_dssp --HHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHH
Confidence 3578999999999998 34455555555554433
No 15
>COG5044 MRS6 RAB proteins geranylgeranyltransferase component A (RAB escort protein) [Posttranslational modification, protein turnover, chaperones]
Probab=31.91 E-value=1.9e+02 Score=24.30 Aligned_cols=75 Identities=19% Similarity=0.310 Sum_probs=49.8
Q ss_pred cceeeeecc-----------ccChhHHHHHHHHHHHHHHhcCCCCCCHHHH--HHHHHHHHhcCCCh-------h-----
Q psy4243 27 HLASIKVPI-----------PISDKEARFKCMKWHIEIAKDKPSEVRFSDN--LAQILIDGADNKGK-------V----- 81 (104)
Q Consensus 27 gG~~~qvP~-----------~i~~~rr~~lAirWii~~a~~k~~~~~~~~~--La~Eii~A~~~~g~-------a----- 81 (104)
-|..|.||. ++-++|++..-|+|+.+.+.+...-+..++. +.+++..-+...|. +
T Consensus 114 ~~k~~kVP~ne~ei~~s~~lsL~eKr~vmrFl~~V~n~~~~~~~~~~~~e~k~~~~~~~ekf~L~~~~~e~i~~~i~l~l 193 (434)
T COG5044 114 PGKIYKVPYNEAEIFTSPLLSLFEKRRVMRFLKWVSNYAEQKSTLQELYESKDTMEFLFEKFGLSGATEEFIGHGIALSL 193 (434)
T ss_pred CCcEEECCccHHhhhcCCCcchhhHHHHHHHHHHHHhHHhhhhhchhhhhcccHHHHHHHHHccCcchhhhhhhhhhhhc
Confidence 348888984 5678999999999999888865433334442 44455454444321 1
Q ss_pred ---HHhHHHHHHHHHHhhhhccc
Q psy4243 82 ---VKKKTDLHRQCEANRAYAHY 101 (104)
Q Consensus 82 ---ikkK~elhK~A~aNRa~ahy 101 (104)
+.-|+.+.|++.-=|+|.-|
T Consensus 194 dl~~p~re~~erIl~Y~~Sf~~y 216 (434)
T COG5044 194 DLDIPAREALERILRYMRSFGDY 216 (434)
T ss_pred cccCCchHHHHHHHHHHHhhccc
Confidence 23578888888777777765
No 16
>COG5573 Predicted nucleic-acid-binding protein, contains PIN domain [General function prediction only]
Probab=28.62 E-value=28 Score=24.91 Aligned_cols=68 Identities=18% Similarity=0.179 Sum_probs=47.3
Q ss_pred eecceeeeeccccChhHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHHHHHhcCCChhHHhHHHHHHHHHHhhhhcccccC
Q psy4243 25 KAHLASIKVPIPISDKEARFKCMKWHIEIAKDKPSEVRFSDNLAQILIDGADNKGKVVKKKTDLHRQCEANRAYAHYYFG 104 (104)
Q Consensus 25 r~gG~~~qvP~~i~~~rr~~lAirWii~~a~~k~~~~~~~~~La~Eii~A~~~~g~aikkK~elhK~A~aNRa~ahyrw~ 104 (104)
.-+|.+|-||+.|=-+ ..-..+ || .++++.++.|++.++..+...+--=.|.+..|-.-+.-.||-+|
T Consensus 34 ~a~~~~~VVs~QVl~E---------t~~vl~-RK--~~~s~~~i~~lie~~~~~~~Iv~~t~~~~~~a~~l~~ry~fs~W 101 (142)
T COG5573 34 QALGHTYVVSVQVLNE---------TCYVLK-RK--YGASEQLIQTLIEAFRRQCRIVHLTHEEVVQASRLAPRYHFSFW 101 (142)
T ss_pred HhcCceEEEehHHHHH---------HHHHHH-Hh--cCCcHHHHHHHHHHHHhhceeecCCHHHHHHHhccccccCchHH
Confidence 3467678888865211 112222 43 47899999999999999998888888888887666655555555
No 17
>PF14160 FAM110_C: Centrosome-associated C terminus
Probab=28.26 E-value=23 Score=24.34 Aligned_cols=25 Identities=24% Similarity=0.348 Sum_probs=22.1
Q ss_pred eeeeeccccChhHHHHHHHHHHHHH
Q psy4243 29 ASIKVPIPISDKEARFKCMKWHIEI 53 (104)
Q Consensus 29 ~~~qvP~~i~~~rr~~lAirWii~~ 53 (104)
..+++|.+++--.|...-||||-..
T Consensus 83 ~~~r~p~~~SIIERNARIIKWLy~C 107 (111)
T PF14160_consen 83 LAERVPYGVSIIERNARIIKWLYGC 107 (111)
T ss_pred ccccCCCCCceeeehhHHHHHHHhc
Confidence 4789999999999999999999753
No 18
>PF13314 DUF4083: Domain of unknown function (DUF4083)
Probab=26.65 E-value=1.4e+02 Score=18.28 Aligned_cols=33 Identities=9% Similarity=0.067 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHhcCCC-CCCHHHHHHHHHHHHhcC
Q psy4243 44 FKCMKWHIEIAKDKPS-EVRFSDNLAQILIDGADN 77 (104)
Q Consensus 44 ~lAirWii~~a~~k~~-~~~~~~~La~Eii~A~~~ 77 (104)
.+-||++...+..|+. ..++..+| ++||+-+++
T Consensus 24 tl~IRri~~~s~~kkq~~~~~eqKL-DrIIeLLEK 57 (58)
T PF13314_consen 24 TLFIRRILINSNAKKQDVDSMEQKL-DRIIELLEK 57 (58)
T ss_pred HHHHHHHHHhccccccchhHHHHHH-HHHHHHHcc
Confidence 3568999988776542 23688888 777776654
No 19
>PF14202 TnpW: Transposon-encoded protein TnpW
Probab=24.80 E-value=40 Score=18.55 Aligned_cols=17 Identities=18% Similarity=0.192 Sum_probs=13.6
Q ss_pred eecceeeeeccccChhH
Q psy4243 25 KAHLASIKVPIPISDKE 41 (104)
Q Consensus 25 r~gG~~~qvP~~i~~~r 41 (104)
++||++|.|-+-.++.-
T Consensus 6 kIG~Tty~V~~~F~~~s 22 (37)
T PF14202_consen 6 KIGKTTYVVEVHFSETS 22 (37)
T ss_pred EECCEEEEEEEEECCCc
Confidence 78999999988776643
No 20
>PF05164 ZapA: Cell division protein ZapA; InterPro: IPR007838 This entry a structural domain found in the cell division protein ZapA, as well as in related proteins. This domain has a core structure consisting of two layers alpha/beta, and has a long C-terminal helix that forms dimeric parallel and tetrameric antiparallel coiled coils []. ZapA interacts with FtsZ, where FtsZ is part of a mid-cell cytokinetic structure termed the Z-ring that recruits a hierarchy of fission related proteins early in the bacterial cell cycle. ZapA drives the polymerisation and filament bundling of FtsZ, thereby contributing to the spatio-temporal tuning of the Z-ring.; PDB: 1T3U_B 1W2E_B 3HNW_A.
Probab=22.46 E-value=1.9e+02 Score=17.67 Aligned_cols=33 Identities=15% Similarity=0.080 Sum_probs=27.8
Q ss_pred eeecceeeeeccccChhHHHHHHHHHHHHHHhc
Q psy4243 24 IKAHLASIKVPIPISDKEARFKCMKWHIEIAKD 56 (104)
Q Consensus 24 ~r~gG~~~qvP~~i~~~rr~~lAirWii~~a~~ 56 (104)
+.++|++|.+.++-.+......+.+.|-+-..+
T Consensus 4 v~I~G~~y~i~~~~~~ee~l~~~a~~i~~~i~~ 36 (89)
T PF05164_consen 4 VTILGREYRIKCPDEDEEYLRKAAELINEKINE 36 (89)
T ss_dssp EEETTEEEEECETGCGHHHHHHHHHHHHHHHHH
T ss_pred EEECCEEEEeecCCCCHHHHHHHHHHHHHHHHH
Confidence 368999999999999998898888888776554
No 21
>PF05743 UEV: UEV domain; InterPro: IPR008883 The N-terminal ubiquitin E2 variant (UEV) domain is ~145 amino acid residues in length and shows significant sequence similarity to E2 ubiquitin ligases but is unable to catalyze ubiquitin transfer as it lacks the active site cysteine that forms the transient thioester bond with the C terminus of ubiquitin (Ub). Nevertheless, at least some UEVs have retained the ability to bind Ub, and appear to act either as cofactors in ubiquitylation reactions, or as ubiquitin sensors. UEV domains also frequently contain other protein recognition motifs, and may generally serve to couple protein and Ub binding functions to facilitate the formation of multiprotein complexes [, , , ]. The UEV domain consists of a twisted four-stranded antiparallel beta-sheet having a meander topology, with four alpha-helices packed against one face of the sheet. The UEV fold is generally similar to canonical E2 ligases in the hydrophobic core and 'active site' regions, but differs significantly at both its N- and C-termini [, ]. The UEV domain is found in the eukaryotic tumour susceptibility gene 101 protein (TSG101). Altered transcripts of this gene have been detected in sporadic breast cancers and many other Homo sapiens malignancies. However, the involvement of this gene in neoplastic transformation and tumourigenesis is still elusive. TSG101 is required for normal cell function of embryonic and adult tissues but this gene is not a tumour suppressor for sporadic forms of breast cancer [].; GO: 0006464 protein modification process, 0015031 protein transport; PDB: 3R3Q_A 3R42_A 1UZX_A 3OBX_A 3OBS_A 3P9H_A 2F0R_A 3P9G_A 3OBQ_A 3OBU_A ....
Probab=20.84 E-value=49 Score=22.52 Aligned_cols=16 Identities=25% Similarity=0.403 Sum_probs=12.3
Q ss_pred cceeecceeeeecccc
Q psy4243 22 LTIKAHLASIKVPIPI 37 (104)
Q Consensus 22 ~P~r~gG~~~qvP~~i 37 (104)
-|+...|++|.||+.|
T Consensus 39 ipi~y~g~~y~iPi~I 54 (121)
T PF05743_consen 39 IPITYKGSTYNIPICI 54 (121)
T ss_dssp EEECCTTCCEEEEEEE
T ss_pred cccccCCcccceeEEE
Confidence 3456789999999865
No 22
>PHA02652 hypothetical protein; Provisional
Probab=20.61 E-value=47 Score=20.49 Aligned_cols=21 Identities=29% Similarity=0.373 Sum_probs=16.3
Q ss_pred cccccChHHHHHHHhhhcccc
Q psy4243 3 HLVCQDLIWVCLVLGLCNILT 23 (104)
Q Consensus 3 ~~~~~~~~~~~L~~Ai~nv~P 23 (104)
.|+|+|.++..=..|+.|+.+
T Consensus 18 kllcgddilnmeydavmniis 38 (70)
T PHA02652 18 KLLCGDDILNMEYDAVMNIIS 38 (70)
T ss_pred HHhcccHHhhhhHHHHHHHHH
Confidence 378999988877888887643
No 23
>PF06972 DUF1296: Protein of unknown function (DUF1296); InterPro: IPR009719 This family represents a conserved region approximately 60 residues long within a number of plant proteins of unknown function.
Probab=20.58 E-value=44 Score=20.66 Aligned_cols=12 Identities=25% Similarity=0.174 Sum_probs=9.5
Q ss_pred ccccChHHHHHH
Q psy4243 4 LVCQDLIWVCLV 15 (104)
Q Consensus 4 ~~~~~~~~~~L~ 15 (104)
||+||++++|-.
T Consensus 42 LL~qD~FheVk~ 53 (60)
T PF06972_consen 42 LLSQDPFHEVKS 53 (60)
T ss_pred HHhcCcHHHHHH
Confidence 789999987754
No 24
>PHA02100 hypothetical protein
Probab=20.23 E-value=60 Score=22.02 Aligned_cols=17 Identities=24% Similarity=0.409 Sum_probs=14.6
Q ss_pred HHhcC-CChhHHhHHHHH
Q psy4243 73 DGADN-KGKVVKKKTDLH 89 (104)
Q Consensus 73 ~A~~~-~g~aikkK~elh 89 (104)
|||++ +|++.|.|.|++
T Consensus 91 daykkyqgkarkirqe~y 108 (112)
T PHA02100 91 DAYKKYQGKARKIRQEIY 108 (112)
T ss_pred HHHHHhhhHHHHHHHHhh
Confidence 67777 899999999986
Done!