Query 029596
Match_columns 191
No_of_seqs 120 out of 242
Neff 3.8
Searched_HMMs 46136
Date Fri Mar 29 15:27:07 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/029596.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/029596hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF01251 Ribosomal_S7e: Riboso 100.0 1.3E-96 3E-101 617.6 14.0 185 6-190 1-189 (189)
2 KOG3320 40S ribosomal protein 100.0 1.5E-92 3.3E-97 590.1 19.4 190 1-190 1-191 (192)
3 PTZ00389 40S ribosomal protein 100.0 1.3E-89 2.7E-94 573.7 20.1 182 8-190 1-184 (184)
4 PRK06418 transcription elongat 99.8 8.8E-19 1.9E-23 145.1 10.9 138 15-186 27-165 (166)
5 TIGR01618 phage_P_loop phage n 82.1 8.1 0.00018 33.4 7.8 93 68-168 110-206 (220)
6 PF01883 DUF59: Domain of unkn 67.6 19 0.00042 24.8 5.3 66 20-90 1-66 (72)
7 TIGR02945 SUF_assoc FeS assemb 62.5 25 0.00054 25.8 5.4 69 19-92 2-71 (99)
8 PF01577 Peptidase_S30: Potyvi 53.8 17 0.00038 30.6 3.7 40 116-155 102-154 (245)
9 smart00174 RHO Rho (Ras homolo 53.2 44 0.00096 25.4 5.5 45 59-103 71-115 (174)
10 PF02154 FliM: Flagellar motor 46.6 1.6E+02 0.0035 24.0 8.4 62 12-74 104-189 (192)
11 cd00157 Rho Rho (Ras homology) 39.6 99 0.0022 23.1 5.5 45 59-103 73-117 (171)
12 PF01383 CpcD: CpcD/allophycoc 35.0 45 0.00098 23.1 2.8 21 58-78 24-45 (56)
13 PF02374 ArsA_ATPase: Anion-tr 33.9 70 0.0015 28.6 4.5 32 68-99 255-286 (305)
14 PF00071 Ras: Ras family; Int 31.1 1.6E+02 0.0035 21.9 5.5 42 59-101 73-115 (162)
15 cd01870 RhoA_like RhoA-like su 30.0 1.8E+02 0.0038 22.0 5.6 43 60-102 75-117 (175)
16 TIGR02110 PQQ_syn_pqqF coenzym 29.8 1E+02 0.0022 31.2 5.3 49 50-102 607-668 (696)
17 cd04135 Tc10 TC10 subfamily. 29.3 1.9E+02 0.0041 21.9 5.7 41 60-100 74-114 (174)
18 cd01893 Miro1 Miro1 subfamily. 28.9 1.9E+02 0.0042 21.9 5.7 41 60-100 73-113 (166)
19 cd04130 Wrch_1 Wrch-1 subfamil 28.8 1.8E+02 0.0039 22.3 5.5 43 60-102 74-116 (173)
20 PF08534 Redoxin: Redoxin; In 28.4 1.2E+02 0.0026 22.8 4.4 47 50-97 19-67 (146)
21 cd04134 Rho3 Rho3 subfamily. 27.4 2E+02 0.0043 22.7 5.7 44 60-103 74-117 (189)
22 COG1868 FliM Flagellar motor s 27.4 5.1E+02 0.011 24.1 10.8 148 8-170 135-322 (332)
23 cd08971 AcNei2_N N-terminal do 27.2 1.2E+02 0.0026 23.3 4.2 60 79-156 9-68 (114)
24 PF06858 NOG1: Nucleolar GTP-b 27.1 90 0.0019 22.3 3.2 25 74-98 31-55 (58)
25 COG0195 NusA Transcription elo 26.6 1.4E+02 0.003 25.6 4.9 104 60-187 76-180 (190)
26 cd01874 Cdc42 Cdc42 subfamily. 25.5 2.3E+02 0.0049 22.3 5.7 44 59-102 74-117 (175)
27 cd00877 Ran Ran (Ras-related n 25.1 2.3E+02 0.0049 21.8 5.5 43 59-102 74-116 (166)
28 TIGR01397 fliM_switch flagella 25.1 4.8E+02 0.01 23.0 9.2 62 12-73 138-222 (320)
29 COG4496 Uncharacterized protei 24.8 22 0.00048 27.9 -0.2 31 112-142 4-48 (100)
30 PF14085 DUF4265: Domain of un 24.3 1E+02 0.0022 23.9 3.4 64 127-190 28-97 (117)
31 PF13479 AAA_24: AAA domain 23.9 1.4E+02 0.003 24.8 4.3 62 70-137 105-169 (213)
32 cd02970 PRX_like2 Peroxiredoxi 23.4 1.6E+02 0.0034 21.8 4.2 42 56-98 22-63 (149)
33 cd04172 Rnd3_RhoE_Rho8 Rnd3/Rh 22.6 2.7E+02 0.0058 22.4 5.7 44 59-102 78-121 (182)
34 cd04173 Rnd2_Rho7 Rnd2/Rho7 su 22.5 2.5E+02 0.0054 23.8 5.7 44 59-102 74-117 (222)
35 cd01875 RhoG RhoG subfamily. 22.5 2.8E+02 0.006 22.1 5.7 45 59-103 76-120 (191)
36 smart00175 RAB Rab subfamily o 21.2 3.4E+02 0.0074 19.9 5.7 40 59-99 74-114 (164)
37 cd04129 Rho2 Rho2 subfamily. 20.9 3.2E+02 0.0069 21.5 5.7 41 60-100 75-115 (187)
No 1
>PF01251 Ribosomal_S7e: Ribosomal protein S7e; InterPro: IPR000554 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. A number of eukaryotic ribosomal proteins can be grouped on the basis of sequence similarities []. One of these families consists of Xenopus S8, and mammalian, insect and yeast S7. These proteins have about 200 amino acids.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 2XZN_3 2XZM_3 3U5G_H 3U5C_H.
Probab=100.00 E-value=1.3e-96 Score=617.55 Aligned_cols=185 Identities=62% Similarity=0.971 Sum_probs=154.6
Q ss_pred cccccCCCCCCCHHHHHHHHHhHhhhcCChHHHhcCcceeeeeeEEEEecCCeeEEEEEeehhhHHHHHHHHHHHHHHHH
Q 029596 6 KKIQKDKDAEPTEFEETVAQALFDLENTNQELKSDLKDLYINQAIQMDVSGNRKAIVVYVPYRLRKAYRKIHSRLVRELE 85 (191)
Q Consensus 6 ~Ki~k~~~~~p~e~E~~Vaqal~dLE~~~~dLK~~L~~l~i~~akei~v~~~kkAivIfVP~~~lk~f~Kiq~rLv~ELE 85 (191)
+||+|++|++|||||.+|||||+|||++++|||++|++|+|++||||||++|+||||||||||||++|||||.||++|||
T Consensus 1 ~Ki~K~~~~~p~e~E~~Vaqal~dle~~~~dLK~~Lr~L~i~~aKEi~v~~~kKAivIfVP~~~lk~f~KIq~rLv~ELE 80 (189)
T PF01251_consen 1 KKIVKPKGKKPDEFEESVAQALLDLEMNSSDLKAQLRELYITSAKEIEVGGGKKAIVIFVPVPQLKAFQKIQVRLVRELE 80 (189)
T ss_dssp ---SS---SS--CHHHHHHHHHHHHCHCHCHHCCCCCC--ECEEEEEEECTCEEEEEEEE-CCCCHHHHHHCHHHHHHHH
T ss_pred CCccccCCCCCCHHHHHHHHHHHHHHcCcHHHHhhccccEEEEEEEEEECCCcEEEEEEEcHHHHHHHHHHHHHHHHHHH
Confidence 48999999999999999999999999877799999999999999999999999999999999999999999999999999
Q ss_pred hhcCCCeEEEEeeecccCCCCCC----CccccCCCcchhhhhHhhhhcccccceeeceeEEEeeCCceEEEEEeCccccc
Q 029596 86 KKFSGKDVVLIATRRIVRPPKKG----SAVQRPRSRTLTAVHDAMLEDVVYPAEIVGKRVRYRLDGSKIIKIFLDPKERN 161 (191)
Q Consensus 86 KKfsg~~Vv~vAqRrIl~kp~~~----~~q~RPRSRTLTaVhdaiLeDlV~P~eIVGKR~r~r~DGsk~~KV~LD~kd~~ 161 (191)
|||||+||||||||||+|||+++ .+|+|||||||||||||||||||||+|||||||||++|||+++|||||++|||
T Consensus 81 KKfsgk~Vv~iAqRrIl~kp~r~~~~~~~qkrPRSRTLTaVhdaILeDLV~PseIVGKRir~rlDGskl~KV~LD~k~~~ 160 (189)
T PF01251_consen 81 KKFSGKHVVFIAQRRILPKPTRKSRQKQKQKRPRSRTLTAVHDAILEDLVYPSEIVGKRIRVRLDGSKLIKVHLDKKDQN 160 (189)
T ss_dssp HCTTTCEEEEEE------SS-SSS---TTS---CCCSHHHHHHHHHHHHTTTS-ECEEEEEE-TTS-EEEEEEEECCCCH
T ss_pred hhcCCCeEEEeccceEcCCCCcCccccccccCcCCcchHHHHHHHHHhhccHHHhheeeEEEecCCCEEEEEEEChHHcc
Confidence 99999999999999999999877 47999999999999999999999999999999999999999999999999999
Q ss_pred chhhhhhhHHHHHhhhcCCceEEECCCCC
Q 029596 162 NTEYKLESFSGVYRKLTGKDVVFDYPITD 190 (191)
Q Consensus 162 ~ve~Kldtfs~VYkkLTgk~v~FeFp~~~ 190 (191)
++|||||||++|||+||||||+||||+++
T Consensus 161 ~ve~Kl~tfs~VYkkLTgK~v~FeFp~~~ 189 (189)
T PF01251_consen 161 NVEHKLDTFSAVYKKLTGKDVVFEFPEQE 189 (189)
T ss_dssp HHHCCHHHHHHHHHHHCS-EEEEEEE---
T ss_pred cHHHHHHHHHHHHHHHcCCceEEEcCCCC
Confidence 99999999999999999999999999864
No 2
>KOG3320 consensus 40S ribosomal protein S7 [Translation, ribosomal structure and biogenesis]
Probab=100.00 E-value=1.5e-92 Score=590.12 Aligned_cols=190 Identities=59% Similarity=0.928 Sum_probs=187.2
Q ss_pred CCccccccccCCCCCCCHHHHHHHHHhHhhhcCChHHHhcCcceeeeeeEEEEecCCeeEEEEEeehhhHHHHHHHHHHH
Q 029596 1 MFTTKKKIQKDKDAEPTEFEETVAQALFDLENTNQELKSDLKDLYINQAIQMDVSGNRKAIVVYVPYRLRKAYRKIHSRL 80 (191)
Q Consensus 1 m~~~~~Ki~k~~~~~p~e~E~~Vaqal~dLE~~~~dLK~~L~~l~i~~akei~v~~~kkAivIfVP~~~lk~f~Kiq~rL 80 (191)
||++.+||+|++|+.|||||.+|||||+|||++|+|||++|++|||++|+||||+||+||||||||+|+|++|||||.||
T Consensus 1 m~s~~~Ki~k~~~~~ptE~E~~iaqal~~le~~n~~lk~~lr~L~I~~a~eiev~Gg~Kaivi~VP~p~lk~fqki~~~L 80 (192)
T KOG3320|consen 1 MFSSQAKIHKPSGSKPTEFEMQIAQALLDLEMDNSDLKAQLRELNITSAKEIEVGGGRKAIVIFVPVPQLKAFQKIQVRL 80 (192)
T ss_pred CccccccccCCCCCCchHHHHHHHHHHHHHHhcchhhHHHhhhheeeeeEEEEecCCcEEEEEEechHHHHHHHHHHHHH
Confidence 89999999999999999999999999999999899999999999999999999999999999999999999999999999
Q ss_pred HHHHHhhcCCCeEEEEeeecccCCCCCCCc-cccCCCcchhhhhHhhhhcccccceeeceeEEEeeCCceEEEEEeCccc
Q 029596 81 VRELEKKFSGKDVVLIATRRIVRPPKKGSA-VQRPRSRTLTAVHDAMLEDVVYPAEIVGKRVRYRLDGSKIIKIFLDPKE 159 (191)
Q Consensus 81 v~ELEKKfsg~~Vv~vAqRrIl~kp~~~~~-q~RPRSRTLTaVhdaiLeDlV~P~eIVGKR~r~r~DGsk~~KV~LD~kd 159 (191)
+|||||||||+||+|||+|||+|+|++++. |||||||||||||||||||+|||+|||||||||++||++++|||||++|
T Consensus 81 vreleKKF~gk~Vifia~Rrilpkp~rks~~qKRprsrtltaVhdaiLed~vfP~eIvGkR~rv~ldg~ki~kV~LD~~~ 160 (192)
T KOG3320|consen 81 VRELEKKFSGKHVIFIAQRRILPKPTRKSRTQKRPRSRTLTAVHDAILEDLVFPAEIVGKRTRVKLDGSKLVKVHLDKKQ 160 (192)
T ss_pred HHHHHHhcCCceEEEEEeeeeccCCCCCcccccCCccchHHHHHHHHHHhccchhhhcceeEEEEecCcEEEEEEechhh
Confidence 999999999999999999999999999987 8999999999999999999999999999999999999999999999999
Q ss_pred ccchhhhhhhHHHHHhhhcCCceEEECCCCC
Q 029596 160 RNNTEYKLESFSGVYRKLTGKDVVFDYPITD 190 (191)
Q Consensus 160 ~~~ve~Kldtfs~VYkkLTgk~v~FeFp~~~ 190 (191)
+|++|||+|+|++||++||||||+||||...
T Consensus 161 ~n~~e~K~e~f~~vy~kLtGKdv~fEfp~~~ 191 (192)
T KOG3320|consen 161 QNNVEHKVETFSAVYKKLTGKDVVFEFPEFT 191 (192)
T ss_pred ccchHHhHHHHHHHHHHhcCCceEEecCccc
Confidence 9999999999999999999999999999764
No 3
>PTZ00389 40S ribosomal protein S7; Provisional
Probab=100.00 E-value=1.3e-89 Score=573.66 Aligned_cols=182 Identities=57% Similarity=0.936 Sum_probs=177.6
Q ss_pred cccCCCCCCCHHHHHHHHHhHhhhcCChHHHhcCcceeeeeeEEEEecCC-eeEEEEEeehhhHHHHHHHHHHHHHHHHh
Q 029596 8 IQKDKDAEPTEFEETVAQALFDLENTNQELKSDLKDLYINQAIQMDVSGN-RKAIVVYVPYRLRKAYRKIHSRLVRELEK 86 (191)
Q Consensus 8 i~k~~~~~p~e~E~~Vaqal~dLE~~~~dLK~~L~~l~i~~akei~v~~~-kkAivIfVP~~~lk~f~Kiq~rLv~ELEK 86 (191)
+.|++|++|||||.+|||||+|||++|+|||++|++|||++||||||++| ||||||||||||+++|||||.||++||||
T Consensus 1 ~~k~~~~~p~e~E~~vaqal~~le~~~~dlK~~L~~L~i~~akei~v~~~~kkaivIfVP~~~lk~~~kiq~rLv~ELEK 80 (184)
T PTZ00389 1 IKKLKKAEPSELEKQVAKALFELEASSKDLKADLKNLKISSVKEVTVGKDKKKAVVVFVPYRMLMIYRKIQRKLIPELEK 80 (184)
T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHhCcHHHHhhhhccEEeeEEEEEecCCCcEEEEEEecHHHHHHHHHHHHHHHHHHHH
Confidence 46889999999999999999999999999999999999999999999877 99999999999999999999999999999
Q ss_pred hcCCCeEEEEeeecccCCCCCCCc-cccCCCcchhhhhHhhhhcccccceeeceeEEEeeCCceEEEEEeCcccccchhh
Q 029596 87 KFSGKDVVLIATRRIVRPPKKGSA-VQRPRSRTLTAVHDAMLEDVVYPAEIVGKRVRYRLDGSKIIKIFLDPKERNNTEY 165 (191)
Q Consensus 87 Kfsg~~Vv~vAqRrIl~kp~~~~~-q~RPRSRTLTaVhdaiLeDlV~P~eIVGKR~r~r~DGsk~~KV~LD~kd~~~ve~ 165 (191)
|| |+||+|||||||+|+|+++++ |+|||||||||||||||+|||||+|||||||||++|||+++||+||++|++++||
T Consensus 81 K~-g~~Vv~ia~RrIl~kp~r~~~~q~rPrSRTLTaVhdaiLeDLvyPaeIvGkRir~~~DGsk~~KV~Ld~~d~~~ve~ 159 (184)
T PTZ00389 81 KL-KKHVVIVAQRTILKKPVKNYKLKTRPRSRTLTAVHEAILEDLVYPSEIVGKRTRVRVDGSKLLKVFLDPKDRKNVEE 159 (184)
T ss_pred Hh-CCeEEEEEEEEEcCCCCcCccccCCCCccchHHHHHHHHHHhccchheeeeEEEEecCCcEEEEEEeCHHHhcccch
Confidence 99 999999999999999999886 9999999999999999999999999999999999999999999999999999999
Q ss_pred hhhhHHHHHhhhcCCceEEECCCCC
Q 029596 166 KLESFSGVYRKLTGKDVVFDYPITD 190 (191)
Q Consensus 166 Kldtfs~VYkkLTgk~v~FeFp~~~ 190 (191)
|+|+|++||++|||+||+||||+++
T Consensus 160 Kletf~~VykkLTgkdV~fefp~~~ 184 (184)
T PTZ00389 160 KLDAFSAVYKKLTGRDVVFEFPWDP 184 (184)
T ss_pred hHHHHHHHHHHHhCCCeEEEecCCC
Confidence 9999999999999999999999864
No 4
>PRK06418 transcription elongation factor NusA-like protein; Validated
Probab=99.79 E-value=8.8e-19 Score=145.12 Aligned_cols=138 Identities=23% Similarity=0.390 Sum_probs=108.5
Q ss_pred CCCHHHHHHHHHhHhhhcCChHHHhcCcceeeeeeEEEEecCCeeEEEEEe-ehhhHHHHHHHHHHHHHHHHhhcCCCeE
Q 029596 15 EPTEFEETVAQALFDLENTNQELKSDLKDLYINQAIQMDVSGNRKAIVVYV-PYRLRKAYRKIHSRLVRELEKKFSGKDV 93 (191)
Q Consensus 15 ~p~e~E~~Vaqal~dLE~~~~dLK~~L~~l~i~~akei~v~~~kkAivIfV-P~~~lk~f~Kiq~rLv~ELEKKfsg~~V 93 (191)
.-+++|-.|+.+|++||.+ ..|++..+-.|-++ ++ + |++.| |=. =.+..| -...++.|++++ ||+|
T Consensus 27 ~v~~~dv~i~~~l~~l~~~-----~~l~~~~~~k~~~~--dd-r--vIfvV~~gd-g~aIGk-~G~~ik~l~~~l-gk~V 93 (166)
T PRK06418 27 EVTELDVEVSKVLLKLEED-----KELKDVEYKKAYEV--DD-L--VILLVTSGP-RIPIGK-GGKIAKALSRKL-GKKV 93 (166)
T ss_pred ceEEeehHHHHHHHHhhcc-----ccccCceEEEEEEe--CC-E--EEEEEeCCC-cccccc-cchHHHHHHHHh-CCcE
Confidence 5789999999999999833 34566666555554 32 2 22222 222 223333 467888899999 8898
Q ss_pred EEEeeecccCCCCCCCccccCCCcchhhhhHhhhhcccccceeeceeEEEeeCCceEEEEEeCcccccchhhhhhhHHHH
Q 029596 94 VLIATRRIVRPPKKGSAVQRPRSRTLTAVHDAMLEDVVYPAEIVGKRVRYRLDGSKIIKIFLDPKERNNTEYKLESFSGV 173 (191)
Q Consensus 94 v~vAqRrIl~kp~~~~~q~RPRSRTLTaVhdaiLeDlV~P~eIVGKR~r~r~DGsk~~KV~LD~kd~~~ve~Kldtfs~V 173 (191)
=+|-- |-+ -+.+|+||+||++|+|+|++++.||+..+||++|++|++++++|+++|++|
T Consensus 94 evVE~-----------------s~d----~~~fl~Nl~~PA~V~gV~i~~~~dG~~~~kV~Vd~~Dk~~l~~k~e~~~~v 152 (166)
T PRK06418 94 RVVEK-----------------TND----IKKLAVQLLSPARVLGVNTVWLPDGTVQYVIRVSRRDRRRLPAKPELLESI 152 (166)
T ss_pred EEEEc-----------------CCC----HHHHHHhcCCCcEEEEEEEEEeCCCcEEEEEEECHHHhhcccccHHHHHHH
Confidence 77751 111 467999999999999999999999999999999999999999999999999
Q ss_pred HhhhcCCceEEEC
Q 029596 174 YRKLTGKDVVFDY 186 (191)
Q Consensus 174 YkkLTgk~v~FeF 186 (191)
|++|||++|.|+|
T Consensus 153 ~~kltgk~v~~~f 165 (166)
T PRK06418 153 LSKITGTEVKIRV 165 (166)
T ss_pred HHHHHCCcEEEEe
Confidence 9999999999998
No 5
>TIGR01618 phage_P_loop phage nucleotide-binding protein. This model represents an uncharacterized family of proteins from a number of phage of Gram-positive bacteria. This protein contains a P-loop motif, G/A-X-X-G-X-G-K-T near its amino end. The function of this protein is unknown.
Probab=82.12 E-value=8.1 Score=33.37 Aligned_cols=93 Identities=19% Similarity=0.286 Sum_probs=60.7
Q ss_pred hhHHHHHHHHHHHH---HHHHhhcCCCeEEEEeeecccCCCCCCCccccCCCcchhhhhHhhhhcccccceeeceeEEEe
Q 029596 68 RLRKAYRKIHSRLV---RELEKKFSGKDVVLIATRRIVRPPKKGSAVQRPRSRTLTAVHDAMLEDVVYPAEIVGKRVRYR 144 (191)
Q Consensus 68 ~~lk~f~Kiq~rLv---~ELEKKfsg~~Vv~vAqRrIl~kp~~~~~q~RPRSRTLTaVhdaiLeDlV~P~eIVGKR~r~r 144 (191)
+.+..|.+.+.+++ ..| +- .|++|+|+|.+.+-..+ - .--.+.+|=....++.+.+.+.--+++|| |+.+.
T Consensus 110 ~~~~~yg~~~~~fl~~l~~L-~~-~g~nII~tAhe~~~~~~-d--e~G~~~~r~~P~i~~K~~n~l~G~~DvV~-rl~i~ 183 (220)
T TIGR01618 110 PELQHYQKLDLWFLDLLTVL-KE-SNKNIYATAWELTNQSS-G--ESGQIYNRYQPDIREKVLNAFLGLTDVVG-RIVLN 183 (220)
T ss_pred cccccHHHHHHHHHHHHHHH-Hh-CCCcEEEEEeecccccc-C--CCCCCcceechhhhhhHHHhhcccccEEE-EEEEc
Confidence 35678888876654 555 23 59999999999642111 0 11224455666778888889999999999 55554
Q ss_pred e-CCceEEEEEeCcccccchhhhhh
Q 029596 145 L-DGSKIIKIFLDPKERNNTEYKLE 168 (191)
Q Consensus 145 ~-DGsk~~KV~LD~kd~~~ve~Kld 168 (191)
- +|.|.+ .+++.+...-.++||
T Consensus 184 ~~~g~R~~--~~~~~~~~~AKNrld 206 (220)
T TIGR01618 184 GETGERGF--ILDPSKGNYAKNRLD 206 (220)
T ss_pred cCCCceEE--EECCCCCcccccccc
Confidence 4 567655 567766655555554
No 6
>PF01883 DUF59: Domain of unknown function DUF59; InterPro: IPR002744 This family includes prokaryotic proteins of unknown function. The family also includes PhaH (O84984 from SWISSPROT) from Pseudomonas putida. PhaH forms a complex with PhaF (O84982 from SWISSPROT), PhaG (O84983 from SWISSPROT) and PhaI (O84985 from SWISSPROT), which hydroxylates phenylacetic acid to 2-hydroxyphenylacetic acid []. So members of this family may all be components of ring hydroxylating complexes.; PDB: 3LNO_C 3CQ3_A 3CQ2_D 2CU6_B 3CQ1_A 3UX3_B 3UX2_A 1WCJ_A 1UWD_A.
Probab=67.58 E-value=19 Score=24.80 Aligned_cols=66 Identities=18% Similarity=0.358 Sum_probs=46.8
Q ss_pred HHHHHHHhHhhhcCChHHHhcCcceeeeeeEEEEecCCeeEEEEEeehhhHHHHHHHHHHHHHHHHhhcCC
Q 029596 20 EETVAQALFDLENTNQELKSDLKDLYINQAIQMDVSGNRKAIVVYVPYRLRKAYRKIHSRLVRELEKKFSG 90 (191)
Q Consensus 20 E~~Vaqal~dLE~~~~dLK~~L~~l~i~~akei~v~~~kkAivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg 90 (191)
|++|-+||-.+. .|++..+|-++.+ .+++++.+|+=.+.+.+|+|-...+..++..+..-|. .++|
T Consensus 1 k~~V~~aL~~v~--dP~~~~~iv~~g~--V~~i~i~~~~V~v~l~l~~~~~~~~~~l~~~i~~~l~-~l~g 66 (72)
T PF01883_consen 1 KQAVRDALKQVK--DPELGKDIVELGM--VRDISIEGGKVSVSLELPTPACPAAEPLREEIREALK-ALPG 66 (72)
T ss_dssp HHHHHHHHTT-B--ETTTSSBTTTTTS--EEEEEECTCEEEEEE--SSTTHTTHHHHHHHHHHHHH-TSTT
T ss_pred CHHHHHHHhCCC--CCCCCCCHHHcCC--eeEEEEECCEEEEEEEECCCCchHHHHHHHHHHHHHH-hCCC
Confidence 456667776666 2555555555544 6789999999999999999999888888888888777 5555
No 7
>TIGR02945 SUF_assoc FeS assembly SUF system protein. Members of this family belong to the broader Pfam family pfam01883, or Domain of Unknown Function DUF59. Many members of DUF59 are candidate ring hydroxylating complex subunits. However, members of the narrower family defined here all are found in genomes that carry the FeS assembly SUF system. For 70 % of these species, the member of this protein family is found as part of the SUF locus, usually immediately downstream of the sufS gene.
Probab=62.48 E-value=25 Score=25.78 Aligned_cols=69 Identities=13% Similarity=0.194 Sum_probs=51.8
Q ss_pred HHHHHHHHhHhhhcCChHHHhcCcceeeeeeEEEEecC-CeeEEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCe
Q 029596 19 FEETVAQALFDLENTNQELKSDLKDLYINQAIQMDVSG-NRKAIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKD 92 (191)
Q Consensus 19 ~E~~Vaqal~dLE~~~~dLK~~L~~l~i~~akei~v~~-~kkAivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~ 92 (191)
.+.+|.++|-.+. .+++...|-++.. ++.+.+.+ ++--|.|..|+|.......++..+...|+. ++|-.
T Consensus 2 ~~~~I~~~L~~v~--dP~l~~~lv~~g~--V~~i~v~~~~~v~i~l~l~~p~~~~~~~l~~~i~~al~~-l~gv~ 71 (99)
T TIGR02945 2 LKDAVIEALKTVY--DPEIPVNIYELGL--IYDIDVDDDGHVDIQMTLTAPNCPVAGSMPGEVENAVRA-VPGVG 71 (99)
T ss_pred HHHHHHHHHcCCC--CCCCCCCeecCCC--eeEEEECCCCeEEEEEEECCCCCChHHHHHHHHHHHHHh-CCCCc
Confidence 4677888888877 3666666665544 47788885 888999999999888888888888888865 44544
No 8
>PF01577 Peptidase_S30: Potyvirus P1 protease; InterPro: IPR002540 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. The potyviridae are a family of positive strand RNA viruses, members of which include Zucchini yellow mosaic virus, and Turnip mosaic virus (strain Japanese) which cause considerable losses of crops worldwide. This entry represents a C-terminal region from various plant potyvirus P1 proteins (found at the N terminus of the polyprotein). The C terminus of P1 is a serine peptidase belonging to MEROPS peptidase family S30 (clan PA(S)). It is the protease responsible for autocatalytic cleavage between P1 and the helper component protease, which is a cysteine peptidase belonging to MEROPS peptidase family C6 IPR001456 from INTERPRO [, ]. The P1 protein may be involved in virus-host interactions [].; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis
Probab=53.78 E-value=17 Score=30.65 Aligned_cols=40 Identities=30% Similarity=0.547 Sum_probs=27.1
Q ss_pred CcchhhhhHhhhhccc-------ccceeeceeE---E---EeeCCceEEEEEe
Q 029596 116 SRTLTAVHDAMLEDVV-------YPAEIVGKRV---R---YRLDGSKIIKIFL 155 (191)
Q Consensus 116 SRTLTaVhdaiLeDlV-------~P~eIVGKR~---r---~r~DGsk~~KV~L 155 (191)
.-++++-.+.++.+++ .|-||||||- + ++.+|+...+|.|
T Consensus 102 ~~~~~~~~~~l~~~v~~i~~~~~~~vEiIgKrk~~~~~~~~~~~~~~~~kv~~ 154 (245)
T PF01577_consen 102 KVKMSDTFDNLIRQVLKIAKKKGKPVEIIGKRKKRTRARYKRRGGKRYLKVET 154 (245)
T ss_pred ccccchhHHHHHHHHHHHHHhcCCeEEEEecCCceEEEEEEEECCEEEEEEEC
Confidence 3344442455555544 4999999974 2 5678999999997
No 9
>smart00174 RHO Rho (Ras homology) subfamily of Ras-like small GTPases. Members of this subfamily of Ras-like small GTPases include Cdc42 and Rac, as well as Rho isoforms.
Probab=53.18 E-value=44 Score=25.36 Aligned_cols=45 Identities=9% Similarity=0.267 Sum_probs=35.9
Q ss_pred eEEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEeeecccC
Q 029596 59 KAIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIATRRIVR 103 (191)
Q Consensus 59 kAivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqRrIl~ 103 (191)
-++|+.+-..-..+|..+...+..++.+..++.++++|+.-.=++
T Consensus 71 d~~ilv~d~~~~~s~~~~~~~~~~~i~~~~~~~piilv~nK~Dl~ 115 (174)
T smart00174 71 DVFLICFSVDSPASFENVKEKWYPEVKHFCPNTPIILVGTKLDLR 115 (174)
T ss_pred CEEEEEEECCCHHHHHHHHHHHHHHHHhhCCCCCEEEEecChhhh
Confidence 466666777778899999877888888888899999999875443
No 10
>PF02154 FliM: Flagellar motor switch protein FliM; InterPro: IPR001689 The flagellar motor switch in Escherichia coli and Salmonella typhimurium regulates the direction of flagellar rotation and hence controls swimming behaviour []. The switch is a complex apparatus that responds to signals transduced by the chemotaxis sensory signalling system during chemotactic behaviour []. CheY, the chemotaxis response regulator, is believed to act directly on the switch to induce tumbles in the swimming pattern, but no physical interactions of CheY and switch proteins have yet been demonstrated. The switch complex comprises at least three proteins - FliG, FliM and FliN. It has been shown that FliG interacts with FliM, FliM interacts with itself, and FliM interacts with FliN []. Several residues within the middle third of FliG appear to be strongly involved in the FliG-FliM interaction, with residues near the N or C termini being less important []. Such clustering suggests that FliG-FliM interaction plays a central role in switching. Analysis of the FliG, FliM and FliN sequences shows that none are especially hydrophobic or appear to be integral membrane proteins []. This result is consistent with other evidence suggesting that the proteins may be peripheral to the membrane, possibly mounted on the basal body M ring [, ].; GO: 0003774 motor activity, 0001539 ciliary or flagellar motility, 0006935 chemotaxis, 0009425 bacterial-type flagellum basal body; PDB: 3SOH_C 2HP7_A.
Probab=46.59 E-value=1.6e+02 Score=24.03 Aligned_cols=62 Identities=18% Similarity=0.353 Sum_probs=42.0
Q ss_pred CCCCCCHHHHHHHHHhHhh----------------------hcCChHHHhcC--cceeeeeeEEEEecCCeeEEEEEeeh
Q 029596 12 KDAEPTEFEETVAQALFDL----------------------ENTNQELKSDL--KDLYINQAIQMDVSGNRKAIVVYVPY 67 (191)
Q Consensus 12 ~~~~p~e~E~~Vaqal~dL----------------------E~~~~dLK~~L--~~l~i~~akei~v~~~kkAivIfVP~ 67 (191)
.+.+.|++|..+.+-+++. | +|+++.+-. .+..+...=++++++..-.+-|.+|+
T Consensus 104 ~~R~~T~iE~~i~~~v~~~~~~~l~~aw~~v~~~~~~~~~~E-~np~~~~i~~~~e~vv~~~f~i~i~~~~g~~~i~~P~ 182 (192)
T PF02154_consen 104 EGREFTEIEQRILRRVVERILEALREAWQPVVPLEFELERIE-TNPQFVQIVPPNEPVVVITFEIKIGGREGMMNICIPY 182 (192)
T ss_dssp -SS---HHHHHHHHHHHHHHHHHHHHHCTTTH---EEEEEEE-SSGGGT-SS-TTSEEEEEEEEEEETTEEEEEEEEEEH
T ss_pred ccccCcHHHHHHHHHHHHHHHHHHHHHHhhceeeeeEeeeEe-cCHHHHcccCCCCeEEEEEEEEEECCcEEEEEEEecH
Confidence 4668999999988755542 3 355554433 56777777899999888899999999
Q ss_pred hhHHHHH
Q 029596 68 RLRKAYR 74 (191)
Q Consensus 68 ~~lk~f~ 74 (191)
..+...+
T Consensus 183 ~~lepi~ 189 (192)
T PF02154_consen 183 STLEPIR 189 (192)
T ss_dssp HHHHHHH
T ss_pred HHHHHHH
Confidence 8876654
No 11
>cd00157 Rho Rho (Ras homology) family. Members of the Rho family include RhoA, Cdc42, Rac, Rnd, Wrch1, RhoBTB, and Rop. There are 22 human Rho family members identified currently. These proteins are all involved in the reorganization of the actin cytoskeleton in response to external stimuli. They also have roles in cell transformation by Ras in cytokinesis, in focal adhesion formation and in the stimulation of stress-activated kinase. These various functions are controlled through distinct effector proteins and mediated through a GTP-binding/GTPase cycle involving three classes of regulating proteins: GAPs (GTPase-activating proteins), GEFs (guanine nucleotide exchange factors), and GDIs (guanine nucleotide dissociation inhibitors). Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho protein
Probab=39.58 E-value=99 Score=23.10 Aligned_cols=45 Identities=9% Similarity=0.307 Sum_probs=35.8
Q ss_pred eEEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEeeecccC
Q 029596 59 KAIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIATRRIVR 103 (191)
Q Consensus 59 kAivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqRrIl~ 103 (191)
.++++.+.......|......+..++....++.++++|+...=++
T Consensus 73 ~~~i~v~d~~~~~s~~~~~~~~~~~~~~~~~~~p~ivv~nK~Dl~ 117 (171)
T cd00157 73 DVFLICFSVDSPSSFENVKTKWIPEIRHYCPNVPIILVGTKIDLR 117 (171)
T ss_pred CEEEEEEECCCHHHHHHHHHHHHHHHHhhCCCCCEEEEEccHHhh
Confidence 477777777778888888877888888887889999999875443
No 12
>PF01383 CpcD: CpcD/allophycocyanin linker domain; InterPro: IPR008213 Ferredoxin-NADP(+) oxydoreductase (FNR) (EC=1.18.1.2) transfers electrons from ferredoxin (or flavodoxin) to NADP(+) to generate NADPH. In eucaryotes, the nuclear-encoded, chloroplast-targeted enzyme contains two domains: an FAD-binding domain (see PDOC51384 from PROSITEDOC) and an NADP(+)-binding domain. With the exception of Gloeobacter violaceus PCC 7421, the predicted sequences of all cyanobacterial petH genes, encoding FNR, correspond to a protein containing three domains. Two domains at the C terminus correspond to the FAD- and NADP(+)-binding domains of higher plants FNR protein, which compose the catalytic domains of the enzyme. The N-terminal domain is similar to phycobilisome (PBS)-associated linker proteins from numerous cyanobacteria [, , ] and is associated with: - CpcD, the phycocyanin (PC)-associated, rod-capping, linker polypeptide of PBS. The similarity spans nearly the entire sequence of this linker class. - CpcC, the PC-associated rod linker polypeptide. The similarity is confined only to the C terminus of this linker class. - ApcC, the allophycocyanin (APC)-associated, core linker polypeptide. The similarity only correspond to about half of the molecule. The CpcD-like domain has an elongated shape and consists of a three-stranded beta-sheet, two alpha-helices, one of which has only about one turn, and the connecting random coil segments [].; GO: 0030089 phycobilisome; PDB: 1B33_O.
Probab=35.04 E-value=45 Score=23.10 Aligned_cols=21 Identities=24% Similarity=0.409 Sum_probs=15.6
Q ss_pred eeEEEEEeehhhH-HHHHHHHH
Q 029596 58 RKAIVVYVPYRLR-KAYRKIHS 78 (191)
Q Consensus 58 kkAivIfVP~~~l-k~f~Kiq~ 78 (191)
+..-..+|||.+| ..+|.||.
T Consensus 24 rs~~~~~Vpy~~ls~~~q~I~r 45 (56)
T PF01383_consen 24 RSNQTYVVPYSQLSQEMQRINR 45 (56)
T ss_dssp HHEEEEEEEHHHHHHHHHHHHH
T ss_pred eeeEEEEEcHHHhHHHHHHHHH
Confidence 4556667999999 66688774
No 13
>PF02374 ArsA_ATPase: Anion-transporting ATPase; PDB: 2WOO_A 3IBG_B 3SJA_A 3H84_B 3SJD_A 3ZS9_A 3A37_A 2WOJ_A 3SJC_B 3A36_B ....
Probab=33.94 E-value=70 Score=28.55 Aligned_cols=32 Identities=25% Similarity=0.316 Sum_probs=27.2
Q ss_pred hhHHHHHHHHHHHHHHHHhhcCCCeEEEEeee
Q 029596 68 RLRKAYRKIHSRLVRELEKKFSGKDVVLIATR 99 (191)
Q Consensus 68 ~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqR 99 (191)
+.+.+.++.|.+.+.++++.|+|.+|+-+-..
T Consensus 255 ~~~~~r~~~Q~~~l~~i~~~f~~~~v~~vp~~ 286 (305)
T PF02374_consen 255 PFCAARRKEQQKYLAEIEESFPDLPVVKVPLL 286 (305)
T ss_dssp HHHHHHHHHHHHHHHHHHHHTTTSEEEEEE--
T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCEEEecCC
Confidence 56799999999999999999999998877544
No 14
>PF00071 Ras: Ras family; InterPro: IPR001806 Small GTPases form an independent superfamily within the larger class of regulatory GTP hydrolases. This superfamily contains proteins that control a vast number of important processes and possess a common, structurally preserved GTP-binding domain [, ]. Sequence comparisons of small G proteins from various species have revealed that they are conserved in primary structures at the level of 30-55% similarity []. Crystallographic analysis of various small G proteins revealed the presence of a 20 kDa catalytic domain that is unique for the whole superfamily [, ]. The domain is built of five alpha helices (A1-A5), six beta-strands (B1-B6) and five polypeptide loops (G1-G5). A structural comparison of the GTP- and GDP-bound form, allows one to distinguish two functional loop regions: switch I and switch II that surround the gamma-phosphate group of the nucleotide. The G1 loop (also called the P-loop) that connects the B1 strand and the A1 helix is responsible for the binding of the phosphate groups. The G3 loop provides residues for Mg(2+) and phosphate binding and is located at the N terminus of the A2 helix. The G1 and G3 loops are sequentially similar to Walker A and Walker B boxes that are found in other nucleotide binding motifs. The G2 loop connects the A1 helix and the B2 strand and contains a conserved Thr residue responsible for Mg(2+) binding. The guanine base is recognised by the G4 and G5 loops. The consensus sequence NKXD of the G4 loop contains Lys and Asp residues directly interacting with the nucleotide. Part of the G5 loop located between B6 and A5 acts as a recognition site for the guanine base []. The small GTPase superfamily can be divided into at least 8 different families, including: Arf small GTPases. GTP-binding proteins involved in protein trafficking by modulating vesicle budding and uncoating within the Golgi apparatus. Ran small GTPases. GTP-binding proteins involved in nucleocytoplasmic transport. Required for the import of proteins into the nucleus and also for RNA export. Rab small GTPases. GTP-binding proteins involved in vesicular traffic. Rho small GTPases. GTP-binding proteins that control cytoskeleton reorganisation. Ras small GTPases. GTP-binding proteins involved in signalling pathways. Sar1 small GTPases. Small GTPase component of the coat protein complex II (COPII) which promotes the formation of transport vesicles from the endoplasmic reticulum (ER). Mitochondrial Rho (Miro). Small GTPase domain found in mitochondrial proteins involved in mitochondrial trafficking. Roc small GTPases domain. Small GTPase domain always found associated with the COR domain. ; GO: 0005525 GTP binding, 0007264 small GTPase mediated signal transduction; PDB: 1M7B_A 2V55_B 3EG5_C 3LAW_A 1YHN_A 1T91_B 1HE8_B 3SEA_B 3T5G_A 1XTS_A ....
Probab=31.15 E-value=1.6e+02 Score=21.89 Aligned_cols=42 Identities=10% Similarity=0.296 Sum_probs=34.7
Q ss_pred eEEEEEeehhhHHHHHHHHHHHHHHHHhhcC-CCeEEEEeeecc
Q 029596 59 KAIVVYVPYRLRKAYRKIHSRLVRELEKKFS-GKDVVLIATRRI 101 (191)
Q Consensus 59 kAivIfVP~~~lk~f~Kiq~rLv~ELEKKfs-g~~Vv~vAqRrI 101 (191)
.+++|..-+....+|+.++ .+..++.+..+ +.++++++...=
T Consensus 73 ~~~ii~fd~~~~~S~~~~~-~~~~~i~~~~~~~~~iivvg~K~D 115 (162)
T PF00071_consen 73 DAIIIVFDVTDEESFENLK-KWLEEIQKYKPEDIPIIVVGNKSD 115 (162)
T ss_dssp SEEEEEEETTBHHHHHTHH-HHHHHHHHHSTTTSEEEEEEETTT
T ss_pred ccccccccccccccccccc-cccccccccccccccceeeecccc
Confidence 5778888888899999999 77788988887 688999987653
No 15
>cd01870 RhoA_like RhoA-like subfamily. The RhoA subfamily consists of RhoA, RhoB, and RhoC. RhoA promotes the formation of stress fibers and focal adhesions, regulating cell shape, attachment, and motility. RhoA can bind to multiple effector proteins, thereby triggering different downstream responses. In many cell types, RhoA mediates local assembly of the contractile ring, which is necessary for cytokinesis. RhoA is vital for muscle contraction; in vascular smooth muscle cells, RhoA plays a key role in cell contraction, differentiation, migration, and proliferation. RhoA activities appear to be elaborately regulated in a time- and space-dependent manner to control cytoskeletal changes. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho proteins. RhoA and RhoC are observed only in geranyl
Probab=30.01 E-value=1.8e+02 Score=22.04 Aligned_cols=43 Identities=9% Similarity=0.232 Sum_probs=31.8
Q ss_pred EEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEeeeccc
Q 029596 60 AIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIATRRIV 102 (191)
Q Consensus 60 AivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqRrIl 102 (191)
++++.+.+....+|..+...+..++.+..++.++++++...=+
T Consensus 75 ~~i~v~~~~~~~s~~~~~~~~~~~~~~~~~~~piilv~nK~Dl 117 (175)
T cd01870 75 VILMCFSIDSPDSLENIPEKWTPEVKHFCPNVPIILVGNKKDL 117 (175)
T ss_pred EEEEEEECCCHHHHHHHHHHHHHHHHhhCCCCCEEEEeeChhc
Confidence 5555566666678888877778888777778899999886544
No 16
>TIGR02110 PQQ_syn_pqqF coenzyme PQQ biosynthesis probable peptidase PqqF. In a subset of species that make coenzyme PQQ (pyrrolo-quinoline-quinone), this probable peptidase is found in the PQQ biosynthesis region and is thought to act as a protease on PqqA (TIGR02107), a probable peptide precursor of the coenzyme. PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases.
Probab=29.81 E-value=1e+02 Score=31.22 Aligned_cols=49 Identities=29% Similarity=0.548 Sum_probs=37.5
Q ss_pred EEEEecCCeeEEEEEeehh--h---HHHHH--------HHHHHHHHHHHhhcCCCeEEEEeeeccc
Q 029596 50 IQMDVSGNRKAIVVYVPYR--L---RKAYR--------KIHSRLVRELEKKFSGKDVVLIATRRIV 102 (191)
Q Consensus 50 kei~v~~~kkAivIfVP~~--~---lk~f~--------Kiq~rLv~ELEKKfsg~~Vv~vAqRrIl 102 (191)
.++.+.|+-.|+++|.|.| . ..++| ...+||--|| .. | +|||.+-|++-
T Consensus 607 ~~~~~~~~e~alllf~p~~~~~~~~~aa~rlla~l~~~~f~qrlRve~--ql-G-Y~v~~~~~~~~ 668 (696)
T TIGR02110 607 VPLACDGGEQALLLFCPLPTADVASEAAWRLLAQLLEPPFFQRLRVEL--QL-G-YVVFCRYRRVA 668 (696)
T ss_pred EeccCCCCCcEEEEEecCCCCCHHHHHHHHHHHHHhchhHHHHHHHhh--cc-c-eEEEEeeEEcC
Confidence 5677788899999999999 3 45665 3357777666 44 5 99999999984
No 17
>cd04135 Tc10 TC10 subfamily. TC10 is a Rho family protein that has been shown to induce microspike formation and neurite outgrowth in vitro. Its expression changes dramatically after peripheral nerve injury, suggesting an important role in promoting axonal outgrowth and regeneration. TC10 regulates translocation of insulin-stimulated GLUT4 in adipocytes and has also been shown to bind directly to Golgi COPI coat proteins. GTP-bound TC10 in vitro can bind numerous potential effectors. Depending on its subcellular localization and distinct functional domains, TC10 can differentially regulate two types of filamentous actin in adipocytes. TC10 mRNAs are highly expressed in three types of mouse muscle tissues: leg skeletal muscle, cardiac muscle, and uterus; they were also present in brain, with higher levels in adults than in newborns. TC10 has also been shown to play a role in regulating the expression of cystic fibrosis transmembrane conductance regulator (CFTR) through interacti
Probab=29.30 E-value=1.9e+02 Score=21.86 Aligned_cols=41 Identities=12% Similarity=0.222 Sum_probs=30.7
Q ss_pred EEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEeeec
Q 029596 60 AIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIATRR 100 (191)
Q Consensus 60 AivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqRr 100 (191)
++++.+-..-..+|+.+...+.++|.+..++.++++++...
T Consensus 74 ~~ilv~~~~~~~s~~~~~~~~~~~l~~~~~~~piivv~nK~ 114 (174)
T cd04135 74 VFLICFSVVNPASFQNVKEEWVPELKEYAPNVPYLLVGTQI 114 (174)
T ss_pred EEEEEEECCCHHHHHHHHHHHHHHHHhhCCCCCEEEEeEch
Confidence 34444455556889999888888888777889999998764
No 18
>cd01893 Miro1 Miro1 subfamily. Miro (mitochondrial Rho) proteins have tandem GTP-binding domains separated by a linker region containing putative calcium-binding EF hand motifs. Genes encoding Miro-like proteins were found in several eukaryotic organisms. This CD represents the N-terminal GTPase domain of Miro proteins. These atypical Rho GTPases have roles in mitochondrial homeostasis and apoptosis. Most Rho proteins contain a lipid modification site at the C-terminus; however, Miro is one of few Rho subfamilies that lack this feature.
Probab=28.86 E-value=1.9e+02 Score=21.95 Aligned_cols=41 Identities=7% Similarity=0.169 Sum_probs=29.8
Q ss_pred EEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEeeec
Q 029596 60 AIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIATRR 100 (191)
Q Consensus 60 AivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqRr 100 (191)
++++.+......+|..+...+..+++....+.++++++...
T Consensus 73 ~~ilv~d~~~~~s~~~~~~~~~~~i~~~~~~~pviiv~nK~ 113 (166)
T cd01893 73 VICLVYSVDRPSTLERIRTKWLPLIRRLGVKVPIILVGNKS 113 (166)
T ss_pred EEEEEEECCCHHHHHHHHHHHHHHHHHhCCCCCEEEEEEch
Confidence 45555566667888887777777777655678999998764
No 19
>cd04130 Wrch_1 Wrch-1 subfamily. Wrch-1 (Wnt-1 responsive Cdc42 homolog) is a Rho family GTPase that shares significant sequence and functional similarity with Cdc42. Wrch-1 was first identified in mouse mammary epithelial cells, where its transcription is upregulated in Wnt-1 transformation. Wrch-1 contains N- and C-terminal extensions relative to cdc42, suggesting potential differences in cellular localization and function. The Wrch-1 N-terminal extension contains putative SH3 domain-binding motifs and has been shown to bind the SH3 domain-containing protein Grb2, which increases the level of active Wrch-1 in cells. Unlike Cdc42, which localizes to the cytosol and perinuclear membranes, Wrch-1 localizes extensively with the plasma membrane and endosomes. The membrane association, localization, and biological activity of Wrch-1 indicate an atypical model of regulation distinct from other Rho family GTPases. Most Rho proteins contain a lipid modification site at the C-terminus,
Probab=28.80 E-value=1.8e+02 Score=22.30 Aligned_cols=43 Identities=12% Similarity=0.276 Sum_probs=32.3
Q ss_pred EEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEeeeccc
Q 029596 60 AIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIATRRIV 102 (191)
Q Consensus 60 AivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqRrIl 102 (191)
++|+.+.+.-..+|+.+...+..++.+..++..+++++...=+
T Consensus 74 ~~i~v~d~~~~~sf~~~~~~~~~~~~~~~~~~piilv~nK~Dl 116 (173)
T cd04130 74 VFLLCFSVVNPSSFQNISEKWIPEIRKHNPKAPIILVGTQADL 116 (173)
T ss_pred EEEEEEECCCHHHHHHHHHHHHHHHHhhCCCCCEEEEeeChhh
Confidence 5555556666778988877777888877778899999987544
No 20
>PF08534 Redoxin: Redoxin; InterPro: IPR013740 This redoxin domain is found in peroxiredoxin, thioredoxin and glutaredoxin proteins. Peroxiredoxins (Prxs) constitute a family of thiol peroxidases that reduce hydrogen peroxide, peroxinitrite, and hydroperoxides using a strictly conserved cysteine []. Chloroplast thioredoxin systems in plants regulate the enzymes involved in photosynthetic carbon assimilation []. It is thought that redoxins have a large role to play in anti-oxidant defence. Cadmium-sensitive proteins are also regulated via thioredoxin and glutaredoxin thiol redox systems [].; GO: 0016491 oxidoreductase activity; PDB: 2H30_A 1TP9_A 1Y25_A 1XVQ_A 2B1K_A 2G0F_A 2B1L_B 3K8N_A 1Z5Y_E 3OR5_A ....
Probab=28.42 E-value=1.2e+02 Score=22.79 Aligned_cols=47 Identities=19% Similarity=0.295 Sum_probs=31.0
Q ss_pred EEEEecC--CeeEEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEe
Q 029596 50 IQMDVSG--NRKAIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIA 97 (191)
Q Consensus 50 kei~v~~--~kkAivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vA 97 (191)
+.+..+. ||..+|.|.+..-...+++-.. .+.+|-+++.++.|-+++
T Consensus 19 ~~~~l~~~~gk~~vv~f~~~~~Cp~C~~~~p-~l~~l~~~~~~~~v~~v~ 67 (146)
T PF08534_consen 19 KPVSLSDFKGKPVVVNFWASAWCPPCRKELP-YLNELQEKYKDKGVDVVG 67 (146)
T ss_dssp EEEEGGGGTTSEEEEEEESTTTSHHHHHHHH-HHHHHHHHHHTTTCEEEE
T ss_pred CEecHHHhCCCeEEEEEEccCCCCcchhhhh-hHHhhhhhhccCceEEEE
Confidence 4555544 7889999998755566666444 778887776666544443
No 21
>cd04134 Rho3 Rho3 subfamily. Rho3 is a member of the Rho family found only in fungi. Rho3 is believed to regulate cell polarity by interacting with the diaphanous/formin family protein For3 to control both the actin cytoskeleton and microtubules. Rho3 is also believed to have a direct role in exocytosis that is independent of its role in regulating actin polarity. The function in exocytosis may be two-pronged: first, in the transport of post-Golgi vesicles from the mother cell to the bud, mediated by myosin (Myo2); second, in the docking and fusion of vesicles to the plasma membrane, mediated by an exocyst (Exo70) protein. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho proteins.
Probab=27.40 E-value=2e+02 Score=22.72 Aligned_cols=44 Identities=18% Similarity=0.339 Sum_probs=32.5
Q ss_pred EEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEeeecccC
Q 029596 60 AIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIATRRIVR 103 (191)
Q Consensus 60 AivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqRrIl~ 103 (191)
++|+.+-+....+|+.+....+.++....++.++++||...=|+
T Consensus 74 ~~ilv~dv~~~~sf~~~~~~~~~~i~~~~~~~piilvgNK~Dl~ 117 (189)
T cd04134 74 VIMLCFSVDSPDSLENVESKWLGEIREHCPGVKLVLVALKCDLR 117 (189)
T ss_pred EEEEEEECCCHHHHHHHHHHHHHHHHHhCCCCCEEEEEEChhhc
Confidence 55555556677899888765677887777788999999875554
No 22
>COG1868 FliM Flagellar motor switch protein [Cell motility and secretion]
Probab=27.35 E-value=5.1e+02 Score=24.10 Aligned_cols=148 Identities=18% Similarity=0.207 Sum_probs=95.6
Q ss_pred cccCCCCCCCHHHHHHHHHhHh----------------------hhcCChHHHhc--CcceeeeeeEEEEecCCeeEEEE
Q 029596 8 IQKDKDAEPTEFEETVAQALFD----------------------LENTNQELKSD--LKDLYINQAIQMDVSGNRKAIVV 63 (191)
Q Consensus 8 i~k~~~~~p~e~E~~Vaqal~d----------------------LE~~~~dLK~~--L~~l~i~~akei~v~~~kkAivI 63 (191)
..++.+-+.|++|..|++-++. .| +|+++... ..+..+...-+|++|+..--+=|
T Consensus 135 ~~~~~~R~lT~iE~~v~~~il~~i~~~l~eaw~~v~~~ep~~~~~e-~~p~~~~iv~pne~vv~i~~~i~ig~~~g~~ni 213 (332)
T COG1868 135 PAKPEGRELTDIEQRVITKLLERILEALKEAWNAVIELEPEIVRSE-TNPQFAQIVSPNEIVVLITLEVEIGNLSGMFNI 213 (332)
T ss_pred CCCcCCCCCcHHHHHHHHHHHHHHHHHHHHHhcccceeeecccccc-cChhhhcccCCCceEEEEEEEEEECCcceEEEE
Confidence 5566788999999999886553 23 45555543 35677778889999998889999
Q ss_pred EeehhhHHHHH------------HHHHHHHHHHHhhcCCCeEEEEeeecccCCCCCCCccccCCCcchhhhhHhhhhccc
Q 029596 64 YVPYRLRKAYR------------KIHSRLVRELEKKFSGKDVVLIATRRIVRPPKKGSAVQRPRSRTLTAVHDAMLEDVV 131 (191)
Q Consensus 64 fVP~~~lk~f~------------Kiq~rLv~ELEKKfsg~~Vv~vAqRrIl~kp~~~~~q~RPRSRTLTaVhdaiLeDlV 131 (191)
-+||..+..-+ ..+.+-..||++....-.|-++|.=. -.+=||....+-=.-|++
T Consensus 214 ciP~~~le~i~~kl~~~~~~~~~~~~~~w~~~L~~~v~~v~V~l~A~l~-------------~~~ltl~~il~L~vGDVI 280 (332)
T COG1868 214 CIPYSMLEPIREKLSSRMQENTREKDPEWRKELRQQVQRVEVELEARLG-------------EISLTLREILRLEVGDVI 280 (332)
T ss_pred EeeHHHHHHHHHHHhhhhhhcccccChHHHHHHHHHHhcCceEEEEEee-------------cceeeHHHHhCCCCCcEE
Confidence 99998875432 22355677888888777777776421 111222222211112322
Q ss_pred -ccceeeceeEEEeeCCceEEEEEeCcccccc---hhhhhhhH
Q 029596 132 -YPAEIVGKRVRYRLDGSKIIKIFLDPKERNN---TEYKLESF 170 (191)
Q Consensus 132 -~P~eIVGKR~r~r~DGsk~~KV~LD~kd~~~---ve~Kldtf 170 (191)
++.. +..+++++.+|..+.++.+-..-.+. +.+.+++=
T Consensus 281 ~l~~~-~~d~v~v~v~g~~~f~c~~G~~g~~~aVkI~~~i~~~ 322 (332)
T COG1868 281 PLEKP-ADDRVTVSVGGKPKFLCQYGKSGGQYAVKILELINSE 322 (332)
T ss_pred ECCCC-CCceEEEEECCEEEEEEeccccCCEEEEEEeeecCcc
Confidence 2333 68999999999999999987644442 55555443
No 23
>cd08971 AcNei2_N N-terminal domain of the actinomycetal Nei2 and related DNA glycosylases. This family contains the N-terminal domain of the actinomycetal Nei2 and related DNA glycosylases. It belongs to the FpgNei_N, [N-terminal domain of Fpg (formamidopyrimidine-DNA glycosylase, MutM)_Nei (endonuclease VIII)] domain superfamily. DNA glycosylases maintain genome integrity by recognizing base lesions created by ionizing radiation, alkylating or oxidizing agents, and endogenous reactive oxygen species. They initiate the base-excision repair process, which is completed with the help of enzymes such as phosphodiesterases, AP endonucleases, DNA polymerases and DNA ligases. DNA glycosylases cleave the N-glycosyl bond between the sugar and the damaged base, creating an AP (apurinic/apyrimidinic) site. Most FpgNei DNA glycosylases use their N-terminal proline residue as the key catalytic nucleophile, and the reaction proceeds via a Schiff base intermediate. This family contains mostly actinom
Probab=27.18 E-value=1.2e+02 Score=23.30 Aligned_cols=60 Identities=27% Similarity=0.398 Sum_probs=36.7
Q ss_pred HHHHHHHhhcCCCeEEEEeeecccCCCCCCCccccCCCcchhhhhHhhhhcccccceeeceeEEEeeCCceEEEEEeC
Q 029596 79 RLVRELEKKFSGKDVVLIATRRIVRPPKKGSAVQRPRSRTLTAVHDAMLEDVVYPAEIVGKRVRYRLDGSKIIKIFLD 156 (191)
Q Consensus 79 rLv~ELEKKfsg~~Vv~vAqRrIl~kp~~~~~q~RPRSRTLTaVhdaiLeDlV~P~eIVGKR~r~r~DGsk~~KV~LD 156 (191)
++.+.|++.+.|+.|.-+--++ ++ .....=..||+++| +=.||.+-+.+|+...+-+||-
T Consensus 9 ~v~~~L~~~~~G~~I~~v~~~~----~~--~~~~~l~G~~i~~v------------~R~GK~L~~~l~~~~~l~vHLg 68 (114)
T cd08971 9 RAARRLRRALAGRVLTRADLRV----PR--LATADLAGRTVEEV------------VARGKHLLIRFDGGLTLHTHLR 68 (114)
T ss_pred HHHHHHHHHhCCCEEEEEEecC----ch--hhhhhcCCCEEEEE------------EEeeeEEEEEcCCCCEEEEeCC
Confidence 4667899999999886554221 10 01222334444444 4489999999876557777764
No 24
>PF06858 NOG1: Nucleolar GTP-binding protein 1 (NOG1); InterPro: IPR010674 This domain represents a conserved region of approximately 60 residues in length within nucleolar GTP-binding protein 1 (NOG1). The NOG1 family includes eukaryotic, bacterial and archaeal proteins. In Saccharomyces cerevisiae, the NOG1 gene has been shown to be essential for cell viability, suggesting that NOG1 may play an important role in nucleolar functions. In particular, NOG1 is believed to be functionally linked to ribosome biogenesis, which occurs in the nucleolus. In eukaryotes, NOG1 mutants were found to disrupt the biogenesis of the 60S ribosomal subunit []. The DRG and OBG proteins as well as the prokaryotic NOG-like proteins are homologous throughout their length to the amino half of eukaryotic NOG1, which contains the GTP binding motifs (IPR006073 from INTERPRO); the N-terminal GTP-binding motif is required for function.; GO: 0005525 GTP binding; PDB: 2E87_A.
Probab=27.05 E-value=90 Score=22.28 Aligned_cols=25 Identities=20% Similarity=0.371 Sum_probs=20.1
Q ss_pred HHHHHHHHHHHHhhcCCCeEEEEee
Q 029596 74 RKIHSRLVRELEKKFSGKDVVLIAT 98 (191)
Q Consensus 74 ~Kiq~rLv~ELEKKfsg~~Vv~vAq 98 (191)
-.-|..|-.|+..-|++++++.|..
T Consensus 31 ie~Q~~L~~~ik~~F~~~P~i~V~n 55 (58)
T PF06858_consen 31 IEEQLSLFKEIKPLFPNKPVIVVLN 55 (58)
T ss_dssp HHHHHHHHHHHHHHTTTS-EEEEE-
T ss_pred HHHHHHHHHHHHHHcCCCCEEEEEe
Confidence 3558999999999999999998865
No 25
>COG0195 NusA Transcription elongation factor [Transcription]
Probab=26.57 E-value=1.4e+02 Score=25.57 Aligned_cols=104 Identities=15% Similarity=0.208 Sum_probs=70.6
Q ss_pred EEEEEeeh-hhHHHHHHHHHHHHHHHHhhcCCCeEEEEeeecccCCCCCCCccccCCCcchhhhhHhhhhcccccceeec
Q 029596 60 AIVVYVPY-RLRKAYRKIHSRLVRELEKKFSGKDVVLIATRRIVRPPKKGSAVQRPRSRTLTAVHDAMLEDVVYPAEIVG 138 (191)
Q Consensus 60 AivIfVP~-~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqRrIl~kp~~~~~q~RPRSRTLTaVhdaiLeDlV~P~eIVG 138 (191)
|+++|++- ...-+.----..-++.+.+++ |++|=+|-.- +- -...+-++++|+++++
T Consensus 76 av~~~~~~~d~vG~~iG~~G~rvk~i~~eL-gekIdVVe~s---~d------------------~~~fI~nal~Pa~v~~ 133 (190)
T COG0195 76 AVVSNVVKIDPVGACIGKRGSRVKAVSEEL-GEKIDVVEWS---ED------------------PAEFIKNALAPAEVLS 133 (190)
T ss_pred ceEEeecCcCchhhhccCCChHHHHHHHHh-CCceEEEEeC---CC------------------HHHHHHHhcCcceEeE
Confidence 68888763 222333333455566666666 5777776543 11 1234557788999998
Q ss_pred eeEEEeeCCceEEEEEeCcccccchhhhhhhHHHHHhhhcCCceEEECC
Q 029596 139 KRVRYRLDGSKIIKIFLDPKERNNTEYKLESFSGVYRKLTGKDVVFDYP 187 (191)
Q Consensus 139 KR~r~r~DGsk~~KV~LD~kd~~~ve~Kldtfs~VYkkLTgk~v~FeFp 187 (191)
=.+... ||. ...|...+.|...+=-|-..-...-++|||..+..++.
T Consensus 134 V~~~~~-d~~-~~~v~V~~~~~~~aIGk~G~Nvrla~~Ltg~~i~I~~~ 180 (190)
T COG0195 134 VNIKED-DGH-VAIVVVPPDQLSLAIGKGGQNVRLASQLTGWEIDIETI 180 (190)
T ss_pred EEEEeC-CCc-EEEEEECHHHHhhccCcccHHHHHHHHHhCCEEEEEeh
Confidence 777766 666 88888888888876667666777889999999887764
No 26
>cd01874 Cdc42 Cdc42 subfamily. Cdc42 is an essential GTPase that belongs to the Rho family of Ras-like GTPases. These proteins act as molecular switches by responding to exogenous and/or endogenous signals and relaying those signals to activate downstream components of a biological pathway. Cdc42 transduces signals to the actin cytoskeleton to initiate and maintain polarized growth and to mitogen-activated protein morphogenesis. In the budding yeast Saccharomyces cerevisiae, Cdc42 plays an important role in multiple actin-dependent morphogenetic events such as bud emergence, mating-projection formation, and pseudohyphal growth. In mammalian cells, Cdc42 regulates a variety of actin-dependent events and induces the JNK/SAPK protein kinase cascade, which leads to the activation of transcription factors within the nucleus. Cdc42 mediates these processes through interactions with a myriad of downstream effectors, whose number and regulation we are just starting to understand. In addi
Probab=25.54 E-value=2.3e+02 Score=22.26 Aligned_cols=44 Identities=11% Similarity=0.237 Sum_probs=31.8
Q ss_pred eEEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEeeeccc
Q 029596 59 KAIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIATRRIV 102 (191)
Q Consensus 59 kAivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqRrIl 102 (191)
.++|+.+=+.-..+|..+...+..++++.-++.++++|+...=+
T Consensus 74 ~~~ilv~d~~~~~s~~~~~~~w~~~i~~~~~~~piilvgnK~Dl 117 (175)
T cd01874 74 DVFLVCFSVVSPSSFENVKEKWVPEITHHCPKTPFLLVGTQIDL 117 (175)
T ss_pred CEEEEEEECCCHHHHHHHHHHHHHHHHHhCCCCCEEEEEECHhh
Confidence 35555555566788998876667888776678899999986433
No 27
>cd00877 Ran Ran (Ras-related nuclear proteins) /TC4 subfamily of small GTPases. Ran GTPase is involved in diverse biological functions, such as nuclear transport, spindle formation during mitosis, DNA replication, and cell division. Among the Ras superfamily, Ran is a unique small G protein. It does not have a lipid modification motif at the C-terminus to bind to the membrane, which is often observed within the Ras superfamily. Ran may therefore interact with a wide range of proteins in various intracellular locations. Like other GTPases, Ran exists in GTP- and GDP-bound conformations that interact differently with effectors. Conversion between these forms and the assembly or disassembly of effector complexes requires the interaction of regulator proteins. The intrinsic GTPase activity of Ran is very low, but it is greatly stimulated by a GTPase-activating protein (RanGAP1) located in the cytoplasm. By contrast, RCC1, a guanine nucleotide exchange factor that generates RanGTP, is
Probab=25.13 E-value=2.3e+02 Score=21.83 Aligned_cols=43 Identities=14% Similarity=0.284 Sum_probs=33.7
Q ss_pred eEEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEeeeccc
Q 029596 59 KAIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIATRRIV 102 (191)
Q Consensus 59 kAivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqRrIl 102 (191)
-|+|+.+-.....+|..++. .+.++.+...+.++++||...=+
T Consensus 74 d~~i~v~d~~~~~s~~~~~~-~~~~i~~~~~~~piiiv~nK~Dl 116 (166)
T cd00877 74 QCAIIMFDVTSRVTYKNVPN-WHRDLVRVCGNIPIVLCGNKVDI 116 (166)
T ss_pred CEEEEEEECCCHHHHHHHHH-HHHHHHHhCCCCcEEEEEEchhc
Confidence 46778888888889988864 67788877768999999987544
No 28
>TIGR01397 fliM_switch flagellar motor switch protein FliM. Members of this family are the flagellar motor switch protein FliM. The family excludes FliM homologs that lack an N-terminal region critical to interaction with phosphorylated CheY. One set lacking this N-terminal region is found in Rhizobium meliloti, in which the direction of flagellar rotation is not reversible (i.e. the FliM homolog does not act to reverse the motor direction), and in related species. Another is found in Buchnera, an obligate intracellular endosymbiont with genes for many of the components of the flagellar apparatus, but not, apparently, for flagellin iself.
Probab=25.10 E-value=4.8e+02 Score=23.00 Aligned_cols=62 Identities=19% Similarity=0.295 Sum_probs=42.3
Q ss_pred CCCCCCHHHHHHHHHhHhhhc---------------------CChHHHh--cCcceeeeeeEEEEecCCeeEEEEEeehh
Q 029596 12 KDAEPTEFEETVAQALFDLEN---------------------TNQELKS--DLKDLYINQAIQMDVSGNRKAIVVYVPYR 68 (191)
Q Consensus 12 ~~~~p~e~E~~Vaqal~dLE~---------------------~~~dLK~--~L~~l~i~~akei~v~~~kkAivIfVP~~ 68 (191)
.+.++|++|..+.+-+++.-. +|+.+-+ .-.+..+...=++++++..--+-+.+|+.
T Consensus 138 ~~R~lT~iE~~i~~~~~~~~~~~l~~aw~~~~~~~~~~~~~e~np~~~~i~~~~e~vv~~~f~v~i~~~~g~~~l~lP~~ 217 (320)
T TIGR01397 138 EGREFTEIERRVIDRILDRVLEDLKEAWSPVMPLEPELDRSETNPQFAQIVPPNEIVVLVSFSVEVGETEGMINICLPYS 217 (320)
T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHhhCceeeeeEEEEEEcCHHHHhccCCCcEEEEEEEEEEECCceEEEEEEeeHH
Confidence 456889999999886665320 2333322 22345666777889988778888999998
Q ss_pred hHHHH
Q 029596 69 LRKAY 73 (191)
Q Consensus 69 ~lk~f 73 (191)
.+...
T Consensus 218 ~lepi 222 (320)
T TIGR01397 218 TLEPI 222 (320)
T ss_pred HHHHH
Confidence 87654
No 29
>COG4496 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.77 E-value=22 Score=27.95 Aligned_cols=31 Identities=29% Similarity=0.506 Sum_probs=24.7
Q ss_pred ccCCCcchhhhhHhhh------------hcccccceee--ceeEE
Q 029596 112 QRPRSRTLTAVHDAML------------EDVVYPAEIV--GKRVR 142 (191)
Q Consensus 112 ~RPRSRTLTaVhdaiL------------eDlV~P~eIV--GKR~r 142 (191)
.+||+.+|.+..|||| +|||-|.||- ..|++
T Consensus 4 ~klr~~~Ld~l~dailtL~n~eecy~FfdDlcTinEiqslaqRlq 48 (100)
T COG4496 4 EKLRGAALDELFDAILTLENLEECYAFFDDLCTINEIQSLAQRLQ 48 (100)
T ss_pred cchhhHHHHHHHHHHHHhccHHHHHHHHHhhcCHHHHHHHHHHHH
Confidence 4688999999999985 8999999983 34444
No 30
>PF14085 DUF4265: Domain of unknown function (DUF4265)
Probab=24.34 E-value=1e+02 Score=23.92 Aligned_cols=64 Identities=19% Similarity=0.257 Sum_probs=45.6
Q ss_pred hhcccccceeeceeEEEee---CCceEEEEEeCcccc---cchhhhhhhHHHHHhhhcCCceEEECCCCC
Q 029596 127 LEDVVYPAEIVGKRVRYRL---DGSKIIKIFLDPKER---NNTEYKLESFSGVYRKLTGKDVVFDYPITD 190 (191)
Q Consensus 127 LeDlV~P~eIVGKR~r~r~---DGsk~~KV~LD~kd~---~~ve~Kldtfs~VYkkLTgk~v~FeFp~~~ 190 (191)
+.|+|.=..-=|..+-.++ .|...++|++++... +.+...|..+..-+....+.-+.|..|...
T Consensus 28 ~gDvV~~~~~~g~~~~~~~v~~sGnsTiRv~~~~~~~~~~~~v~~~l~~lG~~~E~~~~~~lav~VP~~~ 97 (117)
T PF14085_consen 28 LGDVVRAEPDDGELWFQKVVESSGNSTIRVIFDDPGPDDIEAVREELEALGCTVEGFSERMLAVDVPPSV 97 (117)
T ss_pred CCCEEEEEeCCCeEEEEEEEecCCCEEEEEEEcCCcchhHHHHHHHHHHcCCeEEccCCCEEEEEECCCC
Confidence 4566652221166666666 999999999977655 567777888887788888888888888654
No 31
>PF13479 AAA_24: AAA domain
Probab=23.95 E-value=1.4e+02 Score=24.81 Aligned_cols=62 Identities=18% Similarity=0.303 Sum_probs=34.8
Q ss_pred HHHHHHHHHHHHHHHHhh--cCCCeEEEEeeecccCCCCCC-CccccCCCcchhhhhHhhhhcccccceee
Q 029596 70 RKAYRKIHSRLVRELEKK--FSGKDVVLIATRRIVRPPKKG-SAVQRPRSRTLTAVHDAMLEDVVYPAEIV 137 (191)
Q Consensus 70 lk~f~Kiq~rLv~ELEKK--fsg~~Vv~vAqRrIl~kp~~~-~~q~RPRSRTLTaVhdaiLeDlV~P~eIV 137 (191)
++.|..++..+.+-+.+- ..|+||||+|.-..-.-+..+ ..+.+|. .+....+-+.+-+++|
T Consensus 105 ~~~yg~~~~~~~~~i~~l~~~~~~~VI~tah~~~~~~~~~~~~~~~~~~------l~~k~~~~l~~~~D~V 169 (213)
T PF13479_consen 105 GKGYGELQQEFMRFIDKLLNALGKNVIFTAHAKEEEDEDGGKYTRYKPK------LGKKVRNELPGWFDVV 169 (213)
T ss_pred cchHHHHHHHHHHHHHHHHHHCCCcEEEEEEEEEEEcCCCCceeEEeec------cChhHHhhhhecccEE
Confidence 567766665555554431 249999999977654332111 1222332 3334555666777777
No 32
>cd02970 PRX_like2 Peroxiredoxin (PRX)-like 2 family; hypothetical proteins that show sequence similarity to PRXs. Members of this group contain a CXXC motif, similar to TRX. The second cysteine in the motif corresponds to the peroxidatic cysteine of PRX, however, these proteins do not contain the other two residues of the catalytic triad of PRX. PRXs confer a protective antioxidant role in cells through their peroxidase activity in which hydrogen peroxide, peroxynitrate, and organic hydroperoxides are reduced and detoxified using reducing equivalents derived from either thioredoxin, glutathione, trypanothione and AhpF. TRXs alter the redox state of target proteins by catalyzing the reduction of their disulfide bonds via the CXXC motif using reducing equivalents derived from either NADPH or ferredoxins.
Probab=23.43 E-value=1.6e+02 Score=21.77 Aligned_cols=42 Identities=21% Similarity=0.188 Sum_probs=25.1
Q ss_pred CCeeEEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEee
Q 029596 56 GNRKAIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIAT 98 (191)
Q Consensus 56 ~~kkAivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAq 98 (191)
++++.+|+|+|-.-....+. +..-+.++-.+|.++.|-+||-
T Consensus 22 ~~~~~vl~f~~~~~Cp~C~~-~~~~l~~~~~~~~~~~v~vv~V 63 (149)
T cd02970 22 GEGPVVVVFYRGFGCPFCRE-YLRALSKLLPELDALGVELVAV 63 (149)
T ss_pred cCCCEEEEEECCCCChhHHH-HHHHHHHHHHHHHhcCeEEEEE
Confidence 35788999998765555554 3334445656664444555543
No 33
>cd04172 Rnd3_RhoE_Rho8 Rnd3/RhoE/Rho8 subfamily. Rnd3/RhoE/Rho8 is a member of the novel Rho subfamily Rnd, together with Rnd1/Rho6 and Rnd2/Rho7. Rnd3/RhoE is known to bind the serine-threonine kinase ROCK I. Unphosphorylated Rnd3/RhoE associates primarily with membranes, but ROCK I-phosphorylated Rnd3/RhoE localizes in the cytosol. Phosphorylation of Rnd3/RhoE correlates with its activity in disrupting RhoA-induced stress fibers and inhibiting Ras-induced fibroblast transformation. In cells that lack stress fibers, such as macrophages and monocytes, Rnd3/RhoE induces a redistribution of actin, causing morphological changes in the cell. In addition, Rnd3/RhoE has been shown to inhibit cell cycle progression in G1 phase at a point upstream of the pRb family pocket protein checkpoint. Rnd3/RhoE has also been shown to inhibit Ras- and Raf-induced fibroblast transformation. In mammary epithelial tumor cells, Rnd3/RhoE regulates the assembly of the apical junction complex and tight
Probab=22.64 E-value=2.7e+02 Score=22.40 Aligned_cols=44 Identities=7% Similarity=0.196 Sum_probs=34.0
Q ss_pred eEEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEeeeccc
Q 029596 59 KAIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIATRRIV 102 (191)
Q Consensus 59 kAivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqRrIl 102 (191)
-++++.+-+.-..+|..+..+.+.++.+-.++.++++||...=|
T Consensus 78 d~~ilvyDit~~~Sf~~~~~~w~~~i~~~~~~~piilVgNK~DL 121 (182)
T cd04172 78 DAVLICFDISRPETLDSVLKKWKGEIQEFCPNTKMLLVGCKSDL 121 (182)
T ss_pred CEEEEEEECCCHHHHHHHHHHHHHHHHHHCCCCCEEEEeEChhh
Confidence 46666666777899999877777888776678899999987544
No 34
>cd04173 Rnd2_Rho7 Rnd2/Rho7 subfamily. Rnd2/Rho7 is a member of the novel Rho subfamily Rnd, together with Rnd1/Rho6 and Rnd3/RhoE/Rho8. Rnd2/Rho7 is transiently expressed in radially migrating cells in the brain while they are within the subventricular zone of the hippocampus and cerebral cortex. These migrating cells typically develop into pyramidal neurons. Cells that exogenously expressed Rnd2/Rho7 failed to migrate to upper layers of the brain, suggesting that Rnd2/Rho7 plays a role in the radial migration and morphological changes of developing pyramidal neurons, and that Rnd2/Rho7 degradation is necessary for proper cellular migration. The Rnd2/Rho7 GEF Rapostlin is found primarily in the brain and together with Rnd2/Rho7 induces dendrite branching. Unlike Rnd1/Rho6 and Rnd3/RhoE/Rho8, which are RhoA antagonists, Rnd2/Rho7 binds the GEF Pragmin and significantly stimulates RhoA activity and Rho-A mediated cell contraction. Rnd2/Rho7 is also found to be expressed in sperma
Probab=22.47 E-value=2.5e+02 Score=23.77 Aligned_cols=44 Identities=11% Similarity=0.172 Sum_probs=35.3
Q ss_pred eEEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEeeeccc
Q 029596 59 KAIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIATRRIV 102 (191)
Q Consensus 59 kAivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqRrIl 102 (191)
-++++.+-+.-..+|..+...+..|+....++-++++|+...=|
T Consensus 74 d~illvfdis~~~Sf~~i~~~w~~~~~~~~~~~piiLVgnK~DL 117 (222)
T cd04173 74 DAVLICFDISRPETLDSVLKKWQGETQEFCPNAKVVLVGCKLDM 117 (222)
T ss_pred CEEEEEEECCCHHHHHHHHHHHHHHHHhhCCCCCEEEEEECccc
Confidence 47777778888899999987777888777788899999976444
No 35
>cd01875 RhoG RhoG subfamily. RhoG is a GTPase with high sequence similarity to members of the Rac subfamily, including the regions involved in effector recognition and binding. However, RhoG does not bind to known Rac1 and Cdc42 effectors, including proteins containing a Cdc42/Rac interacting binding (CRIB) motif. Instead, RhoG interacts directly with Elmo, an upstream regulator of Rac1, in a GTP-dependent manner and forms a ternary complex with Dock180 to induce activation of Rac1. The RhoG-Elmo-Dock180 pathway is required for activation of Rac1 and cell spreading mediated by integrin, as well as for neurite outgrowth induced by nerve growth factor. Thus RhoG activates Rac1 through Elmo and Dock180 to control cell morphology. RhoG has also been shown to play a role in caveolar trafficking and has a novel role in signaling the neutrophil respiratory burst stimulated by G protein-coupled receptor (GPCR) agonists. Most Rho proteins contain a lipid modification site at the C-termin
Probab=22.46 E-value=2.8e+02 Score=22.07 Aligned_cols=45 Identities=11% Similarity=0.304 Sum_probs=33.5
Q ss_pred eEEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEeeecccC
Q 029596 59 KAIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIATRRIVR 103 (191)
Q Consensus 59 kAivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqRrIl~ 103 (191)
.++|+.+-+.-..+|..++..+..++....++-++++||...=|+
T Consensus 76 ~~~ilvydit~~~Sf~~~~~~w~~~i~~~~~~~piilvgNK~DL~ 120 (191)
T cd01875 76 NVFIICFSIASPSSYENVRHKWHPEVCHHCPNVPILLVGTKKDLR 120 (191)
T ss_pred CEEEEEEECCCHHHHHHHHHHHHHHHHhhCCCCCEEEEEeChhhh
Confidence 355665666667889998766667777666788999999987664
No 36
>smart00175 RAB Rab subfamily of small GTPases. Rab GTPases are implicated in vesicle trafficking.
Probab=21.18 E-value=3.4e+02 Score=19.91 Aligned_cols=40 Identities=10% Similarity=0.274 Sum_probs=29.5
Q ss_pred eEEEEEeehhhHHHHHHHHHHHHHHHHhhc-CCCeEEEEeee
Q 029596 59 KAIVVYVPYRLRKAYRKIHSRLVRELEKKF-SGKDVVLIATR 99 (191)
Q Consensus 59 kAivIfVP~~~lk~f~Kiq~rLv~ELEKKf-sg~~Vv~vAqR 99 (191)
-++++.+......+|..+.. +..++++-- ++.++++++..
T Consensus 74 d~~ilv~d~~~~~s~~~~~~-~l~~~~~~~~~~~pivvv~nK 114 (164)
T smart00175 74 VGALLVYDITNRESFENLKN-WLKELREYADPNVVIMLVGNK 114 (164)
T ss_pred CEEEEEEECCCHHHHHHHHH-HHHHHHHhCCCCCeEEEEEEc
Confidence 36777777788888988876 666666544 57899999874
No 37
>cd04129 Rho2 Rho2 subfamily. Rho2 is a fungal GTPase that plays a role in cell morphogenesis, control of cell wall integrity, control of growth polarity, and maintenance of growth direction. Rho2 activates the protein kinase C homolog Pck2, and Pck2 controls Mok1, the major (1-3) alpha-D-glucan synthase. Together with Rho1 (RhoA), Rho2 regulates the construction of the cell wall. Unlike Rho1, Rho2 is not an essential protein, but its overexpression is lethal. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for proper intracellular localization via membrane attachment. As with other Rho family GTPases, the GDP/GTP cycling is regulated by GEFs (guanine nucleotide exchange factors), GAPs (GTPase-activating proteins) and GDIs (guanine nucleotide dissociation inhibitors).
Probab=20.87 E-value=3.2e+02 Score=21.48 Aligned_cols=41 Identities=10% Similarity=0.306 Sum_probs=31.0
Q ss_pred EEEEEeehhhHHHHHHHHHHHHHHHHhhcCCCeEEEEeeec
Q 029596 60 AIVVYVPYRLRKAYRKIHSRLVRELEKKFSGKDVVLIATRR 100 (191)
Q Consensus 60 AivIfVP~~~lk~f~Kiq~rLv~ELEKKfsg~~Vv~vAqRr 100 (191)
++++..-+....+|+.+...+..++.+..+..++++|+...
T Consensus 75 ~~llv~~i~~~~s~~~~~~~~~~~i~~~~~~~piilvgnK~ 115 (187)
T cd04129 75 VILIGFAVDTPDSLENVRTKWIEEVRRYCPNVPVILVGLKK 115 (187)
T ss_pred EEEEEEECCCHHHHHHHHHHHHHHHHHhCCCCCEEEEeeCh
Confidence 45554555677899999877778887777788899998763
Done!