Query psy16521
Match_columns 201
No_of_seqs 201 out of 1204
Neff 8.1
Searched_HMMs 46136
Date Fri Aug 16 23:25:48 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16521.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16521hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF03184 DDE_1: DDE superfamil 100.0 3.6E-30 7.8E-35 206.8 5.4 135 1-136 48-185 (217)
2 PF13358 DDE_3: DDE superfamil 98.7 5.8E-09 1.3E-13 77.8 2.4 81 39-123 54-136 (146)
3 KOG3105|consensus 98.1 5.1E-07 1.1E-11 81.4 0.5 129 2-135 280-411 (488)
4 KOG3105|consensus 97.3 0.00021 4.6E-09 64.5 3.7 118 1-125 195-319 (488)
5 COG3335 Transposase and inacti 93.4 0.52 1.1E-05 33.5 7.4 78 36-118 14-92 (132)
6 PF07592 DDE_Tnp_ISAZ013: Rhod 90.6 0.78 1.7E-05 39.1 6.1 93 31-124 137-247 (311)
7 PF00665 rve: Integrase core d 89.8 0.35 7.5E-06 34.4 2.9 37 68-104 68-104 (120)
8 PHA02517 putative transposase 85.8 4.3 9.3E-05 33.6 7.6 77 42-121 146-222 (277)
9 COG3889 Predicted solute bindi 78.6 1.1 2.4E-05 42.5 1.5 19 182-200 834-852 (872)
10 COG3889 Predicted solute bindi 46.4 15 0.00031 35.3 2.2 15 186-200 834-848 (872)
11 PF07905 PucR: Purine cataboli 46.4 51 0.0011 23.8 4.8 66 33-103 42-110 (123)
12 COG4907 Predicted membrane pro 37.9 26 0.00057 31.7 2.4 33 90-122 461-496 (595)
13 PRK14702 insertion element IS2 37.3 1.9E+02 0.0041 23.9 7.4 54 66-120 151-204 (262)
14 PRK09409 IS2 transposase TnpB; 32.0 3.1E+02 0.0068 23.1 8.6 75 45-120 167-243 (301)
15 PF12068 DUF3548: Domain of un 29.9 1.2E+02 0.0025 24.6 4.7 34 48-81 157-190 (213)
16 PLN02941 inositol-tetrakisphos 28.3 73 0.0016 27.5 3.5 56 81-136 38-94 (328)
17 PF05770 Ins134_P3_kin: Inosit 27.6 43 0.00093 28.7 2.0 55 82-136 25-80 (307)
18 PRK08857 para-aminobenzoate sy 24.8 76 0.0016 24.8 2.8 31 70-101 2-32 (193)
19 PRK13600 putative ribosomal pr 24.0 1.2E+02 0.0026 20.7 3.3 26 75-100 38-63 (84)
20 COG5019 CDC3 Septin family pro 21.7 1.4E+02 0.003 26.4 4.0 55 48-117 103-159 (373)
21 COG3545 Predicted esterase of 21.2 3E+02 0.0064 21.8 5.4 74 12-89 2-81 (181)
22 PHA02732 hypothetical protein; 21.1 48 0.001 32.1 1.1 27 167-193 1235-1263(1467)
No 1
>PF03184 DDE_1: DDE superfamily endonuclease; InterPro: IPR004875 These proteins are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which contain three carboxylate residues that are believed to be responsible for coordinating metal ions needed for catalysis. The catalytic activity of this enzyme involves DNA cleavage at a specific site followed by a strand transfer reaction. Interestingly this family also includes the CENP-B protein. This domain in that protein appears to have lost the metal binding residues and is unlikely to have endonuclease activity. Centromere Protein B (CENP-B) is a DNA-binding protein localised to the centromere []. ; GO: 0003676 nucleic acid binding
Probab=99.96 E-value=3.6e-30 Score=206.80 Aligned_cols=135 Identities=33% Similarity=0.594 Sum_probs=122.8
Q ss_pred CEEEEeecCcccCcEEEeccccCccccc--cCCCCCceEEecCCCCCchHHHHHHH-HHHHHhhhhcCCCCeEEEeecCc
Q psy16521 1 MIGIIRASGEAIPPVFVFPRVFMKDEFL--LGAPEGSLGFANKSGWSSTEIFFETI-KHIQNQIKASKENPALILMDNHE 77 (201)
Q Consensus 1 Vi~~~nA~G~~lpP~iI~~~~~~~~~~~--~~~~~~~~~~~s~~Gw~t~~i~~~wl-~~f~~~~~~~~~~~~lLilD~~~ 77 (201)
|++|++|+|..+||++|++++.....+. ...+.++.|..+++||||.++|++|| +.|.+.++ ..++++|||+|+++
T Consensus 48 ~l~~~~a~g~~lpP~vI~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~t~~~~~~wl~~~f~~~~~-~~~~~~lLllD~~~ 126 (217)
T PF03184_consen 48 VLCCNSAGGEKLPPLVIFKGKRPRCFWNDVDDKPLGVYYASSENGWMTTELFLEWLKKVFDPRVK-SPGRPVLLLLDSHS 126 (217)
T ss_pred ceeeecccccccccccccccccccccccccccccCCcccccccccccccccchhhhhhhhccccc-cccccccccccccc
Confidence 4566778889999999999998777776 45678999999999999999999999 57999887 56799999999999
Q ss_pred ccchhhHHhhhhccceEEeecCCCCCCCCCcccchhhhhhHHHHHHhHHHHhhcCCCCc
Q psy16521 78 THQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNRLKAGFKKWTSDNPGSI 136 (201)
Q Consensus 78 sh~s~~~~~~~~~~~I~l~~lP~~~t~~lQPLD~~~f~~lK~~~~~~~~~~~~~~~~~~ 136 (201)
+|.+.++.++|++++|.++++||+++|.+||||+++|++||..|++.+.+|+.+.....
T Consensus 127 ~h~~~~~~~~~~~~~i~l~~lP~~~t~~lQPld~~v~~~~K~~~r~~~~~~~~~~~~~~ 185 (217)
T PF03184_consen 127 SHISPEFKQLCKSNNIILVFLPPNTTHLLQPLDVGVFGPFKAHYRKKLLQWIIENGNNP 185 (217)
T ss_pred ccccccchhhhhccccccceecccccccccccchhhhhhhhHHHHHHHHHHHHhccccc
Confidence 99999999999999999999999999999999999999999999999999998876554
No 2
>PF13358 DDE_3: DDE superfamily endonuclease
Probab=98.72 E-value=5.8e-09 Score=77.75 Aligned_cols=81 Identities=27% Similarity=0.482 Sum_probs=69.5
Q ss_pred ecCCCCCchHHHHHHHHHHHHhhhhcCCCCeEEEeecCcccchhhHHhhhhc--cceEEeecCCCCCCCCCcccchhhhh
Q psy16521 39 ANKSGWSSTEIFFETIKHIQNQIKASKENPALILMDNHETHQSMEAIEYCRA--SGITLLTFPPHKTDELQPLDVAVFSP 116 (201)
Q Consensus 39 ~s~~Gw~t~~i~~~wl~~f~~~~~~~~~~~~lLilD~~~sh~s~~~~~~~~~--~~I~l~~lP~~~t~~lQPLD~~~f~~ 116 (201)
....|.++.+.+.+||+.+....... .+++|++|+.+.|.+..+.+.+++ .++.++++||++.. |+|+| .+|+.
T Consensus 54 ~~~~~~~~~~~~~~~l~~~~~~~~~~--~~~~li~DNa~~H~~~~~~~~l~~~~~~~~~~~~P~~sPd-LNpiE-~~w~~ 129 (146)
T PF13358_consen 54 FVVEGTMNSEDFIEFLEQLLRPYPRK--GRIVLIMDNASIHKSKKVQEWLEEHERGIELLFLPPYSPD-LNPIE-NVWGY 129 (146)
T ss_pred eeeeeeeccccccccccccccccccc--eEEEEecccccccccccccceeeccccccccccccCcCCc-cCHHH-HHHHH
Confidence 55677899999999999765433311 189999999999999999999998 99999999999998 89999 69999
Q ss_pred hHHHHHH
Q psy16521 117 FKNRLKA 123 (201)
Q Consensus 117 lK~~~~~ 123 (201)
+|..+++
T Consensus 130 lk~~~~~ 136 (146)
T PF13358_consen 130 LKRRIRR 136 (146)
T ss_pred HHHHHHh
Confidence 9998876
No 3
>KOG3105|consensus
Probab=98.15 E-value=5.1e-07 Score=81.41 Aligned_cols=129 Identities=20% Similarity=0.178 Sum_probs=91.8
Q ss_pred EEEEeecCcccCcEEEeccccCccccccCCC--CCceEEec-CCCCCchHHHHHHHHHHHHhhhhcCCCCeEEEeecCcc
Q psy16521 2 IGIIRASGEAIPPVFVFPRVFMKDEFLLGAP--EGSLGFAN-KSGWSSTEIFFETIKHIQNQIKASKENPALILMDNHET 78 (201)
Q Consensus 2 i~~~nA~G~~lpP~iI~~~~~~~~~~~~~~~--~~~~~~~s-~~Gw~t~~i~~~wl~~f~~~~~~~~~~~~lLilD~~~s 78 (201)
+.++.++|..+||+++++++..+..- +.. ....|+.. ...-|..+.+.+|.+...- ...-...++.+|++.+
T Consensus 280 ~~~i~~~g~~~p~~~~l~~~~~p~~~--~i~~~~k~~~r~~~~~~l~~~~~~~d~~~~~~~---~~~~~~~~~~~d~~~~ 354 (488)
T KOG3105|consen 280 LPMIRADGSKGPAYIVLPEKKFPPDQ--GIINRFKLEYRAGWKEHLMEKELMRDWLESCVG---LPAVPKKLLAWDSWPE 354 (488)
T ss_pred HHHHhhccCcCcceeeccccCCCCcC--CcchHHHHHHHhhhhhhhhhhhhhhhhhhhccc---ccccccceeeecccHH
Confidence 56788999999999999888332211 111 01112111 4444555555555553211 1123445666699999
Q ss_pred cchhhHHhhhhccceEEeecCCCCCCCCCcccchhhhhhHHHHHHhHHHHhhcCCCC
Q psy16521 79 HQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNRLKAGFKKWTSDNPGS 135 (201)
Q Consensus 79 h~s~~~~~~~~~~~I~l~~lP~~~t~~lQPLD~~~f~~lK~~~~~~~~~~~~~~~~~ 135 (201)
|....+...+...++....+|++++..+||+|+.++.++|...++....++....+.
T Consensus 355 ~~~~~~~~s~~~~~~~~~~ip~~~ts~iq~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 411 (488)
T KOG3105|consen 355 HKEDATKQSLKLLDVDIRVIPGGTTSFIQPLDVYWNSPWKNLIKKFTADELRAQIDD 411 (488)
T ss_pred HHHHHHHHhhhhccccccccCCccceeecccceeecccchhhhhhhHHHHHHhcccc
Confidence 999999888887799999999999999999999999999999999988888877665
No 4
>KOG3105|consensus
Probab=97.29 E-value=0.00021 Score=64.50 Aligned_cols=118 Identities=14% Similarity=0.082 Sum_probs=82.9
Q ss_pred CEEEEeecCcc-cCcEEEeccccCcccccc---CCCCCceEEecCCCCCchHHHHHHHHHHHHhhhhcCCCCe--EEEee
Q psy16521 1 MIGIIRASGEA-IPPVFVFPRVFMKDEFLL---GAPEGSLGFANKSGWSSTEIFFETIKHIQNQIKASKENPA--LILMD 74 (201)
Q Consensus 1 Vi~~~nA~G~~-lpP~iI~~~~~~~~~~~~---~~~~~~~~~~s~~Gw~t~~i~~~wl~~f~~~~~~~~~~~~--lLilD 74 (201)
++.|.+++|.. +|+.++.+.+...+.... ..+..+.+......|++...+..|...+..... .+.+ ++.+|
T Consensus 195 L~~~~~a~g~~kl~~~~~~k~~~p~~~~~~~~~~~~~~~~~~~~~~a~~t~~~~~~~~~~~~~~~~---~~~v~~V~~~~ 271 (488)
T KOG3105|consen 195 LVLRRNADGVTKLPLDLKGKKVLPASFFEANNRPELSNYHPSMIFNADETGFQKEFWLRRTLAEMG---EKEVELVQSLD 271 (488)
T ss_pred eeeeeccCccccCcHHHHHHhhchHHHHhhccccccccCChhhccccccchhhhhhhhhccccccc---ceEEEEEEecC
Confidence 57899999998 777777555544333222 234456666777888888888888775433322 3344 88999
Q ss_pred cCcccchhhHHhhhhccceE-EeecCCCCCCCCCcccchhhhhhHHHHHHhH
Q psy16521 75 NHETHQSMEAIEYCRASGIT-LLTFPPHKTDELQPLDVAVFSPFKNRLKAGF 125 (201)
Q Consensus 75 ~~~sh~s~~~~~~~~~~~I~-l~~lP~~~t~~lQPLD~~~f~~lK~~~~~~~ 125 (201)
+...|.+...........+. .+++|.. .||++.++...+|..|+..+
T Consensus 272 s~~~h~~~~~~i~~~g~~~p~~~~l~~~----~~p~~~~i~~~~k~~~r~~~ 319 (488)
T KOG3105|consen 272 SLTHHPTVLPMIRADGSKGPAYIVLPEK----KFPPDQGIINRFKLEYRAGW 319 (488)
T ss_pred CCCcchHHHHHHhhccCcCcceeecccc----CCCCcCCcchHHHHHHHhhh
Confidence 99999887555555444444 4666666 89999999999999999986
No 5
>COG3335 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=93.45 E-value=0.52 Score=33.54 Aligned_cols=78 Identities=22% Similarity=0.380 Sum_probs=57.8
Q ss_pred eEEecCCCCCchHHHHHHHHHHHHhhhhcCCCCeEEEeec-CcccchhhHHhhhhccceEEeecCCCCCCCCCcccchhh
Q psy16521 36 LGFANKSGWSSTEIFFETIKHIQNQIKASKENPALILMDN-HETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVF 114 (201)
Q Consensus 36 ~~~~s~~Gw~t~~i~~~wl~~f~~~~~~~~~~~~lLilD~-~~sh~s~~~~~~~~~~~I~l~~lP~~~t~~lQPLD~~~f 114 (201)
.......+-++.+.+..|++.+..... . .+..+++|+ ...|......+.++..+..++++|+++.. +.|.+. ++
T Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~flp~y~p~-~n~ie~-~~ 88 (132)
T COG3335 14 IERMGLITEANKKFFIGFLRIYLWLVL--K-KPIVLILDNVASFHRKKALKEDAEENGIHLLFLPPYSPD-LNPIEY-AW 88 (132)
T ss_pred eeEeeeccccCHHHHHHHHHHHhhhhc--c-CceEEEecCCcccchHHHHHHHHHhCCccEEEcCCCCCC-ccHHHH-HH
Confidence 344555666888899999987543322 3 677888998 77777778888999999999999999987 556655 45
Q ss_pred hhhH
Q psy16521 115 SPFK 118 (201)
Q Consensus 115 ~~lK 118 (201)
+.+|
T Consensus 89 ~~~k 92 (132)
T COG3335 89 KKLK 92 (132)
T ss_pred HHHH
Confidence 5555
No 6
>PF07592 DDE_Tnp_ISAZ013: Rhodopirellula transposase DDE domain; InterPro: IPR011518 These transposases are found in the planctomycete Rhodopirellula baltica, the cyanobacterium Nostoc, and the Gram-positive bacterium Streptomyces. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=90.58 E-value=0.78 Score=39.14 Aligned_cols=93 Identities=23% Similarity=0.415 Sum_probs=62.3
Q ss_pred CCCCceEEecCCCCCc-------hHHHHHHHHHHHHhh-hhc--CCCCeEEEeecCcccch------hhHHhhhhcc--c
Q psy16521 31 APEGSLGFANKSGWSS-------TEIFFETIKHIQNQI-KAS--KENPALILMDNHETHQS------MEAIEYCRAS--G 92 (201)
Q Consensus 31 ~~~~~~~~~s~~Gw~t-------~~i~~~wl~~f~~~~-~~~--~~~~~lLilD~~~sh~s------~~~~~~~~~~--~ 92 (201)
.|-|++-.....||++ .+...+=|+.++..+ +.+ .-+..|+..|+-++.-+ .++++++.+. .
T Consensus 137 ~PyGiyd~~~n~g~v~vg~s~dTa~Fav~~i~~WW~~~g~~~yp~a~~lli~~D~GgsN~~r~r~wk~~L~~la~~~gl~ 216 (311)
T PF07592_consen 137 IPYGIYDPAANEGWVSVGTSHDTADFAVDSIRRWWEEMGKARYPHAKRLLITADNGGSNGSRRRLWKKRLQELADETGLS 216 (311)
T ss_pred ccceeEeccCCeEEEEEecCcccHHHHHHHHHHHHHHhChhhcCchheEEEeccCCCCccchhHHHHHHHHHHHHHhCCE
Confidence 4566766777788875 444444445444444 222 24567778886554432 4567777765 5
Q ss_pred eEEeecCCCCCCCCCcccchhhhhhHHHHHHh
Q psy16521 93 ITLLTFPPHKTDELQPLDVAVFSPFKNRLKAG 124 (201)
Q Consensus 93 I~l~~lP~~~t~~lQPLD~~~f~~lK~~~~~~ 124 (201)
|.+.++||+++- ..|++..+|+.+-..|+-.
T Consensus 217 I~v~hyPP~tSK-wN~IEHRlfs~is~~w~G~ 247 (311)
T PF07592_consen 217 IRVCHYPPGTSK-WNPIEHRLFSHISRNWRGR 247 (311)
T ss_pred EEEEEcCCCccc-ccchhhhHhHhhhhhcCCC
Confidence 889999999876 6899988999988888654
No 7
>PF00665 rve: Integrase core domain; InterPro: IPR001584 Integrase comprises three domains capable of folding independently and whose three-dimensional structures are known. However, the manner in which the N-terminal, catalytic, and C-terminal domains interact in the holoenzyme remains obscure. Numerous studies indicate that the enzyme functions as a multimer, minimally a dimer. The integrase proteins from Human immunodeficiency virus 1 (HIV-1) and Avian sarcoma virus (ASV) have been studied most carefully with respect to the structural basis of catalysis. Although the active site of ASV integrase does not undergo significant conformational changes on binding the required metal cofactor, that of HIV-1 does. This active site-mediated conformational change in HIV-1 reorganises the catalytic core and C-terminal domains and appears to promote an interaction that is favourable for catalysis []. Retroviral integrase is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. The presence of retrovirus integrase-related gene sequences in eukaryotes is known. Bacterial transposases involved in the transposition of the insertion sequence also belong to this group. HIV integrase catalyses the incorporation of virally derived DNA into the human genome. This unique step in the virus life cycle provides a variety of points for intervention and hence is an attractive target for the development of new therapeutics for the treatment of AIDS []. Substrate recognition by the retroviral integrase enzyme is critical for retroviral integration. To catalyse this recombination event, integrase must recognise and act on two types of substrates, viral DNA and host DNA, yet the necessary interactions exhibit markedly different degrees of specificity [].; GO: 0015074 DNA integration; PDB: 3AO3_A 3OVN_A 3AO5_A 3AO4_A 3AO1_A 1C6V_D 3HPG_A 3HPH_A 3OYD_A 3OYF_B ....
Probab=89.76 E-value=0.35 Score=34.35 Aligned_cols=37 Identities=27% Similarity=0.440 Sum_probs=32.9
Q ss_pred CeEEEeecCcccchhhHHhhhhccceEEeecCCCCCC
Q psy16521 68 PALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTD 104 (201)
Q Consensus 68 ~~lLilD~~~sh~s~~~~~~~~~~~I~l~~lP~~~t~ 104 (201)
|..+..|+.+...+..+.++|++.+|.+...||+..+
T Consensus 68 p~~i~tD~g~~f~~~~~~~~~~~~~i~~~~~~~~~p~ 104 (120)
T PF00665_consen 68 PRVIRTDNGSEFTSHAFEAWCKHLGIKHVFTPPYTPQ 104 (120)
T ss_dssp -SEEEEESCHHHHSHHHHHHHHHHT-EEEESSTSSTH
T ss_pred ceecccccccccccchhhhHHHHcCceEeeCCCCChh
Confidence 8999999999999999999999999999999998865
No 8
>PHA02517 putative transposase OrfB; Reviewed
Probab=85.84 E-value=4.3 Score=33.59 Aligned_cols=77 Identities=12% Similarity=0.111 Sum_probs=52.5
Q ss_pred CCCCchHHHHHHHHHHHHhhhhcCCCCeEEEeecCcccchhhHHhhhhccceEEeecCCCCCCCCCcccchhhhhhHHHH
Q psy16521 42 SGWSSTEIFFETIKHIQNQIKASKENPALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNRL 121 (201)
Q Consensus 42 ~Gw~t~~i~~~wl~~f~~~~~~~~~~~~lLilD~~~sh~s~~~~~~~~~~~I~l~~lP~~~t~~lQPLD~~~f~~lK~~~ 121 (201)
..-++.+...+.|+....... .....++..|+-+...+..+.++|++.+|...+.++++..---+++. +|+.||..+
T Consensus 146 ~~~~~~~~~~~~l~~a~~~~~--~~~~~i~~sD~G~~y~s~~~~~~~~~~gi~~~~~~~~~P~~N~~iEr-~~~tlK~e~ 222 (277)
T PHA02517 146 SSSMDTDFVLDALEQALWARG--RPGGLIHHSDKGSQYVSLAYTQRLKEAGIRASTGSRGDSYDNAPAES-INGLYKAEV 222 (277)
T ss_pred CCCCChHHHHHHHHHHHHhcC--CCcCcEeecccccccchHHHHHHHHHcCcccccCCCCCCCCccccch-hHhhhhhhh
Confidence 334666777777764322111 12334666788888888899999999999999999998665445554 677777654
No 9
>COG3889 Predicted solute binding protein [General function prediction only]
Probab=78.58 E-value=1.1 Score=42.48 Aligned_cols=19 Identities=47% Similarity=0.699 Sum_probs=12.3
Q ss_pred eeeeeeeeeeeccccCcCC
Q psy16521 182 TTTTTTTTGVGGVGGGVGG 200 (201)
Q Consensus 182 ~t~~~~~~~~~~~~~~~~~ 200 (201)
+|+||++|.+..++||+-|
T Consensus 834 ~TtTtTsps~ttt~~~i~g 852 (872)
T COG3889 834 STTTTTSPSQTTTGGGICG 852 (872)
T ss_pred cceeeecccccccccccch
Confidence 4566666667777777644
No 10
>COG3889 Predicted solute binding protein [General function prediction only]
Probab=46.45 E-value=15 Score=35.34 Aligned_cols=15 Identities=33% Similarity=0.350 Sum_probs=7.5
Q ss_pred eeeeeeeccccCcCC
Q psy16521 186 TTTTGVGGVGGGVGG 200 (201)
Q Consensus 186 ~~~~~~~~~~~~~~~ 200 (201)
.++|+-+-.-+++||
T Consensus 834 ~TtTtTsps~ttt~~ 848 (872)
T COG3889 834 STTTTTSPSQTTTGG 848 (872)
T ss_pred cceeeeccccccccc
Confidence 344555555555554
No 11
>PF07905 PucR: Purine catabolism regulatory protein-like family; InterPro: IPR012914 This domain is found in the purine catabolism regulatory protein expressed by Bacillus subtilis (PucR, O32138 from SWISSPROT). PucR is thought to be a transcriptional regulator of genes involved in the purine degradation pathway, and may contain a LysR-like DNA-binding domain. It is similar to LysR-type regulators in that it represses its own expression []. The other members of this family are also putative regulatory proteins.
Probab=46.37 E-value=51 Score=23.83 Aligned_cols=66 Identities=20% Similarity=0.217 Sum_probs=42.6
Q ss_pred CCceEEecCCCCCc-h-HHHHHHHHHHHHhhhhcCCCCeEEE-eecCcccchhhHHhhhhccceEEeecCCCCC
Q psy16521 33 EGSLGFANKSGWSS-T-EIFFETIKHIQNQIKASKENPALIL-MDNHETHQSMEAIEYCRASGITLLTFPPHKT 103 (201)
Q Consensus 33 ~~~~~~~s~~Gw~t-~-~i~~~wl~~f~~~~~~~~~~~~lLi-lD~~~sh~s~~~~~~~~~~~I~l~~lP~~~t 103 (201)
.|-.+..+--.+-+ + +.+.+|++.+.. .+---|.+ .+.+-.++..+++++|.++++-++.+|.+..
T Consensus 42 ~gElvlttg~~~~~~~~~~~~~~i~~L~~-----~~~agL~i~~~~~~~~iP~~~i~~A~~~~lPli~ip~~~~ 110 (123)
T PF07905_consen 42 GGELVLTTGYALRDDDEEELREFIRELAE-----KGAAGLGIKTGRYLDEIPEEIIELADELGLPLIEIPWEVP 110 (123)
T ss_pred CCeEEEECCcccCCCCHHHHHHHHHHHHH-----CCCeEEEEeccCccccCCHHHHHHHHHcCCCEEEeCCCCC
Confidence 34555556666666 3 335666665432 12223333 3446668899999999999999999998553
No 12
>COG4907 Predicted membrane protein [Function unknown]
Probab=37.88 E-value=26 Score=31.72 Aligned_cols=33 Identities=24% Similarity=0.338 Sum_probs=16.3
Q ss_pred ccceEEeecCCCCCCCCCc-c--cchhhhhhHHHHH
Q psy16521 90 ASGITLLTFPPHKTDELQP-L--DVAVFSPFKNRLK 122 (201)
Q Consensus 90 ~~~I~l~~lP~~~t~~lQP-L--D~~~f~~lK~~~~ 122 (201)
..||++++.|...-....| + ...-+..||+.+.
T Consensus 461 ~~gIIl~~~~g~~~grw~~~~~~~~~~W~aFKnfLs 496 (595)
T COG4907 461 PLGIILFFSAGSTAGRWSKKLAETISKWQAFKNFLS 496 (595)
T ss_pred hhceeEEEeccccccccCHhHHHHHHHHHHHHHHHH
Confidence 3567777666533333333 1 2234456666554
No 13
>PRK14702 insertion element IS2 transposase InsD; Provisional
Probab=37.28 E-value=1.9e+02 Score=23.87 Aligned_cols=54 Identities=15% Similarity=0.200 Sum_probs=38.8
Q ss_pred CCCeEEEeecCcccchhhHHhhhhccceEEeecCCCCCCCCCcccchhhhhhHHH
Q psy16521 66 ENPALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNR 120 (201)
Q Consensus 66 ~~~~lLilD~~~sh~s~~~~~~~~~~~I~l~~lP~~~t~~lQPLD~~~f~~lK~~ 120 (201)
..+.++.-|.-+...+.++.++|++.+|...+..++... -+..--.+|+.||..
T Consensus 151 ~~~~iihSD~Gsqy~s~~~~~~l~~~gI~~Smsr~g~p~-dNa~~Erf~~tlK~E 204 (262)
T PRK14702 151 SSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPE-SNGIAESFVKTIKRD 204 (262)
T ss_pred CCCeEEEcCCCcccchHHHHHHHHHcCCeeccCCCCCCC-cchHHHHHHHHHHHH
Confidence 457788899988889999999999999999888777743 122222345555543
No 14
>PRK09409 IS2 transposase TnpB; Reviewed
Probab=32.00 E-value=3.1e+02 Score=23.10 Aligned_cols=75 Identities=12% Similarity=0.168 Sum_probs=49.1
Q ss_pred CchHHHHHHHHH-HHHhhhh-cCCCCeEEEeecCcccchhhHHhhhhccceEEeecCCCCCCCCCcccchhhhhhHHH
Q psy16521 45 SSTEIFFETIKH-IQNQIKA-SKENPALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNR 120 (201)
Q Consensus 45 ~t~~i~~~wl~~-f~~~~~~-~~~~~~lLilD~~~sh~s~~~~~~~~~~~I~l~~lP~~~t~~lQPLD~~~f~~lK~~ 120 (201)
++.++..+-|+. +...... ....+.++.-|.-+.+.+.++.++|++++|..-+..++... -+..--.+|+.||..
T Consensus 167 ~~~~~v~~~l~~a~~~~~~~~~~~~~~iihSDrGsqy~s~~~~~~l~~~gI~~Smsr~g~p~-dNa~~Erf~~tlK~E 243 (301)
T PRK09409 167 FNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPE-SNGIAESFVKTIKRD 243 (301)
T ss_pred CCHHHHHHHHHHHHHHHhccCCCCCCcEEecCCCcccccHHHHHHHHHcCCeEeeCCCCCCC-ccchhHHHHHHHHHH
Confidence 455555555542 3222111 13456788889988999999999999999999988887743 233333466666654
No 15
>PF12068 DUF3548: Domain of unknown function (DUF3548); InterPro: IPR021935 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes and is typically between 184 to 216 amino acids in length. The domain is found associated with PF00566 from PFAM and at the N terminus of GYP7 proteins.
Probab=29.85 E-value=1.2e+02 Score=24.62 Aligned_cols=34 Identities=9% Similarity=0.284 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHhhhhcCCCCeEEEeecCcccch
Q psy16521 48 EIFFETIKHIQNQIKASKENPALILMDNHETHQS 81 (201)
Q Consensus 48 ~i~~~wl~~f~~~~~~~~~~~~lLilD~~~sh~s 81 (201)
+.|++.|+.++...+...+...+|+.|..+.-++
T Consensus 157 ~~fl~~L~~~v~l~~S~~D~~~~lv~~~~s~aL~ 190 (213)
T PF12068_consen 157 KEFLKSLQRYVTLARSPRDSNLYLVNDHNSEALS 190 (213)
T ss_pred HHHHHHHHhhEEEeecCCCCcEEEEECCCchHHH
Confidence 4566666665554454566777777776543333
No 16
>PLN02941 inositol-tetrakisphosphate 1-kinase
Probab=28.34 E-value=73 Score=27.53 Aligned_cols=56 Identities=18% Similarity=0.380 Sum_probs=40.4
Q ss_pred hhhHHhhhhccceEEeecCCCCCC-CCCcccchhhhhhHHHHHHhHHHHhhcCCCCc
Q psy16521 81 SMEAIEYCRASGITLLTFPPHKTD-ELQPLDVAVFSPFKNRLKAGFKKWTSDNPGSI 136 (201)
Q Consensus 81 s~~~~~~~~~~~I~l~~lP~~~t~-~lQPLD~~~f~~lK~~~~~~~~~~~~~~~~~~ 136 (201)
...+++.|++++|.++.+-....- -..|.|+-+-+.....|++.+..|+..+++-.
T Consensus 38 ~~~l~~~~~~~Gi~~v~Id~~~pl~~qgpfDvilhK~~~~~~~~~~~~~~~e~pgv~ 94 (328)
T PLN02941 38 QPSLEALARSKGIDLVAIDPSRPLSEQGPFDVILHKLYGKEWRQQLEEYREKHPDVT 94 (328)
T ss_pred hHHHHHHHHHCCCeEEEecCCCCccccCCcCEEEEecCCHHHHHHHHHHHHHCCCcE
Confidence 345677788888888777555321 12388888778788888999999998887653
No 17
>PF05770 Ins134_P3_kin: Inositol 1, 3, 4-trisphosphate 5/6-kinase; InterPro: IPR008656 This entry represents inositol-tetrakisphosphate 1-kinase which is also called inositol 1,3,4-trisphosphate 5/6-kinase. Inositol-tetrakisphosphate 1-kinase can phosphorylate various inositol polyphosphate such as Ins(3,4,5,6)P4 or Ins(1,3,4)P3. This enzyme phosphorylates Ins(3,4,5,6)P4 at position 1 to form Ins(1,3,4,5,6)P5. This reaction is thought to have regulatory importance, since Ins(3,4,5,6)P4 is an inhibitor of plasma membrane Ca(2+)-activated Cl(-) channels, while Ins(1,3,4,5,6)P5 is not. It also phosphorylates Ins(1,3,4)P3 on O-5 and O-6 to form Ins(1,3,4,6)P4, an essential molecule in the hexakisphosphate (InsP6) pathway [, , , , ].; GO: 0000287 magnesium ion binding, 0005524 ATP binding, 0047325 inositol tetrakisphosphate 1-kinase activity, 0052725 inositol-1,3,4-trisphosphate 6-kinase activity, 0052726 inositol-1,3,4-trisphosphate 5-kinase activity, 0032957 inositol trisphosphate metabolic process, 0005622 intracellular; PDB: 1Z2P_X 1Z2O_X 1Z2N_X 2Q7D_A 2QB5_B 2ODT_X.
Probab=27.60 E-value=43 Score=28.70 Aligned_cols=55 Identities=18% Similarity=0.312 Sum_probs=39.0
Q ss_pred hhHHhhhhccceEEeecCCCCCC-CCCcccchhhhhhHHHHHHhHHHHhhcCCCCc
Q psy16521 82 MEAIEYCRASGITLLTFPPHKTD-ELQPLDVAVFSPFKNRLKAGFKKWTSDNPGSI 136 (201)
Q Consensus 82 ~~~~~~~~~~~I~l~~lP~~~t~-~lQPLD~~~f~~lK~~~~~~~~~~~~~~~~~~ 136 (201)
..+..+|+++||.++.+=....- -.+|+|+-+=+--...|.+.+.+|+..+|+-.
T Consensus 25 ~~~~~~~~~~gi~~v~id~~~pl~~QgpfDvIlHKltd~~~~~~l~~y~~~hP~v~ 80 (307)
T PF05770_consen 25 PSFIDLARSRGIDFVPIDLSKPLEEQGPFDVILHKLTDEDWVQQLEEYIKKHPEVV 80 (307)
T ss_dssp CHHCCCCCCCTTEEEEEECCSSSGCC--SCEEEE--CHCHHHHHHHHHHHH-TTSE
T ss_pred HHHHHHHHhcCCEEEEcCCCCCcccCCCcEEEEEeCCCHHHHHHHHHHHHHCCCeE
Confidence 46778899999998887765442 23599997777777889999999999988764
No 18
>PRK08857 para-aminobenzoate synthase component II; Provisional
Probab=24.83 E-value=76 Score=24.78 Aligned_cols=31 Identities=16% Similarity=0.339 Sum_probs=19.0
Q ss_pred EEEeecCcccchhhHHhhhhccceEEeecCCC
Q psy16521 70 LILMDNHETHQSMEAIEYCRASGITLLTFPPH 101 (201)
Q Consensus 70 lLilD~~~sh~s~~~~~~~~~~~I~l~~lP~~ 101 (201)
+|++|++.+. +..+.+++++.++.+..+|..
T Consensus 2 il~id~~dsf-t~~~~~~l~~~g~~~~~~~~~ 32 (193)
T PRK08857 2 LLMIDNYDSF-TYNLYQYFCELGAQVKVVRND 32 (193)
T ss_pred EEEEECCCCc-HHHHHHHHHHCCCcEEEEECC
Confidence 5666777766 344555666666666666654
No 19
>PRK13600 putative ribosomal protein L7Ae-like; Provisional
Probab=24.02 E-value=1.2e+02 Score=20.71 Aligned_cols=26 Identities=12% Similarity=0.101 Sum_probs=21.2
Q ss_pred cCcccchhhHHhhhhccceEEeecCC
Q psy16521 75 NHETHQSMEAIEYCRASGITLLTFPP 100 (201)
Q Consensus 75 ~~~sh~s~~~~~~~~~~~I~l~~lP~ 100 (201)
....++...+..+|++++|-+.++|.
T Consensus 38 Da~~~vv~~l~~lceek~Ip~v~V~s 63 (84)
T PRK13600 38 DVEVYLMTRVLSQINQKNIPVSFFKS 63 (84)
T ss_pred CCCHHHHHHHHHHHHHcCCCEEEECC
Confidence 33456677899999999999999875
No 20
>COG5019 CDC3 Septin family protein [Cell division and chromosome partitioning / Cytoskeleton]
Probab=21.73 E-value=1.4e+02 Score=26.38 Aligned_cols=55 Identities=16% Similarity=0.266 Sum_probs=29.5
Q ss_pred HHHHHHHH-HHHHhhhhcCCCCeE-EEeecCcccchhhHHhhhhccceEEeecCCCCCCCCCcccchhhhhh
Q psy16521 48 EIFFETIK-HIQNQIKASKENPAL-ILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPF 117 (201)
Q Consensus 48 ~i~~~wl~-~f~~~~~~~~~~~~l-LilD~~~sh~s~~~~~~~~~~~I~l~~lP~~~t~~lQPLD~~~f~~l 117 (201)
+...+|++ .|..++.....-++- .+.|. .-.+.++||+|.. |-+.|+|+...+.+
T Consensus 103 e~I~~yI~~q~d~yl~~E~~~~R~~~~~D~--------------RVH~cLYFI~Ptg-h~l~~~DIe~Mk~l 159 (373)
T COG5019 103 EPIVDYIDDQFDQYLDEEQKIKRNPKFKDT--------------RVHACLYFIRPTG-HGLKPLDIEAMKRL 159 (373)
T ss_pred HHHHHHHHHHHHHHHHHhhccccccccccC--------------ceEEEEEEecCCC-CCCCHHHHHHHHHH
Confidence 46678887 477777542111111 12222 1345666777654 44778887765544
No 21
>COG3545 Predicted esterase of the alpha/beta hydrolase fold [General function prediction only]
Probab=21.20 E-value=3e+02 Score=21.76 Aligned_cols=74 Identities=11% Similarity=0.197 Sum_probs=44.7
Q ss_pred cCcEEEeccccC------ccccccCCCCCceEEecCCCCCchHHHHHHHHHHHHhhhhcCCCCeEEEeecCcccchhhHH
Q psy16521 12 IPPVFVFPRVFM------KDEFLLGAPEGSLGFANKSGWSSTEIFFETIKHIQNQIKASKENPALILMDNHETHQSMEAI 85 (201)
Q Consensus 12 lpP~iI~~~~~~------~~~~~~~~~~~~~~~~s~~Gw~t~~i~~~wl~~f~~~~~~~~~~~~lLilD~~~sh~s~~~~ 85 (201)
.+.++|++|-.. ++.|...+|. +++. ...- -......+|+..+...+... +++++|+--+.++-.-....
T Consensus 2 ~~~~lIVpG~~~Sg~~HWq~~we~~l~~-a~rv-eq~~-w~~P~~~dWi~~l~~~v~a~-~~~~vlVAHSLGc~~v~h~~ 77 (181)
T COG3545 2 MTDVLIVPGYGGSGPNHWQSRWESALPN-ARRV-EQDD-WEAPVLDDWIARLEKEVNAA-EGPVVLVAHSLGCATVAHWA 77 (181)
T ss_pred CceEEEecCCCCCChhHHHHHHHhhCcc-chhc-ccCC-CCCCCHHHHHHHHHHHHhcc-CCCeEEEEecccHHHHHHHH
Confidence 356788877432 2333344553 4433 3333 45567899999888877654 77889887777766544444
Q ss_pred hhhh
Q psy16521 86 EYCR 89 (201)
Q Consensus 86 ~~~~ 89 (201)
..+.
T Consensus 78 ~~~~ 81 (181)
T COG3545 78 EHIQ 81 (181)
T ss_pred Hhhh
Confidence 4433
No 22
>PHA02732 hypothetical protein; Provisional
Probab=21.08 E-value=48 Score=32.11 Aligned_cols=27 Identities=33% Similarity=0.645 Sum_probs=17.1
Q ss_pred CcccCCCCCCCcceee--eeeeeeeeeec
Q psy16521 167 DTATTPIPPPTTTTTT--TTTTTTTGVGG 193 (201)
Q Consensus 167 ~~~~tp~~~~~~~~~~--t~~~~~~~~~~ 193 (201)
....|||||+++++++ ++.++++|++.
T Consensus 1235 ~aatt~tpp~sssssssa~sistsp~~~q 1263 (1467)
T PHA02732 1235 PAATTPTPPPSSSSSSSAQSISTSPGQIQ 1263 (1467)
T ss_pred CcccCCCCCCCcccccccccccCCCCeEE
Confidence 4456888888765444 45555677664
Done!