Query psy9993
Match_columns 605
No_of_seqs 314 out of 334
Neff 4.9
Searched_HMMs 46136
Date Sat Aug 17 00:53:52 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9993.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9993hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG5651 PPE-repeat proteins [C 99.2 1E-09 2.3E-14 121.5 20.2 47 113-159 126-175 (490)
2 COG5651 PPE-repeat proteins [C 96.5 0.087 1.9E-06 59.1 16.9 11 123-133 96-106 (490)
3 PF01744 GLTT: GLTT repeat (6 85.3 0.77 1.7E-05 32.8 2.4 20 431-450 4-23 (29)
4 PF01744 GLTT: GLTT repeat (6 84.0 0.95 2.1E-05 32.3 2.3 25 439-463 2-26 (29)
5 PF00823 PPE: PPE family; Int 78.1 0.42 9.1E-06 45.7 -1.5 34 112-145 121-157 (159)
6 PTZ00395 Sec24-related protein 52.3 28 0.0006 44.2 6.3 79 29-107 394-472 (1560)
7 PTZ00395 Sec24-related protein 33.3 55 0.0012 41.7 4.9 48 50-97 405-452 (1560)
8 PF10221 DUF2151: Cell cycle a 22.0 15 0.00031 43.7 -2.5 30 569-598 38-67 (695)
9 PF10572 UPF0556: Uncharacteri 11.2 1.2E+02 0.0027 29.8 1.2 14 587-600 75-88 (158)
10 KOG3110|consensus 7.7 1.7E+02 0.0037 28.4 0.6 16 589-604 71-86 (153)
No 1
>COG5651 PPE-repeat proteins [Cell motility and secretion]
Probab=99.19 E-value=1e-09 Score=121.52 Aligned_cols=47 Identities=15% Similarity=0.229 Sum_probs=41.8
Q ss_pred cCCChHHHH---HHHhhhcccCCCccccccccchhhhhcccccccccccc
Q psy9993 113 LGYSASEID---SELNYSWNYEPNGMAHYSRSDLLTDRGLTKLGLTDRGL 159 (605)
Q Consensus 113 ~G~~~~~i~---~~~~~~w~~dv~~m~gy~~~asaaAaaL~~~G~t~~GL 159 (605)
+|++.++|+ +.|+|||+||+++|..||..+++++++|.+|..+.+-.
T Consensus 126 fGqNt~aIaa~Ea~Y~~MW~qda~aM~~yy~~~saaa~~l~p~~~p~~~~ 175 (490)
T COG5651 126 FGQNTPAIAATEAQYAEMWAQDAAAMYGYYAAASAAASALTPFNEPPPTT 175 (490)
T ss_pred ccchhHHHhhhhcchHHHHHHHHhhhhhhhHHHHHHHHhcCCCCCCcccc
Confidence 899999985 56899999999999999999999999999998665543
No 2
>COG5651 PPE-repeat proteins [Cell motility and secretion]
Probab=96.53 E-value=0.087 Score=59.12 Aligned_cols=11 Identities=0% Similarity=-0.185 Sum_probs=5.4
Q ss_pred HHhhhcccCCC
Q psy9993 123 ELNYSWNYEPN 133 (605)
Q Consensus 123 ~~~~~w~~dv~ 133 (605)
+||.-.+..|.
T Consensus 96 aye~A~a~tv~ 106 (490)
T COG5651 96 AYEAALAATVP 106 (490)
T ss_pred HHHHHHHcCCC
Confidence 45555554444
No 3
>PF01744 GLTT: GLTT repeat (6 copies); InterPro: IPR008164 This short repeat of unknown function is found in multiple copies in several Caenorhabditis elegans proteins. The repeat is five residues long and consists of XGLTT where X can be any amino acid.
Probab=85.32 E-value=0.77 Score=32.80 Aligned_cols=20 Identities=50% Similarity=0.826 Sum_probs=7.2
Q ss_pred CCCCCCCCCCCCCCCCCCCC
Q psy9993 431 NLGLTDRGLTNLGLTDRGLT 450 (605)
Q Consensus 431 n~G~~~~Gl~~~G~~~~Gl~ 450 (605)
.+|+...||+.+|+...|++
T Consensus 4 ~~GL~t~GL~t~GL~t~Gl~ 23 (29)
T PF01744_consen 4 TMGLTTLGLVTMGLTTLGLV 23 (29)
T ss_pred cccccccCcccCCccccccc
Confidence 33333333333333333333
No 4
>PF01744 GLTT: GLTT repeat (6 copies); InterPro: IPR008164 This short repeat of unknown function is found in multiple copies in several Caenorhabditis elegans proteins. The repeat is five residues long and consists of XGLTT where X can be any amino acid.
Probab=84.02 E-value=0.95 Score=32.35 Aligned_cols=25 Identities=48% Similarity=0.846 Sum_probs=10.4
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCC
Q psy9993 439 LTNLGLTDRGLTNLGLTDRGLTNLG 463 (605)
Q Consensus 439 l~~~G~~~~Gl~n~G~~n~Gl~n~G 463 (605)
|+.+|+...|++.+|+...|++.+|
T Consensus 2 L~~~GL~t~GL~t~GL~t~Gl~t~G 26 (29)
T PF01744_consen 2 LVTMGLTTLGLVTMGLTTLGLVTMG 26 (29)
T ss_pred cccccccccCcccCCccccccccCC
Confidence 3344444444444444444444433
No 5
>PF00823 PPE: PPE family; InterPro: IPR000030 The human pathogen Mycobacterium tuberculosis harbours a large number of genes that encode proteins whose N-termini contain the characteristic motifs Pro-Glu (PE) or Pro-Pro-Glu (PPE). A subgroup of the PE proteins contains polymorphic GC-rich sequences (PGRS), while a subgroup of the PPE proteins contains major polymorphic tandem repeats (MPTR). The function of most of these proteins remains unknown []. However, the PE_PGRS proteins from Mycobacterium marinum are secreted by components of the ESX-5 system that belongs to the recently defined type VII secretion systems []. It has also been reported that the PE_PGRS family of proteins contains multiple calcium-binding and glycine-rich sequence motifs GGXGXD/NXUX. This sequence repeat constitutes a calcium-binding parallel beta-roll or parallel beta-helix structure and is found in RTX toxins secreted by many Gram-negative bacteria []. This mycobacterial family is named after a conserved amino-terminal region of about 180 amino acids, the PPE motif. The carboxy termini of proteins belonging to the PPE family are variable, and on the basis of this region at least three groups can be distinguished. The MPTR subgroup is characterised by tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group shares only similarity in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis [].; PDB: 2G38_B.
Probab=78.11 E-value=0.42 Score=45.67 Aligned_cols=34 Identities=18% Similarity=0.313 Sum_probs=28.0
Q ss_pred ccCCChHHHH---HHHhhhcccCCCccccccccchhh
Q psy9993 112 ALGYSASEID---SELNYSWNYEPNGMAHYSRSDLLT 145 (605)
Q Consensus 112 ~~G~~~~~i~---~~~~~~w~~dv~~m~gy~~~asaa 145 (605)
-+|++.+.|+ ++|++||.+++.+|..|+..+..+
T Consensus 121 ~~G~~t~~ia~~ea~y~~~~~qaa~vM~~Y~aa~~~a 157 (159)
T PF00823_consen 121 FFGQNTPAIAALEAEYEEMWAQAAAVMTAYEAASTAA 157 (159)
T ss_dssp TTSTTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3789988874 589999999999999998664433
No 6
>PTZ00395 Sec24-related protein; Provisional
Probab=52.29 E-value=28 Score=44.20 Aligned_cols=79 Identities=14% Similarity=0.222 Sum_probs=48.0
Q ss_pred ccccccccccccccCCcccccccccccccccccccCcccCccccccccccccccccccccccccCcccccccccccccC
Q psy9993 29 LTDRGLTNLGLTDRGLTNLGLTDRGLTNLGLTDRGLTNLGLTDRVLTNLGLTDRGLTNLGLTDRGLTNLGLTDRDVTDD 107 (605)
Q Consensus 29 ~~~~g~~n~G~~~~g~~n~G~~~~g~~n~G~~n~g~~n~G~~~~~l~n~Gf~n~gl~n~Gf~~~g~~n~Gf~~~~~~~~ 107 (605)
|.+.-..++-|+|++..|+|++|.+..|.+|.|....|+.|++....|+=|+|.-..|.=|.+.-..|.=|.++-+.+.
T Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~n~~~~n~~~~n~p~~n~p~~n~p~~n~p~~n~p~~n~~~~n~ 472 (1560)
T PTZ00395 394 YSNAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNA 472 (1560)
T ss_pred hcccccchhhhcccccccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcCCCC
Confidence 3334444555666666666666666666666666666666666666666666666666666665555555555555544
No 7
>PTZ00395 Sec24-related protein; Provisional
Probab=33.34 E-value=55 Score=41.70 Aligned_cols=48 Identities=19% Similarity=0.272 Sum_probs=18.6
Q ss_pred ccccccccccccccCcccCccccccccccccccccccccccccCcccc
Q psy9993 50 TDRGLTNLGLTDRGLTNLGLTDRVLTNLGLTDRGLTNLGLTDRGLTNL 97 (605)
Q Consensus 50 ~~~g~~n~G~~n~g~~n~G~~~~~l~n~Gf~n~gl~n~Gf~~~g~~n~ 97 (605)
+|.+..++|+.|.+..|.+|++....|+.|+|+-..|.=|.+.-..|.
T Consensus 405 ~~~~~~~~~~~~~~~~~~~~~n~~~~n~~~~n~~~~n~p~~n~p~~n~ 452 (1560)
T PTZ00395 405 SNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNP 452 (1560)
T ss_pred cccccccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Confidence 333333333333333333444333334444444444333333333333
No 8
>PF10221 DUF2151: Cell cycle and development regulator; InterPro: IPR019355 This entry represents the cell cycle regulator Mat89b, which plays an evolutionarily conserved role as a crucial regulator of both cell cycle and development []. Mat89Bb is a PNG kinase substrate that is essential for S-M cycles of early Drosophila embryogenesis, Xenopus embryonic cell cycles and morphogenesis, and cell division in cultured mammalian cells.
Probab=21.97 E-value=15 Score=43.68 Aligned_cols=30 Identities=17% Similarity=0.398 Sum_probs=25.2
Q ss_pred chhhhhhcceeeeeccceeeeeeccCCccc
Q psy9993 569 NIDTLRLLYTSLDRSKLEYATVIWAPKSKS 598 (605)
Q Consensus 569 ~~stlk~LY~slVRp~LEY~s~vW~P~~~~ 598 (605)
....-|.|+..-|-..+|||+||||=|...
T Consensus 38 ~~~i~KSLWTc~vEa~~EYcRIV~DlFP~~ 67 (695)
T PF10221_consen 38 KAPISKSLWTCAVEASIEYCRIVWDLFPDG 67 (695)
T ss_pred cCcccchHHHHHHHHHHHHHHHHhhccCCC
Confidence 445567999999999999999999987654
No 9
>PF10572 UPF0556: Uncharacterised protein family UPF0556; InterPro: IPR018887 This family of proteins has no known function.
Probab=11.25 E-value=1.2e+02 Score=29.81 Aligned_cols=14 Identities=50% Similarity=1.274 Sum_probs=0.0
Q ss_pred eeeeeccCCccccc
Q psy9993 587 YATVIWAPKSKSYV 600 (605)
Q Consensus 587 Y~s~vW~P~~~~~~ 600 (605)
|.|+||||..+.|+
T Consensus 75 ~sC~I~RPqgkSYL 88 (158)
T PF10572_consen 75 YSCIIWRPQGKSYL 88 (158)
T ss_pred eEEEEECCCCCcEE
No 10
>KOG3110|consensus
Probab=7.70 E-value=1.7e+02 Score=28.40 Aligned_cols=16 Identities=25% Similarity=0.482 Sum_probs=0.0
Q ss_pred eeeccCCccccccccc
Q psy9993 589 TVIWAPKSKSYVSLLE 604 (605)
Q Consensus 589 s~vW~P~~~~~~~~lE 604 (605)
|+=|+||++.+.+.+|
T Consensus 71 SIGwNP~Y~N~~Kt~E 86 (153)
T KOG3110|consen 71 SIGWNPYYKNKKKTME 86 (153)
T ss_pred EcccCcccCCccccee
Done!