Query psy4031
Match_columns 115
No_of_seqs 113 out of 202
Neff 5.1
Searched_HMMs 46136
Date Fri Aug 16 23:38:03 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4031.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4031hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF04597 Ribophorin_I: Ribopho 100.0 3.7E-46 8.1E-51 312.5 7.8 111 4-114 289-418 (432)
2 KOG2291|consensus 100.0 1.3E-44 2.7E-49 309.6 6.0 111 4-114 315-443 (602)
3 PRK13986 urease subunit alpha; 65.4 26 0.00057 28.1 6.2 57 33-89 76-137 (225)
4 PF05986 ADAM_spacer1: ADAM-TS 62.1 6.2 0.00013 27.6 1.9 14 41-54 19-32 (114)
5 PRK13192 bifunctional urease s 59.7 39 0.00085 26.8 6.2 57 33-89 75-141 (208)
6 cd06495 p23_NUDCD3_like p23-li 57.5 13 0.00028 25.9 2.9 23 32-54 10-35 (102)
7 cd06493 p23_NUDCD1_like p23_NU 48.5 20 0.00043 23.3 2.6 22 33-54 5-28 (85)
8 PF14903 WG_beta_rep: WG conta 46.4 31 0.00068 18.1 2.8 24 69-92 2-25 (35)
9 PF09972 DUF2207: Predicted me 40.9 77 0.0017 26.0 5.5 62 32-102 128-190 (511)
10 PHA02090 hypothetical protein 40.6 14 0.0003 24.7 0.8 13 12-24 48-61 (79)
11 PF08968 DUF1885: Domain of un 39.5 20 0.00043 26.5 1.6 15 36-50 82-96 (130)
12 cd06467 p23_NUDC_like p23_like 37.4 46 0.00099 21.0 2.9 21 34-54 6-28 (85)
13 cd06492 p23_mNUDC_like p23-lik 34.7 54 0.0012 21.7 3.1 23 32-54 4-30 (87)
14 KOG2566|consensus 33.7 80 0.0017 27.9 4.6 51 33-88 437-488 (518)
15 PF10976 DUF2790: Protein of u 30.9 68 0.0015 21.5 3.0 31 48-79 36-66 (78)
16 TIGR03000 plancto_dom_1 Planct 30.7 37 0.00081 22.8 1.7 14 37-50 2-15 (75)
17 PF13860 FlgD_ig: FlgD Ig-like 29.6 80 0.0017 20.3 3.2 17 37-53 13-29 (81)
18 PF12503 CMV_1a_C: Cucumber mo 27.5 32 0.0007 23.7 1.0 14 12-25 38-55 (85)
19 PF07617 DUF1579: Protein of u 26.3 2E+02 0.0044 21.4 5.2 35 15-50 64-98 (159)
20 KOG3179|consensus 25.0 28 0.0006 28.1 0.3 35 42-78 159-193 (245)
21 PF04525 Tub_2: Tubby C 2; In 24.9 71 0.0015 23.6 2.5 42 43-87 33-74 (187)
22 cd01777 SNX27_RA Ubiquitin dom 24.9 55 0.0012 22.6 1.8 14 37-50 1-14 (87)
23 COG4824 Phage-related holin (L 24.5 18 0.0004 26.8 -0.7 10 10-19 19-28 (133)
24 PF13716 CRAL_TRIO_2: Divergen 23.2 54 0.0012 22.7 1.6 16 72-87 10-25 (149)
25 KOG3264|consensus 23.1 55 0.0012 26.1 1.7 17 38-54 171-187 (221)
26 PF08487 VIT: Vault protein in 23.1 1.2E+02 0.0026 20.9 3.3 37 63-102 7-43 (118)
27 COG4856 Uncharacterized protei 22.2 65 0.0014 27.9 2.0 17 36-52 288-304 (403)
28 PF10515 APP_amyloid: beta-amy 22.0 41 0.00088 21.2 0.6 11 20-30 41-51 (52)
29 PF08089 Toxin_20: Huwentoxin- 22.0 25 0.00054 20.8 -0.3 10 11-20 26-35 (39)
30 PF13349 DUF4097: Domain of un 21.7 1.4E+02 0.0029 20.9 3.4 21 36-56 118-140 (166)
31 cd06494 p23_NUDCD2_like p23-li 21.5 1.1E+02 0.0025 20.6 2.8 22 32-53 11-32 (93)
32 PF08813 Phage_tail_3: Phage t 21.5 1.1E+02 0.0023 22.9 2.9 35 16-50 90-128 (165)
33 PF14879 DUF4489: Domain of un 20.6 2.2E+02 0.0048 21.2 4.4 53 41-103 12-67 (141)
34 PRK11615 hypothetical protein; 20.4 1.8E+02 0.004 22.7 4.0 42 39-85 49-91 (185)
No 1
>PF04597 Ribophorin_I: Ribophorin I; InterPro: IPR007676 Ribophorin I is an essential subunit of oligosaccharyltransferase (OST), which is also known as dolichyl-diphosphooligosaccharide--protein glycosyltransferase, (2.4.1.119 from EC). OST catalyses the transfer of an oligosaccharide from dolichol pyrophosphate to selected asparagine residues of nascent polypeptides as they are translocated into the lumen of the rough endoplasmic reticulum. Ribophorin I and OST48 are thought to be responsible for OST catalytic activity []. Both yeast and mammalian proteins are glycosylated but the sites are not conserved. Glycosylation may contribute towards general solubility but is unlikely to be involved in a specific biochemical function []. Most family members are predicted to have a transmembrane helix at the C terminus of this region.; GO: 0004579 dolichyl-diphosphooligosaccharide-protein glycotransferase activity, 0006486 protein glycosylation, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=100.00 E-value=3.7e-46 Score=312.50 Aligned_cols=111 Identities=32% Similarity=0.517 Sum_probs=108.7
Q ss_pred ccccceeccCcccceEeeecCChhhhcc------------------ccceeeEEEEEEcCCCeeeeeEeCCcceeecccc
Q psy4031 4 ARVTWARSRYWHALYIPGHRRSPYDYAA------------------RAQPRNAKVNIILPEHSLVRQIHTPYPVSSQSDG 65 (115)
Q Consensus 4 ~~~~~plfGGWky~ftiGyn~pl~~~l~------------------d~~~d~~~l~ViLPEGA~~i~v~~P~~v~~~~~~ 65 (115)
.||||||||||||+|++|||+|+++||+ |++||+++++|+|||||+||+|.+|+++++.+++
T Consensus 289 l~PRfPLfGGWk~~FtiGyn~p~~~~l~~~~~~~y~L~vp~~~~~~d~~~d~~~l~i~LPEGA~~i~v~~P~~~~~~~~~ 368 (432)
T PF04597_consen 289 LKPRFPLFGGWKYNFTIGYNLPLSNFLRKSGDGRYVLKVPFLPGIKDIVYDNVELRIILPEGAKNIKVSSPFPVDSVSVS 368 (432)
T ss_pred EEcCCcccCCcceeEEEEccCChHHhEEECCCCcEEEEEECcCCcCceEEEEEEEEEECCCCceeeeEeCCccceeeecc
Confidence 6899999999999999999999999999 9999999999999999999999999999999999
Q ss_pred ceEEeccCCCCcEEEEEeccCcccCc-ceEEEEEEeCchhhcccceeeee
Q psy4031 66 THSTYLDTAGRPMITLRAHNLVENHI-EDIEIEYEFSKSYLGKEPLLRAF 114 (115)
Q Consensus 66 ~~~TYLDt~GRp~v~l~~~Nlvd~h~-~~i~V~Y~~~~~~~l~kPl~i~~ 114 (115)
.++||||+.|||+|+++++||+|+|+ ++|+|+|+|+..+||+|||+|++
T Consensus 369 ~~~tyLDt~GR~vv~l~~~nlvd~~~~~~~~v~Y~~~~~~~~~kPl~i~~ 418 (432)
T PF04597_consen 369 THKTYLDTTGRPVVVLEKKNLVDEHNDQDFQVTYTYPSSAMLRKPLLIAG 418 (432)
T ss_pred ceeeeeeccCceEEEEEeccCCHhHCCeeEEEEEEcCHHHHHHHHHHHHH
Confidence 99999999999999999999999988 99999999999999999999875
No 2
>KOG2291|consensus
Probab=100.00 E-value=1.3e-44 Score=309.64 Aligned_cols=111 Identities=31% Similarity=0.477 Sum_probs=108.6
Q ss_pred ccccceeccCcccceEeeecCChhhhcc------------------ccceeeEEEEEEcCCCeeeeeEeCCcceeecccc
Q psy4031 4 ARVTWARSRYWHALYIPGHRRSPYDYAA------------------RAQPRNAKVNIILPEHSLVRQIHTPYPVSSQSDG 65 (115)
Q Consensus 4 ~~~~~plfGGWky~ftiGyn~pl~~~l~------------------d~~~d~~~l~ViLPEGA~~i~v~~P~~v~~~~~~ 65 (115)
.||||||||||||+|+||||+|+++||+ |++||+++++|+|||||+||++.+||++.+.+.+
T Consensus 315 irPRfPlFGGWkt~ftiGy~lP~~eyl~~~g~ry~L~~~~~~~~~d~~V~dkl~ikvvLPEGak~i~i~tP~~is~~p~e 394 (602)
T KOG2291|consen 315 IRPRFPLFGGWKTNFTIGYNLPLEEYLFSKGRRYALKIILIDHIFDDTVYDKLTIKVVLPEGAKDIEIDTPYEISRSPIE 394 (602)
T ss_pred eccCCccccCceeeEEEecCCcHHHHhhccCceeEEccccccCCCccceeeeEEEEEEccCCCcccccccceeeccCchh
Confidence 5999999999999999999999999999 6799999999999999999999999999999999
Q ss_pred ceEEeccCCCCcEEEEEeccCcccCcceEEEEEEeCchhhcccceeeee
Q psy4031 66 THSTYLDTAGRPMITLRAHNLVENHIEDIEIEYEFSKSYLGKEPLLRAF 114 (115)
Q Consensus 66 ~~~TYLDt~GRp~v~l~~~Nlvd~h~~~i~V~Y~~~~~~~l~kPl~i~~ 114 (115)
+++|||||.|||+++++|+|+++.|+++|+|+|+|++..||+|||+|++
T Consensus 395 ~~~syLDt~GR~Vvv~ek~Nvv~~h~~~i~v~Y~f~~~sml~ePL~i~a 443 (602)
T KOG2291|consen 395 LKYSYLDTNGRPVVVLEKNNVVPDHNQDIVVHYTFSKSSMLQEPLLIIA 443 (602)
T ss_pred hhhhhhhccCcEEEEEEccccCCCCCccEEEEEEechhHhhhccHHHHH
Confidence 9999999999999999999999999999999999999999999999975
No 3
>PRK13986 urease subunit alpha; Provisional
Probab=65.40 E-value=26 Score=28.09 Aligned_cols=57 Identities=14% Similarity=0.304 Sum_probs=41.3
Q ss_pred cceeeEEEEEEcCCCeeeeeEeCCcceeeccccceEEecc-----CCCCcEEEEEeccCccc
Q psy4031 33 AQPRNAKVNIILPEHSLVRQIHTPYPVSSQSDGTHSTYLD-----TAGRPMITLRAHNLVEN 89 (115)
Q Consensus 33 ~~~d~~~l~ViLPEGA~~i~v~~P~~v~~~~~~~~~TYLD-----t~GRp~v~l~~~Nlvd~ 89 (115)
-.++++.+.-.+|.|.+-|.|+-|+.-+..-.--.+-+-| ..||+.++++-.|.-|.
T Consensus 76 ~ml~~vqVEatFpDGTkLVtvh~PI~~~~~~~PGe~~~~~~~I~lN~gr~~~~l~V~NtGDR 137 (225)
T PRK13986 76 SMIHEVGIEAMFPDGTKLVTVHTPIEANGKLVPGELFLKDEDITINAGKKAVSVKVKNVGDR 137 (225)
T ss_pred HhcceeeEEEEcCCCCEEEEeCCCcCCCCCCCCceEecCCCCeecCCCCcEEEEEEEeCCCC
Confidence 3578899999999999999999999522111111222222 46999999999998775
No 4
>PF05986 ADAM_spacer1: ADAM-TS Spacer 1; InterPro: IPR010294 This domain represents the Spacer-1 domain from the ADAM-TS family of metalloproteinases []. A cellular disintegrin and metalloproteinase (ADAM) is a family of genes with structural homology to the snake venom metalloproteinases and disintegrins []. There is variation amongst members of the family, however, all have a similar domain organisation comprising a preproregion, a reprolysin-type catalytic domain, a disintegrin-like domain, a thrombospondin type-1 (TS) module, a cysteine-rich domain, a spacer domain without cysteine residues, and a COOH-terminal TS module [, ]. They are involved in embryogenesis and have been implicated in some cancers and inflammatory diseases [].; GO: 0004222 metalloendopeptidase activity, 0031012 extracellular matrix
Probab=62.10 E-value=6.2 Score=27.56 Aligned_cols=14 Identities=14% Similarity=0.211 Sum_probs=12.1
Q ss_pred EEEcCCCeeeeeEe
Q psy4031 41 NIILPEHSLVRQIH 54 (115)
Q Consensus 41 ~ViLPEGA~~i~v~ 54 (115)
-+.+|.||++|+|.
T Consensus 19 v~~IP~GA~nI~I~ 32 (114)
T PF05986_consen 19 VVTIPAGARNIRIT 32 (114)
T ss_pred EEECCCCceEEEEE
Confidence 47899999999884
No 5
>PRK13192 bifunctional urease subunit gamma/beta; Reviewed
Probab=59.69 E-value=39 Score=26.81 Aligned_cols=57 Identities=21% Similarity=0.282 Sum_probs=42.5
Q ss_pred cceeeEEEEEEcCCCeeeeeEeCCcceeecccc-----ceEEecc-----CCCCcEEEEEeccCccc
Q psy4031 33 AQPRNAKVNIILPEHSLVRQIHTPYPVSSQSDG-----THSTYLD-----TAGRPMITLRAHNLVEN 89 (115)
Q Consensus 33 ~~~d~~~l~ViLPEGA~~i~v~~P~~v~~~~~~-----~~~TYLD-----t~GRp~v~l~~~Nlvd~ 89 (115)
-.++++.+.-.+|.|-+-|.|+-|+.-+..+.+ -..-.-| ..||+.++++-.|.-|.
T Consensus 75 ~ml~~vqVEatFpDGTkLVtvh~PI~~~~~~~~~al~PGei~~~~~~I~lN~gr~~~~l~V~NtGDR 141 (208)
T PRK13192 75 DMVPEVQVEATFPDGTKLVTVHDPIRPAEGDLADALYPGEILPGDGEIELNAGRPAVTLDVTNTGDR 141 (208)
T ss_pred hhcceeeEEEEcCCCCEEEEecCCcCCCCCchhhccCCCEEEcCCCCeeeCCCCCEEEEEEEeCCCC
Confidence 357889999999999999999999975433321 1222222 47999999999998775
No 6
>cd06495 p23_NUDCD3_like p23-like NUD (nuclear distribution) C-like domain found in human NUDC domain-containing protein 3 (NUDCD3) and similar proteins. Little is known about the function of the proteins in this subgroup.
Probab=57.53 E-value=13 Score=25.85 Aligned_cols=23 Identities=9% Similarity=0.209 Sum_probs=19.2
Q ss_pred ccceeeEEEEEEcCCC---eeeeeEe
Q psy4031 32 RAQPRNAKVNIILPEH---SLVRQIH 54 (115)
Q Consensus 32 d~~~d~~~l~ViLPEG---A~~i~v~ 54 (115)
.+..++++++|=||+| |+++++.
T Consensus 10 tQTl~eV~V~i~lp~~~~~~kdv~v~ 35 (102)
T cd06495 10 SQDYTDVEVRVPVPKDVVKGRQVSVD 35 (102)
T ss_pred EeECCeEEEEEECCCCCccceEEEEE
Confidence 7788999999999998 5676653
No 7
>cd06493 p23_NUDCD1_like p23_NUDCD1: p23-like NUD (nuclear distribution) C-like domain found in human NUD (nuclear distribution) C domain-containing protein 1, NUDCD1 (also known as CML66), and similar proteins. NUDCD1/CML66 is a broadly immunogenic tumor associated antigen, which is highly expressed in a variety of solid tumors and in leukemias. In normal tissues high expression of NUDCD1/CML66 is limited to testis and heart.
Probab=48.47 E-value=20 Score=23.33 Aligned_cols=22 Identities=23% Similarity=0.301 Sum_probs=17.4
Q ss_pred cceeeEEEEEEcCCCe--eeeeEe
Q psy4031 33 AQPRNAKVNIILPEHS--LVRQIH 54 (115)
Q Consensus 33 ~~~d~~~l~ViLPEGA--~~i~v~ 54 (115)
+.-+++++.|.+|+|+ +++++.
T Consensus 5 Qt~~~V~v~i~~p~~~~~~dv~v~ 28 (85)
T cd06493 5 QTEEDLTLTIRLPEDTTKEDIRIK 28 (85)
T ss_pred EeCCEEEEEEECCCCCChhhEEEE
Confidence 4568999999999996 666653
No 8
>PF14903 WG_beta_rep: WG containing repeat
Probab=46.40 E-value=31 Score=18.13 Aligned_cols=24 Identities=17% Similarity=0.282 Sum_probs=17.8
Q ss_pred EeccCCCCcEEEEEeccCcccCcc
Q psy4031 69 TYLDTAGRPMITLRAHNLVENHIE 92 (115)
Q Consensus 69 TYLDt~GRp~v~l~~~Nlvd~h~~ 92 (115)
.++|+.|+.++..+.+.+-.-.+.
T Consensus 2 G~id~~G~~vi~~~yd~i~~~~~g 25 (35)
T PF14903_consen 2 GYIDKNGKIVIPPKYDEIYPFSNG 25 (35)
T ss_pred EEEeCCCCEEEEccccCccccCCC
Confidence 578899999988888776554443
No 9
>PF09972 DUF2207: Predicted membrane protein (DUF2207); InterPro: IPR018702 This domain has no known function.
Probab=40.95 E-value=77 Score=26.05 Aligned_cols=62 Identities=23% Similarity=0.270 Sum_probs=36.2
Q ss_pred ccceeeEEEEEEcCCCeeeeeEeCC-cceeeccccceEEeccCCCCcEEEEEeccCcccCcceEEEEEEeCc
Q psy4031 32 RAQPRNAKVNIILPEHSLVRQIHTP-YPVSSQSDGTHSTYLDTAGRPMITLRAHNLVENHIEDIEIEYEFSK 102 (115)
Q Consensus 32 d~~~d~~~l~ViLPEGA~~i~v~~P-~~v~~~~~~~~~TYLDt~GRp~v~l~~~Nlvd~h~~~i~V~Y~~~~ 102 (115)
++-+++++++|.||++...+++..= -+.. ...+ ++ ...-.++++.+|+- .++.+.|...||+
T Consensus 128 ~~~i~~v~v~i~~P~~~~~~~~~~~~g~~~-----~~~~-~~-~~~~~v~~~~~~l~--~~~~~~v~~~fP~ 190 (511)
T PF09972_consen 128 DVPIENVTVTITLPKPVDNSKAWGHPGPYG-----GTVE-ID-DDDGTVTFTTDNLP--PNEGVEVRVSFPK 190 (511)
T ss_pred CCccceEEEEEECCCCCcceEEEEeccCCC-----ccce-ee-ecCCEEEEEEeccC--CCCeEEEEEEccc
Confidence 7889999999999987766655321 0111 1111 22 23345667777743 4455666666676
No 10
>PHA02090 hypothetical protein
Probab=40.57 E-value=14 Score=24.68 Aligned_cols=13 Identities=15% Similarity=0.043 Sum_probs=10.3
Q ss_pred cCcccceE-eeecC
Q psy4031 12 RYWHALYI-PGHRR 24 (115)
Q Consensus 12 GGWky~ft-iGyn~ 24 (115)
||||++-+ +||+.
T Consensus 48 g~~ktna~flgy~i 61 (79)
T PHA02090 48 GGWKTNAEFLGYAI 61 (79)
T ss_pred CCccccceeeeeee
Confidence 89999976 57764
No 11
>PF08968 DUF1885: Domain of unknown function (DUF1885); InterPro: IPR015062 This family consists of hypothetical proteins produced by bacteria of the Bacillus genus. ; PDB: 1T6A_A.
Probab=39.53 E-value=20 Score=26.48 Aligned_cols=15 Identities=13% Similarity=0.463 Sum_probs=8.8
Q ss_pred eeEEEEEEcCCCeee
Q psy4031 36 RNAKVNIILPEHSLV 50 (115)
Q Consensus 36 d~~~l~ViLPEGA~~ 50 (115)
+...+.|+||+||+.
T Consensus 82 e~~~IQv~LP~~ATh 96 (130)
T PF08968_consen 82 EQSYIQVVLPDGATH 96 (130)
T ss_dssp TEEEEEEE--TT--H
T ss_pred cceEEEEECCCCCcc
Confidence 467899999999963
No 12
>cd06467 p23_NUDC_like p23_like domain of NUD (nuclear distribution) C and similar proteins. Aspergillus nidulas (An) NUDC is needed for nuclear movement. AnNUDC is localized at the hyphal cortex, and binds NUDF at spindle pole bodies (SPBs) and in the cytoplasm at different stages in the cell cycle. At the SPBs it is part of the dynein molecular motor/NUDF complex that regulates microtubule dynamics. Mammalian(m) NUDC associates both with the dynein complex and also with an anti-inflammatory enzyme, platelet activating factor acetylhydrolase I, PAF-AH(I) complex, through binding mNUDF, the regulatory beta subunit of PAF-AH(I). mNUDC is important for cell proliferation both in normal and tumor tissues. Its expression is elevated in various cell types undergoing mitosis or stimulated to proliferate, with high expression levels observed in leukemic cells and tumors. For a leukemic cell line, human NUDC was shown to activate the thrombopoietin (TPO) receptor (Mpl) by binding to its ext
Probab=37.39 E-value=46 Score=20.99 Aligned_cols=21 Identities=24% Similarity=0.273 Sum_probs=16.1
Q ss_pred ceeeEEEEEEcCCCe--eeeeEe
Q psy4031 34 QPRNAKVNIILPEHS--LVRQIH 54 (115)
Q Consensus 34 ~~d~~~l~ViLPEGA--~~i~v~ 54 (115)
.-+++.+.|-+|+|. +++++.
T Consensus 6 t~~~V~i~i~~~~~~~~~dv~v~ 28 (85)
T cd06467 6 TLDEVTVTIPLPEGTKSKDVKVE 28 (85)
T ss_pred eCCEEEEEEECCCCCcceeEEEE
Confidence 457899999999984 666653
No 13
>cd06492 p23_mNUDC_like p23-like NUD (nuclear distribution) C-like domain of mammalian(m) NUDC and similar proteins. Mammalian(m) NUDC associates both with the dynein complex and also with an anti-inflammatory enzyme, platelet activating factor acetylhydrolase I, PAF-AH(I) complex, through binding mNUDF, the regulatory beta subunit of PAF-AH(I). mNUDC is important for cell proliferation both in normal and tumor tissues. Its expression is elevated in various cell types undergoing mitosis or stimulated to proliferate, with high expression levels observed in leukemic cells and tumors. For a leukemic cell line, human NUDC was shown to activate the thrombopoietin (TPO) receptor (Mpl) by binding to its extracellular domain, and promoting cell proliferation and differentiation.
Probab=34.71 E-value=54 Score=21.72 Aligned_cols=23 Identities=0% Similarity=-0.039 Sum_probs=17.5
Q ss_pred ccceeeEEEEEEcCCC----eeeeeEe
Q psy4031 32 RAQPRNAKVNIILPEH----SLVRQIH 54 (115)
Q Consensus 32 d~~~d~~~l~ViLPEG----A~~i~v~ 54 (115)
.+..++++++|-||+| |+++++.
T Consensus 4 ~QT~~ev~v~v~l~~~~~~~~kdv~v~ 30 (87)
T cd06492 4 TQTLSEVELKVPFKVSFRLKGKDVVVD 30 (87)
T ss_pred EeecCEEEEEEECCCCCCccceEEEEE
Confidence 3567899999999887 4677664
No 14
>KOG2566|consensus
Probab=33.72 E-value=80 Score=27.93 Aligned_cols=51 Identities=14% Similarity=0.156 Sum_probs=34.0
Q ss_pred cceeeEEEEEEcCCCeeeeeEeCCcceeeccccce-EEeccCCCCcEEEEEeccCcc
Q psy4031 33 AQPRNAKVNIILPEHSLVRQIHTPYPVSSQSDGTH-STYLDTAGRPMITLRAHNLVE 88 (115)
Q Consensus 33 ~~~d~~~l~ViLPEGA~~i~v~~P~~v~~~~~~~~-~TYLDt~GRp~v~l~~~Nlvd 88 (115)
|.|-=...+=.||+||..|....-. +.+.+ -++|--.|+.+|++--+|-.+
T Consensus 437 mfya~~hFSkFl~pGs~Rv~~~i~~-----~~~ve~~aflnpdGskvvVllnk~s~~ 488 (518)
T KOG2566|consen 437 MFYALGHFSKFLPPGSVRVGHSINQ-----NLDVEATAFLNPDGSKVVVLLNKNSLD 488 (518)
T ss_pred HHHHHHHHhhcCCCCceEeeeeecc-----ccccceeEEEcCCCcEEEEEeccCCCC
Confidence 4444455677899999988764322 33333 478889999988777666544
No 15
>PF10976 DUF2790: Protein of unknown function (DUF2790); InterPro: IPR021245 This family of proteins with unknown function appear to be restricted to Pseudomonadaceae.
Probab=30.89 E-value=68 Score=21.48 Aligned_cols=31 Identities=19% Similarity=0.167 Sum_probs=19.9
Q ss_pred eeeeeEeCCcceeeccccceEEeccCCCCcEE
Q psy4031 48 SLVRQIHTPYPVSSQSDGTHSTYLDTAGRPMI 79 (115)
Q Consensus 48 A~~i~v~~P~~v~~~~~~~~~TYLDt~GRp~v 79 (115)
|+.|.+..+-+ .-.-+..+-+|+|..|+.-.
T Consensus 36 akVis~t~~~~-~C~Vvpa~MtY~DS~G~~h~ 66 (78)
T PF10976_consen 36 AKVISVTPPPN-VCGVVPARMTYEDSQGELHT 66 (78)
T ss_pred heeEEecCCCC-CCcEEccEEEEECCCCCEEE
Confidence 35566654432 23445567799999999863
No 16
>TIGR03000 plancto_dom_1 Planctomycetes uncharacterized domain TIGR03000. Domains described by this model are found, so far, only in the Planctomycetes (Pirellula sp. strain 1 and Gemmata obscuriglobus), in up to six proteins per genome, and may be duplicated within a protein. The function is unknown.
Probab=30.75 E-value=37 Score=22.79 Aligned_cols=14 Identities=21% Similarity=0.503 Sum_probs=11.5
Q ss_pred eEEEEEEcCCCeee
Q psy4031 37 NAKVNIILPEHSLV 50 (115)
Q Consensus 37 ~~~l~ViLPEGA~~ 50 (115)
.+++.|.||++|+.
T Consensus 2 ~a~itv~vPadAkl 15 (75)
T TIGR03000 2 AATITVTLPADAKL 15 (75)
T ss_pred ceEEEEEeCCCCEE
Confidence 46899999999863
No 17
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=29.60 E-value=80 Score=20.33 Aligned_cols=17 Identities=18% Similarity=0.309 Sum_probs=12.4
Q ss_pred eEEEEEEcCCCeeeeeE
Q psy4031 37 NAKVNIILPEHSLVRQI 53 (115)
Q Consensus 37 ~~~l~ViLPEGA~~i~v 53 (115)
.+.+++.||++|.++++
T Consensus 13 ~~~~~~~l~~~a~~v~v 29 (81)
T PF13860_consen 13 KGSIEYTLPEDADNVTV 29 (81)
T ss_dssp EEEEEEEECSSCEEEEE
T ss_pred EEEEEEeCCCcccEEEE
Confidence 45677778888887755
No 18
>PF12503 CMV_1a_C: Cucumber mosaic virus 1a protein C terminal ; InterPro: IPR022184 This domain family is found in viruses, and is approximately 90 amino acids in length. The family is found in association with PF01443 from PFAM, PF01660 from PFAM. There is a conserved GLG sequence motif. 1a protein is the major virulence factor of the (Cucumber mosaic virus. The Ns strain of CMV causes necrotic lesions to Nicotiana spp. while other strains cause systemic mosaic. The determinant of the pathogenesis of these different strains is the specific amino acid residue at the 461 residue of the 1a protein. ; GO: 0008168 methyltransferase activity, 0016817 hydrolase activity, acting on acid anhydrides
Probab=27.51 E-value=32 Score=23.67 Aligned_cols=14 Identities=29% Similarity=0.496 Sum_probs=11.2
Q ss_pred cCc----ccceEeeecCC
Q psy4031 12 RYW----HALYIPGHRRS 25 (115)
Q Consensus 12 GGW----ky~ftiGyn~p 25 (115)
|+| ++.|++|||..
T Consensus 38 GrWm~P~~~~Y~VGyNe~ 55 (85)
T PF12503_consen 38 GRWMYPEGYEYMVGYNES 55 (85)
T ss_pred CceecCCCeEEEeeecCC
Confidence 677 67899999963
No 19
>PF07617 DUF1579: Protein of unknown function (DUF1579); InterPro: IPR011473 This is a family of paralogous hypothetical proteins identified in Rhodopirellula baltica that also has members in Gloeobacter violaceus, Rhizobium meliloti and Agrobacterium tumefaciens, amongst others.
Probab=26.29 E-value=2e+02 Score=21.38 Aligned_cols=35 Identities=14% Similarity=0.059 Sum_probs=30.6
Q ss_pred ccceEeeecCChhhhccccceeeEEEEEEcCCCeee
Q psy4031 15 HALYIPGHRRSPYDYAARAQPRNAKVNIILPEHSLV 50 (115)
Q Consensus 15 ky~ftiGyn~pl~~~l~d~~~d~~~l~ViLPEGA~~ 50 (115)
...-++||+.....|+ ...+|+.-..+-..||.-+
T Consensus 64 ~~~~~lGYD~~~~~yv-gtWidSM~t~~~~~~G~~d 98 (159)
T PF07617_consen 64 EGIGTLGYDPAKKKYV-GTWIDSMGTGIWVYEGTLD 98 (159)
T ss_pred EEEEEEEECCccCeEE-EEEeccCCCceeeEccccC
Confidence 3457899999999999 8999999999999999844
No 20
>KOG3179|consensus
Probab=24.96 E-value=28 Score=28.11 Aligned_cols=35 Identities=20% Similarity=0.166 Sum_probs=23.4
Q ss_pred EEcCCCeeeeeEeCCcceeeccccceEEeccCCCCcE
Q psy4031 42 IILPEHSLVRQIHTPYPVSSQSDGTHSTYLDTAGRPM 78 (115)
Q Consensus 42 ViLPEGA~~i~v~~P~~v~~~~~~~~~TYLDt~GRp~ 78 (115)
+-|||||+.+-.+.-..++- .+.....|...|||-
T Consensus 159 le~PE~a~llasSe~ceve~--fs~~~~~l~fQGHPE 193 (245)
T KOG3179|consen 159 LELPEGAELLASSEKCEVEM--FSIEDHLLCFQGHPE 193 (245)
T ss_pred ecCCchhhhhccccccceEE--EEecceEEEecCCch
Confidence 35899999997766655532 223345677789985
No 21
>PF04525 Tub_2: Tubby C 2; InterPro: IPR007612 This is a family of plant and bacterial uncharacterised proteins.; PDB: 1ZXU_A 2Q4M_A.
Probab=24.90 E-value=71 Score=23.58 Aligned_cols=42 Identities=21% Similarity=0.287 Sum_probs=22.5
Q ss_pred EcCCCeeeeeEeCCcceeeccccceEEeccCCCCcEEEEEeccCc
Q psy4031 43 ILPEHSLVRQIHTPYPVSSQSDGTHSTYLDTAGRPMITLRAHNLV 87 (115)
Q Consensus 43 iLPEGA~~i~v~~P~~v~~~~~~~~~TYLDt~GRp~v~l~~~Nlv 87 (115)
.=..|...-+|.. -.-.+.....+.+|..|+|.++|+++.+.
T Consensus 33 ~D~~G~~vf~V~g---~~~~s~~~~~~l~D~~G~~L~~i~~k~~~ 74 (187)
T PF04525_consen 33 YDENGNVVFRVDG---GKFFSIGKKRTLMDASGNPLFTIRRKLFS 74 (187)
T ss_dssp EETTS-EEEEEE-----SCTTBTTEEEEE-TTS-EEEEEE-----
T ss_pred EcCCCCEEEEEEE---ecccCCCCEEEEECCCCCEEEEEEeeecc
Confidence 3445555555544 01146678889999999999999997443
No 22
>cd01777 SNX27_RA Ubiquitin domain of SNX27 (sorting nexin protein 27). SNX27_RA SNX27 (sorting nexin protein 27) belongs to a large family of endosome-localized proteins related to sorting nexin1 which is implicated in regulating membrane traffic. The domain architecture of SNX27 includes an amino-terminal PDZ domain, a PX (PhoX homologous) domain, and a carboxy-terminal RA (RAS-associated) domain.
Probab=24.89 E-value=55 Score=22.57 Aligned_cols=14 Identities=29% Similarity=0.641 Sum_probs=11.7
Q ss_pred eEEEEEEcCCCeee
Q psy4031 37 NAKVNIILPEHSLV 50 (115)
Q Consensus 37 ~~~l~ViLPEGA~~ 50 (115)
+++++|.||-|++.
T Consensus 1 ~V~L~V~Lpdg~~i 14 (87)
T cd01777 1 DVELRIALPDKATV 14 (87)
T ss_pred CeEEEEEccCCCEE
Confidence 47899999999854
No 23
>COG4824 Phage-related holin (Lysis protein) [General function prediction only]
Probab=24.53 E-value=18 Score=26.79 Aligned_cols=10 Identities=10% Similarity=-0.250 Sum_probs=8.1
Q ss_pred eccCcccceE
Q psy4031 10 RSRYWHALYI 19 (115)
Q Consensus 10 lfGGWky~ft 19 (115)
+||||+|.+.
T Consensus 19 lFG~~~~l~~ 28 (133)
T COG4824 19 LFGGVDYLLL 28 (133)
T ss_pred HhcchHHHHH
Confidence 7999998763
No 24
>PF13716 CRAL_TRIO_2: Divergent CRAL/TRIO domain; PDB: 2D4Q_B 3PG7_B 2E2X_B 3P7Z_B 3PEG_A.
Probab=23.23 E-value=54 Score=22.75 Aligned_cols=16 Identities=38% Similarity=0.675 Sum_probs=12.5
Q ss_pred cCCCCcEEEEEeccCc
Q psy4031 72 DTAGRPMITLRAHNLV 87 (115)
Q Consensus 72 Dt~GRp~v~l~~~Nlv 87 (115)
|..|||++++-.+++.
T Consensus 10 d~~g~pV~~~~~~~~~ 25 (149)
T PF13716_consen 10 DREGRPVVVFIASRLP 25 (149)
T ss_dssp BTTS-EEEEEEGGG-C
T ss_pred CCCcCEEEEEECCcCc
Confidence 7789999999999994
No 25
>KOG3264|consensus
Probab=23.10 E-value=55 Score=26.06 Aligned_cols=17 Identities=6% Similarity=0.249 Sum_probs=14.0
Q ss_pred EEEEEEcCCCeeeeeEe
Q psy4031 38 AKVNIILPEHSLVRQIH 54 (115)
Q Consensus 38 ~~l~ViLPEGA~~i~v~ 54 (115)
++++|++|+|++|-.+.
T Consensus 171 velsV~aPa~~E~~~va 187 (221)
T KOG3264|consen 171 VELSVVAPAGQENYEVA 187 (221)
T ss_pred EEEEEecCCccchhhHH
Confidence 78999999999976543
No 26
>PF08487 VIT: Vault protein inter-alpha-trypsin domain; InterPro: IPR013694 Inter-alpha-trypsin inhibitors (ITIs) consist of one light chain and a variable set of heavy chains. ITIs play a role in extracellular matrix (ECM) stabilisation and tumour metastasis as well as in plasma protease inhibition []. The vault protein inter-alpha-trypsin (VIT) domain described here is found to the N terminus of a von Willebrand factor type A domain (IPR002035 from INTERPRO) in ITI heavy chains (ITIHs) and their precursors.
Probab=23.07 E-value=1.2e+02 Score=20.86 Aligned_cols=37 Identities=27% Similarity=0.267 Sum_probs=27.5
Q ss_pred cccceEEeccCCCCcEEEEEeccCcccCcceEEEEEEeCc
Q psy4031 63 SDGTHSTYLDTAGRPMITLRAHNLVENHIEDIEIEYEFSK 102 (115)
Q Consensus 63 ~~~~~~TYLDt~GRp~v~l~~~Nlvd~h~~~i~V~Y~~~~ 102 (115)
+...+-+..|...|++++..+.|-. .....+.|.|+-
T Consensus 7 s~~v~~~I~~~~a~t~v~q~f~N~~---~~~~E~~y~fpL 43 (118)
T PF08487_consen 7 SVHVKVTIIDRFARTTVTQTFENPS---SEPLEAVYSFPL 43 (118)
T ss_pred EEEEEEEEEccEEEEEEEEEEECCC---CCcEEEEEEeEC
Confidence 3445567788899999999999854 445677888764
No 27
>COG4856 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.17 E-value=65 Score=27.95 Aligned_cols=17 Identities=18% Similarity=0.200 Sum_probs=13.7
Q ss_pred eeEEEEEEcCCCeeeee
Q psy4031 36 RNAKVNIILPEHSLVRQ 52 (115)
Q Consensus 36 d~~~l~ViLPEGA~~i~ 52 (115)
+..++++.||||++-++
T Consensus 288 t~~tv~lpvPegv~sv~ 304 (403)
T COG4856 288 TTKTVKLPVPEGVKSVS 304 (403)
T ss_pred ceEEEEeeCCCcceecC
Confidence 35789999999997653
No 28
>PF10515 APP_amyloid: beta-amyloid precursor protein C-terminus; InterPro: IPR019543 This is the amyloid, C-terminal, protein of the beta-Amyloid precursor protein (APP) which is a conserved and ubiquitous transmembrane glycoprotein strongly implicated in the pathogenesis of Alzheimer's disease but whose normal biological function is unknown. The C-terminal 100 residues are released and aggregate into amyloid deposits which are strongly implicated in the pathology of Alzheimer's disease plaque-formation. The domain is associated with IPR008154 from INTERPRO, further towards the N terminus. ; PDB: 2ROZ_A 3DXD_D 1X11_D 2LP1_A 2LOH_A 3MXC_L 3MXY_L 3DXC_B 3DXE_D 3L81_B ....
Probab=22.02 E-value=41 Score=21.22 Aligned_cols=11 Identities=27% Similarity=0.561 Sum_probs=9.4
Q ss_pred eeecCChhhhc
Q psy4031 20 PGHRRSPYDYA 30 (115)
Q Consensus 20 iGyn~pl~~~l 30 (115)
-||.+|++.|+
T Consensus 41 nGYENPTYkyf 51 (52)
T PF10515_consen 41 NGYENPTYKYF 51 (52)
T ss_dssp TEEESCTCHHC
T ss_pred cCCcCCceecc
Confidence 39999999886
No 29
>PF08089 Toxin_20: Huwentoxin-II family; InterPro: IPR012625 This family consists of the huwentoxin-II (HWTX-II) family of toxins secreted by spiders. These toxins are found in venom that secreted from the bird spider Selenocosmia huwena Wang. The HWTX-II adopts a novel scaffold different from the ICK motif that is found in other huwentoxins. HWTX-II consists of 37 amino acids residues including six cysteines involved in three disulphide bridges [].; GO: 0005576 extracellular region; PDB: 2KGH_A 1I25_A.
Probab=21.98 E-value=25 Score=20.83 Aligned_cols=10 Identities=10% Similarity=0.275 Sum_probs=6.9
Q ss_pred ccCcccceEe
Q psy4031 11 SRYWHALYIP 20 (115)
Q Consensus 11 fGGWky~fti 20 (115)
-||||..|.+
T Consensus 26 ~GGwkcK~n~ 35 (39)
T PF08089_consen 26 KGGWKCKFNM 35 (39)
T ss_dssp -TTEEEETTE
T ss_pred cCCceEEEEE
Confidence 4899987653
No 30
>PF13349 DUF4097: Domain of unknown function (DUF4097)
Probab=21.73 E-value=1.4e+02 Score=20.87 Aligned_cols=21 Identities=29% Similarity=0.381 Sum_probs=16.4
Q ss_pred eeEEEEEEcCCCe--eeeeEeCC
Q psy4031 36 RNAKVNIILPEHS--LVRQIHTP 56 (115)
Q Consensus 36 d~~~l~ViLPEGA--~~i~v~~P 56 (115)
....+.|.||++. .++++.+-
T Consensus 118 ~~~~i~I~lP~~~~l~~i~i~~~ 140 (166)
T PF13349_consen 118 NKSKITIYLPKDYKLDKIDIKTS 140 (166)
T ss_pred CCcEEEEEECCCCceeEEEEEec
Confidence 4678999999997 57877664
No 31
>cd06494 p23_NUDCD2_like p23-like NUD (nuclear distribution) C-like found in human NUDC domain-containing protein 2 (NUDCD2) and similar proteins. Little is known about the function of the proteins in this subgroup.
Probab=21.54 E-value=1.1e+02 Score=20.59 Aligned_cols=22 Identities=5% Similarity=0.167 Sum_probs=17.4
Q ss_pred ccceeeEEEEEEcCCCeeeeeE
Q psy4031 32 RAQPRNAKVNIILPEHSLVRQI 53 (115)
Q Consensus 32 d~~~d~~~l~ViLPEGA~~i~v 53 (115)
.+..++++++|=||+|++-=++
T Consensus 11 ~QT~~eV~v~i~lp~~~~~kdv 32 (93)
T cd06494 11 YQTMDEVFIEVNVPPGTRAKDV 32 (93)
T ss_pred EeEcCEEEEEEECCCCCceeeE
Confidence 6778999999999998744433
No 32
>PF08813 Phage_tail_3: Phage tail protein; InterPro: IPR014918 This entry is represented by Bacteriophage T1, Orf41. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. Proteins of this entry include phage tail proteins. They probably include bacterial Ig-like domains related to IPR003343 from INTERPRO. Which also includes a number of phage tail invasin proteins.
Probab=21.53 E-value=1.1e+02 Score=22.87 Aligned_cols=35 Identities=11% Similarity=0.128 Sum_probs=24.4
Q ss_pred cceEeeec--CChhhhcc--ccceeeEEEEEEcCCCeee
Q psy4031 16 ALYIPGHR--RSPYDYAA--RAQPRNAKVNIILPEHSLV 50 (115)
Q Consensus 16 y~ftiGyn--~pl~~~l~--d~~~d~~~l~ViLPEGA~~ 50 (115)
.+|++.|. +|-+.-|+ +-.=+-..+|+.||.|++.
T Consensus 90 lt~t~a~dp~~~~y~aL~~a~~~~~~~a~r~~~p~G~~~ 128 (165)
T PF08813_consen 90 LTFTFAYDPDDPGYKALRAADDSKEVRAFRVTLPNGSTI 128 (165)
T ss_pred EEEEEEeCCCCHHHHHHHHHhhcCCeEEEEEEcCCCCEE
Confidence 46888888 55555555 3333445689999999977
No 33
>PF14879 DUF4489: Domain of unknown function (DUF4489)
Probab=20.65 E-value=2.2e+02 Score=21.23 Aligned_cols=53 Identities=26% Similarity=0.367 Sum_probs=35.2
Q ss_pred EEEcCCCeeeeeEeCCcceeeccccceEEeccCC--CCcEEEEEe-ccCcccCcceEEEEEEeCch
Q psy4031 41 NIILPEHSLVRQIHTPYPVSSQSDGTHSTYLDTA--GRPMITLRA-HNLVENHIEDIEIEYEFSKS 103 (115)
Q Consensus 41 ~ViLPEGA~~i~v~~P~~v~~~~~~~~~TYLDt~--GRp~v~l~~-~Nlvd~h~~~i~V~Y~~~~~ 103 (115)
.+.||+++. ..+||++.+.. |||. -.|.|.|++ .|++-.-.....+.++.-+.
T Consensus 12 ~~~lP~~t~---~~~~~~vAslt-------lDTs~l~~p~ikLeFtsnI~~~~~~~~tLnFQifK~ 67 (141)
T PF14879_consen 12 SVPLPVITT---AGTTFTVASLT-------LDTSCLCNPCIKLEFTSNIVTPAAVSGTLNFQIFKQ 67 (141)
T ss_pred ceEcccccc---CCceEEEEEEE-------EccccCcCcEEEEEEEEeEEeeeeeEEEEEEEEeeh
Confidence 368999888 77888876544 6764 578899998 56655444445555554443
No 34
>PRK11615 hypothetical protein; Provisional
Probab=20.36 E-value=1.8e+02 Score=22.68 Aligned_cols=42 Identities=14% Similarity=0.288 Sum_probs=28.4
Q ss_pred EEEEEcCCCeeeeeEeCCcceeeccccceEEeccCCCCcEE-EEEecc
Q psy4031 39 KVNIILPEHSLVRQIHTPYPVSSQSDGTHSTYLDTAGRPMI-TLRAHN 85 (115)
Q Consensus 39 ~l~ViLPEGA~~i~v~~P~~v~~~~~~~~~TYLDt~GRp~v-~l~~~N 85 (115)
.+.+.||.|-.+..- + . -..--..+=|.|.+||.+| ++..+|
T Consensus 49 Kl~FtLPag~sdqsg--k--~-Gtq~nn~~vYad~tg~kavIVi~gD~ 91 (185)
T PRK11615 49 KLSFTLPADMSDQSG--K--L-GTQANNMHVYADATGQKAVIVILGDD 91 (185)
T ss_pred EEEEEcCCccccccc--c--c-cccccceEEEEcCCCCEEEEEEeCCC
Confidence 578899999887632 1 1 1233456789998888886 776554
Done!