Query psy6233
Match_columns 271
No_of_seqs 159 out of 301
Neff 4.8
Searched_HMMs 46136
Date Fri Aug 16 21:11:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6233.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6233hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3878|consensus 100.0 4E-101 8E-106 719.2 19.3 271 1-271 101-469 (469)
2 PF13897 GOLD_2: Golgi-dynamic 100.0 9.8E-56 2.1E-60 369.0 13.9 132 135-270 1-136 (136)
3 KOG3287|consensus 97.8 0.00021 4.5E-09 65.1 10.5 87 129-267 34-126 (236)
4 KOG1692|consensus 95.3 0.093 2E-06 47.3 8.2 33 238-270 81-113 (201)
5 KOG1693|consensus 93.8 0.48 1E-05 43.0 9.1 30 238-267 85-114 (209)
6 PF01105 EMP24_GP25L: emp24/gp 84.0 0.32 7E-06 40.3 0.0 32 238-269 66-98 (183)
7 KOG0163|consensus 80.8 9.5 0.00021 41.3 9.2 30 57-86 965-994 (1259)
8 KOG1029|consensus 78.1 4.5 9.7E-05 43.6 5.8 8 73-80 418-425 (1118)
9 smart00110 C1Q Complement comp 74.2 2.7 5.8E-05 35.3 2.5 24 229-252 35-58 (135)
10 KOG0817|consensus 67.6 3.6 7.7E-05 35.4 1.9 22 1-22 68-89 (142)
11 PF00386 C1q: C1q domain; Int 65.9 7.5 0.00016 31.2 3.4 24 229-252 29-52 (127)
12 PTZ00266 NIMA-related protein 58.5 22 0.00048 39.5 6.3 10 18-27 423-432 (1021)
13 PF09394 Inhibitor_I42: Chagas 53.7 93 0.002 23.4 10.4 34 133-167 1-34 (92)
14 PF15236 CCDC66: Coiled-coil d 51.6 79 0.0017 27.8 7.4 27 63-89 102-129 (157)
15 KOG1029|consensus 50.6 53 0.0012 35.9 7.3 18 26-43 338-355 (1118)
16 PF01117 Aerolysin: Aerolysin 41.6 33 0.0007 33.8 3.9 44 131-174 185-229 (360)
17 KOG2891|consensus 41.0 65 0.0014 31.3 5.7 17 53-69 391-407 (445)
18 KOG2412|consensus 39.8 64 0.0014 33.7 5.8 12 9-20 200-211 (591)
19 KOG0163|consensus 39.3 1.3E+02 0.0029 33.0 8.1 8 245-252 1170-1177(1259)
20 cd05840 SPBC215_ISWI_like The 34.3 22 0.00048 28.1 1.3 23 242-264 38-62 (93)
21 KOG3006|consensus 33.8 27 0.00058 29.7 1.7 20 243-263 76-95 (132)
22 TIGR03769 P_ac_wall_RPT actino 33.2 31 0.00068 23.5 1.7 16 238-253 6-21 (41)
23 PF06868 DUF1257: Protein of u 32.7 61 0.0013 26.3 3.6 30 131-170 15-44 (105)
24 CHL00193 ycf35 Ycf35; Provisio 31.2 83 0.0018 26.5 4.3 29 131-169 33-61 (128)
25 COG3081 Nucleoid-associated pr 29.3 1.1E+02 0.0024 29.5 5.2 69 71-163 246-314 (335)
26 TIGR02568 LcrE type III secret 29.1 1.8E+02 0.004 26.5 6.5 36 68-104 62-99 (240)
27 PLN03086 PRLI-interacting fact 27.8 1.6E+02 0.0035 30.8 6.5 20 138-157 171-190 (567)
28 PF13540 RCC1_2: Regulator of 27.7 86 0.0019 19.4 2.9 24 132-157 1-24 (30)
29 PF07946 DUF1682: Protein of u 27.6 77 0.0017 30.2 3.9 11 7-17 237-247 (321)
30 PF00855 PWWP: PWWP domain; I 26.8 50 0.0011 24.5 2.0 25 241-265 31-57 (86)
31 smart00293 PWWP domain with co 26.6 55 0.0012 23.7 2.1 22 243-264 37-60 (63)
32 KOG3054|consensus 25.8 1.5E+02 0.0033 28.2 5.4 16 54-69 139-154 (299)
33 cd05838 WHSC1_related The PWWP 25.1 44 0.00095 26.4 1.5 23 242-264 36-60 (95)
34 KOG1144|consensus 25.1 1.3E+02 0.0028 33.2 5.2 7 74-80 290-296 (1064)
35 cd05162 PWWP The PWWP domain, 24.3 58 0.0013 24.6 2.0 22 243-264 36-59 (87)
36 PRK09510 tolA cell envelope in 24.0 2.4E+02 0.0052 28.2 6.7 18 137-154 324-341 (387)
37 PHA03051 Hypothetical protein; 23.9 38 0.00081 26.4 0.9 14 15-28 33-46 (88)
38 PF09805 Nop25: Nucleolar prot 23.6 1.8E+02 0.0038 24.6 5.0 9 22-30 21-30 (137)
39 PF02337 Gag_p10: Retroviral G 23.6 70 0.0015 25.6 2.4 18 5-22 28-45 (90)
40 KOG2357|consensus 23.4 2.2E+02 0.0047 28.9 6.2 17 14-30 365-381 (440)
41 PLN03086 PRLI-interacting fact 22.4 1.9E+02 0.004 30.4 5.8 13 26-38 8-20 (567)
42 PF03531 SSrecog: Structure-sp 21.6 1.3E+02 0.0029 27.6 4.1 36 235-270 139-174 (222)
43 KOG2701|consensus 21.2 2.4E+02 0.0052 29.9 6.2 27 4-30 501-527 (608)
44 KOG2880|consensus 21.2 2.1E+02 0.0045 28.7 5.5 34 9-43 60-93 (424)
No 1
>KOG3878|consensus
Probab=100.00 E-value=3.7e-101 Score=719.20 Aligned_cols=271 Identities=58% Similarity=0.991 Sum_probs=244.7
Q ss_pred CChHHHHHHHHHHHhhhCCCchhHHHHHHhhHHHHHHH--HHHHHHHHHHhhhhhhcccccchhhhHHHHHHHHHHHHHh
Q psy6233 1 MSSKDAMSNFIDLLDLKCILFKPYIDAHVAQKEEENRK--AREQIEREEKVKSEQVNGNEVSQETSIEEKKRRLIQDALN 78 (271)
Q Consensus 1 ~s~~~am~~fi~ll~~~cp~f~~~v~a~~~~~ee~~~~--~~~~~~~~~~~~~~~~~~~~~~~e~~~~e~~~~qi~~aln 78 (271)
||+++||.+||+|||+|||+|+|||++||++|+|..+. .+..+++.++.++++..+++.++|+++.|+||+|||+|||
T Consensus 101 ~sre~AM~~FV~Lldr~C~~F~~yia~~k~~kde~lkE~e~r~~ee~~e~~~lQe~~qr~l~ee~~~~E~Qr~Qiq~ALN 180 (469)
T KOG3878|consen 101 ISREQAMEGFVDLLDRMCSAFRPYIAAVKQDKDETLKEKELRLMEEKKEARELQENAQRELLEEGYKEELQRRQIQDALN 180 (469)
T ss_pred ccHHHHHHHHHHHHHhcchhhhhHHHHhhhhhhhHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhhHHHHHHHHHHHHh
Confidence 79999999999999999999999999999999998763 4444555555556666678888999999999999999999
Q ss_pred hhhHHHHHHHHHhcCCCChhhhhhhh-h----------------------------------------------------
Q psy6233 79 QQTYAQFRAYAEDQFPGNPEQQAVLV-Q---------------------------------------------------- 105 (271)
Q Consensus 79 ~qt~~qf~~ya~~q~P~~~eqq~~li-~---------------------------------------------------- 105 (271)
.|||+||++||++||||||+||++|| |
T Consensus 181 ~QT~~QF~~YA~~QfPGnpEQQ~vLIrQLQeqHYqQYMqQly~~~~aQ~q~~~Q~~~~v~~s~s~lPn~~e~n~~~p~~~ 260 (469)
T KOG3878|consen 181 KQTYQQFKLYAEKQFPGNPEQQAVLIRQLQEQHYQQYMQQLYLQNQAQNQNGHQEAEHVPGSSSDLPNAMEGNKLGPVET 260 (469)
T ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccCcccCCCCchhhhcccCchhH
Confidence 99999999999999999999999999 1
Q ss_pred ------------------------------------ccCCCcccccchhhhHHhhhccCCceeEEEccCceEEEEEEeec
Q psy6233 106 ------------------------------------IGPANMWTLTNIKSFKEHIRKEGGDAIIKIGHGETVTVRVPTHE 149 (271)
Q Consensus 106 ------------------------------------~vP~s~wt~~~i~~fk~~i~~~~~~~~i~V~~Get~tV~V~v~~ 149 (271)
+.|+||||||+|.+||.+|..++++++|+||||+++|||||||.
T Consensus 261 ~shq~qa~~~dds~e~~le~v~~eeagE~~p~s~p~I~pas~WtRpdI~~FK~~i~~d~~~gvItvGhGetVTVRVPThe 340 (469)
T KOG3878|consen 261 QSHQQQAEQQDDSVEQTLEGVGQEEAGEEEPDSYPMICPASIWTRPDIEQFKTEISADDGDGVITVGHGETVTVRVPTHE 340 (469)
T ss_pred HHHHhhhhccCCchhhhccccchhhhcccCCcccceecchhhcCcccHHHHHHHhccCCCCCeEEecCCceEEEeccccC
Confidence 89999999999999999999998889999999999999999999
Q ss_pred CCCEEEEEEEeeCccEEEEEEEEecCCCCCcceeccccCccccccC-----cCCcCCCCCCCCCccc--cccCCCCceee
Q psy6233 150 GGSCLYWEFCTDHYDLGFGIYFEWNKSPTNQVSVHVSETDDEMISD-----KDTESGRTGSQPRKTT--KIVNRSPFSII 222 (271)
Q Consensus 150 ~Gs~L~WeF~Td~yDIGFGV~fe~~~~~s~~vsv~v~eS~d~~~~~-----~~~e~g~~~~~~~~~~--~~~~~~~~~eI 222 (271)
.|+||||+|+||+|||||||||+|+++.+|+||||||+|+||+|.. |+++.|..+.+.++.| ..++++.+++|
T Consensus 341 nGsclFWEFATD~YDIGFGvYFEWt~~~~n~VsVHVSeSddded~~~~~e~E~~e~G~~~~E~gA~~n~~~anKp~~deI 420 (469)
T KOG3878|consen 341 NGSCLFWEFATDSYDIGFGVYFEWTKPVTNEVSVHVSESDDDEDCVYLSETEDLESGSLSQERGAVNNPTAANKPPIDEI 420 (469)
T ss_pred CCceEEEEeccccccccceEEEEeecCCCceeEEEecccccchhhhhhhhhhhhhcCCchhhhhhhcCCCCCCCCCcccc
Confidence 9999999999999999999999999999999999999999988644 6666663211122211 24789999999
Q ss_pred eeCeeeccccceecceEEeccceEEEEEEecCccceeeceeEEEEEEeC
Q psy6233 223 SPIMRRDCHQEVYAGSHCYPGEGVYLLKFDNSYSLWRSKTLYYRVYYTR 271 (271)
Q Consensus 223 vP~~R~~sh~~~~~Gs~~~~~~GvY~L~FDNSYS~~RsK~L~Y~V~~t~ 271 (271)
||+||++||.++|+|||.|||.|+|+|+||||||+||||+|||||||||
T Consensus 421 vPvYRRdCheEVYaGSH~YPGrGvYLLKFDNSYSlWRsKtlYYRVYY~r 469 (469)
T KOG3878|consen 421 VPVYRRDCHEEVYAGSHSYPGRGVYLLKFDNSYSLWRSKTLYYRVYYER 469 (469)
T ss_pred cchhhhhhhHHhhcccccCCCCceEEEEecchhhhhcccceEEEEEecC
Confidence 9999999999999999999999999999999999999999999999997
No 2
>PF13897 GOLD_2: Golgi-dynamics membrane-trafficking
Probab=100.00 E-value=9.8e-56 Score=369.04 Aligned_cols=132 Identities=60% Similarity=1.067 Sum_probs=118.0
Q ss_pred EccCceEEEEEEeecCCCEEEEEEEeeCccEEEEEEEEecCCCCCcceeccccCccccccCcCCc----CCCCCCCCCcc
Q psy6233 135 IGHGETVTVRVPTHEGGSCLYWEFCTDHYDLGFGIYFEWNKSPTNQVSVHVSETDDEMISDKDTE----SGRTGSQPRKT 210 (271)
Q Consensus 135 V~~Get~tV~V~v~~~Gs~L~WeF~Td~yDIGFGV~fe~~~~~s~~vsv~v~eS~d~~~~~~~~e----~g~~~~~~~~~ 210 (271)
|+||+++|||||+|++|++|+|||+||+|||||||||+|+++++++|||||++|+|+||++++.+ .|. .+. |
T Consensus 1 v~~Ge~~tvrVpt~~~G~~l~WeFaTd~yDIgFG~~few~~~~s~~vsv~vses~de~~~~~~~~~~~~~~~--ve~--g 76 (136)
T PF13897_consen 1 VGRGETVTVRVPTHPEGKCLFWEFATDSYDIGFGVYFEWTPPTSNQVSVHVSESSDEEDEEEEEEEDSESGD--VEK--G 76 (136)
T ss_pred CCcCcEEEEEcccCCCCCEEEEEEeeCCCCceEEEEEEecCCCCCceeEecccCccccccccccccccCccc--hhc--c
Confidence 78999999999999999999999999999999999999999999999999999999886664333 121 111 1
Q ss_pred ccccCCCCceeeeeCeeeccccceecceEEeccceEEEEEEecCccceeeceeEEEEEEe
Q psy6233 211 TKIVNRSPFSIISPIMRRDCHQEVYAGSHCYPGEGVYLLKFDNSYSLWRSKTLYYRVYYT 270 (271)
Q Consensus 211 ~~~~~~~~~~eIvP~~R~~sh~~~~~Gs~~~~~~GvY~L~FDNSYS~~RsK~L~Y~V~~t 270 (271)
.....++.+.+|||++|++||++|++|+|.|++||+|+|+|||||||||||+|+|+|+||
T Consensus 77 s~~~~~~~~~eviPv~R~dsH~~~~~Gs~~c~~~GvYvLkFDNSYS~~rsK~l~Y~V~yt 136 (136)
T PF13897_consen 77 SERSSRPEMDEVIPVYRRDSHLEVEAGSHTCPGPGVYVLKFDNSYSWFRSKKLYYRVYYT 136 (136)
T ss_pred ccccCCCCeeEEeEeeeeecCcceeceEEECCCCeEEEEEeeCcceeEEeeEEEEEEEeC
Confidence 134567889999999999999999999999999999999999999999999999999998
No 3
>KOG3287|consensus
Probab=97.79 E-value=0.00021 Score=65.12 Aligned_cols=87 Identities=26% Similarity=0.439 Sum_probs=64.7
Q ss_pred CceeEEEccCceEEEEEEeecCCCEEEEEEE-eeC---ccEEEEEEEEecCCCCCcceeccccCccccccCcCCcCCCCC
Q psy6233 129 GDAIIKIGHGETVTVRVPTHEGGSCLYWEFC-TDH---YDLGFGIYFEWNKSPTNQVSVHVSETDDEMISDKDTESGRTG 204 (271)
Q Consensus 129 ~~~~i~V~~Get~tV~V~v~~~Gs~L~WeF~-Td~---yDIGFGV~fe~~~~~s~~vsv~v~eS~d~~~~~~~~e~g~~~ 204 (271)
++-.+.|.+|++--.-=|++ .|..|.-||. +++ -||+|-+.- | . |
T Consensus 34 ~dftv~ipAGk~eCf~Q~v~-~~~tle~eyQVi~G~GDl~i~Ftl~~----P----~-------------------G--- 82 (236)
T KOG3287|consen 34 YDFTVMIPAGKTECFYQPVP-QGATLEVEYQVIDGAGDLDIDFTLLN----P----A-------------------G--- 82 (236)
T ss_pred cceEEEecCCCceeeeeecc-CCeEEEEEEEEEecCCccceeeEEeC----C----C-------------------c---
Confidence 55688899999888777776 6688888887 454 477777761 1 0 1
Q ss_pred CCCCccccccCCCCceeeeeCeeeccccceecceEEe--ccceEEEEEEecCccceeeceeEEEE
Q psy6233 205 SQPRKTTKIVNRSPFSIISPIMRRDCHQEVYAGSHCY--PGEGVYLLKFDNSYSLWRSKTLYYRV 267 (271)
Q Consensus 205 ~~~~~~~~~~~~~~~~eIvP~~R~~sh~~~~~Gs~~~--~~~GvY~L~FDNSYS~~RsK~L~Y~V 267 (271)
..+++-+|- ..|.|.. .++|-|.|.||||||.+.+|.|++-+
T Consensus 83 ---------------~~lv~~q~k------~dg~ht~e~~e~GdY~~CfDNsFS~fs~K~Vffel 126 (236)
T KOG3287|consen 83 ---------------EVLVSDQRK------VDGVHTVEVTETGDYQVCFDNSFSTFSRKLVFFEL 126 (236)
T ss_pred ---------------cEEeecccc------cCceeEeeccCCcceEEEEcCccccccceEEEEEE
Confidence 255555543 2466664 58999999999999999999999987
No 4
>KOG1692|consensus
Probab=95.32 E-value=0.093 Score=47.31 Aligned_cols=33 Identities=24% Similarity=0.414 Sum_probs=30.2
Q ss_pred eEEeccceEEEEEEecCccceeeceeEEEEEEe
Q psy6233 238 SHCYPGEGVYLLKFDNSYSLWRSKTLYYRVYYT 270 (271)
Q Consensus 238 s~~~~~~GvY~L~FDNSYS~~RsK~L~Y~V~~t 270 (271)
++.+|.+|+|.+.|+|.+|=+.+|+|-+.|.+.
T Consensus 81 tF~a~~~G~Y~fCF~N~~s~mtpk~V~F~ihvg 113 (201)
T KOG1692|consen 81 TFTAPKKGTYTFCFSNKMSTMTPKTVMFTIHVG 113 (201)
T ss_pred EEEecCCceEEEEecCCCCCCCceEEEEEEEEe
Confidence 456899999999999999999999999999864
No 5
>KOG1693|consensus
Probab=93.79 E-value=0.48 Score=43.05 Aligned_cols=30 Identities=27% Similarity=0.479 Sum_probs=26.0
Q ss_pred eEEeccceEEEEEEecCccceeeceeEEEE
Q psy6233 238 SHCYPGEGVYLLKFDNSYSLWRSKTLYYRV 267 (271)
Q Consensus 238 s~~~~~~GvY~L~FDNSYS~~RsK~L~Y~V 267 (271)
.++..+.|+|.+.|+|.||-+..|.++...
T Consensus 85 ~f~ae~~G~Y~fCFsN~fstf~~Kiv~~~~ 114 (209)
T KOG1693|consen 85 LFKAEGKGEYTFCFSNEFSTFSHKIVYMDF 114 (209)
T ss_pred EEEEecceEEEEEecCccccccceEeeehh
Confidence 467788999999999999999999887654
No 6
>PF01105 EMP24_GP25L: emp24/gp25L/p24 family/GOLD; InterPro: IPR009038 The GOLD (for Golgi dynamics) domain is a protein module found in several eukaryotic Golgi and lipid-traffic proteins. It is typically between 90 and 150 amino acids long. Most of the size difference observed in the GOLD-domain superfamily is traceable to a single large low-complexity insert that is seen in some versions of the domain. With the exception of the p24 proteins, which have a simple architecture with the GOLD domain as their only globular domain, all other GOLD-domain proteins contain additional conserved globular domains. In these proteins, the GOLD domain co-occurs with lipid-, sterol- or fatty acid-binding domains such as PH, CRAL-TRIO, FYVE oxysterol binding- and acyl CoA-binding domains, suggesting that these proteins may interact with membranes. The GOLD domain can also be found associated with a RUN domain, which may have a role in the interaction of various proteins with cytoskeletal filaments. The GOLD domain is predicted to mediate diverse protein-protein interactions []. A secondary structure prediction for the GOLD domain reveals that it is likely to adopt a compact all-beta-fold structure with six to seven strands. Most of the sequence conservation is centred on the hydrophobic cores that support these predicted strands. The predicted secondary-structure elements and the size of the conserved core of the domain suggests that it may form a beta- sandwich fold with the strands arranged in two beta sheets stacked on each other []. Some proteins known to contain a GOLD domain are listed below: Eukaryotic proteins of the p24 family. Animal Sec14-like proteins. They are involved in secretion. Human Golgi resident protein GCP60. It interacts with the Golgi integral membrane protein Giantin. Yeast oxysterol-binding protein homologue 3 (OSH3). ; GO: 0006810 transport, 0016021 integral to membrane; PDB: 1P23_A 1M23_A.
Probab=83.96 E-value=0.32 Score=40.27 Aligned_cols=32 Identities=28% Similarity=0.373 Sum_probs=0.0
Q ss_pred eEEeccceEEEEEEecCccceee-ceeEEEEEE
Q psy6233 238 SHCYPGEGVYLLKFDNSYSLWRS-KTLYYRVYY 269 (271)
Q Consensus 238 s~~~~~~GvY~L~FDNSYS~~Rs-K~L~Y~V~~ 269 (271)
++..+++|.|.+.|+|+.+-+.. |++.+.+..
T Consensus 66 ~f~~~~~G~y~iCf~n~~~~~~~~~~v~~~~~~ 98 (183)
T PF01105_consen 66 SFTAKESGEYQICFDNSSSSFSPSKRVSFDIDV 98 (183)
T ss_dssp ---------------------------------
T ss_pred EEEeccCCCEEEEEEcCCCCccccEEEEEEEEE
Confidence 35567999999999999888877 888877653
No 7
>KOG0163|consensus
Probab=80.82 E-value=9.5 Score=41.30 Aligned_cols=30 Identities=13% Similarity=0.178 Sum_probs=16.3
Q ss_pred cccchhhhHHHHHHHHHHHHHhhhhHHHHH
Q psy6233 57 NEVSQETSIEEKKRRLIQDALNQQTYAQFR 86 (271)
Q Consensus 57 ~~~~~e~~~~e~~~~qi~~aln~qt~~qf~ 86 (271)
+.+|++++++|.-.++|..++-.|...+=.
T Consensus 965 K~eEeqr~~qee~e~~l~~e~q~qla~e~e 994 (1259)
T KOG0163|consen 965 KAEEEQRKAQEEEERRLALELQEQLAKEAE 994 (1259)
T ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHH
Confidence 444455566555556666666655544433
No 8
>KOG1029|consensus
Probab=78.05 E-value=4.5 Score=43.63 Aligned_cols=8 Identities=50% Similarity=0.596 Sum_probs=4.0
Q ss_pred HHHHHhhh
Q psy6233 73 IQDALNQQ 80 (271)
Q Consensus 73 i~~aln~q 80 (271)
+++-|||-
T Consensus 418 ~qem~~Qk 425 (1118)
T KOG1029|consen 418 RQEMLNQK 425 (1118)
T ss_pred HHHHHhhh
Confidence 44555544
No 9
>smart00110 C1Q Complement component C1q domain. Globular domain found in many collagens and eponymously in complement C1q. When part of full length proteins these domains form a 'bouquet' due to the multimerization of heterotrimers. The C1q fold is similar to that of tumour necrosis factor.
Probab=74.22 E-value=2.7 Score=35.27 Aligned_cols=24 Identities=25% Similarity=0.524 Sum_probs=21.6
Q ss_pred ccccceecceEEeccceEEEEEEe
Q psy6233 229 DCHQEVYAGSHCYPGEGVYLLKFD 252 (271)
Q Consensus 229 ~sh~~~~~Gs~~~~~~GvY~L~FD 252 (271)
..|..+..|.++||-||+|.+.|.
T Consensus 35 g~~yd~~TG~Ftcpv~GvY~F~f~ 58 (135)
T smart00110 35 QGHYDPRTGKFTCPVPGVYYFSYH 58 (135)
T ss_pred CCCccCCCCEEECeeceEEEEEEE
Confidence 456788899999999999999997
No 10
>KOG0817|consensus
Probab=67.59 E-value=3.6 Score=35.45 Aligned_cols=22 Identities=18% Similarity=0.361 Sum_probs=19.6
Q ss_pred CChHHHHHHHHHHHhhhCCCch
Q psy6233 1 MSSKDAMSNFIDLLDLKCILFK 22 (271)
Q Consensus 1 ~s~~~am~~fi~ll~~~cp~f~ 22 (271)
||+++||..||+++..++|-..
T Consensus 68 ~s~~eA~~~Yv~~~~~l~~~~~ 89 (142)
T KOG0817|consen 68 MSKEEAMEAYVEKVEELIPKYG 89 (142)
T ss_pred CCHHHHHHHHHHHHHHHHHHhh
Confidence 7999999999999999998543
No 11
>PF00386 C1q: C1q domain; InterPro: IPR001073 This entry represents the C-terminal domain of C1q. C1q is a subunit of the C1 enzyme complex that activates the serum complement system. C1q comprises 6 A, 6 B and 6 C chains. These share the same topology, each possessing a small, globular N-terminal domain, a collagen-like Gly/Pro-rich central region, and a conserved C-terminal region, the C1q domain []. The C1q protein is produced in collagen-producing cells and shows sequence and structural similarity to collagens VIII and X [, ]. This domain is also found in multimerin and EMILIN proteins.; PDB: 1O91_C 2JG8_D 2JG9_A 2WNV_A 2WNU_A 1PK6_A 4DOU_A 1C3H_C 1C28_C 2OII_A ....
Probab=65.93 E-value=7.5 Score=31.25 Aligned_cols=24 Identities=25% Similarity=0.394 Sum_probs=18.0
Q ss_pred ccccceecceEEeccceEEEEEEe
Q psy6233 229 DCHQEVYAGSHCYPGEGVYLLKFD 252 (271)
Q Consensus 229 ~sh~~~~~Gs~~~~~~GvY~L~FD 252 (271)
..+..+..|.++||.+|+|.+.|.
T Consensus 29 g~~yn~~tG~Ftap~~G~Y~F~~~ 52 (127)
T PF00386_consen 29 GNAYNPSTGIFTAPVPGVYFFSFT 52 (127)
T ss_dssp TS-EETTTTEEE-SS-EEEEEEEE
T ss_pred CCccEeecCEEecCCCCEEEEEEE
Confidence 446778899999999999999874
No 12
>PTZ00266 NIMA-related protein kinase; Provisional
Probab=58.48 E-value=22 Score=39.46 Aligned_cols=10 Identities=10% Similarity=-0.004 Sum_probs=4.5
Q ss_pred CCCchhHHHH
Q psy6233 18 CILFKPYIDA 27 (271)
Q Consensus 18 cp~f~~~v~a 27 (271)
|+.+..-++.
T Consensus 423 ~g~~g~r~ek 432 (1021)
T PTZ00266 423 NGHYGGRVDK 432 (1021)
T ss_pred cCccccccch
Confidence 4444444433
No 13
>PF09394 Inhibitor_I42: Chagasin family peptidase inhibitor I42; InterPro: IPR018990 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. Chagasin reversible inhibitor of papain-like cysteine proteases []. Chagasin has a beta-barrel structure, which is a unique variant of the immunoglobulin fold with homology to human CD8alpha [, ].; PDB: 2NQD_A 2NNR_A 2H7W_B 3E1Z_A 3CBK_B 3CBJ_B 2OUL_B 2FO8_A 2WGN_B 2C34_A ....
Probab=53.74 E-value=93 Score=23.35 Aligned_cols=34 Identities=21% Similarity=0.502 Sum_probs=26.8
Q ss_pred EEEccCceEEEEEEeecCCCEEEEEEEeeCccEEE
Q psy6233 133 IKIGHGETVTVRVPTHEGGSCLYWEFCTDHYDLGF 167 (271)
Q Consensus 133 i~V~~Get~tV~V~v~~~Gs~L~WeF~Td~yDIGF 167 (271)
|+|..|+++.|.++-. +++-..|.+..+..-+.+
T Consensus 1 I~v~~g~~~~I~L~~n-pstGY~W~~~~~~~~l~l 34 (92)
T PF09394_consen 1 ITVKVGDTFEIELPEN-PSTGYSWSLSSDSDGLQL 34 (92)
T ss_dssp -EEETTSEEEEEEEEB-CCGTBEEEECTSTTTEEE
T ss_pred CeecCCCEEEEEECCC-CCCCeEEEEecCCCeEEE
Confidence 6799999999999976 777899999985544554
No 14
>PF15236 CCDC66: Coiled-coil domain-containing protein 66
Probab=51.57 E-value=79 Score=27.78 Aligned_cols=27 Identities=15% Similarity=0.224 Sum_probs=11.4
Q ss_pred hhHHHHHHHHHHH-HHhhhhHHHHHHHH
Q psy6233 63 TSIEEKKRRLIQD-ALNQQTYAQFRAYA 89 (271)
Q Consensus 63 ~~~~e~~~~qi~~-aln~qt~~qf~~ya 89 (271)
++..|..++..++ +..+.|..-|...+
T Consensus 102 ~~E~E~~~~~~KEe~~~~k~~~l~e~~q 129 (157)
T PF15236_consen 102 QFEEEQRKQREKEEEQTRKTQELYEAMQ 129 (157)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444433332 44444444454433
No 15
>KOG1029|consensus
Probab=50.57 E-value=53 Score=35.89 Aligned_cols=18 Identities=28% Similarity=0.449 Sum_probs=7.4
Q ss_pred HHHHhhHHHHHHHHHHHH
Q psy6233 26 DAHVAQKEEENRKAREQI 43 (271)
Q Consensus 26 ~a~~~~~ee~~~~~~~~~ 43 (271)
+..++++|+.+++.++++
T Consensus 338 eqqqreree~eqkEreE~ 355 (1118)
T KOG1029|consen 338 EQQQREREEVEQKEREEE 355 (1118)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 333444444444433333
No 16
>PF01117 Aerolysin: Aerolysin toxin; InterPro: IPR005830 This family represents the pore forming lobe of aerolysin, and the related toxins haemolysin and the leukocidin S subunit. Aerolysin [] is a cytolytic toxin exported by Aeromonas hydrophila, a Gram-negative bacterium associated with diarrhoeal diseases and deep wound infections []. The mature toxin binds to eukaryotic cells and aggregates to form holes (approximately 3 nm in diameter) leading to the destruction of the membrane permeability barrier and osmotic lysis. The structure of proaerolysin has been determined to 2.8A resolution and shows the protoxin to adopt a novel fold []. Images of an aerolysin oligomer derived from electron microscopy have helped to construct a model of the protein and to outline a mechanism by which it might insert into lipid bilayers to form ion channels [].; GO: 0009405 pathogenesis, 0005576 extracellular region; PDB: 3G4N_B 3C0N_B 3C0O_B 1PRE_A 1Z52_B 3C0M_A 3G4O_A.
Probab=41.56 E-value=33 Score=33.82 Aligned_cols=44 Identities=18% Similarity=0.335 Sum_probs=36.4
Q ss_pred eeEEEccCceEEEEEEeecCCCEEEEEEEee-CccEEEEEEEEec
Q psy6233 131 AIIKIGHGETVTVRVPTHEGGSCLYWEFCTD-HYDLGFGIYFEWN 174 (271)
Q Consensus 131 ~~i~V~~Get~tV~V~v~~~Gs~L~WeF~Td-~yDIGFGV~fe~~ 174 (271)
..++|+++..++|+|.+...-.-+-+.|.++ +|||.|-=|+.|.
T Consensus 185 ~~vtVPP~s~~~V~i~~~rs~i~iPYt~~~~i~Ydv~f~GFlRwg 229 (360)
T PF01117_consen 185 YRVTVPPHSKVPVRIELYRSTIDIPYTAKADISYDVEFTGFLRWG 229 (360)
T ss_dssp E--EB-TTEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEB-TT
T ss_pred EeeccCCCceEEEEEEEEeeEEEeeEEEEEEEEEEEEEeeeEeec
Confidence 4689999999999999988788899999987 9999999999986
No 17
>KOG2891|consensus
Probab=41.03 E-value=65 Score=31.33 Aligned_cols=17 Identities=12% Similarity=0.151 Sum_probs=10.0
Q ss_pred hhcccccchhhhHHHHH
Q psy6233 53 QVNGNEVSQETSIEEKK 69 (271)
Q Consensus 53 ~~~~~~~~~e~~~~e~~ 69 (271)
++++++.++++|..|.|
T Consensus 391 eerrkqkeeeklk~e~q 407 (445)
T KOG2891|consen 391 EERRKQKEEEKLKAEEQ 407 (445)
T ss_pred HHHHhhhHHHHHHHHHH
Confidence 34455556677766555
No 18
>KOG2412|consensus
Probab=39.81 E-value=64 Score=33.67 Aligned_cols=12 Identities=8% Similarity=-0.197 Sum_probs=5.2
Q ss_pred HHHHHHhhhCCC
Q psy6233 9 NFIDLLDLKCIL 20 (271)
Q Consensus 9 ~fi~ll~~~cp~ 20 (271)
.||.-+.+.|.+
T Consensus 200 ~~i~~~~q~~eq 211 (591)
T KOG2412|consen 200 QAIQREKQRKEQ 211 (591)
T ss_pred HHHHHHHHHHHH
Confidence 344444444443
No 19
>KOG0163|consensus
Probab=39.33 E-value=1.3e+02 Score=33.03 Aligned_cols=8 Identities=38% Similarity=0.974 Sum_probs=3.3
Q ss_pred eEEEEEEe
Q psy6233 245 GVYLLKFD 252 (271)
Q Consensus 245 GvY~L~FD 252 (271)
|.+...||
T Consensus 1170 gmWyaHFd 1177 (1259)
T KOG0163|consen 1170 GMWYAHFD 1177 (1259)
T ss_pred ceEEEecC
Confidence 44444444
No 20
>cd05840 SPBC215_ISWI_like The PWWP domain is a component of the S. pombe hypothetical protein SPBC215, as well as ISWI complex protein 4. The ISWI (imitation switch) proteins are ATPases responsible for chromatin remodeling in eukaryotes, and SPBC215 is proposed to also bind chromatin. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=34.33 E-value=22 Score=28.09 Aligned_cols=23 Identities=26% Similarity=0.381 Sum_probs=19.0
Q ss_pred ccceEEEEEE--ecCccceeeceeE
Q psy6233 242 PGEGVYLLKF--DNSYSLWRSKTLY 264 (271)
Q Consensus 242 ~~~GvY~L~F--DNSYS~~RsK~L~ 264 (271)
.++|.|.+.| |++|+|...+.|.
T Consensus 38 ~~~~~~~V~FFg~~~~~Wv~~~~l~ 62 (93)
T cd05840 38 KNKRTYPVMFFPDGDYYWVPNKDLK 62 (93)
T ss_pred CCCCeEEEEEeCCCcEEEEChhhcc
Confidence 4688899998 8999999887763
No 21
>KOG3006|consensus
Probab=33.78 E-value=27 Score=29.73 Aligned_cols=20 Identities=25% Similarity=0.481 Sum_probs=16.5
Q ss_pred cceEEEEEEecCccceeecee
Q psy6233 243 GEGVYLLKFDNSYSLWRSKTL 263 (271)
Q Consensus 243 ~~GvY~L~FDNSYS~~RsK~L 263 (271)
.||+|.|.||-. ++|+++-|
T Consensus 76 ~~GtYr~~~dT~-~Y~~a~gv 95 (132)
T KOG3006|consen 76 IPGTYRLVFDTE-PYYKALGV 95 (132)
T ss_pred ccceEEEEEecc-cccccCCc
Confidence 489999999976 77888764
No 22
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=33.22 E-value=31 Score=23.48 Aligned_cols=16 Identities=19% Similarity=0.451 Sum_probs=12.5
Q ss_pred eEEeccceEEEEEEec
Q psy6233 238 SHCYPGEGVYLLKFDN 253 (271)
Q Consensus 238 s~~~~~~GvY~L~FDN 253 (271)
......||+|.|.|.=
T Consensus 6 nW~FT~PG~Y~l~~~a 21 (41)
T TIGR03769 6 NWVFTKPGTYTLTVQA 21 (41)
T ss_pred ceeeCCCeEEEEEEEE
Confidence 3456789999999953
No 23
>PF06868 DUF1257: Protein of unknown function (DUF1257); InterPro: IPR009666 This family represents Ycf35, which is encoded in algal chloroplast and in cyanobacteria. The function of these proteins are unknown. As the family is exclusively found in phototrophic organisms it may play a role in photosynthesis.
Probab=32.72 E-value=61 Score=26.27 Aligned_cols=30 Identities=20% Similarity=0.344 Sum_probs=20.6
Q ss_pred eeEEEccCceEEEEEEeecCCCEEEEEEEeeCccEEEEEE
Q psy6233 131 AIIKIGHGETVTVRVPTHEGGSCLYWEFCTDHYDLGFGIY 170 (271)
Q Consensus 131 ~~i~V~~Get~tV~V~v~~~Gs~L~WeF~Td~yDIGFGV~ 170 (271)
..|.-.+|.++.+.+-+..++ +|||||-.-
T Consensus 15 ~~vrgy~gq~~~aelvi~~~~----------~~DIGF~wn 44 (105)
T PF06868_consen 15 QPVRGYRGQTQKAELVIKQNN----------GYDIGFRWN 44 (105)
T ss_pred eeccccCCCeEEEEEEEEcCC----------CccEEEEEC
Confidence 455566788888877776443 788887543
No 24
>CHL00193 ycf35 Ycf35; Provisional
Probab=31.20 E-value=83 Score=26.52 Aligned_cols=29 Identities=28% Similarity=0.475 Sum_probs=21.2
Q ss_pred eeEEEccCceEEEEEEeecCCCEEEEEEEeeCccEEEEE
Q psy6233 131 AIIKIGHGETVTVRVPTHEGGSCLYWEFCTDHYDLGFGI 169 (271)
Q Consensus 131 ~~i~V~~Get~tV~V~v~~~Gs~L~WeF~Td~yDIGFGV 169 (271)
..+...+|.++.+.+-+..+ .+|||||..
T Consensus 33 ~~vrgy~gq~~~a~lvi~~~----------~~~diGf~~ 61 (128)
T CHL00193 33 QVIKGYNGQTHNADLVIKQS----------NNYDIGFVW 61 (128)
T ss_pred ceeeccCCCeEEEEEEEEcC----------CCCceeEee
Confidence 46777888888877777655 377887765
No 25
>COG3081 Nucleoid-associated protein [General function prediction only]
Probab=29.31 E-value=1.1e+02 Score=29.52 Aligned_cols=69 Identities=25% Similarity=0.366 Sum_probs=47.1
Q ss_pred HHHHHHHhhhhHHHHHHHHHhcCCCChhhhhhhhhccCCCcccccchhhhHHhhhccCCceeEEEccCceEEEEEEeecC
Q psy6233 71 RLIQDALNQQTYAQFRAYAEDQFPGNPEQQAVLVQIGPANMWTLTNIKSFKEHIRKEGGDAIIKIGHGETVTVRVPTHEG 150 (271)
Q Consensus 71 ~qi~~aln~qt~~qf~~ya~~q~P~~~eqq~~li~~vP~s~wt~~~i~~fk~~i~~~~~~~~i~V~~Get~tV~V~v~~~ 150 (271)
.+-|+-||.+.+..|- |++-|-=-+ .+||---|-.....| +|+|+-++|.+..+-=
T Consensus 246 S~~lp~~~e~~F~~ft--~eq~YgLee--------s~PadrsTl~qltkf--------------~Gsg~GltisFdA~LL 301 (335)
T COG3081 246 SKELPGLNEVAFYDFT--AEKGYGLEE--------SFPADRSTLRQLTKF--------------AGSGGGLTISFDAMLL 301 (335)
T ss_pred HhhCCCcCcchHHHHH--HhcccCCCc--------cCCccHHHHHHHHHh--------------cCCCCceEEeehhhhh
Confidence 4567778887665553 444554333 367654443333222 7899999999999989
Q ss_pred CCEEEEEEEeeCc
Q psy6233 151 GSCLYWEFCTDHY 163 (271)
Q Consensus 151 Gs~L~WeF~Td~y 163 (271)
|+.++|.=+||.-
T Consensus 302 g~rvfwD~~tDtL 314 (335)
T COG3081 302 GERVFWDPATDTL 314 (335)
T ss_pred CCeeeecccccee
Confidence 9999999999864
No 26
>TIGR02568 LcrE type III secretion regulator YopN/LcrE/InvE/MxiC. This protein is found in type III secretion operons and, in Yersinia is localized to the cell surface and is involved in the Low-Calicium Response (LCR), possibly by sensing the calcium concentration. In Salmonella, the gene is known as InvE and is believed to perform an essential role in the secretion process and interacts with the proteins SipBCD and SicA.//Altered name to reflect regulatory role. Added GO and role IDs. Negative regulation of type III secretion in Y pestis is mediated in part by a multiprotein complex that has been proposed to act as a physical impediment to type III secretion by blocking the entrance to the secretion apparatus prior to contact with mammalian cells. This complex is composed of YopN, its heterodimeric secretion chaperone SycN-YscB, and TyeA. PubMed: 15701523
Probab=29.05 E-value=1.8e+02 Score=26.52 Aligned_cols=36 Identities=28% Similarity=0.441 Sum_probs=28.7
Q ss_pred HHHHHHHHHHhh--hhHHHHHHHHHhcCCCChhhhhhhh
Q psy6233 68 KKRRLIQDALNQ--QTYAQFRAYAEDQFPGNPEQQAVLV 104 (271)
Q Consensus 68 ~~~~qi~~aln~--qt~~qf~~ya~~q~P~~~eqq~~li 104 (271)
++...++..|+. ++..|+..|+.+.||+ |.+|-.++
T Consensus 62 ~k~~~l~~~l~~~~~~~~~ll~~l~~~f~D-~s~~~laL 99 (240)
T TIGR02568 62 QKLKELRAELSGQAAGLEQLLALARGAFPD-PSDQALAL 99 (240)
T ss_pred HHHHHHHHHHcccCCCHHHHHHHHHhhCCC-hHHHHHHH
Confidence 455677777786 8999999999999995 77676666
No 27
>PLN03086 PRLI-interacting factor K; Provisional
Probab=27.81 E-value=1.6e+02 Score=30.83 Aligned_cols=20 Identities=20% Similarity=0.018 Sum_probs=13.1
Q ss_pred CceEEEEEEeecCCCEEEEE
Q psy6233 138 GETVTVRVPTHEGGSCLYWE 157 (271)
Q Consensus 138 Get~tV~V~v~~~Gs~L~We 157 (271)
|..++|..-.-+.|+.+.-.
T Consensus 171 ~~~v~v~~~~Lpkgt~vklq 190 (567)
T PLN03086 171 VPLVEVRYIWLPKGTYAKLQ 190 (567)
T ss_pred CCeEEEEEeecCCCCEEEEe
Confidence 55677776666777776544
No 28
>PF13540 RCC1_2: Regulator of chromosome condensation (RCC1) repeat; PDB: 3QI0_D 1JTD_B 3QHY_B.
Probab=27.71 E-value=86 Score=19.42 Aligned_cols=24 Identities=21% Similarity=0.509 Sum_probs=18.8
Q ss_pred eEEEccCceEEEEEEeecCCCEEEEE
Q psy6233 132 IIKIGHGETVTVRVPTHEGGSCLYWE 157 (271)
Q Consensus 132 ~i~V~~Get~tV~V~v~~~Gs~L~We 157 (271)
++.|..|..|++-+. ..|.+..|-
T Consensus 1 V~~ia~G~~ht~al~--~~g~v~~wG 24 (30)
T PF13540_consen 1 VVQIACGGYHTCALT--SDGEVYCWG 24 (30)
T ss_dssp EEEEEEESSEEEEEE---TTEEEEEE
T ss_pred CEEEEecCCEEEEEE--cCCCEEEEc
Confidence 467889999998774 579999996
No 29
>PF07946 DUF1682: Protein of unknown function (DUF1682); InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found.
Probab=27.58 E-value=77 Score=30.19 Aligned_cols=11 Identities=18% Similarity=0.169 Sum_probs=4.8
Q ss_pred HHHHHHHHhhh
Q psy6233 7 MSNFIDLLDLK 17 (271)
Q Consensus 7 m~~fi~ll~~~ 17 (271)
|.-++.|.|.+
T Consensus 237 ~~~v~~l~D~~ 247 (321)
T PF07946_consen 237 LKLVFYLIDKL 247 (321)
T ss_pred HHHHHHHHHHh
Confidence 33444444444
No 30
>PF00855 PWWP: PWWP domain; InterPro: IPR000313 Upon characterisation of WHSC1, a gene mapping to the Wolf-Hirschhornsyndrome critical region and at its C terminus similar to the Drosophila melanogaster ASH1/trithorax group proteins, a novel protein domain designated PWWP domain was identified []. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. It is present in proteins of nuclear origin and plays a role in cell growth and differentiation. Due to its position, the composition of amino acids close to the PWWP motif and the pattern of other domains present it has been suggested that the domain is involved in protein-protein interactions [].; PDB: 3LYI_B 2L89_A 2NLU_A 1RI0_A 1KHC_A 3QKJ_C 2DAQ_A 1N27_A 3PFS_B 3QJ6_A ....
Probab=26.80 E-value=50 Score=24.48 Aligned_cols=25 Identities=24% Similarity=0.448 Sum_probs=19.1
Q ss_pred eccceEEEEEE--ecCccceeeceeEE
Q psy6233 241 YPGEGVYLLKF--DNSYSLWRSKTLYY 265 (271)
Q Consensus 241 ~~~~GvY~L~F--DNSYS~~RsK~L~Y 265 (271)
-..++.|++.| ||+|+|...+.|..
T Consensus 31 ~~~~~~~~V~Ffg~~~~~wv~~~~i~~ 57 (86)
T PF00855_consen 31 KRKDGHVLVRFFGDNDYAWVKPSNIKP 57 (86)
T ss_dssp SSSSTEEEEEETTTTEEEEEEGGGEEE
T ss_pred CCCCCEEEEEecCCCCEEEECHHHhhC
Confidence 45577778887 67899999988763
No 31
>smart00293 PWWP domain with conserved PWWP motif. conservation of Pro-Trp-Trp-Pro residues
Probab=26.57 E-value=55 Score=23.65 Aligned_cols=22 Identities=27% Similarity=0.407 Sum_probs=17.3
Q ss_pred cceEEEEEE--ecCccceeeceeE
Q psy6233 243 GEGVYLLKF--DNSYSLWRSKTLY 264 (271)
Q Consensus 243 ~~GvY~L~F--DNSYS~~RsK~L~ 264 (271)
.++.|.+.| ||+|+|...+.|.
T Consensus 37 ~~~~~~V~Ffg~~~~awv~~~~l~ 60 (63)
T smart00293 37 FENLYPVLFFGDKDTAWISSSKLF 60 (63)
T ss_pred CCCEEEEEEeCCCCEEEECcccee
Confidence 467788887 7789999888774
No 32
>KOG3054|consensus
Probab=25.76 E-value=1.5e+02 Score=28.21 Aligned_cols=16 Identities=19% Similarity=0.158 Sum_probs=7.6
Q ss_pred hcccccchhhhHHHHH
Q psy6233 54 VNGNEVSQETSIEEKK 69 (271)
Q Consensus 54 ~~~~~~~~e~~~~e~~ 69 (271)
+..++.+++|+++|+.
T Consensus 139 E~erKkdEeR~~eEae 154 (299)
T KOG3054|consen 139 EAERKKDEERLAEEAE 154 (299)
T ss_pred HHHHhhhHHHHHHHHH
Confidence 3334444555555443
No 33
>cd05838 WHSC1_related The PWWP domain was first identified in the WHSC1 (Wolf-Hirschhorn syndrome candidate 1) protein, a protein implicated in Wolf-Hirschhorn syndrome (WHS). When translocated, WHSC1 plays a role in lymphoid multiple myeloma (MM) disease, also known as plasmacytoma. WHCS1 proteins typically contain two copies of the PWWP domain. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=25.15 E-value=44 Score=26.38 Aligned_cols=23 Identities=13% Similarity=0.294 Sum_probs=18.0
Q ss_pred ccceEEEEEE--ecCccceeeceeE
Q psy6233 242 PGEGVYLLKF--DNSYSLWRSKTLY 264 (271)
Q Consensus 242 ~~~GvY~L~F--DNSYS~~RsK~L~ 264 (271)
..+|.|.+.| ||+|.|+..+.|.
T Consensus 36 ~~~~~~~V~Ffgs~~y~Wv~~~~l~ 60 (95)
T cd05838 36 HCIGEFCVMFFGTHDYYWVHRGRVF 60 (95)
T ss_pred CCCCeEEEEEeCCCCEEEecccccc
Confidence 3457888888 7779999988774
No 34
>KOG1144|consensus
Probab=25.09 E-value=1.3e+02 Score=33.16 Aligned_cols=7 Identities=29% Similarity=0.430 Sum_probs=2.8
Q ss_pred HHHHhhh
Q psy6233 74 QDALNQQ 80 (271)
Q Consensus 74 ~~aln~q 80 (271)
+.||+.-
T Consensus 290 ~~a~aea 296 (1064)
T KOG1144|consen 290 EAALAEA 296 (1064)
T ss_pred HHHHHHH
Confidence 3444433
No 35
>cd05162 PWWP The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes. The function of the PWWP domain is still not known precisely; however, based on the fact that other regions of PWWP-domain proteins are responsible for nuclear localization and DNA-binding, is likely that the PWWP domain acts as a site for protein-protein binding interactions, influencing chromatin remodeling and thereby regulating transcriptional processes. Some PWWP-domain proteins have been linked to cancer or other diseases; some are known to function as growth factors.
Probab=24.32 E-value=58 Score=24.63 Aligned_cols=22 Identities=27% Similarity=0.454 Sum_probs=16.5
Q ss_pred cceEEEEEE--ecCccceeeceeE
Q psy6233 243 GEGVYLLKF--DNSYSLWRSKTLY 264 (271)
Q Consensus 243 ~~GvY~L~F--DNSYS~~RsK~L~ 264 (271)
.++.|.|.| +|+|+|...+.|.
T Consensus 36 ~~~~~~V~Ffg~~~~~wv~~~~l~ 59 (87)
T cd05162 36 KEGKVLVLFFGDKTFAWVGAERLK 59 (87)
T ss_pred CCCEEEEEEeCCCcEEEeCcccee
Confidence 356677777 6779999988874
No 36
>PRK09510 tolA cell envelope integrity inner membrane protein TolA; Provisional
Probab=24.05 E-value=2.4e+02 Score=28.16 Aligned_cols=18 Identities=33% Similarity=0.540 Sum_probs=10.8
Q ss_pred cCceEEEEEEeecCCCEE
Q psy6233 137 HGETVTVRVPTHEGGSCL 154 (271)
Q Consensus 137 ~Get~tV~V~v~~~Gs~L 154 (271)
.|.+.+|+|.+.+.|.++
T Consensus 324 ~gktc~VrI~LapDG~V~ 341 (387)
T PRK09510 324 AGKTCTLRIKLAPDGTLL 341 (387)
T ss_pred cCceEEEEEEEcCCCcEE
Confidence 355666666666666544
No 37
>PHA03051 Hypothetical protein; Provisional
Probab=23.90 E-value=38 Score=26.44 Aligned_cols=14 Identities=36% Similarity=0.705 Sum_probs=10.4
Q ss_pred hhhCCCchhHHHHH
Q psy6233 15 DLKCILFKPYIDAH 28 (271)
Q Consensus 15 ~~~cp~f~~~v~a~ 28 (271)
|+.||||+..+..-
T Consensus 33 nsrcplfksliska 46 (88)
T PHA03051 33 NSRCPLFKSLISKA 46 (88)
T ss_pred CccChhHHHHHHHH
Confidence 57899998876543
No 38
>PF09805 Nop25: Nucleolar protein 12 (25kDa); InterPro: IPR019186 Nop12 is a novel nucleolar protein required for pre-large subunit rRNA processing and in yeast normal rates of cell growth at low temperatures [].
Probab=23.63 E-value=1.8e+02 Score=24.61 Aligned_cols=9 Identities=22% Similarity=0.527 Sum_probs=4.5
Q ss_pred hhHHH-HHHh
Q psy6233 22 KPYID-AHVA 30 (271)
Q Consensus 22 ~~~v~-a~~~ 30 (271)
+.||+ =||+
T Consensus 21 ~eyLTGFhKR 30 (137)
T PF09805_consen 21 REYLTGFHKR 30 (137)
T ss_pred HHHHHHHHHH
Confidence 34665 3444
No 39
>PF02337 Gag_p10: Retroviral GAG p10 protein; InterPro: IPR003322 Retroviral matrix proteins (or major core proteins) are components of envelope-associated capsids, which line the inner surface of virus envelopes and are associated with viral membranes []. Matrix proteins are produced as part of Gag precursor polyproteins. During viral maturation, the Gag polyprotein is cleaved into major structural proteins by the viral protease, yielding the matrix (MA), capsid (CA), nucleocapsid (NC), and some smaller peptides. Gag-derived proteins govern the entire assembly and release of the virus particles, with matrix proteins playing key roles in Gag stability, capsid assembly, transport and budding. Although matrix proteins from different retroviruses appear to perform similar functions and can have similar structural folds, their primary sequences can be very different. This entry represents matrix proteins from beta-retroviruses such as Mason-Pfizer monkey virus (MPMV) (Simian Mason-Pfizer virus) and Mouse mammary tumor virus (MMTV) [, ]. This entry also identifies matrix proteins from several eukaryotic endogenous retroviruses, which arise when one or more copies of the retroviral genome becomes integrated into the host genome [].; GO: 0005198 structural molecule activity, 0019028 viral capsid; PDB: 2F77_X 2F76_X.
Probab=23.55 E-value=70 Score=25.57 Aligned_cols=18 Identities=28% Similarity=0.660 Sum_probs=14.3
Q ss_pred HHHHHHHHHHhhhCCCch
Q psy6233 5 DAMSNFIDLLDLKCILFK 22 (271)
Q Consensus 5 ~am~~fi~ll~~~cp~f~ 22 (271)
.-...|+.-++++||-|.
T Consensus 28 ~~L~~f~~~i~~~~PWF~ 45 (90)
T PF02337_consen 28 KDLINFLSFIDKVCPWFP 45 (90)
T ss_dssp HHHHHHHHHHHHHTT-SS
T ss_pred HHHHHHHHHHHHhCCCCC
Confidence 345789999999999995
No 40
>KOG2357|consensus
Probab=23.36 E-value=2.2e+02 Score=28.92 Aligned_cols=17 Identities=0% Similarity=-0.293 Sum_probs=9.0
Q ss_pred HhhhCCCchhHHHHHHh
Q psy6233 14 LDLKCILFKPYIDAHVA 30 (271)
Q Consensus 14 l~~~cp~f~~~v~a~~~ 30 (271)
+.+.|+.++-.-++.++
T Consensus 365 k~kt~~~RQ~~~e~~~K 381 (440)
T KOG2357|consen 365 KAKTDKNRQRVEEEFLK 381 (440)
T ss_pred HhhhhhhHHHHHHHHHH
Confidence 34555555555555554
No 41
>PLN03086 PRLI-interacting factor K; Provisional
Probab=22.37 E-value=1.9e+02 Score=30.40 Aligned_cols=13 Identities=23% Similarity=0.399 Sum_probs=6.7
Q ss_pred HHHHhhHHHHHHH
Q psy6233 26 DAHVAQKEEENRK 38 (271)
Q Consensus 26 ~a~~~~~ee~~~~ 38 (271)
.+.|.++|.++|+
T Consensus 8 ~~~~~~~~~~~~~ 20 (567)
T PLN03086 8 AREKLEREQRERK 20 (567)
T ss_pred HHHHHHHHHHHHH
Confidence 4445555555554
No 42
>PF03531 SSrecog: Structure-specific recognition protein (SSRP1); InterPro: IPR000969 Human structure-specific recognition protein, SSRP1, [] binds specifically to DNA modified with the anti-cancer drug cisplatin. An 81kDa protein is predicted, containing several highly-charged domains and a stretch of 75 residues that share 47% identity with a portion of the high mobility group (HMG) protein HMG1. This HMG box probably constitutes the structure recognition element for cisplatin-modified DNA, the probable recognition motif being the local duplex unwinding and bending that occurs on formation of intra-strand cross-links []. SSRP1 is the human homologue of a recently identified mouse protein that binds to recombination signal sequences []. These sequences have been postulated to form stem-loop structures, further implicating local bends and unwinding in DNA as a recognition target for HMG-box proteins. A Drosophila melanogaster cDNA encoding an HMG-box-containing protein has also been isolated [, ]. This protein shares 50% sequence identity with human SSRP1. In vitro binding studies using Drosophila SSRP showed that the protein binds to single-stranded DNA and RNA, with highest affinity for nucleotides G and U. Comparison of the predicted amino acid sequences among SSRP family members reveals 48% identity, with structural conservation in the C terminus of the HMG box, as well as domains of highly charged residues. The most highly conserved regions lie in the poorly understood N terminus, suggesting that this portion of the protein is critical for its function []. This entry contains Pob3 Q04636 from SWISSPROT, which is a subunit of the heterodimeric yeast FACT complex (Spt16p (IPR013953 from INTERPRO)-Pob3p) []. The FACT complex facilitates RNA Polymerase II transcription elongation through nucleosomes by destabilising and then reassembling nucleosome structure [, ].; GO: 0003677 DNA binding, 0005634 nucleus; PDB: 3F5R_A 2GCL_B 2GCJ_D.
Probab=21.57 E-value=1.3e+02 Score=27.64 Aligned_cols=36 Identities=19% Similarity=0.454 Sum_probs=29.9
Q ss_pred ecceEEeccceEEEEEEecCccceeeceeEEEEEEe
Q psy6233 235 YAGSHCYPGEGVYLLKFDNSYSLWRSKTLYYRVYYT 270 (271)
Q Consensus 235 ~~Gs~~~~~~GvY~L~FDNSYS~~RsK~L~Y~V~~t 270 (271)
..+-......|.|-+.+=-+|=.+|.|+.-|+|.|+
T Consensus 139 f~~i~~ltPRGRydi~~y~~~lrl~GktyDykI~y~ 174 (222)
T PF03531_consen 139 FEDILCLTPRGRYDIEMYPTFLRLHGKTYDYKIQYS 174 (222)
T ss_dssp EEEEEEEETTEEEEEEE-SSEEEEEESSBEEEEEGG
T ss_pred cccccccccccccccccccccccccccccccccccc
Confidence 344455677899999999999999999999999986
No 43
>KOG2701|consensus
Probab=21.19 E-value=2.4e+02 Score=29.86 Aligned_cols=27 Identities=19% Similarity=0.171 Sum_probs=22.9
Q ss_pred HHHHHHHHHHHhhhCCCchhHHHHHHh
Q psy6233 4 KDAMSNFIDLLDLKCILFKPYIDAHVA 30 (271)
Q Consensus 4 ~~am~~fi~ll~~~cp~f~~~v~a~~~ 30 (271)
.+=|..||.+||..|..|..|+-.-+-
T Consensus 501 K~Y~~keiSylndi~t~lsd~~~ka~~ 527 (608)
T KOG2701|consen 501 KDYREKEISYLNDICTQLSDWMWKAKL 527 (608)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhccchh
Confidence 467999999999999999999875443
No 44
>KOG2880|consensus
Probab=21.16 E-value=2.1e+02 Score=28.72 Aligned_cols=34 Identities=24% Similarity=0.314 Sum_probs=23.7
Q ss_pred HHHHHHhhhCCCchhHHHHHHhhHHHHHHHHHHHH
Q psy6233 9 NFIDLLDLKCILFKPYIDAHVAQKEEENRKAREQI 43 (271)
Q Consensus 9 ~fi~ll~~~cp~f~~~v~a~~~~~ee~~~~~~~~~ 43 (271)
.||.|.-..||--.+|= |.|.+++.=-++.++.+
T Consensus 60 ry~tLfiEkipkHrDy~-s~k~ek~d~~~klk~~~ 93 (424)
T KOG2880|consen 60 RYITLFIEKIPKHRDYR-SVKPEKEDIRKKLKEEA 93 (424)
T ss_pred HHHHHHHHhcccCcchh-hhchhHHHHHHHHHHHh
Confidence 46666667777778887 77777777666665554
Done!