Query psy16714
Match_columns 68
No_of_seqs 105 out of 549
Neff 7.1
Searched_HMMs 46136
Date Fri Aug 16 20:08:11 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16714.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16714hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3589|consensus 99.8 8.7E-20 1.9E-24 119.3 7.8 61 8-68 84-146 (221)
2 smart00315 RGS Regulator of G 99.8 4.5E-19 9.7E-24 103.7 7.7 63 5-68 3-65 (118)
3 PF00615 RGS: Regulator of G p 99.7 2.4E-17 5.3E-22 95.4 8.8 64 4-68 2-65 (118)
4 PF09128 RGS-like: Regulator o 98.3 3.3E-06 7.1E-11 54.9 6.6 62 4-67 8-69 (188)
5 KOG3590|consensus 97.6 0.00014 3E-09 52.8 5.2 39 4-43 115-153 (602)
6 KOG3590|consensus 97.1 0.0012 2.6E-08 48.0 4.9 58 9-66 322-386 (602)
7 KOG0986|consensus 97.0 0.0019 4.2E-08 47.5 5.6 57 9-66 58-115 (591)
8 PF15182 OTOS: Otospiralin 86.1 1.6 3.5E-05 24.0 3.4 30 31-60 25-54 (69)
9 PF02184 HAT: HAT (Half-A-TPR) 80.7 1.3 2.9E-05 21.0 1.5 15 50-64 3-17 (32)
10 PF11467 LEDGF: Lens epitheliu 62.0 14 0.00031 21.9 3.2 18 44-61 59-76 (106)
11 COG4790 EscR Type III secretor 60.8 13 0.00027 24.8 3.0 32 10-41 91-122 (214)
12 PF05528 Coronavirus_5: Corona 58.6 6.7 0.00014 22.2 1.3 24 23-46 54-77 (82)
13 PRK10803 tol-pal system protei 52.2 71 0.0015 21.5 6.4 50 12-66 160-214 (263)
14 PF10774 DUF4226: Domain of un 47.9 16 0.00036 21.9 1.9 18 9-26 67-84 (112)
15 COG4133 CcmA ABC-type transpor 45.8 13 0.00028 24.8 1.3 18 23-40 87-105 (209)
16 PF09279 EF-hand_like: Phospho 41.1 39 0.00084 18.2 2.6 18 14-31 17-34 (83)
17 PF14894 Lsm_C: Lsm C-terminal 40.4 28 0.00061 19.1 1.9 16 48-63 46-61 (64)
18 PF11163 DUF2947: Protein of u 34.6 44 0.00095 21.3 2.4 18 50-67 33-50 (153)
19 PF09764 Nt_Gln_amidase: N-ter 31.9 35 0.00076 22.2 1.7 17 24-42 4-20 (184)
20 PF02671 PAH: Paired amphipath 30.7 47 0.001 16.1 1.7 19 4-22 28-46 (47)
21 PF14328 DUF4385: Domain of un 29.2 47 0.001 21.0 1.8 17 46-62 47-63 (145)
22 PF10312 Cactin_mid: Conserved 28.4 88 0.0019 20.3 3.1 24 16-39 82-105 (191)
23 PF12725 DUF3810: Protein of u 28.3 2.1E+02 0.0045 19.9 5.3 49 10-66 251-302 (318)
24 PF01193 RNA_pol_L: RNA polyme 27.9 32 0.0007 18.1 0.8 9 3-11 15-23 (66)
25 PF10875 DUF2670: Protein of u 27.9 40 0.00087 20.9 1.3 18 2-19 3-20 (139)
26 KOG2005|consensus 27.4 2.1E+02 0.0046 22.9 5.3 49 12-60 204-252 (878)
27 PHA02901 virus redox protein; 27.0 1.1E+02 0.0024 17.2 2.9 21 3-23 34-54 (75)
28 PF03096 Ndr: Ndr family; Int 26.9 1.6E+02 0.0034 20.5 4.2 29 10-38 177-206 (283)
29 PF07631 PSD4: Protein of unkn 26.8 78 0.0017 19.1 2.5 53 4-66 42-106 (128)
30 PF05288 Pox_A3L: Poxvirus A3L 26.6 1.2E+02 0.0027 16.8 3.0 22 3-24 29-50 (70)
31 PHA02683 ORF078 thioredoxin-li 26.6 1.1E+02 0.0023 17.3 2.8 21 3-23 34-54 (75)
32 PF10118 Metal_hydrol: Predict 26.6 2E+02 0.0044 19.2 5.9 63 2-66 45-112 (253)
33 PF05843 Suf: Suppressor of fo 25.9 2.1E+02 0.0045 19.1 5.1 49 10-63 50-99 (280)
34 KOG0063|consensus 25.9 93 0.002 23.7 3.1 31 13-43 495-525 (592)
35 PHA03056 putative myristoylate 24.8 1.2E+02 0.0026 19.2 3.0 30 36-65 70-99 (165)
36 KOG2348|consensus 24.2 45 0.00098 25.4 1.3 17 50-66 375-391 (667)
37 PRK13722 lytic transglycosylas 24.1 1.2E+02 0.0026 19.5 3.0 54 8-62 85-144 (169)
38 PF03134 TB2_DP1_HVA22: TB2/DP 23.9 56 0.0012 18.2 1.4 14 50-63 79-92 (94)
39 PF07956 DUF1690: Protein of U 23.6 41 0.0009 20.9 0.9 20 24-43 121-140 (142)
40 PF11709 Mit_ribos_Mrp51: Mito 23.5 1.1E+02 0.0024 21.3 3.0 26 4-29 148-173 (312)
41 PHA02335 hypothetical protein 22.7 1.9E+02 0.0041 17.5 4.0 43 24-67 4-54 (118)
42 PF12122 DUF3582: Protein of u 22.7 81 0.0018 18.4 1.9 18 9-26 7-24 (101)
43 PF13981 SopA: SopA-like centr 22.4 99 0.0021 19.0 2.4 35 30-65 92-126 (135)
44 PF12335 SBF2: Myotubularin pr 21.9 1.1E+02 0.0025 20.4 2.7 24 2-26 47-70 (225)
45 PF08708 PriCT_1: Primase C te 21.8 1.4E+02 0.003 15.6 3.6 11 48-58 57-67 (71)
46 PF08361 TetR_C_2: MAATS-type 21.7 1.8E+02 0.0039 16.8 4.4 34 8-41 17-51 (121)
47 COG5562 Phage envelope protein 21.2 57 0.0012 20.4 1.1 23 2-24 75-97 (137)
48 PF14159 CAAD: CAAD domains of 21.2 1.7E+02 0.0036 16.7 3.0 29 33-61 61-89 (90)
49 COG4748 Uncharacterized conser 20.6 2.9E+02 0.0064 19.9 4.6 51 13-63 172-227 (365)
50 PF03909 BSD: BSD domain ; I 20.4 1.1E+02 0.0023 15.9 2.0 31 4-36 17-47 (62)
51 PRK09718 hypothetical protein; 20.1 1.8E+02 0.004 22.0 3.7 19 47-65 385-403 (512)
No 1
>KOG3589|consensus
Probab=99.81 E-value=8.7e-20 Score=119.33 Aligned_cols=61 Identities=48% Similarity=0.822 Sum_probs=57.4
Q ss_pred HhhChHHHHHHHHHHHhhcchhHHHHHHHHHHHhccCCH--HHHHHHHHHHHHHhcCCCCCCC
Q psy16714 8 RNTFKAGRKLFREFLRCEYSEENILFWLACEDLKKESNP--DVIEEKARFIYEDYISILSPKE 68 (68)
Q Consensus 8 ll~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~~~--~~~~~~a~~Iy~~fi~~~sp~E 68 (68)
|++|+.|+..|..||++|||+|||.||++||+||+.... ..+..+|+.||.+||+++||+|
T Consensus 84 L~~~~~G~~~F~~fLk~e~Seeni~FW~ace~~k~~~~~~~~~~~~~a~~i~~~f~~~~ap~~ 146 (221)
T KOG3589|consen 84 LLADEAGRAVFAEFLKSEFSEENLEFWLACEEFKKRRSEKSTKMSEKAKEIYEEFSAPAAPKE 146 (221)
T ss_pred HhhChhhHHHHHHHHHHHhhHhHHHHHHHHHHHHhCcCchhhhhhHHHHHHHHHHhccCCCCC
Confidence 459999999999999999999999999999999998765 5799999999999999999986
No 2
>smart00315 RGS Regulator of G protein signalling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
Probab=99.79 E-value=4.5e-19 Score=103.67 Aligned_cols=63 Identities=46% Similarity=0.720 Sum_probs=58.5
Q ss_pred HHHHhhChHHHHHHHHHHHhhcchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCCCCCCC
Q psy16714 5 KRTRNTFKAGRKLFREFLRCEYSEENILFWLACEDLKKESNPDVIEEKARFIYEDYISILSPKE 68 (68)
Q Consensus 5 ~~~ll~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~~sp~E 68 (68)
...| .||.|+..|+.||++++|.|||.||.+|++|+.......+.+.|+.||++||.++||+|
T Consensus 3 ~~il-~d~~~~~~F~~fl~~~~~~e~l~F~~~v~~~~~~~~~~~~~~~a~~I~~~yi~~~s~~~ 65 (118)
T smart00315 3 ESLL-SDPIGRLLFREFLESEFSEENLEFWLAVEEFKKAEDDEERIAKAREIYDKFLSPNAPKE 65 (118)
T ss_pred HHHh-cChHHHHHHHHHHHHhcchHhHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHhCCCCCCc
Confidence 4444 99999999999999999999999999999999998767889999999999999999975
No 3
>PF00615 RGS: Regulator of G protein signaling domain; InterPro: IPR000342 RGS (Regulator of G Protein Signalling) proteins are multi-functional, GTPase-accelerating proteins that promote GTP hydrolysis by the alpha subunit of heterotrimeric G proteins, thereby inactivating the G protein and rapidly switching off G protein-coupled receptor signalling pathways []. Upon activation by GPCRs, heterotrimeric G proteins exchange GDP for GTP, are released from the receptor, and dissociate into free, active GTP-bound alpha subunit and beta-gamma dimer, both of which activate downstream effectors. The response is terminated upon GTP hydrolysis by the alpha subunit (IPR001019 from INTERPRO), which can then bind the beta-gamma dimer (IPR001632 from INTERPRO, IPR001770 from INTERPRO) and the receptor. RGS proteins markedly reduce the lifespan of GTP-bound alpha subunits by stabilising the G protein transition state. All RGS proteins contain an 'RGS-box' (or RGS domain), which is required for activity. Some small RGS proteins such as RGS1 and RGS4 are comprised of little more than an RGS domain, while others also contain additional domains that confer further functionality []. RGS domains can be found in conjunction with a variety of domains, including: DEP for membrane targeting (IPR000591 from INTERPRO), PDZ for binding to GPCRs (IPR001478 from INTERPRO), PTB for phosphotyrosine-binding (IPR006020 from INTERPRO), RBD for Ras-binding (IPR003116 from INTERPRO), GoLoco for guanine nucleotide inhibitor activity (IPR003109 from INTERPRO), PX for phosphatidylinositol-binding (IPR001683 from INTERPRO), PXA that is associated with PX (IPR003114 from INTERPRO), PH for stimulating guanine nucleotide exchange (IPR001849 from INTERPRO), and GGL (G protein gamma subunit-like) for binding G protein beta subunits (IPR001770 from INTERPRO) []. Those RGS proteins that contain GGL domains can interact with G protein beta subunits to form novel dimers that prevent G protein gamma subunit binding and G protein alpha subunit association, thereby preventing heterotrimer formation.; GO: 0004871 signal transducer activity; PDB: 2BCJ_A 3PSC_A 3PVU_A 1YM7_B 3PVW_A 1OMW_A 2EBZ_A 2BV1_B 2GTP_D 1CMZ_A ....
Probab=99.74 E-value=2.4e-17 Score=95.44 Aligned_cols=64 Identities=36% Similarity=0.587 Sum_probs=59.1
Q ss_pred HHHHHhhChHHHHHHHHHHHhhcchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCCCCCCC
Q psy16714 4 IKRTRNTFKAGRKLFREFLRCEYSEENILFWLACEDLKKESNPDVIEEKARFIYEDYISILSPKE 68 (68)
Q Consensus 4 ~~~~ll~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~~sp~E 68 (68)
++.+| .||.|+..|..||+++++.+||.||.+|+.|+...+...+...|..||++||+++||.+
T Consensus 2 ~~~il-~~~~~~~~F~~Fl~~~~~~~~l~F~~~v~~~~~~~~~~~~~~~a~~I~~~fi~~~s~~~ 65 (118)
T PF00615_consen 2 FEKIL-EDPEGLELFKEFLEKENCEENLQFWLEVEEFKSSESEEQRKKLAQQIYNKFISPGSPNE 65 (118)
T ss_dssp HHHHH-HSHHHHHHHHHHHHHTTTTHHHHHHHHHHHHHTSCSHHHHHHHHHHHHHHHTSTTSTTC
T ss_pred HHHHH-CChHHHHHHHHHHhHCCCHHHHHHHHHHHHHHhhcchhhHHHHHHHHHHHHhccccccc
Confidence 44555 99999999999999999999999999999999997777889999999999999999975
No 4
>PF09128 RGS-like: Regulator of G protein signalling-like domain; InterPro: IPR015212 This entry represents a domain consisting of twelve helices that fold into a compact structure that contains the overall structural scaffold observed in other regulator of G protein signalling (RGS) proteins and three additional helical elements that pack closely to it. Helices 1-9 comprise the RGS fold, in which helices 4-7 form a classic antiparallel bundle adjacent to the other helices. Like other RGS structures, helices 7 and 8 span the length of the folded domain and form essentially one continuous helix with a kink in the middle. Helices 10-12 form an apparently stable C-terminal extension of the structural domain, and although other RGS proteins lack this structure, these elements are intimately associated with the rest of the structural framework by hydrophobic interactions. This domain binds to active G-alpha proteins, promoting GTP hydrolysis by the alpha subunit of heterotrimeric G proteins, thereby inactivating the G protein and rapidly switching off G protein-coupled receptor signalling pathways []. ; GO: 0005089 Rho guanyl-nucleotide exchange factor activity, 0005737 cytoplasm; PDB: 3CX6_B 3CX8_B 3CX7_B 1HTJ_F 1SHZ_C 3AB3_D 1IAP_A.
Probab=98.30 E-value=3.3e-06 Score=54.88 Aligned_cols=62 Identities=15% Similarity=0.246 Sum_probs=51.9
Q ss_pred HHHHHhhChHHHHHHHHHHHhhcchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCCCCCC
Q psy16714 4 IKRTRNTFKAGRKLFREFLRCEYSEENILFWLACEDLKKESNPDVIEEKARFIYEDYISILSPK 67 (68)
Q Consensus 4 ~~~~ll~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~~sp~ 67 (68)
++.+ .+.|.-...|.+|+-..+.-.-|+|++-.+-|+..+.+ ++.+.|.+||.+|+.++||.
T Consensus 8 l~~L-k~rPaHlaVFl~yv~sq~dP~~LLfyL~~d~yk~~~~K-e~rk~~~ei~stFL~~~ApL 69 (188)
T PF09128_consen 8 LELL-KSRPAHLAVFLNYVLSQFDPSPLLFYLIADLYKQGNAK-EMRKWAYEIHSTFLDPGAPL 69 (188)
T ss_dssp HHHH-HC-HHHHHHHHHHHHHHS-SHHHHHHHHHHHHTTS-TT-CHHHHHHHHHHHHTSTT-TT
T ss_pred HHHH-hcCcHHHHHHHHHHHhcCCcchHHHHHHHHHHhccCHH-HHHHHHHHHHHHHcCCCCCc
Confidence 4444 48999999999999999988889999999999997654 78999999999999999996
No 5
>KOG3590|consensus
Probab=97.61 E-value=0.00014 Score=52.78 Aligned_cols=39 Identities=21% Similarity=0.298 Sum_probs=34.6
Q ss_pred HHHHHhhChHHHHHHHHHHHhhcchhHHHHHHHHHHHhcc
Q psy16714 4 IKRTRNTFKAGRKLFREFLRCEYSEENILFWLACEDLKKE 43 (68)
Q Consensus 4 ~~~~ll~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~ 43 (68)
+.++| .||..+.+|-+||...-.-..|.||+++|.||+.
T Consensus 115 L~~il-~D~~~LSYFiQYLD~r~~L~LIKFwl~~E~FKs~ 153 (602)
T KOG3590|consen 115 LEQVL-HDTIVLSYFIQYLDLRRMLHLIKFWLEAESFKST 153 (602)
T ss_pred HHHHh-ccchHHHHHHHHHHHHhHHHHHHHHHhhhhhhHH
Confidence 34444 9999999999999999999999999999999975
No 6
>KOG3590|consensus
Probab=97.08 E-value=0.0012 Score=48.05 Aligned_cols=58 Identities=19% Similarity=0.423 Sum_probs=47.3
Q ss_pred hhChHHHHHHHHHHHhhcchhHHHHHHHHHHHhccC-------CHHHHHHHHHHHHHHhcCCCCC
Q psy16714 9 NTFKAGRKLFREFLRCEYSEENILFWLACEDLKKES-------NPDVIEEKARFIYEDYISILSP 66 (68)
Q Consensus 9 l~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~-------~~~~~~~~a~~Iy~~fi~~~sp 66 (68)
|...-.+-+|..|++++-..+-|.||++|.+|++.- +.++...-|-.||++|++-.|.
T Consensus 322 L~~e~aLfyF~Ey~e~~~~~~~lqFwl~adnF~~~la~~E~~~d~~eAq~DAMilYdKYFSlQat 386 (602)
T KOG3590|consen 322 LFCESALFYFSEYMEKEDAVNILQFWLAADNFQSQLAAKEGQYDGQEAQNDAMILYDKYFSLQAT 386 (602)
T ss_pred HhhhHHHHHHHHHHHHHhHHHHHHHHHHhhhHHHhhhhcccccchhhhhcchhHHHHHHhhhhcc
Confidence 355557789999999999999999999999999852 2335677788999999987664
No 7
>KOG0986|consensus
Probab=97.03 E-value=0.0019 Score=47.53 Aligned_cols=57 Identities=26% Similarity=0.335 Sum_probs=46.6
Q ss_pred hhChHHHHHHHHHHHh-hcchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCCCCC
Q psy16714 9 NTFKAGRKLFREFLRC-EYSEENILFWLACEDLKKESNPDVIEEKARFIYEDYISILSP 66 (68)
Q Consensus 9 l~dp~g~~~F~~FL~~-e~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~~sp 66 (68)
..-|.|+.+|++||.. +--.--+.||.+|++|-...+ +....+++.|+..||....|
T Consensus 58 ~~QpIGrlLFr~Fc~~~~~~~~~i~Fl~~v~eYE~~~~-~~r~~~~~~i~~~~l~~~~~ 115 (591)
T KOG0986|consen 58 EKQPIGRLLFRDFCENRPELRPAIEFLEEVKEYEKADD-DERKDKARQILDAYLMKELL 115 (591)
T ss_pred ccCcHHHHHHHHHHhcCchhhhHHHHHHHHHHHhcCCc-ccchhhHHHHHHHHhhcccc
Confidence 3579999999999996 223345899999999999884 46789999999999986544
No 8
>PF15182 OTOS: Otospiralin
Probab=86.08 E-value=1.6 Score=24.03 Aligned_cols=30 Identities=20% Similarity=0.372 Sum_probs=25.5
Q ss_pred HHHHHHHHHHhccCCHHHHHHHHHHHHHHh
Q psy16714 31 ILFWLACEDLKKESNPDVIEEKARFIYEDY 60 (68)
Q Consensus 31 l~Fw~ave~~k~~~~~~~~~~~a~~Iy~~f 60 (68)
-+||-=||.|+....-.++..+|+..|.-|
T Consensus 25 sDFW~YveyFrtlGAY~~indmARtfFAh~ 54 (69)
T PF15182_consen 25 SDFWNYVEYFRTLGAYNQINDMARTFFAHF 54 (69)
T ss_pred hHHHHHHHHHHHhccHHHHHHHHHHHHhhC
Confidence 379999999999877778899999887665
No 9
>PF02184 HAT: HAT (Half-A-TPR) repeat; InterPro: IPR003107 The HAT (Half A TPR) repeat has a repetitive pattern characterised by three aromatic residues with a conserved spacing. They are structurally and sequentially similar to TPRs (tetratricopeptide repeats), though they lack the highly conserved alanine and glycine residues found in TPRs. The number of HAT repeats found in different proteins varies between 9 and 12. HAT-repeat-containing proteins appear to be components of macromolecular complexes that are required for RNA processing []. The repeats may be involved in protein-protein interactions. The HAT motif has striking structural similarities to HEAT repeats (IPR000357 from INTERPRO), being of a similar length and consisting of two short helices connected by a loop domain, as in HEAT repeats.; GO: 0006396 RNA processing, 0005622 intracellular
Probab=80.72 E-value=1.3 Score=21.05 Aligned_cols=15 Identities=33% Similarity=0.627 Sum_probs=12.2
Q ss_pred HHHHHHHHHHhcCCC
Q psy16714 50 EEKARFIYEDYISIL 64 (68)
Q Consensus 50 ~~~a~~Iy~~fi~~~ 64 (68)
...|+.||++||.-.
T Consensus 3 ~dRAR~IyeR~v~~h 17 (32)
T PF02184_consen 3 FDRARSIYERFVLVH 17 (32)
T ss_pred HHHHHHHHHHHHHhC
Confidence 368999999998753
No 10
>PF11467 LEDGF: Lens epithelium-derived growth factor (LEDGF) ; InterPro: IPR021567 LEDGF is a chromatin-associated protein that protects cells from stress-induced apoptosis. It is the binding partner of HIV-1 integrase in human cells. The integrase binding domain (IBD) of LEDGF is a compact right-handed bundle composed of five alpha-helices. The residues essential for the interaction with the integrase are present in the inter-helical loop regions of the bundle structure. ; PDB: 3F9K_K 3HPG_G 3U88_C 3HPH_H 2B4J_D 1Z9E_A.
Probab=62.00 E-value=14 Score=21.90 Aligned_cols=18 Identities=22% Similarity=0.392 Sum_probs=14.3
Q ss_pred CCHHHHHHHHHHHHHHhc
Q psy16714 44 SNPDVIEEKARFIYEDYI 61 (68)
Q Consensus 44 ~~~~~~~~~a~~Iy~~fi 61 (68)
.....+.++|..||++|-
T Consensus 59 ~gn~~Ir~KA~~lYnkfK 76 (106)
T PF11467_consen 59 KGNQQIRKKATELYNKFK 76 (106)
T ss_dssp TT-HHHHHHHHHHHHHHH
T ss_pred hccHHHHHHHHHHHHHHH
Confidence 334689999999999984
No 11
>COG4790 EscR Type III secretory pathway, component EscR [Intracellular trafficking and secretion]
Probab=60.78 E-value=13 Score=24.82 Aligned_cols=32 Identities=25% Similarity=0.382 Sum_probs=27.2
Q ss_pred hChHHHHHHHHHHHhhcchhHHHHHHHHHHHh
Q psy16714 10 TFKAGRKLFREFLRCEYSEENILFWLACEDLK 41 (68)
Q Consensus 10 ~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k 41 (68)
-|..|+.-+|+||+|...+|+..|+....+=+
T Consensus 91 ~~~~~l~pyR~fL~k~t~~~~~~FF~~~~~~~ 122 (214)
T COG4790 91 FDDKGLSPYRDFLKKNTEEEEVSFFERSAQKK 122 (214)
T ss_pred HHHhhhhHHHHHHHHhCcHHHHHHHHHHHHHh
Confidence 45679999999999999999999999876433
No 12
>PF05528 Coronavirus_5: Coronavirus gene 5 protein; InterPro: IPR008458 Infectious bronchitis virus, a member of Coronaviridae family, has a single-stranded positive-sense RNA genome, which is 27 kb in length. Gene 5 contains two (5a and 5b) open reading frames. The function of the 5a and 5b proteins is unknown [].
Probab=58.59 E-value=6.7 Score=22.16 Aligned_cols=24 Identities=25% Similarity=0.487 Sum_probs=18.7
Q ss_pred HhhcchhHHHHHHHHHHHhccCCH
Q psy16714 23 RCEYSEENILFWLACEDLKKESNP 46 (68)
Q Consensus 23 ~~e~s~Enl~Fw~ave~~k~~~~~ 46 (68)
.+..+..||+-|.+|..+.+.+..
T Consensus 54 ~eH~~nNnllSw~av~~le~qt~~ 77 (82)
T PF05528_consen 54 EEHQSNNNLLSWQAVKQLERQTPQ 77 (82)
T ss_pred HHHhcccchHhHHHHHHHHhhCcc
Confidence 445577889999999999887643
No 13
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=52.21 E-value=71 Score=21.53 Aligned_cols=50 Identities=18% Similarity=0.371 Sum_probs=33.6
Q ss_pred hHHHHHHHHHHHhhcch---hHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcC--CCCC
Q psy16714 12 KAGRKLFREFLRCEYSE---ENILFWLACEDLKKESNPDVIEEKARFIYEDYIS--ILSP 66 (68)
Q Consensus 12 p~g~~~F~~FL~~e~s~---Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~--~~sp 66 (68)
......|..|++..=.+ .|..||++--.|.... ...|...|.+++. |++|
T Consensus 160 ~~Ai~af~~fl~~yP~s~~a~~A~y~LG~~y~~~g~-----~~~A~~~f~~vv~~yP~s~ 214 (263)
T PRK10803 160 DDAIVAFQNFVKKYPDSTYQPNANYWLGQLNYNKGK-----KDDAAYYFASVVKNYPKSP 214 (263)
T ss_pred HHHHHHHHHHHHHCcCCcchHHHHHHHHHHHHHcCC-----HHHHHHHHHHHHHHCCCCc
Confidence 34577899999865444 5899999998887653 2345555666662 5554
No 14
>PF10774 DUF4226: Domain of unknown function (DUF4226); InterPro: IPR019710 This entry represents an uncharacterised family of proteins belonging to Mycobacteria. It was previously incorrectly annotated as Biofilm regulator BssS (also known as YliH).
Probab=47.87 E-value=16 Score=21.89 Aligned_cols=18 Identities=33% Similarity=0.488 Sum_probs=15.4
Q ss_pred hhChHHHHHHHHHHHhhc
Q psy16714 9 NTFKAGRKLFREFLRCEY 26 (68)
Q Consensus 9 l~dp~g~~~F~~FL~~e~ 26 (68)
+..|.|...|..||....
T Consensus 67 ldTPaG~r~f~~fL~~kl 84 (112)
T PF10774_consen 67 LDTPAGAREFQRFLRGKL 84 (112)
T ss_pred cCCHHHHHHHHHHHHHHH
Confidence 588999999999998543
No 15
>COG4133 CcmA ABC-type transport system involved in cytochrome c biogenesis, ATPase component [Posttranslational modification, protein turnover, chaperones]
Probab=45.82 E-value=13 Score=24.78 Aligned_cols=18 Identities=28% Similarity=0.545 Sum_probs=13.1
Q ss_pred Hhhcc-hhHHHHHHHHHHH
Q psy16714 23 RCEYS-EENILFWLACEDL 40 (68)
Q Consensus 23 ~~e~s-~Enl~Fw~ave~~ 40 (68)
+.+.+ .|||.||.++-+.
T Consensus 87 K~eLTa~ENL~F~~~~~~~ 105 (209)
T COG4133 87 KTELTALENLHFWQRFHGS 105 (209)
T ss_pred cchhhHHHHHHHHHHHhCC
Confidence 34443 4999999997765
No 16
>PF09279 EF-hand_like: Phosphoinositide-specific phospholipase C, efhand-like; InterPro: IPR015359 This domain is predominantly found in the enzyme phosphoinositol-specific phospholipase C. It adopts a structure consisting of a core of four alpha helices, in an EF like fold, and is required for functioning of the enzyme []. ; PDB: 3OHM_B 3QR0_A 2ZKM_X 2FJU_B 3QR1_D 1DJW_B 1DJI_B 1DJG_B 1QAS_B 2ISD_B ....
Probab=41.13 E-value=39 Score=18.20 Aligned_cols=18 Identities=39% Similarity=0.556 Sum_probs=11.1
Q ss_pred HHHHHHHHHHhhcchhHH
Q psy16714 14 GRKLFREFLRCEYSEENI 31 (68)
Q Consensus 14 g~~~F~~FL~~e~s~Enl 31 (68)
....|+.||.++-.+.++
T Consensus 17 t~~~f~~FL~~eQ~~~~~ 34 (83)
T PF09279_consen 17 TAEEFRRFLREEQGEPRL 34 (83)
T ss_dssp EHHHHHHHHHHTSS-TTS
T ss_pred CHHHHHHHHHHHhccccC
Confidence 356777788766666444
No 17
>PF14894 Lsm_C: Lsm C-terminal; PDB: 1M5Q_1.
Probab=40.41 E-value=28 Score=19.05 Aligned_cols=16 Identities=25% Similarity=0.526 Sum_probs=12.8
Q ss_pred HHHHHHHHHHHHhcCC
Q psy16714 48 VIEEKARFIYEDYISI 63 (68)
Q Consensus 48 ~~~~~a~~Iy~~fi~~ 63 (68)
-+-+++..||++||..
T Consensus 46 PlAerv~~ly~eyi~~ 61 (64)
T PF14894_consen 46 PLAERVYDLYNEYIEK 61 (64)
T ss_dssp HHHHHHHHHHHHHHH-
T ss_pred hHHHHHHHHHHHHHHh
Confidence 3678999999999864
No 18
>PF11163 DUF2947: Protein of unknown function (DUF2947); InterPro: IPR021334 This family of proteins with unknown function appears to be restricted to Gammaproteobacteria.
Probab=34.55 E-value=44 Score=21.28 Aligned_cols=18 Identities=28% Similarity=0.473 Sum_probs=16.4
Q ss_pred HHHHHHHHHHhcCCCCCC
Q psy16714 50 EEKARFIYEDYISILSPK 67 (68)
Q Consensus 50 ~~~a~~Iy~~fi~~~sp~ 67 (68)
...|..|++++|+..+|.
T Consensus 33 ~~~a~qvW~~~IS~~~~~ 50 (153)
T PF11163_consen 33 QARAAQVWKENISNQSPH 50 (153)
T ss_pred HHHHHHHHHHHhccCCCC
Confidence 688999999999999985
No 19
>PF09764 Nt_Gln_amidase: N-terminal glutamine amidase; InterPro: IPR019161 This entry represents the Protein N-terminal glutamine amidohydrolase (NtQ-amidase) family of proteins. They contain a region of approximately 200 residues carrying several distinctive motifs including a WDYHV motif and one of three cysteines. Protein N-terminal glutamine amidohydrolase is responsible for degradation of N-terminal glutamine [].; GO: 0016811 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in linear amides; PDB: 3C9Q_A.
Probab=31.91 E-value=35 Score=22.25 Aligned_cols=17 Identities=41% Similarity=0.786 Sum_probs=11.8
Q ss_pred hhcchhHHHHHHHHHHHhc
Q psy16714 24 CEYSEENILFWLACEDLKK 42 (68)
Q Consensus 24 ~e~s~Enl~Fw~ave~~k~ 42 (68)
.-+||||+ |.-|+.++.
T Consensus 4 scYCEENV--ykLce~~~~ 20 (184)
T PF09764_consen 4 SCYCEENV--YKLCEYLRE 20 (184)
T ss_dssp TT-HHHHH--HHHHHHHHH
T ss_pred ccccHHHH--HHHHHHHHh
Confidence 35899998 777777743
No 20
>PF02671 PAH: Paired amphipathic helix repeat; InterPro: IPR003822 This family contains the paired amphipathic helix (PAH) repeat. The family contains the eukaryotic Sin3 proteins, which have at least three PAH domains (PAH1, PAH2, and PAH3). Sin3 proteins are components of a co-repressor complex that silences transcription, playing important roles in the transition between proliferation and differentiation. Sin3 proteins are recruited to the DNA by various DNA-binding transcription factors such as the Mad family of repressors, Mnt/Rox, PLZF, MeCP2, p53, REST/NRSF, MNFbeta, Sp1, TGIF and Ume6 []. Sin3 acts as a scaffold protein that in turn recruits histone-binding proteins RbAp46/RbAp48 and histone deacetylases HDAC1/HDAC2, which deacetylate the core histones resulting in a repressed state of the chromatin []. The PAH domains are protein-protein interaction domains through which Sin3 fulfils its role as a scaffold. The PAH2 domain of Sin3 can interact with a wide range of unrelated and structurally diverse transcription factors that bind using different interaction motifs. For example, the Sin3 PAH2 domain can interact with the unrelated Mad and HBP1 factors using alternative interaction motifs that involve binding in opposite helical orientations [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1S5Q_B 2L9S_B 1G1E_B 1S5R_B 2CR7_A 2CZY_A 2LD7_B 2RMR_A 2RMS_A 1PD7_A ....
Probab=30.71 E-value=47 Score=16.15 Aligned_cols=19 Identities=16% Similarity=0.156 Sum_probs=12.5
Q ss_pred HHHHHhhChHHHHHHHHHH
Q psy16714 4 IKRTRNTFKAGRKLFREFL 22 (68)
Q Consensus 4 ~~~~ll~dp~g~~~F~~FL 22 (68)
+...+-.+|.....|..||
T Consensus 28 v~~Ll~~hpdLl~~F~~Fl 46 (47)
T PF02671_consen 28 VSELLRGHPDLLEEFNRFL 46 (47)
T ss_dssp HHHHTTT-HHHHHHHHHHS
T ss_pred HHHHHccCHHHHHHHHhhC
Confidence 3344456888888888875
No 21
>PF14328 DUF4385: Domain of unknown function (DUF4385)
Probab=29.16 E-value=47 Score=20.98 Aligned_cols=17 Identities=29% Similarity=0.700 Sum_probs=12.7
Q ss_pred HHHHHHHHHHHHHHhcC
Q psy16714 46 PDVIEEKARFIYEDYIS 62 (68)
Q Consensus 46 ~~~~~~~a~~Iy~~fi~ 62 (68)
+....+.|..||..|+.
T Consensus 47 p~iA~~Ss~kIy~~Fl~ 63 (145)
T PF14328_consen 47 PEIARESSEKIYEMFLD 63 (145)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 45667888888888763
No 22
>PF10312 Cactin_mid: Conserved mid region of cactin; InterPro: IPR018816 This entry represents the conserved central domain of a family of proteins referred to as cactins. It contains two of three predicted coiled-coil motifs. Most proteins containing this domain also have IPR019134 from INTERPRO at the C-terminal end. Upstream of this domain in Drosophila proteins are a serine-rich region, some non-typical RD motifs and three predicted bipartite nuclear localisation signals, none of which are well-conserved. Cactin associates with IkappaB-cactus as one of the intracellular members of the Rel (NF-kappaB) pathway which is conserved in invertebrates and vertebrates. In mammals, this pathway controls the activities of the immune and inflammatory response genes as well as viral genes, and is critical for cell growth and survival. In Drosophila, the Rel pathway functions in the innate cellular and humoral immune response, in muscle development, and in the establishment of dorsal-ventral polarity in the early embryo [].
Probab=28.38 E-value=88 Score=20.27 Aligned_cols=24 Identities=25% Similarity=0.337 Sum_probs=17.1
Q ss_pred HHHHHHHHhhcchhHHHHHHHHHH
Q psy16714 16 KLFREFLRCEYSEENILFWLACED 39 (68)
Q Consensus 16 ~~F~~FL~~e~s~Enl~Fw~ave~ 39 (68)
.-...|+.-+....|..||.++--
T Consensus 82 ~dIk~y~~Le~~~~n~~fW~~~~~ 105 (191)
T PF10312_consen 82 EDIKVYLELERDNKNREFWEAMLV 105 (191)
T ss_pred HHHHHHHHHcccccHHHHHHHHHH
Confidence 344556776777779999998753
No 23
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=28.34 E-value=2.1e+02 Score=19.92 Aligned_cols=49 Identities=27% Similarity=0.364 Sum_probs=28.9
Q ss_pred hChHHHHHHHHHHH---hhcchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCCCCC
Q psy16714 10 TFKAGRKLFREFLR---CEYSEENILFWLACEDLKKESNPDVIEEKARFIYEDYISILSP 66 (68)
Q Consensus 10 ~dp~g~~~F~~FL~---~e~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~~sp 66 (68)
.||.....+..-+. +.--.++-.||..-++ .+...+..+|++|++.|.-
T Consensus 251 ~d~e~~~~l~~~l~~~v~~d~~~~~~fW~~y~~--------~i~~~~~~~yd~yLKaN~q 302 (318)
T PF12725_consen 251 KDPEAYKELYSQLSPGVKKDLKENRAFWQKYEG--------PIEEVSDFVYDTYLKANNQ 302 (318)
T ss_pred cCHHHHHHHHHhCCHHHHHHHHHHHHHHHHhcc--------hHHHHHHHHHHHHHHhcCc
Confidence 45655555543333 1112355566655442 3567789999999998864
No 24
>PF01193 RNA_pol_L: RNA polymerase Rpb3/Rpb11 dimerisation domain; InterPro: IPR011261 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. RNA polymerase (RNAP) II, which is responsible for all mRNA synthesis in eukaryotes, consists of 12 subunits. Subunits Rpb3 and Rpb11 form a heterodimer that is functionally analogous to the archaeal RNAP D/L heterodimer, and to the prokaryotic RNAP alpha subunit (RpoA) homodimer. In each case, they play a key role in RNAP assembly by forming a platform on which the catalytic subunits (eukaryotic Rpb1/Rpb2, and prokaryotic beta/beta') can interact []. These different subunits share regions of homology required for dimerisation. In eukaryotic Rpb11 and archaeal L subunits, the dimerisation domain consists of a contiguous Rpb11-like domain, whereas in eukaryotic Rpb3, archaeal D and bacterial RpoA subunits (IPR011263 from INTERPRO), the dimerisation domain consists of the Rpb11-like domain interrupted by an insert domain. In the prokaryotic alpha subunit, this dimerisation domain is the N-terminal domain [].; GO: 0003899 DNA-directed RNA polymerase activity, 0046983 protein dimerization activity, 0006351 transcription, DNA-dependent; PDB: 1HQM_B 1YNJ_A 1YNN_A 1I6V_A 2GHO_A 3HKZ_V 2PMZ_X 2PA8_L 3GTK_C 1TWH_C ....
Probab=27.91 E-value=32 Score=18.07 Aligned_cols=9 Identities=11% Similarity=0.095 Sum_probs=6.5
Q ss_pred HHHHHHhhC
Q psy16714 3 CIKRTRNTF 11 (68)
Q Consensus 3 ~~~~~ll~d 11 (68)
++||+|+++
T Consensus 15 ~LRr~ll~~ 23 (66)
T PF01193_consen 15 ALRRILLSE 23 (66)
T ss_dssp HHHHHHHSS
T ss_pred HHHHHHHhc
Confidence 577777776
No 25
>PF10875 DUF2670: Protein of unknown function (DUF2670); InterPro: IPR022714 This bacterial family of proteins has no known function and appears to be restricted to Rickettsiaceae.
Probab=27.88 E-value=40 Score=20.89 Aligned_cols=18 Identities=22% Similarity=0.244 Sum_probs=14.3
Q ss_pred hHHHHHHhhChHHHHHHH
Q psy16714 2 QCIKRTRNTFKAGRKLFR 19 (68)
Q Consensus 2 ~~~~~~ll~dp~g~~~F~ 19 (68)
||+||++..||-|.-...
T Consensus 3 qalrrlIaaNPMg~flw~ 20 (139)
T PF10875_consen 3 QALRRLIAANPMGFFLWS 20 (139)
T ss_pred HHHHHHHhhCCchhhHHH
Confidence 799999999998865443
No 26
>KOG2005|consensus
Probab=27.37 E-value=2.1e+02 Score=22.92 Aligned_cols=49 Identities=16% Similarity=0.258 Sum_probs=28.2
Q ss_pred hHHHHHHHHHHHhhcchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHh
Q psy16714 12 KAGRKLFREFLRCEYSEENILFWLACEDLKKESNPDVIEEKARFIYEDY 60 (68)
Q Consensus 12 p~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~f 60 (68)
-.+......|+...--..-.++..+|-.|--.++...+.+-|..||-+|
T Consensus 204 ve~id~l~~~Vd~~n~~RvclYl~sc~~~lP~Pdd~~ll~~a~~IYlKf 252 (878)
T KOG2005|consen 204 VEGIDLLLDYVDEHNYQRVCLYLTSCVPLLPGPDDVALLRTALKIYLKF 252 (878)
T ss_pred hhhHhHHHHHhhhhhHHHHHHHHHHHhhcCCCchhhHHHHHHHHHHHHH
Confidence 3344444444443333333455555555555554446788999999887
No 27
>PHA02901 virus redox protein; Provisional
Probab=27.00 E-value=1.1e+02 Score=17.24 Aligned_cols=21 Identities=19% Similarity=0.385 Sum_probs=16.1
Q ss_pred HHHHHHhhChHHHHHHHHHHH
Q psy16714 3 CIKRTRNTFKAGRKLFREFLR 23 (68)
Q Consensus 3 ~~~~~ll~dp~g~~~F~~FL~ 23 (68)
.++-+|+++|.-......||.
T Consensus 34 ~ik~~L~sqP~k~~iLk~FL~ 54 (75)
T PHA02901 34 TIRAILESQPYKKKILKQFLA 54 (75)
T ss_pred HHHHHHHcCchHHHHHHHHHH
Confidence 456678888888888887776
No 28
>PF03096 Ndr: Ndr family; InterPro: IPR004142 This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted []. The precise molecular and cellular function of members of this family is still unknown, yet they are known to be involved in cellular differentiation events. The Ndr1 group was the first to be discovered. Their expression is repressed by the proto-oncogenes N-myc and c-myc, and in line with this observation, Ndr1 protein expression is down-regulated in neoplastic cells, and is reactivated when differentiation is induced by chemicals such as retinoic acid. Ndr2 and Ndr3 expression is not under the control of N-myc or c-myc. Ndr1 expression is also activated by several chemicals: tunicamycin and homocysteine induce Ndr1 in human umbilical endothelial cells; nickel induces Ndr1 in several cell types. Members of this family are found in wide variety of multicellular eukaryotes, including an Ndr1 type protein in Helianthus annuus (Common sunflower), known as Sf21. Interestingly, the highest scoring matches in the noise are all alpha/beta hydrolases (IPR000073 from INTERPRO), suggesting that this family may have an enzymatic function.; PDB: 2QMQ_A 2XMR_B 2XMQ_B 2XMS_A.
Probab=26.89 E-value=1.6e+02 Score=20.50 Aligned_cols=29 Identities=17% Similarity=0.229 Sum_probs=22.8
Q ss_pred hChHHHHHHHHHHHhhcchhHH-HHHHHHH
Q psy16714 10 TFKAGRKLFREFLRCEYSEENI-LFWLACE 38 (68)
Q Consensus 10 ~dp~g~~~F~~FL~~e~s~Enl-~Fw~ave 38 (68)
+++...+.|++.+.+.-+..|+ .||.+..
T Consensus 177 ~n~Dlv~~yr~~l~~~~Np~Nl~~f~~sy~ 206 (283)
T PF03096_consen 177 NNSDLVQTYRQHLDERINPKNLALFLNSYN 206 (283)
T ss_dssp CT-HHHHHHHHHHHT-TTHHHHHHHHHHHH
T ss_pred ccHHHHHHHHHHHhcCCCHHHHHHHHHHHh
Confidence 6788899999999999998887 7877764
No 29
>PF07631 PSD4: Protein of unknown function (DUF1592); InterPro: IPR013042 A region of similarity shared by several Rhodopirellula baltica cytochrome-like proteins that are predicted to be secreted. These proteins also contain IPR011478 from INTERPRO, IPR013036 from INTERPRO, IPR013039 from INTERPRO and IPR013043 from INTERPRO.
Probab=26.82 E-value=78 Score=19.15 Aligned_cols=53 Identities=17% Similarity=0.334 Sum_probs=29.7
Q ss_pred HHHHHhhChHHHHHHHHHHHhhcchhHHHHHHHHHHHhccCC------------HHHHHHHHHHHHHHhcCCCCC
Q psy16714 4 IKRTRNTFKAGRKLFREFLRCEYSEENILFWLACEDLKKESN------------PDVIEEKARFIYEDYISILSP 66 (68)
Q Consensus 4 ~~~~ll~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~~------------~~~~~~~a~~Iy~~fi~~~sp 66 (68)
++|.| +||..+.....|..+ |+..+.+..... ...+...+...+..-+..|.|
T Consensus 42 ~~RML-~dpr~~~~~~~F~~q---------WL~l~~~~~~~~d~~~~p~~~~~l~~~m~~E~~~f~~~vl~~n~~ 106 (128)
T PF07631_consen 42 AERML-ADPRARRFVERFFRQ---------WLDLDRLDSIVKDPEKFPEFSPDLREAMREETDEFFEHVLEENGS 106 (128)
T ss_pred HHHHH-cCccHHHHHHHHHHH---------HhCCCcccccCCChhhccccCHHHHHHHHHHHHHHHHHHHHcCCC
Confidence 34444 999888888877765 555554444321 123445555555555555543
No 30
>PF05288 Pox_A3L: Poxvirus A3L Protein; InterPro: IPR007952 This family consists of several poxvirus A3L or A2_5L proteins. The entry of vaccinia virus (VV) into the host cell results in the delivery of the double-stranded DNA genome-containing core into the cytoplasm. The core is disassembled, releasing the viral DNA in order to initiate VV cytoplasmic transcription and DNA replication. A3L protein is a part of that core []. The A2.5L gene product is an all-alpha-helical protein with a conserved Cxx(x)C motif in the N-terminal alpha-helix. It appears to be an integral component of intracellular virions [].
Probab=26.64 E-value=1.2e+02 Score=16.81 Aligned_cols=22 Identities=23% Similarity=0.288 Sum_probs=17.0
Q ss_pred HHHHHHhhChHHHHHHHHHHHh
Q psy16714 3 CIKRTRNTFKAGRKLFREFLRC 24 (68)
Q Consensus 3 ~~~~~ll~dp~g~~~F~~FL~~ 24 (68)
.++-+|+++|.-..-.++||..
T Consensus 29 ~ik~~L~sqp~k~~iLk~FL~~ 50 (70)
T PF05288_consen 29 TIKMMLESQPNKLKILKRFLSF 50 (70)
T ss_pred HHHHHHHcChhHHHHHHHHHHH
Confidence 4566788999888888888873
No 31
>PHA02683 ORF078 thioredoxin-like protein; Provisional
Probab=26.57 E-value=1.1e+02 Score=17.27 Aligned_cols=21 Identities=19% Similarity=0.332 Sum_probs=16.2
Q ss_pred HHHHHHhhChHHHHHHHHHHH
Q psy16714 3 CIKRTRNTFKAGRKLFREFLR 23 (68)
Q Consensus 3 ~~~~~ll~dp~g~~~F~~FL~ 23 (68)
.++-+|+++|.-......||.
T Consensus 34 ~ikm~L~sqP~k~~iLk~FL~ 54 (75)
T PHA02683 34 NIRMVLESQPNKLRILKEFLA 54 (75)
T ss_pred HHHHHHHcCccHHHHHHHHHH
Confidence 456678888988888887776
No 32
>PF10118 Metal_hydrol: Predicted metal-dependent hydrolase; InterPro: IPR016516 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function
Probab=26.57 E-value=2e+02 Score=19.22 Aligned_cols=63 Identities=22% Similarity=0.217 Sum_probs=43.1
Q ss_pred hHHHHHH--hhChHHHHHHHHHHHhh--cchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCC-CCC
Q psy16714 2 QCIKRTR--NTFKAGRKLFREFLRCE--YSEENILFWLACEDLKKESNPDVIEEKARFIYEDYISI-LSP 66 (68)
Q Consensus 2 ~~~~~~l--l~dp~g~~~F~~FL~~e--~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~-~sp 66 (68)
.++|.++ +.||..+..=+.|..+| |+-+--.|-...+. ..-+.+...+....++.+++.. .+|
T Consensus 45 ~svr~~~~~i~D~~L~~~i~~FIgQEA~H~r~H~~~n~~l~~--~G~~~~~~~~~~~~~~~~~l~~~~~~ 112 (253)
T PF10118_consen 45 RSVRRARPQIKDPELREEIKGFIGQEAMHSREHRKFNEALEA--QGYDVRPFLEKMEKLFLKFLEKRLSL 112 (253)
T ss_pred HHHHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--cCCCCcHHHHHHHHHHHHHHhhhcCH
Confidence 5667764 68999999999999976 56666677666632 2222334567777777777776 444
No 33
>PF05843 Suf: Suppressor of forked protein (Suf); InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=25.90 E-value=2.1e+02 Score=19.12 Aligned_cols=49 Identities=29% Similarity=0.317 Sum_probs=26.8
Q ss_pred hChH-HHHHHHHHHHhhcchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCC
Q psy16714 10 TFKA-GRKLFREFLRCEYSEENILFWLACEDLKKESNPDVIEEKARFIYEDYISI 63 (68)
Q Consensus 10 ~dp~-g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~ 63 (68)
.|+. ++.-|..=|+. -..+..||+.--+|-.... + ...|+.+|++.|..
T Consensus 50 ~d~~~A~~Ife~glk~--f~~~~~~~~~Y~~~l~~~~--d-~~~aR~lfer~i~~ 99 (280)
T PF05843_consen 50 KDPKRARKIFERGLKK--FPSDPDFWLEYLDFLIKLN--D-INNARALFERAISS 99 (280)
T ss_dssp S-HHHHHHHHHHHHHH--HTT-HHHHHHHHHHHHHTT----HHHHHHHHHHHCCT
T ss_pred CCHHHHHHHHHHHHHH--CCCCHHHHHHHHHHHHHhC--c-HHHHHHHHHHHHHh
Confidence 5555 66666655543 3335566666555544332 1 35677788777765
No 34
>KOG0063|consensus
Probab=25.87 E-value=93 Score=23.65 Aligned_cols=31 Identities=10% Similarity=0.005 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHhhcchhHHHHHHHHHHHhcc
Q psy16714 13 AGRKLFREFLRCEYSEENILFWLACEDLKKE 43 (68)
Q Consensus 13 ~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~ 43 (68)
.++..++-+|....++.++.+|+.++.|...
T Consensus 495 AskvikRfilhakktafvVEhdfImaTYlad 525 (592)
T KOG0063|consen 495 ASKVIKRFILHAKKTAFVVEHDFIMATYLAD 525 (592)
T ss_pred HHHHHHHHHHhccchhhhhhhHHHHHHhhcc
Confidence 4677888899999999999999999999864
No 35
>PHA03056 putative myristoylated protein; Provisional
Probab=24.85 E-value=1.2e+02 Score=19.16 Aligned_cols=30 Identities=30% Similarity=0.597 Sum_probs=19.7
Q ss_pred HHHHHhccCCHHHHHHHHHHHHHHhcCCCC
Q psy16714 36 ACEDLKKESNPDVIEEKARFIYEDYISILS 65 (68)
Q Consensus 36 ave~~k~~~~~~~~~~~a~~Iy~~fi~~~s 65 (68)
-|.+|-....+.......+.||..|.-|.|
T Consensus 70 LCKd~l~~~~p~T~~~~IK~Il~qy~IP~S 99 (165)
T PHA03056 70 LCKDHAEKSSPETQQMIIKHIYEQYLIPVS 99 (165)
T ss_pred HHHHHHHcCCchHHHHHHHHHHHHhcCChh
Confidence 355554444444445668999999988876
No 36
>KOG2348|consensus
Probab=24.23 E-value=45 Score=25.42 Aligned_cols=17 Identities=24% Similarity=0.237 Sum_probs=13.9
Q ss_pred HHHHHHHHHHhcCCCCC
Q psy16714 50 EEKARFIYEDYISILSP 66 (68)
Q Consensus 50 ~~~a~~Iy~~fi~~~sp 66 (68)
.-...+||++||+|+|.
T Consensus 375 SvfTYDIYr~Yi~PrAs 391 (667)
T KOG2348|consen 375 SVFTYDIYRNYIRPRAS 391 (667)
T ss_pred hhhehHHHHhhcCCccC
Confidence 44568999999999874
No 37
>PRK13722 lytic transglycosylase; Provisional
Probab=24.09 E-value=1.2e+02 Score=19.46 Aligned_cols=54 Identities=13% Similarity=0.067 Sum_probs=29.9
Q ss_pred HhhChHHHHHHHH-HHHhhcchhHHHHHHHHHHHhccCC-----HHHHHHHHHHHHHHhcC
Q psy16714 8 RNTFKAGRKLFRE-FLRCEYSEENILFWLACEDLKKESN-----PDVIEEKARFIYEDYIS 62 (68)
Q Consensus 8 ll~dp~g~~~F~~-FL~~e~s~Enl~Fw~ave~~k~~~~-----~~~~~~~a~~Iy~~fi~ 62 (68)
|+.||..--..-. +|...+..-. ..|.+|-.|...+. ......-|.+||..|-.
T Consensus 85 ll~DPc~NI~~Gaw~L~~~~~~~g-~~w~AvaaYNaG~~~~~~~~~~r~~Ya~kV~~~y~~ 144 (169)
T PRK13722 85 LTTDPCMNIYTGAYYLAIAFKKWG-VSWEAVGAYNAGFRKTERQNQRRLAYASEVYRIYTG 144 (169)
T ss_pred HccCHHHHHHHHHHHHHHHHHHcC-CchheeeecCCCCCccccchHHHHHHHHHHHHHHHH
Confidence 4578865433322 2222211111 36899999998642 24556667667666644
No 38
>PF03134 TB2_DP1_HVA22: TB2/DP1, HVA22 family; InterPro: IPR004345 This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein which in human is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological inherited disease. The family also includes the plant protein of known similarity to TB2/DP1, the HVA22 abscisic acid-induced protein (e.g. Q07764 from SWISSPROT), which is thought to be a regulatory protein.
Probab=23.85 E-value=56 Score=18.23 Aligned_cols=14 Identities=29% Similarity=0.617 Sum_probs=11.2
Q ss_pred HHHHHHHHHHhcCC
Q psy16714 50 EEKARFIYEDYISI 63 (68)
Q Consensus 50 ~~~a~~Iy~~fi~~ 63 (68)
.+-|..||+++|+|
T Consensus 79 ~~Ga~~iy~~~i~P 92 (94)
T PF03134_consen 79 FQGAEYIYDKFIRP 92 (94)
T ss_pred CCcHHHHHHHHccc
Confidence 46788899998876
No 39
>PF07956 DUF1690: Protein of Unknown function (DUF1690) ; InterPro: IPR012471 Family of uncharacterised fungal proteins.
Probab=23.65 E-value=41 Score=20.85 Aligned_cols=20 Identities=30% Similarity=0.416 Sum_probs=16.6
Q ss_pred hhcchhHHHHHHHHHHHhcc
Q psy16714 24 CEYSEENILFWLACEDLKKE 43 (68)
Q Consensus 24 ~e~s~Enl~Fw~ave~~k~~ 43 (68)
++...--|+.|-.|+.||+.
T Consensus 121 ~~N~~rPLnCw~EVe~FKk~ 140 (142)
T PF07956_consen 121 RENDGRPLNCWEEVEAFKKE 140 (142)
T ss_pred HHCCCCCCchHHHHHHHHHH
Confidence 55666779999999999974
No 40
>PF11709 Mit_ribos_Mrp51: Mitochondrial ribosomal protein subunit ; InterPro: IPR016712 The function of mitochondrial ribosomal small-subunit protein MRP51 is not entirely clear, but deletion of the MRP51 gene completely blocks mitochondrial gene expression [].
Probab=23.49 E-value=1.1e+02 Score=21.26 Aligned_cols=26 Identities=23% Similarity=0.349 Sum_probs=20.9
Q ss_pred HHHHHhhChHHHHHHHHHHHhhcchh
Q psy16714 4 IKRTRNTFKAGRKLFREFLRCEYSEE 29 (68)
Q Consensus 4 ~~~~ll~dp~g~~~F~~FL~~e~s~E 29 (68)
|++-|-..+..+..|++||.+.+.+.
T Consensus 148 f~~yL~kvr~~R~eF~~~L~~~~~e~ 173 (312)
T PF11709_consen 148 FERYLKKVRPLRPEFKKWLREKHPES 173 (312)
T ss_pred HHHHHHHhHHHHHHHHHHHHHhChhh
Confidence 55566677889999999999888776
No 41
>PHA02335 hypothetical protein
Probab=22.67 E-value=1.9e+02 Score=17.53 Aligned_cols=43 Identities=16% Similarity=0.224 Sum_probs=28.0
Q ss_pred hhcchhHHHHHHHHHHHhcc--CCHH------HHHHHHHHHHHHhcCCCCCC
Q psy16714 24 CEYSEENILFWLACEDLKKE--SNPD------VIEEKARFIYEDYISILSPK 67 (68)
Q Consensus 24 ~e~s~Enl~Fw~ave~~k~~--~~~~------~~~~~a~~Iy~~fi~~~sp~ 67 (68)
+|..++|+.++ |+..|..- .+.+ ++.+...+.+.+|+..++|+
T Consensus 4 ~el~~~n~m~f-Ai~~Y~np~sVt~ddf~~DlkRi~yIkrllKRy~~~~~~k 54 (118)
T PHA02335 4 DELNEGNYMLF-AIKNYNNPQSVTYDDFEEDLKRFKYIKRLFKRYLNTGELK 54 (118)
T ss_pred ccCCchhHHHH-HHHhcCCcccccHHHHHHHHHHHHHHHHHHHhhcCCCChh
Confidence 45667777664 67777632 1121 34567778899999999884
No 42
>PF12122 DUF3582: Protein of unknown function (DUF3582); InterPro: IPR022732 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the N-terminal domain of membrane-bound serine endopeptidases belonging to MEROPS peptidase family S54 (rhomboid-1, clan ST). This domain contains a conserved ASW sequence motif and a single completely conserved residue F that may be functionally important. The tertiary structure of the GlpG protein from Escherichia coli has been determined []. The GlpG protein has six transmembrane domains (other members of the family are predicted to have seven), with the N- and C-terminal ends anchored in the cytoplasm. One transmembrane domain is shorter than the rest, creating an internal, aqueous cavity just below the membrane surface and it is here were proteolysis occurs. There is also a membrane-embedded loop between the first and second transmembrane domains which is postulated to act as a gate controlling substrate access to the active site. No other family of serine peptidases is known to have active site residues within transmembrane domains (although transmembrane active sites are known for aspartic peptidase and metallopeptidases), and the GlpG protein has the type structure for clan ST.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=22.67 E-value=81 Score=18.40 Aligned_cols=18 Identities=11% Similarity=0.132 Sum_probs=15.4
Q ss_pred hhChHHHHHHHHHHHhhc
Q psy16714 9 NTFKAGRKLFREFLRCEY 26 (68)
Q Consensus 9 l~dp~g~~~F~~FL~~e~ 26 (68)
++||.....|.+||....
T Consensus 7 ~~n~r~AqaF~DYl~sqg 24 (101)
T PF12122_consen 7 LNNPRAAQAFIDYLASQG 24 (101)
T ss_dssp ESSHHHHHHHHHHHHHTT
T ss_pred cCCHHHHHHHHHHHHHCC
Confidence 478999999999999765
No 43
>PF13981 SopA: SopA-like central domain; PDB: 3NB2_B 3NAW_B 3SQV_B 2QZA_B 3SY2_B 2QYU_A.
Probab=22.36 E-value=99 Score=19.01 Aligned_cols=35 Identities=26% Similarity=0.379 Sum_probs=21.6
Q ss_pred HHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCCCC
Q psy16714 30 NILFWLACEDLKKESNPDVIEEKARFIYEDYISILS 65 (68)
Q Consensus 30 nl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~~s 65 (68)
|=.|-+-+..=....+ ..+.+.|..+|++|++...
T Consensus 92 N~~FIQ~i~~~~~~~~-~~~k~~A~~LY~~YL~~~~ 126 (135)
T PF13981_consen 92 NGAFIQLIAQAMTHGD-DEIKQKARDLYKKYLQLPR 126 (135)
T ss_dssp HHHHHHHHHHHCC-TS-CCCHHHHHHHHHHHCCSTT
T ss_pred ccHHHHHHHHHHHhcc-HHHHHHHHHHHHHHhcChh
Confidence 4444444443333332 3568999999999998653
No 44
>PF12335 SBF2: Myotubularin protein ; InterPro: IPR022096 This domain family is found in eukaryotes, and is approximately 220 amino acids in length. The family is found in association with PF02141 from PFAM, PF03456 from PFAM, PF03455 from PFAM. This family is the middle region of SBF2, a member of the myotubularin family. Myotubularin-related proteins have been suggested to work in phosphoinositide-mediated signalling events that may also convey control of myelination. Mutations of SBF2 are implicated in Charcot-Marie-Tooth disease.
Probab=21.94 E-value=1.1e+02 Score=20.40 Aligned_cols=24 Identities=17% Similarity=0.177 Sum_probs=19.5
Q ss_pred hHHHHHHhhChHHHHHHHHHHHhhc
Q psy16714 2 QCIKRTRNTFKAGRKLFREFLRCEY 26 (68)
Q Consensus 2 ~~~~~~ll~dp~g~~~F~~FL~~e~ 26 (68)
.+|.++| ....||..|...|.+..
T Consensus 47 ~av~~~l-k~~~aR~~~~~~L~~~~ 70 (225)
T PF12335_consen 47 PAVLRAL-KSRSARQAFCRELSKHV 70 (225)
T ss_pred HHHHHHH-ccchHHHHHHHHHHHHH
Confidence 5677777 88889999999998543
No 45
>PF08708 PriCT_1: Primase C terminal 1 (PriCT-1); InterPro: IPR014820 This alpha helical domain is found at the C-terminal of primases.
Probab=21.84 E-value=1.4e+02 Score=15.60 Aligned_cols=11 Identities=18% Similarity=0.172 Sum_probs=5.3
Q ss_pred HHHHHHHHHHH
Q psy16714 48 VIEEKARFIYE 58 (68)
Q Consensus 48 ~~~~~a~~Iy~ 58 (68)
++...++.|++
T Consensus 57 Ev~~i~kSi~k 67 (71)
T PF08708_consen 57 EVKAIAKSIAK 67 (71)
T ss_pred HHHHHHHHHHH
Confidence 44455555443
No 46
>PF08361 TetR_C_2: MAATS-type transcriptional repressor, C-terminal region; InterPro: IPR013572 This entry is named after the various transcriptional regulatory proteins that it contains, including MtrR (Q6RV06 from SWISSPROT), AcrR (P34000 from SWISSPROT), ArpR (Q9KJC4 from SWISSPROT), TtgR (Q9AIU0 from SWISSPROT) and SmeT (Q8KLP4 from SWISSPROT). These are members of the TetR (tetracycline resistance) family of transcriptional repressors, that are involved in the control of expression of multidrug resistance proteins [, , ]. ; GO: 0003677 DNA binding; PDB: 3BCG_B 2QOP_A 2UXP_B 2XDN_C 2UXH_A 2UXI_B 2UXO_A 2UXU_B 2WUI_A 2W53_A ....
Probab=21.69 E-value=1.8e+02 Score=16.84 Aligned_cols=34 Identities=21% Similarity=0.327 Sum_probs=25.5
Q ss_pred HhhChHHHHHHHHHH-HhhcchhHHHHHHHHHHHh
Q psy16714 8 RNTFKAGRKLFREFL-RCEYSEENILFWLACEDLK 41 (68)
Q Consensus 8 ll~dp~g~~~F~~FL-~~e~s~Enl~Fw~ave~~k 41 (68)
+.+||..+..|.-.+ +-|+.+|+--.........
T Consensus 17 l~~d~~~Rrv~~I~~~kcE~~~e~~~~~~r~~~~~ 51 (121)
T PF08361_consen 17 LAEDERQRRVFEILFHKCEYVEEMAPVRERRREAQ 51 (121)
T ss_dssp HHHSHHHHHHHHHHHHS--SSTTTHHHHHHHHHHH
T ss_pred HhhCHHHHHHHHHHHHhcccCCcccHHHHHHHHHH
Confidence 349999999999855 5899999998887766544
No 47
>COG5562 Phage envelope protein [General function prediction only]
Probab=21.18 E-value=57 Score=20.42 Aligned_cols=23 Identities=26% Similarity=0.439 Sum_probs=18.9
Q ss_pred hHHHHHHhhChHHHHHHHHHHHh
Q psy16714 2 QCIKRTRNTFKAGRKLFREFLRC 24 (68)
Q Consensus 2 ~~~~~~ll~dp~g~~~F~~FL~~ 24 (68)
.+++-++..+..|...|+.|+..
T Consensus 75 ~~i~~al~~~qsGqttF~ef~~~ 97 (137)
T COG5562 75 TLIKTALRRHQSGQTTFEEFCSA 97 (137)
T ss_pred HHHHHHHHHHhcCCccHHHHHHH
Confidence 35677888999999999999874
No 48
>PF14159 CAAD: CAAD domains of cyanobacterial aminoacyl-tRNA synthetase
Probab=21.15 E-value=1.7e+02 Score=16.69 Aligned_cols=29 Identities=14% Similarity=0.213 Sum_probs=21.3
Q ss_pred HHHHHHHHhccCCHHHHHHHHHHHHHHhc
Q psy16714 33 FWLACEDLKKESNPDVIEEKARFIYEDYI 61 (68)
Q Consensus 33 Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi 61 (68)
=|....++-..+++++..++...+.++++
T Consensus 61 ~wF~~ryLL~~~~R~el~~~i~~~k~~i~ 89 (90)
T PF14159_consen 61 GWFVYRYLLFAENRQELLQKIQSLKKEIL 89 (90)
T ss_pred hHHHHHHHcChHhHHHHHHHHHHHHHHhc
Confidence 37777777777777778888888777654
No 49
>COG4748 Uncharacterized conserved protein [Function unknown]
Probab=20.65 E-value=2.9e+02 Score=19.89 Aligned_cols=51 Identities=20% Similarity=0.225 Sum_probs=35.5
Q ss_pred HHHHHHHHHHHhhc---chhHHHHHHHHH--HHhccCCHHHHHHHHHHHHHHhcCC
Q psy16714 13 AGRKLFREFLRCEY---SEENILFWLACE--DLKKESNPDVIEEKARFIYEDYISI 63 (68)
Q Consensus 13 ~g~~~F~~FL~~e~---s~Enl~Fw~ave--~~k~~~~~~~~~~~a~~Iy~~fi~~ 63 (68)
.....+..||.+|. |.|-+.|..+-- +++.......-....+.++..||..
T Consensus 172 kYt~eik~~la~e~~~Ps~Efvkffaa~v~~g~~t~~~~~kFt~ivk~~~~qFi~D 227 (365)
T COG4748 172 KYTKEIKAYLAKETNNPSVEFVKFFAAKVYTGFKTTSVDEKFTDIVKNAFSQFIND 227 (365)
T ss_pred hHHHHHHHHHHHhccCCCHHHHHHHHHHHhccceeeeHHHHHHHHHHHHHHHHHHH
Confidence 45667788888666 678888888766 5555544445566777888888754
No 50
>PF03909 BSD: BSD domain ; InterPro: IPR005607 The BSD domain is an about 60-residue long domain named after the BTF2-like transcription factors, Synapse-associated proteins and DOS2-like proteins in which it is found. Additionally, it is also found in several hypothetical proteins. The BSD domain occurs in one or two copies in a variety of species ranging from primal protozoan to human. It can be found associated with other domains such as the BTB domain (see PDOC50097 from PROSITEDOC) or the U-box in multidomain proteins. The function of the BSD domain is yet unknown []. Secondary structure prediction indicates the presence of three predicted alpha helices, which probably form a three-helical bundle in small domains. The third predicted helix contains neighbouring phenylalanine and tryptophan residues - less common amino acids that are invariant in all the BSD domains identified and that are the most striking sequence features of the domain []. Some proteins known to contain one or two BSD domains are listed below: Mammalian TFIIH basal transcription factor complex p62 subunit (GTF2H1). Yeast RNA polymerase II transcription factor B 73 kDa subunit (TFB1), the homologue of BTF2. Yeast DOS2 protein. It is involved in single-copy DNA replication and ubiquitination. Drosophila synapse-associated protein SAP47. Mammalian SYAP1. Various Arabidopsis thaliana (Mouse-ear cress) hypothetical proteins.; PDB: 1X3A_A 1PFJ_A 2RNR_B 2DII_A.
Probab=20.35 E-value=1.1e+02 Score=15.89 Aligned_cols=31 Identities=35% Similarity=0.510 Sum_probs=20.8
Q ss_pred HHHHHhhChHHHHHHHHHHHhhcchhHHHHHHH
Q psy16714 4 IKRTRNTFKAGRKLFREFLRCEYSEENILFWLA 36 (68)
Q Consensus 4 ~~~~ll~dp~g~~~F~~FL~~e~s~Enl~Fw~a 36 (68)
...+|..||..+..+...+-...+++ .||..
T Consensus 17 i~~lL~~~p~l~~~~~~lVP~~~~e~--~FW~r 47 (62)
T PF03909_consen 17 IKKLLEEDPNLRKLYNELVPSKMSEE--EFWKR 47 (62)
T ss_dssp HHHHHHH-HHHHHHHHHCCTTTS-HH--HHHHH
T ss_pred HHHHHHhCHHHHHHHHHhCCCCCCHH--HHHHH
Confidence 34567799999999988885555544 67864
No 51
>PRK09718 hypothetical protein; Validated
Probab=20.08 E-value=1.8e+02 Score=22.00 Aligned_cols=19 Identities=26% Similarity=0.501 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHhcCCCC
Q psy16714 47 DVIEEKARFIYEDYISILS 65 (68)
Q Consensus 47 ~~~~~~a~~Iy~~fi~~~s 65 (68)
..+.++|..+|++||+..+
T Consensus 385 ~~~k~~a~~ly~~~l~~~~ 403 (512)
T PRK09718 385 DTIKANAWHLYKEWIRSDD 403 (512)
T ss_pred HHHHHHHHHHHHHHhcCcc
Confidence 5789999999999998764
Done!