Query 015516
Match_columns 405
No_of_seqs 79 out of 81
Neff 2.9
Searched_HMMs 46136
Date Fri Mar 29 07:02:43 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/015516.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/015516hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF12937 F-box-like: F-box-lik 98.9 1.8E-09 3.9E-14 77.0 4.2 44 94-137 2-45 (47)
2 PF00646 F-box: F-box domain; 98.7 9.8E-09 2.1E-13 72.5 2.2 47 91-137 1-47 (48)
3 smart00256 FBOX A Receptor for 98.5 1.4E-07 3E-12 63.5 3.9 40 96-135 1-40 (41)
4 KOG2997 F-box protein FBX9 [Ge 95.4 0.013 2.7E-07 59.8 3.2 52 92-143 106-162 (366)
5 KOG2120 SCF ubiquitin ligase, 95.2 0.019 4E-07 58.9 3.9 50 87-136 92-141 (419)
6 PF06881 Elongin_A: RNA polyme 94.8 0.036 7.8E-07 46.7 3.9 60 93-154 4-66 (109)
7 PF13013 F-box-like_2: F-box-l 93.0 0.12 2.7E-06 44.8 4.0 44 91-136 20-63 (109)
8 PLN03215 ascorbic acid mannose 92.8 0.098 2.1E-06 53.6 3.5 39 91-129 2-41 (373)
9 KOG3926 F-box proteins [Amino 84.1 1.4 2.9E-05 44.8 4.3 50 89-138 198-248 (332)
10 KOG0281 Beta-TrCP (transducin 84.1 1.3 2.8E-05 46.5 4.3 68 93-160 75-149 (499)
11 KOG0274 Cdc4 and related F-box 82.9 0.64 1.4E-05 49.5 1.6 54 91-144 106-159 (537)
12 KOG1708 Mitochondrial/chloropl 80.9 2 4.4E-05 41.9 4.1 82 268-386 47-129 (236)
13 PF04970 LRAT: Lecithin retino 74.0 1.8 3.8E-05 36.7 1.4 16 296-311 3-18 (125)
14 smart00743 Agenet Tudor-like d 73.6 3.8 8.2E-05 30.6 3.0 23 299-325 2-24 (61)
15 PF00018 SH3_1: SH3 domain; I 73.3 2.8 6.1E-05 30.0 2.1 24 299-326 15-38 (48)
16 PF14250 AbrB-like: AbrB-like 73.0 2.3 5E-05 35.2 1.7 21 292-312 45-65 (71)
17 PF07653 SH3_2: Variant SH3 do 70.7 3 6.5E-05 30.7 1.8 24 299-325 17-40 (55)
18 KOG4408 Putative Mg2+ and Co2+ 66.2 1.4 3.1E-05 45.6 -1.0 52 90-141 5-56 (386)
19 PF00855 PWWP: PWWP domain; I 63.3 10 0.00022 29.5 3.6 52 301-367 2-53 (86)
20 COG1064 AdhP Zn-dependent alco 59.7 5.8 0.00013 40.5 2.0 47 283-340 57-110 (339)
21 cd05829 Sortase_E Sortase E (S 56.7 5.4 0.00012 35.2 1.1 23 295-317 68-90 (144)
22 PF12969 DUF3857: Domain of Un 51.4 13 0.00028 32.0 2.5 23 295-317 85-107 (177)
23 PF07886 BA14K: BA14K-like pro 49.9 6.1 0.00013 27.9 0.3 23 242-267 6-28 (31)
24 PF05641 Agenet: Agenet domain 44.7 34 0.00073 26.7 3.7 44 300-356 1-44 (68)
25 PF09372 PRANC: PRANC domain; 41.9 24 0.00052 28.9 2.6 27 90-116 69-95 (97)
26 PF03489 SapB_2: Saposin-like 41.3 27 0.00059 23.7 2.4 23 98-120 13-35 (35)
27 cd05835 Dnmt3b_related The PWW 40.9 42 0.00091 27.5 3.9 40 301-350 2-41 (87)
28 PF00970 FAD_binding_6: Oxidor 40.4 9.4 0.0002 30.2 0.0 18 292-309 74-91 (99)
29 TIGR03784 marine_sortase sorta 39.3 14 0.00029 34.3 0.9 22 295-316 104-125 (174)
30 PF14604 SH3_9: Variant SH3 do 39.3 25 0.00055 25.8 2.2 23 299-325 14-36 (49)
31 smart00326 SH3 Src homology 3 39.1 34 0.00074 23.3 2.7 23 299-325 20-42 (58)
32 smart00306 HintN Hint (Hedgeho 38.2 16 0.00035 28.3 1.0 25 285-309 68-99 (100)
33 cd04459 Rho_CSD Rho_CSD: Rho p 37.7 35 0.00075 27.6 2.9 36 279-314 15-55 (68)
34 PF07497 Rho_RNA_bind: Rho ter 36.9 33 0.00072 28.6 2.7 38 278-315 16-58 (78)
35 COG1018 Hmp Flavodoxin reducta 33.3 25 0.00055 34.3 1.7 28 292-320 79-107 (266)
36 PF11270 DUF3070: Protein of u 32.4 30 0.00065 24.9 1.5 24 282-305 2-26 (32)
37 cd05162 PWWP The PWWP domain, 32.2 68 0.0015 25.6 3.8 29 300-333 1-29 (87)
38 cd00174 SH3 Src homology 3 dom 30.7 57 0.0012 22.1 2.7 23 299-325 17-39 (54)
39 TIGR03635 S17_bact 30S ribosom 27.5 48 0.001 27.0 2.1 13 297-309 46-58 (71)
40 cd03470 Rieske_cytochrome_bc1 27.0 43 0.00092 29.4 1.9 19 297-315 6-24 (126)
41 COG1545 Predicted nucleic-acid 26.6 1.3E+02 0.0028 27.0 4.9 96 219-322 35-132 (140)
42 PF11604 CusF_Ec: Copper bindi 24.0 59 0.0013 25.8 2.0 30 286-315 29-58 (70)
43 PF04225 OapA: Opacity-associa 23.5 41 0.0009 27.7 1.1 16 295-310 38-53 (85)
44 cd04453 S1_RNase_E S1_RNase_E: 23.0 40 0.00086 27.6 0.9 15 298-312 55-69 (88)
45 PF13550 Phage-tail_3: Putativ 22.8 56 0.0012 27.7 1.8 14 298-311 138-151 (164)
46 TIGR02988 YaaA_near_RecF S4 do 21.9 54 0.0012 24.6 1.4 10 299-308 49-58 (59)
47 cd06166 Sortase_D_5 Sortase D 21.8 36 0.00078 29.1 0.4 18 294-311 59-76 (126)
48 cd00081 Hint Hedgehog/Intein d 21.7 45 0.00098 27.4 1.0 25 285-309 68-99 (136)
49 PRK06944 sulfur carrier protei 21.5 56 0.0012 24.5 1.4 9 300-308 50-58 (65)
50 PF13621 Cupin_8: Cupin-like d 21.4 49 0.0011 29.5 1.2 21 299-327 212-232 (251)
51 PF03658 Ub-RnfH: RnfH family 21.4 47 0.001 28.0 1.1 10 299-308 61-70 (84)
52 PRK05863 sulfur carrier protei 20.9 58 0.0013 25.2 1.4 9 300-308 50-58 (65)
53 KOG0162 Myosin class I heavy c 20.9 1.2E+02 0.0025 35.3 4.2 27 299-329 1069-1095(1106)
54 PRK05886 yajC preprotein trans 20.6 69 0.0015 28.2 1.9 48 296-366 35-82 (109)
55 PF06945 DUF1289: Protein of u 20.5 60 0.0013 24.6 1.4 16 386-401 21-36 (51)
No 1
>PF12937 F-box-like: F-box-like; PDB: 1P22_A 2OVP_B 2OVR_B 2OVQ_B 1FS1_A 1FS2_C 1FQV_I 1LDK_E 2AST_B 2ASS_B.
Probab=98.89 E-value=1.8e-09 Score=77.03 Aligned_cols=44 Identities=27% Similarity=0.506 Sum_probs=40.0
Q ss_pred CCCcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHhh
Q 015516 94 LDLPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKHM 137 (405)
Q Consensus 94 LDLPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWeklV 137 (405)
.+||+|++..|+++|+|.||++++.||+.+|+.+.++.||.++.
T Consensus 2 ~~LP~Eil~~If~~L~~~dl~~~~~vcr~w~~~~~~~~lW~~~~ 45 (47)
T PF12937_consen 2 SSLPDEILLEIFSYLDPRDLLRLSLVCRRWRRIANDNSLWRRLC 45 (47)
T ss_dssp CCS-HHHHHHHHTTS-HHHHHHHTTSSHHHHHHHTCCCHHHHHC
T ss_pred hHhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHCChhhhhhhc
Confidence 58999999999999999999999999999999999999999864
No 2
>PF00646 F-box: F-box domain; InterPro: IPR001810 The F-box domain was first described as a sequence motif found in cyclin-F that interacts with the protein SKP1 [, ]. This relatively conserved structural motif is present in numerous proteins and serves as a link between a target protein and a ubiquitin-conjugating enzyme. The SCF complex (e.g., Skp1-Cullin-F-box) plays a similar role as an E3 ligase in the ubiquitin protein degradation pathway [, ]. Different F-box proteins as a part of SCF complex recruit particular substrates for ubiquitination through specific protein-protein interaction domains. Many mammalian F-box domains contain leucine-rich or WD-40 repeats (IPR001680 from INTERPRO). However, several F-box proteins either have other previously described domains such as Sec7 domain found in FBS protein or do not contain defined protein-protein interaction domains or motifs.; GO: 0005515 protein binding; PDB: 2E32_A 2E31_A 3V7D_B 1NEX_B 3MKS_D 3L2O_B.
Probab=98.67 E-value=9.8e-09 Score=72.47 Aligned_cols=47 Identities=32% Similarity=0.526 Sum_probs=40.9
Q ss_pred CCCCCCcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHhh
Q 015516 91 MSVLDLPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKHM 137 (405)
Q Consensus 91 ~sLLDLPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWeklV 137 (405)
+++.+||++++..|+++|++.|+++++.||+.+|+.+.++.+|.+..
T Consensus 1 ~~~~~LP~~il~~Il~~l~~~~~~~l~~vsk~~~~~~~~~~~~~~~~ 47 (48)
T PF00646_consen 1 FPLSDLPDEILQEILSYLDPKDLLRLSLVSKRWRSLVDSPRLWKKII 47 (48)
T ss_dssp -HHHHS-HHHHHHHHHTS-HHHHHHHCTT-HHHHHHHTTHHHHHHHH
T ss_pred CCHHHCCHHHHHHHHHHCcHHHHHHHHHHhhHHHHHHcCCCccHHHh
Confidence 36789999999999999999999999999999999999999998864
No 3
>smart00256 FBOX A Receptor for Ubiquitination Targets.
Probab=98.49 E-value=1.4e-07 Score=63.54 Aligned_cols=40 Identities=40% Similarity=0.620 Sum_probs=38.7
Q ss_pred CcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHH
Q 015516 96 LPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEK 135 (405)
Q Consensus 96 LPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWek 135 (405)
||++++..|+++|++.|++++++||+.+|..+.++.+|.+
T Consensus 1 lP~~ll~~I~~~l~~~d~~~~~~vc~~~~~~~~~~~~~~~ 40 (41)
T smart00256 1 LPDEILEEILSKLPPKDLLRLRKVSRRWRSLIDSHDFWFK 40 (41)
T ss_pred CCHHHHHHHHHcCCHHHHHHHHHHHHHHHHHhcChhhhhc
Confidence 7999999999999999999999999999999999999975
No 4
>KOG2997 consensus F-box protein FBX9 [General function prediction only]
Probab=95.36 E-value=0.013 Score=59.77 Aligned_cols=52 Identities=25% Similarity=0.406 Sum_probs=44.4
Q ss_pred CCCCCcHHHHHHHHccCC-----hhhHHHHHhhcHHHHHhccChhHHHHhhhhhhcc
Q 015516 92 SVLDLPELVLECILEKLP-----PSGLCNMAAVCSSLRDRCMSDHLWEKHMKQKWGR 143 (405)
Q Consensus 92 sLLDLPeelLd~ILs~LS-----P~DLcrmAaVCrsLRerc~SD~LWeklV~~kwp~ 143 (405)
.+..||+|+|=.|+...= -.+|-++|+||+.|+..|+.|.||...--.-|++
T Consensus 106 ~~~~LPdEvLm~I~~~vv~~~~d~rsL~~~s~vCr~F~~~~R~~~lwR~aC~KvW~~ 162 (366)
T KOG2997|consen 106 SISVLPDEVLMRIFRWVVSSLLDLRSLEQLSLVCRGFYKCARDPELWRLACLKVWQR 162 (366)
T ss_pred hhhhCCHHHHHHHHHHHHhhhcchhhHHHhHhhHHHHHHHHcChHHHHHHHHHHHHH
Confidence 368999999999887654 5999999999999999999999999876655553
No 5
>KOG2120 consensus SCF ubiquitin ligase, Skp2 component [Posttranslational modification, protein turnover, chaperones]
Probab=95.21 E-value=0.019 Score=58.87 Aligned_cols=50 Identities=22% Similarity=0.361 Sum_probs=45.5
Q ss_pred CCCCCCCCCCcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHh
Q 015516 87 GGAEMSVLDLPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKH 136 (405)
Q Consensus 87 ~~~~~sLLDLPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWekl 136 (405)
--.+.+.-.||+|++.+|.+.|.-.||.+|+.||+.+..-+....+|..+
T Consensus 92 ~npgv~~~slpDEill~IFs~L~kk~LL~~~~VC~Rfyr~~~de~lW~~l 141 (419)
T KOG2120|consen 92 NNPGVSWDSLPDEILLGIFSCLCKKELLKVSGVCKRFYRLASDESLWQTL 141 (419)
T ss_pred cCCCCCcccCCHHHHHHHHHhccHHHHHHHHHHHHHHhhccccccceeee
Confidence 34556788999999999999999999999999999999999999999764
No 6
>PF06881 Elongin_A: RNA polymerase II transcription factor SIII (Elongin) subunit A; InterPro: IPR010684 This family represents a conserved region within RNA polymerase II transcription factor SIII (Elongin) subunit A. In mammals, the Elongin complex activates elongation by RNA polymerase II by suppressing transient pausing of the polymerase at many sites within transcription units. Elongin is a heterotrimer composed of A, B, and C subunits of 110, 18, and 15 kilodaltons, respectively. Subunit A has been shown to function as the transcriptionally active component of Elongin [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus, 0016021 integral to membrane
Probab=94.80 E-value=0.036 Score=46.73 Aligned_cols=60 Identities=30% Similarity=0.496 Sum_probs=50.9
Q ss_pred CCCCcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHhhhhhhccc---cCccccchhh
Q 015516 93 VLDLPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKHMKQKWGRI---VGPAAYREWQ 154 (405)
Q Consensus 93 LLDLPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWeklV~~kwp~v---Lgpaa~~eWq 154 (405)
+.++|-.+|.-||...+|..|.++-.-|..|... +|.||.+|+++..|.- -.|....+|.
T Consensus 4 vG~~py~ll~piL~~~~~~QL~~iE~~np~l~~~--tdeLW~~~i~rdFp~~~~~~~~~~~~~Wr 66 (109)
T PF06881_consen 4 VGDVPYHLLRPILEKCSPEQLRRIEDNNPHLIED--TDELWKKLIKRDFPEESKRQKPKEPESWR 66 (109)
T ss_pred cCCCCHHHHHHHHccCCHHHHHHHHHhCCCcchh--hHHHHHHHHHhHCcChhhcccccccchHH
Confidence 6789999999999999999999999999888776 7999999999999952 2444445773
No 7
>PF13013 F-box-like_2: F-box-like domain
Probab=93.02 E-value=0.12 Score=44.77 Aligned_cols=44 Identities=27% Similarity=0.493 Sum_probs=37.7
Q ss_pred CCCCCCcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHh
Q 015516 91 MSVLDLPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKH 136 (405)
Q Consensus 91 ~sLLDLPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWekl 136 (405)
..++|||+|++..|+.+-.+.++..+..+|+.+|.+. +++|..+
T Consensus 20 ltl~DLP~ELl~~I~~~C~~~~l~~l~~~~~~~r~~r--~~~~~~L 63 (109)
T PF13013_consen 20 LTLLDLPWELLQLIFDYCNDPILLALSRTCRAYRSWR--DHIWYLL 63 (109)
T ss_pred cchhhChHHHHHHHHhhcCcHHHHHHHHHHHHHHHHH--HHHHHHh
Confidence 4699999999999999999999999999999998765 3455544
No 8
>PLN03215 ascorbic acid mannose pathway regulator 1; Provisional
Probab=92.76 E-value=0.098 Score=53.60 Aligned_cols=39 Identities=26% Similarity=0.366 Sum_probs=35.6
Q ss_pred CCCCCCcHHHHHHHHccC-ChhhHHHHHhhcHHHHHhccC
Q 015516 91 MSVLDLPELVLECILEKL-PPSGLCNMAAVCSSLRDRCMS 129 (405)
Q Consensus 91 ~sLLDLPeelLd~ILs~L-SP~DLcrmAaVCrsLRerc~S 129 (405)
++.-+||+++|+.|..+| +..|+.++++||++-|.+...
T Consensus 2 ~~Ws~Lp~dll~~i~~~l~~~~d~~~~~~vC~sWr~a~~~ 41 (373)
T PLN03215 2 ADWSTLPEELLHMIAGRLFSNVELKRFRSICRSWRSSVSG 41 (373)
T ss_pred CChhhCCHHHHHHHHhhCCcHHHHHHHHhhhhhHHHhccc
Confidence 356799999999999999 889999999999999998775
No 9
>KOG3926 consensus F-box proteins [Amino acid transport and metabolism]
Probab=84.11 E-value=1.4 Score=44.84 Aligned_cols=50 Identities=24% Similarity=0.357 Sum_probs=45.6
Q ss_pred CCCCCCCCcHHHHHHHHccCCh-hhHHHHHhhcHHHHHhccChhHHHHhhh
Q 015516 89 AEMSVLDLPELVLECILEKLPP-SGLCNMAAVCSSLRDRCMSDHLWEKHMK 138 (405)
Q Consensus 89 ~~~sLLDLPeelLd~ILs~LSP-~DLcrmAaVCrsLRerc~SD~LWeklV~ 138 (405)
.+.-|.|||++++-.||-+||- .||..+|+|=.+|.+-...+.+|.++-+
T Consensus 198 ~~ltl~dLP~e~vl~Il~rlsDh~dL~s~aqa~etl~~l~~e~~iWkkLcq 248 (332)
T KOG3926|consen 198 AGLTLHDLPLECVLNILLRLSDHRDLESLAQAWETLAKLSEERRIWKKLCQ 248 (332)
T ss_pred CCCCcccchHHHHHHHHHHccCcchHHHHHHhhHHHHHHHHHHHHHHHHHH
Confidence 4577999999999999999984 8999999999999999999999998754
No 10
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=84.08 E-value=1.3 Score=46.46 Aligned_cols=68 Identities=25% Similarity=0.447 Sum_probs=58.2
Q ss_pred CCCCc----HHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHhhhhhhccc-c--Cccccchhhhhhhcc
Q 015516 93 VLDLP----ELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKHMKQKWGRI-V--GPAAYREWQWHLASR 160 (405)
Q Consensus 93 LLDLP----eelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWeklV~~kwp~v-L--gpaa~~eWq~h~~~~ 160 (405)
...|| +++-+.||+||...|||+--.||+.-+......-+|.+++.+++... + |-+-++.|..++-+.
T Consensus 75 i~~lP~~gl~hi~e~ilsyld~~sLc~celv~k~W~r~l~dg~~WKkLie~~vr~dslWrgl~e~rqw~~~lf~~ 149 (499)
T KOG0281|consen 75 ITALPEQGLDHIAENILSYLDALSLCACELVCKEWKRVLSDGMLWKKLIERMVRTDSLWRGLSERRQWDQYLFKN 149 (499)
T ss_pred HHhcccccHHHHHHHHHHhcchhhhhHHHHHHHHHHHHhccchHHHHHHHHhcchHHHHhhhhhccCcchhhccC
Confidence 34688 99999999999999999999999999999999999999999998733 2 777777887766543
No 11
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=82.91 E-value=0.64 Score=49.46 Aligned_cols=54 Identities=22% Similarity=0.283 Sum_probs=49.0
Q ss_pred CCCCCCcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHhhhhhhccc
Q 015516 91 MSVLDLPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKHMKQKWGRI 144 (405)
Q Consensus 91 ~sLLDLPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWeklV~~kwp~v 144 (405)
--+..||.|+.-.||.+|++.+|+++++||+..+..+..|..|-+...+.-+..
T Consensus 106 dfi~~lp~el~~~il~~Ld~~~l~~~~~v~~~w~~~~~~~~~~~~~~~~~~~~~ 159 (537)
T KOG0274|consen 106 DFLSLLPSELSLHILSFLDGRDLLAVRQVCRNWNKLLDDDKVWWRMCRELIGRL 159 (537)
T ss_pred chhhcccchhcccccccCCHHHhhhhhhhcchhhhhhhccchhhhhhhhhcccC
Confidence 358899999999999999999999999999999999999999998877766653
No 12
>KOG1708 consensus Mitochondrial/chloroplast ribosomal protein L24 [Translation, ribosomal structure and biogenesis]
Probab=80.94 E-value=2 Score=41.88 Aligned_cols=82 Identities=32% Similarity=0.487 Sum_probs=56.3
Q ss_pred eeeecCCCccccccCCCCCC-CcccccccccCCCCCCCeeEEEeeccCCCCccceeeeeccccccCCCCCceecccCCeE
Q 015516 268 IAIECGVPWERLRAAPVDTS-PHDLHISDCLNDLHPGDHIEIQWRRNKEFPYGWWYGVVGHLESCDGNENFCHCHNSETL 346 (405)
Q Consensus 268 ~~ie~~i~WdRlRapPvdT~-~~dlh~Sdcl~dLkPGDHiEIQWRrnkefPYgWWYgvVGHlesC~gn~~~C~Ch~sdtv 346 (405)
.+...+-+|+|.|.+|++-. -||.| +.+||.|||- ||---.=.|-.-.-.-|+|++|
T Consensus 47 ~~~qk~~~w~rrr~~~~e~i~d~dw~-------ff~GDtVeVl---------------vGkDkGkqG~Vtqv~r~~s~Vv 104 (236)
T KOG1708|consen 47 LVLQKNKPWERRRCVPVEPIIDEDWH-------FFFGDTVEVL---------------VGKDKGKQGEVTQVIRHRSWVV 104 (236)
T ss_pred hHHhhcCccccccCCCCCCcccccee-------EecCCEEEEE---------------ecccCCccceEEEEeecCceEE
Confidence 34456789999999999877 47887 7899999994 4443333344334444555555
Q ss_pred EEEEeecCCCCcceeeecccccccccccccCccccceecc
Q 015516 347 ILEFNQYTPGSRWRKTTINRKDHREEGNEADGFYGGIRKV 386 (405)
Q Consensus 347 vlEF~qY~~gSrWR~t~inRk~hrEeG~e~dGfyGGIRKL 386 (405)
|=.. |=| .+-.|.|.+||=|-|+|.
T Consensus 105 V~gl--------------n~k-~r~~gsekeg~pgtivk~ 129 (236)
T KOG1708|consen 105 VKGL--------------NTK-YRHMGSEKEGEPGTIVKS 129 (236)
T ss_pred Eccc--------------chh-hhhhcccccCCCceEEee
Confidence 5322 222 267899999999999986
No 13
>PF04970 LRAT: Lecithin retinol acyltransferase; InterPro: IPR007053 This entry represents a conserved sequence region found in proteins from viruses, bacteria and eukaryotes. It contains a well-conserved NCEHF motif, though its function in these proteins is unknown.; PDB: 2KYT_A 4DOT_A 4FA0_A.
Probab=73.98 E-value=1.8 Score=36.67 Aligned_cols=16 Identities=44% Similarity=0.680 Sum_probs=8.6
Q ss_pred ccCCCCCCCeeEEEee
Q 015516 296 CLNDLHPGDHIEIQWR 311 (405)
Q Consensus 296 cl~dLkPGDHiEIQWR 311 (405)
+...|+||||||++-.
T Consensus 3 ~~~~~~~GD~I~~~r~ 18 (125)
T PF04970_consen 3 DKKRLKPGDHIEVPRG 18 (125)
T ss_dssp ---S--TT-EEEEEET
T ss_pred cccCCCCCCEEEEecC
Confidence 4578999999999754
No 14
>smart00743 Agenet Tudor-like domain present in plant sequences. Domain in plant sequences with possible chromatin-associated functions.
Probab=73.64 E-value=3.8 Score=30.60 Aligned_cols=23 Identities=35% Similarity=0.807 Sum_probs=19.9
Q ss_pred CCCCCCeeEEEeeccCCCCccceeeee
Q 015516 299 DLHPGDHIEIQWRRNKEFPYGWWYGVV 325 (405)
Q Consensus 299 dLkPGDHiEIQWRrnkefPYgWWYgvV 325 (405)
+++.||.||++|+- --+||-|+|
T Consensus 2 ~~~~G~~Ve~~~~~----~~~W~~a~V 24 (61)
T smart00743 2 DFKKGDRVEVFSKE----EDSWWEAVV 24 (61)
T ss_pred CcCCCCEEEEEECC----CCEEEEEEE
Confidence 57999999999975 468999977
No 15
>PF00018 SH3_1: SH3 domain; InterPro: IPR001452 SH3 (src Homology-3) domains are small protein modules containing approximately 50 amino acid residues [, ]. They are found in a great variety of intracellular or membrane-associated proteins [, , ] for example, in a variety of proteins with enzymatic activity, in adaptor proteins that lack catalytic sequences and in cytoskeletal proteins, such as fodrin and yeast actin binding protein ABP-1. The SH3 domain has a characteristic fold which consists of five or six beta-strands arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain short helices []. The surface of the SH3-domain bears a flat, hydrophobic ligand-binding pocket which consists of three shallow grooves defined by conservative aromatic residues in which the ligand adopts an extended left-handed helical arrangement. The ligand binds with low affinity but this may be enhanced by multiple interactions. The region bound by the SH3 domain is in all cases proline-rich and contains PXXP as a core-conserved binding motif. The function of the SH3 domain is not well understood but they may mediate many diverse processes such as increasing local concentration of proteins, altering their subcellular location and mediating the assembly of large multiprotein complexes []. The crystal structure of the SH3 domain of the cytoskeletal protein spectrin, and the solution structures of SH3 domains of phospholipase C (PLC-y) and phosphatidylinositol 3-kinase p85 alpha-subunit, have been determined [, , ]. In spite of relatively limited sequence similarity, their overall structures are similar. The domains belong to the alpha+beta structural class, with 5 to 8 beta-strands forming 2 tightly-packed, anti-parallel beta-sheets arranged in a barrel-like structure, and intervening loops sometimes forming helices. Conserved aliphatic and aromatic residues form a hydrophobic core (A11, L23, A29, V34, W42, L52 and V59 in PLC-y []) and a hydrophobic pocket on the molecular surface (L12, F13, W53 and P55 in PLC-y). The conserved core is believed to stabilise the fold, while the pocket is thought to serve as a binding site for target proteins. Conserved carboxylic amino acids located in the loops, on the periphery of the pocket (D14 and E22), may be involved in protein-protein interactions via proline-rich regions. The N- and C-termini are packed in close proximity, indicating that they are independent structural modules.; GO: 0005515 protein binding; PDB: 1UHF_A 1W1F_A 1WA7_A 1SEM_A 1KFZ_A 2SEM_B 1K76_A 3SEM_B 1X2Q_A 2J06_B ....
Probab=73.28 E-value=2.8 Score=30.03 Aligned_cols=24 Identities=29% Similarity=0.642 Sum_probs=18.9
Q ss_pred CCCCCCeeEEEeeccCCCCccceeeeec
Q 015516 299 DLHPGDHIEIQWRRNKEFPYGWWYGVVG 326 (405)
Q Consensus 299 dLkPGDHiEIQWRrnkefPYgWWYgvVG 326 (405)
.++.||.|+|-=+.+. |||+|-..
T Consensus 15 s~~~Gd~i~v~~~~~~----~Ww~~~~~ 38 (48)
T PF00018_consen 15 SFKKGDIIEVLEKSDD----GWWKVRNE 38 (48)
T ss_dssp EB-TTEEEEEEEESSS----SEEEEEET
T ss_pred eEECCCEEEEEEecCC----CEEEEEEC
Confidence 3799999999876666 99999763
No 16
>PF14250 AbrB-like: AbrB-like transcriptional regulator
Probab=72.98 E-value=2.3 Score=35.21 Aligned_cols=21 Identities=33% Similarity=0.439 Sum_probs=17.0
Q ss_pred ccccccCCCCCCCeeEEEeec
Q 015516 292 HISDCLNDLHPGDHIEIQWRR 312 (405)
Q Consensus 292 h~Sdcl~dLkPGDHiEIQWRr 312 (405)
|+..=.-+|+|||.+||+=.|
T Consensus 45 ~AYT~~m~L~PGdEFeI~Lgr 65 (71)
T PF14250_consen 45 SAYTKQMGLKPGDEFEIKLGR 65 (71)
T ss_pred HHHHHHhCCCCCCEEEEEeCc
Confidence 456667799999999999655
No 17
>PF07653 SH3_2: Variant SH3 domain; InterPro: IPR011511 SH3 (src Homology-3) domains are small protein modules containing approximately 50 amino acid residues [, ]. They are found in a great variety of intracellular or membrane-associated proteins [, , ] for example, in a variety of proteins with enzymatic activity, in adaptor proteins that lack catalytic sequences and in cytoskeletal proteins, such as fodrin and yeast actin binding protein ABP-1. The SH3 domain has a characteristic fold which consists of five or six beta-strands arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain short helices []. The surface of the SH3-domain bears a flat, hydrophobic ligand-binding pocket which consists of three shallow grooves defined by conservative aromatic residues in which the ligand adopts an extended left-handed helical arrangement. The ligand binds with low affinity but this may be enhanced by multiple interactions. The region bound by the SH3 domain is in all cases proline-rich and contains PXXP as a core-conserved binding motif. The function of the SH3 domain is not well understood but they may mediate many diverse processes such as increasing local concentration of proteins, altering their subcellular location and mediating the assembly of large multiprotein complexes []. This entry represents a variant of the SH3 domain.; PDB: 1I1J_B 1K0X_A 1HJD_A 2KEA_A 1KJW_A 1JXM_A 1JXO_B 2EBP_A 2DL3_A 2EYX_A ....
Probab=70.72 E-value=3 Score=30.72 Aligned_cols=24 Identities=38% Similarity=0.819 Sum_probs=17.3
Q ss_pred CCCCCCeeEEEeeccCCCCccceeeee
Q 015516 299 DLHPGDHIEIQWRRNKEFPYGWWYGVV 325 (405)
Q Consensus 299 dLkPGDHiEIQWRrnkefPYgWWYgvV 325 (405)
.|+.||.|+|. +.++-+ |||+|..
T Consensus 17 s~~~Gd~i~v~--~~~~~~-~ww~~~~ 40 (55)
T PF07653_consen 17 SFKKGDVIEVL--GEKDDD-GWWLGEN 40 (55)
T ss_dssp EB-TTEEEEEE--EEECST-SEEEEEE
T ss_pred EEecCCEEEEE--EeecCC-CEEEEEE
Confidence 47899999998 233333 9999977
No 18
>KOG4408 consensus Putative Mg2+ and Co2+ transporter CorD [Inorganic ion transport and metabolism]
Probab=66.17 E-value=1.4 Score=45.65 Aligned_cols=52 Identities=25% Similarity=0.241 Sum_probs=46.9
Q ss_pred CCCCCCCcHHHHHHHHccCChhhHHHHHhhcHHHHHhccChhHHHHhhhhhh
Q 015516 90 EMSVLDLPELVLECILEKLPPSGLCNMAAVCSSLRDRCMSDHLWEKHMKQKW 141 (405)
Q Consensus 90 ~~sLLDLPeelLd~ILs~LSP~DLcrmAaVCrsLRerc~SD~LWeklV~~kw 141 (405)
.+.+-+||..+++.||+++-+.+|.+.|.|.+.|.+-++-+.+|.++-+..|
T Consensus 5 ~~~le~~~~~~l~~vls~~~~~~~~~~a~vs~rLk~~~s~~~lw~r~c~k~l 56 (386)
T KOG4408|consen 5 PLGLEWLPRDPLHLVLSFLLYRDLINCAYVSRRLKELGSHLPLWNRPCKKYL 56 (386)
T ss_pred ccchhhcccccceeeecccchhhhhcceeechHHhhhhhccccccccccccc
Confidence 3678889999999999999999999999999999999999999998874433
No 19
>PF00855 PWWP: PWWP domain; InterPro: IPR000313 Upon characterisation of WHSC1, a gene mapping to the Wolf-Hirschhornsyndrome critical region and at its C terminus similar to the Drosophila melanogaster ASH1/trithorax group proteins, a novel protein domain designated PWWP domain was identified []. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. It is present in proteins of nuclear origin and plays a role in cell growth and differentiation. Due to its position, the composition of amino acids close to the PWWP motif and the pattern of other domains present it has been suggested that the domain is involved in protein-protein interactions [].; PDB: 3LYI_B 2L89_A 2NLU_A 1RI0_A 1KHC_A 3QKJ_C 2DAQ_A 1N27_A 3PFS_B 3QJ6_A ....
Probab=63.32 E-value=10 Score=29.54 Aligned_cols=52 Identities=21% Similarity=0.551 Sum_probs=34.6
Q ss_pred CCCCeeEEEeeccCCCCccceeeeeccccccCCCCCceecccCCeEEEEEeecCCCCcceeeecccc
Q 015516 301 HPGDHIEIQWRRNKEFPYGWWYGVVGHLESCDGNENFCHCHNSETLILEFNQYTPGSRWRKTTINRK 367 (405)
Q Consensus 301 kPGDHiEIQWRrnkefPYgWWYgvVGHlesC~gn~~~C~Ch~sdtvvlEF~qY~~gSrWR~t~inRk 367 (405)
+|||.| |=+-+.|| ||=|+|=+.+.+.++ -...+.+.|.| .|+. ..+-|+.+
T Consensus 2 ~~GdlV---WaK~~g~p--wWPa~V~~~~~~~~~-----~~~~~~~~V~F----fg~~-~~~wv~~~ 53 (86)
T PF00855_consen 2 RPGDLV---WAKLKGYP--WWPARVCDPDEKSKK-----KRKDGHVLVRF----FGDN-DYAWVKPS 53 (86)
T ss_dssp STTEEE---EEEETTSE--EEEEEEEECCHCTSC-----SSSSTEEEEEE----TTTT-EEEEEEGG
T ss_pred CCCCEE---EEEeCCCC--CCceEEeeccccccc-----CCCCCEEEEEe----cCCC-CEEEECHH
Confidence 678766 66778887 999999888766553 34566777777 2344 44445443
No 20
>COG1064 AdhP Zn-dependent alcohol dehydrogenases [General function prediction only]
Probab=59.69 E-value=5.8 Score=40.55 Aligned_cols=47 Identities=30% Similarity=0.543 Sum_probs=36.3
Q ss_pred CCCCCCcccc------cccccCCCCCCCeeEEEeeccCCCCccceeeeeccccccC-CCCCceec
Q 015516 283 PVDTSPHDLH------ISDCLNDLHPGDHIEIQWRRNKEFPYGWWYGVVGHLESCD-GNENFCHC 340 (405)
Q Consensus 283 PvdT~~~dlh------~Sdcl~dLkPGDHiEIQWRrnkefPYgWWYgvVGHlesC~-gn~~~C~C 340 (405)
++-+|-|..= -++. ..+|+||.+=| ||++.-.||=|.|. |+||+|.=
T Consensus 57 ~P~ipGHEivG~V~~vG~~V-~~~k~GDrVgV----------~~~~~~Cg~C~~C~~G~E~~C~~ 110 (339)
T COG1064 57 LPLIPGHEIVGTVVEVGEGV-TGLKVGDRVGV----------GWLVISCGECEYCRSGNENLCPN 110 (339)
T ss_pred CCccCCcceEEEEEEecCCC-ccCCCCCEEEe----------cCccCCCCCCccccCcccccCCC
Confidence 4556666654 1333 37999999998 78999999999997 89999764
No 21
>cd05829 Sortase_E Sortase E (SrtE) is a membrane transpeptidase found in gram-positive bacteria that cleaves surface proteins at a cell sorting motif and catalyzes a transpeptidation reaction in which the surface protein substrate is covalently linked to peptidoglycan for display on the bacterial surface. Sortases are grouped into different classes and subfamilies based on sequence, membrane topology, genomic positioning, and cleavage site preference. The function of Sortase E is unknown. In two different sortase families, the N-terminus either functions as both a signal peptide for secretion and a stop-transfer signal for membrane anchoring, or it contains a signal peptide only and the C-terminus serves as a membrane anchor. Most gram-positive bacteria contain more than one sortase and it is thought that the different sortases anchor different surface protein classes. The sortase domain is a modified beta-barrel flanked by two (SrtA) or three (SrtB) short alpha-helices.
Probab=56.74 E-value=5.4 Score=35.16 Aligned_cols=23 Identities=26% Similarity=0.367 Sum_probs=17.5
Q ss_pred cccCCCCCCCeeEEEeeccCCCC
Q 015516 295 DCLNDLHPGDHIEIQWRRNKEFP 317 (405)
Q Consensus 295 dcl~dLkPGDHiEIQWRrnkefP 317 (405)
.-|++|++||.|.|+....+.+=
T Consensus 68 ~~L~~l~~GD~I~v~~~~g~~~~ 90 (144)
T cd05829 68 FRLGDLRKGDKVEVTRADGQTAT 90 (144)
T ss_pred cchhcCCCCCEEEEEECCCCEEE
Confidence 34679999999999996554443
No 22
>PF12969 DUF3857: Domain of Unknown Function with PDB structure (DUF3857); InterPro: IPR024618 This domain is based on the first domain of the PDB structure 3KD4 (residues 1-228). It is structurally similar to domains in other hydrolases, eg. M1 family aminopeptidase, despite lack of any significant sequence similarity. The domain is N-terminal to a transglutaminase domain, which is found in many proteins known to have transglutaminase activity. The function of this domain is unknown. ; PDB: 3KD4_A.
Probab=51.41 E-value=13 Score=31.99 Aligned_cols=23 Identities=26% Similarity=0.388 Sum_probs=14.0
Q ss_pred cccCCCCCCCeeEEEeeccCCCC
Q 015516 295 DCLNDLHPGDHIEIQWRRNKEFP 317 (405)
Q Consensus 295 dcl~dLkPGDHiEIQWRrnkefP 317 (405)
--+.+|+|||-||++++....-|
T Consensus 85 ~~~p~v~~GdiIe~~y~~~~~~p 107 (177)
T PF12969_consen 85 FAFPDVRVGDIIEYSYTIKSKNP 107 (177)
T ss_dssp EE--S--TT-EEEEEEEEEE-TT
T ss_pred EEcCCCCCCcEEEEEEEEEecCC
Confidence 56889999999999999865544
No 23
>PF07886 BA14K: BA14K-like protein; InterPro: IPR012413 The sequences found in this family are similar to the BA14K proteins expressed by Brucella abortus (Q44701 from SWISSPROT) and by Brucella suis (Q8FVU0 from SWISSPROT). BA14K was found to be strongly immunoreactive; it induces both humoral and cellular responses in hosts throughout the infective process [].
Probab=49.95 E-value=6.1 Score=27.91 Aligned_cols=23 Identities=35% Similarity=0.562 Sum_probs=18.3
Q ss_pred ecceeeeeecCCcceeecCCCCCCCc
Q 015516 242 CYDAELSYDQSTDTFQARYPPHGRRP 267 (405)
Q Consensus 242 CYDA~lsYd~~TDTF~ARYpphGrr~ 267 (405)
|.+--=|||++|+||++. .|+|.
T Consensus 6 C~~rYRSy~p~~~Ty~~~---~G~r~ 28 (31)
T PF07886_consen 6 CARRYRSYDPRDNTYQPY---DGPRR 28 (31)
T ss_pred HHHHhcCCCCCCCcEeCC---CCccc
Confidence 566667999999999987 67664
No 24
>PF05641 Agenet: Agenet domain; InterPro: IPR008395 This domain is related to the TUDOR domain IPR008191 from INTERPRO []. The function of the agenet domain is unknown. This signature matches one of the two Agenet domains in the FMR proteins [].; GO: 0003723 RNA binding; PDB: 2BKD_N 3O8V_A 3KUF_A 3H8Z_A.
Probab=44.75 E-value=34 Score=26.68 Aligned_cols=44 Identities=18% Similarity=0.517 Sum_probs=27.5
Q ss_pred CCCCCeeEEEeeccCCCCccceeeeeccccccCCCCCceecccCCeEEEEEeecCCC
Q 015516 300 LHPGDHIEIQWRRNKEFPYGWWYGVVGHLESCDGNENFCHCHNSETLILEFNQYTPG 356 (405)
Q Consensus 300 LkPGDHiEIQWRrnkefPYgWWYgvVGHlesC~gn~~~C~Ch~sdtvvlEF~qY~~g 356 (405)
.++||.|||- .....|.=.|+-|+| .+..++. ++.+||.-+...
T Consensus 1 F~~G~~VEV~-s~e~g~~gaWf~a~V---~~~~~~~---------~~~V~Y~~~~~~ 44 (68)
T PF05641_consen 1 FKKGDEVEVS-SDEDGFRGAWFPATV---LKENGDD---------KYLVEYDDLPDE 44 (68)
T ss_dssp --TT-EEEEE-E-SBTT--EEEEEEE---EEEETT----------EEEEEETT-SS-
T ss_pred CCCCCEEEEE-EcCCCCCcEEEEEEE---EEeCCCc---------EEEEEECCcccc
Confidence 4799999996 455677777888999 6776664 788888766643
No 25
>PF09372 PRANC: PRANC domain; InterPro: IPR018272 This presumed domain is found at the C terminus of a variety of Pox virus proteins. The PRANC (Pox proteins Repeats of ANkyrin, C-terminal) domain is also found on its own in some proteins []. The function of this domain is unknown, but it appears to be related to the F-box domain and may play a similar role.
Probab=41.90 E-value=24 Score=28.90 Aligned_cols=27 Identities=26% Similarity=0.089 Sum_probs=24.8
Q ss_pred CCCCCCCcHHHHHHHHccCChhhHHHH
Q 015516 90 EMSVLDLPELVLECILEKLPPSGLCNM 116 (405)
Q Consensus 90 ~~sLLDLPeelLd~ILs~LSP~DLcrm 116 (405)
...+-.||.|+-..||++|+-.||..+
T Consensus 69 ~~~w~~LP~EIk~~Il~~L~~~dL~~l 95 (97)
T PF09372_consen 69 NNYWNILPIEIKYKILEYLSNKDLKKL 95 (97)
T ss_pred CCchhhCCHHHHHHHHHcCCHHHHHHH
Confidence 467999999999999999999999876
No 26
>PF03489 SapB_2: Saposin-like type B, region 2; InterPro: IPR008138 Saposins are small lysosomal proteins that serve as activators of various lysosomal lipid-degrading enzymes []. They probably act by isolating the lipid substrate from the membrane surroundings, thus making it more accessible to the soluble degradative enzymes. All mammalian saposins are synthesized as a single precursor molecule (prosaposin) which contains four Saposin-B domains, yielding the active saposins after proteolytic cleavage, and two Saposin-A domains that are removed in the activation reaction. The Saposin-B domains also occur in other proteins, many of them active in the lysis of membranes [, ].; PDB: 3BQQ_A 2RB3_B 2R0R_A 3BQP_A 2R1Q_A 1NKL_A 1L9L_A 1QDM_C 3RFI_A 4DDJ_A ....
Probab=41.31 E-value=27 Score=23.66 Aligned_cols=23 Identities=17% Similarity=0.626 Sum_probs=20.3
Q ss_pred HHHHHHHHccCChhhHHHHHhhc
Q 015516 98 ELVLECILEKLPPSGLCNMAAVC 120 (405)
Q Consensus 98 eelLd~ILs~LSP~DLcrmAaVC 120 (405)
+.+++.+.+.++|.++|.+-..|
T Consensus 13 ~~ii~~l~~~~~p~~iC~~i~~C 35 (35)
T PF03489_consen 13 PQIIQLLEKQLDPQQICTKIGLC 35 (35)
T ss_dssp HHHHHHHHTTSTHHHHHHHTTSS
T ss_pred HHHHHHHHhcCChHHHHHHcCCC
Confidence 47899999999999999987776
No 27
>cd05835 Dnmt3b_related The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis. In tumorigenesis, DNA methylation by Dnmt3b is known to play a role in the inactivation of tumor suppressor genes. In addition, a point mutation in the PWWP domain of Dnmt3b has been identified in patients with ICF syndrome (immunodeficiency, centromeric instability, and facial anomalies), a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=40.94 E-value=42 Score=27.55 Aligned_cols=40 Identities=25% Similarity=0.506 Sum_probs=25.3
Q ss_pred CCCCeeEEEeeccCCCCccceeeeeccccccCCCCCceecccCCeEEEEE
Q 015516 301 HPGDHIEIQWRRNKEFPYGWWYGVVGHLESCDGNENFCHCHNSETLILEF 350 (405)
Q Consensus 301 kPGDHiEIQWRrnkefPYgWWYgvVGHlesC~gn~~~C~Ch~sdtvvlEF 350 (405)
++||.| |=+-|.|| ||=|.|=+.. ....+.+...++.+.|
T Consensus 2 ~vGDlV---WaK~kg~p--wWP~~V~~~~-----~~~~~~~~~~~~~V~f 41 (87)
T cd05835 2 NVGDLV---WGKIKGFP--WWPGRVVSIT-----VTSKRPPVVGMRWVTW 41 (87)
T ss_pred CCCCEE---EEecCCCC--CCCeEEechh-----hcccccCCCCeEEEEE
Confidence 688877 77778887 9999994432 1111134455666666
No 28
>PF00970 FAD_binding_6: Oxidoreductase FAD-binding domain; InterPro: IPR008333 These sequences contain an oxidoreductase FAD-binding domain. To date, the 3D-structures of the flavoprotein domain of Zea mays (Maize) nitrate reductase [] and of pig NADH:cytochrome b5 reductase [] have been solved. The overall fold is similar to that of ferredoxin:NADP+ reductase []: the FAD-binding domain (N-terminal) has the topology of an anti-parallel beta-barrel, while the NAD(P)-binding domain (C-terminal) has the topology of a classical pyridine dinucleotide-binding fold (i.e. a central parallel beta-sheet flanked by 2 helices on each side).; PDB: 1JB9_A 3LVB_A 3LO8_A 1FRN_A 1FND_A 1BX1_A 1FNC_A 1FNB_A 1BX0_A 1FRQ_A ....
Probab=40.43 E-value=9.4 Score=30.17 Aligned_cols=18 Identities=44% Similarity=0.875 Sum_probs=15.3
Q ss_pred ccccccCCCCCCCeeEEE
Q 015516 292 HISDCLNDLHPGDHIEIQ 309 (405)
Q Consensus 292 h~Sdcl~dLkPGDHiEIQ 309 (405)
.+|..|++|++||.|+|-
T Consensus 74 ~~S~~L~~l~~Gd~v~i~ 91 (99)
T PF00970_consen 74 RVSRYLHQLKPGDEVEIR 91 (99)
T ss_dssp HHHHHHHTSCTTSEEEEE
T ss_pred HHHHHHHhCCCCCEEEEE
Confidence 357889999999999984
No 29
>TIGR03784 marine_sortase sortase, marine proteobacterial type. Members of this protein family are sortase enzymes, cysteine transpeptidases involved in protein sorting activities. Members of this family tend to be found in proteobacteria, rather than in Gram-positive bacteria where sortases attach proteins to the Gram-positive cell wall or participate in pilin cross-linking. Many species with this sortase appear to contain a signal target sequence, a protein with a Vault protein inter-alpha-trypsin domain (pfam08487) and a von Willebrand factor type A domain (pfam00092), encoded by an adjacent gene. These sortases are designated subfamily 6 according to Comfort and Clubb (2004).
Probab=39.26 E-value=14 Score=34.25 Aligned_cols=22 Identities=32% Similarity=0.426 Sum_probs=17.9
Q ss_pred cccCCCCCCCeeEEEeeccCCC
Q 015516 295 DCLNDLHPGDHIEIQWRRNKEF 316 (405)
Q Consensus 295 dcl~dLkPGDHiEIQWRrnkef 316 (405)
.-|++|++||.|+||-..++.+
T Consensus 104 ~~L~~L~~GD~I~v~~~~g~~~ 125 (174)
T TIGR03784 104 AFLQELRPGDVIRLQTPDGQWQ 125 (174)
T ss_pred CChhhCCCCCEEEEEECCCeEE
Confidence 3489999999999998766543
No 30
>PF14604 SH3_9: Variant SH3 domain; PDB: 2CRE_A 2E5K_A 2CT3_A 2DE0_X 2D8H_A 2DA9_A 2X3X_E 2X3W_D 2KRN_A 2ED0_A ....
Probab=39.25 E-value=25 Score=25.75 Aligned_cols=23 Identities=35% Similarity=0.790 Sum_probs=17.1
Q ss_pred CCCCCCeeEEEeeccCCCCccceeeee
Q 015516 299 DLHPGDHIEIQWRRNKEFPYGWWYGVV 325 (405)
Q Consensus 299 dLkPGDHiEIQWRrnkefPYgWWYgvV 325 (405)
.|+.||.|+|--+ -.=|||+|..
T Consensus 14 s~~~Gd~i~v~~~----~~~~W~~g~~ 36 (49)
T PF14604_consen 14 SFKKGDVITVLEK----SDDGWWYGRN 36 (49)
T ss_dssp EB-TTEEEEEEEE----SSTSEEEEEE
T ss_pred eEcCCCEEEEEEe----CCCCEEEEEE
Confidence 4799999999844 3469999963
No 31
>smart00326 SH3 Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Probab=39.12 E-value=34 Score=23.33 Aligned_cols=23 Identities=30% Similarity=0.739 Sum_probs=18.4
Q ss_pred CCCCCCeeEEEeeccCCCCccceeeee
Q 015516 299 DLHPGDHIEIQWRRNKEFPYGWWYGVV 325 (405)
Q Consensus 299 dLkPGDHiEIQWRrnkefPYgWWYgvV 325 (405)
.|++||.|+|-=+. .-|||+|..
T Consensus 20 ~~~~Gd~v~v~~~~----~~~w~~~~~ 42 (58)
T smart00326 20 SFKKGDIITVLEKS----DDGWWKGRL 42 (58)
T ss_pred CCCCCCEEEEEEcC----CCCeEEEEe
Confidence 58999999998554 469999864
No 32
>smart00306 HintN Hint (Hedgehog/Intein) domain N-terminal region. Hedgehog/Intein domain, N-terminal region. Domain has been split to accommodate large insertions of endonucleases.
Probab=38.20 E-value=16 Score=28.31 Aligned_cols=25 Identities=32% Similarity=0.465 Sum_probs=20.1
Q ss_pred CCCCcccccccc-------cCCCCCCCeeEEE
Q 015516 285 DTSPHDLHISDC-------LNDLHPGDHIEIQ 309 (405)
Q Consensus 285 dT~~~dlh~Sdc-------l~dLkPGDHiEIQ 309 (405)
-|+-|-+.+.+- ..+|++||+|-+|
T Consensus 68 ~T~~H~~~~~~~~~~~w~~a~~l~~gd~v~~~ 99 (100)
T smart00306 68 LTPDHLLLVRDGGKLVWVFASELKPGDYVLVP 99 (100)
T ss_pred ECCCCEEEEecCCcEEEEEHHHCCCCCEEEec
Confidence 477788887766 7899999999875
No 33
>cd04459 Rho_CSD Rho_CSD: Rho protein cold-shock domain (CSD). Rho protein is a transcription termination factor in most bacteria. In bacteria, there are two distinct mechanisms for mRNA transcription termination. In intrinsic termination, RNA polymerase and nascent mRNA are released from DNA template by an mRNA stem loop structure, which resembles the transcription termination mechanism used by eukaryotic pol III. The second mechanism is mediated by Rho factor. Rho factor terminates transcription by using energy from ATP hydrolysis to forcibly dissociate the transcripts from RNA polymerase. Rho protein contains an N-terminal S1-like domain, which binds single-stranded RNA. Rho has a C-terminal ATPase domain which hydrolyzes ATP to provide energy to strip RNA polymerase and mRNA from the DNA template. Rho functions as a homohexamer.
Probab=37.73 E-value=35 Score=27.63 Aligned_cols=36 Identities=31% Similarity=0.368 Sum_probs=27.8
Q ss_pred cccCCC--CCCCccccccccc---CCCCCCCeeEEEeeccC
Q 015516 279 LRAAPV--DTSPHDLHISDCL---NDLHPGDHIEIQWRRNK 314 (405)
Q Consensus 279 lRapPv--dT~~~dlh~Sdcl---~dLkPGDHiEIQWRrnk 314 (405)
||.+-. ...+-|.++|--+ ..||+||.|+=+=|..+
T Consensus 15 LR~~~~~y~~~~~DvyVs~~~Irr~~LR~GD~V~G~vr~p~ 55 (68)
T cd04459 15 LRSSGYNYLPGPDDIYVSPSQIRRFNLRTGDTVVGQIRPPK 55 (68)
T ss_pred EecCCcCCCCCCCCEEECHHHHHHhCCCCCCEEEEEEeCCC
Confidence 555533 5678888888876 68999999999988644
No 34
>PF07497 Rho_RNA_bind: Rho termination factor, RNA-binding domain; InterPro: IPR011113 The Rho termination factor disengages newly transcribed RNA from its DNA template at certain, specific transcripts. It is thought that two copies of Rho bind to RNA and that Rho functions as a hexamer of protomers [].; GO: 0003723 RNA binding, 0006353 transcription termination, DNA-dependent; PDB: 1A8V_B 1PVO_A 1PV4_D 3ICE_A 1XPU_C 1XPO_D 1XPR_F 2A8V_B 2HT1_B 1A63_A ....
Probab=36.88 E-value=33 Score=28.56 Aligned_cols=38 Identities=34% Similarity=0.363 Sum_probs=24.2
Q ss_pred ccccC--CCCCCCccccccccc---CCCCCCCeeEEEeeccCC
Q 015516 278 RLRAA--PVDTSPHDLHISDCL---NDLHPGDHIEIQWRRNKE 315 (405)
Q Consensus 278 RlRap--PvdT~~~dlh~Sdcl---~dLkPGDHiEIQWRrnke 315 (405)
-||.+ --.+.+.|.|+|-.+ ..||.||.|+=+=|..|+
T Consensus 16 FLR~~~~~y~~~~~DvYVs~~qIrrf~LR~GD~V~G~vr~p~~ 58 (78)
T PF07497_consen 16 FLRSPDNNYLPSPDDVYVSPSQIRRFGLRTGDLVEGQVRPPRE 58 (78)
T ss_dssp EEE-GGGTTS-STTSEEE-CCCCCCTT--TTEEEEEEEE--ST
T ss_pred EeECCCcCCCCCCCCEEECHHHHHHcCCCCCCEEEEEEeCCCC
Confidence 36666 556678899998876 589999999988887654
No 35
>COG1018 Hmp Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 [Energy production and conversion]
Probab=33.29 E-value=25 Score=34.27 Aligned_cols=28 Identities=25% Similarity=0.275 Sum_probs=23.4
Q ss_pred ccccccC-CCCCCCeeEEEeeccCCCCccc
Q 015516 292 HISDCLN-DLHPGDHIEIQWRRNKEFPYGW 320 (405)
Q Consensus 292 h~Sdcl~-dLkPGDHiEIQWRrnkefPYgW 320 (405)
-.|..|+ +|||||.|+| =.-.-+|++.-
T Consensus 79 ~~S~~Lh~~lk~Gd~l~v-~~P~G~F~l~~ 107 (266)
T COG1018 79 GGSNWLHDHLKVGDTLEV-SAPAGDFVLDD 107 (266)
T ss_pred cccHHHHhcCCCCCEEEE-ecCCCCccCCC
Confidence 3688898 9999999999 66777888765
No 36
>PF11270 DUF3070: Protein of unknown function (DUF3070); InterPro: IPR021423 This eukaryotic family of proteins has no known function.
Probab=32.44 E-value=30 Score=24.95 Aligned_cols=24 Identities=33% Similarity=0.553 Sum_probs=18.5
Q ss_pred CCCCCC-CcccccccccCCCCCCCe
Q 015516 282 APVDTS-PHDLHISDCLNDLHPGDH 305 (405)
Q Consensus 282 pPvdT~-~~dlh~Sdcl~dLkPGDH 305 (405)
|||+.| +-|-|..--++++.||||
T Consensus 2 aPa~~Pp~Eeaheap~~s~~~~~d~ 26 (32)
T PF11270_consen 2 APAEEPPAEEAHEAPHDSDLYPSDH 26 (32)
T ss_pred CcccCCChhhhccCcccccCCCCCc
Confidence 444444 678888888899999998
No 37
>cd05162 PWWP The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes. The function of the PWWP domain is still not known precisely; however, based on the fact that other regions of PWWP-domain proteins are responsible for nuclear localization and DNA-binding, is likely that the PWWP domain acts as a site for protein-protein binding interactions, influencing chromatin remodeling and thereby regulating transcriptional processes. Some PWWP-domain proteins have been linked to cancer or other diseases; some are known to function as growth factors.
Probab=32.24 E-value=68 Score=25.58 Aligned_cols=29 Identities=31% Similarity=0.845 Sum_probs=21.4
Q ss_pred CCCCCeeEEEeeccCCCCccceeeeeccccccCC
Q 015516 300 LHPGDHIEIQWRRNKEFPYGWWYGVVGHLESCDG 333 (405)
Q Consensus 300 LkPGDHiEIQWRrnkefPYgWWYgvVGHlesC~g 333 (405)
++|||.| |=+-+.|| ||=|+|-+......
T Consensus 1 f~~GdlV---waK~~g~p--wWPa~V~~~~~~~~ 29 (87)
T cd05162 1 FRPGDLV---WAKMKGYP--WWPALVVDPPKDSK 29 (87)
T ss_pred CCCCCEE---EEeCCCCC--CCCEEEccccccch
Confidence 4789877 55566666 99999987776643
No 38
>cd00174 SH3 Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Probab=30.74 E-value=57 Score=22.06 Aligned_cols=23 Identities=35% Similarity=0.826 Sum_probs=17.7
Q ss_pred CCCCCCeeEEEeeccCCCCccceeeee
Q 015516 299 DLHPGDHIEIQWRRNKEFPYGWWYGVV 325 (405)
Q Consensus 299 dLkPGDHiEIQWRrnkefPYgWWYgvV 325 (405)
.|++||-|.|-=+.+ =|||+|-.
T Consensus 17 ~~~~Gd~v~v~~~~~----~~w~~~~~ 39 (54)
T cd00174 17 SFKKGDIIEVLEKSD----DGWWEGRL 39 (54)
T ss_pred CCCCCCEEEEEEcCC----CCeEEEEE
Confidence 589999999986633 28998864
No 39
>TIGR03635 S17_bact 30S ribosomal protein S17. This model describes the bacterial ribosomal small subunit protein S17, while excluding cytosolic eukaryotic homologs and archaeal homologs. The model finds many, but not, chloroplast and mitochondrial counterparts to bacterial S17.
Probab=27.45 E-value=48 Score=26.97 Aligned_cols=13 Identities=31% Similarity=0.404 Sum_probs=11.3
Q ss_pred cCCCCCCCeeEEE
Q 015516 297 LNDLHPGDHIEIQ 309 (405)
Q Consensus 297 l~dLkPGDHiEIQ 309 (405)
-|+.+.||-|+|+
T Consensus 46 ~~~~k~GD~V~I~ 58 (71)
T TIGR03635 46 NNECKVGDVVRII 58 (71)
T ss_pred CCCCCCCCEEEEE
Confidence 3589999999997
No 40
>cd03470 Rieske_cytochrome_bc1 Iron-sulfur protein (ISP) component of the bc(1) complex family, Rieske domain; The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. The bc(1) complex is a multisubunit enzyme found in many different organisms including uni- and multi-cellular eukaryotes, plants (in their mitochondria) and bacteria. The cytochrome bc(1) and b6f complexes are central components of the respiratory and photosynthetic electron transport chains, respectively, which carry out similar core electron and proton transfer steps. The bc(1) and b6f complexes share a common core structure of three catalytic subunits: cyt b, the Rieske ISP, and either a cyt c1 in the bc(1) complex or cyt f in the b6f complex, which are arranged in an integral membrane-bound dimeric complex. While the core of the b6f complex is similar to that of the bc(1) complex, the domain arrangement outside the core and the complement of prosthetic groups are strikingly different.
Probab=26.96 E-value=43 Score=29.40 Aligned_cols=19 Identities=26% Similarity=0.678 Sum_probs=16.0
Q ss_pred cCCCCCCCeeEEEeeccCC
Q 015516 297 LNDLHPGDHIEIQWRRNKE 315 (405)
Q Consensus 297 l~dLkPGDHiEIQWRrnke 315 (405)
+.+|.||.-..+.||+..-
T Consensus 6 l~~l~~G~~~~v~w~Gkpv 24 (126)
T cd03470 6 LSKIEEGQLITVEWRGKPV 24 (126)
T ss_pred hhhCCCCCEEEEEECCeEE
Confidence 6789999999999997643
No 41
>COG1545 Predicted nucleic-acid-binding protein containing a Zn-ribbon [General function prediction only]
Probab=26.64 E-value=1.3e+02 Score=26.96 Aligned_cols=96 Identities=21% Similarity=0.199 Sum_probs=51.4
Q ss_pred cCcccccceeeeccCCeeeeeeeecceeeeeecCCcceeecCCCC-CCCceeeecCCCccccccCCCCCCCccccccc-c
Q 015516 219 TGRFWFPAQVYNRENGHVGFMLSCYDAELSYDQSTDTFQARYPPH-GRRPIAIECGVPWERLRAAPVDTSPHDLHISD-C 296 (405)
Q Consensus 219 sG~FWFPAQVYnrenghvGfmLSCYDA~lsYd~~TDTF~ARYpph-Grr~~~ie~~i~WdRlRapPvdT~~~dlh~Sd-c 296 (405)
||+--||.|.|....|--+ -| -..+||=.-.=.||+.=|.+- |-+. +....=.-++..- ...--.|+.| -
T Consensus 35 CG~v~~PPr~~Cp~C~~~~-~~--E~vels~~G~V~t~Tv~~~~~~~~~~---~~P~viaiV~l~~--~~~i~~~i~~~~ 106 (140)
T COG1545 35 CGRVYFPPRAYCPKCGSET-EL--EWVELSGEGKVETYTVVYVKPPGFSL---EEPYVIAIVELEE--GGRILGQLVDVD 106 (140)
T ss_pred CCeEEcCCcccCCCCCCCC-ce--EEEEeCCCeEEEEEEEEeeCCCCccc---CCCEEEEEEEeCC--CCceEEEEEecC
Confidence 6899999999987655443 11 223333333333454443222 2110 0111111111111 1134456666 4
Q ss_pred cCCCCCCCeeEEEeeccCCCCcccee
Q 015516 297 LNDLHPGDHIEIQWRRNKEFPYGWWY 322 (405)
Q Consensus 297 l~dLkPGDHiEIQWRrnkefPYgWWY 322 (405)
-.+++.|+-||..||+..+-.+..+.
T Consensus 107 p~~v~iGm~V~~v~~~~~~~~~~~~~ 132 (140)
T COG1545 107 PDDVEIGMKVEAVFRKREEDGGRGYI 132 (140)
T ss_pred cccccCCCEEEEEEEEccccCCceEE
Confidence 67999999999999998887766543
No 42
>PF11604 CusF_Ec: Copper binding periplasmic protein CusF; InterPro: IPR021647 CusF is a periplasmic protein involved in copper and silver resistance in Escherichia coil. CusF forms a five-stranded beta-barrel OB fold. Cu(I) binds to H36, M47 and M49 which are conserved residues in the protein []. ; PDB: 2L55_A 2VB3_X 1ZEQ_X 2QCP_X 3E6Z_X 2VB2_X.
Probab=24.01 E-value=59 Score=25.79 Aligned_cols=30 Identities=23% Similarity=0.400 Sum_probs=19.1
Q ss_pred CCCcccccccccCCCCCCCeeEEEeeccCC
Q 015516 286 TSPHDLHISDCLNDLHPGDHIEIQWRRNKE 315 (405)
Q Consensus 286 T~~~dlh~Sdcl~dLkPGDHiEIQWRrnke 315 (405)
|-...+--+--+.+|+|||.|+++-.+..+
T Consensus 29 TM~F~v~~~~~l~~l~~Gd~V~F~~~~~~~ 58 (70)
T PF11604_consen 29 TMDFPVADPVDLAGLKPGDKVRFTFERTDD 58 (70)
T ss_dssp EEEEE--TTSEESS-STT-EEEEEEEEETT
T ss_pred EEEEEcCChhhhhcCCCCCEEEEEEEECCC
Confidence 334444445578999999999999887654
No 43
>PF04225 OapA: Opacity-associated protein A LysM-like domain; InterPro: IPR007340 This entry includes the Haemophilus influenzae opacity-associated protein. This protein is required for efficient nasopharyngeal mucosal colonization, and its expression is associated with a distinctive transparent colony phenotype. OapA is thought to be a secreted protein, and its expression exhibits high-frequency phase variation [].; PDB: 2GU1_A.
Probab=23.49 E-value=41 Score=27.70 Aligned_cols=16 Identities=38% Similarity=0.634 Sum_probs=9.2
Q ss_pred cccCCCCCCCeeEEEe
Q 015516 295 DCLNDLHPGDHIEIQW 310 (405)
Q Consensus 295 dcl~dLkPGDHiEIQW 310 (405)
.-|..|||||.|+++=
T Consensus 38 k~L~~L~pGq~l~f~~ 53 (85)
T PF04225_consen 38 KPLTRLKPGQTLEFQL 53 (85)
T ss_dssp --GGG--TT-EEEEEE
T ss_pred chHhhCCCCCEEEEEE
Confidence 4788999999998763
No 44
>cd04453 S1_RNase_E S1_RNase_E: RNase E and RNase G, S1-like RNA-binding domain. RNase E is an essential endoribonuclease in the processing and degradation of RNA. In addition to its role in mRNA degradation, RNase E has also been implicated in the processing of rRNA, and the maturation of tRNA, 10Sa RNA and the M1 precursor of RNase P. RNase E associates with PNPase (3' to 5' exonuclease), Rhl B (DEAD-box RNA helicase) and enolase (glycolytic enzyme) to form the RNA degradosome. RNase E tends to cut mRNA within single-stranded regions that are rich in A/U nucleotides. The N-terminal region of RNase E contains the catalytic site. Within the conserved N-terminal domain of RNAse E and RNase G, there is an S1-like subdomain, which is an ancient single-stranded RNA-binding domain. S1 domain is an RNA-binding module originally identified in the ribosomal protein S1. The S1 domain is required for RNA cleavage by RNase E. RNase G is paralogous to RNase E with an N-terminal catalytic domain th
Probab=23.05 E-value=40 Score=27.61 Aligned_cols=15 Identities=27% Similarity=0.310 Sum_probs=12.7
Q ss_pred CCCCCCCeeEEEeec
Q 015516 298 NDLHPGDHIEIQWRR 312 (405)
Q Consensus 298 ~dLkPGDHiEIQWRr 312 (405)
+.|+.||.|++|=-+
T Consensus 55 ~~~~~Gd~v~VqV~~ 69 (88)
T cd04453 55 KLLKEGQEILVQVVK 69 (88)
T ss_pred HcCCCCCEEEEEEEE
Confidence 459999999999765
No 45
>PF13550 Phage-tail_3: Putative phage tail protein
Probab=22.77 E-value=56 Score=27.71 Aligned_cols=14 Identities=36% Similarity=0.465 Sum_probs=12.4
Q ss_pred CCCCCCCeeEEEee
Q 015516 298 NDLHPGDHIEIQWR 311 (405)
Q Consensus 298 ~dLkPGDHiEIQWR 311 (405)
-.|.|||.|+|++-
T Consensus 138 ~~l~pGDvi~l~~~ 151 (164)
T PF13550_consen 138 LALEPGDVIALSDD 151 (164)
T ss_pred ccCCCCCEEEEEeC
Confidence 45999999999986
No 46
>TIGR02988 YaaA_near_RecF S4 domain protein YaaA. This small protein has a single S4 domain (pfam01479), as do bacterial ribosomal protein S4, some pseudouridine synthases, tyrosyl-tRNA synthetases. The S4 domain may bind RNA. Members of this protein family are found almost exclusively in the Firmicutes, and almost invariably just a few nucleotides upstream of the gene for the DNA replication and repair protein RecF. The few members of this family that are not near recF are found instead near dnaA and/or dnaN, the usual neighbors of recF, near the origin of replication. The conserved location suggests a possible role in replication in the Firmicutes lineage.
Probab=21.87 E-value=54 Score=24.60 Aligned_cols=10 Identities=70% Similarity=1.026 Sum_probs=7.0
Q ss_pred CCCCCCeeEE
Q 015516 299 DLHPGDHIEI 308 (405)
Q Consensus 299 dLkPGDHiEI 308 (405)
-|++||+|+|
T Consensus 49 ~l~~Gd~v~i 58 (59)
T TIGR02988 49 KLYPGDVIEI 58 (59)
T ss_pred CCCCCCEEEe
Confidence 4677777776
No 47
>cd06166 Sortase_D_5 Sortase D (SrtD) is a membrane transpeptidase found in gram-positive bacteria that anchors surface proteins to peptidoglycans of the bacterial cell wall envelope. This involves a transpeptidation reaction in which the surface protein substrate is cleaved at the cell wall sorting signal and covalently linked to peptidoglycan for display on the bacterial surface. Sortases are grouped into different classes and subfamilies based on sequence, membrane topology, genomic positioning, and cleavage site preference. Class D sortases are further classified into subfamilies 4 and 5. This group contains a subset of Class D sortases belonging to subfamily-5, represented by Clostridium perfringens CPE2315. Subfamily-5 sortases recognize a nonstandard sorting signal (LAXTG) and have replaced Sortase A in some gram-postive bacteria. They may play a housekeeping role in the cell.
Probab=21.75 E-value=36 Score=29.13 Aligned_cols=18 Identities=22% Similarity=0.499 Sum_probs=15.2
Q ss_pred ccccCCCCCCCeeEEEee
Q 015516 294 SDCLNDLHPGDHIEIQWR 311 (405)
Q Consensus 294 Sdcl~dLkPGDHiEIQWR 311 (405)
-.-|++|++||.|.|+..
T Consensus 59 F~~L~~l~~Gd~v~v~~~ 76 (126)
T cd06166 59 FNRLDEVEKGDEIKVTTK 76 (126)
T ss_pred cCChHHCCCCCEEEEEEC
Confidence 345889999999999885
No 48
>cd00081 Hint Hedgehog/Intein domain, found in Hedgehog proteins as well as proteins which contain inteins and undergo protein splicing (e.g. DnaB, RIR1-2, GyrA and Pol). In protein splicing an intervening polypeptide sequence - the intein - is excised from a protein, and the flanking polypeptide sequences - the exteins - are joined by a peptide bond. In addition to the autocatalytic splicing domain, many inteins contain an inserted endonuclease domain, which plays a role in spreading inteins. Hedgehog proteins are a major class of intercellular signaling molecules, which control inductive interactions during animal development. The mature signaling forms of hedgehog proteins are the N-terminal fragments, which are covalently linked to cholesterol at their C-termini. This modification is the result of an autoprocessing step catalyzed by the C-terminal fragments, which are aligned here.
Probab=21.69 E-value=45 Score=27.37 Aligned_cols=25 Identities=32% Similarity=0.448 Sum_probs=22.3
Q ss_pred CCCCcccccccc-------cCCCCCCCeeEEE
Q 015516 285 DTSPHDLHISDC-------LNDLHPGDHIEIQ 309 (405)
Q Consensus 285 dT~~~dlh~Sdc-------l~dLkPGDHiEIQ 309 (405)
-|+-|-+.+.+. ..+|++||.|-+|
T Consensus 68 ~T~~H~~~~~~~~~~~~~~a~~l~~gd~l~~~ 99 (136)
T cd00081 68 LTPDHLLFVLEDGELKWVFASDLKPGDYVLVP 99 (136)
T ss_pred EcCCCEEEEEeCCeEEEEEHHHCCCCCEEEEc
Confidence 588899998885 7999999999999
No 49
>PRK06944 sulfur carrier protein ThiS; Provisional
Probab=21.54 E-value=56 Score=24.52 Aligned_cols=9 Identities=33% Similarity=0.877 Sum_probs=8.2
Q ss_pred CCCCCeeEE
Q 015516 300 LHPGDHIEI 308 (405)
Q Consensus 300 LkPGDHiEI 308 (405)
|+.||.|||
T Consensus 50 L~~gD~vei 58 (65)
T PRK06944 50 LAAGDRLDL 58 (65)
T ss_pred CCCCCEEEE
Confidence 888999998
No 50
>PF13621 Cupin_8: Cupin-like domain; PDB: 3AL6_C 3AL5_C 2XUM_A 2Y0I_A 1MZE_A 3KCY_A 1MZF_A 1YCI_A 2ILM_A 1H2L_A ....
Probab=21.43 E-value=49 Score=29.54 Aligned_cols=21 Identities=48% Similarity=0.914 Sum_probs=15.3
Q ss_pred CCCCCCeeEEEeeccCCCCccceeeeecc
Q 015516 299 DLHPGDHIEIQWRRNKEFPYGWWYGVVGH 327 (405)
Q Consensus 299 dLkPGDHiEIQWRrnkefPYgWWYgvVGH 327 (405)
.|.|||-+=| |=|||+.|.-.
T Consensus 212 ~l~pGD~Lfi--------P~gWwH~V~~~ 232 (251)
T PF13621_consen 212 VLEPGDVLFI--------PPGWWHQVENL 232 (251)
T ss_dssp EEETT-EEEE---------TT-EEEEEES
T ss_pred EECCCeEEEE--------CCCCeEEEEEc
Confidence 6899999976 99999999743
No 51
>PF03658 Ub-RnfH: RnfH family Ubiquitin; InterPro: IPR005346 This is a small family of proteins of unknown function.; PDB: 2HJ1_B.
Probab=21.43 E-value=47 Score=28.05 Aligned_cols=10 Identities=50% Similarity=0.787 Sum_probs=6.1
Q ss_pred CCCCCCeeEE
Q 015516 299 DLHPGDHIEI 308 (405)
Q Consensus 299 dLkPGDHiEI 308 (405)
-|+.||.|||
T Consensus 61 ~L~~GDRVEI 70 (84)
T PF03658_consen 61 VLRDGDRVEI 70 (84)
T ss_dssp B--TT-EEEE
T ss_pred cCCCCCEEEE
Confidence 4889999998
No 52
>PRK05863 sulfur carrier protein ThiS; Provisional
Probab=20.92 E-value=58 Score=25.16 Aligned_cols=9 Identities=33% Similarity=0.752 Sum_probs=8.4
Q ss_pred CCCCCeeEE
Q 015516 300 LHPGDHIEI 308 (405)
Q Consensus 300 LkPGDHiEI 308 (405)
|+.||.|||
T Consensus 50 L~~gD~ieI 58 (65)
T PRK05863 50 LRDGARLEV 58 (65)
T ss_pred cCCCCEEEE
Confidence 899999998
No 53
>KOG0162 consensus Myosin class I heavy chain [Cytoskeleton]
Probab=20.90 E-value=1.2e+02 Score=35.30 Aligned_cols=27 Identities=44% Similarity=0.887 Sum_probs=22.1
Q ss_pred CCCCCCeeEEEeeccCCCCccceeeeecccc
Q 015516 299 DLHPGDHIEIQWRRNKEFPYGWWYGVVGHLE 329 (405)
Q Consensus 299 dLkPGDHiEIQWRrnkefPYgWWYgvVGHle 329 (405)
-++-||.|||- ++=|=|||-|-+|-.|
T Consensus 1069 s~~~~diIei~----~edpSGWw~gk~~~ke 1095 (1106)
T KOG0162|consen 1069 SFKKGDIIEIM----REDPSGWWLGKLNGKE 1095 (1106)
T ss_pred cccCCCEEEEe----ccCCCcchhhccCCcc
Confidence 36899999995 6779999999876544
No 54
>PRK05886 yajC preprotein translocase subunit YajC; Validated
Probab=20.57 E-value=69 Score=28.21 Aligned_cols=48 Identities=21% Similarity=0.387 Sum_probs=29.9
Q ss_pred ccCCCCCCCeeEEEeeccCCCCccceeeeeccccccCCCCCceecccCCeEEEEEeecCCCCcceeeeccc
Q 015516 296 CLNDLHPGDHIEIQWRRNKEFPYGWWYGVVGHLESCDGNENFCHCHNSETLILEFNQYTPGSRWRKTTINR 366 (405)
Q Consensus 296 cl~dLkPGDHiEIQWRrnkefPYgWWYgvVGHlesC~gn~~~C~Ch~sdtvvlEF~qY~~gSrWR~t~inR 366 (405)
=+++|+|||.|--- -|++|--.+-+. |+|+||... ..--+|.|..|.+
T Consensus 35 m~~~Lk~GD~VvT~------------gGi~G~V~~I~d----------~~v~leia~-gv~i~~~r~AI~~ 82 (109)
T PRK05886 35 LHESLQPGDRVHTT------------SGLQATIVGITD----------DTVDLEIAP-GVVTTWMKLAVRD 82 (109)
T ss_pred HHHhcCCCCEEEEC------------CCeEEEEEEEeC----------CEEEEEECC-CeEEEEEhhheee
Confidence 35789999998642 355555565543 489999752 1223455666664
No 55
>PF06945 DUF1289: Protein of unknown function (DUF1289); InterPro: IPR010710 This family consists of a number of hypothetical bacterial proteins. The aligned region spans around 56 residues and contains 4 highly conserved cysteine residues towards the N terminus. The function of this family is unknown.
Probab=20.54 E-value=60 Score=24.56 Aligned_cols=16 Identities=25% Similarity=0.372 Sum_probs=12.3
Q ss_pred cCCHHHHHHHhhcCCc
Q 015516 386 VNNEEEIKMWKRLWPV 401 (405)
Q Consensus 386 L~~~eEI~~Wk~lwp~ 401 (405)
.+|.|||..|+.+=|.
T Consensus 21 ~RT~dEI~~W~~~s~~ 36 (51)
T PF06945_consen 21 GRTLDEIRDWKSMSDD 36 (51)
T ss_pred CCcHHHHHHHhhCCHH
Confidence 4678999999987443
Done!