Query psy516
Match_columns 80
No_of_seqs 94 out of 96
Neff 4.2
Searched_HMMs 46136
Date Fri Aug 16 18:53:32 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy516.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/516hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF02937 COX6C: Cytochrome c o 100.0 2.3E-46 4.9E-51 238.4 7.6 73 7-79 1-73 (73)
2 cd00927 Cyt_c_Oxidase_VIc Cyto 100.0 9.2E-46 2E-50 235.0 8.5 70 9-78 1-70 (70)
3 PLN03155 cytochrome c oxidase 87.9 0.95 2.1E-05 28.5 3.5 48 10-61 6-53 (63)
4 PF04584 Pox_A28: Poxvirus A28 68.0 9.6 0.00021 27.3 3.8 17 55-71 25-41 (140)
5 PF12108 SF3a60_bindingd: Spli 60.9 4.7 0.0001 21.5 0.9 18 51-68 7-24 (28)
6 PF02699 YajC: Preprotein tran 60.2 20 0.00044 22.6 3.9 33 31-63 5-37 (82)
7 TIGR00756 PPR pentatricopeptid 59.6 5.9 0.00013 18.9 1.1 15 61-75 18-32 (35)
8 PF01535 PPR: PPR repeat; Int 56.6 6.2 0.00014 18.7 0.9 14 61-74 18-31 (31)
9 COG1422 Predicted membrane pro 56.2 41 0.00089 25.3 5.5 47 26-74 50-96 (201)
10 PF05026 DCP2: Dcp2, box A dom 56.1 4.6 0.0001 26.2 0.4 23 53-79 37-65 (85)
11 PF13812 PPR_3: Pentatricopept 54.2 7.8 0.00017 18.9 1.0 13 62-74 20-32 (34)
12 PF05545 FixQ: Cbb3-type cytoc 54.0 39 0.00084 19.1 4.6 11 45-55 29-40 (49)
13 PF05311 Baculo_PP31: Baculovi 47.8 12 0.00027 29.2 1.7 14 45-58 146-159 (271)
14 PF13041 PPR_2: PPR repeat fam 47.4 12 0.00025 20.4 1.1 15 61-75 21-35 (50)
15 PF06143 Baculo_11_kDa: Baculo 44.9 88 0.0019 20.5 5.4 14 48-61 69-82 (84)
16 PF12273 RCR: Chitin synthesis 44.6 31 0.00067 23.0 3.1 7 45-51 22-28 (130)
17 PTZ00056 glutathione peroxidas 41.7 15 0.00032 26.1 1.2 14 49-62 179-192 (199)
18 PF07006 DUF1310: Protein of u 41.1 15 0.00033 25.2 1.2 33 43-79 35-67 (122)
19 PRK13823 conjugal transfer pro 39.7 58 0.0013 21.5 3.7 24 34-57 54-84 (94)
20 PRK05886 yajC preprotein trans 39.5 70 0.0015 21.7 4.2 19 38-56 14-32 (109)
21 PF03672 UPF0154: Uncharacteri 39.3 76 0.0016 19.8 4.0 29 27-62 3-31 (64)
22 PF12615 TraD_N: F sex factor 38.9 64 0.0014 20.8 3.8 33 8-41 66-98 (101)
23 PF15061 DUF4538: Domain of un 38.6 47 0.001 20.5 2.9 45 23-76 5-49 (58)
24 PF06305 DUF1049: Protein of u 38.3 81 0.0018 18.2 4.4 27 27-53 24-50 (68)
25 PF01102 Glycophorin_A: Glycop 38.0 55 0.0012 22.6 3.5 15 26-40 68-82 (122)
26 COG4499 Predicted membrane pro 37.6 91 0.002 26.0 5.2 32 34-65 233-279 (434)
27 cd03394 PAP2_like_5 PAP2_like_ 37.1 64 0.0014 20.3 3.6 29 23-51 4-32 (106)
28 COG5228 POP2 mRNA deadenylase 35.5 25 0.00055 27.6 1.7 27 37-63 170-204 (299)
29 PF06916 DUF1279: Protein of u 35.2 97 0.0021 19.6 4.2 33 15-47 3-35 (91)
30 PF07423 DUF1510: Protein of u 35.2 36 0.00079 25.5 2.5 31 25-55 16-47 (217)
31 COG1766 fliF Flagellar basal b 33.7 32 0.00069 29.1 2.1 22 43-64 40-61 (545)
32 PF06295 DUF1043: Protein of u 32.8 63 0.0014 21.8 3.1 21 27-47 2-22 (128)
33 PF14584 DUF4446: Protein of u 32.8 1.6E+02 0.0035 20.7 5.3 19 45-63 24-42 (151)
34 COG3944 Capsular polysaccharid 32.7 1.1E+02 0.0024 23.5 4.7 21 23-43 172-192 (226)
35 PF08311 Mad3_BUB1_I: Mad3/BUB 32.1 12 0.00026 25.0 -0.5 26 52-79 74-99 (126)
36 PHA03165 hypothetical protein; 31.2 75 0.0016 19.3 2.9 11 53-63 36-46 (57)
37 PRK07021 fliL flagellar basal 31.1 88 0.0019 21.7 3.7 13 33-45 29-41 (162)
38 COG1862 YajC Preprotein transl 30.1 81 0.0018 20.9 3.2 33 30-62 11-43 (97)
39 cd03393 PAP2_like_3 PAP2_like_ 30.0 1.1E+02 0.0023 19.8 3.8 32 22-53 13-44 (125)
40 COG5346 Predicted membrane pro 29.6 1.5E+02 0.0033 21.1 4.7 33 11-43 77-109 (136)
41 PF14316 DUF4381: Domain of un 29.5 1E+02 0.0022 20.8 3.7 16 39-54 36-51 (146)
42 PF07465 PsaM: Photosystem I p 29.4 82 0.0018 16.9 2.6 15 26-40 5-19 (29)
43 PF09813 Coiled-coil_56: Coile 29.1 1.1E+02 0.0024 20.8 3.8 27 21-47 46-72 (100)
44 PF08019 DUF1705: Domain of un 29.1 1.9E+02 0.004 19.7 5.0 15 49-63 118-132 (156)
45 PF12301 CD99L2: CD99 antigen 29.1 78 0.0017 22.9 3.2 25 23-47 116-140 (169)
46 COG5102 SFT2 Membrane protein 28.9 58 0.0013 24.5 2.6 23 28-50 76-98 (201)
47 TIGR00739 yajC preprotein tran 28.2 1.1E+02 0.0024 19.4 3.5 20 36-55 11-30 (84)
48 PF12783 Sec7_N: Guanine nucle 27.7 28 0.00061 23.6 0.7 12 52-63 150-161 (168)
49 PF08066 PMC2NT: PMC2NT (NUC01 27.5 25 0.00054 22.3 0.4 10 54-63 16-25 (91)
50 PF14004 DUF4227: Protein of u 27.0 1.3E+02 0.0029 19.0 3.7 29 30-63 9-37 (71)
51 PTZ00382 Variant-specific surf 26.9 17 0.00036 23.8 -0.5 14 33-46 79-92 (96)
52 PF01578 Cytochrom_C_asm: Cyto 26.8 1.4E+02 0.0031 20.8 4.2 42 30-71 85-126 (214)
53 KOG4671|consensus 26.5 55 0.0012 24.7 2.1 28 30-62 167-194 (201)
54 PF06679 DUF1180: Protein of u 26.2 79 0.0017 22.8 2.9 28 23-50 94-121 (163)
55 PRK05585 yajC preprotein trans 25.8 1.6E+02 0.0034 19.6 4.0 19 37-55 27-45 (106)
56 TIGR03053 PS_I_psaM photosyste 25.6 1E+02 0.0022 16.5 2.5 15 26-40 5-19 (29)
57 PF10883 DUF2681: Protein of u 25.1 2E+02 0.0044 18.8 6.2 23 27-49 9-31 (87)
58 TIGR03518 ABC_perm_GldF glidin 25.1 2.7E+02 0.0059 20.2 6.7 58 17-75 154-214 (240)
59 COG3621 Patatin [General funct 24.5 1.2E+02 0.0027 25.0 3.9 52 9-63 256-312 (394)
60 CHL00190 psaM photosystem I su 24.5 1.1E+02 0.0023 16.6 2.5 14 26-39 6-19 (30)
61 PF01956 DUF106: Integral memb 23.9 2.4E+02 0.0052 19.3 4.8 18 28-45 21-38 (168)
62 COG2156 KdpC K+-transporting A 23.8 1.5E+02 0.0033 22.1 4.1 26 21-46 3-28 (190)
63 PF05084 GRA6: Granule antigen 23.2 1.4E+02 0.0029 22.6 3.6 25 21-45 146-171 (215)
64 PRK07193 fliF flagellar MS-rin 23.2 82 0.0018 26.5 2.8 17 49-65 49-65 (552)
65 PF02532 PsbI: Photosystem II 23.2 1.6E+02 0.0034 16.7 3.2 24 28-51 8-31 (36)
66 KOG4606|consensus 23.0 76 0.0017 22.3 2.2 28 24-52 31-58 (126)
67 COG1959 Predicted transcriptio 22.9 57 0.0012 22.5 1.6 17 63-79 43-59 (150)
68 PF06522 B12D: NADH-ubiquinone 22.7 1.2E+02 0.0027 18.6 2.9 21 28-48 12-32 (73)
69 PF12669 P12: Virus attachment 22.7 1.3E+02 0.0029 17.9 3.0 7 27-33 2-8 (58)
70 PF15176 LRR19-TM: Leucine-ric 22.6 2.1E+02 0.0046 19.5 4.2 41 6-46 3-43 (102)
71 PF08475 Baculo_VP91_N: Viral 22.4 1.4E+02 0.003 22.1 3.6 35 27-61 2-43 (183)
72 cd04742 NPD_FabD 2-Nitropropan 22.2 63 0.0014 26.4 1.9 24 47-70 300-323 (418)
73 PF13219 DUF4027: Protein of u 21.9 1E+02 0.0022 17.5 2.1 28 17-44 6-34 (36)
74 PF03597 CcoS: Cytochrome oxid 21.7 1.7E+02 0.0036 16.8 3.1 19 27-45 7-25 (45)
75 cd03388 PAP2_SPPase1 PAP2_like 21.5 1.9E+02 0.004 19.4 3.8 27 25-51 36-62 (151)
76 TIGR03356 BGL beta-galactosida 21.4 48 0.001 26.4 1.1 20 53-74 49-68 (427)
77 PF14283 DUF4366: Domain of un 21.3 9 0.00019 28.6 -2.9 29 26-54 163-191 (218)
78 PF04719 TAFII28: hTAFII28-lik 21.1 1E+02 0.0022 20.1 2.4 38 8-47 22-60 (90)
79 PF14086 DUF4266: Domain of un 21.0 75 0.0016 18.9 1.6 21 8-28 7-30 (50)
80 PF12411 Choline_sulf_C: Choli 21.0 71 0.0015 19.3 1.5 50 18-71 4-53 (54)
81 PTZ00045 apical membrane antig 20.6 95 0.0021 26.9 2.7 28 44-71 539-570 (595)
82 PRK11878 psaM photosystem I re 20.6 1.4E+02 0.003 16.6 2.5 14 26-39 9-22 (34)
83 PRK09966 putative inner membra 20.2 3.8E+02 0.0083 20.0 5.7 25 11-35 3-29 (407)
No 1
>PF02937 COX6C: Cytochrome c oxidase subunit VIc; InterPro: IPR004204 Cytochrome c oxidase, a 13 subunit complex, 1.9.3.1 from EC is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc.; GO: 0004129 cytochrome-c oxidase activity; PDB: 3AG4_I 2DYS_V 3ASO_I 2EIK_V 2EIM_I 1OCC_V 1V54_V 1OCO_V 3ASN_V 2EIL_I ....
Probab=100.00 E-value=2.3e-46 Score=238.43 Aligned_cols=73 Identities=44% Similarity=0.752 Sum_probs=58.0
Q ss_pred cCCCCCchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhcCCchHHHHHHHhcCcccccc
Q psy516 7 TGKLPKPQLRNLLQSSIKVALVQGAVIAFGTAVAWKVLIMDPHKKAISEFYKTYDGEKDFERMKKAGLFEEYT 79 (80)
Q Consensus 7 ~~~l~KPqmRgLl~~~lr~~i~~a~~lsl~~~~~~kf~v~~PRKkaYadFYknYD~~k~Fe~Mk~aG~fqS~~ 79 (80)
++.|+||||||||++|+||||++|+++|++++++|||++++|||++|+|||||||||||||||+++|+|||||
T Consensus 1 ~~~l~KP~mrgll~~~l~~~i~~a~~ls~~~~~~~kf~v~~pRKk~YadFYknYD~~k~fe~M~~~G~fqS~~ 73 (73)
T PF02937_consen 1 PSALAKPQMRGLLAKRLKRHIVVAFVLSLGVAAAYKFGVAEPRKKAYADFYKNYDPMKDFEEMRKAGIFQSVK 73 (73)
T ss_dssp ---------BSHHHHHHHHHHHHHHHHHHHHHHHHHHHTHHHHHHHHHHHHHT--HHHHHHHHHHTT--SS--
T ss_pred CCcccCchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHccChHHHHHHHHhcCCcccCC
Confidence 3679999999999999999999999999999999999999999999999999999999999999999999997
No 2
>cd00927 Cyt_c_Oxidase_VIc Cytochrome c oxidase subunit VIc. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. The VIc subunit is found only in eukaryotes and its specific function remains unclear. It has been reported that the relative concentrations of some nuclear encoded CcO subunits, including subunit VIc, compared to those of the mitochondrial encoded subunits, are altered significantly during the progression of prostate cancer.
Probab=100.00 E-value=9.2e-46 Score=235.04 Aligned_cols=70 Identities=54% Similarity=0.886 Sum_probs=68.9
Q ss_pred CCCCchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhcCCchHHHHHHHhcCccccc
Q psy516 9 KLPKPQLRNLLQSSIKVALVQGAVIAFGTAVAWKVLIMDPHKKAISEFYKTYDGEKDFERMKKAGLFEEY 78 (80)
Q Consensus 9 ~l~KPqmRgLl~~~lr~~i~~a~~lsl~~~~~~kf~v~~PRKkaYadFYknYD~~k~Fe~Mk~aG~fqS~ 78 (80)
.|+||||||||++++||||++||++|++++++||++++||||++|+||||||||||||||||++|+||||
T Consensus 1 ~l~kp~mrgLl~~~l~~~l~~a~~lsl~~~~~~k~~~~~pRK~aYadFYknYD~~kdFerM~~~G~f~S~ 70 (70)
T cd00927 1 ALAKPQMRGLLASRIKRHLIVAFVLSLGAAAAYKFLVNEPRKKAYADFYKTYDAMKDFERMRKAGLFQSC 70 (70)
T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHccChHHHHHHHHHcCCcccC
Confidence 3899999999999999999999999999999999999999999999999999999999999999999998
No 3
>PLN03155 cytochrome c oxidase subunit 5C; Provisional
Probab=87.94 E-value=0.95 Score=28.52 Aligned_cols=48 Identities=17% Similarity=0.389 Sum_probs=34.3
Q ss_pred CCCchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhcCC
Q psy516 10 LPKPQLRNLLQSSIKVALVQGAVIAFGTAVAWKVLIMDPHKKAISEFYKTYD 61 (80)
Q Consensus 10 l~KPqmRgLl~~~lr~~i~~a~~lsl~~~~~~kf~v~~PRKkaYadFYknYD 61 (80)
.+..+++|- .+-+-|+++++|.++++.+||.-+-+-|+ +-.+||.-.|
T Consensus 6 vah~~~~gP---svvKEI~iG~~LGL~AG~~WKmhHWn~qr-kt~~fY~~Le 53 (63)
T PLN03155 6 IAHATLKGP---SVVKELCIGLTLGLAAGGLWKMHHWNEQR-KTRSFYDLLE 53 (63)
T ss_pred eeeeeccCC---chhhhHHHHhHHHHhhhhHHHHhhhhhHH-HHHHHHHHHh
Confidence 444444442 45567899999999999999998877554 4578886543
No 4
>PF04584 Pox_A28: Poxvirus A28 family; InterPro: IPR007664 The poxvirus A28 protein is expressed at late times during the virus replication cycle and is a membrane component of the intracellular mature virion. Repression of A28 inhibits cell-to-cell spread, suggesting that all poxviruses use a common A28-dependent mechanism of cell penetration []. An N-terminal hydrophobic sequence, present in all poxvirus A28 orthologues, anchors the protein in the virion surface membrane so that most of it is exposed to the cytoplasm [].; GO: 0016032 viral reproduction, 0019031 viral envelope
Probab=68.00 E-value=9.6 Score=27.30 Aligned_cols=17 Identities=29% Similarity=0.618 Sum_probs=14.6
Q ss_pred HHHhcCCchHHHHHHHh
Q psy516 55 EFYKTYDGEKDFERMKK 71 (80)
Q Consensus 55 dFYknYD~~k~Fe~Mk~ 71 (80)
.+|.|||--++|+.+-.
T Consensus 25 ~iYeNYdnI~EFN~~~~ 41 (140)
T PF04584_consen 25 YIYENYDNIKEFNDAHS 41 (140)
T ss_pred HHHHhhhHHHHHHhhcc
Confidence 58999999999998753
No 5
>PF12108 SF3a60_bindingd: Splicing factor SF3a60 binding domain; InterPro: IPR021966 This domain is found in eukaryotes. This domain is about 30 amino acids in length. This domain has a single completely conserved residue Y that may be functionally important. SF3a60 makes up the SF3a complex with SF3a66 and SF3a120. This domain is the binding site of SF3a60 for SF3a120. The SF3a complex is part of the spliceosome, a protein complex involved in splicing mRNA after transcription. ; PDB: 2DT7_A.
Probab=60.91 E-value=4.7 Score=21.55 Aligned_cols=18 Identities=33% Similarity=0.676 Sum_probs=13.7
Q ss_pred HHHHHHHhcCCchHHHHH
Q psy516 51 KAISEFYKTYDGEKDFER 68 (80)
Q Consensus 51 kaYadFYknYD~~k~Fe~ 68 (80)
.-++|||+.....|||.+
T Consensus 7 d~f~eFY~rlk~Ike~Hr 24 (28)
T PF12108_consen 7 DPFSEFYERLKEIKEYHR 24 (28)
T ss_dssp -HHHHHHHHHHHHHHHHH
T ss_pred ChHHHHHHHHHHHHHHHH
Confidence 347889999888888864
No 6
>PF02699 YajC: Preprotein translocase subunit; InterPro: IPR003849 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component []. From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins []. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone (SecB), ATPase (SecA), an integral membrane complex (SecY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) []. Other cytoplasmic/periplasmic proteins play a part in preprotein translocase activity, namely YidC and YajC []. The latter is bound in a complex to SecD and SecF, and plays a part in stabilising and regulating secretion through the SecYEG integral membrane component via SecA []. Homologues of the YajC gene have been found in a range of pathogenic and commensal microbes. Brucella abortis YajC- and SecD-like proteins were shown to stimulate a Th1 cell-mediated immune response in mice, and conferred protection when challenged with B.abortis []. Therefore, these proteins may have an antigenic role as well as a secretory one in virulent bacteria []. A number of previously uncharacterised "hypothetical" proteins also show similarity to E.coli YajC, suggesting that this family is wider than first thought []. More recently, the precise interactions between the E.coli SecYEG complex, SecD, SecF, YajC and YidC have been studied []. Rather than acting individually, the four proteins form a heterotetrameric complex and associate with the SecYEG heterotrimeric complex []. The SecF and YajC subunits link the complex to the integral membrane translocase. ; PDB: 2RDD_B.
Probab=60.15 E-value=20 Score=22.57 Aligned_cols=33 Identities=18% Similarity=0.170 Sum_probs=21.4
Q ss_pred HHHHHHHHHHHHHHHhhhhhHHHHHHHhcCCch
Q psy516 31 AVIAFGTAVAWKVLIMDPHKKAISEFYKTYDGE 63 (80)
Q Consensus 31 ~~lsl~~~~~~kf~v~~PRKkaYadFYknYD~~ 63 (80)
++.-++..+.++|+...|+||+-++.=.-.|+.
T Consensus 5 li~lv~~~~i~yf~~~rpqkk~~k~~~~m~~~L 37 (82)
T PF02699_consen 5 LIPLVIIFVIFYFLMIRPQKKQQKEHQEMLASL 37 (82)
T ss_dssp HHHHHHHHHHHHHHTHHHHHHHHHHHTTGGG--
T ss_pred HHHHHHHHHHHhhheecHHHHHHHHHHHHHHcC
Confidence 444445666777888889998887665544444
No 7
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=59.56 E-value=5.9 Score=18.93 Aligned_cols=15 Identities=20% Similarity=0.324 Sum_probs=12.4
Q ss_pred CchHHHHHHHhcCcc
Q psy516 61 DGEKDFERMKKAGLF 75 (80)
Q Consensus 61 D~~k~Fe~Mk~aG~f 75 (80)
++.+-|++|++.|+-
T Consensus 18 ~a~~~~~~M~~~g~~ 32 (35)
T TIGR00756 18 EALELFKEMLERGIE 32 (35)
T ss_pred HHHHHHHHHHHcCCC
Confidence 577889999999874
No 8
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=56.60 E-value=6.2 Score=18.74 Aligned_cols=14 Identities=21% Similarity=0.615 Sum_probs=10.9
Q ss_pred CchHHHHHHHhcCc
Q psy516 61 DGEKDFERMKKAGL 74 (80)
Q Consensus 61 D~~k~Fe~Mk~aG~ 74 (80)
++.+-|++|++.|+
T Consensus 18 ~a~~~~~~M~~~g~ 31 (31)
T PF01535_consen 18 EALEVFDEMRERGI 31 (31)
T ss_pred HHHHHHHHHhHCcC
Confidence 46677899998885
No 9
>COG1422 Predicted membrane protein [Function unknown]
Probab=56.21 E-value=41 Score=25.31 Aligned_cols=47 Identities=21% Similarity=0.391 Sum_probs=35.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhcCCchHHHHHHHhcCc
Q psy516 26 ALVQGAVIAFGTAVAWKVLIMDPHKKAISEFYKTYDGEKDFERMKKAGL 74 (80)
Q Consensus 26 ~i~~a~~lsl~~~~~~kf~v~~PRKkaYadFYknYD~~k~Fe~Mk~aG~ 74 (80)
-++.|.++++....+.|+++-.-|=+++.+=.+-+ .|+|+++++.|-
T Consensus 50 ilV~avi~gl~~~i~~~~liD~ekm~~~qk~m~ef--q~e~~eA~~~~d 96 (201)
T COG1422 50 ILVAAVITGLYITILQKLLIDQEKMKELQKMMKEF--QKEFREAQESGD 96 (201)
T ss_pred HHHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHH--HHHHHHHHHhCC
Confidence 35778888999999999999888877777665543 455677776653
No 10
>PF05026 DCP2: Dcp2, box A domain; InterPro: IPR007722 This presumed domain is always found to the N-terminal side of the NUDIX hydrolase domain IPR000086 from INTERPRO. This domain appears to be specific to mRNA decapping protein 2 P53550 from SWISSPROT and its close homologues. This region has been termed Box A [].; GO: 0003723 RNA binding, 0016787 hydrolase activity, 0030145 manganese ion binding; PDB: 2QKM_H 2A6T_B 2QKL_B.
Probab=56.13 E-value=4.6 Score=26.21 Aligned_cols=23 Identities=22% Similarity=0.606 Sum_probs=16.1
Q ss_pred HHHHHhcCCc------hHHHHHHHhcCcccccc
Q psy516 53 ISEFYKTYDG------EKDFERMKKAGLFEEYT 79 (80)
Q Consensus 53 YadFYknYD~------~k~Fe~Mk~aG~fqS~~ 79 (80)
|.|||+.-|| +++|-.+ +|++||
T Consensus 37 Y~Df~~~~~p~Lp~~~lk~F~~~----if~~cp 65 (85)
T PF05026_consen 37 YEDFYREQNPSLPSMSLKEFAKQ----IFQHCP 65 (85)
T ss_dssp HHHTTTTTTTTS----HHHHHHH----HHTT-G
T ss_pred HHHHhhhhcCCCCCCCHHHHHHH----HHHHCH
Confidence 8999987776 6667644 677776
No 11
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=54.18 E-value=7.8 Score=18.86 Aligned_cols=13 Identities=31% Similarity=0.570 Sum_probs=11.0
Q ss_pred chHHHHHHHhcCc
Q psy516 62 GEKDFERMKKAGL 74 (80)
Q Consensus 62 ~~k~Fe~Mk~aG~ 74 (80)
+..-|++|++.|+
T Consensus 20 a~~~~~~M~~~gv 32 (34)
T PF13812_consen 20 ALQLFDEMKEQGV 32 (34)
T ss_pred HHHHHHHHHHhCC
Confidence 6778999999986
No 12
>PF05545 FixQ: Cbb3-type cytochrome oxidase component FixQ; InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=53.98 E-value=39 Score=19.08 Aligned_cols=11 Identities=36% Similarity=0.531 Sum_probs=5.9
Q ss_pred Hhhhh-hHHHHH
Q psy516 45 IMDPH-KKAISE 55 (80)
Q Consensus 45 v~~PR-KkaYad 55 (80)
+..|| |+.|+|
T Consensus 29 ~~~~~~k~~~e~ 40 (49)
T PF05545_consen 29 AYRPRNKKRFEE 40 (49)
T ss_pred HHcccchhhHHH
Confidence 44566 555554
No 13
>PF05311 Baculo_PP31: Baculovirus 33KDa late protein (PP31); InterPro: IPR007975 Autographa californica nuclear polyhedrosis virus (AcMNPV) p31 is a nuclear phosphoprotein that accumulates in the virogenic stroma, which is the viral replication centre in the infected-cell nucleus. The protein binds to DNA, and serves as a late expression factor [].
Probab=47.77 E-value=12 Score=29.17 Aligned_cols=14 Identities=29% Similarity=0.527 Sum_probs=12.7
Q ss_pred HhhhhhHHHHHHHh
Q psy516 45 IMDPHKKAISEFYK 58 (80)
Q Consensus 45 v~~PRKkaYadFYk 58 (80)
.++.|.+.|.|||+
T Consensus 146 sn~kR~klY~EFy~ 159 (271)
T PF05311_consen 146 SNEKRSKLYNEFYR 159 (271)
T ss_pred HHHHHHHHHHHHHH
Confidence 47889999999998
No 14
>PF13041 PPR_2: PPR repeat family
Probab=47.38 E-value=12 Score=20.41 Aligned_cols=15 Identities=40% Similarity=0.543 Sum_probs=12.1
Q ss_pred CchHHHHHHHhcCcc
Q psy516 61 DGEKDFERMKKAGLF 75 (80)
Q Consensus 61 D~~k~Fe~Mk~aG~f 75 (80)
++.+-|++|++.|+-
T Consensus 21 ~a~~l~~~M~~~g~~ 35 (50)
T PF13041_consen 21 EALKLFKEMKKRGIK 35 (50)
T ss_pred HHHHHHHHHHHcCCC
Confidence 467789999999973
No 15
>PF06143 Baculo_11_kDa: Baculovirus 11 kDa family; InterPro: IPR009313 This is a family of uncharacterised Baculovirus proteins that are all about 11 kDa in size.
Probab=44.95 E-value=88 Score=20.52 Aligned_cols=14 Identities=7% Similarity=0.233 Sum_probs=10.8
Q ss_pred hhhHHHHHHHhcCC
Q psy516 48 PHKKAISEFYKTYD 61 (80)
Q Consensus 48 PRKkaYadFYknYD 61 (80)
+|-+.+..|-+|||
T Consensus 69 ~~~~~~~~~l~Nld 82 (84)
T PF06143_consen 69 ERQQREKTYLANLD 82 (84)
T ss_pred HHHHHHHHHHHhcC
Confidence 56666778888988
No 16
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 [].
Probab=44.65 E-value=31 Score=23.00 Aligned_cols=7 Identities=0% Similarity=0.022 Sum_probs=3.0
Q ss_pred HhhhhhH
Q psy516 45 IMDPHKK 51 (80)
Q Consensus 45 v~~PRKk 51 (80)
++..|++
T Consensus 22 ~~rRR~r 28 (130)
T PF12273_consen 22 HNRRRRR 28 (130)
T ss_pred HHHHHhh
Confidence 4444443
No 17
>PTZ00056 glutathione peroxidase; Provisional
Probab=41.73 E-value=15 Score=26.14 Aligned_cols=14 Identities=36% Similarity=0.598 Sum_probs=12.3
Q ss_pred hhHHHHHHHhcCCc
Q psy516 49 HKKAISEFYKTYDG 62 (80)
Q Consensus 49 RKkaYadFYknYD~ 62 (80)
.++.|.|-|||||-
T Consensus 179 ~~~~~~~~~~~~~~ 192 (199)
T PTZ00056 179 GVKDYQELFKNYDK 192 (199)
T ss_pred HHHHHHHHHHhhhh
Confidence 67789999999995
No 18
>PF07006 DUF1310: Protein of unknown function (DUF1310); InterPro: IPR010738 This family consists of several hypothetical proteins of around 125 residues in length. Members of this family seem to be specific to Listeria and Streptococcus species. The function of this family is unknown.
Probab=41.14 E-value=15 Score=25.19 Aligned_cols=33 Identities=21% Similarity=0.361 Sum_probs=25.4
Q ss_pred HHHhhhhhHHHHHHHhcCCchHHHHHHHhcCcccccc
Q psy516 43 VLIMDPHKKAISEFYKTYDGEKDFERMKKAGLFEEYT 79 (80)
Q Consensus 43 f~v~~PRKkaYadFYknYD~~k~Fe~Mk~aG~fqS~~ 79 (80)
...-+-=|+.|++..|+-||. +|. +.|+.||+.
T Consensus 35 Iv~S~E~k~~~E~~Lk~lD~~-AlT---~~G~IksY~ 67 (122)
T PF07006_consen 35 IVKSEEAKKVIEKDLKNLDPK-ALT---EEGKIKSYE 67 (122)
T ss_pred HhcCHHHHHHHHHHHHHhChh-hhc---cCCeeEEEE
Confidence 444555688999999999993 444 799999874
No 19
>PRK13823 conjugal transfer protein TrbD; Provisional
Probab=39.68 E-value=58 Score=21.49 Aligned_cols=24 Identities=13% Similarity=0.135 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHhhh-------hhHHHHHHH
Q psy516 34 AFGTAVAWKVLIMDP-------HKKAISEFY 57 (80)
Q Consensus 34 sl~~~~~~kf~v~~P-------RKkaYadFY 57 (80)
.++..++-+..-.|| |+.+|.+||
T Consensus 54 ~v~h~~l~~mAK~DP~~~~V~~Rh~ry~~~Y 84 (94)
T PRK13823 54 FGALFALRLMAKADPKMRHVYLRHRRYKPYY 84 (94)
T ss_pred HHHHHHHHHHHhcChHHHHHHHHHhhccccC
Confidence 333444444455555 677777777
No 20
>PRK05886 yajC preprotein translocase subunit YajC; Validated
Probab=39.50 E-value=70 Score=21.67 Aligned_cols=19 Identities=11% Similarity=0.111 Sum_probs=12.8
Q ss_pred HHHHHHHHhhhhhHHHHHH
Q psy516 38 AVAWKVLIMDPHKKAISEF 56 (80)
Q Consensus 38 ~~~~kf~v~~PRKkaYadF 56 (80)
-+.++|+.-.|+||+-++.
T Consensus 14 ~~i~yF~~iRPQkKr~K~~ 32 (109)
T PRK05886 14 MGGFMYFASRRQRKAMQAT 32 (109)
T ss_pred HHHHHHHHccHHHHHHHHH
Confidence 3455677788888876543
No 21
>PF03672 UPF0154: Uncharacterised protein family (UPF0154); InterPro: IPR005359 The proteins in this entry are functionally uncharacterised.
Probab=39.25 E-value=76 Score=19.83 Aligned_cols=29 Identities=7% Similarity=0.052 Sum_probs=14.9
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhcCCc
Q psy516 27 LVQGAVIAFGTAVAWKVLIMDPHKKAISEFYKTYDG 62 (80)
Q Consensus 27 i~~a~~lsl~~~~~~kf~v~~PRKkaYadFYknYD~ 62 (80)
+++++++++++++++ + ++-++.+.+..+|
T Consensus 3 iilali~G~~~Gff~----a---r~~~~k~l~~NPp 31 (64)
T PF03672_consen 3 IILALIVGAVIGFFI----A---RKYMEKQLKENPP 31 (64)
T ss_pred HHHHHHHHHHHHHHH----H---HHHHHHHHHHCCC
Confidence 445555554444332 2 2445666676676
No 22
>PF12615 TraD_N: F sex factor protein N terminal; InterPro: IPR022585 This domain of unknown function is found in bacteria, and is typically between 96 and 107 amino acids in length. It is probably periplasmic and is found N-terminal to the DNA binding domain PF10412 from PFAM. TraD is a hexameric ring ATPase that is part of the bacterial F sex factor complex and involved in bacterial conjugation [], [], [].
Probab=38.86 E-value=64 Score=20.76 Aligned_cols=33 Identities=21% Similarity=0.074 Sum_probs=24.1
Q ss_pred CCCCCchhhhHHHHHHHHHHHHHHHHHHHHHHHH
Q psy516 8 GKLPKPQLRNLLQSSIKVALVQGAVIAFGTAVAW 41 (80)
Q Consensus 8 ~~l~KPqmRgLl~~~lr~~i~~a~~lsl~~~~~~ 41 (80)
+-|.+|.|..+. .++.+.+..++.++++++++.
T Consensus 66 q~L~~~~~~~~~-~~~~~~l~~~al~a~~i~~~~ 98 (101)
T PF12615_consen 66 QQLHDPYMVDCG-DQFWQELQISALIAGVISFAV 98 (101)
T ss_pred HHhhCHHHHHHH-HHHHHHHHHHHHHHHHHHHHh
Confidence 347888888776 777777777777777766654
No 23
>PF15061 DUF4538: Domain of unknown function (DUF4538)
Probab=38.61 E-value=47 Score=20.50 Aligned_cols=45 Identities=18% Similarity=0.313 Sum_probs=29.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhcCCchHHHHHHHhcCccc
Q psy516 23 IKVALVQGAVIAFGTAVAWKVLIMDPHKKAISEFYKTYDGEKDFERMKKAGLFE 76 (80)
Q Consensus 23 lr~~i~~a~~lsl~~~~~~kf~v~~PRKkaYadFYknYD~~k~Fe~Mk~aG~fq 76 (80)
.|..++++..++++.+++| =.+..| +=|-|.+|..++..++|+-|
T Consensus 5 ~r~~~~~ggfVg~iG~a~Y-pi~~~P--------mm~~eeYk~~Q~~nR~gI~q 49 (58)
T PF15061_consen 5 WRYALFVGGFVGLIGAALY-PIYFRP--------MMNPEEYKKEQKINRAGIKQ 49 (58)
T ss_pred ccchhhHHHHHHHHHHHHh-hhhccc--------ccChHHHHHHHHHHHhcccH
Confidence 4566778888888666666 444444 23446666778888888765
No 24
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=38.32 E-value=81 Score=18.21 Aligned_cols=27 Identities=4% Similarity=-0.095 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhHHH
Q psy516 27 LVQGAVIAFGTAVAWKVLIMDPHKKAI 53 (80)
Q Consensus 27 i~~a~~lsl~~~~~~kf~v~~PRKkaY 53 (80)
|.++++++++++.++-.......++..
T Consensus 24 il~~f~~G~llg~l~~~~~~~~~r~~~ 50 (68)
T PF06305_consen 24 ILIAFLLGALLGWLLSLPSRLRLRRRI 50 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445555555555555444444334333
No 25
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=37.97 E-value=55 Score=22.57 Aligned_cols=15 Identities=7% Similarity=0.166 Sum_probs=6.3
Q ss_pred HHHHHHHHHHHHHHH
Q psy516 26 ALVQGAVIAFGTAVA 40 (80)
Q Consensus 26 ~i~~a~~lsl~~~~~ 40 (80)
-|+++.+..++..++
T Consensus 68 ~Ii~gv~aGvIg~Il 82 (122)
T PF01102_consen 68 GIIFGVMAGVIGIIL 82 (122)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred ehhHHHHHHHHHHHH
Confidence 344444444433333
No 26
>COG4499 Predicted membrane protein [Function unknown]
Probab=37.61 E-value=91 Score=26.04 Aligned_cols=32 Identities=25% Similarity=0.319 Sum_probs=22.0
Q ss_pred HHHHHHHHHHHHhhhhhHH----HHHHHh-----------cCCchHH
Q psy516 34 AFGTAVAWKVLIMDPHKKA----ISEFYK-----------TYDGEKD 65 (80)
Q Consensus 34 sl~~~~~~kf~v~~PRKka----YadFYk-----------nYD~~k~ 65 (80)
-+++.+.|..+.+.||.++ +++|-+ ||||.|.
T Consensus 233 l~li~~~Y~~f~~~p~qeai~~a~~aFL~~nY~qVittLe~ydp~kl 279 (434)
T COG4499 233 LLLIYFTYYYFSNQPKQEAIITANTAFLKNNYDQVITTLENYDPEKL 279 (434)
T ss_pred HHHHHHHHHHHHcChhHHHHHHHHHHHHhccHHHHhhhcccCChhhC
Confidence 3446667778889999986 455643 7888653
No 27
>cd03394 PAP2_like_5 PAP2_like_5 proteins. PAP2 is a super-family of phosphatases and haloperoxidases. This subgroup, which is specific to bacteria, lacks functional characterization and may act as a membrane-associated lipid phosphatase.
Probab=37.08 E-value=64 Score=20.30 Aligned_cols=29 Identities=10% Similarity=0.066 Sum_probs=22.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhH
Q psy516 23 IKVALVQGAVIAFGTAVAWKVLIMDPHKK 51 (80)
Q Consensus 23 lr~~i~~a~~lsl~~~~~~kf~v~~PRKk 51 (80)
.-..+..+++++.++....|..+..||=.
T Consensus 4 ~~~~~~~~~~~~~~~~~~lK~~~~r~RP~ 32 (106)
T cd03394 4 GLLILAEAAALTAAVTEGLKFAVGRARPD 32 (106)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHCCCCCC
Confidence 34566777888888888899999888863
No 28
>COG5228 POP2 mRNA deadenylase subunit [RNA processing and modification]
Probab=35.47 E-value=25 Score=27.62 Aligned_cols=27 Identities=30% Similarity=0.385 Sum_probs=23.9
Q ss_pred HHHHHHHHHhhhhhHHHHHHHh--------cCCch
Q psy516 37 TAVAWKVLIMDPHKKAISEFYK--------TYDGE 63 (80)
Q Consensus 37 ~~~~~kf~v~~PRKkaYadFYk--------nYD~~ 63 (80)
.+.+.|.+.++|=-+.|+|||+ +||..
T Consensus 170 fgyLikilt~~plP~~~EdFy~~l~~yfP~fYDik 204 (299)
T COG5228 170 FGYLIKILTNDPLPNNKEDFYWWLHQYFPNFYDIK 204 (299)
T ss_pred HHHHHHHHhcCCCCccHHHHHHHHHHHCccccchH
Confidence 4678899999999999999998 88875
No 29
>PF06916 DUF1279: Protein of unknown function (DUF1279); InterPro: IPR009688 This entry represents the C terminus (approx. 120 residues) of a number of eukaryotic proteins of unknown function.
Probab=35.23 E-value=97 Score=19.59 Aligned_cols=33 Identities=18% Similarity=0.075 Sum_probs=27.8
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy516 15 LRNLLQSSIKVALVQGAVIAFGTAVAWKVLIMD 47 (80)
Q Consensus 15 mRgLl~~~lr~~i~~a~~lsl~~~~~~kf~v~~ 47 (80)
++.++.++.+--+++-+++|++.-.++++++..
T Consensus 3 ~K~l~k~YG~~~l~vy~~~s~~~~~~~y~~v~~ 35 (91)
T PF06916_consen 3 LKQLFKKYGYVALGVYLGLSFISLGSCYLAVSS 35 (91)
T ss_pred HHHHHHHhCHhHHHHHHHHHHHHHHHHHHHHHh
Confidence 788999999999999999998888777776543
No 30
>PF07423 DUF1510: Protein of unknown function (DUF1510); InterPro: IPR009988 This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown.
Probab=35.22 E-value=36 Score=25.53 Aligned_cols=31 Identities=19% Similarity=0.267 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHH-hhhhhHHHHH
Q psy516 25 VALVQGAVIAFGTAVAWKVLI-MDPHKKAISE 55 (80)
Q Consensus 25 ~~i~~a~~lsl~~~~~~kf~v-~~PRKkaYad 55 (80)
.||.+++++-||+.++|.++. .++..++-++
T Consensus 16 LNiaI~IV~lLIiiva~~lf~~~~~~~~~~a~ 47 (217)
T PF07423_consen 16 LNIAIGIVSLLIIIVAYQLFFGGDDSPAASAD 47 (217)
T ss_pred HHHHHHHHHHHHHHHhhhheecCCCchhhhhc
Confidence 467778888788888887777 5666555444
No 31
>COG1766 fliF Flagellar basal body M-ring protein [Cell motility and secretion]
Probab=33.70 E-value=32 Score=29.06 Aligned_cols=22 Identities=14% Similarity=0.292 Sum_probs=16.9
Q ss_pred HHHhhhhhHHHHHHHhcCCchH
Q psy516 43 VLIMDPHKKAISEFYKTYDGEK 64 (80)
Q Consensus 43 f~v~~PRKkaYadFYknYD~~k 64 (80)
++..+.++..|.--|.|.|..+
T Consensus 40 ~l~l~~~~p~y~~Ly~~l~~~d 61 (545)
T COG1766 40 ALLLWSRQPKYRTLYSNLSEED 61 (545)
T ss_pred HHheecCCCceeeecCCCCHHH
Confidence 3355566999999999988764
No 32
>PF06295 DUF1043: Protein of unknown function (DUF1043); InterPro: IPR009386 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=32.85 E-value=63 Score=21.83 Aligned_cols=21 Identities=14% Similarity=0.102 Sum_probs=14.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhh
Q psy516 27 LVQGAVIAFGTAVAWKVLIMD 47 (80)
Q Consensus 27 i~~a~~lsl~~~~~~kf~v~~ 47 (80)
++++|++++++|++...+...
T Consensus 2 ~~i~lvvG~iiG~~~~r~~~~ 22 (128)
T PF06295_consen 2 AIIGLVVGLIIGFLIGRLTSS 22 (128)
T ss_pred hHHHHHHHHHHHHHHHHHhcc
Confidence 467788888777776555443
No 33
>PF14584 DUF4446: Protein of unknown function (DUF4446)
Probab=32.80 E-value=1.6e+02 Score=20.67 Aligned_cols=19 Identities=16% Similarity=0.296 Sum_probs=15.2
Q ss_pred HhhhhhHHHHHHHhcCCch
Q psy516 45 IMDPHKKAISEFYKTYDGE 63 (80)
Q Consensus 45 v~~PRKkaYadFYknYD~~ 63 (80)
-...-+++|..|.+..|..
T Consensus 24 kl~kl~r~Y~~lm~g~~~~ 42 (151)
T PF14584_consen 24 KLRKLKRRYDALMRGKDGK 42 (151)
T ss_pred HHHHHHHHHHHHhCCCCcc
Confidence 3445689999999999885
No 34
>COG3944 Capsular polysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane]
Probab=32.67 E-value=1.1e+02 Score=23.50 Aligned_cols=21 Identities=10% Similarity=0.140 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy516 23 IKVALVQGAVIAFGTAVAWKV 43 (80)
Q Consensus 23 lr~~i~~a~~lsl~~~~~~kf 43 (80)
..+|+++||++++..+..+=|
T Consensus 172 ~~~Ni~iaf~~Gl~~~igiaf 192 (226)
T COG3944 172 VMRNIVIAFLAGLAGAIGIAF 192 (226)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 356788888887766655543
No 35
>PF08311 Mad3_BUB1_I: Mad3/BUB1 homology region 1; InterPro: IPR013212 Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of BUB1 and MAD3 to CDC20p [].; PDB: 3ESL_B 4AEZ_I 4A1G_B 2LAH_A 2WVI_A 3SI5_B.
Probab=32.09 E-value=12 Score=25.01 Aligned_cols=26 Identities=12% Similarity=0.195 Sum_probs=21.7
Q ss_pred HHHHHHhcCCchHHHHHHHhcCcccccc
Q psy516 52 AISEFYKTYDGEKDFERMKKAGLFEEYT 79 (80)
Q Consensus 52 aYadFYknYD~~k~Fe~Mk~aG~fqS~~ 79 (80)
.|+++.. ||.+-|+.|.+.|++..+.
T Consensus 74 ~ya~~~~--~~~~if~~l~~~~IG~~~A 99 (126)
T PF08311_consen 74 KYADLSS--DPREIFKFLYSKGIGTKLA 99 (126)
T ss_dssp HHHTTBS--HHHHHHHHHHHHTTSTTBH
T ss_pred HHHHHcc--CHHHHHHHHHHcCccHHHH
Confidence 5777666 9999999999999987653
No 36
>PHA03165 hypothetical protein; Provisional
Probab=31.22 E-value=75 Score=19.32 Aligned_cols=11 Identities=18% Similarity=0.208 Sum_probs=9.3
Q ss_pred HHHHHhcCCch
Q psy516 53 ISEFYKTYDGE 63 (80)
Q Consensus 53 YadFYknYD~~ 63 (80)
|.||-.|..|-
T Consensus 36 ysdflsnlspf 46 (57)
T PHA03165 36 YSDFLSNLSPF 46 (57)
T ss_pred HHHHHhccCch
Confidence 78999998883
No 37
>PRK07021 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=31.08 E-value=88 Score=21.66 Aligned_cols=13 Identities=23% Similarity=-0.294 Sum_probs=6.7
Q ss_pred HHHHHHHHHHHHH
Q psy516 33 IAFGTAVAWKVLI 45 (80)
Q Consensus 33 lsl~~~~~~kf~v 45 (80)
+.++.+++|+|+.
T Consensus 29 ~~~g~gg~~~~~~ 41 (162)
T PRK07021 29 AAAAGAGYSWWLS 41 (162)
T ss_pred HHHHHHHHHHHhh
Confidence 3445555565544
No 38
>COG1862 YajC Preprotein translocase subunit YajC [Intracellular trafficking and secretion]
Probab=30.10 E-value=81 Score=20.93 Aligned_cols=33 Identities=15% Similarity=0.177 Sum_probs=21.3
Q ss_pred HHHHHHHHHHHHHHHHhhhhhHHHHHHHhcCCc
Q psy516 30 GAVIAFGTAVAWKVLIMDPHKKAISEFYKTYDG 62 (80)
Q Consensus 30 a~~lsl~~~~~~kf~v~~PRKkaYadFYknYD~ 62 (80)
.++.-+...+.|||+...|.+|+-.+.=+-.|+
T Consensus 11 ~ll~~vl~~~ifyFli~RPQrKr~K~~~~ml~s 43 (97)
T COG1862 11 LLLPLVLIFAIFYFLIIRPQRKRMKEHQELLNS 43 (97)
T ss_pred HHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHHh
Confidence 344445556677898999988886665444333
No 39
>cd03393 PAP2_like_3 PAP2_like_3 proteins. PAP2 is a super-family of phosphatases and haloperoxidases. This subgroup, which is specific to bacteria and archaea, lacks functional characterization and may act as a membrane-associated lipid phosphatase.
Probab=30.01 E-value=1.1e+02 Score=19.82 Aligned_cols=32 Identities=13% Similarity=-0.100 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHH
Q psy516 22 SIKVALVQGAVIAFGTAVAWKVLIMDPHKKAI 53 (80)
Q Consensus 22 ~lr~~i~~a~~lsl~~~~~~kf~v~~PRKkaY 53 (80)
+.=+.+..+++++.......|..+..||=..+
T Consensus 13 ~~~~~~~~~~~~~~~~~~~lK~~~~r~RP~~~ 44 (125)
T cd03393 13 RLGRYLGLALCASGYLNAALKEVFKIPRPFTY 44 (125)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCcCCC
Confidence 34455677888888888999999998887543
No 40
>COG5346 Predicted membrane protein [Function unknown]
Probab=29.62 E-value=1.5e+02 Score=21.09 Aligned_cols=33 Identities=21% Similarity=0.008 Sum_probs=26.0
Q ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy516 11 PKPQLRNLLQSSIKVALVQGAVIAFGTAVAWKV 43 (80)
Q Consensus 11 ~KPqmRgLl~~~lr~~i~~a~~lsl~~~~~~kf 43 (80)
-|-|-||-+..++-|.++.-|.++|+++...+.
T Consensus 77 ~~~q~r~~~~~~~tril~liFgi~LVvsi~~~t 109 (136)
T COG5346 77 LKIQRRGQLYAKLTRILLLIFGIFLVVSIFPKT 109 (136)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456789999999999998888888877665543
No 41
>PF14316 DUF4381: Domain of unknown function (DUF4381)
Probab=29.49 E-value=1e+02 Score=20.83 Aligned_cols=16 Identities=6% Similarity=-0.159 Sum_probs=7.9
Q ss_pred HHHHHHHhhhhhHHHH
Q psy516 39 VAWKVLIMDPHKKAIS 54 (80)
Q Consensus 39 ~~~kf~v~~PRKkaYa 54 (80)
.+..+.+..-|+++|.
T Consensus 36 ~~~~~~~r~~~~~~yr 51 (146)
T PF14316_consen 36 LLLWRLWRRWRRNRYR 51 (146)
T ss_pred HHHHHHHHHHHccHHH
Confidence 3333444444556665
No 42
>PF07465 PsaM: Photosystem I protein M (PsaM); InterPro: IPR010010 Members of this protein family are PsaM, which is subunit XII of the photosystem I reaction centre. PsaM forms part of the photosystem I complex and its binding is stabilised by PsaI []. This protein is found in both the Cyanobacteria and the chloroplasts of plants, but is absent from non-oxygenic photosynthetic bacteria such as Rhodobacter sphaeroides. Species that contain photosystem I also contain photosystem II, which splits water and releases molecular oxygen.; GO: 0015979 photosynthesis, 0009522 photosystem I, 0030094 plasma membrane-derived photosystem I; PDB: 3PCQ_M 1JB0_M.
Probab=29.43 E-value=82 Score=16.92 Aligned_cols=15 Identities=27% Similarity=0.275 Sum_probs=10.0
Q ss_pred HHHHHHHHHHHHHHH
Q psy516 26 ALVQGAVIAFGTAVA 40 (80)
Q Consensus 26 ~i~~a~~lsl~~~~~ 40 (80)
++.+|++++++.+++
T Consensus 5 Qi~iAL~~Al~~~iL 19 (29)
T PF07465_consen 5 QIFIALVIALITGIL 19 (29)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHH
Confidence 567777777766543
No 43
>PF09813 Coiled-coil_56: Coiled-coil domain-containing protein 56; InterPro: IPR018628 Members of this family of proteins have no known function.
Probab=29.10 E-value=1.1e+02 Score=20.75 Aligned_cols=27 Identities=11% Similarity=0.080 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy516 21 SSIKVALVQGAVIAFGTAVAWKVLIMD 47 (80)
Q Consensus 21 ~~lr~~i~~a~~lsl~~~~~~kf~v~~ 47 (80)
++=++|++.+++++..+...|-+.+.-
T Consensus 46 ~~R~rN~~Tgl~L~~~v~gIY~YTi~s 72 (100)
T PF09813_consen 46 RRRRRNLLTGLALGAFVVGIYAYTIYS 72 (100)
T ss_pred HHhhhhHHHHHHHHHHHHHHHhheeee
Confidence 334478889999877777766554443
No 44
>PF08019 DUF1705: Domain of unknown function (DUF1705); InterPro: IPR012549 Some members of this family are putative bacterial membrane proteins. This domain is found immediately N-terminal to the sulphatase domain in many sulphatases.; GO: 0016021 integral to membrane
Probab=29.08 E-value=1.9e+02 Score=19.70 Aligned_cols=15 Identities=13% Similarity=0.253 Sum_probs=11.7
Q ss_pred hhHHHHHHHhcCCch
Q psy516 49 HKKAISEFYKTYDGE 63 (80)
Q Consensus 49 RKkaYadFYknYD~~ 63 (80)
--+.|+.|+||+...
T Consensus 118 ~~~~~~~~~Rn~~~l 132 (156)
T PF08019_consen 118 FYKDYASFFRNHKEL 132 (156)
T ss_pred HHHhHHHHHhchHHH
Confidence 347899999998765
No 45
>PF12301 CD99L2: CD99 antigen like protein 2; InterPro: IPR022078 This family of proteins is found in eukaryotes. Proteins in this family are typically between 165 and 237 amino acids in length. CD99L2 and CD99 are involved in trans-endothelial migration of neutrophils in vitro and in the recruitment of neutrophils into inflamed peritoneum.
Probab=29.08 E-value=78 Score=22.93 Aligned_cols=25 Identities=20% Similarity=0.183 Sum_probs=17.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q psy516 23 IKVALVQGAVIAFGTAVAWKVLIMD 47 (80)
Q Consensus 23 lr~~i~~a~~lsl~~~~~~kf~v~~ 47 (80)
+-.-||.|++++|+.++.=+|.+..
T Consensus 116 ~IaGIvsav~valvGAvsSyiaYqk 140 (169)
T PF12301_consen 116 TIAGIVSAVVVALVGAVSSYIAYQK 140 (169)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHh
Confidence 4456888999988777776665543
No 46
>COG5102 SFT2 Membrane protein involved in ER to Golgi transport [Intracellular trafficking and secretion]
Probab=28.88 E-value=58 Score=24.49 Aligned_cols=23 Identities=17% Similarity=0.190 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhh
Q psy516 28 VQGAVIAFGTAVAWKVLIMDPHK 50 (80)
Q Consensus 28 ~~a~~lsl~~~~~~kf~v~~PRK 50 (80)
.+|.+++..+...+++++..|||
T Consensus 76 LGa~ac~a~~~fmfpVl~lkPrk 98 (201)
T COG5102 76 LGAGACSAFLYFMFPVLRLKPRK 98 (201)
T ss_pred hhhHHHHHHHHHHHHHHhcCccc
Confidence 34445555666777888999998
No 47
>TIGR00739 yajC preprotein translocase, YajC subunit. While this protein is part of the preprotein translocase in Escherichia coli, it is not essential for viability or protein secretion. The N-terminus region contains a predicted membrane-spanning region followed by a region consisting almost entirely of residues with charged (acidic, basic, or zwitterionic) side chains. This small protein is about 100 residues in length, and is restricted to bacteria; however, this protein is absent from some lineages, including spirochetes and Mycoplasmas.
Probab=28.21 E-value=1.1e+02 Score=19.42 Aligned_cols=20 Identities=20% Similarity=0.290 Sum_probs=12.6
Q ss_pred HHHHHHHHHHhhhhhHHHHH
Q psy516 36 GTAVAWKVLIMDPHKKAISE 55 (80)
Q Consensus 36 ~~~~~~kf~v~~PRKkaYad 55 (80)
+..+.|+|+...|.||+-++
T Consensus 11 v~~~i~yf~~~rpqkK~~k~ 30 (84)
T TIGR00739 11 LIFLIFYFLIIRPQRKRRKA 30 (84)
T ss_pred HHHHHHHHheechHHHHHHH
Confidence 34555667777777776543
No 48
>PF12783 Sec7_N: Guanine nucleotide exchange factor in Golgi transport N-terminal
Probab=27.70 E-value=28 Score=23.59 Aligned_cols=12 Identities=17% Similarity=0.592 Sum_probs=9.7
Q ss_pred HHHHHHhcCCch
Q psy516 52 AISEFYKTYDGE 63 (80)
Q Consensus 52 aYadFYknYD~~ 63 (80)
--.|+|.|||-.
T Consensus 150 ~l~~lf~NYDCd 161 (168)
T PF12783_consen 150 FLVDLFVNYDCD 161 (168)
T ss_pred HHHHHHHHcCCC
Confidence 457899999975
No 49
>PF08066 PMC2NT: PMC2NT (NUC016) domain; InterPro: IPR012588 Exosomes are nano-compartments that function in the degradation or processing of RNA (including mRNA, rRNA, snRNA and snoRNA) [, ]. Exosomes occur in both archaea and eukaryotes, and have a similar overall structure to each other and to bacterial/organelle PNPases (polynucleotide phosphorylases; 2.7.7.8 from EC) [], consisting of a barrel structure composed of a hexameric ring of PH domains that act as a degradation chamber, and an S1-domain/KH-domain containing cap that binds the RNA substrate (and sometimes accessory proteins) in order to regulate and restrict entry into the degradation chamber []. There are two types of exosomes in eukaryotes, cytoplasmic exosomes that are responsible for 3'-5' exoribonuclease degradation of mRNAs, and nuclear exosomes that degrade pre-mRNAs (such as nonsense transcripts) and degrade rRNAs, snRNAs and snoRNAs. Unstructured RNA substrates feed in through the pore made by the S1 domains, are degraded by the PH domain ring, and exit as nucleotides via the PH pore at the opposite end of the barrel [, ]. There are several accessory proteins that help degrade, unwind or polyadenylate RNA substrate before they enter the exosome. This entry represents the N-terminal domain of Rrp6 (exosome component 10 in humans), a nuclear exosome accessory factor that interacts with the bottom of the hexameric PH-ring opposite the cap. Rrp6 functions as a hydrolytic exonuclease, and is homologous to RNase-D in Escherichia coli. More information about these proteins can be found at Protein of the Month: RNA Exosomes [].; GO: 0006396 RNA processing, 0000176 nuclear exosome (RNase complex)
Probab=27.51 E-value=25 Score=22.30 Aligned_cols=10 Identities=40% Similarity=1.052 Sum_probs=8.4
Q ss_pred HHHHhcCCch
Q psy516 54 SEFYKTYDGE 63 (80)
Q Consensus 54 adFYknYD~~ 63 (80)
-+||+++||.
T Consensus 16 ~~Fy~s~dp~ 25 (91)
T PF08066_consen 16 YDFYRSFDPE 25 (91)
T ss_pred HHHHHHhhHH
Confidence 4799999985
No 50
>PF14004 DUF4227: Protein of unknown function (DUF4227)
Probab=27.00 E-value=1.3e+02 Score=18.96 Aligned_cols=29 Identities=17% Similarity=0.213 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHHHHHHhhhhhHHHHHHHhcCCch
Q psy516 30 GAVIAFGTAVAWKVLIMDPHKKAISEFYKTYDGE 63 (80)
Q Consensus 30 a~~lsl~~~~~~kf~v~~PRKkaYadFYknYD~~ 63 (80)
-+++.++++.+++++... -.|-|.+||++
T Consensus 9 ~~~LF~~~T~lfYy~~~w-----~~~~~~~~hrY 37 (71)
T PF14004_consen 9 FFLLFTGCTLLFYYAILW-----VSDEYEPYHRY 37 (71)
T ss_pred HHHHHHHHHHHHHHHHHH-----HHHHhccCCCC
Confidence 455666677777766655 34667777765
No 51
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=26.88 E-value=17 Score=23.78 Aligned_cols=14 Identities=0% Similarity=-0.223 Sum_probs=8.0
Q ss_pred HHHHHHHHHHHHHh
Q psy516 33 IAFGTAVAWKVLIM 46 (80)
Q Consensus 33 lsl~~~~~~kf~v~ 46 (80)
+..+++++++|++.
T Consensus 79 v~~lv~~l~w~f~~ 92 (96)
T PTZ00382 79 VGGLVGFLCWWFVC 92 (96)
T ss_pred HHHHHHHHhheeEE
Confidence 44555666666555
No 52
>PF01578 Cytochrom_C_asm: Cytochrome C assembly protein; InterPro: IPR002541 This entry consists of various proteins involved in cytochrome c assembly from mitochondria and bacteria; CycK from Rhizobium leguminosarum [], CcmC from Escherichia coli and Paracoccus denitrificans [, ] and orf240 from Triticum aestivum (Wheat) mitochondria []. The members of this family are probably integral membrane proteins with six predicted transmembrane helices that may comprise the membrane component of an ABC (ATP binding cassette) transporter complex. This transporter may be necessary for transport of some component needed for cytochrome c assembly. One member, R. leguminosarum CycK, contains a putative haem-binding motif []. Wheat orf240 also contains a putative haem-binding motif and is a proposed ABC transporter with c-type haem as its proposed substrate []. However it seems unlikely that all members of this family transport haem or c-type apocytochromes because P. denitrificans CcmC transports neither [].; GO: 0006461 protein complex assembly, 0008535 respiratory chain complex IV assembly, 0016020 membrane
Probab=26.84 E-value=1.4e+02 Score=20.78 Aligned_cols=42 Identities=24% Similarity=0.268 Sum_probs=27.4
Q ss_pred HHHHHHHHHHHHHHHHhhhhhHHHHHHHhcCCchHHHHHHHh
Q psy516 30 GAVIAFGTAVAWKVLIMDPHKKAISEFYKTYDGEKDFERMKK 71 (80)
Q Consensus 30 a~~lsl~~~~~~kf~v~~PRKkaYadFYknYD~~k~Fe~Mk~ 71 (80)
+++++.+.++.|-+...+=+||+-..++++--+.+..|++..
T Consensus 85 ~~~ia~~~al~~l~~~~~Lk~~~~~~~~~~lp~l~~le~~~~ 126 (214)
T PF01578_consen 85 AFAIAALAALLYLIQERRLKKKKFSRFYQRLPSLETLERLSY 126 (214)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhhccccccccchHHHHHHHHH
Confidence 344455555555444444456778888888888888888764
No 53
>KOG4671|consensus
Probab=26.47 E-value=55 Score=24.72 Aligned_cols=28 Identities=21% Similarity=0.355 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHHHHHHhhhhhHHHHHHHhcCCc
Q psy516 30 GAVIAFGTAVAWKVLIMDPHKKAISEFYKTYDG 62 (80)
Q Consensus 30 a~~lsl~~~~~~kf~v~~PRKkaYadFYknYD~ 62 (80)
|-+...|+++++ .+| |+.++|+|||--+
T Consensus 167 Atifl~G~a~l~---~c~--~~~eedyy~~~k~ 194 (201)
T KOG4671|consen 167 ATIFLIGAAFLL---CCD--KENEEDYYKEKKI 194 (201)
T ss_pred HHHHHhhhhHhh---eeC--chhHHHHhhcccc
Confidence 444455555544 666 4568999998543
No 54
>PF06679 DUF1180: Protein of unknown function (DUF1180); InterPro: IPR009565 This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins. Their function is unknown.
Probab=26.21 E-value=79 Score=22.79 Aligned_cols=28 Identities=11% Similarity=0.018 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy516 23 IKVALVQGAVIAFGTAVAWKVLIMDPHK 50 (80)
Q Consensus 23 lr~~i~~a~~lsl~~~~~~kf~v~~PRK 50 (80)
++|.+++.++++..+.+.|-+-+..=||
T Consensus 94 l~R~~~Vl~g~s~l~i~yfvir~~R~r~ 121 (163)
T PF06679_consen 94 LKRALYVLVGLSALAILYFVIRTFRLRR 121 (163)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 6788877777766555555444443343
No 55
>PRK05585 yajC preprotein translocase subunit YajC; Validated
Probab=25.76 E-value=1.6e+02 Score=19.58 Aligned_cols=19 Identities=26% Similarity=0.375 Sum_probs=12.2
Q ss_pred HHHHHHHHHhhhhhHHHHH
Q psy516 37 TAVAWKVLIMDPHKKAISE 55 (80)
Q Consensus 37 ~~~~~kf~v~~PRKkaYad 55 (80)
..+.++|+...|.||+-++
T Consensus 27 i~~i~yf~~~RpqkK~~k~ 45 (106)
T PRK05585 27 FFAIFYFLIIRPQQKRQKE 45 (106)
T ss_pred HHHHHHHHhccHHHHHHHH
Confidence 4445667777777777653
No 56
>TIGR03053 PS_I_psaM photosystem I reaction center subunit XII. Members of this protein family are PsaM, which is subunit XII of the photosystem I reaction center. This protein is found in both the Cyanobacteria and the chloroplasts of plants, but is absent from non-oxygenic photosynthetic bacteria such as Rhodobacter sphaeroides. Species that contain photosystem I also contain photosystem II, which splits water and releases molecular oxygen. The seed alignment for this model includes sequences from Pfam model pfam07465 and additional sequences, as from Prochlorococcus.
Probab=25.57 E-value=1e+02 Score=16.48 Aligned_cols=15 Identities=20% Similarity=0.233 Sum_probs=9.8
Q ss_pred HHHHHHHHHHHHHHH
Q psy516 26 ALVQGAVIAFGTAVA 40 (80)
Q Consensus 26 ~i~~a~~lsl~~~~~ 40 (80)
++.+|++++++.+++
T Consensus 5 Qi~iaL~~Al~~~iL 19 (29)
T TIGR03053 5 QIFIALVIALIAGIL 19 (29)
T ss_pred HHHHHHHHHHHHHHH
Confidence 566777777766543
No 57
>PF10883 DUF2681: Protein of unknown function (DUF2681); InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=25.12 E-value=2e+02 Score=18.76 Aligned_cols=23 Identities=22% Similarity=0.150 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhh
Q psy516 27 LVQGAVIAFGTAVAWKVLIMDPH 49 (80)
Q Consensus 27 i~~a~~lsl~~~~~~kf~v~~PR 49 (80)
.++++++++++.+.|+.--...+
T Consensus 9 ~~~~v~~~i~~y~~~k~~ka~~~ 31 (87)
T PF10883_consen 9 GVGAVVALILAYLWWKVKKAKKQ 31 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 45555666666666666555544
No 58
>TIGR03518 ABC_perm_GldF gliding motility-associated ABC transporter permease protein GldF. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldF is believed to be a ABC transporter permease protein (along with ATP-binding subunit, GldA and a sunstrate-binding subunit, GldG) and is linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockouts of GldF abolish the gliding phenotype. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility.
Probab=25.06 E-value=2.7e+02 Score=20.16 Aligned_cols=58 Identities=12% Similarity=0.087 Sum_probs=35.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh---HHHHHHHhcCCchHHHHHHHhcCcc
Q psy516 17 NLLQSSIKVALVQGAVIAFGTAVAWKVLIMDPHK---KAISEFYKTYDGEKDFERMKKAGLF 75 (80)
Q Consensus 17 gLl~~~lr~~i~~a~~lsl~~~~~~kf~v~~PRK---kaYadFYknYD~~k~Fe~Mk~aG~f 75 (80)
|++.+.+.++-..|.+++++......++...... ....++.+.-.+...|++.-+ |++
T Consensus 154 g~~iSsl~~~q~~a~~~~~~~~~~l~~~~~~l~~~~~~~~~~~l~~~sp~~~~~~~~~-g~i 214 (240)
T TIGR03518 154 GLFASSLTENQIVAFIIAVFLCFLFYFGFDGLASLLWGGSAYTISELGLSYHYESISR-GVI 214 (240)
T ss_pred HHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHhhhcchhHHHHHHHcCHHHHHHHHHc-Ccc
Confidence 5667777777777776766665444332111110 134578888888888888876 554
No 59
>COG3621 Patatin [General function prediction only]
Probab=24.54 E-value=1.2e+02 Score=24.99 Aligned_cols=52 Identities=13% Similarity=0.145 Sum_probs=31.7
Q ss_pred CCCCchhhh-HHHHHHHHHHH-H--HHHHHHHHHHHHHHHHhhhhhHHHH-HHHhcCCch
Q psy516 9 KLPKPQLRN-LLQSSIKVALV-Q--GAVIAFGTAVAWKVLIMDPHKKAIS-EFYKTYDGE 63 (80)
Q Consensus 9 ~l~KPqmRg-Ll~~~lr~~i~-~--a~~lsl~~~~~~kf~v~~PRKkaYa-dFYknYD~~ 63 (80)
-+++||-+| ++..++--... . +..-+++...+++=+ -.|..|. .|+||||+.
T Consensus 256 l~~~~~~~g~i~g~~liei~~~pa~~~~ss~~~~q~l~eF---as~~~lg~~ffrnld~n 312 (394)
T COG3621 256 LLAATQQAGLIWGNKLIEIPTLPAEMSNSSTENLQALKEF---ASKSILGSSFFRNLDAN 312 (394)
T ss_pred HhccccchhhhHHhhcchhhcchHHHHhhHHHHHHHHHHH---HhhhhHHhhhhhhccCC
Confidence 357899999 56655543332 2 233355555555433 3677887 899999864
No 60
>CHL00190 psaM photosystem I subunit XII; Provisional
Probab=24.49 E-value=1.1e+02 Score=16.59 Aligned_cols=14 Identities=14% Similarity=0.297 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHH
Q psy516 26 ALVQGAVIAFGTAV 39 (80)
Q Consensus 26 ~i~~a~~lsl~~~~ 39 (80)
++.+|++++++.++
T Consensus 6 Qi~iAL~~Al~~~i 19 (30)
T CHL00190 6 QIFIALFLALTTGI 19 (30)
T ss_pred HHHHHHHHHHHHHH
Confidence 56777777776654
No 61
>PF01956 DUF106: Integral membrane protein DUF106; InterPro: IPR002809 This entry represents a group of eukaryotic and archaeal proteins that have no known function. Members are predicted to be integral membrane proteins.; GO: 0016020 membrane
Probab=23.92 E-value=2.4e+02 Score=19.25 Aligned_cols=18 Identities=17% Similarity=0.145 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy516 28 VQGAVIAFGTAVAWKVLI 45 (80)
Q Consensus 28 ~~a~~lsl~~~~~~kf~v 45 (80)
++|++.++.+...-++.+
T Consensus 21 ~~~~~~~~~s~l~~~~~i 38 (168)
T PF01956_consen 21 LIAILRGLISELLQKFLI 38 (168)
T ss_pred HHHHHHHHHHHHHhcccc
Confidence 445555555555555553
No 62
>COG2156 KdpC K+-transporting ATPase, c chain [Inorganic ion transport and metabolism]
Probab=23.82 E-value=1.5e+02 Score=22.14 Aligned_cols=26 Identities=23% Similarity=0.359 Sum_probs=21.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy516 21 SSIKVALVQGAVIAFGTAVAWKVLIM 46 (80)
Q Consensus 21 ~~lr~~i~~a~~lsl~~~~~~kf~v~ 46 (80)
+++|..|+..+++.++++++|=.++.
T Consensus 3 ~~lr~Al~~~~~l~li~G~iYPl~~t 28 (190)
T COG2156 3 RQLRPALVLTLVLLLITGLIYPLLVT 28 (190)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 67888999999999999999876653
No 63
>PF05084 GRA6: Granule antigen protein (GRA6); InterPro: IPR008119 Toxoplasma gondii is an obligate intracellular apicomplexan protozoan parasite, with a complex lifestyle involving varied hosts []. It has two phases of growth: an intestinal phase in feline hosts, and an extra-intestinal phase in other mammals. Oocysts from infected cats develop into tachyzoites, and eventually, bradyzoites and zoitocysts in the extraintestinal host []. Transmission of the parasite occurs through contact with infected cats or raw/undercooked meat; in immunocompromised individuals, it can cause severe and often lethal toxoplasmosis. Acute infection in healthy humans can sometimes also cause tissue damage []. The protozoan utilises a variety of secretory and antigenic proteins to invade a host and gain access to the intracellular environment []. These originate from distinct organelles in the T. gondii cell termed micronemes, rhoptries, and dense granules. They are released at specific times during invasion to ensure the proteins are allocated to their correct target destinations []. Dense granule antigens (GRAs) are released from the T. gondii tachyzoite while still encapsulated in a host vacuole. Gra6, one of these moieties, is associated with the parasitophorous vacuole []. It possesses a hydrophobic central region flanked by two hydrophilic domains, and is present as a single copy gene in the Toxoplasma gondii genome []. Gra6 shares a similar function with Gra2, in that it is rapidly targeted to a network of membranous tubules that connect with the vacuolar membrane []. Indeed, these two proteins, together with Gra4, form a multimeric complex that stabilises the parasite within the vacuole.
Probab=23.18 E-value=1.4e+02 Score=22.59 Aligned_cols=25 Identities=20% Similarity=0.219 Sum_probs=15.3
Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHH
Q psy516 21 SSIKVA-LVQGAVIAFGTAVAWKVLI 45 (80)
Q Consensus 21 ~~lr~~-i~~a~~lsl~~~~~~kf~v 45 (80)
++-+|| +|++.++++.++++.+|+.
T Consensus 146 R~Q~RHR~IG~~VlA~~VA~L~~~F~ 171 (215)
T PF05084_consen 146 RTQKRHRLIGAVVLAVSVAMLTWFFL 171 (215)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344454 5777777777766655544
No 64
>PRK07193 fliF flagellar MS-ring protein; Reviewed
Probab=23.17 E-value=82 Score=26.54 Aligned_cols=17 Identities=6% Similarity=0.190 Sum_probs=13.6
Q ss_pred hhHHHHHHHhcCCchHH
Q psy516 49 HKKAISEFYKTYDGEKD 65 (80)
Q Consensus 49 RKkaYadFYknYD~~k~ 65 (80)
.+..|.--|.|.|+.+.
T Consensus 49 ~~p~Y~~Lys~L~~~da 65 (552)
T PRK07193 49 SSQGYRPLYGSQENVPT 65 (552)
T ss_pred cCCCeeecccCCCHHHH
Confidence 46679999999988654
No 65
>PF02532 PsbI: Photosystem II reaction centre I protein (PSII 4.8 kDa protein); InterPro: IPR003686 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbI, which is tightly associated with the D1/D2 heterodimer in PSII. The function of PsbI is unknown, but it may be involved in the assembly, dimerisation or stabilisation of PSII dimers [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_i 3ARC_I 3A0B_i 3BZ2_I 3PRQ_I 3KZI_I 3PRR_I 2AXT_i 4FBY_I 1S5L_i ....
Probab=23.16 E-value=1.6e+02 Score=16.69 Aligned_cols=24 Identities=29% Similarity=0.364 Sum_probs=14.3
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhH
Q psy516 28 VQGAVIAFGTAVAWKVLIMDPHKK 51 (80)
Q Consensus 28 ~~a~~lsl~~~~~~kf~v~~PRKk 51 (80)
+-.+++.+++-+.+-|+-+||-++
T Consensus 8 Vy~vV~ffv~LFifGflsnDp~Rn 31 (36)
T PF02532_consen 8 VYTVVIFFVSLFIFGFLSNDPGRN 31 (36)
T ss_dssp HHHHHHHHHHHHHHHHHTTCTTSS
T ss_pred ehhhHHHHHHHHhccccCCCCCCC
Confidence 334444444556677888888643
No 66
>KOG4606|consensus
Probab=23.03 E-value=76 Score=22.28 Aligned_cols=28 Identities=25% Similarity=0.279 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhHH
Q psy516 24 KVALVQGAVIAFGTAVAWKVLIMDPHKKA 52 (80)
Q Consensus 24 r~~i~~a~~lsl~~~~~~kf~v~~PRKka 52 (80)
+|..++.+.+|+++..+|+++. ||+-.+
T Consensus 31 WRi~lvi~svc~gaigawywl~-dpet~~ 58 (126)
T KOG4606|consen 31 WRIALVIFSVCTGAIGAWYWLI-DPETQK 58 (126)
T ss_pred hhhHHHHHHHHHHhhhhhhhhc-Ccceee
Confidence 5667888899999999997764 576543
No 67
>COG1959 Predicted transcriptional regulator [Transcription]
Probab=22.91 E-value=57 Score=22.47 Aligned_cols=17 Identities=35% Similarity=0.585 Sum_probs=14.5
Q ss_pred hHHHHHHHhcCcccccc
Q psy516 63 EKDFERMKKAGLFEEYT 79 (80)
Q Consensus 63 ~k~Fe~Mk~aG~fqS~~ 79 (80)
.|-|..++++|+.+|++
T Consensus 43 ~kil~~L~kaGlV~S~r 59 (150)
T COG1959 43 EKILSKLRKAGLVKSVR 59 (150)
T ss_pred HHHHHHHHHcCCEEeec
Confidence 46788899999999986
No 68
>PF06522 B12D: NADH-ubiquinone reductase complex 1 MLRQ subunit; InterPro: IPR010530 The MLRQ subunit of mitochondrial NADH-ubiquinone reductase complex I is nuclear [] and is found in plants [], insects, fungi and higher metazoans []. It appears to act within the membrane and, in mammals, is highly expressed in muscle and neural tissue, indicative of a role in ATP generation [].
Probab=22.67 E-value=1.2e+02 Score=18.61 Aligned_cols=21 Identities=19% Similarity=0.474 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHHHHHhhh
Q psy516 28 VQGAVIAFGTAVAWKVLIMDP 48 (80)
Q Consensus 28 ~~a~~lsl~~~~~~kf~v~~P 48 (80)
++++++++.+..++..+..+|
T Consensus 12 ~vg~a~~~a~~~~~r~l~~~P 32 (73)
T PF06522_consen 12 IVGVAVGGATFYLYRLLLTNP 32 (73)
T ss_pred HHHHHHHHHHHHHHHHHhcCC
Confidence 445555555555666565555
No 69
>PF12669 P12: Virus attachment protein p12 family
Probab=22.66 E-value=1.3e+02 Score=17.90 Aligned_cols=7 Identities=29% Similarity=0.804 Sum_probs=2.8
Q ss_pred HHHHHHH
Q psy516 27 LVQGAVI 33 (80)
Q Consensus 27 i~~a~~l 33 (80)
|++++++
T Consensus 2 iII~~Ii 8 (58)
T PF12669_consen 2 IIIGIII 8 (58)
T ss_pred eeHHHHH
Confidence 3444443
No 70
>PF15176 LRR19-TM: Leucine-rich repeat family 19 TM domain
Probab=22.65 E-value=2.1e+02 Score=19.46 Aligned_cols=41 Identities=17% Similarity=0.102 Sum_probs=18.8
Q ss_pred ccCCCCCchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy516 6 ATGKLPKPQLRNLLQSSIKVALVQGAVIAFGTAVAWKVLIM 46 (80)
Q Consensus 6 ~~~~l~KPqmRgLl~~~lr~~i~~a~~lsl~~~~~~kf~v~ 46 (80)
++...++|--.|===..|=--+++++++|+.++++-|.=++
T Consensus 3 ~~~~~~~~~~~g~sW~~LVGVv~~al~~SlLIalaaKC~~~ 43 (102)
T PF15176_consen 3 SSANAPGPGEGGRSWPFLVGVVVTALVTSLLIALAAKCPVW 43 (102)
T ss_pred ccccCCCCCCCCcccHhHHHHHHHHHHHHHHHHHHHHhHHH
Confidence 34445555433211122222335555566666666655443
No 71
>PF08475 Baculo_VP91_N: Viral capsid protein 91 N-terminal; InterPro: IPR013682 This domain is found in Baculoviridae including the nucleopolyhedrovirus at the N terminus of the viral capsid protein 91 (VP91) [].
Probab=22.42 E-value=1.4e+02 Score=22.08 Aligned_cols=35 Identities=11% Similarity=0.259 Sum_probs=26.3
Q ss_pred HHHHHHHHHHHHHHHHHHHhh-------hhhHHHHHHHhcCC
Q psy516 27 LVQGAVIAFGTAVAWKVLIMD-------PHKKAISEFYKTYD 61 (80)
Q Consensus 27 i~~a~~lsl~~~~~~kf~v~~-------PRKkaYadFYknYD 61 (80)
|.+++++.++..+.|.+.+.| -|.+-..||.|+..
T Consensus 2 Llvai~l~iif~i~y~~I~~dFde~~F~~rL~Vl~EYlkrtn 43 (183)
T PF08475_consen 2 LLVAILLIIIFLIYYLIIYNDFDENEFDNRLQVLTEYLKRTN 43 (183)
T ss_pred chhHHHHHHHHHHHHHhhccccchHHHHHHHHHHHHHHHhcC
Confidence 456777777777777777654 78889999999864
No 72
>cd04742 NPD_FabD 2-Nitropropane dioxygenase (NPD)-like domain, associated with the (acyl-carrier-protein) S-malonyltransferase FabD. NPD is part of the nitroalkaneoxidizing enzyme family, that catalyzes oxidative denitrification of nitroalkanes to their corresponding carbonyl compounds and nitrites. NDPs are members of the NAD(P)H-dependent flavin oxidoreductase family that reduce a range of alternative electron acceptors. Most use FAD/FMN as a cofactor and NAD(P)H as electron donor. Some contain 4Fe-4S cluster to transfer electron from FAD to FMN.
Probab=22.17 E-value=63 Score=26.42 Aligned_cols=24 Identities=17% Similarity=0.412 Sum_probs=19.3
Q ss_pred hhhhHHHHHHHhcCCchHHHHHHH
Q psy516 47 DPHKKAISEFYKTYDGEKDFERMK 70 (80)
Q Consensus 47 ~PRKkaYadFYknYD~~k~Fe~Mk 70 (80)
-.|=+++-|.|++||+.++.+..-
T Consensus 300 ~~ra~kl~~ly~~~~s~~~i~~~~ 323 (418)
T cd04742 300 PARANKLYELYRRYDSLEEIPAKT 323 (418)
T ss_pred hHHHHHHHHHHHhcCChhhhhHHH
Confidence 347888999999999988776553
No 73
>PF13219 DUF4027: Protein of unknown function (DUF4027)
Probab=21.94 E-value=1e+02 Score=17.45 Aligned_cols=28 Identities=21% Similarity=0.190 Sum_probs=18.8
Q ss_pred hHHHHHHHHHH-HHHHHHHHHHHHHHHHH
Q psy516 17 NLLQSSIKVAL-VQGAVIAFGTAVAWKVL 44 (80)
Q Consensus 17 gLl~~~lr~~i-~~a~~lsl~~~~~~kf~ 44 (80)
+|+-+|+---+ .++|+++.++++..|..
T Consensus 6 nL~ysq~Vsl~CLGGf~~~~~lA~~iKi~ 34 (36)
T PF13219_consen 6 NLSYSQGVSLICLGGFALSVTLAAIIKIF 34 (36)
T ss_pred hhhhhhCeeEEecchHHHHHHHHHHHHHh
Confidence 44444443322 57899999999998875
No 74
>PF03597 CcoS: Cytochrome oxidase maturation protein cbb3-type; InterPro: IPR004714 Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO). Expression of cytochrome cbb3 oxidase allows human pathogens to colonise anoxic tissues and agronomically important diazotrophs to sustain nitrogen fixation []. Genes encoding a cytochrome cbb3 oxidase were initially designated fixNOQP (ccoNOQP), the ccoNOQP operon is always found close to a second gene cluster, known as fixGHIS (ccoGHIS) whose expression is necessary for the assembly of a functional cbb3 oxidase. On the basis of their derived amino acid sequences each of the four proteins encoded by the ccoGHIS operon are thought to be membrane-bound. It has been suggested that they may function in concert as a multi-subunit complex, possibly playing a role in the uptake and metabolism of copper required for the assembly of the binuclear centre of cytochrome cbb3 oxidase.
Probab=21.71 E-value=1.7e+02 Score=16.84 Aligned_cols=19 Identities=5% Similarity=0.095 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy516 27 LVQGAVIAFGTAVAWKVLI 45 (80)
Q Consensus 27 i~~a~~lsl~~~~~~kf~v 45 (80)
|++++++++++..+|.+.+
T Consensus 7 ip~sl~l~~~~l~~f~Wav 25 (45)
T PF03597_consen 7 IPVSLILGLIALAAFLWAV 25 (45)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 5666666655555554444
No 75
>cd03388 PAP2_SPPase1 PAP2_like proteins, sphingosine-1-phosphatase subfamily. Sphingosine-1-phosphatase is an intracellular enzyme located in the endoplasmic reticulum, which regulates the level of sphingosine-1-phosphate (S1P), a bioactive lipid. S1P acts as a second messenger in the cell, and extracellularly by binding to G-protein coupled receptors of the endothelial differentiation gene family.
Probab=21.50 E-value=1.9e+02 Score=19.38 Aligned_cols=27 Identities=19% Similarity=0.033 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhH
Q psy516 25 VALVQGAVIAFGTAVAWKVLIMDPHKK 51 (80)
Q Consensus 25 ~~i~~a~~lsl~~~~~~kf~v~~PRKk 51 (80)
+.++.+++++.......|..+..||=-
T Consensus 36 ~~~~~~~~~~~~~~~~lK~~~~r~RP~ 62 (151)
T cd03388 36 RDLVVVLALGMYIGQFIKDLFCLPRPS 62 (151)
T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCcC
Confidence 345556666666667789999988853
No 76
>TIGR03356 BGL beta-galactosidase.
Probab=21.45 E-value=48 Score=26.44 Aligned_cols=20 Identities=30% Similarity=0.672 Sum_probs=17.2
Q ss_pred HHHHHhcCCchHHHHHHHhcCc
Q psy516 53 ISEFYKTYDGEKDFERMKKAGL 74 (80)
Q Consensus 53 YadFYknYD~~k~Fe~Mk~aG~ 74 (80)
=.|||.+|+ +|++.|++.|+
T Consensus 49 a~d~y~~y~--eDi~l~~~~G~ 68 (427)
T TIGR03356 49 ACDHYHRYE--EDVALMKELGV 68 (427)
T ss_pred cccHHHhHH--HHHHHHHHcCC
Confidence 457999987 78999999996
No 77
>PF14283 DUF4366: Domain of unknown function (DUF4366)
Probab=21.27 E-value=9 Score=28.62 Aligned_cols=29 Identities=14% Similarity=0.173 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhHHHH
Q psy516 26 ALVQGAVIAFGTAVAWKVLIMDPHKKAIS 54 (80)
Q Consensus 26 ~i~~a~~lsl~~~~~~kf~v~~PRKkaYa 54 (80)
-|++.+++.++.|++|+|-+..|++....
T Consensus 163 ll~lllv~l~gGGa~yYfK~~K~K~~~~~ 191 (218)
T PF14283_consen 163 LLLLLLVALIGGGAYYYFKFYKPKQEEKA 191 (218)
T ss_pred HHHHHHHHHhhcceEEEEEEecccccccc
Confidence 34444444455577777778777766554
No 78
>PF04719 TAFII28: hTAFII28-like protein conserved region; InterPro: IPR006809 The general transcription factor, TFIID, consists of the TATA-binding protein (TBP) associated with a series of TBP-associated factors (TAFs) that together participate in the assembly of the transcription preinitiation complex. The conserved region is found at the C terminus of most member proteins. The crystal structure of hTAFII28 with hTAFII18 shows that this region is involved in the binding of these two subunits. The conserved region contains four alpha helices and three loops arranged as in histone H3 [, ].; GO: 0006367 transcription initiation from RNA polymerase II promoter, 0005634 nucleus; PDB: 1BH9_B 1BH8_B.
Probab=21.11 E-value=1e+02 Score=20.14 Aligned_cols=38 Identities=13% Similarity=0.259 Sum_probs=21.1
Q ss_pred CCCCCchhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhh
Q psy516 8 GKLPKPQLRNLLQSSIK-VALVQGAVIAFGTAVAWKVLIMD 47 (80)
Q Consensus 8 ~~l~KPqmRgLl~~~lr-~~i~~a~~lsl~~~~~~kf~v~~ 47 (80)
+.++||.+|-|...-+- -++ .=-+.++++.+=|.+|.|
T Consensus 22 s~~~k~~ikkli~~~~~~qsv--~~~v~i~v~g~aKvFVGE 60 (90)
T PF04719_consen 22 SSFNKAAIKKLINQVLGNQSV--SQNVVIAVAGIAKVFVGE 60 (90)
T ss_dssp ----HHHHHHHHHHHHS-S-----HHHHHHHHHHHHHHHHH
T ss_pred ccCCHHHHHHHHHHHcCCCCC--ChhHHHHHHHHHHHHHHH
Confidence 56899999998888664 222 223334556666788776
No 79
>PF14086 DUF4266: Domain of unknown function (DUF4266)
Probab=21.02 E-value=75 Score=18.90 Aligned_cols=21 Identities=24% Similarity=0.582 Sum_probs=15.6
Q ss_pred CCCCCchh---hhHHHHHHHHHHH
Q psy516 8 GKLPKPQL---RNLLQSSIKVALV 28 (80)
Q Consensus 8 ~~l~KPqm---RgLl~~~lr~~i~ 28 (80)
..|+||+| +..+...+..|+.
T Consensus 7 ~~La~~~M~l~~~~~~~~~~~h~y 30 (50)
T PF14086_consen 7 GNLADPEMALDRDPLEFELNFHIY 30 (50)
T ss_pred ccccCcccCCCccHHHHHHHhhhe
Confidence 46899998 6677777777764
No 80
>PF12411 Choline_sulf_C: Choline sulfatase enzyme C terminal
Probab=20.99 E-value=71 Score=19.28 Aligned_cols=50 Identities=20% Similarity=0.342 Sum_probs=35.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhcCCchHHHHHHHh
Q psy516 18 LLQSSIKVALVQGAVIAFGTAVAWKVLIMDPHKKAISEFYKTYDGEKDFERMKK 71 (80)
Q Consensus 18 Ll~~~lr~~i~~a~~lsl~~~~~~kf~v~~PRKkaYadFYknYD~~k~Fe~Mk~ 71 (80)
.+.+|=+|.++-+ ++.-|....| -.+|.-.+=..|.+|+-+.++.|+|..
T Consensus 4 Vl~SQrrRr~V~~-AL~~G~~~~W---DyqP~~das~~YvRnh~~Ld~lE~~aR 53 (54)
T PF12411_consen 4 VLASQRRRRFVYS-ALKQGRFTSW---DYQPPQDASQRYVRNHMDLDDLERRAR 53 (54)
T ss_pred HHHHHHHHHHHHH-HHHcCCCCCC---CCCCCcCcHHHHHHcCCCHHHHHHHhc
Confidence 4677777777654 2222333333 678999999999999999999998753
No 81
>PTZ00045 apical membrane antigen 1; Provisional
Probab=20.61 E-value=95 Score=26.92 Aligned_cols=28 Identities=18% Similarity=0.152 Sum_probs=16.7
Q ss_pred HHhhhhhHHHHHHHhcCCchH----HHHHHHh
Q psy516 44 LIMDPHKKAISEFYKTYDGEK----DFERMKK 71 (80)
Q Consensus 44 ~v~~PRKkaYadFYknYD~~k----~Fe~Mk~ 71 (80)
.-..+.+..|.+++..||=.| +.|+|.+
T Consensus 539 ~k~~~~~~~ydk~~~~~~y~~~~~~~~d~m~d 570 (595)
T PTZ00045 539 YRKKGNKDKYDKMDEAEDYGKTYNTDNDEMQD 570 (595)
T ss_pred hhccCCcchhhhhhhhhhcccccCCCchhhhC
Confidence 344555667777777776444 4555543
No 82
>PRK11878 psaM photosystem I reaction center subunit XII; Reviewed
Probab=20.55 E-value=1.4e+02 Score=16.56 Aligned_cols=14 Identities=21% Similarity=0.323 Sum_probs=9.2
Q ss_pred HHHHHHHHHHHHHH
Q psy516 26 ALVQGAVIAFGTAV 39 (80)
Q Consensus 26 ~i~~a~~lsl~~~~ 39 (80)
++.+|++++++.++
T Consensus 9 Qi~iaL~~Al~~gi 22 (34)
T PRK11878 9 QVFVALVVALHAGV 22 (34)
T ss_pred HHHHHHHHHHHHHH
Confidence 46677777776654
No 83
>PRK09966 putative inner membrane diguanylate cyclase; Provisional
Probab=20.17 E-value=3.8e+02 Score=20.04 Aligned_cols=25 Identities=8% Similarity=0.202 Sum_probs=11.3
Q ss_pred CCchhhh--HHHHHHHHHHHHHHHHHH
Q psy516 11 PKPQLRN--LLQSSIKVALVQGAVIAF 35 (80)
Q Consensus 11 ~KPqmRg--Ll~~~lr~~i~~a~~lsl 35 (80)
.||.|.. -+.+++.+.+...+++++
T Consensus 3 ~~~~m~~~~~l~~~l~~~~~~~~~~~~ 29 (407)
T PRK09966 3 NDNSLNKRPTFKRALRNISMTSIFITM 29 (407)
T ss_pred CCCCCCCCccHHHHHHHHHHHHHHHHH
Confidence 3555542 244555555444444433
Done!