Query psy5120
Match_columns 138
No_of_seqs 37 out of 39
Neff 4.1
Searched_HMMs 46136
Date Fri Aug 16 17:51:32 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5120.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5120hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1473|consensus 99.9 1.9E-28 4.1E-33 232.6 3.6 134 3-138 572-707 (1414)
2 KOG1473|consensus 98.1 1.8E-06 3.8E-11 84.4 2.8 112 4-115 835-958 (1414)
3 PF10929 DUF2811: Protein of u 55.2 18 0.00038 24.6 3.1 11 87-97 20-30 (57)
4 PF09906 DUF2135: Uncharacteri 54.6 6.8 0.00015 25.2 1.0 17 10-26 3-19 (50)
5 cd00923 Cyt_c_Oxidase_Va Cytoc 50.3 11 0.00024 28.3 1.7 30 96-125 43-72 (103)
6 cd07562 Peptidase_S41_TRI Tric 43.5 57 0.0012 26.5 5.0 37 86-122 22-58 (266)
7 PF02346 Vac_Fusion: Chordopox 40.3 33 0.00072 23.1 2.7 19 65-83 35-53 (57)
8 cd02412 30S_S3_KH K homology R 39.9 78 0.0017 22.9 4.8 72 2-78 6-93 (109)
9 PF14222 MOR2-PAG1_N: Cell mor 38.6 68 0.0015 29.7 5.3 69 67-135 151-224 (552)
10 PF02284 COX5A: Cytochrome c o 37.9 15 0.00032 27.8 0.8 28 96-123 46-73 (108)
11 PF14733 ACDC: AP2-coincident 37.4 99 0.0021 21.5 4.9 62 70-131 11-79 (91)
12 PF14684 Tricorn_C1: Tricorn p 37.3 35 0.00077 22.8 2.5 34 90-123 28-61 (70)
13 TIGR01009 rpsC_bact ribosomal 36.6 67 0.0015 26.2 4.4 72 2-78 7-94 (211)
14 KOG4077|consensus 34.7 25 0.00055 27.9 1.6 27 96-122 85-111 (149)
15 cd03565 VHS_Tom1 VHS domain fa 31.6 47 0.001 25.1 2.6 24 102-125 47-70 (141)
16 PF09326 DUF1982: Domain of un 30.9 18 0.0004 23.2 0.2 13 17-29 31-43 (49)
17 PF11491 DUF3213: Protein of u 29.4 19 0.00041 26.4 0.1 24 3-26 16-40 (88)
18 PF11230 DUF3029: Protein of u 25.8 1E+02 0.0023 28.7 4.2 30 63-92 114-143 (487)
19 PHA02675 ORF104 fusion protein 25.5 74 0.0016 23.4 2.6 20 64-83 63-82 (90)
20 PF14910 MMS22L_N: S-phase gen 24.8 69 0.0015 31.1 3.0 45 53-97 50-103 (704)
21 PF04510 DUF577: Family of unk 24.7 1.8E+02 0.0039 23.6 4.9 55 62-122 16-70 (174)
22 PF02064 MAS20: MAS20 protein 24.4 83 0.0018 23.8 2.8 38 90-130 78-115 (121)
23 KOG4061|consensus 23.5 62 0.0013 27.0 2.1 36 92-132 81-116 (217)
24 PF13976 gag_pre-integrs: GAG- 22.9 41 0.00089 21.3 0.8 14 125-138 21-34 (67)
25 PF11013 DUF2851: Protein of u 22.3 37 0.0008 30.2 0.6 68 35-116 240-307 (377)
26 PF13960 DUF4218: Domain of un 21.8 82 0.0018 24.2 2.4 27 66-92 25-51 (128)
27 PF09674 DUF2400: Protein of u 21.4 95 0.0021 25.9 2.8 64 61-124 27-90 (232)
No 1
>KOG1473|consensus
Probab=99.94 E-value=1.9e-28 Score=232.60 Aligned_cols=134 Identities=22% Similarity=0.226 Sum_probs=131.6
Q ss_pred ccccccccCCceeeeeccchhhhhhChhhhhHHHhHhhhhcccccCCCCCceeecc--cccCchHHHHHHHHHHHHHHHH
Q psy5120 3 LQSAFLMTTGFKSYLNQYSTNQIALNKTQRNEERDKKRHMSHKFSLTPTGEFKWAG--PVHGPRNVLVNTLRATLLQLES 80 (138)
Q Consensus 3 ~~frlG~e~~fk~Y~NqYs~N~~Alnk~qr~eerDkkr~ls~KFslt~~~eFkW~g--~~~Gs~~~~~~tlr~Ti~~le~ 80 (138)
+.|.+|+|++|+.| ||++|++++++.+++|++|+|+||++||++++.+.+.|.| .++|+..++.+++++|+.+.++
T Consensus 572 ~~~e~~~dqtf~~y--~ys~n~vse~~~~d~e~~dkk~~~~tkf~l~~nsd~~~~g~~~t~gt~~~~~~~~~~t~~~~lS 649 (1414)
T KOG1473|consen 572 RLREEGNDQTFMKY--YYSGNEVSEIFLTDSENADKKSHMQTKFALITNSDGVTAGNVTTYGTGSQHKKLIARTLQQGLS 649 (1414)
T ss_pred hhhhcccccchhhh--cccCCchhhccCCchhhhcccccccceecccccccceeccccccccchhhcchHHHhhhhhhhc
Confidence 46889999999999 9999999999999999999999999999999999999999 8999999999999999999999
Q ss_pred hcCcccCCCCchhhhhhHHHHhhcccCHHHHHHHHHHHhhhhcchhhhhhhhhhcCCC
Q psy5120 81 SIIPAFMHVNWPSLRKPWISAVNTSVKARDFARAMIVLQACIKPVVFASVWHEQLVHA 138 (138)
Q Consensus 81 ~Ip~~fmH~nW~~~rk~W~~~V~~~~~~~~fA~aL~~le~~ir~vvf~~vW~e~LGht 138 (138)
+||.++||++|+.+.|.|..||+.+.++.+|+-+++.++|++||++|+++|||+||||
T Consensus 650 niP~s~~n~~w~~~tkg~~lavs~A~~~~el~s~t~~~d~s~~~~~~~~~ssn~L~qt 707 (1414)
T KOG1473|consen 650 NIPISYNNRKWPVYTKGFELAVSAAADLAELSSETLEPDLSKRSNAFKAASSNILGQT 707 (1414)
T ss_pred cCchHhhhccchhhccchhhhhhccchHHHHHHhhcccchhhhhhhhccchhhhhcch
Confidence 9999999999999999999999999999999999999999999999999999999997
No 2
>KOG1473|consensus
Probab=98.09 E-value=1.8e-06 Score=84.42 Aligned_cols=112 Identities=13% Similarity=0.048 Sum_probs=97.8
Q ss_pred cccccccCCceeeeeccchhhhhhChhhhhHHHhHhhh-----hcccccCCCCCceeecc--cccCchHHHHHHHH-HHH
Q psy5120 4 QSAFLMTTGFKSYLNQYSTNQIALNKTQRNEERDKKRH-----MSHKFSLTPTGEFKWAG--PVHGPRNVLVNTLR-ATL 75 (138)
Q Consensus 4 ~frlG~e~~fk~Y~NqYs~N~~Alnk~qr~eerDkkr~-----ls~KFslt~~~eFkW~g--~~~Gs~~~~~~tlr-~Ti 75 (138)
||..++|+-++.|+++|.+|+.+++++|+++++++|++ +++|+++++.+.++|.+ .+.+.+++...+++ ++.
T Consensus 835 y~~~~~~n~~rie~s~~~~ng~~v~akQ~r~pgr~~~s~~~ek~A~~s~ld~f~~~Rggks~vvl~kavL~~~~mk~~v~ 914 (1414)
T KOG1473|consen 835 YWPKQMDNWLRIEHSIFQSNGVTVGAKQARDPGRTKQSLQAEKTAPKSDLDSFTWWRGGKSKVVLQKAVLSQSIMKKLVW 914 (1414)
T ss_pred cchhhccCceeeeechhccCceeechhhhcCCcchhhhcchhhccccccccchhhhhcCcceeeehhhhcchHHHHHHhh
Confidence 68889999999999999999999999999999999999 88999999999999988 67899999999999 788
Q ss_pred HHHHHhcCcccCCC----CchhhhhhHHHHhhcccCHHHHHHHH
Q psy5120 76 LQLESSIIPAFMHV----NWPSLRKPWISAVNTSVKARDFARAM 115 (138)
Q Consensus 76 ~~le~~Ip~~fmH~----nW~~~rk~W~~~V~~~~~~~~fA~aL 115 (138)
.+.+..+|..=+=+ -=++.|.-|-.+|..+.+...||.-|
T Consensus 915 ~~g~ta~~k~nfl~~~y~p~~s~~s~wk~av~n~enlh~LAlQ~ 958 (1414)
T KOG1473|consen 915 QQGFTAGPKSNFLDWSYIPRRSRRSCWKAAVENSENLHQLALQL 958 (1414)
T ss_pred ccccccCCcccccccccccchhhhhhhhhhhcChhhHHHHHHHH
Confidence 88887776432211 11468999999999999999999877
No 3
>PF10929 DUF2811: Protein of unknown function (DUF2811); InterPro: IPR021231 This is a bacterial family of uncharacterised proteins.
Probab=55.20 E-value=18 Score=24.55 Aligned_cols=11 Identities=36% Similarity=0.604 Sum_probs=9.1
Q ss_pred CCCCchhhhhh
Q psy5120 87 MHVNWPSLRKP 97 (138)
Q Consensus 87 mH~nW~~~rk~ 97 (138)
.||+|.++|-.
T Consensus 20 ~hP~WDQ~Rl~ 30 (57)
T PF10929_consen 20 THPNWDQYRLF 30 (57)
T ss_pred cCCCchHHHHH
Confidence 49999999854
No 4
>PF09906 DUF2135: Uncharacterized protein conserved in bacteria (DUF2135); InterPro: IPR019220 This entry, found in various hypothetical prokaryotic proteins, has no known function.
Probab=54.61 E-value=6.8 Score=25.24 Aligned_cols=17 Identities=18% Similarity=0.393 Sum_probs=14.0
Q ss_pred cCCceeeeeccchhhhh
Q psy5120 10 TTGFKSYLNQYSTNQIA 26 (138)
Q Consensus 10 e~~fk~Y~NqYs~N~~A 26 (138)
.|.|+.|||+|-.+...
T Consensus 3 ~G~Y~VyVnYyg~~~~~ 19 (50)
T PF09906_consen 3 PGTYLVYVNYYGNRSQS 19 (50)
T ss_pred CceEEEEEEeccCCCcC
Confidence 58899999999876554
No 5
>cd00923 Cyt_c_Oxidase_Va Cytochrome c oxidase subunit Va. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit Va is one of three mammalian subunits that lacks a transmembrane region. Subunit Va is located on the matrix side of the membrane and binds thyroid hormone T2, releasing allosteric inhibition caused by the binding of ATP to subunit IV and allowing high turnover at elevated intramitochondrial ATP/ADP ratios.
Probab=50.27 E-value=11 Score=28.27 Aligned_cols=30 Identities=27% Similarity=0.287 Sum_probs=24.5
Q ss_pred hhHHHHhhcccCHHHHHHHHHHHhhhhcch
Q psy5120 96 KPWISAVNTSVKARDFARAMIVLQACIKPV 125 (138)
Q Consensus 96 k~W~~~V~~~~~~~~fA~aL~~le~~ir~v 125 (138)
+.=+.|+..|+...|||.|.-+||++=-+|
T Consensus 43 ~ii~aaLrAcRRvND~alAVR~lE~vK~K~ 72 (103)
T cd00923 43 KVIEAALRACRRVNDFALAVRILEAIKDKC 72 (103)
T ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHHHc
Confidence 445788999999999999999999754333
No 6
>cd07562 Peptidase_S41_TRI Tricorn protease; serine protease family S41. The tricorn protease (TRI), a member of the S41 peptidase family and named for its tricorn-like shape, exists only in some archaea and eubacteria. It has been shown to act as a carboxypeptidase, involved in the degradation of proteasomal products to preferentially yield di- and tripeptides, with subsequent and final degradations to free amino acid residues by tricorn interacting factors, F1, F2 and F3. Tricorn is a hexameric D3-symmetric protease of 720kD, and can self-associate further into a giant icosahedral capsid structure containing twenty copies of the complex. Each tricorn peptidase monomer consists of five structural domains: a six-bladed beta-propeller and a seven-bladed beta-propeller that limit access to the active site, the two domains (C1 and C2) that carry the active site residues, and a PDZ-like domain (proposed to be important for substrate recognition) between the C1 and C2 domains. The active sit
Probab=43.48 E-value=57 Score=26.49 Aligned_cols=37 Identities=11% Similarity=0.298 Sum_probs=32.1
Q ss_pred cCCCCchhhhhhHHHHhhcccCHHHHHHHHHHHhhhh
Q psy5120 86 FMHVNWPSLRKPWISAVNTSVKARDFARAMIVLQACI 122 (138)
Q Consensus 86 fmH~nW~~~rk~W~~~V~~~~~~~~fA~aL~~le~~i 122 (138)
+-..+|...|+.....|..+.+..||+.+|.-+-+.+
T Consensus 22 ~~g~dW~~~~~~y~~~~~~~~~~~e~~~~l~~ml~~L 58 (266)
T cd07562 22 MHGVDWDAVRAEYRPLLPRAATRAELADVLNEMLGEL 58 (266)
T ss_pred CCCCCHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHH
Confidence 4456999999999999999999999999998776554
No 7
>PF02346 Vac_Fusion: Chordopoxvirus fusion protein; InterPro: IPR003436 This is a family of viral fusion proteins from the Chordopoxvirinae. A 14kDa Vaccinia virus protein has been demonstrated to function as a viral fusion protein mediating cell fusion at endosmomal (low) pH []. The protein, found in the envelope fraction of the virions, is required for fusing the outermost of the two golgi-derived membranes enveloping the virus with the plasma membrane, and its subsequent release extracellularly. The N-terminal proximal region is essential for its fusion ability.; GO: 0019064 viral envelope fusion with host membrane, 0019031 viral envelope
Probab=40.34 E-value=33 Score=23.08 Aligned_cols=19 Identities=26% Similarity=0.300 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHHHHHhcC
Q psy5120 65 NVLVNTLRATLLQLESSII 83 (138)
Q Consensus 65 ~~~~~tlr~Ti~~le~~Ip 83 (138)
..++.|||+.++-|+++|-
T Consensus 35 E~H~ETlRk~mv~L~kKiD 53 (57)
T PF02346_consen 35 EHHIETLRKYMVILAKKID 53 (57)
T ss_pred HHHHHHHHHHHHHHHHhcc
Confidence 4789999999999999984
No 8
>cd02412 30S_S3_KH K homology RNA-binding (KH) domain of the prokaryotic 30S small ribosomal subunit protein S3. S3 is part of the head region of the 30S ribosomal subunit and is believed to interact with mRNA as it threads its way from the latch into the channel. The KH motif is a beta-alpha-alpha-beta-beta unit that folds into an alpha-beta structure with a three stranded beta-sheet interupted by two contiguous helices. In general, KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
Probab=39.87 E-value=78 Score=22.87 Aligned_cols=72 Identities=19% Similarity=0.284 Sum_probs=43.6
Q ss_pred CccccccccCCceeeeeccch-hhhhhChhhhhHHHhHhhhhcccccCCCCCceeec---------------ccccCchH
Q psy5120 2 PLQSAFLMTTGFKSYLNQYST-NQIALNKTQRNEERDKKRHMSHKFSLTPTGEFKWA---------------GPVHGPRN 65 (138)
Q Consensus 2 ~~~frlG~e~~fk~Y~NqYs~-N~~Alnk~qr~eerDkkr~ls~KFslt~~~eFkW~---------------g~~~Gs~~ 65 (138)
|++||||--+.+.+- .|+. +.++.- -.|+-.=+.||..+|.-..-+..+=. |-+.|.+.
T Consensus 6 P~g~Rlg~~~~~~s~--W~~~~~~y~~~---l~ed~~IR~yL~k~~~~agis~I~I~R~~~~i~I~I~t~rPg~vIG~~G 80 (109)
T cd02412 6 PLGFRLGITKDWDSR--WYADKKDYAEL---LHEDLKIRKFIKKKLKKAGISRIEIERKADRVEVTIHTARPGIIIGKKG 80 (109)
T ss_pred CcccccCccCCCcce--EcCCchhhHHH---HHhHHHHHHHHHHHHhhCCccEEEEEEcCCCEEEEEEeCCCCcccCCch
Confidence 999999987766654 6654 244432 33444445555555543333322111 45678888
Q ss_pred HHHHHHHHHHHHH
Q psy5120 66 VLVNTLRATLLQL 78 (138)
Q Consensus 66 ~~~~tlr~Ti~~l 78 (138)
..++.|+..|.++
T Consensus 81 ~~i~~L~~~l~~~ 93 (109)
T cd02412 81 AGIEKLRKELQKL 93 (109)
T ss_pred HHHHHHHHHHHHH
Confidence 8889998887775
No 9
>PF14222 MOR2-PAG1_N: Cell morphogenesis N-terminal
Probab=38.61 E-value=68 Score=29.72 Aligned_cols=69 Identities=17% Similarity=0.275 Sum_probs=53.5
Q ss_pred HHHHHHHHHHHHHHhcCcccCCCCchhh-hhhHHHHhhcccCHHHHHHHH---HHHhhhhcchhhhhhhh-hhc
Q psy5120 67 LVNTLRATLLQLESSIIPAFMHVNWPSL-RKPWISAVNTSVKARDFARAM---IVLQACIKPVVFASVWH-EQL 135 (138)
Q Consensus 67 ~~~tlr~Ti~~le~~Ip~~fmH~nW~~~-rk~W~~~V~~~~~~~~fA~aL---~~le~~ir~vvf~~vW~-e~L 135 (138)
....+..-|.-++..+-..+.||.|.+. -+-.-++..++.+++.-+.+. ..+=|+=.|-.|...|. +.|
T Consensus 151 ~~~~l~~lLlPvA~~~~~evn~P~W~~~v~~i~~~~~~~~~K~khw~~afPL~t~lLCvS~~e~F~~~W~~~~i 224 (552)
T PF14222_consen 151 YCEVLVELLLPVAATATAEVNHPKWKEAVETIYPRAAKMMSKPKHWNVAFPLVTTLLCVSPKEFFLSNWLPSLI 224 (552)
T ss_pred HHHHHHHHHHHHhhcccccccccHHHHHHHHHHHHHHHHHhCcchhhhHHHHHHHHHhcCcHHHHHHHHHHHHH
Confidence 4456666777789999999999999975 566777888888888776653 34557778999999999 555
No 10
>PF02284 COX5A: Cytochrome c oxidase subunit Va; InterPro: IPR003204 Cytochrome c oxidase (1.9.3.1 from EC) is an oligomeric enzymatic complex which is a component of the respiratory chain complex and is involved in the transfer of electrons from cytochrome c to oxygen []. In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane. In eukaryotes, in addition to the three large subunits, I, II and III, that form the catalytic centre of the enzyme complex, there are a variable number of small polypeptidic subunits. One of these subunits is known as Va.; GO: 0004129 cytochrome-c oxidase activity; PDB: 2DYR_R 3AG1_E 3ABL_E 1V54_R 2EIJ_R 1OCR_E 2DYS_E 2EIM_E 2OCC_E 3ASN_R ....
Probab=37.94 E-value=15 Score=27.81 Aligned_cols=28 Identities=25% Similarity=0.283 Sum_probs=22.4
Q ss_pred hhHHHHhhcccCHHHHHHHHHHHhhhhc
Q psy5120 96 KPWISAVNTSVKARDFARAMIVLQACIK 123 (138)
Q Consensus 96 k~W~~~V~~~~~~~~fA~aL~~le~~ir 123 (138)
+.=+.|+..|+...|||.|.-+||+.=-
T Consensus 46 ~ii~aALrAcRRvND~a~AVR~lE~iK~ 73 (108)
T PF02284_consen 46 KIIEAALRACRRVNDFALAVRILEGIKD 73 (108)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHH
Confidence 3446789999999999999999998643
No 11
>PF14733 ACDC: AP2-coincident C-terminal
Probab=37.40 E-value=99 Score=21.51 Aligned_cols=62 Identities=18% Similarity=0.328 Sum_probs=42.8
Q ss_pred HHHHHHHHHHHhcCcccCCC--C-----chhhhhhHHHHhhcccCHHHHHHHHHHHhhhhcchhhhhhh
Q psy5120 70 TLRATLLQLESSIIPAFMHV--N-----WPSLRKPWISAVNTSVKARDFARAMIVLQACIKPVVFASVW 131 (138)
Q Consensus 70 tlr~Ti~~le~~Ip~~fmH~--n-----W~~~rk~W~~~V~~~~~~~~fA~aL~~le~~ir~vvf~~vW 131 (138)
+|.--|.-|..+..+.+... + +...=+.=++.|..|.+..++..-|.+|+.||+.-+.-+.+
T Consensus 11 Ai~~iL~DL~~~c~~~~~~~~~~~~~~~~~~~i~~H~~~V~~a~~~~~L~pyl~lF~~cI~~~~LPS~l 79 (91)
T PF14733_consen 11 AILLILSDLKDNCLPNLFSKLLNASRKIYKRVIDYHIRYVNSAKNIEELQPYLALFSNCIKNNILPSQL 79 (91)
T ss_pred HHHHHHHHHHHhhchhhhhhhhhhhHHHHHHHHHHHHHHHHHccCHHHHHHHHHHHHHHHHcCCCCCcC
Confidence 44555666665555555332 2 33344455678999999999999999999999987655443
No 12
>PF14684 Tricorn_C1: Tricorn protease C1 domain; PDB: 1N6F_D 1N6D_C 1N6E_C 1K32_A.
Probab=37.33 E-value=35 Score=22.76 Aligned_cols=34 Identities=6% Similarity=0.215 Sum_probs=27.6
Q ss_pred CchhhhhhHHHHhhcccCHHHHHHHHHHHhhhhc
Q psy5120 90 NWPSLRKPWISAVNTSVKARDFARAMIVLQACIK 123 (138)
Q Consensus 90 nW~~~rk~W~~~V~~~~~~~~fA~aL~~le~~ir 123 (138)
+|...+......|..|.+..+|..+|..+-+-++
T Consensus 28 DW~~~~~~Y~p~v~~~~~~~el~~vl~eMl~eL~ 61 (70)
T PF14684_consen 28 DWDAVYDRYRPLVPAAKTRDELYDVLNEMLGELN 61 (70)
T ss_dssp HHHHHHHHHHGGGGG--SHHHHHHHHHHHHHTT-
T ss_pred ChHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHC
Confidence 7999999999999999999999999987766543
No 13
>TIGR01009 rpsC_bact ribosomal protein S3, bacterial type. TIGRFAMs model TIGR01008 describes S3 of the eukaryotic cytosol and of the archaea.
Probab=36.59 E-value=67 Score=26.19 Aligned_cols=72 Identities=21% Similarity=0.293 Sum_probs=45.1
Q ss_pred CccccccccCCceeeeeccch-hhhhhChhhhhHHHhHhhhhcccccCCCCCceeec---------------ccccCchH
Q psy5120 2 PLQSAFLMTTGFKSYLNQYST-NQIALNKTQRNEERDKKRHMSHKFSLTPTGEFKWA---------------GPVHGPRN 65 (138)
Q Consensus 2 ~~~frlG~e~~fk~Y~NqYs~-N~~Alnk~qr~eerDkkr~ls~KFslt~~~eFkW~---------------g~~~Gs~~ 65 (138)
|++||||--+++.+. .|+. +.++. --.|+.-=+.++..+|.-..-+..+.. |-+.|...
T Consensus 7 P~gfRLGi~~~w~S~--Wfa~~k~Y~~---~l~eD~~IR~~i~k~~~~agis~IeI~rt~~~i~I~I~~~~pg~vIG~~g 81 (211)
T TIGR01009 7 PIGFRLGITKDWKSR--WYANPKEYAK---LLHEDLKIRNYIKKELSNAGISDVEIERPADKIRVTIHTARPGIVIGKKG 81 (211)
T ss_pred CcccccccccCCceE--EccCcchhHH---HHHHHHHHHHHHHHHhhhCCcceEEEEEcCCceEEEEEeCCCcceeCCCc
Confidence 999999998888876 6665 22332 234555555666665654444444443 24457777
Q ss_pred HHHHHHHHHHHHH
Q psy5120 66 VLVNTLRATLLQL 78 (138)
Q Consensus 66 ~~~~tlr~Ti~~l 78 (138)
.-++.|+..|.+.
T Consensus 82 ~~i~~l~~~l~~~ 94 (211)
T TIGR01009 82 SEIEKLRKDLQKL 94 (211)
T ss_pred hHHHHHHHHHHHH
Confidence 7777787777664
No 14
>KOG4077|consensus
Probab=34.65 E-value=25 Score=27.90 Aligned_cols=27 Identities=30% Similarity=0.345 Sum_probs=23.1
Q ss_pred hhHHHHhhcccCHHHHHHHHHHHhhhh
Q psy5120 96 KPWISAVNTSVKARDFARAMIVLQACI 122 (138)
Q Consensus 96 k~W~~~V~~~~~~~~fA~aL~~le~~i 122 (138)
|.=.+++-.|+.+.|||.|.-+||++=
T Consensus 85 kvIEaaLRA~RRvNDfa~aVRilE~iK 111 (149)
T KOG4077|consen 85 KVIEAALRACRRVNDFATAVRILEAIK 111 (149)
T ss_pred HHHHHHHHHHHHhccHHHHHHHHHHHH
Confidence 445678999999999999999999863
No 15
>cd03565 VHS_Tom1 VHS domain family, Tom1 subfamily; The VHS domain is an essential part of Tom1 (Target of myb1 - retroviral oncogene) protein. The VHS domain has a superhelical structure similar to the structure of the ARM repeats and is present at the very N-termini of proteins. It is a right-handed superhelix of eight alpha helices. The VHS domain has been found in a number of proteins, some of which have been implicated in intracellular trafficking and sorting. The VHS domain of the Tom1 protein is essential for the negative regulation of Interleukin-1 and Tumor Necrosis Factor-induced signaling pathways.
Probab=31.57 E-value=47 Score=25.15 Aligned_cols=24 Identities=21% Similarity=0.497 Sum_probs=20.2
Q ss_pred hhcccCHHHHHHHHHHHhhhhcch
Q psy5120 102 VNTSVKARDFARAMIVLQACIKPV 125 (138)
Q Consensus 102 V~~~~~~~~fA~aL~~le~~ir~v 125 (138)
++...+|+..-.||.+||+|++.+
T Consensus 47 l~~~~n~~v~l~aL~LLe~~vkNC 70 (141)
T cd03565 47 LNGNKNHKEVMLTLTVLETCVKNC 70 (141)
T ss_pred HccCCCHHHHHHHHHHHHHHHHHc
Confidence 655678888888999999999876
No 16
>PF09326 DUF1982: Domain of unknown function (DUF1982); InterPro: IPR015405 This C-terminal domain is functionally uncharacterised and is found in various prokaryotic NADH dehydrogenases including NADH-quinone oxidoreductase, chain G. ; GO: 0016651 oxidoreductase activity, acting on NADH or NADPH, 0051536 iron-sulfur cluster binding, 0055114 oxidation-reduction process
Probab=30.88 E-value=18 Score=23.20 Aligned_cols=13 Identities=23% Similarity=0.286 Sum_probs=10.6
Q ss_pred eeccchhhhhhCh
Q psy5120 17 LNQYSTNQIALNK 29 (138)
Q Consensus 17 ~NqYs~N~~Alnk 29 (138)
.|+|.||||+.+-
T Consensus 31 ~dfY~Td~IsRAS 43 (49)
T PF09326_consen 31 KDFYMTDPISRAS 43 (49)
T ss_pred ccccccchhhhhh
Confidence 4799999998754
No 17
>PF11491 DUF3213: Protein of unknown function (DUF3213) ; InterPro: IPR021583 The backbone structure of this family of proteins has been determined however the function remains unknown. The protein has an alpha and beta structure with a ferredoxin-like fold []. ; PDB: 2F40_A.
Probab=29.40 E-value=19 Score=26.39 Aligned_cols=24 Identities=17% Similarity=0.331 Sum_probs=12.2
Q ss_pred cccccc-ccCCceeeeeccchhhhh
Q psy5120 3 LQSAFL-MTTGFKSYLNQYSTNQIA 26 (138)
Q Consensus 3 ~~frlG-~e~~fk~Y~NqYs~N~~A 26 (138)
.+|+|. +|+.|+.+.|-|+.|-..
T Consensus 16 ~QYeLsk~~~vyRvFiNgYar~g~V 40 (88)
T PF11491_consen 16 KQYELSKNEAVYRVFINGYARNGFV 40 (88)
T ss_dssp HHHTTTTTTTB------TTSS--EE
T ss_pred HHHHhhcccceeeeeecccccceEE
Confidence 467777 589999999999998643
No 18
>PF11230 DUF3029: Protein of unknown function (DUF3029); InterPro: IPR016905 Members of this family are homologs to enzymes known to undergo activation by a radical SAM protein to create an active site glycyl radical. This family appears to be activated by the YjjW radical SAM protein, usually encoded by an adjacent gene.
Probab=25.82 E-value=1e+02 Score=28.71 Aligned_cols=30 Identities=23% Similarity=0.346 Sum_probs=27.3
Q ss_pred chHHHHHHHHHHHHHHHHhcCcccCCCCch
Q psy5120 63 PRNVLVNTLRATLLQLESSIIPAFMHVNWP 92 (138)
Q Consensus 63 s~~~~~~tlr~Ti~~le~~Ip~~fmH~nW~ 92 (138)
+...+-+-||.-+++|.+-||-+|||-|=-
T Consensus 114 ~eee~~~klk~Fl~~lDRTi~DsF~HaNIG 143 (487)
T PF11230_consen 114 DEEEAYKKLKLFLIHLDRTIPDSFCHANIG 143 (487)
T ss_pred CHHHHHHHHHHHHHHhhccCcchhhccCCC
Confidence 777889999999999999999999998853
No 19
>PHA02675 ORF104 fusion protein; Provisional
Probab=25.51 E-value=74 Score=23.37 Aligned_cols=20 Identities=35% Similarity=0.426 Sum_probs=17.3
Q ss_pred hHHHHHHHHHHHHHHHHhcC
Q psy5120 64 RNVLVNTLRATLLQLESSII 83 (138)
Q Consensus 64 ~~~~~~tlr~Ti~~le~~Ip 83 (138)
-..+..|||.+|+.|..+|-
T Consensus 63 LE~H~ETLRk~Ml~L~KKID 82 (90)
T PHA02675 63 LERHLETLREALLKLNTKID 82 (90)
T ss_pred HHHHHHHHHHHHHHHHhhcc
Confidence 34789999999999999884
No 20
>PF14910 MMS22L_N: S-phase genomic integrity recombination mediator, N-terminal
Probab=24.76 E-value=69 Score=31.12 Aligned_cols=45 Identities=18% Similarity=0.378 Sum_probs=38.1
Q ss_pred ceeecc--cccCchHHHHHHHHHHHHHHHHhcC-------cccCCCCchhhhhh
Q psy5120 53 EFKWAG--PVHGPRNVLVNTLRATLLQLESSII-------PAFMHVNWPSLRKP 97 (138)
Q Consensus 53 eFkW~g--~~~Gs~~~~~~tlr~Ti~~le~~Ip-------~~fmH~nW~~~rk~ 97 (138)
.|.|+. .+.-+...+...+|+.|.+||+.|. +.=+|-.|...|..
T Consensus 50 g~~wVte~al~es~~~LF~l~rqq~~~le~llq~~~~~~~~~~lh~~~~~iRqq 103 (704)
T PF14910_consen 50 GFQWVTETALVESCQKLFGLFRQQIYKLEQLLQSSCDFGQASSLHSEAENIRQQ 103 (704)
T ss_pred ceeeehHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCHHHHHHH
Confidence 488886 5566778999999999999999999 57799999988864
No 21
>PF04510 DUF577: Family of unknown function (DUF577); InterPro: IPR007598 This is a family of Arabidopsis thaliana (Mouse-ear cress) proteins. Many of these members contain a repeated region.
Probab=24.74 E-value=1.8e+02 Score=23.64 Aligned_cols=55 Identities=11% Similarity=0.154 Sum_probs=41.5
Q ss_pred CchHHHHHHHHHHHHHHHHhcCcccCCCCchhhhhhHHHHhhcccCHHHHHHHHHHHhhhh
Q psy5120 62 GPRNVLVNTLRATLLQLESSIIPAFMHVNWPSLRKPWISAVNTSVKARDFARAMIVLQACI 122 (138)
Q Consensus 62 Gs~~~~~~tlr~Ti~~le~~Ip~~fmH~nW~~~rk~W~~~V~~~~~~~~fA~aL~~le~~i 122 (138)
|++.-..+.||.+..-...+| ...---.|++++. .| ..- +-.+|.+|..+|.|.-
T Consensus 16 ~~k~s~~KiL~~iVs~Va~~v-~~~~~~~W~eL~d-~I---ls~-~~~e~~kA~~IF~~L~ 70 (174)
T PF04510_consen 16 ETKESDFKILRRIVSHVAYEV-FDLQEGGWDELSD-CI---LSL-SENEPVKAFHIFICLP 70 (174)
T ss_pred cccHhHHHHHHHHHHHHHHHH-HhcCCCCchhHHH-HH---HHh-hccchHHHHHHHHhCC
Confidence 667778999999999999998 2234567999993 23 222 2678899999998864
No 22
>PF02064 MAS20: MAS20 protein import receptor; InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=24.38 E-value=83 Score=23.81 Aligned_cols=38 Identities=13% Similarity=0.341 Sum_probs=29.8
Q ss_pred CchhhhhhHHHHhhcccCHHHHHHHHHHHhhhhcchhhhhh
Q psy5120 90 NWPSLRKPWISAVNTSVKARDFARAMIVLQACIKPVVFASV 130 (138)
Q Consensus 90 nW~~~rk~W~~~V~~~~~~~~fA~aL~~le~~ir~vvf~~v 130 (138)
+...=-..+.+|+..|.+|.+ .|.+++-.+-|-||..+
T Consensus 78 ~~~~aa~hf~nAl~V~~qP~~---LL~i~q~tlP~~vf~~l 115 (121)
T PF02064_consen 78 DYEEAAEHFYNALKVCPQPAE---LLQIYQKTLPPPVFEML 115 (121)
T ss_dssp -HHHHHHHHHHHHHTSSSHHH---HHHHHHHHS-HHHHHHH
T ss_pred CHHHHHHHHHHHHHhCCCHHH---HHHHHHhhCCHHHHHHH
Confidence 345566779999999999986 58899999999988764
No 23
>KOG4061|consensus
Probab=23.52 E-value=62 Score=27.02 Aligned_cols=36 Identities=17% Similarity=0.538 Sum_probs=23.4
Q ss_pred hhhhhhHHHHhhcccCHHHHHHHHHHHhhhhcchhhhhhhh
Q psy5120 92 PSLRKPWISAVNTSVKARDFARAMIVLQACIKPVVFASVWH 132 (138)
Q Consensus 92 ~~~rk~W~~~V~~~~~~~~fA~aL~~le~~ir~vvf~~vW~ 132 (138)
|...++|+++-.--.+...+ ++.--+||+|+.|+|.
T Consensus 81 pvi~hmWdqEk~Hl~tf~~l-----~~k~rVrpT~l~P~w~ 116 (217)
T KOG4061|consen 81 PVIKHMWDQEKEHLKTFENL-----ALKHRVRPTVLTPLWN 116 (217)
T ss_pred hHHHHHHHHHHHHHHHHHHH-----HHHccCCchhhhhHHH
Confidence 34566777766544444443 3455689999999995
No 24
>PF13976 gag_pre-integrs: GAG-pre-integrase domain
Probab=22.93 E-value=41 Score=21.35 Aligned_cols=14 Identities=29% Similarity=0.631 Sum_probs=11.0
Q ss_pred hhhhhhhhhhcCCC
Q psy5120 125 VVFASVWHEQLVHA 138 (138)
Q Consensus 125 vvf~~vW~e~LGht 138 (138)
.-=...||.-|||.
T Consensus 21 ~~~~~lWH~RLGH~ 34 (67)
T PF13976_consen 21 KSSTDLWHRRLGHP 34 (67)
T ss_pred CCCHHHHHHhcCCC
Confidence 34467899999994
No 25
>PF11013 DUF2851: Protein of unknown function (DUF2851); InterPro: IPR021272 This bacterial family of proteins has no known function.
Probab=22.29 E-value=37 Score=30.24 Aligned_cols=68 Identities=21% Similarity=0.345 Sum_probs=48.6
Q ss_pred HHhHhhhhcccccCCCCCceeecccccCchHHHHHHHHHHHHHHHHhcCcccCCCCchhhhhhHHHHhhcccCHHHHHHH
Q psy5120 35 ERDKKRHMSHKFSLTPTGEFKWAGPVHGPRNVLVNTLRATLLQLESSIIPAFMHVNWPSLRKPWISAVNTSVKARDFARA 114 (138)
Q Consensus 35 erDkkr~ls~KFslt~~~eFkW~g~~~Gs~~~~~~tlr~Ti~~le~~Ip~~fmH~nW~~~rk~W~~~V~~~~~~~~fA~a 114 (138)
=++.=.||.+||.+++...+.|. +++.++.=.=|+|.. ||. .++|.. .+.-..+..|++.+++...
T Consensus 240 L~~ey~~L~~K~~L~~~~~~~w~--f~rlRP~NFPtiRLA--qLA-----~L~~~~-----~~Lfs~l~~~~~~~~l~~l 305 (377)
T PF11013_consen 240 LKKEYQFLQHKFQLQPMDRSEWK--FFRLRPANFPTIRLA--QLA-----ALYHKS-----QNLFSKLLEAKSLEELYKL 305 (377)
T ss_pred HHHHHHHHHHhcCCCCCCcccce--ECCCCCCCCCHHHHH--HHH-----HHHHcC-----chHHHHHHccCCHHHHHHH
Confidence 34445789999999998889994 889888777777754 443 344544 3333446688999988887
Q ss_pred HH
Q psy5120 115 MI 116 (138)
Q Consensus 115 L~ 116 (138)
+.
T Consensus 306 f~ 307 (377)
T PF11013_consen 306 FS 307 (377)
T ss_pred hC
Confidence 75
No 26
>PF13960 DUF4218: Domain of unknown function (DUF4218)
Probab=21.83 E-value=82 Score=24.16 Aligned_cols=27 Identities=30% Similarity=0.337 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHHHHHhcCcccCCCCch
Q psy5120 66 VLVNTLRATLLQLESSIIPAFMHVNWP 92 (138)
Q Consensus 66 ~~~~tlr~Ti~~le~~Ip~~fmH~nW~ 92 (138)
.+-+.|..|+++||.-.|++|+.++=-
T Consensus 25 ~L~~~I~~~lc~lE~ifppsffdim~H 51 (128)
T PF13960_consen 25 ELEEEIVETLCQLEMIFPPSFFDIMVH 51 (128)
T ss_pred HHHHHHHHHHHHHHHHcchhHhhhhHH
Confidence 366788899999999999999987643
No 27
>PF09674 DUF2400: Protein of unknown function (DUF2400); InterPro: IPR014127 Members of this uncharacterised protein family are found sporadically, so far only among spirochetes, epsilon and delta proteobacteria, and Bacteroides. The function is unknown and its gene neighbourhoods show little conservation.
Probab=21.45 E-value=95 Score=25.89 Aligned_cols=64 Identities=16% Similarity=0.083 Sum_probs=48.4
Q ss_pred cCchHHHHHHHHHHHHHHHHhcCcccCCCCchhhhhhHHHHhhcccCHHHHHHHHHHHhhhhcc
Q psy5120 61 HGPRNVLVNTLRATLLQLESSIIPAFMHVNWPSLRKPWISAVNTSVKARDFARAMIVLQACIKP 124 (138)
Q Consensus 61 ~Gs~~~~~~tlr~Ti~~le~~Ip~~fmH~nW~~~rk~W~~~V~~~~~~~~fA~aL~~le~~ir~ 124 (138)
||++.+|++.+..-+.-+..+--..+++-++....+....-...=.+.+|++..+..|...++.
T Consensus 27 yG~~~~I~~~~~~ll~~~~~~P~~~v~~~~~~~~~~~~~~~~yRf~~~~D~~~~~~~l~~i~~~ 90 (232)
T PF09674_consen 27 YGNRKQIIKKLERLLDLMGPSPYDFVLSGDEKDDRKDLEGFFYRFQNGEDMYAFFIALKRIYQE 90 (232)
T ss_pred ccCHHHHHHHHHHHHHHhCCCHHHHHHcCCHhhhHHHccCCCcCCCCHHHHHHHHHHHHHHHHc
Confidence 7999999999999988777765555667777766666555556667888888888777776654
Done!