Query         psy5120
Match_columns 138
No_of_seqs    37 out of 39
Neff          4.1 
Searched_HMMs 46136
Date          Fri Aug 16 17:51:32 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy5120.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5120hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1473|consensus               99.9 1.9E-28 4.1E-33  232.6   3.6  134    3-138   572-707 (1414)
  2 KOG1473|consensus               98.1 1.8E-06 3.8E-11   84.4   2.8  112    4-115   835-958 (1414)
  3 PF10929 DUF2811:  Protein of u  55.2      18 0.00038   24.6   3.1   11   87-97     20-30  (57)
  4 PF09906 DUF2135:  Uncharacteri  54.6     6.8 0.00015   25.2   1.0   17   10-26      3-19  (50)
  5 cd00923 Cyt_c_Oxidase_Va Cytoc  50.3      11 0.00024   28.3   1.7   30   96-125    43-72  (103)
  6 cd07562 Peptidase_S41_TRI Tric  43.5      57  0.0012   26.5   5.0   37   86-122    22-58  (266)
  7 PF02346 Vac_Fusion:  Chordopox  40.3      33 0.00072   23.1   2.7   19   65-83     35-53  (57)
  8 cd02412 30S_S3_KH K homology R  39.9      78  0.0017   22.9   4.8   72    2-78      6-93  (109)
  9 PF14222 MOR2-PAG1_N:  Cell mor  38.6      68  0.0015   29.7   5.3   69   67-135   151-224 (552)
 10 PF02284 COX5A:  Cytochrome c o  37.9      15 0.00032   27.8   0.8   28   96-123    46-73  (108)
 11 PF14733 ACDC:  AP2-coincident   37.4      99  0.0021   21.5   4.9   62   70-131    11-79  (91)
 12 PF14684 Tricorn_C1:  Tricorn p  37.3      35 0.00077   22.8   2.5   34   90-123    28-61  (70)
 13 TIGR01009 rpsC_bact ribosomal   36.6      67  0.0015   26.2   4.4   72    2-78      7-94  (211)
 14 KOG4077|consensus               34.7      25 0.00055   27.9   1.6   27   96-122    85-111 (149)
 15 cd03565 VHS_Tom1 VHS domain fa  31.6      47   0.001   25.1   2.6   24  102-125    47-70  (141)
 16 PF09326 DUF1982:  Domain of un  30.9      18  0.0004   23.2   0.2   13   17-29     31-43  (49)
 17 PF11491 DUF3213:  Protein of u  29.4      19 0.00041   26.4   0.1   24    3-26     16-40  (88)
 18 PF11230 DUF3029:  Protein of u  25.8   1E+02  0.0023   28.7   4.2   30   63-92    114-143 (487)
 19 PHA02675 ORF104 fusion protein  25.5      74  0.0016   23.4   2.6   20   64-83     63-82  (90)
 20 PF14910 MMS22L_N:  S-phase gen  24.8      69  0.0015   31.1   3.0   45   53-97     50-103 (704)
 21 PF04510 DUF577:  Family of unk  24.7 1.8E+02  0.0039   23.6   4.9   55   62-122    16-70  (174)
 22 PF02064 MAS20:  MAS20 protein   24.4      83  0.0018   23.8   2.8   38   90-130    78-115 (121)
 23 KOG4061|consensus               23.5      62  0.0013   27.0   2.1   36   92-132    81-116 (217)
 24 PF13976 gag_pre-integrs:  GAG-  22.9      41 0.00089   21.3   0.8   14  125-138    21-34  (67)
 25 PF11013 DUF2851:  Protein of u  22.3      37  0.0008   30.2   0.6   68   35-116   240-307 (377)
 26 PF13960 DUF4218:  Domain of un  21.8      82  0.0018   24.2   2.4   27   66-92     25-51  (128)
 27 PF09674 DUF2400:  Protein of u  21.4      95  0.0021   25.9   2.8   64   61-124    27-90  (232)

No 1  
>KOG1473|consensus
Probab=99.94  E-value=1.9e-28  Score=232.60  Aligned_cols=134  Identities=22%  Similarity=0.226  Sum_probs=131.6

Q ss_pred             ccccccccCCceeeeeccchhhhhhChhhhhHHHhHhhhhcccccCCCCCceeecc--cccCchHHHHHHHHHHHHHHHH
Q psy5120           3 LQSAFLMTTGFKSYLNQYSTNQIALNKTQRNEERDKKRHMSHKFSLTPTGEFKWAG--PVHGPRNVLVNTLRATLLQLES   80 (138)
Q Consensus         3 ~~frlG~e~~fk~Y~NqYs~N~~Alnk~qr~eerDkkr~ls~KFslt~~~eFkW~g--~~~Gs~~~~~~tlr~Ti~~le~   80 (138)
                      +.|.+|+|++|+.|  ||++|++++++.+++|++|+|+||++||++++.+.+.|.|  .++|+..++.+++++|+.+.++
T Consensus       572 ~~~e~~~dqtf~~y--~ys~n~vse~~~~d~e~~dkk~~~~tkf~l~~nsd~~~~g~~~t~gt~~~~~~~~~~t~~~~lS  649 (1414)
T KOG1473|consen  572 RLREEGNDQTFMKY--YYSGNEVSEIFLTDSENADKKSHMQTKFALITNSDGVTAGNVTTYGTGSQHKKLIARTLQQGLS  649 (1414)
T ss_pred             hhhhcccccchhhh--cccCCchhhccCCchhhhcccccccceecccccccceeccccccccchhhcchHHHhhhhhhhc
Confidence            46889999999999  9999999999999999999999999999999999999999  8999999999999999999999


Q ss_pred             hcCcccCCCCchhhhhhHHHHhhcccCHHHHHHHHHHHhhhhcchhhhhhhhhhcCCC
Q psy5120          81 SIIPAFMHVNWPSLRKPWISAVNTSVKARDFARAMIVLQACIKPVVFASVWHEQLVHA  138 (138)
Q Consensus        81 ~Ip~~fmH~nW~~~rk~W~~~V~~~~~~~~fA~aL~~le~~ir~vvf~~vW~e~LGht  138 (138)
                      +||.++||++|+.+.|.|..||+.+.++.+|+-+++.++|++||++|+++|||+||||
T Consensus       650 niP~s~~n~~w~~~tkg~~lavs~A~~~~el~s~t~~~d~s~~~~~~~~~ssn~L~qt  707 (1414)
T KOG1473|consen  650 NIPISYNNRKWPVYTKGFELAVSAAADLAELSSETLEPDLSKRSNAFKAASSNILGQT  707 (1414)
T ss_pred             cCchHhhhccchhhccchhhhhhccchHHHHHHhhcccchhhhhhhhccchhhhhcch
Confidence            9999999999999999999999999999999999999999999999999999999997


No 2  
>KOG1473|consensus
Probab=98.09  E-value=1.8e-06  Score=84.42  Aligned_cols=112  Identities=13%  Similarity=0.048  Sum_probs=97.8

Q ss_pred             cccccccCCceeeeeccchhhhhhChhhhhHHHhHhhh-----hcccccCCCCCceeecc--cccCchHHHHHHHH-HHH
Q psy5120           4 QSAFLMTTGFKSYLNQYSTNQIALNKTQRNEERDKKRH-----MSHKFSLTPTGEFKWAG--PVHGPRNVLVNTLR-ATL   75 (138)
Q Consensus         4 ~frlG~e~~fk~Y~NqYs~N~~Alnk~qr~eerDkkr~-----ls~KFslt~~~eFkW~g--~~~Gs~~~~~~tlr-~Ti   75 (138)
                      ||..++|+-++.|+++|.+|+.+++++|+++++++|++     +++|+++++.+.++|.+  .+.+.+++...+++ ++.
T Consensus       835 y~~~~~~n~~rie~s~~~~ng~~v~akQ~r~pgr~~~s~~~ek~A~~s~ld~f~~~Rggks~vvl~kavL~~~~mk~~v~  914 (1414)
T KOG1473|consen  835 YWPKQMDNWLRIEHSIFQSNGVTVGAKQARDPGRTKQSLQAEKTAPKSDLDSFTWWRGGKSKVVLQKAVLSQSIMKKLVW  914 (1414)
T ss_pred             cchhhccCceeeeechhccCceeechhhhcCCcchhhhcchhhccccccccchhhhhcCcceeeehhhhcchHHHHHHhh
Confidence            68889999999999999999999999999999999999     88999999999999988  67899999999999 788


Q ss_pred             HHHHHhcCcccCCC----CchhhhhhHHHHhhcccCHHHHHHHH
Q psy5120          76 LQLESSIIPAFMHV----NWPSLRKPWISAVNTSVKARDFARAM  115 (138)
Q Consensus        76 ~~le~~Ip~~fmH~----nW~~~rk~W~~~V~~~~~~~~fA~aL  115 (138)
                      .+.+..+|..=+=+    -=++.|.-|-.+|..+.+...||.-|
T Consensus       915 ~~g~ta~~k~nfl~~~y~p~~s~~s~wk~av~n~enlh~LAlQ~  958 (1414)
T KOG1473|consen  915 QQGFTAGPKSNFLDWSYIPRRSRRSCWKAAVENSENLHQLALQL  958 (1414)
T ss_pred             ccccccCCcccccccccccchhhhhhhhhhhcChhhHHHHHHHH
Confidence            88887776432211    11468999999999999999999877


No 3  
>PF10929 DUF2811:  Protein of unknown function (DUF2811);  InterPro: IPR021231  This is a bacterial family of uncharacterised proteins. 
Probab=55.20  E-value=18  Score=24.55  Aligned_cols=11  Identities=36%  Similarity=0.604  Sum_probs=9.1

Q ss_pred             CCCCchhhhhh
Q psy5120          87 MHVNWPSLRKP   97 (138)
Q Consensus        87 mH~nW~~~rk~   97 (138)
                      .||+|.++|-.
T Consensus        20 ~hP~WDQ~Rl~   30 (57)
T PF10929_consen   20 THPNWDQYRLF   30 (57)
T ss_pred             cCCCchHHHHH
Confidence            49999999854


No 4  
>PF09906 DUF2135:  Uncharacterized protein conserved in bacteria (DUF2135);  InterPro: IPR019220  This entry, found in various hypothetical prokaryotic proteins, has no known function. 
Probab=54.61  E-value=6.8  Score=25.24  Aligned_cols=17  Identities=18%  Similarity=0.393  Sum_probs=14.0

Q ss_pred             cCCceeeeeccchhhhh
Q psy5120          10 TTGFKSYLNQYSTNQIA   26 (138)
Q Consensus        10 e~~fk~Y~NqYs~N~~A   26 (138)
                      .|.|+.|||+|-.+...
T Consensus         3 ~G~Y~VyVnYyg~~~~~   19 (50)
T PF09906_consen    3 PGTYLVYVNYYGNRSQS   19 (50)
T ss_pred             CceEEEEEEeccCCCcC
Confidence            58899999999876554


No 5  
>cd00923 Cyt_c_Oxidase_Va Cytochrome c oxidase subunit Va. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit Va is one of three mammalian subunits that lacks a transmembrane region. Subunit Va is located on the matrix side of the membrane and binds thyroid hormone T2, releasing allosteric inhibition caused by the binding of ATP to subunit IV and allowing high turnover at elevated intramitochondrial ATP/ADP ratios.
Probab=50.27  E-value=11  Score=28.27  Aligned_cols=30  Identities=27%  Similarity=0.287  Sum_probs=24.5

Q ss_pred             hhHHHHhhcccCHHHHHHHHHHHhhhhcch
Q psy5120          96 KPWISAVNTSVKARDFARAMIVLQACIKPV  125 (138)
Q Consensus        96 k~W~~~V~~~~~~~~fA~aL~~le~~ir~v  125 (138)
                      +.=+.|+..|+...|||.|.-+||++=-+|
T Consensus        43 ~ii~aaLrAcRRvND~alAVR~lE~vK~K~   72 (103)
T cd00923          43 KVIEAALRACRRVNDFALAVRILEAIKDKC   72 (103)
T ss_pred             HHHHHHHHHHHHhhhHHHHHHHHHHHHHHc
Confidence            445788999999999999999999754333


No 6  
>cd07562 Peptidase_S41_TRI Tricorn protease; serine protease family S41. The tricorn protease (TRI), a member of the S41 peptidase family and named for its tricorn-like shape, exists only in some archaea and eubacteria. It has been shown to act as a carboxypeptidase, involved in the degradation of proteasomal products to preferentially yield di- and tripeptides, with subsequent and final degradations to free amino acid residues by tricorn interacting factors, F1, F2 and F3. Tricorn is a hexameric D3-symmetric protease of 720kD, and can self-associate further into a giant icosahedral capsid structure containing twenty copies of the complex. Each tricorn peptidase monomer consists of five structural domains: a six-bladed beta-propeller and a seven-bladed beta-propeller that limit access to the active site, the two domains (C1 and C2) that carry the active site residues, and a PDZ-like domain (proposed to be important for substrate recognition) between the C1 and C2 domains. The active sit
Probab=43.48  E-value=57  Score=26.49  Aligned_cols=37  Identities=11%  Similarity=0.298  Sum_probs=32.1

Q ss_pred             cCCCCchhhhhhHHHHhhcccCHHHHHHHHHHHhhhh
Q psy5120          86 FMHVNWPSLRKPWISAVNTSVKARDFARAMIVLQACI  122 (138)
Q Consensus        86 fmH~nW~~~rk~W~~~V~~~~~~~~fA~aL~~le~~i  122 (138)
                      +-..+|...|+.....|..+.+..||+.+|.-+-+.+
T Consensus        22 ~~g~dW~~~~~~y~~~~~~~~~~~e~~~~l~~ml~~L   58 (266)
T cd07562          22 MHGVDWDAVRAEYRPLLPRAATRAELADVLNEMLGEL   58 (266)
T ss_pred             CCCCCHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHH
Confidence            4456999999999999999999999999998776554


No 7  
>PF02346 Vac_Fusion:  Chordopoxvirus fusion protein;  InterPro: IPR003436 This is a family of viral fusion proteins from the Chordopoxvirinae. A 14kDa Vaccinia virus protein has been demonstrated to function as a viral fusion protein mediating cell fusion at endosmomal (low) pH []. The protein, found in the envelope fraction of the virions, is required for fusing the outermost of the two golgi-derived membranes enveloping the virus with the plasma membrane, and its subsequent release extracellularly. The N-terminal proximal region is essential for its fusion ability.; GO: 0019064 viral envelope fusion with host membrane, 0019031 viral envelope
Probab=40.34  E-value=33  Score=23.08  Aligned_cols=19  Identities=26%  Similarity=0.300  Sum_probs=16.9

Q ss_pred             HHHHHHHHHHHHHHHHhcC
Q psy5120          65 NVLVNTLRATLLQLESSII   83 (138)
Q Consensus        65 ~~~~~tlr~Ti~~le~~Ip   83 (138)
                      ..++.|||+.++-|+++|-
T Consensus        35 E~H~ETlRk~mv~L~kKiD   53 (57)
T PF02346_consen   35 EHHIETLRKYMVILAKKID   53 (57)
T ss_pred             HHHHHHHHHHHHHHHHhcc
Confidence            4789999999999999984


No 8  
>cd02412 30S_S3_KH K homology RNA-binding (KH) domain of the prokaryotic 30S small ribosomal subunit protein S3. S3  is part of the head region of the 30S ribosomal subunit and is believed to interact with mRNA as it threads its way from the latch into the channel.  The KH motif is a beta-alpha-alpha-beta-beta unit that folds into an alpha-beta structure with a three stranded beta-sheet interupted by two contiguous helices.  In general, KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
Probab=39.87  E-value=78  Score=22.87  Aligned_cols=72  Identities=19%  Similarity=0.284  Sum_probs=43.6

Q ss_pred             CccccccccCCceeeeeccch-hhhhhChhhhhHHHhHhhhhcccccCCCCCceeec---------------ccccCchH
Q psy5120           2 PLQSAFLMTTGFKSYLNQYST-NQIALNKTQRNEERDKKRHMSHKFSLTPTGEFKWA---------------GPVHGPRN   65 (138)
Q Consensus         2 ~~~frlG~e~~fk~Y~NqYs~-N~~Alnk~qr~eerDkkr~ls~KFslt~~~eFkW~---------------g~~~Gs~~   65 (138)
                      |++||||--+.+.+-  .|+. +.++.-   -.|+-.=+.||..+|.-..-+..+=.               |-+.|.+.
T Consensus         6 P~g~Rlg~~~~~~s~--W~~~~~~y~~~---l~ed~~IR~yL~k~~~~agis~I~I~R~~~~i~I~I~t~rPg~vIG~~G   80 (109)
T cd02412           6 PLGFRLGITKDWDSR--WYADKKDYAEL---LHEDLKIRKFIKKKLKKAGISRIEIERKADRVEVTIHTARPGIIIGKKG   80 (109)
T ss_pred             CcccccCccCCCcce--EcCCchhhHHH---HHhHHHHHHHHHHHHhhCCccEEEEEEcCCCEEEEEEeCCCCcccCCch
Confidence            999999987766654  6654 244432   33444445555555543333322111               45678888


Q ss_pred             HHHHHHHHHHHHH
Q psy5120          66 VLVNTLRATLLQL   78 (138)
Q Consensus        66 ~~~~tlr~Ti~~l   78 (138)
                      ..++.|+..|.++
T Consensus        81 ~~i~~L~~~l~~~   93 (109)
T cd02412          81 AGIEKLRKELQKL   93 (109)
T ss_pred             HHHHHHHHHHHHH
Confidence            8889998887775


No 9  
>PF14222 MOR2-PAG1_N:  Cell morphogenesis N-terminal
Probab=38.61  E-value=68  Score=29.72  Aligned_cols=69  Identities=17%  Similarity=0.275  Sum_probs=53.5

Q ss_pred             HHHHHHHHHHHHHHhcCcccCCCCchhh-hhhHHHHhhcccCHHHHHHHH---HHHhhhhcchhhhhhhh-hhc
Q psy5120          67 LVNTLRATLLQLESSIIPAFMHVNWPSL-RKPWISAVNTSVKARDFARAM---IVLQACIKPVVFASVWH-EQL  135 (138)
Q Consensus        67 ~~~tlr~Ti~~le~~Ip~~fmH~nW~~~-rk~W~~~V~~~~~~~~fA~aL---~~le~~ir~vvf~~vW~-e~L  135 (138)
                      ....+..-|.-++..+-..+.||.|.+. -+-.-++..++.+++.-+.+.   ..+=|+=.|-.|...|. +.|
T Consensus       151 ~~~~l~~lLlPvA~~~~~evn~P~W~~~v~~i~~~~~~~~~K~khw~~afPL~t~lLCvS~~e~F~~~W~~~~i  224 (552)
T PF14222_consen  151 YCEVLVELLLPVAATATAEVNHPKWKEAVETIYPRAAKMMSKPKHWNVAFPLVTTLLCVSPKEFFLSNWLPSLI  224 (552)
T ss_pred             HHHHHHHHHHHHhhcccccccccHHHHHHHHHHHHHHHHHhCcchhhhHHHHHHHHHhcCcHHHHHHHHHHHHH
Confidence            4456666777789999999999999975 566777888888888776653   34557778999999999 555


No 10 
>PF02284 COX5A:  Cytochrome c oxidase subunit Va;  InterPro: IPR003204 Cytochrome c oxidase (1.9.3.1 from EC) is an oligomeric enzymatic complex which is a component of the respiratory chain complex and is involved in the transfer of electrons from cytochrome c to oxygen []. In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane.  In eukaryotes, in addition to the three large subunits, I, II and III, that form the catalytic centre of the enzyme complex, there are a variable number of small polypeptidic subunits. One of these subunits is known as Va.; GO: 0004129 cytochrome-c oxidase activity; PDB: 2DYR_R 3AG1_E 3ABL_E 1V54_R 2EIJ_R 1OCR_E 2DYS_E 2EIM_E 2OCC_E 3ASN_R ....
Probab=37.94  E-value=15  Score=27.81  Aligned_cols=28  Identities=25%  Similarity=0.283  Sum_probs=22.4

Q ss_pred             hhHHHHhhcccCHHHHHHHHHHHhhhhc
Q psy5120          96 KPWISAVNTSVKARDFARAMIVLQACIK  123 (138)
Q Consensus        96 k~W~~~V~~~~~~~~fA~aL~~le~~ir  123 (138)
                      +.=+.|+..|+...|||.|.-+||+.=-
T Consensus        46 ~ii~aALrAcRRvND~a~AVR~lE~iK~   73 (108)
T PF02284_consen   46 KIIEAALRACRRVNDFALAVRILEGIKD   73 (108)
T ss_dssp             HHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHhhhHHHHHHHHHHHHH
Confidence            3446789999999999999999998643


No 11 
>PF14733 ACDC:  AP2-coincident C-terminal
Probab=37.40  E-value=99  Score=21.51  Aligned_cols=62  Identities=18%  Similarity=0.328  Sum_probs=42.8

Q ss_pred             HHHHHHHHHHHhcCcccCCC--C-----chhhhhhHHHHhhcccCHHHHHHHHHHHhhhhcchhhhhhh
Q psy5120          70 TLRATLLQLESSIIPAFMHV--N-----WPSLRKPWISAVNTSVKARDFARAMIVLQACIKPVVFASVW  131 (138)
Q Consensus        70 tlr~Ti~~le~~Ip~~fmH~--n-----W~~~rk~W~~~V~~~~~~~~fA~aL~~le~~ir~vvf~~vW  131 (138)
                      +|.--|.-|..+..+.+...  +     +...=+.=++.|..|.+..++..-|.+|+.||+.-+.-+.+
T Consensus        11 Ai~~iL~DL~~~c~~~~~~~~~~~~~~~~~~~i~~H~~~V~~a~~~~~L~pyl~lF~~cI~~~~LPS~l   79 (91)
T PF14733_consen   11 AILLILSDLKDNCLPNLFSKLLNASRKIYKRVIDYHIRYVNSAKNIEELQPYLALFSNCIKNNILPSQL   79 (91)
T ss_pred             HHHHHHHHHHHhhchhhhhhhhhhhHHHHHHHHHHHHHHHHHccCHHHHHHHHHHHHHHHHcCCCCCcC
Confidence            44555666665555555332  2     33344455678999999999999999999999987655443


No 12 
>PF14684 Tricorn_C1:  Tricorn protease C1 domain; PDB: 1N6F_D 1N6D_C 1N6E_C 1K32_A.
Probab=37.33  E-value=35  Score=22.76  Aligned_cols=34  Identities=6%  Similarity=0.215  Sum_probs=27.6

Q ss_pred             CchhhhhhHHHHhhcccCHHHHHHHHHHHhhhhc
Q psy5120          90 NWPSLRKPWISAVNTSVKARDFARAMIVLQACIK  123 (138)
Q Consensus        90 nW~~~rk~W~~~V~~~~~~~~fA~aL~~le~~ir  123 (138)
                      +|...+......|..|.+..+|..+|..+-+-++
T Consensus        28 DW~~~~~~Y~p~v~~~~~~~el~~vl~eMl~eL~   61 (70)
T PF14684_consen   28 DWDAVYDRYRPLVPAAKTRDELYDVLNEMLGELN   61 (70)
T ss_dssp             HHHHHHHHHHGGGGG--SHHHHHHHHHHHHHTT-
T ss_pred             ChHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHC
Confidence            7999999999999999999999999987766543


No 13 
>TIGR01009 rpsC_bact ribosomal protein S3, bacterial type. TIGRFAMs model TIGR01008 describes S3 of the eukaryotic cytosol and of the archaea.
Probab=36.59  E-value=67  Score=26.19  Aligned_cols=72  Identities=21%  Similarity=0.293  Sum_probs=45.1

Q ss_pred             CccccccccCCceeeeeccch-hhhhhChhhhhHHHhHhhhhcccccCCCCCceeec---------------ccccCchH
Q psy5120           2 PLQSAFLMTTGFKSYLNQYST-NQIALNKTQRNEERDKKRHMSHKFSLTPTGEFKWA---------------GPVHGPRN   65 (138)
Q Consensus         2 ~~~frlG~e~~fk~Y~NqYs~-N~~Alnk~qr~eerDkkr~ls~KFslt~~~eFkW~---------------g~~~Gs~~   65 (138)
                      |++||||--+++.+.  .|+. +.++.   --.|+.-=+.++..+|.-..-+..+..               |-+.|...
T Consensus         7 P~gfRLGi~~~w~S~--Wfa~~k~Y~~---~l~eD~~IR~~i~k~~~~agis~IeI~rt~~~i~I~I~~~~pg~vIG~~g   81 (211)
T TIGR01009         7 PIGFRLGITKDWKSR--WYANPKEYAK---LLHEDLKIRNYIKKELSNAGISDVEIERPADKIRVTIHTARPGIVIGKKG   81 (211)
T ss_pred             CcccccccccCCceE--EccCcchhHH---HHHHHHHHHHHHHHHhhhCCcceEEEEEcCCceEEEEEeCCCcceeCCCc
Confidence            999999998888876  6665 22332   234555555666665654444444443               24457777


Q ss_pred             HHHHHHHHHHHHH
Q psy5120          66 VLVNTLRATLLQL   78 (138)
Q Consensus        66 ~~~~tlr~Ti~~l   78 (138)
                      .-++.|+..|.+.
T Consensus        82 ~~i~~l~~~l~~~   94 (211)
T TIGR01009        82 SEIEKLRKDLQKL   94 (211)
T ss_pred             hHHHHHHHHHHHH
Confidence            7777787777664


No 14 
>KOG4077|consensus
Probab=34.65  E-value=25  Score=27.90  Aligned_cols=27  Identities=30%  Similarity=0.345  Sum_probs=23.1

Q ss_pred             hhHHHHhhcccCHHHHHHHHHHHhhhh
Q psy5120          96 KPWISAVNTSVKARDFARAMIVLQACI  122 (138)
Q Consensus        96 k~W~~~V~~~~~~~~fA~aL~~le~~i  122 (138)
                      |.=.+++-.|+.+.|||.|.-+||++=
T Consensus        85 kvIEaaLRA~RRvNDfa~aVRilE~iK  111 (149)
T KOG4077|consen   85 KVIEAALRACRRVNDFATAVRILEAIK  111 (149)
T ss_pred             HHHHHHHHHHHHhccHHHHHHHHHHHH
Confidence            445678999999999999999999863


No 15 
>cd03565 VHS_Tom1 VHS domain family, Tom1 subfamily; The VHS domain is an essential part of Tom1 (Target of myb1 - retroviral oncogene) protein. The VHS domain has a superhelical structure similar to the structure of the ARM repeats and is present at the very N-termini of proteins. It is a right-handed superhelix of eight alpha helices. The VHS domain has been found in a number of proteins, some of which have been implicated in intracellular trafficking and sorting. The VHS domain of the Tom1 protein is essential for the negative regulation of Interleukin-1 and Tumor Necrosis Factor-induced signaling pathways.
Probab=31.57  E-value=47  Score=25.15  Aligned_cols=24  Identities=21%  Similarity=0.497  Sum_probs=20.2

Q ss_pred             hhcccCHHHHHHHHHHHhhhhcch
Q psy5120         102 VNTSVKARDFARAMIVLQACIKPV  125 (138)
Q Consensus       102 V~~~~~~~~fA~aL~~le~~ir~v  125 (138)
                      ++...+|+..-.||.+||+|++.+
T Consensus        47 l~~~~n~~v~l~aL~LLe~~vkNC   70 (141)
T cd03565          47 LNGNKNHKEVMLTLTVLETCVKNC   70 (141)
T ss_pred             HccCCCHHHHHHHHHHHHHHHHHc
Confidence            655678888888999999999876


No 16 
>PF09326 DUF1982:  Domain of unknown function (DUF1982);  InterPro: IPR015405 This C-terminal domain is functionally uncharacterised and is found in various prokaryotic NADH dehydrogenases including NADH-quinone oxidoreductase, chain G. ; GO: 0016651 oxidoreductase activity, acting on NADH or NADPH, 0051536 iron-sulfur cluster binding, 0055114 oxidation-reduction process
Probab=30.88  E-value=18  Score=23.20  Aligned_cols=13  Identities=23%  Similarity=0.286  Sum_probs=10.6

Q ss_pred             eeccchhhhhhCh
Q psy5120          17 LNQYSTNQIALNK   29 (138)
Q Consensus        17 ~NqYs~N~~Alnk   29 (138)
                      .|+|.||||+.+-
T Consensus        31 ~dfY~Td~IsRAS   43 (49)
T PF09326_consen   31 KDFYMTDPISRAS   43 (49)
T ss_pred             ccccccchhhhhh
Confidence            4799999998754


No 17 
>PF11491 DUF3213:  Protein of unknown function (DUF3213)   ;  InterPro: IPR021583  The backbone structure of this family of proteins has been determined however the function remains unknown. The protein has an alpha and beta structure with a ferredoxin-like fold []. ; PDB: 2F40_A.
Probab=29.40  E-value=19  Score=26.39  Aligned_cols=24  Identities=17%  Similarity=0.331  Sum_probs=12.2

Q ss_pred             cccccc-ccCCceeeeeccchhhhh
Q psy5120           3 LQSAFL-MTTGFKSYLNQYSTNQIA   26 (138)
Q Consensus         3 ~~frlG-~e~~fk~Y~NqYs~N~~A   26 (138)
                      .+|+|. +|+.|+.+.|-|+.|-..
T Consensus        16 ~QYeLsk~~~vyRvFiNgYar~g~V   40 (88)
T PF11491_consen   16 KQYELSKNEAVYRVFINGYARNGFV   40 (88)
T ss_dssp             HHHTTTTTTTB------TTSS--EE
T ss_pred             HHHHhhcccceeeeeecccccceEE
Confidence            467777 589999999999998643


No 18 
>PF11230 DUF3029:  Protein of unknown function (DUF3029);  InterPro: IPR016905  Members of this family are homologs to enzymes known to undergo activation by a radical SAM protein to create an active site glycyl radical. This family appears to be activated by the YjjW radical SAM protein, usually encoded by an adjacent gene. 
Probab=25.82  E-value=1e+02  Score=28.71  Aligned_cols=30  Identities=23%  Similarity=0.346  Sum_probs=27.3

Q ss_pred             chHHHHHHHHHHHHHHHHhcCcccCCCCch
Q psy5120          63 PRNVLVNTLRATLLQLESSIIPAFMHVNWP   92 (138)
Q Consensus        63 s~~~~~~tlr~Ti~~le~~Ip~~fmH~nW~   92 (138)
                      +...+-+-||.-+++|.+-||-+|||-|=-
T Consensus       114 ~eee~~~klk~Fl~~lDRTi~DsF~HaNIG  143 (487)
T PF11230_consen  114 DEEEAYKKLKLFLIHLDRTIPDSFCHANIG  143 (487)
T ss_pred             CHHHHHHHHHHHHHHhhccCcchhhccCCC
Confidence            777889999999999999999999998853


No 19 
>PHA02675 ORF104 fusion protein; Provisional
Probab=25.51  E-value=74  Score=23.37  Aligned_cols=20  Identities=35%  Similarity=0.426  Sum_probs=17.3

Q ss_pred             hHHHHHHHHHHHHHHHHhcC
Q psy5120          64 RNVLVNTLRATLLQLESSII   83 (138)
Q Consensus        64 ~~~~~~tlr~Ti~~le~~Ip   83 (138)
                      -..+..|||.+|+.|..+|-
T Consensus        63 LE~H~ETLRk~Ml~L~KKID   82 (90)
T PHA02675         63 LERHLETLREALLKLNTKID   82 (90)
T ss_pred             HHHHHHHHHHHHHHHHhhcc
Confidence            34789999999999999884


No 20 
>PF14910 MMS22L_N:  S-phase genomic integrity recombination mediator, N-terminal
Probab=24.76  E-value=69  Score=31.12  Aligned_cols=45  Identities=18%  Similarity=0.378  Sum_probs=38.1

Q ss_pred             ceeecc--cccCchHHHHHHHHHHHHHHHHhcC-------cccCCCCchhhhhh
Q psy5120          53 EFKWAG--PVHGPRNVLVNTLRATLLQLESSII-------PAFMHVNWPSLRKP   97 (138)
Q Consensus        53 eFkW~g--~~~Gs~~~~~~tlr~Ti~~le~~Ip-------~~fmH~nW~~~rk~   97 (138)
                      .|.|+.  .+.-+...+...+|+.|.+||+.|.       +.=+|-.|...|..
T Consensus        50 g~~wVte~al~es~~~LF~l~rqq~~~le~llq~~~~~~~~~~lh~~~~~iRqq  103 (704)
T PF14910_consen   50 GFQWVTETALVESCQKLFGLFRQQIYKLEQLLQSSCDFGQASSLHSEAENIRQQ  103 (704)
T ss_pred             ceeeehHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCHHHHHHH
Confidence            488886  5566778999999999999999999       57799999988864


No 21 
>PF04510 DUF577:  Family of unknown function (DUF577);  InterPro: IPR007598 This is a family of Arabidopsis thaliana (Mouse-ear cress) proteins. Many of these members contain a repeated region.
Probab=24.74  E-value=1.8e+02  Score=23.64  Aligned_cols=55  Identities=11%  Similarity=0.154  Sum_probs=41.5

Q ss_pred             CchHHHHHHHHHHHHHHHHhcCcccCCCCchhhhhhHHHHhhcccCHHHHHHHHHHHhhhh
Q psy5120          62 GPRNVLVNTLRATLLQLESSIIPAFMHVNWPSLRKPWISAVNTSVKARDFARAMIVLQACI  122 (138)
Q Consensus        62 Gs~~~~~~tlr~Ti~~le~~Ip~~fmH~nW~~~rk~W~~~V~~~~~~~~fA~aL~~le~~i  122 (138)
                      |++.-..+.||.+..-...+| ...---.|++++. .|   ..- +-.+|.+|..+|.|.-
T Consensus        16 ~~k~s~~KiL~~iVs~Va~~v-~~~~~~~W~eL~d-~I---ls~-~~~e~~kA~~IF~~L~   70 (174)
T PF04510_consen   16 ETKESDFKILRRIVSHVAYEV-FDLQEGGWDELSD-CI---LSL-SENEPVKAFHIFICLP   70 (174)
T ss_pred             cccHhHHHHHHHHHHHHHHHH-HhcCCCCchhHHH-HH---HHh-hccchHHHHHHHHhCC
Confidence            667778999999999999998 2234567999993 23   222 2678899999998864


No 22 
>PF02064 MAS20:  MAS20 protein import receptor;  InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=24.38  E-value=83  Score=23.81  Aligned_cols=38  Identities=13%  Similarity=0.341  Sum_probs=29.8

Q ss_pred             CchhhhhhHHHHhhcccCHHHHHHHHHHHhhhhcchhhhhh
Q psy5120          90 NWPSLRKPWISAVNTSVKARDFARAMIVLQACIKPVVFASV  130 (138)
Q Consensus        90 nW~~~rk~W~~~V~~~~~~~~fA~aL~~le~~ir~vvf~~v  130 (138)
                      +...=-..+.+|+..|.+|.+   .|.+++-.+-|-||..+
T Consensus        78 ~~~~aa~hf~nAl~V~~qP~~---LL~i~q~tlP~~vf~~l  115 (121)
T PF02064_consen   78 DYEEAAEHFYNALKVCPQPAE---LLQIYQKTLPPPVFEML  115 (121)
T ss_dssp             -HHHHHHHHHHHHHTSSSHHH---HHHHHHHHS-HHHHHHH
T ss_pred             CHHHHHHHHHHHHHhCCCHHH---HHHHHHhhCCHHHHHHH
Confidence            345566779999999999986   58899999999988764


No 23 
>KOG4061|consensus
Probab=23.52  E-value=62  Score=27.02  Aligned_cols=36  Identities=17%  Similarity=0.538  Sum_probs=23.4

Q ss_pred             hhhhhhHHHHhhcccCHHHHHHHHHHHhhhhcchhhhhhhh
Q psy5120          92 PSLRKPWISAVNTSVKARDFARAMIVLQACIKPVVFASVWH  132 (138)
Q Consensus        92 ~~~rk~W~~~V~~~~~~~~fA~aL~~le~~ir~vvf~~vW~  132 (138)
                      |...++|+++-.--.+...+     ++.--+||+|+.|+|.
T Consensus        81 pvi~hmWdqEk~Hl~tf~~l-----~~k~rVrpT~l~P~w~  116 (217)
T KOG4061|consen   81 PVIKHMWDQEKEHLKTFENL-----ALKHRVRPTVLTPLWN  116 (217)
T ss_pred             hHHHHHHHHHHHHHHHHHHH-----HHHccCCchhhhhHHH
Confidence            34566777766544444443     3455689999999995


No 24 
>PF13976 gag_pre-integrs:  GAG-pre-integrase domain
Probab=22.93  E-value=41  Score=21.35  Aligned_cols=14  Identities=29%  Similarity=0.631  Sum_probs=11.0

Q ss_pred             hhhhhhhhhhcCCC
Q psy5120         125 VVFASVWHEQLVHA  138 (138)
Q Consensus       125 vvf~~vW~e~LGht  138 (138)
                      .-=...||.-|||.
T Consensus        21 ~~~~~lWH~RLGH~   34 (67)
T PF13976_consen   21 KSSTDLWHRRLGHP   34 (67)
T ss_pred             CCCHHHHHHhcCCC
Confidence            34467899999994


No 25 
>PF11013 DUF2851:  Protein of unknown function (DUF2851);  InterPro: IPR021272  This bacterial family of proteins has no known function. 
Probab=22.29  E-value=37  Score=30.24  Aligned_cols=68  Identities=21%  Similarity=0.345  Sum_probs=48.6

Q ss_pred             HHhHhhhhcccccCCCCCceeecccccCchHHHHHHHHHHHHHHHHhcCcccCCCCchhhhhhHHHHhhcccCHHHHHHH
Q psy5120          35 ERDKKRHMSHKFSLTPTGEFKWAGPVHGPRNVLVNTLRATLLQLESSIIPAFMHVNWPSLRKPWISAVNTSVKARDFARA  114 (138)
Q Consensus        35 erDkkr~ls~KFslt~~~eFkW~g~~~Gs~~~~~~tlr~Ti~~le~~Ip~~fmH~nW~~~rk~W~~~V~~~~~~~~fA~a  114 (138)
                      =++.=.||.+||.+++...+.|.  +++.++.=.=|+|..  ||.     .++|..     .+.-..+..|++.+++...
T Consensus       240 L~~ey~~L~~K~~L~~~~~~~w~--f~rlRP~NFPtiRLA--qLA-----~L~~~~-----~~Lfs~l~~~~~~~~l~~l  305 (377)
T PF11013_consen  240 LKKEYQFLQHKFQLQPMDRSEWK--FFRLRPANFPTIRLA--QLA-----ALYHKS-----QNLFSKLLEAKSLEELYKL  305 (377)
T ss_pred             HHHHHHHHHHhcCCCCCCcccce--ECCCCCCCCCHHHHH--HHH-----HHHHcC-----chHHHHHHccCCHHHHHHH
Confidence            34445789999999998889994  889888777777754  443     344544     3333446688999988887


Q ss_pred             HH
Q psy5120         115 MI  116 (138)
Q Consensus       115 L~  116 (138)
                      +.
T Consensus       306 f~  307 (377)
T PF11013_consen  306 FS  307 (377)
T ss_pred             hC
Confidence            75


No 26 
>PF13960 DUF4218:  Domain of unknown function (DUF4218)
Probab=21.83  E-value=82  Score=24.16  Aligned_cols=27  Identities=30%  Similarity=0.337  Sum_probs=22.7

Q ss_pred             HHHHHHHHHHHHHHHhcCcccCCCCch
Q psy5120          66 VLVNTLRATLLQLESSIIPAFMHVNWP   92 (138)
Q Consensus        66 ~~~~tlr~Ti~~le~~Ip~~fmH~nW~   92 (138)
                      .+-+.|..|+++||.-.|++|+.++=-
T Consensus        25 ~L~~~I~~~lc~lE~ifppsffdim~H   51 (128)
T PF13960_consen   25 ELEEEIVETLCQLEMIFPPSFFDIMVH   51 (128)
T ss_pred             HHHHHHHHHHHHHHHHcchhHhhhhHH
Confidence            366788899999999999999987643


No 27 
>PF09674 DUF2400:  Protein of unknown function (DUF2400);  InterPro: IPR014127 Members of this uncharacterised protein family are found sporadically, so far only among spirochetes, epsilon and delta proteobacteria, and Bacteroides. The function is unknown and its gene neighbourhoods show little conservation.
Probab=21.45  E-value=95  Score=25.89  Aligned_cols=64  Identities=16%  Similarity=0.083  Sum_probs=48.4

Q ss_pred             cCchHHHHHHHHHHHHHHHHhcCcccCCCCchhhhhhHHHHhhcccCHHHHHHHHHHHhhhhcc
Q psy5120          61 HGPRNVLVNTLRATLLQLESSIIPAFMHVNWPSLRKPWISAVNTSVKARDFARAMIVLQACIKP  124 (138)
Q Consensus        61 ~Gs~~~~~~tlr~Ti~~le~~Ip~~fmH~nW~~~rk~W~~~V~~~~~~~~fA~aL~~le~~ir~  124 (138)
                      ||++.+|++.+..-+.-+..+--..+++-++....+....-...=.+.+|++..+..|...++.
T Consensus        27 yG~~~~I~~~~~~ll~~~~~~P~~~v~~~~~~~~~~~~~~~~yRf~~~~D~~~~~~~l~~i~~~   90 (232)
T PF09674_consen   27 YGNRKQIIKKLERLLDLMGPSPYDFVLSGDEKDDRKDLEGFFYRFQNGEDMYAFFIALKRIYQE   90 (232)
T ss_pred             ccCHHHHHHHHHHHHHHhCCCHHHHHHcCCHhhhHHHccCCCcCCCCHHHHHHHHHHHHHHHHc
Confidence            7999999999999988777765555667777766666555556667888888888777776654


Done!