Query         T0629 long tail fibre protein gp37, bacteriophage T4, 216 residues
Match_columns 216
No_of_seqs    122 out of 393
Neff          9.6 
Searched_HMMs 11830
Date          Thu Jul 22 15:01:14 2010
Command       /home/syshi_2/2008/ferredoxin/manualcheck/update/HHsearch/bin/hhsearch -i /home/syshi_3/CASP9/HHsearch4Targetseq/seq/T0629.hhm -d /home/syshi_2/2008/ferredoxin/manualcheck/update/HHsearch/database/pfamA_24_hhmdb -o /home/syshi_3/CASP9/HHsearch4Targetseq/pfamAsearch/T0629.hhr 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF07484 Collar:  Phage Tail Co  99.7 2.5E-18 2.1E-22  121.0   2.8   48    6-53      1-57  (57)
  2 PF03235 DUF262:  Protein of un  35.8     9.1 0.00077   15.6   2.4   38    1-38     41-80  (224)
  3 PF02741 FTR_C:  FTR, proximal   19.7      10 0.00087   15.2   0.3   40    5-51     46-85  (150)
  4 PF02285 COX8:  Cytochrome oxid  17.8     2.3 0.00019   19.0  -3.4   24    5-28     17-40  (44)
  5 PF06836 DUF1240:  Protein of u  10.2      38  0.0032   12.0   1.0   18   17-34     70-88  (95)
  6 PF02343 TRA-1_regulated:  TRA-   8.9      31  0.0026   12.5   0.2   12   29-40      6-17  (129)
  7 PF00706 Toxin_4:  Anenome neur   8.8      39  0.0033   11.9   0.7   12   14-25     23-34  (43)
  8 PF00844 Gemini_coat:  Geminivi   8.2      46  0.0039   11.6   1.2   23   32-54    132-159 (244)
  9 PF09979 DUF2213:  Uncharacteri   8.1      46  0.0039   11.5   0.9   15   17-31      7-21  (170)
 10 PF08677 GP11:  GP11 baseplate    7.8      40  0.0034   11.9   0.3   25    1-25     63-92  (218)

No 1  
>PF07484 Collar:  Phage Tail Collar Domain;  InterPro: IPR011083   This region is occasionally found in conjunction with IPR005003 from INTERPRO. Most of the proteins appear to be phage tail proteins; however some appear to be involved in other processes. For instance the RhiB protein (Q03314 from SWISSPROT) from Rhizobium leguminosarum may be involved in plant-microbe interactions . A related protein, microcystin related protein (MrpB, Q9L3N1 from SWISSPROT) is involved in the pathogenicity of Microcystis aeruginosa. The finding of this family in a structural component of the phage tail fibre baseplate (P10930 from SWISSPROT) suggests that its function is structural rather than enzymatic. Structural studies show this region consists of a helix and a loop  and three beta-strands. This alignment does not catch the third strand as it is separated from the rest of the structure by around 100 residues. This strand is conserved in homologues but the intervening sequence is not. Much of the function of P10930 from SWISSPROT appears to reside in this intervening region. In the tertiary structure of the phage baseplate this domain forms part of the collar and may bind SO_4. The long unconserved region maybe due to domain swapping in and out of a loop or due to rapid evolution.; PDB: 1ocy_A 1h6w_A.
Probab=99.69  E-value=2.5e-18  Score=121.00  Aligned_cols=48  Identities=33%  Similarity=0.614  Sum_probs=45.8

Q ss_pred             CCEEECCCCCCCCCEEEECCCEECHHHHHHHHHHCC---------CCCCCCCCCCCC
Q ss_conf             613507778788871863781226332057886607---------777775456510
Q T0629             6 GAPIPWPSDSVPAGFALMEGQTFDKSAYPKLAVAYP---------SGVIPDMRGQTI   53 (216)
Q Consensus         6 G~iv~f~~~~~P~Gwl~cdG~~~s~~~Yp~L~~~~g---------s~~lPdlrg~f~   53 (216)
                      |+|++|+.+.+|+|||+||||++++++||+||++||         +|+||||||||+
T Consensus         1 G~I~~f~~~~~P~gwl~cdG~~~~~~~yp~Lf~~iGt~yGGdg~~tF~lPDlrG~~~   57 (57)
T PF07484_consen    1 GSIVPFAGNFAPEGWLLCDGQTLSISQYPALFSLIGTTYGGDGSTTFNLPDLRGRFP   57 (57)
T ss_dssp             --EEEESSSS-SSTEEE-------TTT-HHHHHH----------S------------
T ss_pred             CCEEEECCCCCCCHHHHHCCCCCCHHHCHHHHHHHCCCCCCCCCCCCCCCCCCCCCC
T ss_conf             947993388896378871886768778999999867013787888677965046788


No 2  
>PF03235 DUF262:  Protein of unknown function DUF262;  InterPro: IPR004919 This entry is found in prokaryotic proteins of unknown function.
Probab=35.75  E-value=9.1  Score=15.55  Aligned_cols=38  Identities=21%  Similarity=0.431  Sum_probs=26.5

Q ss_pred             CCCCCCCEEECC--CCCCCCCEEEECCCEECHHHHHHHHH
Q ss_conf             988876135077--78788871863781226332057886
Q T0629             1 SSYPIGAPIPWP--SDSVPAGFALMEGQTFDKSAYPKLAV   38 (216)
Q Consensus         1 ~~~PvG~iv~f~--~~~~P~Gwl~cdG~~~s~~~Yp~L~~   38 (216)
                      ..||+|+|+.|.  ......-+...|||.=-.+.|--|.+
T Consensus        41 ~~y~iG~iv~~~~~~~~~~~~~~vIDGQQRLTTl~l~l~~   80 (224)
T PF03235_consen   41 RGYFIGSIVLWEKEKNDSNNKYEVIDGQQRLTTLYLLLKA   80 (224)
T ss_pred             CCCCCCEEEEECCCCCCCCCEEEEECCHHHHHHHHHHHHH
T ss_conf             6821071788614567898838984670899999999984


No 3  
>PF02741 FTR_C:  FTR, proximal lobe;  InterPro: IPR002770   Formylmethanofuran:tetrahyromethanopterin formyltransferase (Ftr) is involved in C1 metabolism in methanogenic archaea, sulphate-reducing archaea and methylotrophic bacteria. It catalyses the following reversible reaction:  N-formylmethanofuran + 5,6,7,8-tetrahydromethanopterin = methanofuran + 5-formyl-5,6,7,8-tetrahydromethanopterin   Ftr from the thermophilic methanogen Methanopyrus kandleri (optimum growth temperature 98 degrees C) is a hyperthermophilic enzyme that is absolutely dependent on the presence of lyotropic salts for activity and thermostability. The crystal structure of Ftr, determined to a reveals a homotetramer composed essentially of two dimers. Each subunit is subdivided into two tightly associated lobes both consisting of a predominantly antiparallel beta sheet flanked by alpha helices forming an alpha/beta sandwich structure. The approximate location of the active site was detected in a region close to the dimer interface . Ftr from the mesophilic methanogen Methanosarcina barkeri and the sulphate-reducing archaeon Archaeoglobus fulgidus have a similar structure    In the methylotrophic bacterium Methylobacterium extorquens, Ftr interacts with three other polypeptides to form an Ftr/cyclohydrolase complex which catalyses the hydrolysis of formyl-tetrahydromethanopterin to formate during growth on C1 substrates . ; GO: 0016740 transferase activity, 0006730 one-carbon compound metabolic process; PDB: 1ftr_C 2fhj_A 2fhk_D 1m5s_A 1m5h_H.
Probab=19.73  E-value=10  Score=15.22  Aligned_cols=40  Identities=25%  Similarity=0.485  Sum_probs=23.9

Q ss_pred             CCCEEECCCCCCCCCEEEECCCEECHHHHHHHHHHCCCCCCCCCCCC
Q ss_conf             76135077787888718637812263320578866077777754565
Q T0629             5 IGAPIPWPSDSVPAGFALMEGQTFDKSAYPKLAVAYPSGVIPDMRGQ   51 (216)
Q Consensus         5 vG~iv~f~~~~~P~Gwl~cdG~~~s~~~Yp~L~~~~gs~~lPdlrg~   51 (216)
                      .|.|.|||+-.+..|      +.+... |+.|.+.--.-.-|-||++
T Consensus        46 ~gvI~PFPGGivrSG------SKVGsk-Yk~l~ASTNdayCPTLr~~   85 (150)
T PF02741_consen   46 PGVITPFPGGIVRSG------SKVGSK-YKFLGASTNDAYCPTLRGK   85 (150)
T ss_dssp             ---B-------BSS-------EB------TT-S-BE-GGG-GGGGCC
T ss_pred             CCEEECCCCCEECCC------CCCCCC-CCCCCCCCCCCCCCCHHCC
T ss_conf             995735899842457------746546-6878686676748442202


No 4  
>PF02285 COX8:  Cytochrome oxidase c subunit VIII;  InterPro: IPR003205   Cytochrome c oxidase (1.9.3.1 from EC) is an oligomeric enzymatic complex which is a component of the respiratory chain complex and is involved in the transfer of electrons from cytochrome c to oxygen . In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane.    In eukaryotes, in addition to the three large subunits, I, II and III, that form the catalytic centre of the enzyme complex, there are a variable number of small polypeptidic subunits.This family is composed of cytochrome c oxidase subunit VIII. ; GO: 0004129 cytochrome-c oxidase activity, 0006118 electron transport; PDB: 1ocr_Z 1occ_Z 2dyr_Z 1oco_M 2dys_Z 2ein_Z 2eik_Z 2eim_Z 2eij_M 1v55_M ....
Probab=17.84  E-value=2.3  Score=18.95  Aligned_cols=24  Identities=25%  Similarity=0.418  Sum_probs=15.1

Q ss_pred             CCCEEECCCCCCCCCEEEECCCEE
Q ss_conf             761350777878887186378122
Q T0629             5 IGAPIPWPSDSVPAGFALMEGQTF   28 (216)
Q Consensus         5 vG~iv~f~~~~~P~Gwl~cdG~~~   28 (216)
                      ||..+.|..=-.|+||.+|+=..|
T Consensus        17 igl~~~f~~~l~PagWVL~hL~~Y   40 (44)
T PF02285_consen   17 IGLTVFFATFLGPAGWVLSHLEDY   40 (44)
T ss_dssp             ---HHHHHHHHTT---TTTTHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHH
T ss_conf             889999999971189999999871


No 5  
>PF06836 DUF1240:  Protein of unknown function (DUF1240);  InterPro: IPR010665   This family consists of a number of hypothetical putative membrane proteins which seem to be specific to Yersinia pestis. The function of this family is unknown.
Probab=10.20  E-value=38  Score=12.03  Aligned_cols=18  Identities=11%  Similarity=0.469  Sum_probs=11.7

Q ss_pred             CCCEEEECCCEE-CHHHHH
Q ss_conf             887186378122-633205
Q T0629            17 PAGFALMEGQTF-DKSAYP   34 (216)
Q Consensus        17 P~Gwl~cdG~~~-s~~~Yp   34 (216)
                      -+||+.|+...| +-++|-
T Consensus        70 s~gY~~C~k~SwmSPN~YV   88 (95)
T PF06836_consen   70 SEGYVVCDKISWMSPNTYV   88 (95)
T ss_pred             CCCEEECCCCEEECCCCEE
T ss_conf             3890880531022741276


No 6  
>PF02343 TRA-1_regulated:  TRA-1 regulated protein R03H10.4;  InterPro: IPR003326 This domain is found in a family of proteins from Caenorhabditis elegans. The domain has no known function, but has 4 conserved cysteine residues and is a maximum of 175 residues long.
Probab=8.95  E-value=31  Score=12.51  Aligned_cols=12  Identities=17%  Similarity=0.426  Sum_probs=5.7

Q ss_pred             CHHHHHHHHHHC
Q ss_conf             633205788660
Q T0629            29 DKSAYPKLAVAY   40 (216)
Q Consensus        29 s~~~Yp~L~~~~   40 (216)
                      +..+.|+|...+
T Consensus         6 ~~~~Cpdl~~~~   17 (129)
T PF02343_consen    6 STCECPDLSSYF   17 (129)
T ss_pred             CCCCCCCHHHHH
T ss_conf             877895978950


No 7  
>PF00706 Toxin_4:  Anenome neurotoxin;  InterPro: IPR000693 Sea anemones produce many different neurotoxins with related structure and function. Proteins belonging to this family include the neurotoxins, of which there are several, including calitoxin and anthopleurin. The neurotoxins bind specifically to the sodium channel, thereby delaying its inactivation during signal transduction, resulting in strong stimulation of mammalian cardiac muscle contraction. Calitoxin 1 has been found in neuromuscular prearations of crustaceans, where it increases transmitter release, causing firing of the axons. Three disulphide bonds are present in this protein , , .; GO: 0009966 regulation of signal transduction, 0005576 extracellular region; PDB: 1atx_A 2h9x_A 2sh1_A 1shi_A 1sh1_A 1apf_A 1ahl_A.
Probab=8.84  E-value=39  Score=11.94  Aligned_cols=12  Identities=17%  Similarity=0.351  Sum_probs=8.6

Q ss_pred             CCCCCCEEEECC
Q ss_conf             878887186378
Q T0629            14 DSVPAGFALMEG   25 (216)
Q Consensus        14 ~~~P~Gwl~cdG   25 (216)
                      ..-|.||-+|.+
T Consensus        23 ~~CpsGWhkC~~   34 (43)
T PF00706_consen   23 FGCPSGWHKCKS   34 (43)
T ss_dssp             -------EECCC
T ss_pred             CCCCCHHHHHHC
T ss_conf             578831555424


No 8  
>PF00844 Gemini_coat:  Geminivirus coat protein/nuclear export factor BR1 family;  InterPro: IPR000263 Geminiviruses are characterised by a genome of circular single-stranded DNA encapsidated in twinned (geminate) quasi-isometric particles, from which the group derives its name . Most geminiviruses can be divided into 2 subgroups on the basis of host range and/or insect vector: i.e. those that infect dicotyledenous plants and are transmitted by the same whitefly species, and those that infect monocotyledenous plants and are transmitted by different leafhopper vectors. It has been shown that the 104 N-terminal amino acids of the Maize streak virus coat protein bind DNA non-specifically .; GO: 0005198 structural molecule activity, 0019028 viral capsid
Probab=8.19  E-value=46  Score=11.56  Aligned_cols=23  Identities=17%  Similarity=0.241  Sum_probs=9.6

Q ss_pred             HHHHHHHHCCC-----CCCCCCCCCCCC
Q ss_conf             20578866077-----777754565100
Q T0629            32 AYPKLAVAYPS-----GVIPDMRGQTIK   54 (216)
Q Consensus        32 ~Yp~L~~~~gs-----~~lPdlrg~f~r   54 (216)
                      ++.+||-++-.     -..+||++||.+
T Consensus       132 ~f~elFg~~~~~~~t~~v~~~~~~RF~V  159 (244)
T PF00844_consen  132 TFDELFGMYINSPGTLTVKRDLKDRFYV  159 (244)
T ss_pred             CHHHHCCCCCCCCCCEEECHHHCCCEEE
T ss_conf             7777347765676776986572875899


No 9  
>PF09979 DUF2213:  Uncharacterized protein conserved in bacteria (DUF2213)
Probab=8.09  E-value=46  Score=11.53  Aligned_cols=15  Identities=13%  Similarity=0.479  Sum_probs=0.0

Q ss_pred             CCCEEEECCCEECHH
Q ss_conf             887186378122633
Q T0629            17 PAGFALMEGQTFDKS   31 (216)
Q Consensus        17 P~Gwl~cdG~~~s~~   31 (216)
                      +.|||.|....++|+
T Consensus         7 ~~G~L~v~~~piar~   21 (170)
T PF09979_consen    7 ADGYLVVEAVPIART   21 (170)
T ss_pred             CCEEEEEEEEEEEEE
T ss_conf             980699652477343


No 10 
>PF08677 GP11:  GP11 baseplate wedge protein;  InterPro: IPR014791    The bacteriophage baseplate controls host cell recognition, attachment, tail sheath contraction and viral DNA ejection. The baseplate is a multi-subunit assembly at the distal end of the tail, which is composed of long and short tail fibres . The tail region is responsible for attachment to the host bacteria during infection: long tail fibres enable host receptor recognition, while irreversible attachment is via short tail fibres. Recognition and attachment induce a conformational transition of the baseplate from a hexagonal to a star-shaped structure. In viruses such as Bacteriophage T4, GP11 acts as a structural protein to connect the short tail fibres to the baseplate, while GP9 connects the baseplate with the long tail fibres. Both GP9 and GP11 are trimers. Each GP11 monomer consists of three domains, which are entwined together in the trimer: the N-terminal domains of the three monomers form a central, trimeric, parallel coiled coil surrounded by the entwined middle finger domains; the C-terminal domains appear to be responsible for trimerisation .; PDB: 1el6_C.
Probab=7.75  E-value=40  Score=11.88  Aligned_cols=25  Identities=24%  Similarity=0.432  Sum_probs=0.0

Q ss_pred             CCCCCCCEEECCCCCCCCC-----EEEECC
Q ss_conf             9888761350777878887-----186378
Q T0629             1 SSYPIGAPIPWPSDSVPAG-----FALMEG   25 (216)
Q Consensus         1 ~~~PvG~iv~f~~~~~P~G-----wl~cdG   25 (216)
                      +.+|||.|+.--..++|.|     +|-++|
T Consensus        63 ~~~pVg~Vii~t~~~~P~g~~Q~e~ltFsG   92 (218)
T PF08677_consen   63 AMLPVGTVIISTENVSPEGVQQIETLTFSG   92 (218)
T ss_dssp             H------EEEE-----------EEEEEEE-
T ss_pred             HHCCCCCEEEECCCCCCCCCCCEEEEEEEE
T ss_conf             507513199844778976523136999876


Done!