Psyllid ID: psy6915


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------
MYNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLELDYIGIEYDPNHTEEFAYEMYLFENEVRGIVNW
cccEEEEEEEccccEEEEEEEccccccccccccEEEEEEEcccccEEEEEEEccccEEEcccEEcccccccccccccEEEEEEcccccccEEEEEEEEEEEEccccccHHccEEEccccEEEEEEcc
cccEEEEEEEccccEEEEEEEccccEEcccccEEEEEEEccccccEEEEEEEHHHcEEccccEEccccccccHHHHEEEEEEEEccccccEEEEEEEEEEEccccccHHHHHHHHcccccEEEcccc
MYNHLVIkcrgdgrsymlnlhtpgdfdvtwgdMFTYALytrggpywqitkipfskfflqskgriqdiqnpldtdrissiglslvdqnngpfqleldyigieydpnhteEFAYEMYLFENEVRGIVNW
MYNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLELDYIGIEYDPNHTEEFAYEMYLFENEVRGIVNW
MYNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLELDYIGIEYDPNHTEEFAYEMYLFENEVRGIVNW
***HLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLELDYIGIEYDPNHTEEFAYEMYLFENEVRGIV**
MYNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLELDYIGIEYDPNHTEEFAYEMYLFENE*RGIVNW
MYNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLELDYIGIEYDPNHTEEFAYEMYLFENEVRGIVNW
MYNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLELDYIGIEYDPNHTEEFAYEMYLFENEVRGIVNW
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MYNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLELDYIGIEYDPNHTEEFAYEMYLFENEVRGIVNW
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query127 2.2.26 [Sep-21-2011]
Q9VAI1296 Probable complex I interm yes N/A 0.897 0.385 0.620 1e-39
Q18726340 Probable complex I interm yes N/A 0.897 0.335 0.552 2e-32
Q61FQ3340 Probable complex I interm N/A N/A 0.897 0.335 0.552 2e-32
Q0MQ83327 Complex I intermediate-as N/A N/A 0.881 0.342 0.544 8e-32
Q0MQ82327 Complex I intermediate-as N/A N/A 0.881 0.342 0.544 9e-32
Q9Y375327 Complex I intermediate-as yes N/A 0.881 0.342 0.544 9e-32
Q0MQ84327 Complex I intermediate-as yes N/A 0.881 0.342 0.544 1e-31
Q9CWX2328 Complex I intermediate-as yes N/A 0.881 0.341 0.535 2e-31
O42636278 Complex I intermediate-as N/A N/A 0.740 0.338 0.237 3e-05
Q9LQI7227 Probable complex I interm yes N/A 0.732 0.409 0.234 0.0005
>sp|Q9VAI1|CIA30_DROME Probable complex I intermediate-associated protein 30, mitochondrial OS=Drosophila melanogaster GN=CG7598 PE=2 SV=1 Back     alignment and function desciption
 Score =  161 bits (407), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 72/116 (62%), Positives = 91/116 (78%), Gaps = 2/116 (1%)

Query: 2   YNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSK 61
           YN LV+K RGDGRSY++NLHT G FD+ W D++ Y LYTRGGP+WQI KIPFSKFFL SK
Sbjct: 172 YNMLVMKVRGDGRSYLINLHTEGYFDLMWNDIYHYVLYTRGGPHWQIAKIPFSKFFLSSK 231

Query: 62  GRIQDIQNPLDTDRISSIGLSLVDQN--NGPFQLELDYIGIEYDPNHTEEFAYEMY 115
           GR+QD Q  +  +R++  G S+  +   +GPF LE+DY+G+EYDP+H EEFAYEMY
Sbjct: 232 GRVQDRQGAIPLNRVTHFGFSVAAKKGMDGPFGLEIDYVGLEYDPSHREEFAYEMY 287




Chaperone protein involved in the assembly of the mitochondrial NADH:ubiquinone oxidoreductase complex (complex I).
Drosophila melanogaster (taxid: 7227)
>sp|Q18726|CIA30_CAEEL Probable complex I intermediate-associated protein 30, mitochondrial OS=Caenorhabditis elegans GN=C50B8.3 PE=3 SV=1 Back     alignment and function description
>sp|Q61FQ3|CIA30_CAEBR Probable complex I intermediate-associated protein 30, mitochondrial OS=Caenorhabditis briggsae GN=CBG11538 PE=3 SV=1 Back     alignment and function description
>sp|Q0MQ83|CIA30_GORGO Complex I intermediate-associated protein 30, mitochondrial OS=Gorilla gorilla gorilla GN=NDUFAF1 PE=2 SV=1 Back     alignment and function description
>sp|Q0MQ82|CIA30_PONPY Complex I intermediate-associated protein 30, mitochondrial OS=Pongo pygmaeus GN=NDUFAF1 PE=2 SV=1 Back     alignment and function description
>sp|Q9Y375|CIA30_HUMAN Complex I intermediate-associated protein 30, mitochondrial OS=Homo sapiens GN=NDUFAF1 PE=1 SV=2 Back     alignment and function description
>sp|Q0MQ84|CIA30_PANTR Complex I intermediate-associated protein 30, mitochondrial OS=Pan troglodytes GN=NDUFAF1 PE=2 SV=1 Back     alignment and function description
>sp|Q9CWX2|CIA30_MOUSE Complex I intermediate-associated protein 30, mitochondrial OS=Mus musculus GN=Ndufaf1 PE=2 SV=2 Back     alignment and function description
>sp|O42636|CIA30_NEUCR Complex I intermediate-associated protein 30, mitochondrial OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=cia30 PE=1 SV=1 Back     alignment and function description
>sp|Q9LQI7|CIA30_ARATH Probable complex I intermediate-associated protein 30 OS=Arabidopsis thaliana GN=At1g17350 PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query127
91087275 307 PREDICTED: similar to CG7598 CG7598-PA [ 0.897 0.371 0.657 9e-42
270011209 291 NADH dehydrogenase (ubiquinone) 1 alpha 0.897 0.391 0.657 1e-41
332375991 290 unknown [Dendroctonus ponderosae] 0.897 0.393 0.649 1e-38
193676275 309 PREDICTED: probable complex I intermedia 0.897 0.368 0.622 2e-38
195112449 294 GI10419 [Drosophila mojavensis] gi|19391 0.897 0.387 0.620 6e-38
21358497 296 CG7598 [Drosophila melanogaster] gi|2154 0.897 0.385 0.620 1e-37
195445129 298 GK11918 [Drosophila willistoni] gi|19416 0.897 0.382 0.603 1e-37
195062071 295 GH14323 [Drosophila grimshawi] gi|193891 0.897 0.386 0.620 1e-37
195158867 295 GL13912 [Drosophila persimilis] gi|19411 0.897 0.386 0.612 1e-37
125773031 295 GA20469 [Drosophila pseudoobscura pseudo 0.897 0.386 0.612 1e-37
>gi|91087275|ref|XP_975544.1| PREDICTED: similar to CG7598 CG7598-PA [Tribolium castaneum] Back     alignment and taxonomy information
 Score =  174 bits (442), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 75/114 (65%), Positives = 97/114 (85%)

Query: 2   YNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSK 61
           YN LV++ RGDGRSYMLN+ T G +D+ W DM+ Y LYTRGGPYWQ+++IPFSKFF+ SK
Sbjct: 171 YNMLVMRVRGDGRSYMLNISTRGYYDILWNDMYHYVLYTRGGPYWQVSRIPFSKFFMASK 230

Query: 62  GRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLELDYIGIEYDPNHTEEFAYEMY 115
           GR+QD Q+P+  +R+++ G+S+ D+ NGPF LE+DYIGIEYDPNHTEEFAYE+Y
Sbjct: 231 GRVQDKQHPIALNRVTNFGISIGDKVNGPFSLEIDYIGIEYDPNHTEEFAYELY 284




Source: Tribolium castaneum

Species: Tribolium castaneum

Genus: Tribolium

Family: Tenebrionidae

Order: Coleoptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|270011209|gb|EFA07657.1| NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, assembly factor 1 [Tribolium castaneum] Back     alignment and taxonomy information
>gi|332375991|gb|AEE63136.1| unknown [Dendroctonus ponderosae] Back     alignment and taxonomy information
>gi|193676275|ref|XP_001948791.1| PREDICTED: probable complex I intermediate-associated protein 30, mitochondrial-like [Acyrthosiphon pisum] Back     alignment and taxonomy information
>gi|195112449|ref|XP_002000785.1| GI10419 [Drosophila mojavensis] gi|193917379|gb|EDW16246.1| GI10419 [Drosophila mojavensis] Back     alignment and taxonomy information
>gi|21358497|ref|NP_651718.1| CG7598 [Drosophila melanogaster] gi|21542021|sp|Q9VAI1.1|CIA30_DROME RecName: Full=Probable complex I intermediate-associated protein 30, mitochondrial; Flags: Precursor gi|7301819|gb|AAF56928.1| CG7598 [Drosophila melanogaster] gi|16768482|gb|AAL28460.1| GM05702p [Drosophila melanogaster] Back     alignment and taxonomy information
>gi|195445129|ref|XP_002070186.1| GK11918 [Drosophila willistoni] gi|194166271|gb|EDW81172.1| GK11918 [Drosophila willistoni] Back     alignment and taxonomy information
>gi|195062071|ref|XP_001996127.1| GH14323 [Drosophila grimshawi] gi|193891919|gb|EDV90785.1| GH14323 [Drosophila grimshawi] Back     alignment and taxonomy information
>gi|195158867|ref|XP_002020306.1| GL13912 [Drosophila persimilis] gi|194117075|gb|EDW39118.1| GL13912 [Drosophila persimilis] Back     alignment and taxonomy information
>gi|125773031|ref|XP_001357774.1| GA20469 [Drosophila pseudoobscura pseudoobscura] gi|54637507|gb|EAL26909.1| GA20469 [Drosophila pseudoobscura pseudoobscura] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query127
FB|FBgn0039689296 CG7598 [Drosophila melanogaste 0.897 0.385 0.620 1.6e-38
ZFIN|ZDB-GENE-050306-9304 ndufaf1 "NADH dehydrogenase (u 0.897 0.375 0.570 4.7e-32
UNIPROTKB|E1BZH1303 NDUFAF1 "Uncharacterized prote 0.897 0.376 0.535 3.3e-31
UNIPROTKB|Q9Y375327 NDUFAF1 "Complex I intermediat 0.881 0.342 0.544 5.4e-31
MGI|MGI:1916952328 Ndufaf1 "NADH dehydrogenase (u 0.881 0.341 0.535 1.1e-30
RGD|1306202330 Ndufaf1 "NADH dehydrogenase (u 0.881 0.339 0.544 1.1e-30
UNIPROTKB|E1BAZ2328 NDUFAF1 "Uncharacterized prote 0.881 0.341 0.535 4.4e-29
UNIPROTKB|E2RPE7328 NDUFAF1 "Uncharacterized prote 0.881 0.341 0.535 4.4e-29
CGD|CAL0004155227 orf19.3353 [Candida albicans ( 0.748 0.418 0.313 9.4e-11
UNIPROTKB|Q5A9Q4227 CIA30 "Potential mitochondrial 0.748 0.418 0.313 9.4e-11
FB|FBgn0039689 CG7598 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
 Score = 412 (150.1 bits), Expect = 1.6e-38, P = 1.6e-38
 Identities = 72/116 (62%), Positives = 91/116 (78%)

Query:     2 YNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSK 61
             YN LV+K RGDGRSY++NLHT G FD+ W D++ Y LYTRGGP+WQI KIPFSKFFL SK
Sbjct:   172 YNMLVMKVRGDGRSYLINLHTEGYFDLMWNDIYHYVLYTRGGPHWQIAKIPFSKFFLSSK 231

Query:    62 GRIQDIQNPLDTDRISSIGLSLVDQN--NGPFQLELDYIGIEYDPNHTEEFAYEMY 115
             GR+QD Q  +  +R++  G S+  +   +GPF LE+DY+G+EYDP+H EEFAYEMY
Sbjct:   232 GRVQDRQGAIPLNRVTHFGFSVAAKKGMDGPFGLEIDYVGLEYDPSHREEFAYEMY 287




GO:0005747 "mitochondrial respiratory chain complex I" evidence=ISS
GO:0017004 "cytochrome complex assembly" evidence=NAS
GO:0006119 "oxidative phosphorylation" evidence=ISS
ZFIN|ZDB-GENE-050306-9 ndufaf1 "NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, assembly factor 1" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
UNIPROTKB|E1BZH1 NDUFAF1 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|Q9Y375 NDUFAF1 "Complex I intermediate-associated protein 30, mitochondrial" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
MGI|MGI:1916952 Ndufaf1 "NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, assembly factor 1" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
RGD|1306202 Ndufaf1 "NADH dehydrogenase (ubiquinone) complex I, assembly factor 1" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|E1BAZ2 NDUFAF1 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|E2RPE7 NDUFAF1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
CGD|CAL0004155 orf19.3353 [Candida albicans (taxid:5476)] Back     alignment and assigned GO terms
UNIPROTKB|Q5A9Q4 CIA30 "Potential mitochondrial Complex I assembly chaperone" [Candida albicans SC5314 (taxid:237561)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q9CWX2CIA30_MOUSENo assigned EC number0.53570.88180.3414yesN/A
Q18726CIA30_CAEELNo assigned EC number0.55260.89760.3352yesN/A
Q9VAI1CIA30_DROMENo assigned EC number0.62060.89760.3851yesN/A
Q9Y375CIA30_HUMANNo assigned EC number0.54460.88180.3425yesN/A
Q0MQ84CIA30_PANTRNo assigned EC number0.54460.88180.3425yesN/A

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query127
pfam08547156 pfam08547, CIA30, Complex I intermediate-associate 6e-28
>gnl|CDD|219896 pfam08547, CIA30, Complex I intermediate-associated protein 30 (CIA30) Back     alignment and domain information
 Score = 99.6 bits (249), Expect = 6e-28
 Identities = 39/94 (41%), Positives = 52/94 (55%), Gaps = 4/94 (4%)

Query: 2   YNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSK 61
           Y  L ++ +GDGR+Y  NL T  +F    G  +     T+ G  WQ  KIPFS F    +
Sbjct: 67  YTGLELRVKGDGRTYKFNLKTEDEF---DGLSYQADFDTKPGE-WQTVKIPFSDFKPTFR 122

Query: 62  GRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLEL 95
           GR+ D   PLD  RI  IGL + D+  GPF+LE+
Sbjct: 123 GRVVDDAPPLDLSRIKQIGLLISDKQPGPFELEI 156


This protein is associated with mitochondrial Complex I intermediate-associated protein 30 (CIA30) in human and mouse. The family is also present in Schizosaccharomyces pombe which does not contain the NADH dehydrogenase component of complex I, or many of the other essential subunits. This means it is possible that this family of protein may not be directly involved in oxidative phosphorylation. Length = 156

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 127
KOG2435|consensus323 99.97
PF08547157 CIA30: Complex I intermediate-associated protein 3 99.96
PF03425178 CBM_11: Carbohydrate binding domain (family 11); I 99.02
PF04620217 FlaA: Flagellar filament outer layer protein Flaa; 94.26
>KOG2435|consensus Back     alignment and domain information
Probab=99.97  E-value=3.7e-32  Score=213.28  Aligned_cols=119  Identities=60%  Similarity=1.094  Sum_probs=116.2

Q ss_pred             cceEEEEEeeCCceEEEEEEcCCCCCCcccccEEEEEEeCCCCceEEEEEeCCCceeeecceecCCCCCCCccCccEEEE
Q psy6915           2 YNHLVIKCRGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQDIQNPLDTDRISSIGL   81 (127)
Q Consensus         2 y~gl~lrvrGDGr~Y~l~L~~~~~~~~~~~~~y~~~f~t~~~~~W~tv~iPfs~F~~t~rG~~~~~~~~ld~~~I~~iG~   81 (127)
                      |+.|.||||||||.|.++|+++..++..|+++|+|.+.|++||-||.++|||+.|.+|++||+++.+-.|++.+|.+|||
T Consensus       201 fn~L~LrvRGDGRsy~inihte~~~dq~wndsys~flft~gGp~wq~~KIPfSKff~t~kGriqDrq~e~nl~~vssig~  280 (323)
T KOG2435|consen  201 FNTLYLRVRGDGRSYMINIHTETDFDQRWNDSYSYFLFTRGGPYWQEVKIPFSKFFFTNKGRIQDRQHELNLDKVSSIGF  280 (323)
T ss_pred             cceEEEEEecCCceEEEEecCccchhhhcccceeeEEecCCCCceeEEecchhhheeccccceeecccccCccceeeEeE
Confidence            78999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             EeecCCCeeEEEEEcEEEEEeCCCchhhhhhhhccccce
Q psy6915          82 SLVDQNNGPFQLELDYIGIEYDPNHTEEFAYEMYLFENE  120 (127)
Q Consensus        82 ~i~~kq~G~F~LeI~~I~~~~~~~~~~~~~y~~~~~~~~  120 (127)
                      .++++..|||.||||.|+++.+++|+|+|+||+|..+++
T Consensus       281 sl~dk~dGpF~LEIDfIGv~~d~~H~EdFayE~y~~p~~  319 (323)
T KOG2435|consen  281 SLADKVDGPFFLEIDFIGVFTDPAHTEDFAYENYPEPNP  319 (323)
T ss_pred             EEeeccCCcceeeEEEEEEecCCCcccceeeeccccccc
Confidence            999999999999999999999999999999999998875



>PF08547 CIA30: Complex I intermediate-associated protein 30 (CIA30); InterPro: IPR013857 Mitochondrial complex I intermediate-associated protein 30 (CIA30) is present in human and mouse, and also in Schizosaccharomyces pombe (Fission yeast) which does not contain the NADH dehydrogenase component of complex I, or many of the other essential subunits Back     alignment and domain information
>PF03425 CBM_11: Carbohydrate binding domain (family 11); InterPro: IPR005087 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity Back     alignment and domain information
>PF04620 FlaA: Flagellar filament outer layer protein Flaa; InterPro: IPR006714 Periplasmic flagella are the organelles of spirochete mobility, and are structurally different from the flagella of other motile bacteria Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query127
1v0a_A178 Endoglucanase H; carbohydrate binding module, cell 5e-07
>1v0a_A Endoglucanase H; carbohydrate binding module, cellulosome, cellulose degradation, hydrolase, glycosidase; 1.98A {Clostridium thermocellum} SCOP: b.18.1.30 Length = 178 Back     alignment and structure
 Score = 45.2 bits (106), Expect = 5e-07
 Identities = 20/86 (23%), Positives = 33/86 (38%), Gaps = 4/86 (4%)

Query: 10  RGDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGR--IQDI 67
            G        +       V  G+ + Y++       W+  +IPFS F  +   +   QD+
Sbjct: 79  DGSANEIRFMIAEKSINGVGDGEHWVYSITPDSS--WKTIEIPFSSFRRRLDYQPPGQDM 136

Query: 68  QNPLDTDRISSIGLSLVDQNNGPFQL 93
              LD D I SI     +  +G F +
Sbjct: 137 SGTLDLDNIDSIHFMYANNKSGKFVV 162


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query127
1v0a_A178 Endoglucanase H; carbohydrate binding module, cell 99.78
1wmx_A205 COG3291: FOG: PKD repeat; CBM30, carbohydrate bind 96.11
>1v0a_A Endoglucanase H; carbohydrate binding module, cellulosome, cellulose degradation, hydrolase, glycosidase; 1.98A {Clostridium thermocellum} SCOP: b.18.1.30 Back     alignment and structure
Probab=99.78  E-value=1.5e-18  Score=129.55  Aligned_cols=99  Identities=18%  Similarity=0.281  Sum_probs=78.1

Q ss_pred             CcceEEEEEee---CCceEEEEEEcCCCCC-CcccccEEEEEEeCCCCceEEEEEeCCCceeeecceecCCC----CCCC
Q psy6915           1 MYNHLVIKCRG---DGRSYMLNLHTPGDFD-VTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQDIQ----NPLD   72 (127)
Q Consensus         1 ~y~gl~lrvrG---DGr~Y~l~L~~~~~~~-~~~~~~y~~~f~t~~~~~W~tv~iPfs~F~~t~rG~~~~~~----~~ld   72 (127)
                      .|+||+|+|+|   ||++|+|+||+..... .+| ..|++.|.+.  +.|++|+|||++|.  +|...++..    ++||
T Consensus        67 ~~~GI~l~vkG~~~nG~~y~~~LR~~~~~~~~~~-~~y~~sF~t~--~~W~~IeIPFs~F~--~r~~~~P~~~~~~~~~d  141 (178)
T 1v0a_A           67 KWLKISFDIKSVDGSANEIRFMIAEKSINGVGDG-EHWVYSITPD--SSWKTIEIPFSSFR--RRLDYQPPGQDMSGTLD  141 (178)
T ss_dssp             TCCEEEEEEEEC---CCCEEEEEEEECTTSSSEE-EEEEEEECCC--SSCEEEEEEGGGCE--ECCSCCCTTCCCCSSCC
T ss_pred             HCCcEEEEEEcCCCCCCEEEEEEeeCCCCCCCCC-eeEEEEecCC--CcCEEEEEEHHHhc--cccccCCCCcccCCCcC
Confidence            38999999999   6999999999976542 133 3999999986  47999999999999  444444332    3899


Q ss_pred             ccCccEEEEEeecCCCeeEEEEEcEEEEEeCCCc
Q psy6915          73 TDRISSIGLSLVDQNNGPFQLELDYIGIEYDPNH  106 (127)
Q Consensus        73 ~~~I~~iG~~i~~kq~G~F~LeI~~I~~~~~~~~  106 (127)
                      +++|++|+||++...+|.  |.|++|++|....+
T Consensus       142 ~~~i~si~~m~G~~~~g~--~~id~I~l~~~~~~  173 (178)
T 1v0a_A          142 LDNIDSIHFMYANNKSGK--FVVDNIKLIGALEH  173 (178)
T ss_dssp             TTSEEEEEEEESSSCCEE--EEEEEEEEECBCCC
T ss_pred             hhHeEEEEEEEcCCCcce--EEEEEEEEEeeecc
Confidence            999999999987644554  99999999965443



>1wmx_A COG3291: FOG: PKD repeat; CBM30, carbohydrate binding module family30, CELJ, sugar BIN protein; 2.00A {Clostridium thermocellum} SCOP: b.18.1.24 PDB: 1wzx_A 2c24_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 127
d1v0aa1167 b.18.1.30 (A:4-170) Endoglucanase H {Clostridium t 3e-15
>d1v0aa1 b.18.1.30 (A:4-170) Endoglucanase H {Clostridium thermocellum [TaxId: 1515]} Length = 167 Back     information, alignment and structure

class: All beta proteins
fold: Galactose-binding domain-like
superfamily: Galactose-binding domain-like
family: CBM11
domain: Endoglucanase H
species: Clostridium thermocellum [TaxId: 1515]
 Score = 65.8 bits (160), Expect = 3e-15
 Identities = 22/104 (21%), Positives = 40/104 (38%), Gaps = 8/104 (7%)

Query: 2   YNHLVIKCR---GDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFL 58
           +  +    +   G        +       V  G+ + Y++       W+  +IPFS F  
Sbjct: 65  WLKISFDIKSVDGSANEIRFMIAEKSINGVGDGEHWVYSITPDSS--WKTIEIPFSSFRR 122

Query: 59  QS--KGRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLE-LDYIG 99
           +   +   QD+   LD D I SI     +  +G F ++ +  IG
Sbjct: 123 RLDYQPPGQDMSGTLDLDNIDSIHFMYANNKSGKFVVDNIKLIG 166


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query127
d1v0aa1167 Endoglucanase H {Clostridium thermocellum [TaxId: 99.79
d1wmxa_173 Endoglucanase CelJ {Clostridium thermocellum [TaxI 97.78
d1gu3a_142 Cellulose-binding domain of cellulase C {Cellulomo 89.2
d2e74g127 PetG subunit of the cytochrome b6f complex {Mastig 82.26
>d1v0aa1 b.18.1.30 (A:4-170) Endoglucanase H {Clostridium thermocellum [TaxId: 1515]} Back     information, alignment and structure
class: All beta proteins
fold: Galactose-binding domain-like
superfamily: Galactose-binding domain-like
family: CBM11
domain: Endoglucanase H
species: Clostridium thermocellum [TaxId: 1515]
Probab=99.79  E-value=3.3e-19  Score=128.67  Aligned_cols=98  Identities=21%  Similarity=0.362  Sum_probs=78.2

Q ss_pred             CcceEEEEEe---eCCceEEEEEEcCCCCCCcccccEEEEEEeCCCCceEEEEEeCCCceeeecceec--CCCCCCCccC
Q psy6915           1 MYNHLVIKCR---GDGRSYMLNLHTPGDFDVTWGDMFTYALYTRGGPYWQITKIPFSKFFLQSKGRIQ--DIQNPLDTDR   75 (127)
Q Consensus         1 ~y~gl~lrvr---GDGr~Y~l~L~~~~~~~~~~~~~y~~~f~t~~~~~W~tv~iPfs~F~~t~rG~~~--~~~~~ld~~~   75 (127)
                      .|+||+|+||   |++++|.|+|++...........|.+.|.+ + ++|++|+|||++|+++.+++++  +..++||+++
T Consensus        64 ~~~gi~~~vkg~~G~~~~~~~~l~d~~~~~~~~~~~~~~~f~~-~-~~W~~V~IPF~~F~~~~~~~~~~~~~~~~ldls~  141 (167)
T d1v0aa1          64 KWLKISFDIKSVDGSANEIRFMIAEKSINGVGDGEHWVYSITP-D-SSWKTIEIPFSSFRRRLDYQPPGQDMSGTLDLDN  141 (167)
T ss_dssp             TCCEEEEEEEEC---CCCEEEEEEEECTTSSSEEEEEEEEECC-C-SSCEEEEEEGGGCEECCSCCCTTCCCCSSCCTTS
T ss_pred             cCCeEEEEEECCCCCceEEEEEEEcCCcCCCCcccEEEEEEec-C-CccEEEEEEHHHcccccccCcccCCCccccChHH
Confidence            4899999999   788999999998765432112367777754 3 3799999999999999877765  3467999999


Q ss_pred             ccEEEEEeecCCCeeEEEEEcEEEEEe
Q psy6915          76 ISSIGLSLVDQNNGPFQLELDYIGIEY  102 (127)
Q Consensus        76 I~~iG~~i~~kq~G~F~LeI~~I~~~~  102 (127)
                      |.+|+|++.++++|+|+  ||+|++|.
T Consensus       142 i~~~~~~~~~g~~G~f~--iD~i~l~g  166 (167)
T d1v0aa1         142 IDSIHFMYANNKSGKFV--VDNIKLIG  166 (167)
T ss_dssp             EEEEEEEESSSCCEEEE--EEEEEEEC
T ss_pred             eEEEEEEECCCCCccEE--EEEEEEec
Confidence            99999999999999885  99999983



>d1wmxa_ b.18.1.24 (A:) Endoglucanase CelJ {Clostridium thermocellum [TaxId: 1515]} Back     information, alignment and structure
>d1gu3a_ b.18.1.14 (A:) Cellulose-binding domain of cellulase C {Cellulomonas fimi [TaxId: 1708]} Back     information, alignment and structure
>d2e74g1 f.23.26.1 (G:9-35) PetG subunit of the cytochrome b6f complex {Mastigocladus laminosus [TaxId: 83541]} Back     information, alignment and structure