Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST
Original result of RPS-BLAST against CDD database part I
ID Alignment Graph Length
Definition
E-value
Query 111
PLN03097
846
PLN03097, FHY3, Protein FAR-RED ELONGATED HYPOCOTY
1e-13
pfam10551 88
pfam10551, MULE, MULE transposase domain
3e-09
>gnl|CDD|178645 PLN03097, FHY3, Protein FAR-RED ELONGATED HYPOCOTYL 3; Provisional
Back Hide alignment and domain information
Score = 65.0 bits (158), Expect = 1e-13
Identities = 29/88 (32%), Positives = 51/88 (57%)
Query: 23 FGDIPTFDTTYQTNAYSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMN 82
F D+ +FDTTY N Y P + VG+NQH + + L++ E+ T ++L++T+LRAM
Sbjct: 285 FSDVVSFDTTYVRNKYKMPLALFVGVNQHYQFMLLGCALISDESAATYSWLMQTWLRAMG 344
Query: 83 RKNSLRVIINGDQIMSKAMATTFLDSTH 110
+ +I + D+ M ++ F ++ H
Sbjct: 345 GQAPKVIITDQDKAMKSVISEVFPNAHH 372
>gnl|CDD|220806 pfam10551, MULE, MULE transposase domain
Back Show alignment and domain information
Score = 49.7 bits (119), Expect = 3e-09
Identities = 23/80 (28%), Positives = 42/80 (52%), Gaps = 4/80 (5%)
Query: 31 TTYQTNAYSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVI 90
TY TN Y VG++ H + F L++ E++ + T+ LE +A+ + L +I
Sbjct: 1 GTYLTNKY----KGAVGVDHHGQGFPLAFALVSDESEESWTWFLEGLKKALGGRPPLTII 56
Query: 91 INGDQIMSKAMATTFLDSTH 110
+GD+ + KA+ F ++ H
Sbjct: 57 SDGDKGLKKAIKEVFPNARH 76
This domain was identified by Babu and colleagues. Length = 88
Conserved Domains Detected by HHsearch
Original result of HHsearch against CDD database
ID Alignment Graph Length
Definition
Probability
Query
111
PLN03097
846
FHY3 Protein FAR-RED ELONGATED HYPOCOTYL 3; Provis
100.0
PF10551 93
MULE: MULE transposase domain; InterPro: IPR018289
99.91
PF00872
381
Transposase_mut: Transposase, Mutator family; Inte
98.7
COG3328
379
Transposase and inactivated derivatives [DNA repli
97.4
PF13610 140
DDE_Tnp_IS240: DDE domain
96.86
PF00665 120
rve: Integrase core domain; InterPro: IPR001584 In
96.43
COG3316 215
Transposase and inactivated derivatives [DNA repli
93.37
PF01610
249
DDE_Tnp_ISL3: Transposase; InterPro: IPR002560 Aut
90.95
>PLN03097 FHY3 Protein FAR-RED ELONGATED HYPOCOTYL 3; Provisional
Back Hide alignment and domain information
Probab=100.00 E-value=6.4e-44 Score=298.14 Aligned_cols=110 Identities=29% Similarity=0.490 Sum_probs=108.5
Q ss_pred ccceeeEEEEechhchhhhhhCCCeEEEecccccCCCCCceeEEEEEccCCceEEEEeeeccccccchhhHHHHHHHHHh
Q 038964 2 KRIDWETYFLYDRGSLVNDEVFGDIPTFDTTYQTNAYSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAM 81 (111)
Q Consensus 2 ~~~~l~~ifw~~~~~~~~~~~f~dvv~~D~Ty~tn~y~~pl~~~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~ 81 (111)
.+++|++|||+|++|+.+|.+|||||+||+||+||+|+|||+.|+|+|||+|+++||||||.||+.|||.|||++|+++|
T Consensus 264 e~~~l~niFWaD~~sr~~Y~~FGDvV~fDTTY~tN~y~~Pfa~FvGvNhH~qtvlfGcaLl~dEt~eSf~WLf~tfl~aM 343 (846)
T PLN03097 264 EDQRLKNLFWVDAKSRHDYGNFSDVVSFDTTYVRNKYKMPLALFVGVNQHYQFMLLGCALISDESAATYSWLMQTWLRAM 343 (846)
T ss_pred cCCCeeeEEeccHHHHHHHHhcCCEEEEeceeeccccCcEEEEEEEecCCCCeEEEEEEEcccCchhhHHHHHHHHHHHh
Confidence 46899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cCCCceEEEeeCcHHHHHHHHHhCCCCCCC
Q 038964 82 NRKNSLRVIINGDQIMSKAMATTFLDSTHK 111 (111)
Q Consensus 82 ~~~~p~~iiTD~d~a~~~Ai~~vfP~~~Hr 111 (111)
+|+.|++||||+|.+|++||++|||+|.||
T Consensus 344 ~gk~P~tIiTDqd~am~~AI~~VfP~t~Hr 373 (846)
T PLN03097 344 GGQAPKVIITDQDKAMKSVISEVFPNAHHC 373 (846)
T ss_pred CCCCCceEEecCCHHHHHHHHHHCCCceeh
Confidence 999999999999999999999999999998
>PF10551 MULE: MULE transposase domain; InterPro: IPR018289 This entry represents a domain found in Mutator-like elements (MULE)-encoded tranposases, some of which also contain a zinc-finger motif [, ]
Back Show alignment and domain information
Probab=99.91 E-value=1.1e-24 Score=138.69 Aligned_cols=79 Identities=30% Similarity=0.497 Sum_probs=75.8
Q ss_pred cccccCCCCCceeE---EEEEccCCceEEEEeeeccccccchhhHHHHHHHHHhcCCCceEEEeeCcHHHHHHHHHhCCC
Q 038964 31 TTYQTNAYSKPFVV---IVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVIINGDQIMSKAMATTFLD 107 (111)
Q Consensus 31 ~Ty~tn~y~~pl~~---~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~~~~~p~~iiTD~d~a~~~Ai~~vfP~ 107 (111)
+||+||+| +|++. ++|+|++++.+++|++++.+|+.++|.|+|+.+++.+..+ |.+|+||+|.++.+||+++||+
T Consensus 1 ~T~~tn~~-~~l~~~~~~~~~d~~~~~~~v~~~l~~~e~~~~~~~~l~~~~~~~~~~-p~~ii~D~~~~~~~Ai~~vfP~ 78 (93)
T PF10551_consen 1 GTYKTNKY-GPLLYLMIAVGIDGNGRGFPVAFALVSSESEESYEWFLEKLKEAMPQK-PKVIISDFDKALINAIKEVFPD 78 (93)
T ss_pred Cccccccc-cccceeceEEEEcCCCCEEEEEEEEEcCCChhhhHHHHHHhhhccccC-ceeeeccccHHHHHHHHHHCCC
Confidence 69999999 88885 9999999999999999999999999999999999999887 9999999999999999999999
Q ss_pred CCCC
Q 038964 108 STHK 111 (111)
Q Consensus 108 ~~Hr 111 (111)
+.|+
T Consensus 79 ~~~~ 82 (93)
T PF10551_consen 79 ARHQ 82 (93)
T ss_pred ceEe
Confidence 9985
This domain is also found in a transposase for the insertion sequence element IS256 in transposon Tn4001 [].
>PF00872 Transposase_mut: Transposase, Mutator family; InterPro: IPR001207 Autonomous mobile genetic elements such as transposon or insertion sequences (IS) encode an enzyme, transposase, that is required for excising and inserting the mobile element
Back Show alignment and domain information
Probab=98.70 E-value=3.5e-09 Score=83.06 Aligned_cols=85 Identities=19% Similarity=0.181 Sum_probs=75.0
Q ss_pred CeEEEecccccCCC-----CCceeEEEEEccCCceEEEEeeeccccccchhhHHHHHHHHHhcCCCceEEEeeCcHHHHH
Q 038964 25 DIPTFDTTYQTNAY-----SKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVIINGDQIMSK 99 (111)
Q Consensus 25 dvv~~D~Ty~tn~y-----~~pl~~~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~~~~~p~~iiTD~d~a~~~ 99 (111)
++|.+|++|.+-+. +.+++.++|+|..|+-.++|+.+-..|+.++|.=+|+.+++. |-+.|..|++|..+++.+
T Consensus 163 ~~l~iD~~~~kvr~~~~~~~~~~~v~iGi~~dG~r~vLg~~~~~~Es~~~W~~~l~~L~~R-Gl~~~~lvv~Dg~~gl~~ 241 (381)
T PF00872_consen 163 PYLWIDGTYFKVREDGRVVKKAVYVAIGIDEDGRREVLGFWVGDRESAASWREFLQDLKER-GLKDILLVVSDGHKGLKE 241 (381)
T ss_pred cceeeeeeecccccccccccchhhhhhhhhcccccceeeeecccCCccCEeeecchhhhhc-cccccceeeccccccccc
Confidence 57889999987553 467899999999999999999999999999999888888765 335699999999999999
Q ss_pred HHHHhCCCCCC
Q 038964 100 AMATTFLDSTH 110 (111)
Q Consensus 100 Ai~~vfP~~~H 110 (111)
||+++||++.|
T Consensus 242 ai~~~fp~a~~ 252 (381)
T PF00872_consen 242 AIREVFPGAKW 252 (381)
T ss_pred cccccccchhh
Confidence 99999999875
Transposases have been grouped into various families [, , ]. The mutator family of transposases consists of a number of elements that include, mutator from maize, IsT2 from Thiobacillus ferrooxidans, Is256 from Staphylococcus aureus, Is1201 from Lactobacillus helveticus, Is1081 from Mycobacterium bovis, IsRm3 from Rhizobium meliloti and others. More information about these proteins can be found at Protein of the Month: Transposase [].; GO: 0003677 DNA binding, 0004803 transposase activity, 0006313 transposition, DNA-mediated
>COG3328 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Back Show alignment and domain information
Probab=97.40 E-value=0.00031 Score=55.55 Aligned_cols=84 Identities=13% Similarity=0.137 Sum_probs=65.6
Q ss_pred CeEEEecccccCC--CCCceeEEEEEccCCceEEEEeeeccccccchhhHHHHHHHHHhcCCCceEEEeeCcHHHHHHHH
Q 038964 25 DIPTFDTTYQTNA--YSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVIINGDQIMSKAMA 102 (111)
Q Consensus 25 dvv~~D~Ty~tn~--y~~pl~~~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~~~~~p~~iiTD~d~a~~~Ai~ 102 (111)
.++.+|++|.+=+ -+..++..+|++.+|+-.+.|.-+=..|. ..|.-++..|... +-..-..+++|..+++.+||.
T Consensus 146 ~~v~~D~~~~k~r~v~~~~~~ia~Gv~~eG~reilg~~~~~~e~-~~w~~~l~~l~~r-gl~~v~l~v~Dg~~gl~~aI~ 223 (379)
T COG3328 146 PYVYLDAKYVKVRSVRNKAVYIAIGVTEEGRREILGIWVGVRES-KFWLSFLLDLKNR-GLSDVLLVVVDGLKGLPEAIS 223 (379)
T ss_pred eEEEEecceeehhhhhhheeeeeeccCcccchhhhceeeecccc-hhHHHHHHHHHhc-cccceeEEecchhhhhHHHHH
Confidence 4678999999988 35688899999999999999988888888 6665333334332 223445667799999999999
Q ss_pred HhCCCCCC
Q 038964 103 TTFLDSTH 110 (111)
Q Consensus 103 ~vfP~~~H 110 (111)
.+||.+.|
T Consensus 224 ~v~p~a~~ 231 (379)
T COG3328 224 AVFPQAAV 231 (379)
T ss_pred HhccHhhh
Confidence 99998765
>PF13610 DDE_Tnp_IS240: DDE domain
Back Show alignment and domain information
Probab=96.86 E-value=0.00036 Score=47.53 Aligned_cols=80 Identities=15% Similarity=0.079 Sum_probs=61.7
Q ss_pred CCeEEEecccccCCCCCceeEEEEEccCCceEEEEeeeccccccchhhHHHHHHHHHhcCCCceEEEeeCcHHHHHHHHH
Q 038964 24 GDIPTFDTTYQTNAYSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVIINGDQIMSKAMAT 103 (111)
Q Consensus 24 ~dvv~~D~Ty~tn~y~~pl~~~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~~~~~p~~iiTD~d~a~~~Ai~~ 103 (111)
|+.+.+|=||-+=+ |--...+-.+|.+++ ++.+=|-...+...=..+|+..++... ..|.+|+||+.++...|+++
T Consensus 1 ~~~w~~DEt~iki~-G~~~yl~~aiD~~~~--~l~~~ls~~Rd~~aA~~Fl~~~l~~~~-~~p~~ivtDk~~aY~~A~~~ 76 (140)
T PF13610_consen 1 GDSWHVDETYIKIK-GKWHYLWRAIDAEGN--ILDFYLSKRRDTAAAKRFLKRALKRHR-GEPRVIVTDKLPAYPAAIKE 76 (140)
T ss_pred CCEEEEeeEEEEEC-CEEEEEEEeeccccc--chhhhhhhhcccccceeeccccceeec-cccceeecccCCccchhhhh
Confidence 68899999997643 345667888899988 555556666666666666666666544 78999999999999999999
Q ss_pred hCCC
Q 038964 104 TFLD 107 (111)
Q Consensus 104 vfP~ 107 (111)
+.|+
T Consensus 77 l~~~ 80 (140)
T PF13610_consen 77 LNPE 80 (140)
T ss_pred cccc
Confidence 9997
>PF00665 rve: Integrase core domain; InterPro: IPR001584 Integrase comprises three domains capable of folding independently and whose three-dimensional structures are known
Back Show alignment and domain information
Probab=96.43 E-value=0.04 Score=35.23 Aligned_cols=74 Identities=14% Similarity=0.062 Sum_probs=51.0
Q ss_pred CCeEEEeccccc-CCCCCceeEEEEEccCCceEEEEeeeccccccchhhHHHHHHHHHhcCCCceEEEeeCcHHHH
Q 038964 24 GDIPTFDTTYQT-NAYSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVIINGDQIMS 98 (111)
Q Consensus 24 ~dvv~~D~Ty~t-n~y~~pl~~~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~~~~~p~~iiTD~d~a~~ 98 (111)
++.+.+|.+... ...+.....++.+|...+-.. +..+-..++.+.+.-+++...+..++..|++|+||+..+..
T Consensus 6 ~~~~~~D~~~~~~~~~~~~~~~~~~iD~~S~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~p~~i~tD~g~~f~ 80 (120)
T PF00665_consen 6 GERWQIDFTPMPIPDKGGRVYLLVFIDDYSRFIY-AFPVSSKETAEAALRALKRAIEKRGGRPPRVIRTDNGSEFT 80 (120)
T ss_dssp TTEEEEEEEEETGGCTT-CEEEEEEEETTTTEEE-EEEESSSSHHHHHHHHHHHHHHHHS-SE-SEEEEESCHHHH
T ss_pred CCEEEEeeEEEecCCCCccEEEEEEEECCCCcEE-EEEeecccccccccccccccccccccccceecccccccccc
Confidence 578889998444 445557888888888877544 55555555766776666666665555559999999999886
However, the manner in which the N-terminal, catalytic, and C-terminal domains interact in the holoenzyme remains obscure. Numerous studies indicate that the enzyme functions as a multimer, minimally a dimer. The integrase proteins from Human immunodeficiency virus 1 (HIV-1) and Avian sarcoma virus (ASV) have been studied most carefully with respect to the structural basis of catalysis. Although the active site of ASV integrase does not undergo significant conformational changes on binding the required metal cofactor, that of HIV-1 does. This active site-mediated conformational change in HIV-1 reorganises the catalytic core and C-terminal domains and appears to promote an interaction that is favourable for catalysis []. Retroviral integrase is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. The presence of retrovirus integrase-related gene sequences in eukaryotes is known. Bacterial transposases involved in the transposition of the insertion sequence also belong to this group. HIV integrase catalyses the incorporation of virally derived DNA into the human genome. This unique step in the virus life cycle provides a variety of points for intervention and hence is an attractive target for the development of new therapeutics for the treatment of AIDS []. Substrate recognition by the retroviral integrase enzyme is critical for retroviral integration. To catalyse this recombination event, integrase must recognise and act on two types of substrates, viral DNA and host DNA, yet the necessary interactions exhibit markedly different degrees of specificity [].; GO: 0015074 DNA integration; PDB: 3AO3_A 3OVN_A 3AO5_A 3AO4_A 3AO1_A 1C6V_D 3HPG_A 3HPH_A 3OYD_A 3OYF_B ....
>COG3316 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Back Show alignment and domain information
Probab=93.37 E-value=0.24 Score=36.54 Aligned_cols=84 Identities=14% Similarity=0.111 Sum_probs=54.6
Q ss_pred CCCeEEEecccccCCCCCceeEEEEEccCCceEEEEeeeccccccchhhHHHHHHHHHhcCCCceEEEeeCcHHHHHHHH
Q 038964 23 FGDIPTFDTTYQTNAYSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVIINGDQIMSKAMA 102 (111)
Q Consensus 23 f~dvv~~D~Ty~tn~y~~pl~~~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~~~~~p~~iiTD~d~a~~~Ai~ 102 (111)
=++..-+|=||-+-+=+- ....-.||..|++.-+- |-..-+...=.=++..+++.. ..|.+|+||+.+....|++
T Consensus 69 ~~~~w~vDEt~ikv~gkw-~ylyrAid~~g~~Ld~~--L~~rRn~~aAk~Fl~kllk~~--g~p~v~vtDka~s~~~A~~ 143 (215)
T COG3316 69 AGDSWRVDETYIKVNGKW-HYLYRAIDADGLTLDVW--LSKRRNALAAKAFLKKLLKKH--GEPRVFVTDKAPSYTAALR 143 (215)
T ss_pred cccceeeeeeEEeeccEe-eehhhhhccCCCeEEEE--EEcccCcHHHHHHHHHHHHhc--CCCceEEecCccchHHHHH
Confidence 357788899987655333 23344556667765542 222222222233445555543 6789999999999999999
Q ss_pred HhCCCCCCC
Q 038964 103 TTFLDSTHK 111 (111)
Q Consensus 103 ~vfP~~~Hr 111 (111)
++-++..||
T Consensus 144 ~l~~~~ehr 152 (215)
T COG3316 144 KLGSEVEHR 152 (215)
T ss_pred hcCcchhee
Confidence 999988886
>PF01610 DDE_Tnp_ISL3: Transposase; InterPro: IPR002560 Autonomous mobile genetic elements such as transposon or insertion sequences (IS) encode an enzyme, transposase, that is required for excising and inserting the mobile element
Back Show alignment and domain information
Probab=90.95 E-value=0.45 Score=34.63 Aligned_cols=49 Identities=18% Similarity=0.184 Sum_probs=37.2
Q ss_pred eccccccchhhHHHHHH-HHHhcCCCceEEEeeCcHHHHHHHHHhCCCCCC
Q 038964 61 LLTSETKYTNTYLLETF-LRAMNRKNSLRVIINGDQIMSKAMATTFLDSTH 110 (111)
Q Consensus 61 ll~~E~~es~~W~l~~f-~~~~~~~~p~~iiTD~d~a~~~Ai~~vfP~~~H 110 (111)
++.+-+.+++.=.|..+ -.. .....++|.+|-..+..+|+++.||+|.+
T Consensus 32 i~~~r~~~~l~~~~~~~~~~~-~~~~v~~V~~Dm~~~y~~~~~~~~P~A~i 81 (249)
T PF01610_consen 32 ILPGRDKETLKDFFRSLYPEE-ERKNVKVVSMDMSPPYRSAIREYFPNAQI 81 (249)
T ss_pred EcCCccHHHHHHHHHHhCccc-cccceEEEEcCCCcccccccccccccccc
Confidence 67787877774444433 222 34678899999999999999999999874
Transposases have been grouped into various families [, , ]. This family includes the IS204 [], IS1001 [], IS1096 [] and IS1165 [] transposases. More information about these proteins can be found at Protein of the Month: Transposase [].; GO: 0003677 DNA binding, 0004803 transposase activity, 0006313 transposition, DNA-mediated
Homologous Structure Templates
Structure Templates Detected by HHsearch
Original result of HHsearch against PDB70 database
ID Alignment Graph Length
Definition
Probability
Query 111
3kks_A 152
Integrase, IN; beta-strands flanked by alpha-helic
93.56
3hph_A 219
Integrase, IN; protein-protein complex, tetramer,
90.16
1c0m_A
238
Protein (integrase); HIV, X-RAY crystallography, p
87.82
1cxq_A 162
Avian sarcoma virus integrase; mixed beta-sheet su
84.72
3l3u_A 163
POL polyprotein; DNA integration, AIDS, integrase,
83.48
>3kks_A Integrase, IN; beta-strands flanked by alpha-helices, DNA binding protein; 2.20A {Bovine immunodeficiency virus} PDB: 3kkr_A
Back Hide alignment and structure
Probab=93.56 E-value=0.38 Score=30.51 Aligned_cols=66 Identities=9% Similarity=0.122 Sum_probs=43.4
Q ss_pred CCeEEEecccccCCCCCceeEEEEEccCCceEEEEeeeccccccchhhHHHHHHHHHhcCCCceEEEeeCcHHHHH
Q 038964 24 GDIPTFDTTYQTNAYSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVIINGDQIMSK 99 (111)
Q Consensus 24 ~dvv~~D~Ty~tn~y~~pl~~~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~~~~~p~~iiTD~d~a~~~ 99 (111)
+++..+|.|+..++ ..++-+|...+-. .+.. +.+++.+...-++..+....+ |++|.||+......
T Consensus 2 n~~w~~D~t~~~~~-----y~~~~iD~~sr~~-~~~~-~~~~~~~~~~~~l~~~~~~~g---p~~i~sDnG~~f~s 67 (152)
T 3kks_A 2 SHLWQMDNTHWNKT-----IIWVAVETNSGLV-EAQV-IPEETALQVALCILQLIQRYT---VLHLHSDNGPCFTA 67 (152)
T ss_dssp CCEEEEEEEEETTE-----EEEEEEETTTCCE-EEEE-ESSSCHHHHHHHHHHHHHHSC---CSEEEECSCHHHHS
T ss_pred CcEEEEeeeeeCCE-----EEEEEEEcCCCcE-EEEE-eCCcCHHHHHHHHHHHHHHhC---CcEEecCCchHhhH
Confidence 46788999988543 4556667666544 3443 456666666555555555444 99999999987754
>3hph_A Integrase, IN; protein-protein complex, tetramer, DNA integration, endonucl magnesium, metal-binding, multifunctional enzyme; 2.64A {Maedi visna virus} PDB: 3hpg_A
Back Show alignment and structure
Probab=90.16 E-value=1.2 Score=30.34 Aligned_cols=65 Identities=11% Similarity=0.055 Sum_probs=42.9
Q ss_pred CeEEEecccccCCCCCceeEEEEEccCCceEEEEeeeccccccchhhHHHHHHHHHhcCCCceEEEeeCcHHHHH
Q 038964 25 DIPTFDTTYQTNAYSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVIINGDQIMSK 99 (111)
Q Consensus 25 dvv~~D~Ty~tn~y~~pl~~~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~~~~~p~~iiTD~d~a~~~ 99 (111)
+++.+|.|+..++ ..++-+|...+-. .+. .+.+++.+...-++..+....+ |++|.||+......
T Consensus 61 ~~w~~D~t~~~~~-----y~~~~iD~~Sr~~-~~~-~~~~~~~~~~~~~l~~~~~~~g---p~~i~sDnG~~F~s 125 (219)
T 3hph_A 61 DHWQVDYTHYEDK-----IILVWVETNSGLI-YAE-RVKGETGQEFRVQTMKWYAMFA---PKSLQSDNGPAFVA 125 (219)
T ss_dssp CEEEEEEEEETTE-----EEEEEEETTTCCE-EEE-EESCCCHHHHHHHHHHHHHHHC---CCEEEECCCHHHHS
T ss_pred ceEEEEEecCCCE-----EEEEEEeCCCCcE-EEE-EeCCCCHHHHHHHHHHHHHHhC---CcEEEeCCCchhhh
Confidence 6778999977554 4556667766654 333 3566666665555555555444 99999999988753
>1c0m_A Protein (integrase); HIV, X-RAY crystallography, protein structure, transferase; 2.53A {Rous sarcoma virus} SCOP: b.34.7.1 c.55.3.2 PDB: 1c1a_A
Back Show alignment and structure
Probab=87.82 E-value=3.4 Score=28.69 Aligned_cols=71 Identities=10% Similarity=0.015 Sum_probs=46.1
Q ss_pred CCeEEEecccccCCCCCceeEEEEEccCCceEEEEeeeccccccchhhHHHHHHHHHhcCCCceEEEeeCcHHHHH
Q 038964 24 GDIPTFDTTYQTNAYSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVIINGDQIMSK 99 (111)
Q Consensus 24 ~dvv~~D~Ty~tn~y~~pl~~~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~~~~~p~~iiTD~d~a~~~ 99 (111)
++++.+|.||.....+. ...++-+|...+-.. |+.+ .+++.+...-+|+......+ .|++|.||+......
T Consensus 10 ~~~w~~D~~~~~~~~g~-~yl~~~iD~~Sr~i~-~~~~-~~~~~~~v~~~l~~~~~~~g--~p~~i~sDnG~~f~s 80 (238)
T 1c0m_A 10 LQIWQTDFTLEPRMAPR-SWLAVTVDTASSAIV-VTQH-GRVTSVAVQHHWATAIAVLG--RPKAIKTDNGSCFTS 80 (238)
T ss_dssp TSEEEEEEEECGGGCSS-CEEEEEEETTTCCEE-EEEE-SSCCHHHHHHHHHHHHHHHC--CCSEEEECSCHHHHS
T ss_pred CCEEEEeeccccccCCC-EEEEEEEECCCCCEE-EEEc-cCCCHHHHHHHHHHHHHHhC--CCcEEEECCCCcccc
Confidence 46889999986654443 445666777665443 5544 36666666555555555444 488999999988743
>1cxq_A Avian sarcoma virus integrase; mixed beta-sheet surrounded by alpha-helices, transferase; HET: EPE; 1.02A {Avian sarcoma virus} SCOP: c.55.3.2 PDB: 1cxu_A* 1czb_A* 1cz9_A* 1asu_A* 1asw_A* 1asv_A 1a5v_A* 1a5w_A* 1a5x_A* 1vsf_A* 1vse_A* 3o4n_A* 1vsm_A 3o4q_A* 1vsd_A* 1vsh_A* 1vsi_A* 1vsj_A* 1vsk_A 1vsl_A
Back Show alignment and structure
Probab=84.72 E-value=5.9 Score=25.41 Aligned_cols=71 Identities=10% Similarity=0.022 Sum_probs=44.7
Q ss_pred CCeEEEecccccCCCCCceeEEEEEccCCceEEEEeeeccccccchhhHHHHHHHHHhcCCCceEEEeeCcHHHHH
Q 038964 24 GDIPTFDTTYQTNAYSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVIINGDQIMSK 99 (111)
Q Consensus 24 ~dvv~~D~Ty~tn~y~~pl~~~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~~~~~p~~iiTD~d~a~~~ 99 (111)
.+++.+|.||.....+. ....+.+|...+-. +|+.+ .+++.+...-+++...... ..|++|.||+......
T Consensus 11 ~~~~~~D~~~~~~~~g~-~yl~~~iD~~Sr~i-~~~~~-~~~~~~~v~~~l~~~~~~~--g~p~~i~sDnG~~f~s 81 (162)
T 1cxq_A 11 LQIWQTDFTLEPRMAPR-SWLAVTVDTASSAI-VVTQH-GRVTSVAAQHHWATAIAVL--GRPKAIKTDNGSCFTS 81 (162)
T ss_dssp CCEEEEEEEECGGGTTS-CEEEEEEETTTCCE-EEEEE-SSCCHHHHHHHHHHHHHHH--CCCSEEECCSCHHHHS
T ss_pred CCEEEeeeeecccCCCC-EEEEEEEEcCCCeE-EEEEC-CCccHHHHHHHHHHHHHHH--CCCeEEEeCCchhhhh
Confidence 56889999986554444 44566777776644 34444 3555555444444444443 3588999999987743
>3l3u_A POL polyprotein; DNA integration, AIDS, integrase, endonuclease, polynucleotidyl transferase, DNA binding, viral protein; 1.40A {Human immunodeficiency virus 1} PDB: 3ao1_A 3l3v_A* 3ovn_A* 3ao2_A* 3ao3_A* 3ao4_A* 3ao5_A* 2itg_A 1biz_A 1biu_A 1b9d_A 4dmn_A* 2b4j_A 1itg_A 1b92_A 1bl3_A 1bi4_A 1bis_A 1b9f_A 1bi4_B ...
Back Show alignment and structure
Probab=83.48 E-value=3 Score=26.73 Aligned_cols=64 Identities=17% Similarity=0.108 Sum_probs=41.7
Q ss_pred CCeEEEecccccCCCCCceeEEEEEccCCceEEEEeeeccccccchhhHHHHHHHHHhcCCCceEEEeeCcHHH
Q 038964 24 GDIPTFDTTYQTNAYSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVIINGDQIM 97 (111)
Q Consensus 24 ~dvv~~D~Ty~tn~y~~pl~~~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~~~~~p~~iiTD~d~a~ 97 (111)
.+++.+|.||..++ ..++-+|...+-. .+.. +.+++.+...-++..+...++ +++|.||+....
T Consensus 9 ~~~w~~D~t~~~g~-----y~~~~iD~~sr~~-~~~~-~~~~~~~~~~~~l~~~~~~~~---~~~i~sD~G~~f 72 (163)
T 3l3u_A 9 PGIWQLDCTHLEGK-----VILVAVHVASGYI-EAEV-IPAETGQETAYFLLKLAGRWP---VKTVHTDNGSNF 72 (163)
T ss_dssp TTEEEEEEEEETTE-----EEEEEEETTTCCE-EEEE-ESSCCHHHHHHHHHHHHHHSC---CCEEEECCCGGG
T ss_pred CCEEEEEecccCCc-----EEEEEEecCCCcE-EEEE-eCCCcHHHHHHHHHHHHHHcC---CcEEeecChHhh
Confidence 46788999998654 3556667766643 3443 456666666555555544443 589999998765
Homologous Structure Domains
Homologous Domains Detected by HHsearch
Original result of HHsearch against SCOP70(version1.75) database
ID Alignment Graph Length
Definition
Probability
Query 111
d1exqa_ 154
Retroviral integrase, catalytic domain {Human immu
92.93
d1cxqa_ 147
Retroviral integrase, catalytic domain {Rous sarco
91.7
d1c6va_ 159
Retroviral integrase, catalytic domain {Simian imm
84.3
>d1exqa_ c.55.3.2 (A:) Retroviral integrase, catalytic domain {Human immunodeficiency virus type 1 [TaxId: 11676]}
Back Hide information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: Ribonuclease H-like motif
superfamily: Ribonuclease H-like
family: Retroviral integrase, catalytic domain
domain: Retroviral integrase, catalytic domain
species: Human immunodeficiency virus type 1 [TaxId: 11676]
Probab=92.93 E-value=0.067 Score=33.78 Aligned_cols=65 Identities=14% Similarity=0.088 Sum_probs=43.7
Q ss_pred eEEEecccccCCCCCceeEEEEEccCCceEEEEeeeccccccchhhHHHHHHHHHhcCCCceEEEeeCcHHHHHH
Q 038964 26 IPTFDTTYQTNAYSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVIINGDQIMSKA 100 (111)
Q Consensus 26 vv~~D~Ty~tn~y~~pl~~~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~~~~~p~~iiTD~d~a~~~A 100 (111)
+..+|.|+..|+| .++.+|...+- +.+. .+.+++.+...-++..+... ..|++|.||+.....++
T Consensus 5 ~W~~D~~~~~~k~-----ilv~vD~~Sr~-~~~~-~~~~~~~~~v~~~l~~~~~~---~~p~~I~sDnG~eF~s~ 69 (154)
T d1exqa_ 5 IWQLDCTHLEGKV-----ILVAVHVASGY-IEAE-VIPAETGQETAYFLLKLAGR---WPVKTIHTDNGSNFTGA 69 (154)
T ss_dssp EEEEEEEEETTEE-----EEEEEETTTTE-EEEE-EESSSSHHHHHHHHHHHHHH---SCCCEEECCCCGGGTSH
T ss_pred ceeeEEEEECCEE-----EEEEEECcCCc-EEEE-EecCcchHHHHhhhhhheec---cCCCEEEEcCCccccch
Confidence 6678999988764 45778888663 3443 45667766665555554433 34899999999877543
>d1cxqa_ c.55.3.2 (A:) Retroviral integrase, catalytic domain {Rous sarcoma virus RSV [TaxId: 11886]}
Back Show information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: Ribonuclease H-like motif
superfamily: Ribonuclease H-like
family: Retroviral integrase, catalytic domain
domain: Retroviral integrase, catalytic domain
species: Rous sarcoma virus RSV [TaxId: 11886]
Probab=91.70 E-value=0.65 Score=28.42 Aligned_cols=70 Identities=9% Similarity=0.043 Sum_probs=42.9
Q ss_pred CCeEEEecccccCCCCCceeEEEEEccCCceEEEEeeeccccccchhhHHHHHHHHHhcCCCceEEEeeCcHHHH
Q 038964 24 GDIPTFDTTYQTNAYSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVIINGDQIMS 98 (111)
Q Consensus 24 ~dvv~~D~Ty~tn~y~~pl~~~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~~~~~p~~iiTD~d~a~~ 98 (111)
++++.+|-|+.....+... ..+.+|...+- +.++ .+.+++.+...-.+......++ .|+.|.||+.....
T Consensus 7 ~~~w~~D~~~~~~~~~~~~-l~v~vD~~Sr~-i~~~-~~~~~t~~~~~~~l~~~~~~~g--~p~~i~sDnG~~F~ 76 (147)
T d1cxqa_ 7 LQIWQTDFTLEPRMAPRSW-LAVTVDTASSA-IVVT-QHGRVTSVAAQHHWATAIAVLG--RPKAIKTDNGSCFT 76 (147)
T ss_dssp CCEEEEEEEECGGGTTSCE-EEEEEETTTCC-EEEE-EESSCCHHHHHHHHHHHHHHHC--CCSEEECCSCHHHH
T ss_pred CCEEEEEEEEecCCCCeEE-EEEEEecccce-eccc-ccCccchHHHHHHHHHHHHHhC--CCeEEEeccccccc
Confidence 4678899888765555543 34555665543 3333 4566666555444444444433 69999999988774
>d1c6va_ c.55.3.2 (A:) Retroviral integrase, catalytic domain {Simian immunodeficiency virus [TaxId: 11723]}
Back Show information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: Ribonuclease H-like motif
superfamily: Ribonuclease H-like
family: Retroviral integrase, catalytic domain
domain: Retroviral integrase, catalytic domain
species: Simian immunodeficiency virus [TaxId: 11723]
Probab=84.30 E-value=1.1 Score=27.62 Aligned_cols=62 Identities=15% Similarity=0.043 Sum_probs=38.4
Q ss_pred eEEEecccccCCCCCceeEEEEEccCCceEEEEeeeccccccchhhHHHHHHHHHhcCCCceEEEeeCcHHH
Q 038964 26 IPTFDTTYQTNAYSKPFVVIVGINQHIETISFLFGLLTSETKYTNTYLLETFLRAMNRKNSLRVIINGDQIM 97 (111)
Q Consensus 26 vv~~D~Ty~tn~y~~pl~~~~g~n~~~~~~~~~~~ll~~E~~es~~W~l~~f~~~~~~~~p~~iiTD~d~a~ 97 (111)
+-.+|.||..++| .++.+|...+- +.+. .+.+++.+...-++.... ....|++|.||+....
T Consensus 6 ~W~~D~t~~~~k~-----il~~vD~~Sr~-~~~~-~~~~~~a~~v~~~l~~~~---~~~~~~~I~sDnG~~F 67 (159)
T d1c6va_ 6 TWQMDCTHLEGKI-----VIVAVHVASGF-IEAE-VIPQETGRQTALFLLKLA---GRWPITHLHTDNGANF 67 (159)
T ss_dssp EEEEEEEEETTEE-----EEEEECTTSCC-EEEE-ECSSCSHHHHHHHHHHHH---TTSCCCEEECCCCSST
T ss_pred ceeeEEEEECCEE-----EEEEEECCCCe-EEEE-EeeccCHHHHHHHHHHHh---hccCCceeEecCCCcc
Confidence 4568999987764 44556766554 3343 567777766554444433 2233568999998765