Your job contains 1 sequence.
>015432
MGPIRGLKRRKKAEKKVDQNVLAAAAASDGDGDGDADADSLVAQPQPLDWWDNFSRRISG
PLFGSKTSKNFESVFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPLSPNDMVAIALRR
LSSGESLQIIGDLFGLNQSTVSQVTWRFVESMEERGLHHLQWPSKETEMEDIKSKFEKIR
GFRNCCGAIDITHIVMNIPAVDPANNVWYDREKNYSMILQGIVDPEMRFRDIIAGWPGSL
TDALVLRNSGFFKLTEEGKRLDGKSLQLSEGIELREYIIGDTGFPLLPWLLTPYQGKGLS
DIEAEYNKRHSATRMVAQMALARLKDVWRIIHGVMWMPDKNRLPRIVLVCCLLHNIVIDM
EDEMLDELPLSYHHDSGYHQQTCESVDKTASVMRDNLSLYLSGKLPP
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 015432
(407 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2099901 - symbol:AT3G55350 species:3702 "Arabi... 1398 5.3e-143 1
TAIR|locus:2077259 - symbol:AT3G63270 species:3702 "Arabi... 950 1.6e-95 1
TAIR|locus:2143104 - symbol:AT5G12010 species:3702 "Arabi... 375 1.3e-34 1
ZFIN|ZDB-GENE-050327-32 - symbol:zgc:113227 "zgc:113227" ... 373 2.2e-34 1
TAIR|locus:2123874 - symbol:AT4G29780 "AT4G29780" species... 321 2.2e-28 1
ZFIN|ZDB-GENE-081022-77 - symbol:zgc:194221 "zgc:194221" ... 285 4.6e-25 1
RGD|1584007 - symbol:Harbi1 "harbinger transposase derive... 275 5.3e-24 1
UNIPROTKB|F1SIA2 - symbol:HARBI1 "Uncharacterized protein... 274 6.8e-24 1
ZFIN|ZDB-GENE-040608-1 - symbol:harbi1 "harbinger transpo... 274 6.8e-24 1
UNIPROTKB|Q17QR8 - symbol:HARBI1 "Putative nuclease HARBI... 272 1.1e-23 1
UNIPROTKB|Q96MB7 - symbol:HARBI1 "Putative nuclease HARBI... 272 1.1e-23 1
UNIPROTKB|E1BQ99 - symbol:HARBI1 "Uncharacterized protein... 270 1.8e-23 1
UNIPROTKB|E2RCW9 - symbol:HARBI1 "Uncharacterized protein... 266 4.8e-23 1
MGI|MGI:2443194 - symbol:Harbi1 "harbinger transposase de... 266 4.8e-23 1
TAIR|locus:2207051 - symbol:AT1G72270 species:3702 "Arabi... 235 4.9e-16 1
TAIR|locus:2094088 - symbol:AT3G19120 species:3702 "Arabi... 222 8.4e-16 1
TAIR|locus:2165775 - symbol:AT5G41980 species:3702 "Arabi... 155 2.4e-08 1
ZFIN|ZDB-GENE-020415-1 - symbol:tceb2 "transcription elon... 144 5.3e-07 1
TAIR|locus:504956234 - symbol:AT1G43722 "AT1G43722" speci... 129 1.5e-05 1
ZFIN|ZDB-GENE-060531-147 - symbol:si:dkey-56d12.4 "si:dke... 125 8.1e-05 1
>TAIR|locus:2099901 [details] [associations]
symbol:AT3G55350 species:3702 "Arabidopsis thaliana"
[GO:0005634 "nucleus" evidence=ISM] [GO:0016788 "hydrolase
activity, acting on ester bonds" evidence=IEA] EMBL:CP002686
GenomeReviews:BA000014_GR EMBL:AL132975 InterPro:IPR026103
PANTHER:PTHR22930 HOGENOM:HOG000070719 ProtClustDB:CLSN2685285
EMBL:AY087712 EMBL:BT009674 EMBL:AK117365 IPI:IPI00516908
PIR:T47674 RefSeq:NP_191095.1 UniGene:At.35030 PRIDE:Q9M2U3
DNASU:824701 EnsemblPlants:AT3G55350.1 GeneID:824701
KEGG:ath:AT3G55350 TAIR:At3g55350 eggNOG:NOG241715
InParanoid:Q9M2U3 OMA:TTHITMC PhylomeDB:Q9M2U3
Genevestigator:Q9M2U3 Uniprot:Q9M2U3
Length = 406
Score = 1398 (497.2 bits), Expect = 5.3e-143, P = 5.3e-143
Identities = 264/408 (64%), Positives = 313/408 (76%)
Query: 1 MGPIRGLKRRKKAEKKVDQNVLXXXXXXXXXXXXXXXXXXLV----AQPQPLDWWDNFSR 56
MGPI+ +K++K+AEKKVD+NVL + Q LDWWD FSR
Sbjct: 1 MGPIKTIKKKKRAEKKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGFSR 60
Query: 57 RISGPLFGSKTSKNFESVFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPLSPNDMVAI 116
RI G GS K FESVFKISRKTFDYICSLVK D A+ +NFS SNG PLS ND VA+
Sbjct: 61 RIYG---GSTDPKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVAV 117
Query: 117 ALRRLSSGESLQIIGDLFGLNQSTVSQVTWRFVESMEERGLHHLQWPSKETEMEDIKSKF 176
ALRRL SGESL +IG+ FG+NQSTVSQ+TWRFVESMEER +HHL WPSK +++IKSKF
Sbjct: 118 ALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPSK---LDEIKSKF 174
Query: 177 EKIRGFRNCCGAIDITHIVMNIPAVDPANNVWYDREKNYSMILQGIVDPEMRFRDIIAGW 236
EKI G NCCGAIDITHIVMN+PAV+P+N VW D EKN+SM LQ +VDP+MRF D+IAGW
Sbjct: 175 EKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAGW 234
Query: 237 PGSLTDALVLRNSGFFKLTEEGKRLDGKSLQLSEGIELREYIIGDTGFPLLPWLLTPYQG 296
PGSL D +VL+NSGF+KL E+GKRL+G+ L LSE ELREYI+GD+GFPLLPWLLTPYQG
Sbjct: 235 PGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQG 294
Query: 297 KGLSDIEAEYNKRHSATRMVAQMALARLKDVWRIIHGVMWMPDKNRLPRIVLVCCLLHNI 356
K S + E+NKRHS AQMAL++LKD WRII+GVMWMPD+NRLPRI+ VCCLLHNI
Sbjct: 295 KPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNI 354
Query: 357 VIDMEDEMLDELPLSYHHDSGYHQQTCESVDKTASVMRDNLSLYLSGK 404
+IDMED+ LD+ PLS HD Y Q++C+ D+ +SV+RD LS L GK
Sbjct: 355 IIDMEDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 402
>TAIR|locus:2077259 [details] [associations]
symbol:AT3G63270 species:3702 "Arabidopsis thaliana"
[GO:0005634 "nucleus" evidence=ISM] [GO:0016788 "hydrolase
activity, acting on ester bonds" evidence=IEA] EMBL:CP002686
GenomeReviews:BA000014_GR InterPro:IPR026103 PANTHER:PTHR22930
EMBL:AF370300 EMBL:AY063087 IPI:IPI00539136 RefSeq:NP_567144.1
UniGene:At.1305 PRIDE:Q94K49 EnsemblPlants:AT3G63270.1
GeneID:825502 KEGG:ath:AT3G63270 TAIR:At3g63270 eggNOG:NOG298020
HOGENOM:HOG000070719 InParanoid:Q94K49 OMA:SGLINIE PhylomeDB:Q94K49
ProtClustDB:CLSN2685285 ArrayExpress:Q94K49 Genevestigator:Q94K49
Uniprot:Q94K49
Length = 396
Score = 950 (339.5 bits), Expect = 1.6e-95, P = 1.6e-95
Identities = 180/354 (50%), Positives = 239/354 (67%)
Query: 49 DWWDNFSRRISGPLFGSKTSKNFESVFKISRKTFDYICSLVKEDLAARQ-SNFSFSNGKP 107
DWWD F R S P S F+ F+ S+ TF YICSLV+EDL +R S G+
Sbjct: 43 DWWDTFWLRNSSPSVPSDEDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRL 102
Query: 108 LSPNDMVAIALRRLSSGESLQIIGDLFGLNQSTVSQVTWRFVESMEERGLHHLQWPSKET 167
LS VAIALRRL+SG+S +G FG+ QSTVSQVTWRF+E++EER HHL+WP +
Sbjct: 103 LSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTVSQVTWRFIEALEERAKHHLRWPDSD- 161
Query: 168 EMEDIKSKFEKIRGFRNCCGAIDITHIVMNIPAVDPANNVWYDREKNYSMILQGIVDPEM 227
+E+IKSKFE++ G NCCGAID THI+M +PAV A++ W D+EKNYSM LQG+ D EM
Sbjct: 162 RIEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAVQ-ASDDWCDQEKNYSMFLQGVFDHEM 220
Query: 228 RFRDIIAGWPGSLTDALVLRNSGFFKLTEEGKRLDGKSLQLSEGIELREYIIGDTGFPLL 287
RF +++ GWPG +T + +L+ SGFFKL E + LDG LS+G ++REY++G +PLL
Sbjct: 221 RFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQILDGNPKTLSQGAQIREYVVGGISYPLL 280
Query: 288 PWLLTPYQGKGLSDIEAEYNKRHSATRMVAQMALARLKDVWRIIHGVMWMPDKNRLPRIV 347
PWL+TP+ SD +N+RH R VA A +LK WRI+ VMW PD+ +LP I+
Sbjct: 281 PWLITPHDSDHPSDSMVAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSII 340
Query: 348 LVCCLLHNIVIDMEDEMLDELPLSYHHDSGYHQQTCESVDKTASVMRDNLSLYL 401
LVCCLLHNI+ID D + +++PLS HHDSGY + C+ + S +R L+ +L
Sbjct: 341 LVCCLLHNIIIDCGDYLQEDVPLSGHHDSGYADRYCKQTEPLGSELRGCLTEHL 394
>TAIR|locus:2143104 [details] [associations]
symbol:AT5G12010 species:3702 "Arabidopsis thaliana"
[GO:0005634 "nucleus" evidence=ISM] [GO:0005886 "plasma membrane"
evidence=IDA] [GO:0005774 "vacuolar membrane" evidence=IDA]
[GO:0016020 "membrane" evidence=IDA] [GO:0009507 "chloroplast"
evidence=IDA] [GO:0015824 "proline transport" evidence=RCA]
GO:GO:0005886 GO:GO:0005774 EMBL:CP002688 GenomeReviews:BA000015_GR
GO:GO:0009507 EMBL:AL163812 InterPro:IPR026103 PANTHER:PTHR22930
eggNOG:NOG243843 HOGENOM:HOG000241246 ProtClustDB:CLSN2686810
EMBL:AY058074 EMBL:BT002297 IPI:IPI00541096 PIR:T48560
RefSeq:NP_196762.1 UniGene:At.5105 IntAct:Q9LYH2 PRIDE:Q9LYH2
EnsemblPlants:AT5G12010.1 GeneID:831074 KEGG:ath:AT5G12010
TAIR:At5g12010 InParanoid:Q9LYH2 OMA:YLIANSA PhylomeDB:Q9LYH2
Genevestigator:Q9LYH2 Uniprot:Q9LYH2
Length = 502
Score = 375 (137.1 bits), Expect = 1.3e-34, P = 1.3e-34
Identities = 102/355 (28%), Positives = 178/355 (50%)
Query: 50 WWDNFSRRISGPLFGSKTSKNFESVFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPLS 109
WW+ SR + P ++F+ F++S+ TF+ IC + +A + + N P+
Sbjct: 161 WWEECSR-LDYP------EEDFKKAFRMSKSTFELICDELNSAVAKEDT--ALRNAIPV- 210
Query: 110 PNDMVAIALRRLSSGESLQIIGDLFGLNQSTVSQVTWRFVESMEERGL-HHLQWPSKETE 168
VA+ + RL++GE L+++ FGL ST ++ +++++ + +LQWP E+
Sbjct: 211 -RQRVAVCIWRLATGEPLRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQWPDDES- 268
Query: 169 MEDIKSKFEKIRGFRNCCGAIDITHIVMNIPAVDPA---NNVWYDREK--NYSMILQGIV 223
+ +I+ +FE + G N G++ THI + P + A N +R + +YS+ +Q +V
Sbjct: 269 LRNIRERFESVSGIPNVVGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITIQAVV 328
Query: 224 DPEMRFRDIIAGWPGSLTDALVLRNSGFFKLTEEGKRLDGKSLQLSEGIELREYIIGDTG 283
+P+ F D+ GWPGS+ D VL S ++ G L G ++ G G
Sbjct: 329 NPKGVFTDLCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGM------------WVAGGPG 376
Query: 284 FPLLPWLLTPYQGKGLSDIEAEYNKRHSATRMVAQMALARLKDVWRIIHGVMWMPDKNRL 343
PLL W+L PY + L+ + +N++ S + VA+ A RLK W + + ++ L
Sbjct: 377 HPLLDWVLVPYTQQNLTWTQHAFNEKMSEVQGVAKEAFGRLKGRWACLQKRTEVKLQD-L 435
Query: 344 PRIVLVCCLLHNIVIDMEDEMLDELPLSYHHDSGYHQQTCESVDKTASVMRDNLS 398
P ++ CC+LHNI E++M EL + D + SV+ A RD +S
Sbjct: 436 PTVLGACCVLHNICEMREEKMEPELMVEVIDDEVLPENVLRSVN--AMKARDTIS 488
>ZFIN|ZDB-GENE-050327-32 [details] [associations]
symbol:zgc:113227 "zgc:113227" species:7955 "Danio
rerio" [GO:0005575 "cellular_component" evidence=ND]
ZFIN:ZDB-GENE-050327-32 GeneTree:ENSGT00530000063045
InterPro:IPR026103 PANTHER:PTHR22930 eggNOG:NOG243843 EMBL:CR926129
EMBL:BC091804 IPI:IPI00506833 RefSeq:NP_001014341.1
UniGene:Dr.90965 Ensembl:ENSDART00000065568 GeneID:541506
KEGG:dre:541506 HOGENOM:HOG000198826 InParanoid:Q58EQ3 OMA:NDEWLEV
OrthoDB:EOG4C87T0 NextBio:20879288 Uniprot:Q58EQ3
Length = 415
Score = 373 (136.4 bits), Expect = 2.2e-34, P = 2.2e-34
Identities = 104/337 (30%), Positives = 175/337 (51%)
Query: 43 AQPQPLDWWDNFSRRISGPLFGSKTSKNFESVFKISRKTFDYICSLVKEDLAARQSNFSF 102
+ P+ WWD + P F T + F F++SR++F+YIC ++ L + +NF
Sbjct: 61 SHPREHRWWD-----VIVPEF---TPEEFIQNFRVSRESFEYICRRLRHMLERKDTNFRL 112
Query: 103 SNGKPLSPNDMVAIALRRLSSGESLQIIGDLFGLNQSTVSQVTWRFVESMEERGLH-HLQ 161
S P+ VAIAL +L++G + + LFG+ STV F ++ + + H++
Sbjct: 113 S--VPVKKR--VAIALCKLATGSEYRYVSQLFGVGVSTVFNCVQDFCSAVIKILVPVHMK 168
Query: 162 WPSKETEMEDIKSKFEKIRGFRNCCGAIDITHIVMNIPAVDPANNVWYDREKNYSMILQG 221
+PS E +++++ FE C G+ID HI + P +P + +R+ +S++LQ
Sbjct: 169 FPSPE-KLKEMADVFENCWNVPQCIGSIDAHHIPIIAPEKNPRG--YLNRKGWHSVVLQA 225
Query: 222 IVDPEMRFRDIIAGWPGSLTDALVLRNSGFFKLTEEGKRLDGKSLQLSEGIELREYIIGD 281
+VD F D+ G+ G+L+DA VLR S + L E L+ + +S G ++ Y+IGD
Sbjct: 226 VVDGNGLFWDLCVGFSGNLSDARVLRQSYLWSLLSERDLLNHNKVDIS-GCDVGYYLIGD 284
Query: 282 TGFPLLPWLLTPYQG-KGLSDIEAEYNKRHSATRMVAQMALARLKDVWRIIHGVMWMPD- 339
+ +PL WL+ P+ GL+ + +N R S+ R V+ ++ +LK W+ + D
Sbjct: 285 SAYPLQNWLMKPFPDIGGLTPQQESFNSRLSSARSVSDLSFKKLKARWQCLFR---RNDC 341
Query: 340 KNRL-PRIVLVCCLLHNIVIDM-----EDEMLDELPL 370
K L ++ L CC+LHNI + ED D L L
Sbjct: 342 KVELVKKMALTCCVLHNICEEKGTQFSEDHSTDHLNL 378
>TAIR|locus:2123874 [details] [associations]
symbol:AT4G29780 "AT4G29780" species:3702 "Arabidopsis
thaliana" [GO:0009611 "response to wounding" evidence=RCA]
[GO:0009612 "response to mechanical stimulus" evidence=RCA]
[GO:0009873 "ethylene mediated signaling pathway" evidence=RCA]
[GO:0010200 "response to chitin" evidence=RCA] EMBL:CP002687
GenomeReviews:CT486007_GR InterPro:IPR026103 PANTHER:PTHR22930
EMBL:BT002922 EMBL:BT005724 IPI:IPI00544260 RefSeq:NP_567834.2
UniGene:At.3318 STRING:Q84J48 EnsemblPlants:AT4G29780.1
GeneID:829100 KEGG:ath:AT4G29780 TAIR:At4g29780 eggNOG:NOG330321
HOGENOM:HOG000241246 InParanoid:Q84J48 OMA:RDHISHN PhylomeDB:Q84J48
ProtClustDB:CLSN2686810 Genevestigator:Q84J48 Uniprot:Q84J48
Length = 540
Score = 321 (118.1 bits), Expect = 2.2e-28, P = 2.2e-28
Identities = 100/348 (28%), Positives = 170/348 (48%)
Query: 57 RISGPLFGSKTSKNFESVFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPLSPNDMVAI 116
R+S P F F F++S+ TF+ IC + D + N + P +P V +
Sbjct: 202 RVSRPDF---PEDEFRREFRMSKSTFNLICEEL--DTTVTKKNTMLRDAIP-APK-RVGV 254
Query: 117 ALRRLSSGESLQIIGDLFGLNQSTVSQVTWRFVESMEERGL-HHLQWPSKETEMEDIKSK 175
+ RL++G L+ + + FGL ST ++ ++ + + +L WPS ++E+ K+K
Sbjct: 255 CVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLLWPS-DSEINSTKAK 313
Query: 176 FEKIRGFRNCCGAIDITHIVMNIPAVDPA---NNVWYDREK--NYSMILQGIVDPEMRFR 230
FE + N G+I THI + P V A N +R + +YS+ +QG+V+ + F
Sbjct: 314 FESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSITVQGVVNADGIFT 373
Query: 231 DIIAGWPGSLTDALVLRNSGFFKLTEEGKRLDGKSLQLSEGIELREYIIGDTGFPLLPWL 290
D+ G PGSLTD +L S L+ + + + G+ +I+G++GFPL +L
Sbjct: 374 DVCIGNPGSLTDDQILEKSS---LSRQ---------RAARGMLRDSWIVGNSGFPLTDYL 421
Query: 291 LTPYQGKGLSDIEAEYNKRHSATRMVAQMALARLKDVWRIIHGVMWMPDKNRLPRIVLVC 350
L PY + L+ + +N+ + +A A RLK W + + ++ LP ++ C
Sbjct: 422 LVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQKRTEVKLQD-LPYVLGAC 480
Query: 351 CLLHNIVIDMEDEMLDELPLSYHHDSGYHQQTCESVDKTASVMRDNLS 398
C+LHNI ++EML EL D + S +A RD++S
Sbjct: 481 CVLHNICEMRKEEMLPELKFEVFDDVAVPENNIRSA--SAVNTRDHIS 526
Score = 231 (86.4 bits), Expect = 1.2e-16, P = 1.2e-16
Identities = 69/237 (29%), Positives = 115/237 (48%)
Query: 49 DWWDNFSRRISGPLFGSKTSKNFESVFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPL 108
DWWD R+S P F F F++S+ TF+ IC + D + N + P
Sbjct: 198 DWWD----RVSRPDF---PEDEFRREFRMSKSTFNLICEEL--DTTVTKKNTMLRDAIP- 247
Query: 109 SPNDMVAIALRRLSSGESLQIIGDLFGLNQSTVSQVTWRFVESMEERGL-HHLQWPSKET 167
+P V + + RL++G L+ + + FGL ST ++ ++ + + +L WPS ++
Sbjct: 248 APK-RVGVCVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLLWPS-DS 305
Query: 168 EMEDIKSKFEKIRGFRNCCGAIDITHIVMNIPAVDPA---NNVWYDREK--NYSMILQGI 222
E+ K+KFE + N G+I THI + P V A N +R + +YS+ +QG+
Sbjct: 306 EINSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSITVQGV 365
Query: 223 VDPEMRFRDIIAGWPGSLTDALVLRNSGFFKLTEEGKRLDGKSLQLSEGIELREYII 279
V+ + F D+ G PGSLTD +L S + L + + G L +Y++
Sbjct: 366 VNADGIFTDVCIGNPGSLTDDQILEKSSLSRQRAARGMLRDSWIVGNSGFPLTDYLL 422
>ZFIN|ZDB-GENE-081022-77 [details] [associations]
symbol:zgc:194221 "zgc:194221" species:7955 "Danio
rerio" [GO:0005575 "cellular_component" evidence=ND] [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] ZFIN:ZDB-GENE-081022-77 GeneTree:ENSGT00530000063045
InterPro:IPR026103 PANTHER:PTHR22930 EMBL:BX324210 EMBL:BC162733
EMBL:BC162738 IPI:IPI00774426 RefSeq:NP_001129460.1
UniGene:Dr.134637 Ensembl:ENSDART00000082245 GeneID:100191015
KEGG:dre:100191015 eggNOG:NOG248361 HOGENOM:HOG000007556
HOVERGEN:HBG079725 OMA:DGRFQRY OrthoDB:EOG42JNTD NextBio:20795590
Uniprot:B3DHE2
Length = 394
Score = 285 (105.4 bits), Expect = 4.6e-25, P = 4.6e-25
Identities = 92/346 (26%), Positives = 164/346 (47%)
Query: 71 FESVFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPLSPNDMVAIALRRLSSGESLQII 130
F+ F++ R+ FD + S V +A + +N+ S + P + +AI LR L++G+S + I
Sbjct: 53 FQRYFRLDREQFDSLLSKVGPQIARQDTNYRQS----IEPAERLAICLRFLATGDSYRTI 108
Query: 131 GDLFGLNQSTVSQVTWRFVESMEERGLHHLQWPSKETE-MEDIKSKFEKIRGFRNCCGAI 189
+ + STV+ + ++ + + P TE +I + F F NC G+I
Sbjct: 109 AFSYRVGVSTVAGIVAAVTRAIWDTLAQEVM-PVPTTEDWRNISTDFLHRWNFPNCLGSI 167
Query: 190 DITHIVMNIPAVDPANNVWYDREKNYSMILQGIVDPEMRFRDIIAGWPGSLTDALVLRNS 249
D H+V+ P D + +++Y+ + YS++L +VD + RFR + G G ++D VL NS
Sbjct: 168 DGKHVVIKAP--DNSGSLFYNYKGTYSVVLLAVVDSQYRFRVVDVGSYGRMSDGGVLANS 225
Query: 250 GFFKLTEEGKRLDGKSLQLS--EGIELREYI-IGDTGFPLLPWLLTPYQGKGLSDIEAEY 306
F + +G + LS E + ++ + D FPL L+ P+ G LS + +
Sbjct: 226 IFGQALRDGALGLPQDALLSGAEHFGPQPHVFVADEAFPLRRDLMRPFPGHNLSGRQRIF 285
Query: 307 NKRHSATRMVAQMALARLKDVWRIIHGVMWMPDKNRLPRIVLVCCLLHNIVIDMEDEMLD 366
N R S R++ + L WR+ G + + N + V C+LHN + +
Sbjct: 286 NYRLSRARLIVENTFGILTAQWRMYRGAIEISPAN-VDACVKATCVLHNFLRSTTSTRIP 344
Query: 367 ELPLSYHHDS-GYHQQT---CESVDKTASVMRDNLSLYLS--GKLP 406
LP + D+ G + T + + A +R+ + Y S G +P
Sbjct: 345 -LPSAADGDAAGLQEVTRVGSNNATREAIRVRETFTSYFSTEGAVP 389
>RGD|1584007 [details] [associations]
symbol:Harbi1 "harbinger transposase derived 1" species:10116
"Rattus norvegicus" [GO:0004518 "nuclease activity" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
evidence=IEA;ISO] [GO:0005813 "centrosome" evidence=IEA;ISO]
[GO:0046872 "metal ion binding" evidence=IEA] RGD:1584007
GO:GO:0005634 GO:GO:0005737 GO:GO:0005813 GO:GO:0046872
GO:GO:0090305 GO:GO:0004518 CTD:283254 eggNOG:NOG137666
GeneTree:ENSGT00530000063045 HOGENOM:HOG000231449
HOVERGEN:HBG054543 OrthoDB:EOG479F79 InterPro:IPR026103
InterPro:IPR026244 PANTHER:PTHR22930 PRINTS:PR02086 EMBL:BC158734
IPI:IPI00394536 RefSeq:NP_001107265.2 UniGene:Rn.198635
Ensembl:ENSRNOT00000065462 GeneID:690164 KEGG:rno:690164
UCSC:RGD:1584007 NextBio:740317 ArrayExpress:B0BN95
Genevestigator:B0BN95 Uniprot:B0BN95
Length = 349
Score = 275 (101.9 bits), Expect = 5.3e-24, P = 5.3e-24
Identities = 89/340 (26%), Positives = 155/340 (45%)
Query: 67 TSKNFESVFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPLSPNDMVAIALRRLSSGES 126
T + S++ R+ Y+ L+ L+ R + S + +SP + AL +SG
Sbjct: 31 TDEYLMSMYGFPRQFIYYLVELLGASLS-RPTQRS----RAISPETQILAALGFYTSGSF 85
Query: 127 LQIIGDLFGLNQSTVSQVTWRFVESMEERGLHHLQWPSKETEMEDIKSKFEKIRGFRNCC 186
+GD G++Q+++S+ E++ ER + +P+ E ++ +K +F + G
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPADEAAIQSLKDEFYGLAGMPGVI 145
Query: 187 GAIDITHIVMNIPAVDPANNVWYDREKNYSMILQGIVDPEMRFRDIIAGWPGSLTDALVL 246
GA+D H+ + P + + V +R+ +S+ + D + WPGSL D VL
Sbjct: 146 GAVDCIHVAIKAPNAEDLSYV--NRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVL 203
Query: 247 RNSGFFKLTEEGKRLDGKSLQLSEGIELREYIIGDTGFPLLPWLLTPYQGKGLSDIEAEY 306
+ S S Q G+ +++GD+ F L WLLTP + E Y
Sbjct: 204 QQSSL-------------SSQFETGMPKDSWLLGDSSFFLHTWLLTPLHIPE-TPAEYRY 249
Query: 307 NKRHSATRMVAQMALARLKDVWRIIHG----VMWMPDKNRLPRIVLVCCLLHNIVIDMED 362
N+ HSAT V + L L +R + G + + P+K+ I+L CC+LHNI ++
Sbjct: 250 NRAHSATHSVIEKTLRTLCCRFRCLDGSKGALQYSPEKSS--HIILACCVLHNISLEHGM 307
Query: 363 EMLDELPLS---YHHDSGYHQQTCESVDKTASVMRDNLSL 399
++ P++ G +Q ES+D A +R L L
Sbjct: 308 DVWSS-PVTGPIEQPPEGEDEQM-ESLDLEADRIRQELIL 345
>UNIPROTKB|F1SIA2 [details] [associations]
symbol:HARBI1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0005813 "centrosome" evidence=IEA] [GO:0005737
"cytoplasm" evidence=IEA] GO:GO:0005737 GO:GO:0005813 CTD:283254
GeneTree:ENSGT00530000063045 OMA:GDSSFFL InterPro:IPR026103
InterPro:IPR026244 PANTHER:PTHR22930 PRINTS:PR02086 EMBL:CU467600
RefSeq:XP_003122875.1 UniGene:Ssc.5597 Ensembl:ENSSSCT00000014482
GeneID:100516314 KEGG:ssc:100516314 Uniprot:F1SIA2
Length = 349
Score = 274 (101.5 bits), Expect = 6.8e-24, P = 6.8e-24
Identities = 85/340 (25%), Positives = 151/340 (44%)
Query: 67 TSKNFESVFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPLSPNDMVAIALRRLSSGES 126
T + S++ R+ Y+ L+ L+ R + S + +SP + AL +SG
Sbjct: 31 TDEYLMSMYGFPRQFIYYLVELLGSSLS-RPTQRS----RAISPETQILAALGFYTSGSF 85
Query: 127 LQIIGDLFGLNQSTVSQVTWRFVESMEERGLHHLQWPSKETEMEDIKSKFEKIRGFRNCC 186
+GD G++Q+++S+ E++ ER +++P+ E ++ +K +F + G
Sbjct: 86 QTRMGDAIGISQASMSRCVTNVTEALVERASQFIRFPADEASVQALKDEFYGLAGMPGVI 145
Query: 187 GAIDITHIVMNIPAVDPANNVWYDREKNYSMILQGIVDPEMRFRDIIAGWPGSLTDALVL 246
G +D H+ + P + + V +R+ +S+ + D + WPGSL D +VL
Sbjct: 146 GVVDCIHVAIKAPNAEDLSYV--NRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCVVL 203
Query: 247 RNSGFFKLTEEGKRLDGKSLQLSEGIELREYIIGDTGFPLLPWLLTPYQGKGLSDIEAEY 306
+ S S Q G+ +++GD+ F L WL+TP + E Y
Sbjct: 204 QQSSL-------------SSQFEAGMHKESWLLGDSSFFLRSWLMTPLHIPE-TPAEYRY 249
Query: 307 NKRHSATRMVAQMALARLKDVWRIIHG----VMWMPDKNRLPRIVLVCCLLHNIVIDMED 362
N HSAT V + L +R + G + + P+K I+L CC+LHNI ++
Sbjct: 250 NMAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEK--CSHIILACCVLHNISLEHGM 307
Query: 363 EMLDEL---PLSYHHDSGYHQQTCESVDKTASVMRDNLSL 399
++ P+ + Y ES+D A +R L L
Sbjct: 308 DVWSSPVTGPMEQPPEEEYEHM--ESLDLEADRIRQELML 345
>ZFIN|ZDB-GENE-040608-1 [details] [associations]
symbol:harbi1 "harbinger transposase derived 1"
species:7955 "Danio rerio" [GO:0004518 "nuclease activity"
evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
ZFIN:ZDB-GENE-040608-1 GO:GO:0005634 GO:GO:0005737 GO:GO:0046872
GO:GO:0090305 GO:GO:0004518 CTD:283254 eggNOG:NOG137666
GeneTree:ENSGT00530000063045 HOGENOM:HOG000231449
HOVERGEN:HBG054543 OMA:GDSSFFL OrthoDB:EOG479F79 InterPro:IPR026103
InterPro:IPR026244 PANTHER:PTHR22930 PRINTS:PR02086 EMBL:BC078390
EMBL:BC100116 IPI:IPI00482479 RefSeq:NP_001003734.1
UniGene:Dr.85217 STRING:Q6AZB8 Ensembl:ENSDART00000052323
Ensembl:ENSDART00000129462 GeneID:445279 KEGG:dre:445279
InParanoid:Q6AZB8 NextBio:20832025 Bgee:Q6AZB8 Uniprot:Q6AZB8
Length = 349
Score = 274 (101.5 bits), Expect = 6.8e-24, P = 6.8e-24
Identities = 76/288 (26%), Positives = 131/288 (45%)
Query: 73 SVFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPLSPNDMVAIALRRLSSGESLQIIGD 132
+ F R+ Y+ L+K+ L R + +SP+ + AL +SG +GD
Sbjct: 37 NTFGFPREFIYYLVELLKDSLLRRTQR-----SRAISPDVQILAALGFYTSGSFQSKMGD 91
Query: 133 LFGLNQSTVSQVTWRFVESMEERGLHHLQWPSKETEMEDIKSKFEKIRGFRNCCGAIDIT 192
G++Q+++S+ +++ E+ + + E + K +F +I G N G +D
Sbjct: 92 AIGISQASMSRCVSNVTKALIEKAPEFIGFTRDEATKQQFKDEFYRIAGIPNVTGVVDCA 151
Query: 193 HIVMNIPAVDPANNVWYDREKNYSMILQGIVDPEMRFRDIIAGWPGSLTDALVLRNSGFF 252
HI + P D ++ V +++ +S+ Q + D WPGSLTD V + S
Sbjct: 152 HIAIKAPNADDSSYV--NKKGFHSINCQLVCDARGLLLSAETHWPGSLTDRAVFKQSNVA 209
Query: 253 KLTEEGKRLDGKSLQLSEGIELREYIIGDTGFPLLPWLLTPYQGKGLSDIEAEYNKRHSA 312
KL EE + D EG +++GD +PL WL+TP Q S + YN H+
Sbjct: 210 KLFEEQENDD-------EG-----WLLGDNRYPLKKWLMTPVQSPE-SPADYRYNLAHTT 256
Query: 313 TRMVAQMALARLKDVWRIIHG----VMWMPDKNRLPRIVLVCCLLHNI 356
T + ++ +R + G + + P+K I+ CC+LHNI
Sbjct: 257 THEIVDRTFRAIQTRFRCLDGAKGYLQYSPEK--CSHIIQACCVLHNI 302
>UNIPROTKB|Q17QR8 [details] [associations]
symbol:HARBI1 "Putative nuclease HARBI1" species:9913 "Bos
taurus" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] [GO:0005813 "centrosome" evidence=IEA] [GO:0046872
"metal ion binding" evidence=IEA] [GO:0004518 "nuclease activity"
evidence=IEA] GO:GO:0005634 GO:GO:0005737 GO:GO:0005813
GO:GO:0046872 GO:GO:0090305 GO:GO:0004518 EMBL:BC118217
IPI:IPI00696757 RefSeq:NP_001069136.1 UniGene:Bt.37438
STRING:Q17QR8 Ensembl:ENSBTAT00000006085 GeneID:514442
KEGG:bta:514442 CTD:283254 eggNOG:NOG137666
GeneTree:ENSGT00530000063045 HOGENOM:HOG000231449
HOVERGEN:HBG054543 InParanoid:Q17QR8 OMA:GDSSFFL OrthoDB:EOG479F79
NextBio:20871335 InterPro:IPR026103 InterPro:IPR026244
PANTHER:PTHR22930 PRINTS:PR02086 Uniprot:Q17QR8
Length = 349
Score = 272 (100.8 bits), Expect = 1.1e-23, P = 1.1e-23
Identities = 85/340 (25%), Positives = 151/340 (44%)
Query: 67 TSKNFESVFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPLSPNDMVAIALRRLSSGES 126
T + S++ R+ Y+ L+ L+ R + S + +SP + AL +SG
Sbjct: 31 TDEYLMSMYGFPRQFIYYLVELLGASLS-RPTQRS----RAISPETQILAALGFYTSGSF 85
Query: 127 LQIIGDLFGLNQSTVSQVTWRFVESMEERGLHHLQWPSKETEMEDIKSKFEKIRGFRNCC 186
+GD G++Q+++S+ E++ ER + +P+ E ++ +K +F + G
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPADEASVQALKDEFYGLAGIPGVI 145
Query: 187 GAIDITHIVMNIPAVDPANNVWYDREKNYSMILQGIVDPEMRFRDIIAGWPGSLTDALVL 246
G +D H+ + P + + V +R+ +S+ + D + WPGSL D +VL
Sbjct: 146 GVVDCMHVAIKAPNAEDLSYV--NRKGLHSLNCLMVCDIRGALMTVETSWPGSLQDCVVL 203
Query: 247 RNSGFFKLTEEGKRLDGKSLQLSEGIELREYIIGDTGFPLLPWLLTPYQGKGLSDIEAEY 306
+ S S Q G+ +++GD+ F L WL+TP + E Y
Sbjct: 204 QQSSL-------------SSQFEAGMHKESWLLGDSSFFLRTWLMTPLHIPE-TPAEYRY 249
Query: 307 NKRHSATRMVAQMALARLKDVWRIIHG----VMWMPDKNRLPRIVLVCCLLHNIVIDMED 362
N HSAT V + L +R + G + + P+K+ I+L CC+LHNI ++
Sbjct: 250 NMAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSS--HIILACCVLHNISLEHGM 307
Query: 363 EMLDEL---PLSYHHDSGYHQQTCESVDKTASVMRDNLSL 399
++ P+ + Y ES+D A +R L L
Sbjct: 308 DVWSSPVTGPVEQPPEEEYEHM--ESLDLEADRIRQELML 345
>UNIPROTKB|Q96MB7 [details] [associations]
symbol:HARBI1 "Putative nuclease HARBI1" species:9606 "Homo
sapiens" [GO:0004518 "nuclease activity" evidence=IEA] [GO:0046872
"metal ion binding" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005813
"centrosome" evidence=IDA] GO:GO:0005634 GO:GO:0005737
GO:GO:0005813 EMBL:CH471064 GO:GO:0046872 GO:GO:0090305
GO:GO:0004518 CTD:283254 eggNOG:NOG137666 HOGENOM:HOG000231449
HOVERGEN:HBG054543 OMA:GDSSFFL OrthoDB:EOG479F79 InterPro:IPR026103
InterPro:IPR026244 PANTHER:PTHR22930 PRINTS:PR02086 EMBL:AK057237
EMBL:BC036925 IPI:IPI00065459 RefSeq:NP_776172.1 UniGene:Hs.714463
STRING:Q96MB7 DMDM:74732341 PRIDE:Q96MB7 Ensembl:ENST00000326737
GeneID:283254 KEGG:hsa:283254 UCSC:uc001ncy.3 GeneCards:GC11M046672
HGNC:HGNC:26522 HPA:HPA038671 neXtProt:NX_Q96MB7
PharmGKB:PA162390577 InParanoid:Q96MB7 PhylomeDB:Q96MB7
GenomeRNAi:283254 NextBio:93767 ArrayExpress:Q96MB7 Bgee:Q96MB7
CleanEx:HS_HARBI1 Genevestigator:Q96MB7 GermOnline:ENSG00000180423
Uniprot:Q96MB7
Length = 349
Score = 272 (100.8 bits), Expect = 1.1e-23, P = 1.1e-23
Identities = 86/340 (25%), Positives = 152/340 (44%)
Query: 67 TSKNFESVFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPLSPNDMVAIALRRLSSGES 126
T + S++ R+ Y+ L+ +L+ R + S + +SP V AL +SG
Sbjct: 31 TDEYLMSMYGFPRQFIYYLVELLGANLS-RPTQRS----RAISPETQVLAALGFYTSGSF 85
Query: 127 LQIIGDLFGLNQSTVSQVTWRFVESMEERGLHHLQWPSKETEMEDIKSKFEKIRGFRNCC 186
+GD G++Q+++S+ E++ ER +++P+ E ++ +K +F + G
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASIQALKDEFYGLAGMPGVM 145
Query: 187 GAIDITHIVMNIPAVDPANNVWYDREKNYSMILQGIVDPEMRFRDIIAGWPGSLTDALVL 246
G +D H+ + P + + V +R+ +S+ + D + WPGSL D VL
Sbjct: 146 GVVDCIHVAIKAPNAEDLSYV--NRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVL 203
Query: 247 RNSGFFKLTEEGKRLDGKSLQLSEGIELREYIIGDTGFPLLPWLLTPYQGKGLSDIEAEY 306
+ S S Q G+ +++GD+ F L WL+TP + E Y
Sbjct: 204 QQSSL-------------SSQFEAGMHKDSWLLGDSSFFLRTWLMTPLHIPE-TPAEYRY 249
Query: 307 NKRHSATRMVAQMALARLKDVWRIIHG----VMWMPDKNRLPRIVLVCCLLHNIVIDMED 362
N HSAT V + L +R + G + + P+K+ I+L CC+LHNI ++
Sbjct: 250 NMAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSS--HIILACCVLHNISLEHGM 307
Query: 363 EMLDEL---PLSYHHDSGYHQQTCESVDKTASVMRDNLSL 399
++ P+ + Y ES+D A +R L L
Sbjct: 308 DVWSSPMTGPMEQPPEEEYEHM--ESLDLEADRIRQELML 345
>UNIPROTKB|E1BQ99 [details] [associations]
symbol:HARBI1 "Uncharacterized protein" species:9031
"Gallus gallus" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005813
"centrosome" evidence=IEA] GO:GO:0005737 GO:GO:0005813 CTD:283254
GeneTree:ENSGT00530000063045 OMA:GDSSFFL InterPro:IPR026103
InterPro:IPR026244 PANTHER:PTHR22930 PRINTS:PR02086
EMBL:AADN02033491 IPI:IPI00598024 RefSeq:XP_421117.1
Ensembl:ENSGALT00000013605 GeneID:423193 KEGG:gga:423193
NextBio:20825695 Uniprot:E1BQ99
Length = 348
Score = 270 (100.1 bits), Expect = 1.8e-23, P = 1.8e-23
Identities = 82/321 (25%), Positives = 141/321 (43%)
Query: 84 YICSLVKEDLAARQSNFSFSNGKPLSPNDMVAIALRRLSSGESLQIIGDLFGLNQSTVSQ 143
+IC LV DL + + +SP V AL +SG +GD G++Q+++S+
Sbjct: 45 FICYLV--DLLGASLSRPTQRSRAISPETQVLAALGFYTSGSFQTRMGDAIGISQASMSR 102
Query: 144 VTWRFVESMEERGLHHLQWPSKETEMEDIKSKFEKIRGFRNCCGAIDITHIVMNIPAVDP 203
E++ ER + +P E ++ +K F + G G +D TH+ + P +
Sbjct: 103 CVANVTEALVERAPQFIHFPEDEAAVQSLKDDFYALAGMPGVLGVVDCTHVAIKAPNAED 162
Query: 204 ANNVWYDREKNYSMILQGIVDPEMRFRDIIAGWPGSLTDALVLRNSGFFKLTEEGKRLDG 263
+ V +R+ +S+ + D WPGS+ D VL+ + E DG
Sbjct: 163 LSYV--NRKGLHSLNCLMVCDARGALLSAETHWPGSMPDCNVLQQAALTSQFENELYKDG 220
Query: 264 KSLQLSEGIELREYIIGDTGFPLLPWLLTPYQGKGLSDIEAEYNKRHSATRMVAQMALAR 323
+++GD+ F L WL+TP + E YN HSAT V +
Sbjct: 221 -------------WLLGDSSFFLRTWLMTPLHIPE-TPAEYRYNMAHSATHNVIERTFRT 266
Query: 324 LKDVWRIIHG----VMWMPDKNRLPRIVLVCCLLHNIVIDMEDEMLDELPLSYHHDSGYH 379
++ +R + G + + P+K+ I+L CC+LHNI + ++ P + H +
Sbjct: 267 IRSRFRCLDGSKGTLQYSPEKSS--HIILACCVLHNISLQHGLDVWSA-PAAGHVEPAEE 323
Query: 380 Q-QTCESVDKTASVMRDNLSL 399
+ + ES+D A +R L L
Sbjct: 324 EYEQMESMDSEACRIRQELLL 344
>UNIPROTKB|E2RCW9 [details] [associations]
symbol:HARBI1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0005813 "centrosome" evidence=IEA]
[GO:0005737 "cytoplasm" evidence=IEA] GO:GO:0005737 GO:GO:0005813
CTD:283254 GeneTree:ENSGT00530000063045 OMA:GDSSFFL
InterPro:IPR026103 InterPro:IPR026244 PANTHER:PTHR22930
PRINTS:PR02086 EMBL:AAEX03011498 RefSeq:XP_540753.2
Ensembl:ENSCAFT00000014604 GeneID:483633 KEGG:cfa:483633
NextBio:20858002 Uniprot:E2RCW9
Length = 349
Score = 266 (98.7 bits), Expect = 4.8e-23, P = 4.8e-23
Identities = 85/339 (25%), Positives = 152/339 (44%)
Query: 67 TSKNFESVFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPLSPNDMVAIALRRLSSGES 126
T + S++ R+ Y+ L+ L+ R + S + +SP + AL +SG
Sbjct: 31 TDEYLMSMYGFPRQFIYYLVELLGASLS-RPTQRS----RAISPETQILAALGFYTSGSF 85
Query: 127 LQIIGDLFGLNQSTVSQVTWRFVESMEERGLHHLQWPSKETEMEDIKSKFEKIRGFRNCC 186
+GD G++Q+++S+ E++ ER +++P+ E M+ +K +F + G
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERATQFIRFPADEASMQALKDEFYGLAGMPGVI 145
Query: 187 GAIDITHIVMNIPAVDPANNVWYDREKNYSMILQGIVDPEMRFRDIIAGWPGSLTDALVL 246
G +D H+ + P + + V +R+ +S+ + D + WPGSL D VL
Sbjct: 146 GVVDCIHVAIKAPNAEDLSYV--NRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDYAVL 203
Query: 247 RNSGFFKLTEEGKRLDGKSLQLSEGIELREYIIGDTGFPLLPWLLTPYQGKGLSDIEAEY 306
+ S E G D +++GD+ F L WL+TP + E Y
Sbjct: 204 QQSSLNSHFEAGMHKDS-------------WLLGDSSFFLRTWLMTPLHIPE-TPAEYRY 249
Query: 307 NKRHSATRMVAQMALARLKDVWRIIHG----VMWMPDKNRLPRIVLVCCLLHNIVIDMED 362
N HSAT V + L +R + G + + P+K+ I+L CC+LHNI ++
Sbjct: 250 NMAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSS--HIILACCVLHNISLEHGM 307
Query: 363 EMLDELPLSYHHDSGYHQQT--CESVDKTASVMRDNLSL 399
++ P++ + ++ ES+D A +R L L
Sbjct: 308 DVWSS-PMTGPMEQPPEEEFEHMESLDLEADRIRQELML 345
>MGI|MGI:2443194 [details] [associations]
symbol:Harbi1 "harbinger transposase derived 1"
species:10090 "Mus musculus" [GO:0003674 "molecular_function"
evidence=ND] [GO:0004518 "nuclease activity" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
evidence=IEA] [GO:0008150 "biological_process" evidence=ND]
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0046872 "metal
ion binding" evidence=IEA] MGI:MGI:2443194 GO:GO:0005634
GO:GO:0005737 GO:GO:0005813 GO:GO:0046872 GO:GO:0090305
EMBL:AL714023 GO:GO:0004518 CTD:283254 eggNOG:NOG137666
GeneTree:ENSGT00530000063045 HOGENOM:HOG000231449
HOVERGEN:HBG054543 OMA:GDSSFFL OrthoDB:EOG479F79 InterPro:IPR026103
InterPro:IPR026244 PANTHER:PTHR22930 PRINTS:PR02086 EMBL:AK041747
EMBL:AK045343 EMBL:AK080671 EMBL:AK084226 EMBL:AK147045
EMBL:BC094315 IPI:IPI00453562 IPI:IPI00473454 IPI:IPI00816924
RefSeq:NP_848839.2 UniGene:Mm.130331 STRING:Q8BR93 PRIDE:Q8BR93
Ensembl:ENSMUST00000090608 Ensembl:ENSMUST00000111322
Ensembl:ENSMUST00000142692 GeneID:241547 KEGG:mmu:241547
UCSC:uc008kwo.1 InParanoid:Q8BR93 ChiTaRS:HARBI1 NextBio:385049
Bgee:Q8BR93 Genevestigator:Q8BR93 GermOnline:ENSMUSG00000027243
Uniprot:Q8BR93
Length = 349
Score = 266 (98.7 bits), Expect = 4.8e-23, P = 4.8e-23
Identities = 88/338 (26%), Positives = 148/338 (43%)
Query: 67 TSKNFESVFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPLSPNDMVAIALRRLSSGES 126
T + S++ R+ ++ L+ L+ R + S + +SP + AL +SG
Sbjct: 31 TDEYLMSMYGFPRQFIYFLVELLGASLS-RPTQRS----RAISPETQILAALGFYTSGSF 85
Query: 127 LQIIGDLFGLNQSTVSQVTWRFVESMEERGLHHLQWPSKETEMEDIKSKFEKIRGFRNCC 186
+GD G++Q+++S+ E++ ER + +P E ++ +K +F + G
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPVDEAAVQSLKDEFYGLAGMPGVI 145
Query: 187 GAIDITHIVMNIPAVDPANNVWYDREKNYSMILQGIVDPEMRFRDIIAGWPGSLTDALVL 246
G D H+ + P + + V +R+ +S+ + D + WPGSL D VL
Sbjct: 146 GVADCIHVAIKAPNAEDLSYV--NRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVL 203
Query: 247 RNSGFFKLTEEGKRLDGKSLQLSEGIELREYIIGDTGFPLLPWLLTPYQGKGLSDIEAEY 306
+ S LT Q G+ +++GD+ F L WLLTP + E Y
Sbjct: 204 QRSS---LTS----------QFETGMPKDSWLLGDSSFFLRSWLLTPLPIPETA-AEYRY 249
Query: 307 NKRHSATRMVAQMALARLKDVWRIIHG----VMWMPDKNRLPRIVLVCCLLHNIVIDME- 361
N+ HSAT V + L L +R + G + + P+K I+L CC+LHNI +D
Sbjct: 250 NRAHSATHSVIERTLQTLCCRFRCLDGSKGALQYSPEK--CSHIILACCVLHNISLDHGM 307
Query: 362 DEMLDELPLSYHHDSGYHQQTCESVDKTASVMRDNLSL 399
D +P + ES+D A +R L L
Sbjct: 308 DVWSSPVPGPIDQPPEGEDEHMESLDLEADRIRQELIL 345
>TAIR|locus:2207051 [details] [associations]
symbol:AT1G72270 species:3702 "Arabidopsis thaliana"
[GO:0005634 "nucleus" evidence=ISM] [GO:0005739 "mitochondrion"
evidence=IDA] [GO:0007059 "chromosome segregation" evidence=RCA]
[GO:0007062 "sister chromatid cohesion" evidence=RCA] [GO:0007129
"synapsis" evidence=RCA] [GO:0007131 "reciprocal meiotic
recombination" evidence=RCA] [GO:0010332 "response to gamma
radiation" evidence=RCA] [GO:0032204 "regulation of telomere
maintenance" evidence=RCA] [GO:0032504 "multicellular organism
reproduction" evidence=RCA] [GO:0042138 "meiotic DNA double-strand
break formation" evidence=RCA] [GO:0043247 "telomere maintenance in
response to DNA damage" evidence=RCA] [GO:0045132 "meiotic
chromosome segregation" evidence=RCA] EMBL:CP002684 GO:GO:0005739
KO:K14861 UniGene:At.21413 InterPro:IPR021714 Pfam:PF11707
IPI:IPI00524456 RefSeq:NP_565039.4 PRIDE:F4IBR2
EnsemblPlants:AT1G72270.1 GeneID:843559 KEGG:ath:AT1G72270
OMA:ESSPEMG ArrayExpress:F4IBR2 Uniprot:F4IBR2
Length = 2845
Score = 235 (87.8 bits), Expect = 4.9e-16, P = 4.9e-16
Identities = 54/153 (35%), Positives = 86/153 (56%)
Query: 216 SMILQGIVDPEMRFRDIIAGWPGSLTDALVLRNSGFFKLTEEGKRLDGKSLQLSEGIELR 275
S+++Q +VD RF DI AGWP ++ + R + F + EE L G +L G+ +
Sbjct: 206 SILVQALVDSNGRFVDISAGWPSTMKPEAIFRQTKLFSIAEEV--LSGAPTKLGNGVLVP 263
Query: 276 EYIIGDTGFPLLPWLLTPYQGKGLSDIEA---EYNKR-HSATRMVAQMALARLKDVWRII 331
YI+GD+ PLLPWL+TPY SD E+ E+N H+ V ++A A+++ WRI+
Sbjct: 264 RYILGDSCLPLLPWLVTPYDLT--SDEESFREEFNNVVHTGLHSV-EIAFAKVRARWRIL 320
Query: 332 HGVMWMPDKNR-LPRIVLVCCLLHNIVIDMEDE 363
W P+ +P ++ CLLHN +++ D+
Sbjct: 321 DK-KWKPETIEFMPFVITTGCLLHNFLVNSGDD 352
>TAIR|locus:2094088 [details] [associations]
symbol:AT3G19120 species:3702 "Arabidopsis thaliana"
[GO:0005634 "nucleus" evidence=ISM] [GO:0016788 "hydrolase
activity, acting on ester bonds" evidence=IEA] [GO:0009220
"pyrimidine ribonucleotide biosynthetic process" evidence=RCA]
EMBL:CP002686 EMBL:AP000419 InterPro:IPR026103 PANTHER:PTHR22930
EMBL:AY070731 EMBL:AY149933 IPI:IPI00533950 RefSeq:NP_566626.1
UniGene:At.28342 IntAct:Q9LJL8 PRIDE:Q9LJL8
EnsemblPlants:AT3G19120.1 GeneID:821446 KEGG:ath:AT3G19120
TAIR:At3g19120 HOGENOM:HOG000090855 InParanoid:Q9LJL8 OMA:YLISKIT
PhylomeDB:Q9LJL8 ProtClustDB:CLSN2688554 Genevestigator:Q9LJL8
Uniprot:Q9LJL8
Length = 446
Score = 222 (83.2 bits), Expect = 8.4e-16, P = 8.4e-16
Identities = 71/261 (27%), Positives = 123/261 (47%)
Query: 102 FSNGKPLS-PNDM-VAIALRRLSSGESLQIIGDLFGLNQSTVSQVTWRFVESMEERGLH- 158
F LS P D VA+ L RL+ G S + + + L+ +S++T V + L+
Sbjct: 138 FITASNLSLPADYAVAMVLSRLAHGCSAKTLASRYSLDPYLISKIT-NMVTRLLATKLYP 196
Query: 159 -HLQWPSKETEMEDIKSKFEKIRGFRNCCGAIDITHIVMNIPAVDPANNVWYDREKNY-S 216
++ P + + + FE++ N CGAID T + + N+ Y + Y +
Sbjct: 197 EFIKIPVGKRRLIETTQGFEELTSLPNICGAIDSTPVKLRRRTKLNPRNI-YGCKYGYDA 255
Query: 217 MILQGIVDPEMRFRDIIAGWPGSLTDALVLRNSGFFKLTEEGKRLDGKSLQLSEGIELRE 276
++LQ + D + F D+ PG D+ R+S +K G + K + + G +R
Sbjct: 256 VLLQVVADHKKIFWDVCVKAPGGEDDSSHFRDSLLYKRLTSGDIVWEKVINI-RGHHVRP 314
Query: 277 YIIGDTGFPLLPWLLTPYQGKGL-SDIEAEYNKRHSATRMVAQMALARLKDVWRIIHGVM 335
YI+GD +PLL +L+TP+ G + E ++ R V A+ LK W+I+ +
Sbjct: 315 YIVGDWCYPLLSFLMTPFSPNGSGTPPENLFDGMLMKGRSVVVEAIGLLKARWKILQSLN 374
Query: 336 WMPDKNRLPRIVLVCCLLHNI 356
N P+ ++ CC+LHN+
Sbjct: 375 --VGVNHAPQTIVACCVLHNL 393
>TAIR|locus:2165775 [details] [associations]
symbol:AT5G41980 species:3702 "Arabidopsis thaliana"
[GO:0005634 "nucleus" evidence=ISM] [GO:0016788 "hydrolase
activity, acting on ester bonds" evidence=IEA] EMBL:CP002688
GenomeReviews:BA000015_GR EMBL:AB017067 InterPro:IPR026103
PANTHER:PTHR22930 UniGene:At.21383 UniGene:At.70296 EMBL:BT004620
EMBL:AK227532 IPI:IPI00538888 RefSeq:NP_199013.1 UniGene:At.71790
PRIDE:Q9FHY5 DNASU:834203 EnsemblPlants:AT5G41980.1 GeneID:834203
KEGG:ath:AT5G41980 TAIR:At5g41980 eggNOG:NOG274281
HOGENOM:HOG000237477 InParanoid:Q9FHY5 OMA:VAMFINT PhylomeDB:Q9FHY5
ProtClustDB:CLSN2686422 Genevestigator:Q9FHY5 Uniprot:Q9FHY5
Length = 374
Score = 155 (59.6 bits), Expect = 2.4e-08, P = 2.4e-08
Identities = 54/191 (28%), Positives = 91/191 (47%)
Query: 182 FRNCCGAIDITHIVMNIPAVDPANNVWYDREKNYSMILQGIVDP---EMRFRDIIAGWPG 238
F++C G +D HI + + VD R N ++ Q ++ ++RF ++AGW G
Sbjct: 140 FKDCVGVVDSFHIPVMV-GVDEQGPF---RNGN-GLLTQNVLAASSFDLRFNYVLAGWEG 194
Query: 239 SLTDALVLRNSGFFKLTEEGKRLDGKSLQLSEGIELREYIIGDTGFPLLPWLLTPYQGKG 298
S +D VL N+ LT K LQ+ +G +Y I D +P LP + PY G
Sbjct: 195 SASDQQVL-NAA---LTRRNK------LQVPQG----KYYIVDNKYPNLPGFIAPYHGVS 240
Query: 299 L-SDIEAE--YNKRHSATRMVAQMALARLKDVWRIIHGVMWMPDKNRLPRIVLVCCLLHN 355
S EA+ +N+RH LK+ + I+ P + ++ ++V+ C LHN
Sbjct: 241 TNSREEAKEMFNERHKLLHRAIHRTFGALKERFPILLSAPPYPLQTQV-KLVIAACALHN 299
Query: 356 IV-IDMEDEML 365
V ++ D+++
Sbjct: 300 YVRLEKPDDLV 310
>ZFIN|ZDB-GENE-020415-1 [details] [associations]
symbol:tceb2 "transcription elongation factor B
(SIII), polypeptide 2 (18kD, elongin B)" species:7955 "Danio rerio"
[GO:0000079 "regulation of cyclin-dependent protein
serine/threonine kinase activity" evidence=IEA] [GO:0019901
"protein kinase binding" evidence=IEA] ZFIN:ZDB-GENE-020415-1
InterPro:IPR026103 PANTHER:PTHR22930 EMBL:BX004999 IPI:IPI00810253
RefSeq:XP_001924016.1 RefSeq:XP_709669.1 UniGene:Dr.71798
Ensembl:ENSDART00000140998 GeneID:569097 KEGG:dre:569097
GeneTree:ENSGT00510000052535 NextBio:20889509 ArrayExpress:E9QD98
Bgee:E9QD98 Uniprot:E9QD98
Length = 420
Score = 144 (55.7 bits), Expect = 5.3e-07, P = 5.3e-07
Identities = 63/291 (21%), Positives = 121/291 (41%)
Query: 81 TFDYICSLVKEDLAARQSNFSFSNGKPLSPNDMVAIALRRLSSGESLQIIGDLFGLNQST 140
T Y+ + ++ N S LS + V ++L LS S + + F L +
Sbjct: 98 TVQYVTNFLQSSNMGYSKN-RMSGRARLSMSHTVLLSLTLLSKRVSYRSVSSSFHLEKGN 156
Query: 141 VSQVTWRFVESMEERGLHHLQWPSKETEMEDI------KSKFEKI--RGFRNCCGAIDIT 192
+ ++ + F + + + +QWP+ + ++++ S+ E + RG G + T
Sbjct: 157 IHRIFFSFCDQVIAQQNRIIQWPTGQEAIQNLLPFSSWHSRSEGLEERGLPRVLGVLGDT 216
Query: 193 HIVMNIPAVDPANNVWYDREKNYSMILQGIVDPEMRFR-DIIAGWPGSLTDALVLRNSGF 251
I + +P+ P + K L+ V P+ +++ G + + S
Sbjct: 217 RIPIRLPSGKPDSETDAPDAKK----LKSEVHPDSWLNLELVCNGDGRFIYCHISKGSE- 271
Query: 252 FKLTEEGKRLDGKSLQLSEGIELREYIIGDTGFPLLPWLLTPYQGKGLSDIEAEYNKRHS 311
++ GK L + + E + +I G PL +LTP+ G S E YN+
Sbjct: 272 ---SDRGKALTERLQKHPEMLPPGACLIAGVGHPLTEQILTPFS-TGRSPQENLYNRALG 327
Query: 312 ATRMVAQMALARLKDVWRIIHGVMWMPDKNRLPRIVLVCCLLHNIVIDMED 362
A+A LK+ ++ + + M + R +VL C+LHN+ +DM D
Sbjct: 328 NHLGRFNQAVADLKERFQKLR-YLDMGNFERAKTVVLTACVLHNVFLDMGD 377
>TAIR|locus:504956234 [details] [associations]
symbol:AT1G43722 "AT1G43722" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
"nucleus" evidence=ISM] [GO:0008150 "biological_process"
evidence=ND] EMBL:CP002684 InterPro:IPR026103 PANTHER:PTHR22930
IPI:IPI00546258 RefSeq:NP_683376.1 UniGene:At.52016
EnsemblPlants:AT1G43722.1 GeneID:840961 KEGG:ath:AT1G43722
OMA:LNIMAIC Uniprot:F4ICS6
Length = 324
Score = 129 (50.5 bits), Expect = 1.5e-05, P = 1.5e-05
Identities = 55/227 (24%), Positives = 103/227 (45%)
Query: 74 VFKISRKTFDYICSLVKEDLAARQSNFSFSNGKPLSPNDMVAIALRRLSSGESLQIIGDL 133
+ ++S F +C+++ Q+N+ +S + VA+ LR E + +G
Sbjct: 69 LLRMSLPCFTTLCNML-------QTNYDLQPTLNISIEESVAMFLRICGHNEVYRDVGLR 121
Query: 134 FGLNQSTVSQVTWRFVESMEERGLHHLQWPSKETEMEDIKSKFEKIRG----FRNCCGAI 189
FG NQ TV + + + E +++ P+++ E+ I + + + F GA+
Sbjct: 122 FGRNQETVQRKFREVLTATELLACDYIRTPTRQ-ELYRIPERLQVDQRYWPYFSGFVGAM 180
Query: 190 DITHIVMNIPAVDPANNVWYDREKNYSMILQGIVDPEMRFRDIIAGWPGSLTDALVLRNS 249
D TH+ + + ++++R N S+ + I D +M F I G PGS D VL+
Sbjct: 181 DGTHVCVKVKP--DLQGMYWNRHDNASLNIMAICDLKMLFTYIWNGAPGSCYDTAVLQ-- 236
Query: 250 GFFKLTEEGKRLDGK-SLQLSEGIELREYIIGDTGFPLLPWLLTPYQ 295
+ ++ D + L SE +Y + D+G+P LL PY+
Sbjct: 237 ----IAQQS---DSEFPLPPSE-----KYYLVDSGYPNKQGLLAPYR 271
>ZFIN|ZDB-GENE-060531-147 [details] [associations]
symbol:si:dkey-56d12.4 "si:dkey-56d12.4"
species:7955 "Danio rerio" [GO:0003676 "nucleic acid binding"
evidence=IEA] InterPro:IPR006612 PROSITE:PS50950 SMART:SM00980
ZFIN:ZDB-GENE-060531-147 GO:GO:0003676 EMBL:BX784026 EMBL:BC134990
IPI:IPI00835791 RefSeq:NP_001103508.1 UniGene:Dr.89500
Ensembl:ENSDART00000104361 GeneID:799220 KEGG:dre:799220
eggNOG:NOG245613 GeneTree:ENSGT00600000084622 HOVERGEN:HBG098973
InParanoid:A4QN70 OMA:DKILRIC NextBio:20933729 Uniprot:A4QN70
Length = 464
Score = 125 (49.1 bits), Expect = 8.1e-05, P = 8.1e-05
Identities = 62/271 (22%), Positives = 115/271 (42%)
Query: 101 SFSNGKPLSPNDMVAIALRRLSSGESLQIIGDLFGLNQSTVSQVTWRFVESMEERGLHHL 160
++SN L P D + + L +L + + F ++QS VS+V +++ MEE ++
Sbjct: 207 AYSNSFQLHPWDQLLMTLMKLRLNLLQGDLAERFAVSQSIVSKVISCWIDIMEENMRDYV 266
Query: 161 QWPSKETEMEDIKSKFEKIRGFRNCCGAIDITHIVMNIPAVDPANNVWYDREKNYSMILQ 220
W KET + F + F N ID + + P + Y + I
Sbjct: 267 PWLPKETIQATMPQCFRE--QFPNTTCIIDCSETPLQKPHNLDSRGESYSHYYGQNTIKY 324
Query: 221 GI-VDPEMRFRDIIAGWPGSLTDALVLRNSGFFKLTEEGKRLDGKSLQLSEGIELREYII 279
+ + P I + G +D + NSGF + G + +++ R + I
Sbjct: 325 LVSIAPCGLIMFISPAYGGRCSDKFITANSGFLEYLRPGDEV------MAD----RGFTI 374
Query: 280 GDTGFPLLPWLLTP-YQGKG--LSDIEAEYNKRHSATRMVAQMALARLKDVWRIIHGVMW 336
D + L+ P + KG LS+ + +R + R+ + + RLK ++II +
Sbjct: 375 SDLLYEKKVKLVIPAFTKKGMQLSEEDTTNTRRIANVRVHVERVICRLK-TFKIISQTVP 433
Query: 337 MPDKNRLPRIVLVCCLLHN----IVIDMEDE 363
+ ++ +I+ +C L N I+ D+EDE
Sbjct: 434 INLTPKIDKILRICAALCNLRSDIISDVEDE 464
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.320 0.137 0.418 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 407 389 0.00093 117 3 11 22 0.38 34
34 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 20
No. of states in DFA: 621 (66 KB)
Total size of DFA: 265 KB (2141 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 30.38u 0.11s 30.49t Elapsed: 00:00:02
Total cpu time: 30.39u 0.11s 30.50t Elapsed: 00:00:02
Start: Sat May 11 00:41:57 2013 End: Sat May 11 00:41:59 2013