Your job contains 1 sequence.
>041521
MEITRFPFLNQEEDYSHLLDLLPEMESRSTFINNNNSSNNNNNNNNNLKKRRRSDDVLNK
SAAWSDILTSLILLDEEEKREQQQYSIHSHQDKLLVDDNHKRKEQAMNDYFHQLQDHYTD
LDVMDQLRTNKRSRRTASAVATVAASASASASASEDASADNPTTAGGSAQHRRLWVKDRS
KDWWDERNHPDFPEEEFWRDFRMSKATFEMICEELESTVMKKNTMLRDAIPVRQRVAVCV
WRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQIKEEFQG
ISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTDVC
IGWPGSMPDDQVLERSALFQRADRGLLKDVWIVGNSGYPLMDWVMVPYTQKNLTWTQHAF
NEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNICEMRNEVMDPQ
LKFDLFDDEMIPDNSVRSMASAQARDHIAHNLLHHGLAGTSFLH
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 041521
(524 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2143104 - symbol:AT5G12010 species:3702 "Arabi... 1563 3.2e-162 2
TAIR|locus:2123874 - symbol:AT4G29780 "AT4G29780" species... 1564 1.4e-160 1
ZFIN|ZDB-GENE-050327-32 - symbol:zgc:113227 "zgc:113227" ... 542 2.7e-52 1
TAIR|locus:2099901 - symbol:AT3G55350 species:3702 "Arabi... 386 9.2e-36 1
TAIR|locus:2077259 - symbol:AT3G63270 species:3702 "Arabi... 351 4.7e-32 1
UNIPROTKB|E1BQ99 - symbol:HARBI1 "Uncharacterized protein... 266 1.7e-22 1
UNIPROTKB|E2RCW9 - symbol:HARBI1 "Uncharacterized protein... 259 1.0e-21 1
UNIPROTKB|Q96MB7 - symbol:HARBI1 "Putative nuclease HARBI... 258 1.3e-21 1
UNIPROTKB|Q17QR8 - symbol:HARBI1 "Putative nuclease HARBI... 254 1.5e-20 1
UNIPROTKB|F1SIA2 - symbol:HARBI1 "Uncharacterized protein... 254 1.5e-20 1
MGI|MGI:2443194 - symbol:Harbi1 "harbinger transposase de... 250 1.0e-19 1
RGD|1584007 - symbol:Harbi1 "harbinger transposase derive... 248 2.0e-19 1
ZFIN|ZDB-GENE-081022-77 - symbol:zgc:194221 "zgc:194221" ... 253 2.4e-19 1
TAIR|locus:2094088 - symbol:AT3G19120 species:3702 "Arabi... 253 4.9e-19 1
ZFIN|ZDB-GENE-040608-1 - symbol:harbi1 "harbinger transpo... 238 5.5e-18 1
FB|FBgn0052095 - symbol:CG32095 species:7227 "Drosophila ... 199 6.1e-13 1
ZFIN|ZDB-GENE-060810-147 - symbol:si:dkey-197c15.6 "si:dk... 160 1.2e-08 1
TAIR|locus:2207051 - symbol:AT1G72270 species:3702 "Arabi... 181 5.7e-08 3
TAIR|locus:504956234 - symbol:AT1G43722 "AT1G43722" speci... 123 0.00010 1
>TAIR|locus:2143104 [details] [associations]
symbol:AT5G12010 species:3702 "Arabidopsis thaliana"
[GO:0005634 "nucleus" evidence=ISM] [GO:0005886 "plasma membrane"
evidence=IDA] [GO:0005774 "vacuolar membrane" evidence=IDA]
[GO:0016020 "membrane" evidence=IDA] [GO:0009507 "chloroplast"
evidence=IDA] [GO:0015824 "proline transport" evidence=RCA]
GO:GO:0005886 GO:GO:0005774 EMBL:CP002688 GenomeReviews:BA000015_GR
GO:GO:0009507 EMBL:AL163812 InterPro:IPR026103 PANTHER:PTHR22930
eggNOG:NOG243843 HOGENOM:HOG000241246 ProtClustDB:CLSN2686810
EMBL:AY058074 EMBL:BT002297 IPI:IPI00541096 PIR:T48560
RefSeq:NP_196762.1 UniGene:At.5105 IntAct:Q9LYH2 PRIDE:Q9LYH2
EnsemblPlants:AT5G12010.1 GeneID:831074 KEGG:ath:AT5G12010
TAIR:At5g12010 InParanoid:Q9LYH2 OMA:YLIANSA PhylomeDB:Q9LYH2
Genevestigator:Q9LYH2 Uniprot:Q9LYH2
Length = 502
Score = 1563 (555.3 bits), Expect = 3.2e-162, Sum P(2) = 3.2e-162
Identities = 284/466 (60%), Positives = 357/466 (76%)
Query: 59 NKSAAWSDILTSLILLDEEEKREQQQYSIHSHQDKLLVDDNHKRKEQAMNDYFHQLQDHY 118
N++ TSL+L++E EK++Q+ + S ++ N++++ + M+DY+ L D+Y
Sbjct: 38 NETKNLKGFFTSLLLMEEHEKQDQEARNAASRREMSDFQSNYRKRARTMSDYYSDLNDYY 97
Query: 119 TDLDVMDQLRTNKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPTTAGGSAQHRRLWVKD 178
D + + K+ GS Q RRLWVKD
Sbjct: 98 ADAEESGDINL-KKSRVSRAVASVAVAAASEIEAESSEITGSGSVRGTGSGQQRRLWVKD 156
Query: 179 RSKDWWDERNHPDFPEEEFWRDFRMSKATFEMICEELESTVMKKNTMLRDAIPVRQRVAV 238
RS+ WW+E + D+PEE+F + FRMSK+TFE+IC+EL S V K++T LR+AIPVRQRVAV
Sbjct: 157 RSRAWWEECSRLDYPEEDFKKAFRMSKSTFELICDELNSAVAKEDTALRNAIPVRQRVAV 216
Query: 239 CVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQIKEEF 298
C+WRLATGEPLR+VSK+FGLGISTCHKLVLEVC AIK VLMPK+LQWPD+ ++ I+E F
Sbjct: 217 CIWRLATGEPLRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQWPDDESLRNIRERF 276
Query: 299 QGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTD 358
+ +SGIPNV GSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSIT+Q VV+ KGVFTD
Sbjct: 277 ESVSGIPNVVGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITIQAVVNPKGVFTD 336
Query: 359 VCIGWPGSMPDDQVLERSALFQRADRG-LLKDVWIVGNSGYPLMDWVMVPYTQKNLTWTQ 417
+CIGWPGSMPDD+VLE+S L+QRA+ G LLK +W+ G G+PL+DWV+VPYTQ+NLTWTQ
Sbjct: 337 LCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGMWVAGGPGHPLLDWVLVPYTQQNLTWTQ 396
Query: 418 HAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNICEMRNEVM 477
HAFNEK+ ++Q VAK+AF RLKGRWACLQKRTEVKLQDLP VLGACCVLHNICEMR E M
Sbjct: 397 HAFNEKMSEVQGVAKEAFGRLKGRWACLQKRTEVKLQDLPTVLGACCVLHNICEMREEKM 456
Query: 478 DPQLKFDLFDDEMIPDNSVRSMASAQARDHIAHNLLHHGLAGTSFL 523
+P+L ++ DDE++P+N +RS+ + +ARD I+HNLLHHGLAGTSFL
Sbjct: 457 EPELMVEVIDDEVLPENVLRSVNAMKARDTISHNLLHHGLAGTSFL 502
Score = 38 (18.4 bits), Expect = 3.2e-162, Sum P(2) = 3.2e-162
Identities = 7/29 (24%), Positives = 16/29 (55%)
Query: 77 EEKREQQQYSIHSHQDKLLVDDNHKRKEQ 105
E KR + ++ LL+ + H++++Q
Sbjct: 33 ESKRNNETKNLKGFFTSLLLMEEHEKQDQ 61
>TAIR|locus:2123874 [details] [associations]
symbol:AT4G29780 "AT4G29780" species:3702 "Arabidopsis
thaliana" [GO:0009611 "response to wounding" evidence=RCA]
[GO:0009612 "response to mechanical stimulus" evidence=RCA]
[GO:0009873 "ethylene mediated signaling pathway" evidence=RCA]
[GO:0010200 "response to chitin" evidence=RCA] EMBL:CP002687
GenomeReviews:CT486007_GR InterPro:IPR026103 PANTHER:PTHR22930
EMBL:BT002922 EMBL:BT005724 IPI:IPI00544260 RefSeq:NP_567834.2
UniGene:At.3318 STRING:Q84J48 EnsemblPlants:AT4G29780.1
GeneID:829100 KEGG:ath:AT4G29780 TAIR:At4g29780 eggNOG:NOG330321
HOGENOM:HOG000241246 InParanoid:Q84J48 OMA:RDHISHN PhylomeDB:Q84J48
ProtClustDB:CLSN2686810 Genevestigator:Q84J48 Uniprot:Q84J48
Length = 540
Score = 1564 (555.6 bits), Expect = 1.4e-160, P = 1.4e-160
Identities = 305/542 (56%), Positives = 384/542 (70%)
Query: 1 MEITRFPF-LNQEEDYSHLLDLLPEMESR-STFIXXXXXXXXXXXXXXXLKKRRRSDD-- 56
MEI+ FPF Q+++ SH L L +M+S STF K+ R+ D+
Sbjct: 1 MEISSFPFPYLQDDECSHFLGLFQDMDSSPSTF--GLEGFNSNDNNTNQKKRPRKDDEGG 58
Query: 57 --------VL-----NKSAAWSDILTSLILLDEEEKREQQQYSIHSHQDKLLVDDNHKRK 103
VL N AA+ DIL +L+LLDEE K++Q+Q+ ++K L++ NHK+K
Sbjct: 59 GGGGGGTEVLGAVNGNNKAAFGDILATLLLLDEEAKQQQEQWDFEFIKEKSLLEANHKKK 118
Query: 104 EQAMNDYFHQLQDHYTDLDVMDQLRTNKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPT 163
+ M+ Y++Q+QDHY+ D R+ + +
Sbjct: 119 VKTMDGYYNQMQDHYSAAGETDGSRSKRARKTAVAAVVSAVASGADTTGLAAPVPTADIA 178
Query: 164 TAGGSA-QHRRLWVKDRSKDWWDERNHPDFPEEEFWRDFRMSKATFEMICEELESTVMKK 222
+ GS HRRLWVK+R+ DWWD + PDFPE+EF R+FRMSK+TF +ICEEL++TV KK
Sbjct: 179 SGSGSGPSHRRLWVKERTTDWWDRVSRPDFPEDEFRREFRMSKSTFNLICEELDTTVTKK 238
Query: 223 NTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKF 282
NTMLRDAIP +RV VCVWRLATG PLR VS+RFGLGISTCHKLV+EVC AI VLMPK+
Sbjct: 239 NTMLRDAIPAPKRVGVCVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKY 298
Query: 283 LQWPDELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSY 342
L WP + ++ K +F+ + IPNV GS+YTTHIPIIAPK+ VA+YFNKRHTERNQKTSY
Sbjct: 299 LLWPSDSEINSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSY 358
Query: 343 SITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALF-QRADRGLLKDVWIVGNSGYPLM 401
SITVQGVV+ G+FTDVCIG PGS+ DDQ+LE+S+L QRA RG+L+D WIVGNSG+PL
Sbjct: 359 SITVQGVVNADGIFTDVCIGNPGSLTDDQILEKSSLSRQRAARGMLRDSWIVGNSGFPLT 418
Query: 402 DWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLG 461
D+++VPYT++NLTWTQHAFNE IG+IQ +A AF RLKGRWACLQKRTEVKLQDLP VLG
Sbjct: 419 DYLLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQKRTEVKLQDLPYVLG 478
Query: 462 ACCVLHNICEMRNEVMDPQLKFDLFDDEMIPDNSVRSMASAQARDHIAHNLLHHGLAGTS 521
ACCVLHNICEMR E M P+LKF++FDD +P+N++RS ++ RDHI+HNLLH GLAGT
Sbjct: 479 ACCVLHNICEMRKEEMLPELKFEVFDDVAVPENNIRSASAVNTRDHISHNLLHRGLAGTR 538
Query: 522 FL 523
L
Sbjct: 539 TL 540
>ZFIN|ZDB-GENE-050327-32 [details] [associations]
symbol:zgc:113227 "zgc:113227" species:7955 "Danio
rerio" [GO:0005575 "cellular_component" evidence=ND]
ZFIN:ZDB-GENE-050327-32 GeneTree:ENSGT00530000063045
InterPro:IPR026103 PANTHER:PTHR22930 eggNOG:NOG243843 EMBL:CR926129
EMBL:BC091804 IPI:IPI00506833 RefSeq:NP_001014341.1
UniGene:Dr.90965 Ensembl:ENSDART00000065568 GeneID:541506
KEGG:dre:541506 HOGENOM:HOG000198826 InParanoid:Q58EQ3 OMA:NDEWLEV
OrthoDB:EOG4C87T0 NextBio:20879288 Uniprot:Q58EQ3
Length = 415
Score = 542 (195.9 bits), Expect = 2.7e-52, P = 2.7e-52
Identities = 114/311 (36%), Positives = 186/311 (59%)
Query: 174 LWVKDRSKDWWDERNHPDFPEEEFWRDFRMSKATFEMICEELESTVMKKNTMLRDAIPVR 233
+W R WWD P+F EEF ++FR+S+ +FE IC L + +K+T R ++PV+
Sbjct: 59 VWSHPREHRWWDVIV-PEFTPEEFIQNFRVSRESFEYICRRLRHMLERKDTNFRLSVPVK 117
Query: 234 QRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQ 293
+RVA+ + +LATG R VS+ FG+G+ST V + CSA+ +L+P +++P K+K+
Sbjct: 118 KRVAIALCKLATGSEYRYVSQLFGVGVSTVFNCVQDFCSAVIKILVPVHMKFPSPEKLKE 177
Query: 294 IKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTK 353
+ + F+ +P GS+ HIPIIAP+ + Y N+ K +S+ +Q VVD
Sbjct: 178 MADVFENCWNVPQCIGSIDAHHIPIIAPEKNPRGYLNR-------KGWHSVVLQAVVDGN 230
Query: 354 GVFTDVCIGWPGSMPDDQVLERSALFQR-ADRGLLK---------DV--WIVGNSGYPLM 401
G+F D+C+G+ G++ D +VL +S L+ ++R LL DV +++G+S YPL
Sbjct: 231 GLFWDLCVGFSGNLSDARVLRQSYLWSLLSERDLLNHNKVDISGCDVGYYLIGDSAYPLQ 290
Query: 402 DWVMVPYTQ-KNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVL 460
+W+M P+ LT Q +FN ++ ++V+ +F +LK RW CL +R + K++ + +
Sbjct: 291 NWLMKPFPDIGGLTPQQESFNSRLSSARSVSDLSFKKLKARWQCLFRRNDCKVELVKKMA 350
Query: 461 GACCVLHNICE 471
CCVLHNICE
Sbjct: 351 LTCCVLHNICE 361
>TAIR|locus:2099901 [details] [associations]
symbol:AT3G55350 species:3702 "Arabidopsis thaliana"
[GO:0005634 "nucleus" evidence=ISM] [GO:0016788 "hydrolase
activity, acting on ester bonds" evidence=IEA] EMBL:CP002686
GenomeReviews:BA000014_GR EMBL:AL132975 InterPro:IPR026103
PANTHER:PTHR22930 HOGENOM:HOG000070719 ProtClustDB:CLSN2685285
EMBL:AY087712 EMBL:BT009674 EMBL:AK117365 IPI:IPI00516908
PIR:T47674 RefSeq:NP_191095.1 UniGene:At.35030 PRIDE:Q9M2U3
DNASU:824701 EnsemblPlants:AT3G55350.1 GeneID:824701
KEGG:ath:AT3G55350 TAIR:At3g55350 eggNOG:NOG241715
InParanoid:Q9M2U3 OMA:TTHITMC PhylomeDB:Q9M2U3
Genevestigator:Q9M2U3 Uniprot:Q9M2U3
Length = 406
Score = 386 (140.9 bits), Expect = 9.2e-36, P = 9.2e-36
Identities = 100/325 (30%), Positives = 173/325 (53%)
Query: 179 RSKDWWD---ERNHPDFPEEEFWRD-FRMSKATFEMICEELESTVMKKNTMLRDA----I 230
+S DWWD R + + + + F++S+ TF+ IC +++ K D+ +
Sbjct: 50 QSLDWWDGFSRRIYGGSTDPKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPL 109
Query: 231 PVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELK 290
+ RVAV + RL +GE L V+ + FG+ ST ++ +++ + L WP K
Sbjct: 110 SLNDRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAI-HHLSWPS--K 166
Query: 291 MKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVV 350
+ +IK +F+ ISG+PN G++ THI + P + + NK + +K ++S+T+Q VV
Sbjct: 167 LDEIKSKFEKISGLPNCCGAIDITHIVMNLPAVEPS---NKVWLD-GEK-NFSMTLQAVV 221
Query: 351 DTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGL--------LKD-----VWIVGNSG 397
D F DV GWPGS+ DD VL+ S ++ ++G L + +IVG+SG
Sbjct: 222 DPDMRFLDVIAGWPGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSG 281
Query: 398 YPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQD-L 456
+PL+ W++ PY K + Q FN++ + A+ A ++LK RW + + ++ L
Sbjct: 282 FPLLPWLLTPYQGKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRL 341
Query: 457 PVVLGACCVLHNIC-EMRNEVMDPQ 480
P ++ CC+LHNI +M ++ +D Q
Sbjct: 342 PRIIFVCCLLHNIIIDMEDQTLDDQ 366
>TAIR|locus:2077259 [details] [associations]
symbol:AT3G63270 species:3702 "Arabidopsis thaliana"
[GO:0005634 "nucleus" evidence=ISM] [GO:0016788 "hydrolase
activity, acting on ester bonds" evidence=IEA] EMBL:CP002686
GenomeReviews:BA000014_GR InterPro:IPR026103 PANTHER:PTHR22930
EMBL:AF370300 EMBL:AY063087 IPI:IPI00539136 RefSeq:NP_567144.1
UniGene:At.1305 PRIDE:Q94K49 EnsemblPlants:AT3G63270.1
GeneID:825502 KEGG:ath:AT3G63270 TAIR:At3g63270 eggNOG:NOG298020
HOGENOM:HOG000070719 InParanoid:Q94K49 OMA:SGLINIE PhylomeDB:Q94K49
ProtClustDB:CLSN2685285 ArrayExpress:Q94K49 Genevestigator:Q94K49
Uniprot:Q94K49
Length = 396
Score = 351 (128.6 bits), Expect = 4.7e-32, P = 4.7e-32
Identities = 91/309 (29%), Positives = 158/309 (51%)
Query: 183 WWDERNHPDFPEEE---FWRDFRMSKATFEMICEEL-ESTVMKKNTMLRDA----IPVRQ 234
+W + P P +E F FR SK TF IC + E + + + L + + V +
Sbjct: 48 FWLRNSSPSVPSDEDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEK 107
Query: 235 RVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQI 294
+VA+ + RLA+G+ V FG+G ST ++ A++ L+WPD ++++I
Sbjct: 108 QVAIALRRLASGDSQVSVGAAFGVGQSTVSQVTWRFIEALEE-RAKHHLRWPDSDRIEEI 166
Query: 295 KEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKG 354
K +F+ + G+PN G++ TTHI + P + + + +Q+ +YS+ +QGV D +
Sbjct: 167 KSKFEEMYGLPNCCGAIDTTHIIMTLPAVQASDDWC------DQEKNYSMFLQGVFDHEM 220
Query: 355 VFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKD-------------VWIVGNSGYPLM 401
F ++ GWPG M ++L+ S F+ + + D ++VG YPL+
Sbjct: 221 RFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQILDGNPKTLSQGAQIREYVVGGISYPLL 280
Query: 402 DWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQK-RTEVKLQDLPVVL 460
W++ P+ + + + AFNE+ +++VA AF +LKG W L K + LP ++
Sbjct: 281 PWLITPHDSDHPSDSMVAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSII 340
Query: 461 GACCVLHNI 469
CC+LHNI
Sbjct: 341 LVCCLLHNI 349
Score = 250 (93.1 bits), Expect = 5.8e-19, P = 5.8e-19
Identities = 67/220 (30%), Positives = 114/220 (51%)
Query: 182 DWWDE---RNH-PDFPEEE---FWRDFRMSKATFEMICEEL-ESTVMKKNTMLRDA---- 229
DWWD RN P P +E F FR SK TF IC + E + + + L +
Sbjct: 43 DWWDTFWLRNSSPSVPSDEDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRL 102
Query: 230 IPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDEL 289
+ V ++VA+ + RLA+G+ V FG+G ST ++ A++ L+WPD
Sbjct: 103 LSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTVSQVTWRFIEALEE-RAKHHLRWPDSD 161
Query: 290 KMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGV 349
++++IK +F+ + G+PN G++ TTHI + P + + + +Q+ +YS+ +QGV
Sbjct: 162 RIEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAVQASDDWC------DQEKNYSMFLQGV 215
Query: 350 VDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKD 389
D + F ++ GWPG M ++L+ S F+ + + D
Sbjct: 216 FDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQILD 255
>UNIPROTKB|E1BQ99 [details] [associations]
symbol:HARBI1 "Uncharacterized protein" species:9031
"Gallus gallus" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005813
"centrosome" evidence=IEA] GO:GO:0005737 GO:GO:0005813 CTD:283254
GeneTree:ENSGT00530000063045 OMA:GDSSFFL InterPro:IPR026103
InterPro:IPR026244 PANTHER:PTHR22930 PRINTS:PR02086
EMBL:AADN02033491 IPI:IPI00598024 RefSeq:XP_421117.1
Ensembl:ENSGALT00000013605 GeneID:423193 KEGG:gga:423193
NextBio:20825695 Uniprot:E1BQ99
Length = 348
Score = 266 (98.7 bits), Expect = 1.7e-22, P = 1.7e-22
Identities = 79/287 (27%), Positives = 134/287 (46%)
Query: 192 FPEEEFWRDFRMSKATF--EMICE--ELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGE 247
F E+ ++ +S F + IC +L + + T AI +V + +G
Sbjct: 25 FKLEDVTDEYLVSTYGFPRQFICYLVDLLGASLSRPTQRSRAISPETQVLAALGFYTSGS 84
Query: 248 PLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWP-DELKMKQIKEEFQGISGIPN 306
+ G+ ++ + V V A+ P+F+ +P DE ++ +K++F ++G+P
Sbjct: 85 FQTRMGDAIGISQASMSRCVANVTEAL-VERAPQFIHFPEDEAAVQSLKDDFYALAGMPG 143
Query: 307 VGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGS 366
V G + TH+ I AP SY N+ K +S+ V D +G WPGS
Sbjct: 144 VLGVVDCTHVAIKAPNAEDLSYVNR-------KGLHSLNCLMVCDARGALLSAETHWPGS 196
Query: 367 MPDDQVLERSALFQRADRGLLKDVWIVGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGD 426
MPD VL+++AL + + L KD W++G+S + L W+M P T ++ +N
Sbjct: 197 MPDCNVLQQAALTSQFENELYKDGWLLGDSSFFLRTWLMTPLHIPE-TPAEYRYNMAHSA 255
Query: 427 IQAVAKDAFARLKGRWACLQKRTEVKLQDLPV----VLGACCVLHNI 469
V + F ++ R+ CL ++ LQ P ++ ACCVLHNI
Sbjct: 256 THNVIERTFRTIRSRFRCLDG-SKGTLQYSPEKSSHIILACCVLHNI 301
>UNIPROTKB|E2RCW9 [details] [associations]
symbol:HARBI1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0005813 "centrosome" evidence=IEA]
[GO:0005737 "cytoplasm" evidence=IEA] GO:GO:0005737 GO:GO:0005813
CTD:283254 GeneTree:ENSGT00530000063045 OMA:GDSSFFL
InterPro:IPR026103 InterPro:IPR026244 PANTHER:PTHR22930
PRINTS:PR02086 EMBL:AAEX03011498 RefSeq:XP_540753.2
Ensembl:ENSCAFT00000014604 GeneID:483633 KEGG:cfa:483633
NextBio:20858002 Uniprot:E2RCW9
Length = 349
Score = 259 (96.2 bits), Expect = 1.0e-21, P = 1.0e-21
Identities = 74/261 (28%), Positives = 123/261 (47%)
Query: 214 ELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSA 273
EL + + T AI ++ + +G + G+ ++ + V V A
Sbjct: 51 ELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEA 110
Query: 274 IKTVLMPKFLQWP-DELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKR 332
+ +F+++P DE M+ +K+EF G++G+P V G + H+ I AP SY N+
Sbjct: 111 L-VERATQFIRFPADEASMQALKDEFYGLAGMPGVIGVVDCIHVAIKAPNAEDLSYVNR- 168
Query: 333 HTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWI 392
K +S+ V D +G V WPGS+ D VL++S+L + G+ KD W+
Sbjct: 169 ------KGLHSLNCLMVCDIRGALMTVETNWPGSLQDYAVLQQSSLNSHFEAGMHKDSWL 222
Query: 393 VGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVK 452
+G+S + L W+M P T ++ +N +V + F L R+ CL ++
Sbjct: 223 LGDSSFFLRTWLMTPLHIPE-TPAEYRYNMAHSATHSVIEKTFRTLCSRFRCLDG-SKGA 280
Query: 453 LQDLPV----VLGACCVLHNI 469
LQ P ++ ACCVLHNI
Sbjct: 281 LQYSPEKSSHIILACCVLHNI 301
>UNIPROTKB|Q96MB7 [details] [associations]
symbol:HARBI1 "Putative nuclease HARBI1" species:9606 "Homo
sapiens" [GO:0004518 "nuclease activity" evidence=IEA] [GO:0046872
"metal ion binding" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005813
"centrosome" evidence=IDA] GO:GO:0005634 GO:GO:0005737
GO:GO:0005813 EMBL:CH471064 GO:GO:0046872 GO:GO:0090305
GO:GO:0004518 CTD:283254 eggNOG:NOG137666 HOGENOM:HOG000231449
HOVERGEN:HBG054543 OMA:GDSSFFL OrthoDB:EOG479F79 InterPro:IPR026103
InterPro:IPR026244 PANTHER:PTHR22930 PRINTS:PR02086 EMBL:AK057237
EMBL:BC036925 IPI:IPI00065459 RefSeq:NP_776172.1 UniGene:Hs.714463
STRING:Q96MB7 DMDM:74732341 PRIDE:Q96MB7 Ensembl:ENST00000326737
GeneID:283254 KEGG:hsa:283254 UCSC:uc001ncy.3 GeneCards:GC11M046672
HGNC:HGNC:26522 HPA:HPA038671 neXtProt:NX_Q96MB7
PharmGKB:PA162390577 InParanoid:Q96MB7 PhylomeDB:Q96MB7
GenomeRNAi:283254 NextBio:93767 ArrayExpress:Q96MB7 Bgee:Q96MB7
CleanEx:HS_HARBI1 Genevestigator:Q96MB7 GermOnline:ENSG00000180423
Uniprot:Q96MB7
Length = 349
Score = 258 (95.9 bits), Expect = 1.3e-21, P = 1.3e-21
Identities = 74/261 (28%), Positives = 124/261 (47%)
Query: 214 ELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSA 273
EL + + T AI +V + +G + G+ ++ + V V A
Sbjct: 51 ELLGANLSRPTQRSRAISPETQVLAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEA 110
Query: 274 IKTVLMPKFLQWP-DELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKR 332
+ +F+++P DE ++ +K+EF G++G+P V G + H+ I AP SY N+
Sbjct: 111 L-VERASQFIRFPADEASIQALKDEFYGLAGMPGVMGVVDCIHVAIKAPNAEDLSYVNR- 168
Query: 333 HTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWI 392
K +S+ V D +G V WPGS+ D VL++S+L + + G+ KD W+
Sbjct: 169 ------KGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQQSSLSSQFEAGMHKDSWL 222
Query: 393 VGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVK 452
+G+S + L W+M P T ++ +N +V + F L R+ CL ++
Sbjct: 223 LGDSSFFLRTWLMTPLHIPE-TPAEYRYNMAHSATHSVIEKTFRTLCSRFRCLDG-SKGA 280
Query: 453 LQDLPV----VLGACCVLHNI 469
LQ P ++ ACCVLHNI
Sbjct: 281 LQYSPEKSSHIILACCVLHNI 301
>UNIPROTKB|Q17QR8 [details] [associations]
symbol:HARBI1 "Putative nuclease HARBI1" species:9913 "Bos
taurus" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] [GO:0005813 "centrosome" evidence=IEA] [GO:0046872
"metal ion binding" evidence=IEA] [GO:0004518 "nuclease activity"
evidence=IEA] GO:GO:0005634 GO:GO:0005737 GO:GO:0005813
GO:GO:0046872 GO:GO:0090305 GO:GO:0004518 EMBL:BC118217
IPI:IPI00696757 RefSeq:NP_001069136.1 UniGene:Bt.37438
STRING:Q17QR8 Ensembl:ENSBTAT00000006085 GeneID:514442
KEGG:bta:514442 CTD:283254 eggNOG:NOG137666
GeneTree:ENSGT00530000063045 HOGENOM:HOG000231449
HOVERGEN:HBG054543 InParanoid:Q17QR8 OMA:GDSSFFL OrthoDB:EOG479F79
NextBio:20871335 InterPro:IPR026103 InterPro:IPR026244
PANTHER:PTHR22930 PRINTS:PR02086 Uniprot:Q17QR8
Length = 349
Score = 254 (94.5 bits), Expect = 1.5e-20, P = 1.5e-20
Identities = 73/261 (27%), Positives = 123/261 (47%)
Query: 214 ELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSA 273
EL + + T AI ++ + +G + G+ ++ + V V A
Sbjct: 51 ELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEA 110
Query: 274 IKTVLMPKFLQWP-DELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKR 332
+ +F+ +P DE ++ +K+EF G++GIP V G + H+ I AP SY N+
Sbjct: 111 L-VERASQFIHFPADEASVQALKDEFYGLAGIPGVIGVVDCMHVAIKAPNAEDLSYVNR- 168
Query: 333 HTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWI 392
K +S+ V D +G V WPGS+ D VL++S+L + + G+ K+ W+
Sbjct: 169 ------KGLHSLNCLMVCDIRGALMTVETSWPGSLQDCVVLQQSSLSSQFEAGMHKESWL 222
Query: 393 VGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVK 452
+G+S + L W+M P T ++ +N +V + F L R+ CL ++
Sbjct: 223 LGDSSFFLRTWLMTPLHIPE-TPAEYRYNMAHSATHSVIEKTFRTLCSRFRCLDG-SKGA 280
Query: 453 LQDLPV----VLGACCVLHNI 469
LQ P ++ ACCVLHNI
Sbjct: 281 LQYSPEKSSHIILACCVLHNI 301
>UNIPROTKB|F1SIA2 [details] [associations]
symbol:HARBI1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0005813 "centrosome" evidence=IEA] [GO:0005737
"cytoplasm" evidence=IEA] GO:GO:0005737 GO:GO:0005813 CTD:283254
GeneTree:ENSGT00530000063045 OMA:GDSSFFL InterPro:IPR026103
InterPro:IPR026244 PANTHER:PTHR22930 PRINTS:PR02086 EMBL:CU467600
RefSeq:XP_003122875.1 UniGene:Ssc.5597 Ensembl:ENSSSCT00000014482
GeneID:100516314 KEGG:ssc:100516314 Uniprot:F1SIA2
Length = 349
Score = 254 (94.5 bits), Expect = 1.5e-20, P = 1.5e-20
Identities = 72/261 (27%), Positives = 125/261 (47%)
Query: 214 ELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSA 273
EL + + + T AI ++ + +G + G+ ++ + V V A
Sbjct: 51 ELLGSSLSRPTQRSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVTNVTEA 110
Query: 274 IKTVLMPKFLQWP-DELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKR 332
+ +F+++P DE ++ +K+EF G++G+P V G + H+ I AP SY N+
Sbjct: 111 L-VERASQFIRFPADEASVQALKDEFYGLAGMPGVIGVVDCIHVAIKAPNAEDLSYVNR- 168
Query: 333 HTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWI 392
K +S+ V D +G V WPGS+ D VL++S+L + + G+ K+ W+
Sbjct: 169 ------KGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCVVLQQSSLSSQFEAGMHKESWL 222
Query: 393 VGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVK 452
+G+S + L W+M P T ++ +N +V + F L R+ CL ++
Sbjct: 223 LGDSSFFLRSWLMTPLHIPE-TPAEYRYNMAHSATHSVIEKTFRTLCSRFRCLDG-SKGA 280
Query: 453 LQDLPV----VLGACCVLHNI 469
LQ P ++ ACCVLHNI
Sbjct: 281 LQYSPEKCSHIILACCVLHNI 301
>MGI|MGI:2443194 [details] [associations]
symbol:Harbi1 "harbinger transposase derived 1"
species:10090 "Mus musculus" [GO:0003674 "molecular_function"
evidence=ND] [GO:0004518 "nuclease activity" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
evidence=IEA] [GO:0008150 "biological_process" evidence=ND]
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0046872 "metal
ion binding" evidence=IEA] MGI:MGI:2443194 GO:GO:0005634
GO:GO:0005737 GO:GO:0005813 GO:GO:0046872 GO:GO:0090305
EMBL:AL714023 GO:GO:0004518 CTD:283254 eggNOG:NOG137666
GeneTree:ENSGT00530000063045 HOGENOM:HOG000231449
HOVERGEN:HBG054543 OMA:GDSSFFL OrthoDB:EOG479F79 InterPro:IPR026103
InterPro:IPR026244 PANTHER:PTHR22930 PRINTS:PR02086 EMBL:AK041747
EMBL:AK045343 EMBL:AK080671 EMBL:AK084226 EMBL:AK147045
EMBL:BC094315 IPI:IPI00453562 IPI:IPI00473454 IPI:IPI00816924
RefSeq:NP_848839.2 UniGene:Mm.130331 STRING:Q8BR93 PRIDE:Q8BR93
Ensembl:ENSMUST00000090608 Ensembl:ENSMUST00000111322
Ensembl:ENSMUST00000142692 GeneID:241547 KEGG:mmu:241547
UCSC:uc008kwo.1 InParanoid:Q8BR93 ChiTaRS:HARBI1 NextBio:385049
Bgee:Q8BR93 Genevestigator:Q8BR93 GermOnline:ENSMUSG00000027243
Uniprot:Q8BR93
Length = 349
Score = 250 (93.1 bits), Expect = 1.0e-19, P = 1.0e-19
Identities = 72/261 (27%), Positives = 121/261 (46%)
Query: 214 ELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSA 273
EL + + T AI ++ + +G + G+ ++ + V V A
Sbjct: 51 ELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEA 110
Query: 274 IKTVLMPKFLQWP-DELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKR 332
+ +F+ +P DE ++ +K+EF G++G+P V G H+ I AP SY N+
Sbjct: 111 L-VERASQFIHFPVDEAAVQSLKDEFYGLAGMPGVIGVADCIHVAIKAPNAEDLSYVNR- 168
Query: 333 HTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWI 392
K +S+ V D +G V WPGS+ D VL+RS+L + + G+ KD W+
Sbjct: 169 ------KGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQRSSLTSQFETGMPKDSWL 222
Query: 393 VGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVK 452
+G+S + L W++ P T ++ +N +V + L R+ CL ++
Sbjct: 223 LGDSSFFLRSWLLTPLPIPE-TAAEYRYNRAHSATHSVIERTLQTLCCRFRCLDG-SKGA 280
Query: 453 LQDLPV----VLGACCVLHNI 469
LQ P ++ ACCVLHNI
Sbjct: 281 LQYSPEKCSHIILACCVLHNI 301
>RGD|1584007 [details] [associations]
symbol:Harbi1 "harbinger transposase derived 1" species:10116
"Rattus norvegicus" [GO:0004518 "nuclease activity" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
evidence=IEA;ISO] [GO:0005813 "centrosome" evidence=IEA;ISO]
[GO:0046872 "metal ion binding" evidence=IEA] RGD:1584007
GO:GO:0005634 GO:GO:0005737 GO:GO:0005813 GO:GO:0046872
GO:GO:0090305 GO:GO:0004518 CTD:283254 eggNOG:NOG137666
GeneTree:ENSGT00530000063045 HOGENOM:HOG000231449
HOVERGEN:HBG054543 OrthoDB:EOG479F79 InterPro:IPR026103
InterPro:IPR026244 PANTHER:PTHR22930 PRINTS:PR02086 EMBL:BC158734
IPI:IPI00394536 RefSeq:NP_001107265.2 UniGene:Rn.198635
Ensembl:ENSRNOT00000065462 GeneID:690164 KEGG:rno:690164
UCSC:RGD:1584007 NextBio:740317 ArrayExpress:B0BN95
Genevestigator:B0BN95 Uniprot:B0BN95
Length = 349
Score = 248 (92.4 bits), Expect = 2.0e-19, P = 2.0e-19
Identities = 71/261 (27%), Positives = 123/261 (47%)
Query: 214 ELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSA 273
EL + + T AI ++ + +G + G+ ++ + V V A
Sbjct: 51 ELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEA 110
Query: 274 IKTVLMPKFLQWP-DELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKR 332
+ +F+ +P DE ++ +K+EF G++G+P V G++ H+ I AP SY N+
Sbjct: 111 L-VERASQFIHFPADEAAIQSLKDEFYGLAGMPGVIGAVDCIHVAIKAPNAEDLSYVNR- 168
Query: 333 HTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWI 392
K +S+ V D +G V WPGS+ D VL++S+L + + G+ KD W+
Sbjct: 169 ------KGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQQSSLSSQFETGMPKDSWL 222
Query: 393 VGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVK 452
+G+S + L W++ P T ++ +N +V + L R+ CL ++
Sbjct: 223 LGDSSFFLHTWLLTPLHIPE-TPAEYRYNRAHSATHSVIEKTLRTLCCRFRCLDG-SKGA 280
Query: 453 LQDLPV----VLGACCVLHNI 469
LQ P ++ ACCVLHNI
Sbjct: 281 LQYSPEKSSHIILACCVLHNI 301
>ZFIN|ZDB-GENE-081022-77 [details] [associations]
symbol:zgc:194221 "zgc:194221" species:7955 "Danio
rerio" [GO:0005575 "cellular_component" evidence=ND] [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] ZFIN:ZDB-GENE-081022-77 GeneTree:ENSGT00530000063045
InterPro:IPR026103 PANTHER:PTHR22930 EMBL:BX324210 EMBL:BC162733
EMBL:BC162738 IPI:IPI00774426 RefSeq:NP_001129460.1
UniGene:Dr.134637 Ensembl:ENSDART00000082245 GeneID:100191015
KEGG:dre:100191015 eggNOG:NOG248361 HOGENOM:HOG000007556
HOVERGEN:HBG079725 OMA:DGRFQRY OrthoDB:EOG42JNTD NextBio:20795590
Uniprot:B3DHE2
Length = 394
Score = 253 (94.1 bits), Expect = 2.4e-19, P = 2.4e-19
Identities = 78/289 (26%), Positives = 136/289 (47%)
Query: 197 FWRDFRMSKATFEMICEELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRF 256
F R FR+ + F+ + ++ + +++T R +I +R+A+C+ LATG+ R ++ +
Sbjct: 53 FQRYFRLDREQFDSLLSKVGPQIARQDTNYRQSIEPAERLAICLRFLATGDSYRTIAFSY 112
Query: 257 GLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQIKEEFQGISGIPNVGGSMYTTHI 316
+G+ST +V V AI L + + P + I +F PN GS+ H+
Sbjct: 113 RVGVSTVAGIVAAVTRAIWDTLAQEVMPVPTTEDWRNISTDFLHRWNFPNCLGSIDGKHV 172
Query: 317 PIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERS 376
I AP S + ++N + T YS+ + VVD++ F V +G G M D VL S
Sbjct: 173 VIKAPDNSGSLFYNYKGT-------YSVVLLAVVDSQYRFRVVDVGSYGRMSDGGVLANS 225
Query: 377 ALFQRADR----GLLKDVWIVG-------------NSGYPLMDWVMVPYTQKNLTWTQHA 419
+F +A R GL +D + G + +PL +M P+ NL+ Q
Sbjct: 226 -IFGQALRDGALGLPQDALLSGAEHFGPQPHVFVADEAFPLRRDLMRPFPGHNLSGRQRI 284
Query: 420 FNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHN 468
FN ++ + + ++ F L +W + E+ ++ + A CVLHN
Sbjct: 285 FNYRLSRARLIVENTFGILTAQWRMYRGAIEISPANVDACVKATCVLHN 333
>TAIR|locus:2094088 [details] [associations]
symbol:AT3G19120 species:3702 "Arabidopsis thaliana"
[GO:0005634 "nucleus" evidence=ISM] [GO:0016788 "hydrolase
activity, acting on ester bonds" evidence=IEA] [GO:0009220
"pyrimidine ribonucleotide biosynthetic process" evidence=RCA]
EMBL:CP002686 EMBL:AP000419 InterPro:IPR026103 PANTHER:PTHR22930
EMBL:AY070731 EMBL:AY149933 IPI:IPI00533950 RefSeq:NP_566626.1
UniGene:At.28342 IntAct:Q9LJL8 PRIDE:Q9LJL8
EnsemblPlants:AT3G19120.1 GeneID:821446 KEGG:ath:AT3G19120
TAIR:At3g19120 HOGENOM:HOG000090855 InParanoid:Q9LJL8 OMA:YLISKIT
PhylomeDB:Q9LJL8 ProtClustDB:CLSN2688554 Genevestigator:Q9LJL8
Uniprot:Q9LJL8
Length = 446
Score = 253 (94.1 bits), Expect = 4.9e-19, P = 4.9e-19
Identities = 81/302 (26%), Positives = 145/302 (48%)
Query: 198 WRD-FRMSKATFEMICEELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRF 256
WR + +S F + ++L+ + N L P VA+ + RLA G + ++ R+
Sbjct: 117 WRSLYGLSYPVFITVVDKLKPFITASNLSL----PADYAVAMVLSRLAHGCSAKTLASRY 172
Query: 257 GLGISTCHKLVLEVCSAIKTVLMPKFLQWP-DELKMKQIKEEFQGISGIPNVGGSMYTTH 315
L K+ V + T L P+F++ P + ++ + + F+ ++ +PN+ G++ +T
Sbjct: 173 SLDPYLISKITNMVTRLLATKLYPEFIKIPVGKRRLIETTQGFEELTSLPNICGAIDST- 231
Query: 316 IPIIAPKISVASYFNKRHTERNQKTSY-SITVQGVVDTKGVFTDVCIGWPGSMPDDQVLE 374
P+ K+ + N R+ K Y ++ +Q V D K +F DVC+ PG D
Sbjct: 232 -PV---KLRRRTKLNPRNIY-GCKYGYDAVLLQVVADHKKIFWDVCVKAPGGEDDSSHFR 286
Query: 375 RSALFQRADRGLLKDVW--------------IVGNSGYPLMDWVMVPYTQKNL-TWTQHA 419
S L++R G + VW IVG+ YPL+ ++M P++ T ++
Sbjct: 287 DSLLYKRLTSGDI--VWEKVINIRGHHVRPYIVGDWCYPLLSFLMTPFSPNGSGTPPENL 344
Query: 420 FNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNICEMRNEVMDP 479
F+ + ++V +A LK RW LQ V + P + ACCVLHN+C++ E +P
Sbjct: 345 FDGMLMKGRSVVVEAIGLLKARWKILQS-LNVGVNHAPQTIVACCVLHNLCQIAREP-EP 402
Query: 480 QL 481
++
Sbjct: 403 EI 404
>ZFIN|ZDB-GENE-040608-1 [details] [associations]
symbol:harbi1 "harbinger transposase derived 1"
species:7955 "Danio rerio" [GO:0004518 "nuclease activity"
evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
ZFIN:ZDB-GENE-040608-1 GO:GO:0005634 GO:GO:0005737 GO:GO:0046872
GO:GO:0090305 GO:GO:0004518 CTD:283254 eggNOG:NOG137666
GeneTree:ENSGT00530000063045 HOGENOM:HOG000231449
HOVERGEN:HBG054543 OMA:GDSSFFL OrthoDB:EOG479F79 InterPro:IPR026103
InterPro:IPR026244 PANTHER:PTHR22930 PRINTS:PR02086 EMBL:BC078390
EMBL:BC100116 IPI:IPI00482479 RefSeq:NP_001003734.1
UniGene:Dr.85217 STRING:Q6AZB8 Ensembl:ENSDART00000052323
Ensembl:ENSDART00000129462 GeneID:445279 KEGG:dre:445279
InParanoid:Q6AZB8 NextBio:20832025 Bgee:Q6AZB8 Uniprot:Q6AZB8
Length = 349
Score = 238 (88.8 bits), Expect = 5.5e-18, P = 5.5e-18
Identities = 76/267 (28%), Positives = 124/267 (46%)
Query: 211 ICEELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEV 270
+ E L+ +++++ R P Q +A + +G + G+ ++ + V V
Sbjct: 49 LVELLKDSLLRRTQRSRAISPDVQILAALGF-YTSGSFQSKMGDAIGISQASMSRCVSNV 107
Query: 271 CSAIKTVLMPKFLQWP-DELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYF 329
A+ P+F+ + DE +Q K+EF I+GIPNV G + HI I AP +SY
Sbjct: 108 TKAL-IEKAPEFIGFTRDEATKQQFKDEFYRIAGIPNVTGVVDCAHIAIKAPNADDSSYV 166
Query: 330 NKRHTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERS---ALFQRADRGL 386
NK K +SI Q V D +G+ WPGS+ D V ++S LF+ +
Sbjct: 167 NK-------KGFHSINCQLVCDARGLLLSAETHWPGSLTDRAVFKQSNVAKLFEEQEND- 218
Query: 387 LKDVWIVGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQ 446
+ W++G++ YPL W+M P Q + + +N + F ++ R+ CL
Sbjct: 219 -DEGWLLGDNRYPLKKWLMTP-VQSPESPADYRYNLAHTTTHEIVDRTFRAIQTRFRCLD 276
Query: 447 KRTEVKLQDLPV----VLGACCVLHNI 469
+ LQ P ++ ACCVLHNI
Sbjct: 277 G-AKGYLQYSPEKCSHIIQACCVLHNI 302
>FB|FBgn0052095 [details] [associations]
symbol:CG32095 species:7227 "Drosophila melanogaster"
[GO:0008150 "biological_process" evidence=ND] [GO:0003674
"molecular_function" evidence=ND] [GO:0005575 "cellular_component"
evidence=ND] EMBL:AE014296 InterPro:IPR026103 PANTHER:PTHR22930
EMBL:AY058280 RefSeq:NP_729755.1 UniGene:Dm.20863
EnsemblMetazoa:FBtr0076077 GeneID:317849 KEGG:dme:Dmel_CG32095
UCSC:CG32095-RA FlyBase:FBgn0052095 eggNOG:NOG243843
InParanoid:Q95U65 OMA:SEPHMLE OrthoDB:EOG4R229X GenomeRNAi:317849
NextBio:843946 Uniprot:Q95U65
Length = 429
Score = 199 (75.1 bits), Expect = 6.1e-13, P = 6.1e-13
Identities = 78/335 (23%), Positives = 145/335 (43%)
Query: 190 PDFPEEEFWRDFRMSKATFEMICEELESTVMKKN--TMLRDAIPVRQRVAVCVWRLATGE 247
P+ EE+F +++ TFE +C++L T+ + T AI + VA+ + LA+GE
Sbjct: 100 PELSEEDFLNTLHVTRGTFETLCKQLSPTLRTSDELTQREPAISTEKCVALALNFLASGE 159
Query: 248 PLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDE-LKMKQIKEEFQGISGIPN 306
L ++++RF L K + C+A+ + L Q P + + + FQ S +P
Sbjct: 160 RLSLIAERFSLPRPRTIKCLKVFCNAVMSTLGRALRQLPQNPVDCNSVAKGFQRESNMPA 219
Query: 307 -VGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQ-GVVDTKGVFTDVCIGWP 364
+ G + IPI + + S + ++ + + G+ T G
Sbjct: 220 ALVGVLGVCSIPIRSTGEAKNSILRMEYLLDDRMLFRELQLGCGLRATLGPMFSHAPNTL 279
Query: 365 GSMPDDQVLERSALFQRADRGLLKDVWIVGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKI 424
++P+ ++ R +L V+ YPL W++ YT +H FNE
Sbjct: 280 TAIPEFRINSRLV-----PAFVLAPVY----QNYPLRPWLLQRYTDPTAPH-EHDFNEVA 329
Query: 425 GDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNICEMRNE--VMDPQLK 482
+Q ++ A RL RW+ L + ++ ++ A VLHN+ E +E +++
Sbjct: 330 EHLQELSDCALHRLMSRWSFLSQPLDISFHTASCIITAAAVLHNLLEELSEPHMLEWGNS 389
Query: 483 FDL--FDDEMIPDN---SVRSMASAQARDHIAHNL 512
D+ F E + D+ S A+ + RD +A +
Sbjct: 390 VDVSKFRAEPLSDSVSEDAESHAALEVRDFLARTI 424
>ZFIN|ZDB-GENE-060810-147 [details] [associations]
symbol:si:dkey-197c15.6 "si:dkey-197c15.6"
species:7955 "Danio rerio" [GO:0008150 "biological_process"
evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
[GO:0005575 "cellular_component" evidence=ND]
ZFIN:ZDB-GENE-060810-147 GeneTree:ENSGT00530000063045
InterPro:IPR026103 InterPro:IPR026244 PANTHER:PTHR22930
PRINTS:PR02086 EMBL:CR376854 IPI:IPI00901436 RefSeq:XP_697483.2
UniGene:Dr.133515 Ensembl:ENSDART00000112777 GeneID:569030
KEGG:dre:569030 NextBio:20889464 Bgee:E7FFX8 Uniprot:E7FFX8
Length = 395
Score = 160 (61.4 bits), Expect = 1.2e-08, P = 1.2e-08
Identities = 63/247 (25%), Positives = 108/247 (43%)
Query: 271 CSAIKTV------LMPKFLQWPDELKMKQ-IKEEFQGISGIPNVGGSMYTTHIPIIAPKI 323
C A+K L P+F+ +P+ + + F+ +SGIP+V G + HI + P +
Sbjct: 129 CEAVKATTKLLSDLTPEFITFPNSYNDRMGAAQAFKNLSGIPHVVGVLGYLHIRVRPPVL 188
Query: 324 SVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRAD 383
Y N +SI VQ + D G V PG P+ V E S + ++
Sbjct: 189 EERMYVNTLGY-------HSIMVQVIFDADGNLFSVEQCCPGGTPEHSVWENSDIGRQFS 241
Query: 384 RGLLKDVWIVGNSGYPLMDWVMVPY-TQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRW 442
WI+G+ V+ P T + + FN+ + ++ F LK R+
Sbjct: 242 IFQHGHTWIIGSPSLLGCGHVLTPVETIRIKSNAAVQFNKAHALLYNSSQHVFGSLKSRF 301
Query: 443 ACLQKRTEVKL-QDLPVVLGACCVLHNICEMRNEVMDPQLKFDLFDDEMIPDNSVRSMAS 501
CLQ ++ + + ++ ACCVLHNI + + V P F L + + P + V ++ +
Sbjct: 302 QCLQDFGSIQSPESVACMIRACCVLHNISK-KFSVPLPA-DFSL--EPLHPPSEVLNIMA 357
Query: 502 AQARDHI 508
Q D++
Sbjct: 358 EQQFDYM 364
>TAIR|locus:2207051 [details] [associations]
symbol:AT1G72270 species:3702 "Arabidopsis thaliana"
[GO:0005634 "nucleus" evidence=ISM] [GO:0005739 "mitochondrion"
evidence=IDA] [GO:0007059 "chromosome segregation" evidence=RCA]
[GO:0007062 "sister chromatid cohesion" evidence=RCA] [GO:0007129
"synapsis" evidence=RCA] [GO:0007131 "reciprocal meiotic
recombination" evidence=RCA] [GO:0010332 "response to gamma
radiation" evidence=RCA] [GO:0032204 "regulation of telomere
maintenance" evidence=RCA] [GO:0032504 "multicellular organism
reproduction" evidence=RCA] [GO:0042138 "meiotic DNA double-strand
break formation" evidence=RCA] [GO:0043247 "telomere maintenance in
response to DNA damage" evidence=RCA] [GO:0045132 "meiotic
chromosome segregation" evidence=RCA] EMBL:CP002684 GO:GO:0005739
KO:K14861 UniGene:At.21413 InterPro:IPR021714 Pfam:PF11707
IPI:IPI00524456 RefSeq:NP_565039.4 PRIDE:F4IBR2
EnsemblPlants:AT1G72270.1 GeneID:843559 KEGG:ath:AT1G72270
OMA:ESSPEMG ArrayExpress:F4IBR2 Uniprot:F4IBR2
Length = 2845
Score = 181 (68.8 bits), Expect = 5.7e-08, Sum P(3) = 5.7e-08
Identities = 43/142 (30%), Positives = 77/142 (54%)
Query: 343 SITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADR-----------GLLKDVW 391
SI VQ +VD+ G F D+ GWP +M + + ++ LF A+ G+L +
Sbjct: 206 SILVQALVDSNGRFVDISAGWPSTMKPEAIFRQTKLFSIAEEVLSGAPTKLGNGVLVPRY 265
Query: 392 IVGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVA----KDAFARLKGRWACLQK 447
I+G+S PL+ W++ PY +LT + +F E+ ++ + AFA+++ RW L K
Sbjct: 266 ILGDSCLPLLPWLVTPY---DLTSDEESFREEFNNVVHTGLHSVEIAFAKVRARWRILDK 322
Query: 448 RTEVK-LQDLPVVLGACCVLHN 468
+ + + ++ +P V+ C+LHN
Sbjct: 323 KWKPETIEFMPFVITTGCLLHN 344
Score = 44 (20.5 bits), Expect = 5.7e-08, Sum P(3) = 5.7e-08
Identities = 22/79 (27%), Positives = 31/79 (39%)
Query: 201 FRMSKATFEMICEELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGL-G 259
FRMSK+TF + L + ++P A ++RLA G + RFG
Sbjct: 101 FRMSKSTFFSLYSILSHS----------SLP---SFAATIFRLAHGASYECLVHRFGFDS 147
Query: 260 ISTCHKLVLEVCSAIKTVL 278
S + VC I L
Sbjct: 148 TSQASRSFFTVCKLINEKL 166
Score = 37 (18.1 bits), Expect = 5.7e-08, Sum P(3) = 5.7e-08
Identities = 6/11 (54%), Positives = 8/11 (72%)
Query: 513 LHHGLAGTSFL 523
LHHG G S++
Sbjct: 509 LHHGKQGLSYI 519
>TAIR|locus:504956234 [details] [associations]
symbol:AT1G43722 "AT1G43722" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
"nucleus" evidence=ISM] [GO:0008150 "biological_process"
evidence=ND] EMBL:CP002684 InterPro:IPR026103 PANTHER:PTHR22930
IPI:IPI00546258 RefSeq:NP_683376.1 UniGene:At.52016
EnsemblPlants:AT1G43722.1 GeneID:840961 KEGG:ath:AT1G43722
OMA:LNIMAIC Uniprot:F4ICS6
Length = 324
Score = 123 (48.4 bits), Expect = 0.00010, P = 0.00010
Identities = 57/226 (25%), Positives = 99/226 (43%)
Query: 202 RMSKATFEMICEELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGIS 261
RMS F +C L++ + T+ I + + VA+ + E R V RFG
Sbjct: 71 RMSLPCFTTLCNMLQTNYDLQPTL---NISIEESVAMFLRICGHNEVYRDVGLRFGRNQE 127
Query: 262 TCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQIKEEFQGISGI-PNVGG---SMYTTHIP 317
T + EV +A + +L +++ P ++ +I E Q P G +M TH+
Sbjct: 128 TVQRKFREVLTATE-LLACDYIRTPTRQELYRIPERLQVDQRYWPYFSGFVGAMDGTHVC 186
Query: 318 I-IAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERS 376
+ + P + Y+N RH S+ + + D K +FT + G PGS D VL+
Sbjct: 187 VKVKPDLQ-GMYWN-RHDNA------SLNIMAICDLKMLFTYIWNGAPGSCYDTAVLQ-- 236
Query: 377 ALFQRADRGLL---KDVWIVGNSGYPLMDWVMVPY-TQKNLTWTQH 418
+ Q++D + + + +SGYP ++ PY + +N H
Sbjct: 237 -IAQQSDSEFPLPPSEKYYLVDSGYPNKQGLLAPYRSSRNRVVRYH 281
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.321 0.135 0.416 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 524 481 0.00080 119 3 11 22 0.37 34
35 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 19
No. of states in DFA: 631 (67 KB)
Total size of DFA: 325 KB (2164 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 38.58u 0.13s 38.71t Elapsed: 00:00:02
Total cpu time: 38.58u 0.13s 38.71t Elapsed: 00:00:02
Start: Fri May 10 17:36:08 2013 End: Fri May 10 17:36:10 2013