BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000733
(1324 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297746515|emb|CBI16571.3| unnamed protein product [Vitis vinifera]
Length = 1685
Score = 1192 bits (3084), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 694/1327 (52%), Positives = 875/1327 (65%), Gaps = 91/1327 (6%)
Query: 1 MATP-TRIEIE-EEIEGGRIESNDFFFDEIGESIPIIRDDDSQFDLQNP-PSLPLAVSQL 57
MATP + I++E +++EGGR++ +D+ F +IGES+ I+ F+L +P PS PLAVS+
Sbjct: 1 MATPESVIDLEGKDLEGGRLDCDDYVFVKIGESV-TIKPQYYNFNLDSPLPSQPLAVSER 59
Query: 58 SHQLIFAAHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSND 117
S QLIF AHS GFCVART VI+ AKE+KE G+ +QELSVVDVP N V IL+LS D
Sbjct: 60 S-QLIFVAHSDGFCVARTEAVIELAKEIKEKGSGSSIQELSVVDVPIAN--VRILALSTD 116
Query: 118 ESTLAVSLYQSPHIHFFSVHSLLNKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLALSNV 177
STLA S+ IHFFSV SLLNK +PSF+ SL+GS+ VKD+RWRKK +NS++ LS+
Sbjct: 117 SSTLAASV--GGDIHFFSVDSLLNKGQEPSFTRSLSGSSSVKDMRWRKKMDNSYVVLSSD 174
Query: 178 GRLYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFKSWV 237
G+LY G GPL+DVMD VDAVEWSVKG +AVA+KN LS+L+SK KERL +SLSFKSW+
Sbjct: 175 GKLYHGAAEGPLKDVMDGVDAVEWSVKGNSIAVARKNTLSLLSSKFKERLCMSLSFKSWI 234
Query: 238 GDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVL 297
GD VNC+VKVDSIRWVR DCII+GCF+LT DGKEE+++VQV+ SK+GKITDASS P VL
Sbjct: 235 GDSGVNCTVKVDSIRWVRPDCIILGCFKLTADGKEESFMVQVVTSKEGKITDASSLPTVL 294
Query: 298 SFSDVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIA 357
SF DVF ++DDI+P G+GPYLFL+Y+E+ LAI+ +RKN+D+H+VL WSLDD+KN+ A
Sbjct: 295 SFYDVFPGLVDDIVPFGSGPYLFLSYLEQCQLAITTSRKNVDNHIVLFGWSLDDKKNEAA 354
Query: 358 VVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEE-KELSPYFILMCLT 416
++DI RDK+ PRI+LQ+N +DNLI+GLC DKVSL GKV++QLG EE +ELSPY +L CLT
Sbjct: 355 ILDIGRDKYRPRIELQENDDDNLILGLCCDKVSLYGKVEIQLGAEEPRELSPYCVLFCLT 414
Query: 417 LDGKLNMFHVASVSGPSCSPDVDFT-LSDAEDDVPAVVPVDSNLPRVYSGSGLQKIEPVA 475
L+GKL MF VASV+G + +P D + L+ ED+ PA PV+ +
Sbjct: 415 LEGKLVMFQVASVTG-TPAPTQDLSPLTGDEDETPAEAPVEHD----------------- 456
Query: 476 SSFKSQGVKLKELDTDDTCGVTAKSNLKGFDKYESSTSISIS------------------ 517
+S+ KE+ + K++L F + +S S I+
Sbjct: 457 ---QSREANTKEISIKQEGEILIKNDLNTFQENKSLISACIADQILHKETIAADHEAKSL 513
Query: 518 -NSQELENKDRQQI------------QNSL--QKSTNLVQSPPKASLPEVTSFGVRDSSK 562
NSQ E +Q++ Q+ L Q+STNL S K S E V D K
Sbjct: 514 VNSQTFEADGQQRVSTIKLYQEVDGKQSGLPRQQSTNLEGSSLKTSPLEGLGNVVGDVKK 573
Query: 563 TGTQDTGGFGLGSTGFVGKFPTDTPSLSSHKDLLKSLEFGKEAQGNFGSAGLQSLSSQSQ 622
T Q + + S S + +S E KE G GS LQ+ +SQS
Sbjct: 574 TDIQK-----------ITGVGSGLGSSQSSHNFSRSFETHKELPGKIGSTNLQN-ASQSW 621
Query: 623 SCGNFISSEDSRVKLPVLPSSHSHEKTYENS--SLGAPNVSGSFVGKPLSSKDATGSLTP 680
S G F + + KL L SS E + +L P V G VG P+ KDA SL
Sbjct: 622 SGGKFTFPKSTEEKL-SLSSSFVESGRSETAGINLSIPQVPGGPVGSPIYPKDAATSLAA 680
Query: 681 VFSAKPVHGDGDRASTGAGKIESLPSVRSSQFSLPQNFASGKSHNQKLYPSKDDYKTATL 740
+ G R S AG +E + S SQ S+ +NF + KS N K YP K++Y+T L
Sbjct: 681 GNFGRISQSRGQRGSMVAGNVEPISSTLGSQLSMQENFPA-KSPNYKSYPPKENYRTPPL 739
Query: 741 SGLPNSEPNLSKQSGNIKEMTKELDMLLQSIEETGGFRDACTVFQRQKVEELEEGIGSLS 800
G NSEPNLSKQ GN+KEM KELD LLQ IE GGFRDACT+FQ+ V ELE+GIG+LS
Sbjct: 740 QGQLNSEPNLSKQFGNVKEMAKELDTLLQCIEGPGGFRDACTIFQKSSVMELEQGIGTLS 799
Query: 801 EKCGMWRSIMDERLQEIQNLFDKTVQVLARKIYTEGIVKQASDSRYWDLWNRQKLSPELE 860
E C MWRSI+D++ EI +L DKTVQVLARK+Y +GIVKQA+DSRYWDLW+RQKL+ ELE
Sbjct: 800 ENCRMWRSIIDQQHGEIHHLLDKTVQVLARKVYMQGIVKQATDSRYWDLWSRQKLASELE 859
Query: 861 LKRGHILSINQDLINQLIELERHFNSLELNKFGENDGVHGCQRRPQSRFGTSRHIQSLHS 920
LKR +IL INQDL NQLIELERHFN++ELNKFGEN G H +R QSR G SRH+QSLHS
Sbjct: 860 LKRRNILKINQDLTNQLIELERHFNAIELNKFGENGGNHVGRRALQSRSGPSRHMQSLHS 919
Query: 921 LHTTMSSQIAAAEQLSDRLSKQMALLSIESPVKQQNVKKELFETLGIPYD-ASFSSPDVT 979
LH TM+SQ+AAAEQLS+ LSKQM +LSI+S VK+QNVKKELFE +GIPYD AS SSP ++
Sbjct: 920 LHNTMNSQLAAAEQLSECLSKQMTMLSIDSTVKKQNVKKELFEAIGIPYDSASVSSPTIS 979
Query: 980 KVMDQSSIKKLIHSSGSAAANDQSRRRQSSAMKSYDPETARRRRDSLDRSWASFEPPKTT 1039
D S+K + SS S+A DQSRR Q SA+KSY+PET RRRRDSL +SWA+FEPPKT
Sbjct: 980 NTSDTPSMKNFLVSS-SSANKDQSRRNQLSALKSYEPETVRRRRDSLGQSWANFEPPKTI 1038
Query: 1040 VKRMLLQDHQKCSQVKSSLK-DKQQISPHMLEGAAIVRPKDRTTPSTSWNPLRIKGLQDA 1098
VKRM+L++ QK K S DK+Q SPH LEG+A + TTPS P R KG+ +
Sbjct: 1039 VKRMVLEEQQKVGVNKPSFSVDKRQFSPHKLEGSAFAYSNNNTTPSAFLYPSRNKGIHEI 1098
Query: 1099 SLKQTSESQST-LFKWAGDPTGPSQMTGLKSP---VFQSNIASTRSSLSATQLSPMGWQN 1154
S KQ S S ST LF+WA DP+GPSQ G +SP N S SSLSA Q SP+ Q+
Sbjct: 1099 SAKQASYSPSTSLFRWANDPSGPSQSFGSRSPTPHALPGNNLSAFSSLSAPQSSPVVDQS 1158
Query: 1155 HARNTGDVTAEKLSSGVYFDVKSNSTLTNETKSTMQSESNIFQKPTISTMSPTLTPSLLK 1214
+A T ++T E+ SSGV F KS++ NETKST+ SES++ Q P IST P T L K
Sbjct: 1159 NAMETCNLTNERSSSGVTFVEKSDAVSINETKSTLLSESHLPQTPIISTSLPARTLPLTK 1218
Query: 1215 NPSEMPISNGKGTVPESSTIGSEKHGAATTKTLFVESGKNRDAQVSTPAAVPTSSAFPGI 1274
P+EM SNGKGTV TIGS K + + F +SG + + +S AV + + PG
Sbjct: 1219 KPNEMSNSNGKGTVLAKPTIGSVKQKPVSPGSSFSQSGVSPFSPIS---AVQPAPSLPGK 1275
Query: 1275 VPIFDAA 1281
V D A
Sbjct: 1276 VFQLDIA 1282
>gi|255564792|ref|XP_002523390.1| nuclear pore complex protein nup153, putative [Ricinus communis]
gi|223537340|gb|EEF38969.1| nuclear pore complex protein nup153, putative [Ricinus communis]
Length = 1824
Score = 1126 bits (2912), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 668/1334 (50%), Positives = 863/1334 (64%), Gaps = 73/1334 (5%)
Query: 6 RIEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQ--FDLQNPPSLPLAVSQLSHQLIF 63
+IEIE+++EG R++S D+FF+ IG+ IPI+ D FDLQ+PPS PLAVS H LIF
Sbjct: 14 KIEIEQDVEGDRVDSGDYFFNRIGKPIPILNDQSESPLFDLQSPPSSPLAVSP-QHGLIF 72
Query: 64 AAHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAV 123
HSSGF VART DV+DAA+E+K GT C+Q+LS+ DVP + ILSLS D STL V
Sbjct: 73 LVHSSGFYVARTKDVMDAAEEIK--GTSPCIQDLSIADVPIGKPH--ILSLSFDSSTLVV 128
Query: 124 SLYQSPHIHFFSVHSLLNKEIKPSFSCSLNGST-YVKDIRWRKKPENSFLALSNVGRLYR 182
++ + H+HFF V SLLNKE+KP FSCSL+ T +VKD +WR++ ++S+L LSN G LY
Sbjct: 129 TV--AAHLHFFLVDSLLNKEVKPFFSCSLSEPTSFVKDFKWRRRSDSSYLLLSNHGNLYH 186
Query: 183 GGV-NGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFKSWVGDCD 241
V + PL+ VMD+VDAVEWS+KG ++AVAK ++L IL+S KERL +SL FKSW+ D D
Sbjct: 187 AAVLDSPLKLVMDDVDAVEWSLKGTYLAVAKAHILHILSSNFKERLRLSLPFKSWIADSD 246
Query: 242 VNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDAS--------SE 293
+CSVKVDSIRWVR D I++GCFQ T DGKEENYLVQVI+SKDGKITD S +
Sbjct: 247 DSCSVKVDSIRWVRPDSIVVGCFQQTADGKEENYLVQVIRSKDGKITDVSVSFVALDLKK 306
Query: 294 PVVLSFSDVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
P VLS+ D+FS +IDDILP G GPYL L+Y+ GLAI+ANRKN D HV+LL WS++D
Sbjct: 307 PSVLSYYDLFSGLIDDILPYGNGPYLLLSYLHECGLAITANRKNTDQHVLLLCWSVEDGM 366
Query: 354 NDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYFILM 413
++ A+VDI+RD WIPRI+LQ NG+DNLIMG +D VS+ KV+V++G+E++ELSP+ +L
Sbjct: 367 SETAIVDIDRDTWIPRIELQGNGDDNLIMGFSVDNVSIYAKVEVEVGLEQRELSPFCVLF 426
Query: 414 CLTLDGKLNMFHVASVSGPSCSPDVDFTLSDAEDDVPAVVPVDSNLPRVYSGSGLQKI-- 471
C+TL+GKL MF+VAS +G + PD D L D E+D +LP +G I
Sbjct: 427 CVTLEGKLVMFYVASAAGTTLPPDADSALDDEEED---------SLPESLAGRVQSNILS 477
Query: 472 --EPVASSFKSQGVKLKELDTDDTCGVTAKSNLKGFDKYESSTSISISNSQELENKDRQQ 529
E VA + V +E D ++ +L D S T++ E K +
Sbjct: 478 GPEQVALGLQVNDVSKREPDVSKGSELSTNKDLPSGDTRSSMTALIT------EQKPHKG 531
Query: 530 IQNSLQKSTNLVQSPPKASLPEVTSFGVRDSSKTGT--QDT---GGFGLGSTGFVGKFPT 584
+ Q++ +++ S P F +D + T QD F G+ F+ K P
Sbjct: 532 ATSEAQEAESVLNSKPSV-------FDCQDKASTTKMYQDNKIFSEFRPGTASFLEKAPP 584
Query: 585 DTPSLSSHKDLLKSLEFGKEAQGNFGSAGLQSLSSQSQSCGNFISSEDSRVKLPVLPSS- 643
PS + K L KS K+++ FGS GL SQ S I S S K L S+
Sbjct: 585 -VPSQVNGKGLQKSANLPKDSRVIFGSPGLHGAPSQPWSSEKVICSGGSDSKTSALTSTL 643
Query: 644 -HSHEKTYENSSLGAPNVSGSFVGKPLSSKDATGSLTPV--FSAKPVHGDGDRASTGAGK 700
H+ S+ A NV + GKP K G TP FS +P H +A TG
Sbjct: 644 IQGHKSDNTGLSVDAANVPLNLAGKPFHLKGNIGD-TPSANFSVRPTHTVVQKAKTGM-- 700
Query: 701 IESLPSVRSSQFSLPQNFASGKSHNQKLYPSKDDYKTATLSGLPNSEPNLSKQSGNIKEM 760
I+ LPS+RSSQ ++ A G+S N+ Y SKD +K +LS SEP LS+Q GNIKEM
Sbjct: 701 IDLLPSIRSSQLPSQESLALGRSGNRWPYSSKDAHKAPSLS---KSEPYLSRQFGNIKEM 757
Query: 761 TKELDMLLQSIEETGGFRDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMDERLQEIQNL 820
KELD LL+ IEE GGF+DACT+ QR VE LEE + +LSEKC W+S+MDE+L E+Q+L
Sbjct: 758 AKELDSLLECIEEPGGFKDACTISQRGSVEALEERMQTLSEKCITWKSMMDEQLGEVQHL 817
Query: 821 FDKTVQVLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGHILSINQDLINQLIEL 880
DKTVQVLARKIY +GIVKQASDSRYW+LWNRQKL E ELKR HIL +NQ L NQLI+L
Sbjct: 818 LDKTVQVLARKIYMDGIVKQASDSRYWELWNRQKLGSEFELKRRHILKLNQVLTNQLIDL 877
Query: 881 ERHFNSLELNKFGENDGVHGCQRRPQSRFGTSRHIQSLHSLHTTMSSQIAAAEQLSDRLS 940
ERHFN+LEL+KF EN GV +R QSR G SR IQSLHSL+ T +SQ+AAAE LS+ LS
Sbjct: 878 ERHFNTLELHKFDENGGVPKGRREFQSRHGPSRQIQSLHSLYNTTNSQLAAAEHLSECLS 937
Query: 941 KQMALLSIESPVKQQNVKKELFETLGIPYDASFSSPDVTKVMDQSSIKKLIHSSGSAAAN 1000
KQMA+LS+ESPVKQ+N+KKELFET+GIPY+ +FSSPD TKV D SS KL+ S SA+
Sbjct: 938 KQMAVLSVESPVKQKNIKKELFETIGIPYETTFSSPDSTKVGDSSSSMKLLLSG-SASNK 996
Query: 1001 DQSRRRQSSAMKSYDPETARRRRDSLDRSWASFEPPKTTVKRMLLQDHQKCSQVKSSLKD 1060
QSRRRQ S MKS D ETARRRRDSLD+SWASFEP KTTVKR+LLQ+ QK S KSSL D
Sbjct: 997 SQSRRRQLSVMKSSDSETARRRRDSLDQSWASFEPKKTTVKRVLLQETQKTSVSKSSLMD 1056
Query: 1061 KQQISPHMLEGAAIVRPKDRTTPSTSWNPLRIKGLQDASLKQTSESQSTLFKWAGDPTGP 1120
+QQ+ +++ +A+ PKD T PST P KG+Q KQ + + T +WA D P
Sbjct: 1057 RQQLDNSVVDSSAVNHPKDLTPPSTLTYPSGNKGIQFTFQKQALDKKPTPSRWASDSLPP 1116
Query: 1121 S----QMTGLKSPVFQSNIASTRSSLSATQLSPMGWQNHARNTGDVTAEKLSSGVYFDVK 1176
S Q TGL+ P+ S A S+S Q P+ Q +R TG VT+++L SG K
Sbjct: 1117 SQSTAQATGLRPPMLGSGAA--LPSISPYQALPITGQILSRETGIVTSDEL-SGTGSTGK 1173
Query: 1177 SNSTLTNETKSTMQSESNIFQKPTISTMSPTLTPSLLKNPSEMPISNGKGTVPESSTIGS 1236
S+S LT+E+KS QSE+N+ +K ++S P P+L+K+ +EM N KG +S + +
Sbjct: 1174 SDSLLTHESKSIQQSETNLHKKSSVSMELPAQAPTLMKS-NEMLNCNAKGAGFANSRMAT 1232
Query: 1237 EKHGAATTKTLFVESG--KNRDAQVSTPAAVPTSSAFPGIVPIFD-AANKSQPGGKSFTP 1293
H TK F++S N + +A P S+ PG VP F AA++SQP K+ +
Sbjct: 1233 MSHVPTNTKGAFLKSHSISNETSFSLLTSASPLVSSHPGTVPQFSVAASRSQPSEKASSS 1292
Query: 1294 STFSLSLSASSSPM 1307
FS+ LS SSSP+
Sbjct: 1293 QAFSMPLSFSSSPV 1306
>gi|359478820|ref|XP_002283769.2| PREDICTED: uncharacterized protein LOC100245011 [Vitis vinifera]
Length = 1696
Score = 1107 bits (2862), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 662/1325 (49%), Positives = 839/1325 (63%), Gaps = 147/1325 (11%)
Query: 1 MATP-TRIEIE-EEIEGGRIESNDFFFDEIGESIPIIRDDDSQFDLQNP-PSLPLAVSQL 57
MATP + I++E +++EGGR++ +D+ F +IGES+ I+ F+L +P PS PLAVS+
Sbjct: 1 MATPESVIDLEGKDLEGGRLDCDDYVFVKIGESV-TIKPQYYNFNLDSPLPSQPLAVSER 59
Query: 58 SHQLIFAAHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSND 117
S QLIF AHS GFCVART VI+ AKE+KE G+ +QELSVVDVP N V IL+LS D
Sbjct: 60 S-QLIFVAHSDGFCVARTEAVIELAKEIKEKGSGSSIQELSVVDVPIAN--VRILALSTD 116
Query: 118 ESTLAVSLYQSPHIHFFSVHSLLNKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLALSNV 177
STLA S+ IHFFSV SLLNK +PSF+ SL+GS+ VKD+RWRKK +NS++ LS+
Sbjct: 117 SSTLAASV--GGDIHFFSVDSLLNKGQEPSFTRSLSGSSSVKDMRWRKKMDNSYVVLSSD 174
Query: 178 GRLYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFKSWV 237
G+LY G GPL+DVMD VDAVEWSVKG +AVA+KN LS+L+SK KERL +SLSFKSW+
Sbjct: 175 GKLYHGAAEGPLKDVMDGVDAVEWSVKGNSIAVARKNTLSLLSSKFKERLCMSLSFKSWI 234
Query: 238 GDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVL 297
GD VNC+VKVDSIRWVR DCII+GCF+LT DGKEE+++VQV+ SK+GKITDASS P VL
Sbjct: 235 GDSGVNCTVKVDSIRWVRPDCIILGCFKLTADGKEESFMVQVVTSKEGKITDASSLPTVL 294
Query: 298 SFSDVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIA 357
SF DVF ++DDI+P G+GPYLFL+Y+E+ LAI+ +RKN+D+H+VL WSLDD+KN+ A
Sbjct: 295 SFYDVFPGLVDDIVPFGSGPYLFLSYLEQCQLAITTSRKNVDNHIVLFGWSLDDKKNEAA 354
Query: 358 VVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEE-KELSPYFILMCLT 416
++DI RDK+ PRI+LQ+N +DNLI+GLC DKVSL GKV++QLG EE +ELSPY +L CLT
Sbjct: 355 ILDIGRDKYRPRIELQENDDDNLILGLCCDKVSLYGKVEIQLGAEEPRELSPYCVLFCLT 414
Query: 417 LDGKLNMFHVASVSGPSCSPDVDFT-LSDAEDDVPAVVPVDSNLPRVYSGSGLQKIEPVA 475
L+GKL MF VASV+G + +P D + L+ ED+ PA PV+ +
Sbjct: 415 LEGKLVMFQVASVTG-TPAPTQDLSPLTGDEDETPAEAPVEHD----------------- 456
Query: 476 SSFKSQGVKLKELDTDDTCGVTAKSNLKGFDKYESSTSISIS------------------ 517
+S+ KE+ + K++L F + +S S I+
Sbjct: 457 ---QSREANTKEISIKQEGEILIKNDLNTFQENKSLISACIADQILHKETIAADHEAKSL 513
Query: 518 -NSQELENKDRQQI------------QNSL--QKSTNLVQSPPKASLPEVTSFGVRDSSK 562
NSQ E +Q++ Q+ L Q+STNL S K S E V D K
Sbjct: 514 VNSQTFEADGQQRVSTIKLYQEVDGKQSGLPRQQSTNLEGSSLKTSPLEGLGNVVGDVKK 573
Query: 563 TGTQDTGGFGLGSTGFVGKFPTDTPSLSSHKDLLKSLEFGKEAQGNFGSAGLQSLSSQSQ 622
T Q + + S S + +S E KE G GS LQ+ +SQS
Sbjct: 574 TDIQK-----------ITGVGSGLGSSQSSHNFSRSFETHKELPGKIGSTNLQN-ASQSW 621
Query: 623 SCGNFISSEDSRVKLPVLPSSHSHEKTYENSSLGAPNVSGSFVGKPLSSKDATGSLTPVF 682
S G F + + KL ++S SFV
Sbjct: 622 SGGKFTFPKSTEEKL---------------------SLSSSFV----------------- 643
Query: 683 SAKPVHGDGDRASTGAGKIESLPSVRSSQFSLPQNFASGKSHNQKLYPSKDDYKTATLSG 742
+ R+ T AG S+P V P +YP D T+ +G
Sbjct: 644 -------ESGRSET-AGINLSIPQVPGGPVGSP------------IYP--KDAATSLAAG 681
Query: 743 LPNSEPNLSKQSGNIKEMTKELDMLLQSIEETGGFRDACTVFQRQKVEELEEGIGSLSEK 802
Q +KEM KELD LLQ IE GGFRDACT+FQ+ V ELE+GIG+LSE
Sbjct: 682 NFGRISQSRGQRVQVKEMAKELDTLLQCIEGPGGFRDACTIFQKSSVMELEQGIGTLSEN 741
Query: 803 CGMWRSIMDERLQEIQNLFDKTVQVLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELK 862
C MWRSI+D++ EI +L DKTVQVLARK+Y +GIVKQA+DSRYWDLW+RQKL+ ELELK
Sbjct: 742 CRMWRSIIDQQHGEIHHLLDKTVQVLARKVYMQGIVKQATDSRYWDLWSRQKLASELELK 801
Query: 863 RGHILSINQDLINQLIELERHFNSLELNKFGENDGVHGCQRRPQSRFGTSRHIQSLHSLH 922
R +IL INQDL NQLIELERHFN++ELNKFGEN G H +R QSR G SRH+QSLHSLH
Sbjct: 802 RRNILKINQDLTNQLIELERHFNAIELNKFGENGGNHVGRRALQSRSGPSRHMQSLHSLH 861
Query: 923 TTMSSQIAAAEQLSDRLSKQMALLSIESPVKQQNVKKELFETLGIPYD-ASFSSPDVTKV 981
TM+SQ+AAAEQLS+ LSKQM +LSI+S VK+QNVKKELFE +GIPYD AS SSP ++
Sbjct: 862 NTMNSQLAAAEQLSECLSKQMTMLSIDSTVKKQNVKKELFEAIGIPYDSASVSSPTISNT 921
Query: 982 MDQSSIKKLIHSSGSAAANDQSRRRQSSAMKSYDPETARRRRDSLDRSWASFEPPKTTVK 1041
D S+K + SS S+A DQSRR Q SA+KSY+PET RRRRDSL +SWA+FEPPKT VK
Sbjct: 922 SDTPSMKNFLVSS-SSANKDQSRRNQLSALKSYEPETVRRRRDSLGQSWANFEPPKTIVK 980
Query: 1042 RMLLQDHQKCSQVKSSLK-DKQQISPHMLEGAAIVRPKDRTTPSTSWNPLRIKGLQDASL 1100
RM+L++ QK K S DK+Q SPH LEG+A + TTPS P R KG+ + S
Sbjct: 981 RMVLEEQQKVGVNKPSFSVDKRQFSPHKLEGSAFAYSNNNTTPSAFLYPSRNKGIHEISA 1040
Query: 1101 KQTSESQST-LFKWAGDPTGPSQMTGLKSP---VFQSNIASTRSSLSATQLSPMGWQNHA 1156
KQ S S ST LF+WA DP+GPSQ G +SP N S SSLSA Q SP+ Q++A
Sbjct: 1041 KQASYSPSTSLFRWANDPSGPSQSFGSRSPTPHALPGNNLSAFSSLSAPQSSPVVDQSNA 1100
Query: 1157 RNTGDVTAEKLSSGVYFDVKSNSTLTNETKSTMQSESNIFQKPTISTMSPTLTPSLLKNP 1216
T ++T E+ SSGV F KS++ NETKST+ SES++ Q P IST P T L K P
Sbjct: 1101 METCNLTNERSSSGVTFVEKSDAVSINETKSTLLSESHLPQTPIISTSLPARTLPLTKKP 1160
Query: 1217 SEMPISNGKGTVPESSTIGSEKHGAATTKTLFVESGKNRDAQVSTPAAVPTSSAFPGIVP 1276
+EM SNGKGTV TIGS K + + F +SG + + +S AV + + PG V
Sbjct: 1161 NEMSNSNGKGTVLAKPTIGSVKQKPVSPGSSFSQSGVSPFSPIS---AVQPAPSLPGKVF 1217
Query: 1277 IFDAA 1281
D A
Sbjct: 1218 QLDIA 1222
>gi|334183347|ref|NP_564694.2| Nuclear pore complex protein [Arabidopsis thaliana]
gi|332195141|gb|AEE33262.1| Nuclear pore complex protein [Arabidopsis thaliana]
Length = 1816
Score = 937 bits (2421), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1298 (44%), Positives = 789/1298 (60%), Gaps = 113/1298 (8%)
Query: 5 TRIEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFA 64
+R+EIEE+ EG RI +ND++F+ IGE I I++DD+Q+DL+NPPS PLA+S+ H ++F
Sbjct: 2 SRVEIEEDTEGDRISTNDYYFERIGEPI-SIKEDDAQYDLENPPSQPLAISE-RHAVLFV 59
Query: 65 AHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVS 124
AHSSGF V RTNDVI A+K NG + +Q+LS+VDVP + V ILSLS D+S LAV+
Sbjct: 60 AHSSGFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGD--VRILSLSADDSILAVT 117
Query: 125 LYQSPHIHFFSVHSLLNKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLALSNVGRLYRGG 184
+ + IHFFSV SLL K+ KPSFS S + S +VKD RWR+ ++S+L LSN G+L+ G
Sbjct: 118 V--AADIHFFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGI 175
Query: 185 VNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFKSWVGDCDVNC 244
N P VMD VDAVEWS KG ++AVA+ N L I +SK E+ I+LSF SW+GD D +C
Sbjct: 176 DNAPPRHVMDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDC 235
Query: 245 SVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVLSFSDVFS 304
VKVDSIRWVR +CI++GCFQL E G+EENYLVQVI+S DGKI+D S+ V LSFSD+F
Sbjct: 236 FVKVDSIRWVRNNCILLGCFQLIE-GREENYLVQVIRSPDGKISDGSTNLVALSFSDLFP 294
Query: 305 DVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRD 364
+DD++P G GP+L +Y+++ LA++ANRK+ID+H+VLL WS D+K+ ++VVDI+R+
Sbjct: 295 CSMDDLVPVGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRE 354
Query: 365 KWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEE-KELSPYFILMCLTLDGKLNM 423
++PRI LQ+N +DN +MGLCID+VS+ G V V+ G +E KEL PYF+L+CLTL+GKL M
Sbjct: 355 TFLPRIGLQENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVM 414
Query: 424 FHVASVSGPSCSPDVDF-TLSDAEDDVPAVVPVDSNLPRVYSGSGLQKIEPVASSFKSQG 482
F+VASV+G S D D + SD ED ++ D L K SS K Q
Sbjct: 415 FNVASVAGRPASSDTDLASSSDIEDAYTPLIEDD-----------LSK----QSSEKHQQ 459
Query: 483 VKL------KELDTDDTCGVTAKSNLKGFDK-YES-STSISISNSQELENKDRQQIQNSL 534
+ + K L+T+ N F K +ES +S+S N+++ E + +Q
Sbjct: 460 LNIAVQNDQKHLNTEKFSTEQRLPNENIFSKEFESVKSSVSGDNNKKQEPYAEKPLQ--- 516
Query: 535 QKSTNLVQSPPKASLPEV--TSFG-VRDSSKTGTQDTGGFGLGSTGFVGKFPTDTPSLSS 591
V+ ++ +P + TSFG + S T GFG + K D + S+
Sbjct: 517 ------VEDAQQSMIPRLSGTSFGQLPMSLGYDTNKFAGFG-PALPVSEKLQKDIFAQSN 569
Query: 592 HKDLLKSLEFGKEAQGNFGSAGLQSL--------SSQSQSCGNFISSEDSRVKLPVLPSS 643
L ++E K FGS GLQ+ SSQ S G +S D V P
Sbjct: 570 SMHLQANVE-SKSTAAFFGSPGLQNAILQSPQNTSSQPWSSGKSVSPPD-FVSGPFPSMR 627
Query: 644 HSHEKTYENSSLGAPNVSGSFVGKPLSSKDA------TGSLTPVFSAKPVHGDGDRASTG 697
+ K S G +V P+S KD TG ++ + + P+ G + G
Sbjct: 628 DTQHKQSVQSGTG-------YVNPPMSIKDKSVQVIETGRVSALSNLSPLLGQNQDTNEG 680
Query: 698 AGKIESLPSVRSSQFS--LPQNFASGKSHNQKLYPSKDDYKTATLSGLPNSEPNLSKQSG 755
KIE +PS+R+SQ S + +F SH Q +KT +G E N+S Q
Sbjct: 681 VEKIEPIPSIRASQLSQQVKSSFEKSASHQQ--------HKTPLSTGPLRLEHNMSNQPS 732
Query: 756 NIKEMTKELDMLLQSIEETGGFRDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMDERLQ 815
NI EM +E+D LLQSIE GGF+D+C + VEELE+G+ SL+ KC W+S + E+
Sbjct: 733 NINEMAREMDTLLQSIEGPGGFKDSCAFILKSNVEELEQGLESLAGKCQTWKSTIHEQQA 792
Query: 816 EIQNLFDKTVQVLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGHILSINQDLIN 875
EIQ+L DKT+QVLA+K Y EG+ KQ +D++YW LWNRQKL+PELE KR HI+ +N+DL +
Sbjct: 793 EIQHLLDKTIQVLAKKTYMEGMYKQTADNQYWQLWNRQKLNPELEAKRQHIMKLNKDLTH 852
Query: 876 QLIELERHFNSLELNKFGENDGVHGCQRRP-QSRFGTSRHIQSLHSLHTTMSSQIAAAEQ 934
QLIELER+FN LEL+++ E DG H RR +R SR +QSLHSLH TMSSQ+AAAEQ
Sbjct: 853 QLIELERYFNRLELDRYNE-DGGHPVARRGVPNRSAPSRRVQSLHSLHNTMSSQLAAAEQ 911
Query: 935 LSDRLSKQMALLSIESPVKQQNVKKELFETLGIPYDASFSSPDVTKVMDQSSIKKLIHSS 994
LS+ LSKQM L I+SPVK +NVK+ELFET+GIPYDASFSSPD K + SS K L+ SS
Sbjct: 912 LSECLSKQMTYLKIDSPVK-KNVKQELFETIGIPYDASFSSPDAVKAKNASSAKNLLLSS 970
Query: 995 GSAAANDQSRRRQSSAMKSYDPETARRRRDSLDRSWASFEPPKTTVKRMLLQDHQKCSQV 1054
A+ N QSR+RQSSAMK+ DPETARRRR+SLDR+WA+FEPPKTTVKRMLLQ+ QK
Sbjct: 971 IPASINQQSRQRQSSAMKNSDPETARRRRESLDRNWAAFEPPKTTVKRMLLQEQQKTGMN 1030
Query: 1055 KSSLKDKQQISPHMLEGAAIVRPKDRTTPSTSWNPLRIKGLQDASLKQTSESQSTLFKWA 1114
+ ++ ++ S + + +++ KD +P S N KG+ ++ + TSE+QST FK
Sbjct: 1031 QQTVLSERLRSANNTQDRSLLHVKDHASPVVSSN----KGIMESFQQDTSEAQSTPFK-- 1084
Query: 1115 GDPTGPSQMTGLKSPVFQSNIASTRSSLSATQLSPMGWQ-NHARNTGDVTAEKLSSGVYF 1173
+ P+ QSN T S +SA++ S W N + NT E S
Sbjct: 1085 -----------TRPPMPQSNSPFTISPISASKPS-FNWSGNKSSNTTSYAEESAPS---- 1128
Query: 1174 DVKSNSTLTNETKSTMQSESNIFQKPTISTMSPTLTPSLLKNPSEMPISNGKGTVPESST 1233
+K T++ S+ + ++P S T+ K E S K +
Sbjct: 1129 QIKDTRTVSQPGGSSF-----LPKRPVAS----TVLEQTEKKAGEFKFSEAKANAFVETA 1179
Query: 1234 IGSEKHGAATTKTLFVESGKNRDAQVSTPAAVPTSSAF 1271
GS + + T+ ES K AQ ST ++ +S+F
Sbjct: 1180 AGSVQRLSTTSSGSDFESSKGFGAQFSTMSSGAPASSF 1217
>gi|334183349|ref|NP_001117499.2| Nuclear pore complex protein [Arabidopsis thaliana]
gi|332195142|gb|AEE33263.1| Nuclear pore complex protein [Arabidopsis thaliana]
Length = 1819
Score = 931 bits (2407), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1301 (44%), Positives = 789/1301 (60%), Gaps = 116/1301 (8%)
Query: 5 TRIEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFA 64
+R+EIEE+ EG RI +ND++F+ IGE I I++DD+Q+DL+NPPS PLA+S+ H ++F
Sbjct: 2 SRVEIEEDTEGDRISTNDYYFERIGEPI-SIKEDDAQYDLENPPSQPLAISE-RHAVLFV 59
Query: 65 AHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVS 124
AHSSGF V RTNDVI A+K NG + +Q+LS+VDVP + V ILSLS D+S LAV+
Sbjct: 60 AHSSGFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGD--VRILSLSADDSILAVT 117
Query: 125 LYQSPHIHFFSVHSLLNKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLALSNVGRLYRGG 184
+ + IHFFSV SLL K+ KPSFS S + S +VKD RWR+ ++S+L LSN G+L+ G
Sbjct: 118 V--AADIHFFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGI 175
Query: 185 VNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFKSWVGDCDVNC 244
N P VMD VDAVEWS KG ++AVA+ N L I +SK E+ I+LSF SW+GD D +C
Sbjct: 176 DNAPPRHVMDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDC 235
Query: 245 SVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVLSFSDVFS 304
VKVDSIRWVR +CI++GCFQL E G+EENYLVQVI+S DGKI+D S+ V LSFSD+F
Sbjct: 236 FVKVDSIRWVRNNCILLGCFQLIE-GREENYLVQVIRSPDGKISDGSTNLVALSFSDLFP 294
Query: 305 DVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRD 364
+DD++P G GP+L +Y+++ LA++ANRK+ID+H+VLL WS D+K+ ++VVDI+R+
Sbjct: 295 CSMDDLVPVGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRE 354
Query: 365 KWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEE-KELSPYFILMCLTLDGKLNM 423
++PRI LQ+N +DN +MGLCID+VS+ G V V+ G +E KEL PYF+L+CLTL+GKL M
Sbjct: 355 TFLPRIGLQENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVM 414
Query: 424 FHVASVSGPSCSPDVDF-TLSDAEDDVPAVVPVDSNLPRVYSGSGLQKIEPVASSFKSQG 482
F+VASV+G S D D + SD ED ++ D L K SS K Q
Sbjct: 415 FNVASVAGRPASSDTDLASSSDIEDAYTPLIEDD-----------LSK----QSSEKHQQ 459
Query: 483 VKL------KELDTDDTCGVTAKSNLKGFDK-YES-STSISISNSQELENKDRQQIQNSL 534
+ + K L+T+ N F K +ES +S+S N+++ E + +Q
Sbjct: 460 LNIAVQNDQKHLNTEKFSTEQRLPNENIFSKEFESVKSSVSGDNNKKQEPYAEKPLQ--- 516
Query: 535 QKSTNLVQSPPKASLPEV--TSFG-VRDSSKTGTQDTGGFGLGSTGFVGKFPTDTPSLSS 591
V+ ++ +P + TSFG + S T GFG + K D + S+
Sbjct: 517 ------VEDAQQSMIPRLSGTSFGQLPMSLGYDTNKFAGFG-PALPVSEKLQKDIFAQSN 569
Query: 592 HKDLLKSLEFGKEAQGNFGSAGLQSL--------SSQSQSCGNFISSEDSRVKLPVLPSS 643
L ++E K FGS GLQ+ SSQ S G +S D V P
Sbjct: 570 SMHLQANVE-SKSTAAFFGSPGLQNAILQSPQNTSSQPWSSGKSVSPPD-FVSGPFPSMR 627
Query: 644 HSHEKTYENSSLGAPNVSGSFVGKPLSSKDA------TGSLTPVFSAKPVHGDGDRASTG 697
+ K S G +V P+S KD TG ++ + + P+ G + G
Sbjct: 628 DTQHKQSVQSGTG-------YVNPPMSIKDKSVQVIETGRVSALSNLSPLLGQNQDTNEG 680
Query: 698 AGKIESLPSVRSSQFS--LPQNFASGKSHNQKLYPSKDDYKTATLSGLPNSEPNLSKQSG 755
KIE +PS+R+SQ S + +F SH Q +KT +G E N+S Q
Sbjct: 681 VEKIEPIPSIRASQLSQQVKSSFEKSASHQQ--------HKTPLSTGPLRLEHNMSNQPS 732
Query: 756 NIKEMTKELDMLLQSIEETGGFRDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMDERLQ 815
NI EM +E+D LLQSIE GGF+D+C + VEELE+G+ SL+ KC W+S + E+
Sbjct: 733 NINEMAREMDTLLQSIEGPGGFKDSCAFILKSNVEELEQGLESLAGKCQTWKSTIHEQQA 792
Query: 816 EIQNLFDKTVQVLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGHILSINQDLIN 875
EIQ+L DKT+QVLA+K Y EG+ KQ +D++YW LWNRQKL+PELE KR HI+ +N+DL +
Sbjct: 793 EIQHLLDKTIQVLAKKTYMEGMYKQTADNQYWQLWNRQKLNPELEAKRQHIMKLNKDLTH 852
Query: 876 QLIELERHFNSLELNKFGENDGVHGCQRRP-QSRFGTSRHIQSLHSLHTTMSSQIAAAEQ 934
QLIELER+FN LEL+++ E DG H RR +R SR +QSLHSLH TMSSQ+AAAEQ
Sbjct: 853 QLIELERYFNRLELDRYNE-DGGHPVARRGVPNRSAPSRRVQSLHSLHNTMSSQLAAAEQ 911
Query: 935 LSDRLSKQMALLSIESPVKQQNVKKELFETLGIPYDASFSSPDVTKVMDQSSIKKLIHSS 994
LS+ LSKQM L I+SPVK +NVK+ELFET+GIPYDASFSSPD K + SS K L+ SS
Sbjct: 912 LSECLSKQMTYLKIDSPVK-KNVKQELFETIGIPYDASFSSPDAVKAKNASSAKNLLLSS 970
Query: 995 GSAAANDQSRRRQSSAMKSYDPETARRRRDSLDR---SWASFEPPKTTVKRMLLQDHQKC 1051
A+ N QSR+RQSSAMK+ DPETARRRR+SLDR +WA+FEPPKTTVKRMLLQ+ QK
Sbjct: 971 IPASINQQSRQRQSSAMKNSDPETARRRRESLDRVIFNWAAFEPPKTTVKRMLLQEQQKT 1030
Query: 1052 SQVKSSLKDKQQISPHMLEGAAIVRPKDRTTPSTSWNPLRIKGLQDASLKQTSESQSTLF 1111
+ ++ ++ S + + +++ KD +P S N KG+ ++ + TSE+QST F
Sbjct: 1031 GMNQQTVLSERLRSANNTQDRSLLHVKDHASPVVSSN----KGIMESFQQDTSEAQSTPF 1086
Query: 1112 KWAGDPTGPSQMTGLKSPVFQSNIASTRSSLSATQLSPMGWQ-NHARNTGDVTAEKLSSG 1170
K + P+ QSN T S +SA++ S W N + NT E S
Sbjct: 1087 K-------------TRPPMPQSNSPFTISPISASKPS-FNWSGNKSSNTTSYAEESAPS- 1131
Query: 1171 VYFDVKSNSTLTNETKSTMQSESNIFQKPTISTMSPTLTPSLLKNPSEMPISNGKGTVPE 1230
+K T++ S+ + ++P S T+ K E S K
Sbjct: 1132 ---QIKDTRTVSQPGGSSF-----LPKRPVAS----TVLEQTEKKAGEFKFSEAKANAFV 1179
Query: 1231 SSTIGSEKHGAATTKTLFVESGKNRDAQVSTPAAVPTSSAF 1271
+ GS + + T+ ES K AQ ST ++ +S+F
Sbjct: 1180 ETAAGSVQRLSTTSSGSDFESSKGFGAQFSTMSSGAPASSF 1220
>gi|4204261|gb|AAD10642.1| Hypothetical protein [Arabidopsis thaliana]
Length = 1804
Score = 929 bits (2402), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1303 (44%), Positives = 789/1303 (60%), Gaps = 118/1303 (9%)
Query: 5 TRIEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFA 64
+R+EIEE+ EG RI +ND++F+ IGE I I++DD+Q+DL+NPPS PLA+S+ H ++F
Sbjct: 2 SRVEIEEDTEGDRISTNDYYFERIGEPI-SIKEDDAQYDLENPPSQPLAISE-RHAVLFV 59
Query: 65 AHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVS 124
AHSSGF V RTNDVI A+K NG + +Q+LS+VDVP + V ILSLS D+S LAV+
Sbjct: 60 AHSSGFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGD--VRILSLSADDSILAVT 117
Query: 125 LYQSPHIHFFSVHSLLNK-----EIKPSFSCSLNGSTYVKDIRWRKKPENSFLALSNVGR 179
+ + IHFFSV SLL K + KPSFS S + S +VKD RWR+ ++S+L LSN G+
Sbjct: 118 V--AADIHFFSVDSLLKKVHVSEDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGK 175
Query: 180 LYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFKSWVGD 239
L+ G N P VMD VDAVEWS KG ++AVA+ N L I +SK E+ I+LSF SW+GD
Sbjct: 176 LFHGIDNAPPRHVMDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGD 235
Query: 240 CDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVLSF 299
D +C VKVDSIRWVR +CI++GCFQL E G+EENYLVQVI+S DGKI+D S+ V LSF
Sbjct: 236 SDEDCFVKVDSIRWVRNNCILLGCFQLIE-GREENYLVQVIRSPDGKISDGSTNLVALSF 294
Query: 300 SDVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVV 359
SD+F +DD++P G GP+L +Y+++ LA++ANRK+ID+H+VLL WS D+K+ ++VV
Sbjct: 295 SDLFPCSMDDLVPVGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVV 354
Query: 360 DINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEE-KELSPYFILMCLTLD 418
DI+R+ ++PRI LQ+N +DN +MGLCID+VS+ G V V+ G +E KEL PYF+L+CLTL+
Sbjct: 355 DIDRETFLPRIGLQENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLE 414
Query: 419 GKLNMFHVASVSGPSCSPDVDF-TLSDAEDDVPAVVPVDSNLPRVYSGSGLQKIEPVASS 477
GKL MF+VASV+G S D D + SD ED ++ D L K SS
Sbjct: 415 GKLVMFNVASVAGRPASSDTDLASSSDIEDAYTPLIEDD-----------LSK----QSS 459
Query: 478 FKSQGVKL------KELDTDDTCGVTAKSNLKGFDK-YES-STSISISNSQELENKDRQQ 529
K Q + + K L+T+ N F K +ES +S+S N+++ E +
Sbjct: 460 EKHQQLNIAVQNDQKHLNTEKFSTEQRLPNENIFSKEFESVKSSVSGDNNKKQEPYAEKP 519
Query: 530 IQNSLQKSTNLVQSPPKASLPEV--TSFG-VRDSSKTGTQDTGGFGLGSTGFVGKFPTDT 586
+Q V+ ++ +P + TSFG + S T GFG + K D
Sbjct: 520 LQ---------VEDAQQSMIPRLSGTSFGQLPMSLGYDTNKFAGFG-PALPVSEKLQKDI 569
Query: 587 PSLSSHKDLLKSLEFGKEAQGNFGSAGLQSL--------SSQSQSCGNFISSEDSRVKLP 638
+ S+ L ++E K FGS GLQ+ SSQ S G +S D V P
Sbjct: 570 FAQSNSMHLQANVE-SKSTAAFFGSPGLQNAILQSPQNTSSQPWSSGKSVSPPD-FVSGP 627
Query: 639 VLPSSHSHEKTYENSSLGAPNVSGSFVGKPLSSKDA------TGSLTPVFSAKPVHGDGD 692
+ K S G +V P+S KD TG ++ + + P+ G
Sbjct: 628 FPSMRDTQHKQSVQSGTG-------YVNPPMSIKDKSVQVIETGRVSALSNLSPLLGQNQ 680
Query: 693 RASTGAGKIESLPSVRSSQFS--LPQNFASGKSHNQKLYPSKDDYKTATLSGLPNSEPNL 750
+ G KIE +PS+R+SQ S + +F SH Q +KT +G E N+
Sbjct: 681 DTNEGVEKIEPIPSIRASQLSQQVKSSFEKSASHQQ--------HKTPLSTGPLRLEHNM 732
Query: 751 SKQSGNIKEMTKELDMLLQSIEETGGFRDACTVFQRQKVEELEEGIGSLSEKCGMWRSIM 810
S Q NI EM +E+D LLQSIE GGF+D+C + VEELE+G+ SL+ KC W+S +
Sbjct: 733 SNQPSNINEMAREMDTLLQSIEGPGGFKDSCAFILKSNVEELEQGLESLAGKCQTWKSTI 792
Query: 811 DERLQEIQNLFDKTVQVLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGHILSIN 870
E+ EIQ+L DKT+QVLA+K Y EG+ KQ +D++YW LWNRQKL+PELE KR HI+ +N
Sbjct: 793 HEQQAEIQHLLDKTIQVLAKKTYMEGMYKQTADNQYWQLWNRQKLNPELEAKRQHIMKLN 852
Query: 871 QDLINQLIELERHFNSLELNKFGENDGVHGCQRRP-QSRFGTSRHIQSLHSLHTTMSSQI 929
+DL +QLIELER+FN LEL+++ E DG H RR +R SR +QSLHSLH TMSSQ+
Sbjct: 853 KDLTHQLIELERYFNRLELDRYNE-DGGHPVARRGVPNRSAPSRRVQSLHSLHNTMSSQL 911
Query: 930 AAAEQLSDRLSKQMALLSIESPVKQQNVKKELFETLGIPYDASFSSPDVTKVMDQSSIKK 989
AAAEQLS+ LSKQM L I+SPVK +NVK+ELFET+GIPYDASFSSPD K + SS K
Sbjct: 912 AAAEQLSECLSKQMTYLKIDSPVK-KNVKQELFETIGIPYDASFSSPDAVKAKNASSAKN 970
Query: 990 LIHSSGSAAANDQSRRRQSSAMKSYDPETARRRRDSLDRSWASFEPPKTTVKRMLLQDHQ 1049
L+ SS A+ N QSR+RQSSAMK+ DPETARRRR+SLDR+WA+FEPPKTTVKRMLLQ+ Q
Sbjct: 971 LLLSSIPASINQQSRQRQSSAMKNSDPETARRRRESLDRNWAAFEPPKTTVKRMLLQEQQ 1030
Query: 1050 KCSQVKSSLKDKQQISPHMLEGAAIVRPKDRTTPSTSWNPLRIKGLQDASLKQTSESQST 1109
K + ++ ++ S + + +++ KD +P S N KG+ ++ + TSE+QST
Sbjct: 1031 KTGMNQQTVLSERLRSANNTQDRSLLHVKDHASPVVSSN----KGIMESFQQDTSEAQST 1086
Query: 1110 LFKWAGDPTGPSQMTGLKSPVFQSNIASTRSSLSATQLSPMGWQ-NHARNTGDVTAEKLS 1168
FK + P+ QSN T S +SA++ S W N + NT E
Sbjct: 1087 PFK-------------TRPPMPQSNSPFTISPISASKPS-FNWSGNKSSNTTSYAEESAP 1132
Query: 1169 SGVYFDVKSNSTLTNETKSTMQSESNIFQKPTISTMSPTLTPSLLKNPSEMPISNGKGTV 1228
S +K T++ S+ + ++P S T+ K E S K
Sbjct: 1133 S----QIKDTRTVSQPGGSSF-----LPKRPVAS----TVLEQTEKKAGEFKFSEAKANA 1179
Query: 1229 PESSTIGSEKHGAATTKTLFVESGKNRDAQVSTPAAVPTSSAF 1271
+ GS + + T+ ES K AQ ST ++ +S+F
Sbjct: 1180 FVETAAGSVQRLSTTSSGSDFESSKGFGAQFSTMSSGAPASSF 1222
>gi|297853258|ref|XP_002894510.1| hypothetical protein ARALYDRAFT_337635 [Arabidopsis lyrata subsp.
lyrata]
gi|297340352|gb|EFH70769.1| hypothetical protein ARALYDRAFT_337635 [Arabidopsis lyrata subsp.
lyrata]
Length = 1825
Score = 926 bits (2393), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 576/1298 (44%), Positives = 789/1298 (60%), Gaps = 108/1298 (8%)
Query: 5 TRIEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFA 64
+++EIEE++EG RI +ND++F++IGE I I++DD+Q+D++NPPS PLA+S+ H ++F
Sbjct: 2 SKVEIEEDMEGDRISTNDYYFEKIGEPI-SIKEDDAQYDIENPPSQPLAISE-RHGVVFV 59
Query: 65 AHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVS 124
AHSSGF V RT DVI A+K NG + +Q+LS+VDVP + V ILSLS D+S LAV+
Sbjct: 60 AHSSGFFVGRTKDVISASKNSNGNGDKVYIQDLSLVDVPVGD--VRILSLSADDSILAVT 117
Query: 125 LYQSPHIHFFSVHSLLNKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLALSNVGRLYRGG 184
+ + IHFFSV SLL K+ KPSFS S + S +VKD RW +K ++S+L LSN G+L+ G
Sbjct: 118 V--AADIHFFSVDSLLKKDAKPSFSYSPDESGFVKDFRWIRKDKHSYLVLSNTGKLFHGI 175
Query: 185 VNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFKSWVGDCDVNC 244
N P VMD VDAVEWS KG ++AVA+ N L IL+SK E+ I+LSF W+GD D +C
Sbjct: 176 DNAPPRHVMDAVDAVEWSSKGSYIAVAQDNSLRILSSKFNEKRCIALSFDDWIGDSDEDC 235
Query: 245 SVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVLSFSDVFS 304
SVKVDSIRWVR +CI++GCFQL DG+EENYLVQVI+S DGKI+D SS V LSFSD+F
Sbjct: 236 SVKVDSIRWVRNNCILLGCFQLI-DGREENYLVQVIRSPDGKISDGSSNLVALSFSDLFP 294
Query: 305 DVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRD 364
+DDI+P G GP+L +Y+++ LA++ANRK+ID+H+VLL WS D K ++VVDI+R+
Sbjct: 295 CSMDDIVPVGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSPGDNKTAVSVVDIDRE 354
Query: 365 KWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEE-KELSPYFILMCLTLDGKLNM 423
++PRI LQ+NG+DN IMGLCID+VS+ G V V+ G +E KEL PYF+L+CLTL+GKL M
Sbjct: 355 TFLPRIGLQENGDDNTIMGLCIDRVSVAGTVNVRSGSDELKELPPYFVLVCLTLEGKLVM 414
Query: 424 FHVASVSGPSCSPDVDFTLSDAEDDVPAVVP-VDSNLPRVYSGSGLQKIE-PVASSFKSQ 481
F+VASV+GP S D D S +D A P ++ +L + S + + V+ + +
Sbjct: 415 FNVASVAGPPASSDADLASSSDIED--AYTPLIEDDLSKQSSEEPEEHQQLNVSVQNEQK 472
Query: 482 GVKLKELDTDDTCGVTAKSNL--KGFDKYESSTSISISNSQE------LENKDRQQIQNS 533
+ ++ T+ + N+ K F+ +SS S QE L+ +D QQ
Sbjct: 473 HLNAEKFSTEQSF---PNENIFSKEFESVKSSVSEDNKKKQEPYAEKPLQVEDGQQSMIP 529
Query: 534 LQKSTNLVQSPPKASLPEVTSFGVRDSSKTGTQDTGGFGLGSTGFVGKFPTDTPSLSSHK 593
Q T+ QSP S G T GFG + K D + S
Sbjct: 530 RQFGTSFGQSP--------VSLGYD------TNKFSGFG-PALSVSDKLQKDVSAQSKSM 574
Query: 594 DLLKSLEFGKEAQGNFGSAGLQS--------LSSQSQSCGNFISSEDSRVKLPVLPSSHS 645
L ++E K FGS GLQ+ SSQ S G +S D V P +
Sbjct: 575 HLQANVE-SKSTPALFGSPGLQNSIFQSPLNTSSQPWSSGKGVSPPD-FVPGPFPSVKDT 632
Query: 646 HEKTYENSSLGAPNVSGSFVGKPLSSKDA------TGSLTPVFSAKPVHGDGDRASTGAG 699
K S G +V P+S+K+ TG + + + G + G
Sbjct: 633 QHKQSVQSGTG-------YVNPPMSTKETPVQVVETGRASALSNLSSPLGQNWDTNEGVE 685
Query: 700 KIESLPSVRSSQFS--LPQNFASGKSHNQKLYPSKDDYKTATLSGLPNSEPNLSKQSGNI 757
KIE +PS+R+SQ S + +F SH Q +KT +G E N+SKQ NI
Sbjct: 686 KIEPIPSIRASQLSQQVKSSFEKSASHQQ--------HKTPLSAGPLRLEHNMSKQPSNI 737
Query: 758 KEMTKELDMLLQSIEETGGFRDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMDERLQEI 817
EM +E+D LLQSIE GGF+D+C V + VEELE+G+ SL+ +C W+S + E+ EI
Sbjct: 738 NEMAREMDTLLQSIEGPGGFKDSCEVLLKSHVEELEQGLESLAGRCQTWKSTIHEQQAEI 797
Query: 818 QNLFDKTVQVLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGHILSINQDLINQL 877
Q+L DKT+QVLA+K Y EG+ KQ +D++YW LWNRQKL+PELE KR HI+ +N+DL +QL
Sbjct: 798 QHLLDKTIQVLAKKTYMEGMYKQTADNQYWQLWNRQKLNPELEAKRQHIMKLNKDLTHQL 857
Query: 878 IELERHFNSLELNKFGENDGVHGCQRRP-QSRFGTSRHIQSLHSLHTTMSSQIAAAEQLS 936
IELER+FN LEL+++ E DG H RR SR SR +QSLHSLH TMSSQ+AA+EQLS
Sbjct: 858 IELERYFNRLELDRYHE-DGGHPVARRGVPSRSAPSRRVQSLHSLHNTMSSQLAASEQLS 916
Query: 937 DRLSKQMALLSIESPVKQQNVKKELFETLGIPYDASFSSPDVTKVMDQSSIKKLIHSSGS 996
+ LSKQM L I+SPVK +NVK+ELFET+GIPYDASFSSPD K + SS K L+ SS
Sbjct: 917 ECLSKQMTYLKIDSPVK-KNVKQELFETIGIPYDASFSSPDAVKAKNASSAKNLLLSSIP 975
Query: 997 AAANDQSRRRQSSAMKSYDPETARRRRDSLDRSWASFEPPKTTVKRMLLQDHQKCSQVKS 1056
A+ N QSR+RQSSAMK+ DPETARRRR+SLDR+WA+FEPPKTTVKRMLLQ+ QK +
Sbjct: 976 ASINAQSRQRQSSAMKNSDPETARRRRESLDRNWAAFEPPKTTVKRMLLQEQQKTGMNQQ 1035
Query: 1057 SLKDKQQISPHMLEGAAIVRPKDRTTPSTSWNPLRIKGLQDASLKQTSESQSTLFKWAGD 1116
+ ++ S + + +++R K+ +P S N KG+ ++ + TSE+QSTLFK
Sbjct: 1036 IVLSERLRSVNNTQERSLLRLKNHASPVVSSN----KGIMESFQQDTSEAQSTLFK---- 1087
Query: 1117 PTGPSQMTGLKSPVFQSNIASTRSSLSATQLSPMGWQNHARNTGDVTAEKLSSGVYFDVK 1176
+ P+ QSN T S +SA++ S W + K SS + +
Sbjct: 1088 ---------TRPPMPQSNSPFTISPISASKPS-FNW----------SGNKSSSTTSYAEE 1127
Query: 1177 SNSTLTNETKSTMQSESNIF--QKPTISTMSPTLTPSLLKNPSEMPISNGKGTVPESSTI 1234
S + +T++ Q + F ++P ST+ K E S K +
Sbjct: 1128 SAPSQNKDTRTVSQPGGSNFLLKRPVASTVFEQTE----KKAGEFKFSEAKANAFVETAA 1183
Query: 1235 GSEKHGAATTKTLFVESGKNRDAQVSTPAAVPTSSAFP 1272
GS +H + T+ E K AQ+S ++ +S+FP
Sbjct: 1184 GSVQHLSTTSSGSDYEFSKGFGAQLSPMSSGGPASSFP 1221
>gi|449458781|ref|XP_004147125.1| PREDICTED: uncharacterized protein LOC101219502 [Cucumis sativus]
Length = 2161
Score = 805 bits (2078), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 559/1316 (42%), Positives = 759/1316 (57%), Gaps = 124/1316 (9%)
Query: 7 IEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAH 66
I +E+ EG +I ND +F +IG+ +P+ + DS FD ++PPS PLA+S+ S LIF AH
Sbjct: 512 IPLEDAGEGEQIVRNDLYFQKIGKPVPV-KLGDSIFDPESPPSQPLALSE-SSGLIFVAH 569
Query: 67 SSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLY 126
SGF V R DVI +A+E+K GT VQ+LS+VDV V IL++S D S LA +
Sbjct: 570 LSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGK--VHILAVSTDNSVLAAVV- 626
Query: 127 QSPHIHFFSVHSLLNKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLALSNVGRLYRGGVN 186
+ +H FSV SLL+K KP SCS+ S+++KD +W +K EN++L LS G+LY+G N
Sbjct: 627 -AGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLYQGSAN 685
Query: 187 GPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFKSWVGDCDVNCSV 246
GPL VM ++DAVE SVKGKF+AVAKK+ L+I + K KERL +SL G+ D + +V
Sbjct: 686 GPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETDTDFTV 745
Query: 247 KVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVLSFSDVFSDV 306
KVD I+WVR DCIIIGCFQ+T G EE+YLVQVI+SKDGKITD SS V+LSF D+ S
Sbjct: 746 KVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCDIHSGF 805
Query: 307 IDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKW 366
DILP +GP L L+Y++ LAI ANR ++DH+ LL L+ E N++AVV+I+R+
Sbjct: 806 TRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVE-NEVAVVNIDRNTS 864
Query: 367 IPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEE-KELSPYFILMCLTLDGKLNMFH 425
+P+I+LQ NG+DNL+MGLCID+VSL GKV V++G E+ +E+SPY IL+CLTL+G+L MF
Sbjct: 865 LPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGELIMFQ 924
Query: 426 VASVSGPSCSPDVDFTLSDAEDDVPAVVPVDSNLPRVYSGSGLQKIEPVASSFKSQGVKL 485
+SV+ + D EDD+ VP D +S+ +
Sbjct: 925 FSSVNETEAPHETVSACDDEEDDI--TVPTDD---------------------RSESKES 961
Query: 486 KELDTDDTCGVTAKSNLKG---FDKYESSTSISIS-NSQEL-ENKDRQQI-----QNSLQ 535
+E + D VT K + +K ++S I S N Q L N D I Q
Sbjct: 962 REANIDHRMQVTEKIAISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQ 1021
Query: 536 KSTNLVQSPP-KASLPE-VTSFGVRDSSKTGTQDTGGFGLGSTGFVGKFPTDTPSLSSHK 593
K + + S K+S PE + + + K + TG LGS GK D PS
Sbjct: 1022 KVDSFIYSQSLKSSAPERPPHYEIGNFDKPVLKFTG---LGSASISGK-SEDVPS----- 1072
Query: 594 DLLKSLEFGKEAQGNFGSAGLQSLSSQSQSCGNFISSEDSRVKLPVLPSSHSHEKT-YEN 652
+ KE+ GS GL + S S D +P + +S+S + + EN
Sbjct: 1073 ---QPFPNVKESTKRLGSTGLMAASELSSEKAMSFKKID---PVPSVFTSNSLQSSNTEN 1126
Query: 653 --SSLGAPNVSGSFVGKPLSSKDATGSLTPVFSAKPVHGDGDRASTGAGKIESLPSVRSS 710
S G N F GKP KD +LT G +A+ GAGKIESLP +RSS
Sbjct: 1127 YGPSFGTANAFTGFAGKPFQPKDVPSTLT---------QSGRQATGGAGKIESLPVIRSS 1177
Query: 711 QFSLPQNFASGKSHNQKLYPSKDDYKTATLSGLPNSEPNLSKQSGNIKEMTKELDMLLQS 770
Q SL F+SGK N+K S+ Y S P ++P +KEM + LD LL+S
Sbjct: 1178 QISLQDKFSSGKISNEKHDGSERYY-----SNSPLAKP--------MKEMCEGLDTLLES 1224
Query: 771 IEETGGFRDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMDERLQEIQNLFDKTVQVLAR 830
IEE+GGF DACT FQ+ VE LE G+ SLS+ C +WRS M+ER QE+QNLFDK VQVL++
Sbjct: 1225 IEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQVLSK 1284
Query: 831 KIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGHILSINQDLINQLIELERHFNSLELN 890
K Y EGIV Q+SDS+YW+ W+RQKLS ELELKR HIL +NQ++ NQLIELERHFN LELN
Sbjct: 1285 KTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNGLELN 1344
Query: 891 KFGENDGVHGCQRRPQSRFGTSRHIQSLHSLHTTMSSQIAAAEQLSDRLSKQMALLSIES 950
KFG N+ +R Q +FG+SRH S+HSL+ M SQ+A A+ LS+ LSKQ+A L++ES
Sbjct: 1345 KFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAALNMES 1404
Query: 951 P-VKQQNVKKELFETLGIPYDASFSSPDVTKVMDQSSIKKLIHSSGSAAANDQSRRRQSS 1009
P +K+Q+ KELFE++G+ YDASFSSP+V K+ + SS KKL+ SS S ++ SRR+Q S
Sbjct: 1405 PSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSS-KKLLLSSDSFSSKGTSRRKQQS 1463
Query: 1010 AMKSYDPETARRRRDSLDRSWASFEPPKTTVKRMLLQDHQKCSQVKSSLKDKQQISPHML 1069
K+ + ET RRRRDSLDR+ AS +PPKTTVKRMLLQ + SS +++Q
Sbjct: 1464 GTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQG------IPSS--EEKQFCSRTP 1515
Query: 1070 EGAAIVRPKDRTTPSTSWNPLRIKGLQDASLKQTSESQSTLFKWAGDPTGPSQMTGLKSP 1129
EGAA V P++ ++A SE+ T F W P PS + KS
Sbjct: 1516 EGAATV-----ARPASRITSSISSSSKNAG--HDSENPETPFMW-NSPLQPSNTSRQKSL 1567
Query: 1130 VFQSNIASTRSSLSATQLSPMGWQNHARNTGDVTAEKLSSGVYFDVKSNST---LTNETK 1186
Q + S Q S + VT+E + V KS ++ +
Sbjct: 1568 PLQKINVTPPSPPPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSD 1627
Query: 1187 STMQSESNIFQKPTISTMSPTLTPSLLKNPSEMPISNGKGTVPESSTIGSEKHGAATTKT 1246
S +S N+ QK +I T+S P+ + + + + N K + E+H TT +
Sbjct: 1628 SVQKSNINVDQKSSIFTISSKQMPTPIDSIATSNVDNQK------TANVKERH---TTTS 1678
Query: 1247 LFVESGKNRDAQVSTPAAVPTSSAFPGIVPIFDAANKSQPGGKSFTPSTFSLSLSA 1302
F S ++ P + P +VP D + K++ KS T T S S+SA
Sbjct: 1679 PFFGSANKPES--------PFVGSMPSLVPTVDGSRKTEE-KKSVT--TISQSVSA 1723
>gi|356569182|ref|XP_003552784.1| PREDICTED: uncharacterized protein LOC100794048 [Glycine max]
Length = 1825
Score = 666 bits (1718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/1073 (41%), Positives = 626/1073 (58%), Gaps = 76/1073 (7%)
Query: 239 DCDVN-CSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVL 297
+ DVN S +DS++ VR D I+IGC QLTEDGKEENY++QVI+S+ G+I D SE VV
Sbjct: 312 NSDVNTVSTAIDSVKCVRPDSIVIGCVQLTEDGKEENYVIQVIRSQLGEINDGCSELVVQ 371
Query: 298 SFSDVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIA 357
SF D++ +IDDI+P G+GPYL L Y+++ LAI+AN KN D H++LL WS D++K++
Sbjct: 372 SFCDIYQGLIDDIVPFGSGPYLLLAYLKQCQLAINANMKNTDQHIMLLGWSADEDKSEAV 431
Query: 358 VVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEK-ELSPYFILMCLT 416
V+DI R+ PRI+LQ+NG+DNL++GLCID VS+ KV VQ+ VEE+ EL P+ +L+CLT
Sbjct: 432 VIDIERENCAPRIELQENGDDNLLLGLCIDNVSIYQKVGVQIDVEERTELLPHCVLICLT 491
Query: 417 LDGKLNMFHVASVSGPSCSPDVDFTLSDAEDDVPAVVPVD--SNLPRVYSGSGLQKIEPV 474
L+GKL MFHVAS++G SP+VD L + E D +P D LP+
Sbjct: 492 LEGKLVMFHVASLAGCKTSPEVDSVLHNDE-DTSVNLPEDEGCTLPQ------------- 537
Query: 475 ASSFKSQGVKLKELDTDDTCGVTAKSN-LKGFDKYESSTSISISNSQELENKDRQQIQNS 533
+L++ ++D K+N ++ F+ + T+ N Q++ D +
Sbjct: 538 ---------RLQKQESDKLMTFPIKTNYVQTFEVSGNLTAKPSGNPQQITRTDTNYPEVE 588
Query: 534 LQKSTNLVQSPPKASLPEVTSFGVRDSSKTGTQDTGGFGLGSTGFVGKFPTDTPSLSSHK 593
L ++ ++S + +P+V +F TG Q+ G K T S+
Sbjct: 589 LVGNSESLKSNVQQVVPDVDAF-----QNTGNQNPFLPGEQQKNLGQKTATLGTSIGP-- 641
Query: 594 DLLKSLEFGKEAQGNFGSAGLQSLSSQSQSCGNFISSEDS-RVKLP-------------- 638
L+ + ++ + LQS +++++ SS DS R LP
Sbjct: 642 -LMVNSHSVSSGLSSYNN--LQS-TTKTRELWTANSSRDSQRASLPGETFSFPKKYDVSS 697
Query: 639 VLPSSHSHEKTYENS--SLGAPNVSGSFVGKPLSSKDATGSLTPVFSA-KPVHGDGDRAS 695
+ SS++ ++N ++GA NV GS GKP+ +D + SA + V G ++
Sbjct: 698 ISASSYADGVGFQNKKYTMGATNVPGSMGGKPVLVQDVNDVSPAIDSASRLVQSGGQLST 757
Query: 696 TGAGKIESLPSVRSSQFSLPQNFASGKSHNQKLYPSKDDYKTATLSGLPNSEPNLSKQSG 755
GAG ++ + + SS FS N A+ KS +K PS + + T + G+ +S+ LSKQ G
Sbjct: 758 LGAGNMQPILN-SSSHFSSDGNTAAIKSSARKFLPSNEQHGTPSKLGIFSSD--LSKQFG 814
Query: 756 NIKEMTKELDMLLQSIEETGGFRDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMDERLQ 815
NI EMTKELD+LL+SIEE GGFRDACT R +E +E+G+ +LS+KC + +DE +
Sbjct: 815 NINEMTKELDLLLKSIEEAGGFRDACTRSLRSSIEAVEQGMDALSKKCKILTCQVDEHHE 874
Query: 816 EIQNLFDKTVQVLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGHILSINQDLIN 875
E+ L +KT++ +ARKIY EGI KQASDSRYWDLWNRQKL+ ELELKR HILS+NQDL
Sbjct: 875 EVHYLLNKTIRAMARKIYMEGIYKQASDSRYWDLWNRQKLNSELELKRQHILSLNQDLTY 934
Query: 876 QLIELERHFNSLELNKFGENDGVHGCQRRPQSRFGTSRHIQSLHSLHTTMSSQIAAAEQL 935
QLIELERHFN+LELNKF + G Q+R+G SRH SLHSLH ++SSQ+ AAE L
Sbjct: 935 QLIELERHFNALELNKFSQYGGRCLGHGPSQNRYGPSRHTLSLHSLHNSVSSQLVAAENL 994
Query: 936 SDRLSKQMALLSIESPVKQQNVKKELFETLGIPYDASFSSPDVTKVMDQSSIKKLIHSSG 995
S+ LSKQM LS++S +++ KELFET+GIPY+A+F SPD+ M KK + S
Sbjct: 995 SECLSKQMEALSLKSQTEERKNVKELFETIGIPYEAAFGSPDMKGFMKTPPSKKTLFSDL 1054
Query: 996 SAAANDQSRRRQSSAMKSYDPETARRRRDSLDRSWASFEPPKTTVKRMLLQDHQKCSQVK 1055
+ A DQSRR Q+SAMK ++PETARRRRDSLD+SW FEPPKT VKRMLLQ+ QK ++ +
Sbjct: 1055 T-ANKDQSRRNQASAMKCFEPETARRRRDSLDQSWTCFEPPKTIVKRMLLQELQKVNRNE 1113
Query: 1056 SSLKDKQQISPHMLEGAAIVRPKDRTTPSTSWNPLRIKG-LQDASLKQTSESQ-STLFKW 1113
S + LE ++ D PS + IK + D+ L+ S+ S F
Sbjct: 1114 SLFSMNKDKKVSTLEESSPCH-IDARIPSIVFPASNIKASILDSHLELEEVSEHSKAFMP 1172
Query: 1114 AGDPTGPSQMTGLKSPVFQSNIASTRSSLSATQLSPMGWQNHARNTGDVTAEKLSSGVYF 1173
A P+ ++ KS V Q N T S A +LSP + H+ T D+ AEK S+ F
Sbjct: 1173 ADSLRAPTHVSESKSSVLQKNNILTIPSQPAFRLSPTMVRGHSTETKDLAAEK-STVQKF 1231
Query: 1174 DVKSNSTLTNETKSTMQSESNIFQKPTISTMSPTLTPSLLKNPSEMPISNGKGTVPESST 1233
D+ SNS E K T+ + + Q P+I T S T TPS+ SEMP N K T+ SST
Sbjct: 1232 DLISNS----ENKPTLLWK--MPQNPSIPTYSTTETPSMKIKSSEMPFPNSKMTMATSST 1285
Query: 1234 IGSEKHGAATTKTLFVESGKNRDAQVSTPAAVPTSSAFPGIVPIFDAANKSQP 1286
G + + T ++ GK + S + + S F G V F +KS P
Sbjct: 1286 TGDKLSSSFTPESW----GKGFPSSGSHLSTISAPSTFLGKVTEFH-VDKSLP 1333
Score = 254 bits (650), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 132/248 (53%), Positives = 182/248 (73%), Gaps = 7/248 (2%)
Query: 1 MATPTRIEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQ 60
M+ +IE EE+EG I ++D+FF ++GE++P+ + DS FD + PS PLA+S+ +
Sbjct: 1 MSPSKKIE-PEEVEGEMISTSDYFFVKVGEALPL-KSSDSVFDAETLPSQPLALSE-RFR 57
Query: 61 LIFAAHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDEST 120
L F AHSSGF VA+T D+ID+AKE+K+ G+ V++LS+VDVP V L+LS D ST
Sbjct: 58 LTFVAHSSGFFVAKTKDLIDSAKELKDKGSGSPVEQLSLVDVPVGR--VRALALSTDNST 115
Query: 121 LAVSLYQSPHIHFFSVHSLLNKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLALSNVGRL 180
LA S+ S I F+SV S LNKE+K SFSCSLN ST+VKD+RW P++ ++ LSN+G+L
Sbjct: 116 LAASV--SGDIRFYSVESFLNKEVKQSFSCSLNDSTFVKDMRWITTPKHCYVVLSNIGKL 173
Query: 181 YRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFKSWVGDC 240
Y G + PL+ VMDNVDAV+W +KG FVAVA K+VLSIL++K +ER+ ISLSF+SW+GD
Sbjct: 174 YYGDIGFPLKHVMDNVDAVDWGLKGSFVAVASKSVLSILSTKFEERVSISLSFRSWIGDF 233
Query: 241 DVNCSVKV 248
+ S+KV
Sbjct: 234 SADGSIKV 241
>gi|414888159|tpg|DAA64173.1| TPA: hypothetical protein ZEAMMB73_624267 [Zea mays]
Length = 1735
Score = 597 bits (1540), Expect = e-167, Method: Compositional matrix adjust.
Identities = 458/1279 (35%), Positives = 679/1279 (53%), Gaps = 141/1279 (11%)
Query: 1 MATPTRIEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQ---FDLQNPPSLPLAVSQL 57
M P +E+ +E+EG + DF F G+ IP++ D + FDL +PP+ PLAVS
Sbjct: 1 MGAPRELELSDEVEGEEDGTTDFVFRLAGDPIPLLPTDSNPLPLFDLLSPPARPLAVSD- 59
Query: 58 SHQLIFAAHSSGFCVARTNDVIDAAKEMKENG---TRCCVQELSVVDVPFENENVDILSL 114
H +F AH SGF RT ++I+A+KE +E G TRC Q+ V DVP V +++L
Sbjct: 60 RHATVFLAHPSGFIAVRTKELIEASKEAREKGKSSTRCA-QDCCVADVPLPG--VSLIAL 116
Query: 115 SNDESTLAVSLYQSPHIHFFSVHSLL-NKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLA 173
S+DES LA + I FFS+ SLL +K++ PS SCSL + VKD +W +++
Sbjct: 117 SHDESVLAACTHT--EIQFFSLASLLTHKDVVPSSSCSLERAVTVKDFKWLNHASAAYIV 174
Query: 174 LSNVGRLYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSF 233
LSN G L G + L+DVM+NVDAV+ +G +AVA+ N L+IL+ E +SL F
Sbjct: 175 LSNGGLLCHGNLGEGLKDVMENVDAVDCCKEGNHIAVARGNKLTILSPDFTETCSMSLLF 234
Query: 234 KSWVGDCDV-NCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASS 292
+ W D ++KVDSI WV D I+IGC +L ED EE YLVQVI++++ ++
Sbjct: 235 QLWSDGSDSKGTTIKVDSIGWVHGDSIVIGCVRLNEDDNEEGYLVQVIRTEESTFCESPG 294
Query: 293 EPVVLSFSDVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRW-SLDD 351
+ V +++D F ++DD+LP G GP L L Y+ RWGL +++N+K++D+H+ LL+W S D
Sbjct: 295 KTDVYTYNDFFHGIMDDVLPPGAGPNLLLGYLHRWGLMVASNKKSVDEHIALLKWPSAHD 354
Query: 352 EKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYFI 411
++ + +++ DK+ PRI LQ+NG+DN+I+G ++ VSL K+ V +G E+KE++P I
Sbjct: 355 DEKTVTYLEMLEDKYSPRIDLQENGDDNVILGFGVENVSLFQKITVPVGPEQKEVAPQHI 414
Query: 412 LMCLTLDGKLNMFHVASVSGPSCSPDVDFTLSDAEDDVPAVVPVDSNLPRVYSGSGLQKI 471
L+CLT +GK+ ++++A +S PS P T+S ED G + I
Sbjct: 415 LLCLTGEGKVAVYYLARISDPSDLPHA--TVSTNED------------------YGEKHI 454
Query: 472 EPVASSFKSQGVKLKELDTDDTCGVTAKSNLKGFDKYESSTSISISNSQELEN------- 524
P V KEL T G KS L E S++ + SN QE N
Sbjct: 455 SPAT-------VSEKEL-TPSVAGSVYKSTLTEHGA-EPSSAQTGSNQQESINVKNSSSA 505
Query: 525 -KDRQQIQNSLQKSTNL---------VQSPPKASLPEVTSFGVRDSSKTGTQDTGGFGLG 574
K+++ NSL S+N V PP A +R + TG F
Sbjct: 506 SKEQETTDNSLFISSNKKPLDTKQVNVTGPPAA---------LRSLALTGNTKPTIFSSF 556
Query: 575 STGFVGKFPTDTPSLSSHKDLLKSLEFGKEAQGN--FGSAGLQSLSSQSQSCGNFISSED 632
ST G PT ++ +L SL+ + GN G GL S+ S G+ S++D
Sbjct: 557 ST--EGMNPTGG---NAPNELASSLQPSSRSFGNNQTGKEGLDSVQSVGIFGGSQNSNKD 611
Query: 633 SRV---KLPVLPSSHS-HEKTYENS--SLGAPNVSGSFV------GKPLSSKDATGSLTP 680
V K + SS S K E + G P++ S+ G P++ S
Sbjct: 612 GNVYGFKSSLFTSSGSVPAKIGERNEVGFGVPSLQTSYTADRKVFGAPVALSSVPSSSIS 671
Query: 681 VFSAKPVHGDGDRASTG--AGKIESLPSVRSSQFSLPQNFASGKSHNQKL-YPSKDDYKT 737
P+ +S+G E+ S+R S P + GKSHN + P++ +
Sbjct: 672 PAKPSPI----GYSSSGFRTANSEAPQSLRGSP---PSQQSMGKSHNSRTQAPAEYSRNS 724
Query: 738 ATLSGLPNSEPNLSKQSGNIKEMTKELDMLLQSIEETGGFRDACTVFQRQKVEELEEGIG 797
+G +SE +LSK+ +I EMTKELD LL IE+ GGFRDAC Q++ + E+G+
Sbjct: 725 KKGTGF-DSEQDLSKKFYSINEMTKELDALLAYIEKDGGFRDACITIQQRPLSMFEDGLQ 783
Query: 798 SLSEKCGMWRSIMDERLQEIQNLFDKTVQVLARKIYTEGIVKQASDSRYWDLWNRQKLSP 857
+ + +++ ++E+ +I++L + QV AR+ Y +GIV Q+SD++YWD+WNRQKLSP
Sbjct: 784 NFLQLLQVFKGKVEEQCSKIEDLRNNMFQVSARQTYMKGIVSQSSDNQYWDIWNRQKLSP 843
Query: 858 ELELKRGHILSINQDLINQLIELERHFNSLELNKFGENDGVHGCQRRPQSRFGTSRHIQS 917
E E+KR +IL NQDL NQL+ELERHFN+LE+N+ E V RR TS
Sbjct: 844 EFEVKRQNILRANQDLTNQLVELERHFNNLEMNRSSETGRV-ASNRRAVYSNKTSSSQTQ 902
Query: 918 LHSLHTTMSSQIAAAEQLSDRLSKQMALLSIESPVKQQN-VKKELFETLGIPYDASFSSP 976
L S+++ ++SQ+AAAEQLS+ LSKQ+++L+I SP ++Q V K+LFE++G+ +
Sbjct: 903 LSSVYSALNSQLAAAEQLSECLSKQISVLNIGSPSRKQGMVTKKLFESIGLAHTT----- 957
Query: 977 DVTKVMDQS--SIKKLIHSSGSAAANDQSRRRQSSAMKSYDPETARRRRDSLDRSWASFE 1034
D TK + +K+ + N+ S+ S+ K +PETARRRR+SLD S AS E
Sbjct: 958 DATKFSGSTPKPVKRF------PSVNEHSKGILGSS-KGGEPETARRRRESLDMSLASLE 1010
Query: 1035 PPKTTVKRMLLQDHQKCSQVKSSLKDKQQISPHMLEGAAIVRPKDRTTPSTSW---NPLR 1091
P KTTVKR+ Q K S +K+ M AAI + K + ++S R
Sbjct: 1011 PQKTTVKRITQQQRLKISSDLPFRSNKKIFDSQM---AAISQEKSSDSSNSSIVESYVSR 1067
Query: 1092 IKGL-QDASLKQTSESQSTLFKWAGDPTGPSQMTGLKSPVFQSNIASTRSSLSATQLSP- 1149
++ L +D + K + Q+ LFKW + TG SQ + K + ST S T SP
Sbjct: 1068 LRSLSEDVNDKLSGPQQNPLFKWVKESTGSSQTSEQKHFELPGQMKSTAQSSKLTPSSPP 1127
Query: 1150 ------MGWQNHARNTGDVTAEKLSSGVYFDVKSNSTLTNET----KSTMQSESNIFQKP 1199
MG Q+ + N V S G V +TLT +T KS +E NI P
Sbjct: 1128 SFSYTRMGAQD-SINPSSVP----SFGTKHTVSKPNTLTFKTTISPKSNANTEPNIL--P 1180
Query: 1200 TISTMSPTLTPSLLKNPSE 1218
+I+ +L+P +K E
Sbjct: 1181 SITAPKTSLSPLSVKTAGE 1199
>gi|356514685|ref|XP_003526034.1| PREDICTED: uncharacterized protein LOC100806517 [Glycine max]
Length = 1744
Score = 500 bits (1287), Expect = e-138, Method: Compositional matrix adjust.
Identities = 247/448 (55%), Positives = 338/448 (75%), Gaps = 8/448 (1%)
Query: 1 MATPTRIEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQ 60
M+ +IE EE+EG I + D+FF ++GE++P+ + DS FD + PS PLA+S+ +
Sbjct: 1 MSPSKKIE-PEEVEGEIIGTTDYFFVKVGEALPL-KSSDSVFDAETLPSQPLALSE-RFR 57
Query: 61 LIFAAHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDEST 120
L F AHSSGF VA+T D+ID+AKE+K+ G+ V++LS+VDVP V L+LS D ST
Sbjct: 58 LTFVAHSSGFFVAKTKDLIDSAKELKDKGSGSPVEQLSLVDVPVGR--VRALALSTDNST 115
Query: 121 LAVSLYQSPHIHFFSVHSLLNKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLALSNVGRL 180
LA S+ S I F+SV S LNKE+K SFSCSLN ST+VKD+RW P++S++ LSN+G+L
Sbjct: 116 LAASV--SGDIRFYSVESFLNKEVKQSFSCSLNDSTFVKDMRWITTPKHSYVVLSNIGKL 173
Query: 181 YRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFKSWVGDC 240
Y G + PL+ VMDNVDAV+W +KG FVAVA K+VLSIL++K +ER+ ISLSF+SW+GD
Sbjct: 174 YYGDIGFPLKHVMDNVDAVDWGLKGSFVAVASKSVLSILSTKFEERVSISLSFRSWIGDF 233
Query: 241 DVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVLSFS 300
+ S+KVDS++ VR D I+IGC QLTEDGKEENYL+QVI+S+ G+I D SE VV SF
Sbjct: 234 AADGSIKVDSVKCVRPDSIVIGCVQLTEDGKEENYLIQVIRSQLGEINDDCSELVVQSFC 293
Query: 301 DVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVD 360
D++ +IDDI+P G+GPYL L Y+++ LAI+AN KN D H++LL WS D++K++ V+D
Sbjct: 294 DIYQGLIDDIVPFGSGPYLLLAYLKQCQLAINANMKNTDQHIMLLGWSADEDKSEAVVID 353
Query: 361 INRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEK-ELSPYFILMCLTLDG 419
I R+ +PRI+LQ+NG+DNL++GLCID VS+ KV VQ+ VEE+ EL P+ +L+CLTL+G
Sbjct: 354 IERENCVPRIELQENGDDNLLLGLCIDNVSIYQKVGVQIDVEERTELLPHCVLICLTLEG 413
Query: 420 KLNMFHVASVSGPSCSPDVDFTLSDAED 447
KL MFHVAS++G SP++D L + ED
Sbjct: 414 KLVMFHVASLAGCKTSPEIDSVLHNDED 441
>gi|224052851|ref|XP_002297612.1| predicted protein [Populus trichocarpa]
gi|222844870|gb|EEE82417.1| predicted protein [Populus trichocarpa]
Length = 635
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 248/538 (46%), Positives = 313/538 (58%), Gaps = 67/538 (12%)
Query: 757 IKEMTKELDMLLQSIEETGGFRDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMDERLQE 816
IK+M +ELDMLL+ I+E GGFRDACTVF R VE LEEG+G+LSE C M +++MDERL E
Sbjct: 66 IKDMAEELDMLLECIDEPGGFRDACTVFLRSSVEALEEGMGTLSENCRMLKNMMDERLGE 125
Query: 817 IQNLFDKTVQVLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGHILSINQDLINQ 876
IQ++ KTVQVLAR+I+ +GIVKQASD + W+LWNRQKLS ELELKR IL +NQ+L NQ
Sbjct: 126 IQHVLYKTVQVLARRIHLDGIVKQASDRQSWELWNRQKLSSELELKRRCILKLNQELTNQ 185
Query: 877 LIELERHFNSLELNKFGENDGVHGCQRRPQSRFGTSRHIQSLHSLHTTMSSQIAAAEQLS 936
LI++ERHFN+LEL FG NDG H + QSR Q+ ++ QL
Sbjct: 186 LIQVERHFNTLELRSFGGNDGSHSDRIALQSR-------------------QVPSSLQLL 226
Query: 937 DRLSKQMALLSIESPVKQQNVKKELFETLGIPYDASFSSPDVTKVMDQSSIKKLIHSSGS 996
+ K V Q+ VKKELFET+GIPYDASFSSPD T+V D SS+KKL+ S GS
Sbjct: 227 NNFLK----------VSQKEVKKELFETIGIPYDASFSSPDATEVSDTSSLKKLLLSPGS 276
Query: 997 AAANDQSRRRQSSAMKSYDPETARRRRDSLDRSWASFEPPKTTVKRMLLQDHQKCSQVKS 1056
A ASFEP TTVKR+LLQ+ QK S +S
Sbjct: 277 TATK------------------------------ASFEPTNTTVKRVLLQESQKKSVNRS 306
Query: 1057 S-LKDKQQISPHMLEGAAIVRPKDRTTPSTSWNPLRIKGLQDASLKQTSESQ-STLFKWA 1114
S LKD Q +S +++ +A V ++RT PS+ P KGLQ KQ E + + FKWA
Sbjct: 307 SLLKDSQHVSSSLVDRSA-VHQENRTFPSSFLQPSESKGLQYGFTKQAVEIKPPSAFKWA 365
Query: 1115 GDPTGPSQMTGLKSPVFQSNIASTRSSLSATQLSPMGWQNHARNTGDVTAEKLSSGVYFD 1174
P PSQ L+SP QSN A T S S+ L H R T ++ A+K S
Sbjct: 366 TGPLLPSQPLALRSPTLQSNNAMTLSVSSSLVLL--AGDKHTRETYNMNADKSKSMFSHI 423
Query: 1175 VKSNSTLTNETKSTMQSESNIFQKPTISTMSPTLTPSLLKNPSEMPISNGKGTVPESSTI 1234
+ S L NETKS Q+E+N+ +K +ST+SPT T K P E+ +S T S I
Sbjct: 424 EEPKSVLINETKSIQQNETNLNKKSAVSTVSPTQTSLFPKKPKEIFVST-YSTALAKSAI 482
Query: 1235 GSEKHGAATTKTLFVESG-KNRDAQVSTPAAVPTSSAFPGIVPIFDAA-NKSQPGGKS 1290
S KHG A TK F +S KN + S+ A P + A PG VP + A + SQP K+
Sbjct: 483 ESLKHGPANTKGSFFKSASKNHEPPCSSVGATPVAPALPGKVPHINVATSTSQPSEKA 540
>gi|242051332|ref|XP_002463410.1| hypothetical protein SORBIDRAFT_02g043336 [Sorghum bicolor]
gi|241926787|gb|EER99931.1| hypothetical protein SORBIDRAFT_02g043336 [Sorghum bicolor]
Length = 790
Score = 381 bits (979), Expect = e-102, Method: Compositional matrix adjust.
Identities = 275/813 (33%), Positives = 420/813 (51%), Gaps = 76/813 (9%)
Query: 22 DFFFDEIGESIPIIRDDDSQ---FDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
DF F G+ IP++ D + FDL +PP+ PLAVS H +F AH SGF V RT ++
Sbjct: 17 DFVFRLAGDPIPLLPTDSNPLPLFDLLSPPARPLAVSD-RHATVFLAHPSGFIVVRTKEL 75
Query: 79 IDAAKEMKENG---TRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPHIHFFS 135
I+A+KE +E G TRC Q+ V DVP V +++LS+DES LA I FFS
Sbjct: 76 IEASKEAREKGKASTRC-AQDCCVADVPLPG--VSLIALSHDESVLAAC--TDTEIQFFS 130
Query: 136 V-----HSLLNKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLALSNVGRLYRGGVNGPLE 190
+ H ++ +++ PS SCSL + VKD +W +++ LSN G L G + L+
Sbjct: 131 LASLLTHKVITQDVVPSSSCSLGRAGTVKDFKWLNHASAAYIVLSNGGLLCHGSLGEGLK 190
Query: 191 DVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFKSWVGDCDV-NCSVKVD 249
DVM+NVDAV+ +G +AVA+ N L+IL+ KE +SL F+ W + D ++KVD
Sbjct: 191 DVMENVDAVDCCKEGNHIAVARGNKLTILSPDFKETCSMSLLFQLWSDESDSEGTTIKVD 250
Query: 250 SIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVLSFSDVFSDVIDD 309
SI WV D I+IG +L ED EE YLVQVI++++ ++ +P V ++ D F ++DD
Sbjct: 251 SIGWVHGDSIVIGSVRLNEDDNEEGYLVQVIRTEESTFCESPGKPDVYTYVDFFHGIMDD 310
Query: 310 ILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRW-SLDDEKNDIAVVDINRDKWIP 368
ILPSG GP L L Y+ RW L +++N+K+IDDH+ LL+W S D++ + +++ DK+ P
Sbjct: 311 ILPSGVGPNLLLGYLHRWDLMVASNKKSIDDHIALLKWPSAHDDEKTVTYLEMLEDKYSP 370
Query: 369 RIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHVAS 428
RI LQ+NG+DN+I+G ++ VSL K+ V +G E++E++P IL+CLT +GKL ++++AS
Sbjct: 371 RIDLQENGDDNVILGFGVENVSLFQKITVTVGPEQREVAPQHILLCLTGEGKLAVYYLAS 430
Query: 429 VSGPSCSPDVDFTLSDAEDDV-PAVVPVDSNLPRVYSGSGLQKIEPVASSFKSQGVKLKE 487
V S + D+ E + PA VP +K+ P + S+ L E
Sbjct: 431 V-----STNEDY----GEKHISPAAVPG-------------KKLTPSVTGSVSKST-LTE 467
Query: 488 LDTDDTCGVTAKSNLKGFDKYESSTSISISNSQELENKDRQQIQNSLQKSTN-------- 539
+ + T K L +++E S+++ NS +K+++ NSL S+N
Sbjct: 468 HGAEASIAQTGKRMLASSNQHE---SMNVKNSSS-ASKEQETTGNSLLISSNKKPIVDTK 523
Query: 540 -LVQSPPKASLPEVTSFG-VRDSSKTGTQDTGGFGLGSTGFVGKFPTD-TPSLSSHKDLL 596
L + P A+LP + G + + + G+ TG G P PSL
Sbjct: 524 QLNVTAPPAALPSLELTGNTKPTISSSFSTVNNEGMNPTG--GNAPNGLAPSLQPSSRSF 581
Query: 597 KSLEFGKEAQGNFGSAGLQSLSSQSQSCGNFISSEDSRVKLPVLPSSHSHEKTYENSSLG 656
+ + GKE + S G+ S S G+ + S + E+ +
Sbjct: 582 GNSQTGKEGLDSAQSVGIFGGSQNSNKDGDVYGFKSSGFTSSGSVPAKIGERNEVGFGVP 641
Query: 657 APNVSGSF----VGKPLSSKDATGSLTPVFSAKPVHGDGDRASTGAGKIESLPSVRSSQF 712
+P S + G P++ + PV G G E+ S+R S
Sbjct: 642 SPQTSYTIDRKVFGAPVALSSVPSPSISPAKSAPV---GSSIGFRTGNSEAPQSLRGSP- 697
Query: 713 SLPQNFASGKSHNQKLYPSKDDYKTATLSGLPNSEPNLSK------QSGNIKEMTKELDM 766
P + GKS+N + D + + + +SE +LSK Q I EMTKELD
Sbjct: 698 --PSQQSMGKSYNSRTQTPADYSRNSKMGTRFDSEQDLSKKFYSVRQLNMINEMTKELDA 755
Query: 767 LLQSIEETGGFRDACTVFQRQKVEELEEGIGSL 799
LL IE+ GGFRDAC FQ++ + E+ +L
Sbjct: 756 LLAYIEKDGGFRDACITFQQRPLSMFEDSCKTL 788
>gi|222637707|gb|EEE67839.1| hypothetical protein OsJ_25626 [Oryza sativa Japonica Group]
Length = 1755
Score = 363 bits (932), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 194/463 (41%), Positives = 288/463 (62%), Gaps = 13/463 (2%)
Query: 1 MATPTRIEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQ---FDLQNPPSLPLAVSQL 57
MA P +++ +E+EG + DF F G+ IP++ S FDLQ+PPS PLAVS
Sbjct: 1 MAPPRELDLSDEVEGEEDGTTDFVFRLAGDPIPLLPTTSSPLPLFDLQSPPSRPLAVSN- 59
Query: 58 SHQLIFAAHSSGFCVARTNDVIDAAKEMKENG--TRCCVQELSVVDVPFENENVDILSLS 115
+F AH +GF A T +I+A+KE +E G T C ++ V D+P V +L LS
Sbjct: 60 RRAAVFLAHPNGFMAATTKALIEASKEAREKGKSTTRCARDCCVADIPLPG--VSLLELS 117
Query: 116 NDESTLAVSLYQSPHIHFFSVHSLL-NKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLAL 174
DES LA IHFFS SLL +K+++P SC+L GS+ VKD +W +F+ L
Sbjct: 118 RDESVLAACAGSV--IHFFSASSLLTDKDVEPLSSCTLEGSSTVKDFKWLNHASKAFIVL 175
Query: 175 SNVGRLYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFK 234
S G L +G + L+D+M+NVDAV+ +G + ++KKN L+IL+S KE + L F+
Sbjct: 176 SKDGLLSQGSLGEGLKDIMENVDAVDCCKEGTHIVLSKKNTLNILSSDFKEICCMPLLFQ 235
Query: 235 SWVGDCDVN-CSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSE 293
W D D + S+KVDSI WVR D I++G +L E+G EE YLVQVI+S + SS+
Sbjct: 236 LWSDDSDSDDASIKVDSIGWVRDDSIVVGSVRLNEEGNEEGYLVQVIRSGGNTFFENSSK 295
Query: 294 PVVLSFSDVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRW-SLDDE 352
VV ++ D F ++DD+LPSG GP L L Y+ RW L +++N+K+ID+H+ LL+W S D+
Sbjct: 296 SVVFTYVDFFRGIMDDVLPSGVGPNLLLGYLHRWDLMVTSNKKSIDEHISLLKWSSATDD 355
Query: 353 KNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYFIL 412
K + +++ DK+ PRI LQ+NG+DN+I+G ++ VSL K+ V +G E+KE++P IL
Sbjct: 356 KKTVVYLEMLEDKYSPRIDLQENGDDNVILGFGVENVSLFQKITVTVGPEQKEVAPQHIL 415
Query: 413 MCLTLDGKLNMFHVASVSGPSCSPDVDFTLSDAEDDVPAVVPV 455
+ LT +GKLN++++A +S PS P + + D + + PV
Sbjct: 416 LYLTGEGKLNIYYLARISDPSELPQTKLSAIEDSDVMKEISPV 458
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 178/460 (38%), Positives = 259/460 (56%), Gaps = 32/460 (6%)
Query: 694 ASTG--AGKIESLPSVRSSQFSLPQNFASGKSHNQKLYPSKDDYKTATLSGLPNSEPNLS 751
+STG G +E+ P+ R S LPQ + K H+ + + D K L + ++E +LS
Sbjct: 664 SSTGFRTGNLEAFPTSRGS--PLPQE-SIDKPHD-RTHAVVDHSKNFKLGAMFDTEQDLS 719
Query: 752 KQSGNIKEMTKELDMLLQSIEETGGFRDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMD 811
K+ +I +MTKELD LL IE+ GGFRDAC FQ + + E + + E ++++ +
Sbjct: 720 KKFYSINDMTKELDTLLSYIEKDGGFRDACITFQERPLSMFEGDLQNFLELLQIFKNKIA 779
Query: 812 ERLQEIQNLFDK------TVQVLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGH 865
E+ +++L +K T +V AR+ Y +GIV Q+SD++YWD+WNRQKLSPE E KR +
Sbjct: 780 EQCSRVEDLRNKMFQGTSTTRVSARQAYMKGIVDQSSDTQYWDIWNRQKLSPEFEAKRQN 839
Query: 866 ILSINQDLINQLIELERHFNSLELNKFGENDGVHGCQRRPQSRFGTSRHIQSLHSLHTTM 925
IL NQ+L NQL+ELERHFN+LE+NKFGE V +R S S Q + S++ +
Sbjct: 840 ILKANQNLTNQLVELERHFNNLEMNKFGETGRVASSRRAIYSSKARSSQTQ-ISSVYNAL 898
Query: 926 SSQIAAAEQLSDRLSKQMALLSIESP-VKQQNVKKELFETLGIPYDASFSSPDVTKVMDQ 984
+SQ+AAAEQLSD LSKQ++ L+I SP K+ +V KELFE++G+ D + + +
Sbjct: 899 NSQLAAAEQLSDCLSKQISALNISSPSTKRGSVAKELFESIGL--DHTMDAAKFSGGTPS 956
Query: 985 SSIKKLIHSSGSAAANDQSRRRQSSAMKSYDPETARRRRDSLDRSWASFEPPKTTVKRML 1044
S+K+ + + S K+ +PETARRRR+SLDRS AS EP KTTVKR+
Sbjct: 957 KSVKRY-------PSTREHITSISGPSKTAEPETARRRRESLDRSLASLEPQKTTVKRIA 1009
Query: 1045 LQDHQKCSQVKSSLKDKQQISPHMLEGAAIVRPKDRTTPSTSWNPLRIKGLQDAS----- 1099
Q K S +K+ M AA+ + K ++P++S L S
Sbjct: 1010 QQQRLKISSDLPFRSNKKIFDSQM---AAMSQEKSSSSPTSSIVESYANKLHYPSEVLHE 1066
Query: 1100 -LKQTSESQSTLFKWAGDPTGPSQMTGLKSPVFQSNIAST 1138
K + +TLFKW + GPSQ K P + S+
Sbjct: 1067 KTKPSGPQHNTLFKWVKESAGPSQGPQHKYPELPGQLKSS 1106
>gi|242041743|ref|XP_002468266.1| hypothetical protein SORBIDRAFT_01g042706 [Sorghum bicolor]
gi|241922120|gb|EER95264.1| hypothetical protein SORBIDRAFT_01g042706 [Sorghum bicolor]
Length = 655
Score = 346 bits (888), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 182/436 (41%), Positives = 271/436 (62%), Gaps = 18/436 (4%)
Query: 7 IEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQ---FDLQNPPSLPLAVSQLSHQLIF 63
+E+ +E+EG + DF F G+ IP++ D + FDL +PP+ PL VS H +F
Sbjct: 2 LELSDEVEGEEDGTTDFVFRLAGDPIPLLPTDSNPLPLFDLLSPPARPLVVSD-RHATVF 60
Query: 64 AAHSSGFCVARTNDVIDAAKEMKENG---TRCCVQELSVVDVPFENENVDILSLSNDEST 120
AH SGF RT ++I+A+KE +E G TRC Q+ V DVP V +++LS+DES
Sbjct: 61 LAHPSGFIAVRTKELIEASKEAREKGKASTRC-AQDCCVADVPLPG--VSLIALSHDESV 117
Query: 121 LAVSLYQSPHIHFFSVHSLLN-----KEIKPSFSCSLNGSTYVKDIRWRKKPENSFLALS 175
LA I FFS+ SLL +++ PS SCSL VKD +W S++ LS
Sbjct: 118 LAAC--TDTEIQFFSLASLLTHKCAAQDVVPSSSCSLGRVGTVKDFKWLNHASASYIVLS 175
Query: 176 NVGRLYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFKS 235
N G L G + L+DVM+NVDAV+ +G +AVA+ N L+IL+ +E +SL F+
Sbjct: 176 NGGLLCHGSLGEGLKDVMENVDAVDCCKEGNHIAVARGNKLTILSPDFRETSSMSLLFQL 235
Query: 236 WVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPV 295
W + D + VDSI WV D I+IGC +L ED EE YLVQVI++++ ++ +P
Sbjct: 236 WSDESDSEGTTAVDSIGWVHGDSIVIGCVRLNEDDNEEGYLVQVIRTEESTFCESPGKPD 295
Query: 296 VLSFSDVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRW-SLDDEKN 354
V ++ D F ++DD LP G GP L L Y+ RW L +++N+K+ID+H+ LL+W S D++
Sbjct: 296 VYTYVDFFHGIMDDALPPGVGPNLLLGYLHRWDLMVASNKKSIDEHIALLKWPSAHDDEK 355
Query: 355 DIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYFILMC 414
+ +++ DK+ PRI LQ+NG+DN+I+G ++ VSL K+ V +G E+KE++P IL+C
Sbjct: 356 TVTYLEMLEDKYSPRIDLQENGDDNVILGFGVENVSLFQKITVTVGPEQKEVAPQHILLC 415
Query: 415 LTLDGKLNMFHVASVS 430
LT +GKL ++++ASVS
Sbjct: 416 LTGEGKLAVYYLASVS 431
>gi|449531745|ref|XP_004172846.1| PREDICTED: uncharacterized protein LOC101226494, partial [Cucumis
sativus]
Length = 874
Score = 345 bits (886), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 189/323 (58%), Positives = 241/323 (74%), Gaps = 11/323 (3%)
Query: 757 IKEMTKELDMLLQSIEETGGFRDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMDERLQE 816
+KEM + LD LL+SIEE+GGF DACT FQ+ VE LE G+ SLS+ C +WRS M+ER QE
Sbjct: 1 MKEMCEGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQE 60
Query: 817 IQNLFDKTVQVLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGHILSINQDLINQ 876
+QNLFDK VQVL++K Y EGIV Q+SDS+YW+ W+RQKLS ELELKR HIL +NQ++ NQ
Sbjct: 61 VQNLFDKMVQVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQ 120
Query: 877 LIELERHFNSLELNKFGENDGVHGCQRRPQSRFGTSRHIQSLHSLHTTMSSQIAAAEQLS 936
LIELERHFN LELNKFG N+ +R Q +FG+SRH S+HSL+ M SQ+A A+ LS
Sbjct: 121 LIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLS 180
Query: 937 DRLSKQMALLSIESP-VKQQNVKKELFETLGIPYDASFSSPDVTKVMDQSSIKKLIHSSG 995
+ LSKQ+A L++ESP +K+Q+ KELFE++G+ YDASFSSP+V K+ + SS KKL+ SS
Sbjct: 181 ESLSKQLAALNMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSS-KKLLLSSD 239
Query: 996 SAAANDQSRRRQSSAMKSYDPETARRRRDSLDRSWASFEPPKTTVKRMLLQDHQKCSQVK 1055
S ++ SRR+Q S K+ + ET RRRRDSLDR+ AS +PPKTTVKRMLLQ +
Sbjct: 240 SFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQG------IP 293
Query: 1056 SSLKDKQQISPHMLEGAAIV-RP 1077
SS +++Q EGAA V RP
Sbjct: 294 SS--EEKQFCSRTPEGAATVARP 314
>gi|357116006|ref|XP_003559776.1| PREDICTED: uncharacterized protein LOC100824238 [Brachypodium
distachyon]
Length = 963
Score = 337 bits (864), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 209/520 (40%), Positives = 303/520 (58%), Gaps = 54/520 (10%)
Query: 1 MATPTRIEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQ---FDLQNPPSLPLAVSQL 57
M P +E+ +E+EG + + DF F G+ IP++ S FDL +PP+ PLAVS
Sbjct: 1 MVAPRELELSDEVEGPQDGTADFVFRLAGDPIPVLPPASSPPPLFDLHSPPARPLAVSD- 59
Query: 58 SHQLIFAAHSSGFCVARTNDVIDAAKEMKENG---TRCCVQELSVVDVPFENENVDILSL 114
H +F AH +GF RT +I+A+KE + G TRC ++ V DVP V +L+L
Sbjct: 60 RHAAVFLAHPNGFMAVRTKALIEASKEARAKGKASTRC-ARDCCVADVPLPG--VSLLAL 116
Query: 115 SNDESTLAVSLYQSPHIHFFSVHSLL-NKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLA 173
S D+S LA IHFFS SLL +K+++PS SCSL S VKD +W +++
Sbjct: 117 SGDDSVLAAC--TGTEIHFFSPASLLTDKDVEPSSSCSLGQSDTVKDFKWLNHDSKAYIV 174
Query: 174 LSNVGRLYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSF 233
LSN G L +G + L+DVM++VDAV+ G + VA++N L IL+S LKE ++L F
Sbjct: 175 LSNSGLLSQGSLREGLKDVMESVDAVDCCKDGNHIVVARENSLRILSSDLKETCCMTLLF 234
Query: 234 KSWVGDCDVNCS-VKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASS 292
+ W D D + +KVD+I WVR D I++GC QL E+G EENY+VQVI+S ++SS
Sbjct: 235 QLWPDDTDPEDTDIKVDAIGWVRDDSIVVGCVQLNEEGNEENYVVQVIRSGGDTFFESSS 294
Query: 293 EPVVLSFSDVFSDVIDDILPSGTGPYLFLTYVERW----------------------GLA 330
+PVV ++ D F ++DDILPS GP L L Y+ RW L
Sbjct: 295 KPVVFTYVDFFQGIMDDILPSRVGPNLLLGYLHRWYAKPVLNFVIEFVNDSSCVPYGDLM 354
Query: 331 ISANRKNIDDHVVLLRW--SLDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDK 388
+ +N+K+I+DH+ LL+W + DDE+N + +++ DK+ PRI LQ+NG+DN+I+G ++
Sbjct: 355 VVSNKKSINDHISLLKWPSTTDDERN-VVYLEMLEDKYSPRIDLQENGDDNVILGFGVEN 413
Query: 389 VSLCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHVASVSGPSCSPDVDFTLSDAEDD 448
VSL K+ V +G E+KE++P IL+CLT +GKL ++++AS D+ F +SD D
Sbjct: 414 VSLFQKITVLVGPEQKEVAPQHILLCLTGEGKLIIYYLASY----VDIDMLFRISDPS-D 468
Query: 449 VPAVVPV---DSNL-----PRVYSGSGLQK--IEPVASSF 478
+P P DSN+ P SG L PVA S
Sbjct: 469 LPQTSPYTIEDSNVNKTNSPATVSGKDLTPSVTSPVAKSL 508
Score = 227 bits (579), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 142/338 (42%), Positives = 206/338 (60%), Gaps = 20/338 (5%)
Query: 694 ASTG--AGKIESLPSVRSSQFSLPQNFASGKSHNQKLYPSKDDYKTATLSGLPNSEPNLS 751
+STG G +E++ + R S PQ GK HN + + S D L N+E +LS
Sbjct: 640 SSTGFATGNLEAVATSRVSPS--PQQ-TIGKPHNSRAHTSADSRSFK----LGNTEQDLS 692
Query: 752 KQSGNIKEMTKELDMLLQSIEETGGFRDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMD 811
K+ +I +MT+ELD LL IE+ GGF DAC Q+ + LE + + E ++++ ++
Sbjct: 693 KKFYSINDMTEELDTLLAYIEKDGGFIDACITSQQHPLSVLEGDLQNFMELLQVFKNKVE 752
Query: 812 ERLQEIQNLFDKTVQVLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGHILSINQ 871
E+ + ++L +K QV AR+ Y +GI+ Q+SD++YWD+WNRQKLSPE E+KR +IL NQ
Sbjct: 753 EKYSKTEDLRNKMFQVSARQAYMKGILSQSSDTQYWDIWNRQKLSPEFEVKRQNILKANQ 812
Query: 872 DLINQLIELERHFNSLELNKFGENDGVHGCQRRPQSRFGTSRHIQSLHSLHTTMSSQIAA 931
+L NQL+ELERHFN+LE+NKFG+ V +R S S Q L S++ ++SQ+AA
Sbjct: 813 NLTNQLVELERHFNNLEMNKFGDTGRVASSRRAVYSNKSRSSQTQ-LSSVYNALNSQLAA 871
Query: 932 AEQLSDRLSKQMALLSIESPVKQQ-NVKKELFETLGIPYDASFSSPDVTKVMDQSSIKKL 990
AEQLS+ LSKQ++ L+I SP K++ V KELFE++G+ + P + KL
Sbjct: 872 AEQLSECLSKQVSALNISSPSKKRGTVTKELFESIGLAH------PSDAPKFSAGTPSKL 925
Query: 991 IHSSGSAAANDQSRRRQSSAMKSYDPETARRRRDSLDR 1028
I S + +S S K +PETARRRR+SLDR
Sbjct: 926 IQGFPSTKEHTKSMLGPS---KIAEPETARRRRESLDR 960
>gi|218200281|gb|EEC82708.1| hypothetical protein OsI_27378 [Oryza sativa Indica Group]
Length = 1755
Score = 258 bits (658), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 178/460 (38%), Positives = 259/460 (56%), Gaps = 32/460 (6%)
Query: 694 ASTG--AGKIESLPSVRSSQFSLPQNFASGKSHNQKLYPSKDDYKTATLSGLPNSEPNLS 751
+STG G +E+ P+ R S LPQ + K H+ + + D K L + ++E +LS
Sbjct: 664 SSTGFRTGNLEAFPTSRGS--PLPQE-SIDKPHD-RTHAVVDHSKNFKLGAMFDTEQDLS 719
Query: 752 KQSGNIKEMTKELDMLLQSIEETGGFRDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMD 811
K+ +I +MTKELD LL IE+ GGFRDAC FQ + + E + + E ++++ +
Sbjct: 720 KKFYSINDMTKELDTLLSYIEKDGGFRDACITFQERPLSMFEGDLQNFLELLQIFKNKIA 779
Query: 812 ERLQEIQNLFDK------TVQVLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGH 865
E+ +++L +K T +V AR+ Y +GIV Q+SD++YWD+WNRQKLSPE E KR +
Sbjct: 780 EQCSRVEDLRNKMFQGTSTTRVSARQAYMKGIVDQSSDTQYWDIWNRQKLSPEFEAKRQN 839
Query: 866 ILSINQDLINQLIELERHFNSLELNKFGENDGVHGCQRRPQSRFGTSRHIQSLHSLHTTM 925
IL NQ+L NQL+ELERHFN+LE+NKFGE V +R S S Q + S++ +
Sbjct: 840 ILKANQNLTNQLVELERHFNNLEMNKFGETGRVASSRRAIYSSKARSSQTQ-ISSVYNAL 898
Query: 926 SSQIAAAEQLSDRLSKQMALLSIESP-VKQQNVKKELFETLGIPYDASFSSPDVTKVMDQ 984
+SQ+AAAEQLSD LSKQ++ L+I SP K+ +V KELFE++G+ D + + +
Sbjct: 899 NSQLAAAEQLSDCLSKQISALNISSPSTKRGSVAKELFESIGL--DHTMDAAKFSGGTPS 956
Query: 985 SSIKKLIHSSGSAAANDQSRRRQSSAMKSYDPETARRRRDSLDRSWASFEPPKTTVKRML 1044
S+K+ + + S K+ +PETARRRR+SLDRS AS EP KTTVKR+
Sbjct: 957 KSVKRY-------PSTREHITSISGPSKTAEPETARRRRESLDRSLASLEPQKTTVKRIA 1009
Query: 1045 LQDHQKCSQVKSSLKDKQQISPHMLEGAAIVRPKDRTTPSTSWNPLRIKGLQDAS----- 1099
Q K S +K+ M AA+ + K ++P++S L S
Sbjct: 1010 QQQRLKISSDLPFRSNKKIFDSQM---AAMSQEKSSSSPTSSIVESYANKLHYPSEVLHE 1066
Query: 1100 -LKQTSESQSTLFKWAGDPTGPSQMTGLKSPVFQSNIAST 1138
K + +TLFKW + GPSQ K P + S+
Sbjct: 1067 KTKPSGPQHNTLFKWVKESAGPSQGPQHKYPELPGQLKSS 1106
>gi|449523796|ref|XP_004168909.1| PREDICTED: uncharacterized LOC101219502, partial [Cucumis sativus]
Length = 470
Score = 209 bits (532), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 117/241 (48%), Positives = 161/241 (66%), Gaps = 6/241 (2%)
Query: 7 IEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAH 66
I +E+ EG +I ND +F +IG+ +P+ + DS FD ++PPS PLA+S+ S LIF AH
Sbjct: 236 IPLEDAGEGEQIVRNDLYFQKIGKPVPV-KLGDSIFDPESPPSQPLALSE-SSGLIFVAH 293
Query: 67 SSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLY 126
SGF V R DVI +A+E+K GT VQ+LS+VDV V IL++S D S LA +
Sbjct: 294 LSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGK--VHILAVSTDNSVLAAVV- 350
Query: 127 QSPHIHFFSVHSLLNKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLALSNVGRLYRGGVN 186
+ +H FSV SLL+K KP SCS+ S+++KD +W +K EN++L LS G+LY+G N
Sbjct: 351 -AGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLYQGSAN 409
Query: 187 GPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFKSWVGDCDVNCSV 246
GPL VM ++DAVE SVKGKF+AVAKK+ L+I + K KERL +SL G+ D + +V
Sbjct: 410 GPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETDTDFTV 469
Query: 247 K 247
K
Sbjct: 470 K 470
>gi|33146490|dbj|BAC79599.1| hypothetical protein [Oryza sativa Japonica Group]
gi|50509404|dbj|BAD30983.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 304
Score = 206 bits (525), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 122/289 (42%), Positives = 172/289 (59%), Gaps = 12/289 (4%)
Query: 1 MATPTRIEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQ---FDLQNPPSLPLAVSQL 57
MA P +++ +E+EG + DF F G+ IP++ S FDLQ+PPS PLAVS
Sbjct: 1 MAPPRELDLSDEVEGEEDGTTDFVFRLAGDPIPLLPTTSSPLPLFDLQSPPSRPLAVSN- 59
Query: 58 SHQLIFAAHSSGFCVARTNDVIDAAKEMKENG--TRCCVQELSVVDVPFENENVDILSLS 115
+F AH +GF A T +I+A+KE +E G T C ++ V D+P V +L LS
Sbjct: 60 RRAAVFLAHPNGFMAATTKALIEASKEAREKGKSTTRCARDCCVADIPLPG--VSLLELS 117
Query: 116 NDESTLAVSLYQSPHIHFFSVHSLL-NKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLAL 174
DES LA IHFFS SLL +K+++P SC+L GS+ VKD +W +F+ L
Sbjct: 118 RDESVLAAC--AGSVIHFFSASSLLTDKDVEPLSSCTLEGSSTVKDFKWLNHASKAFIVL 175
Query: 175 SNVGRLYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFK 234
S G L +G + L+D+M+NVDAV+ +G + ++KKN L+IL+S KE + L F+
Sbjct: 176 SKDGLLSQGSLGEGLKDIMENVDAVDCCKEGTHIVLSKKNTLNILSSDFKEICCMPLLFQ 235
Query: 235 SW-VGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQS 282
W + S+KVDSI WVR D I++G +L E+G EE YLVQVI+S
Sbjct: 236 LWSDDSDSDDASIKVDSIGWVRDDSIVVGSVRLNEEGNEEGYLVQVIRS 284
>gi|147843028|emb|CAN83314.1| hypothetical protein VITISV_001464 [Vitis vinifera]
Length = 276
Score = 202 bits (514), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 115/201 (57%), Positives = 148/201 (73%), Gaps = 9/201 (4%)
Query: 1 MATP-TRIEIE-EEIEGGRIESNDFFFDEIGESIPIIRDDDSQFDLQNP-PSLPLAVSQL 57
MATP + I++E +++EGGR++ +D+ F +IGES+ I + F+L +P PS PLAVS+
Sbjct: 1 MATPESVIDLEGKDLEGGRLDCDDYVFVKIGESVTI-KPQYYNFNLDSPLPSQPLAVSER 59
Query: 58 SHQLIFAAHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSND 117
S QLIF AHS GFCVART VI+ AKE+KE G+ +QELSVVDVP N V IL+LS D
Sbjct: 60 S-QLIFVAHSDGFCVARTEAVIELAKEIKEKGSGSSIQELSVVDVPIAN--VRILALSTD 116
Query: 118 ESTLAVSLYQSPHIHFFSVHSLLNKEIKPSFSCSLNGSTYVKDIRWRKKPENSFLALSNV 177
STLA S+ IHFFSV SLLNK +PSF+ SL+GS+ VKD+RWRKK +NS++ LS+
Sbjct: 117 SSTLAASV--GGDIHFFSVDSLLNKGQEPSFTRSLSGSSSVKDMRWRKKMDNSYVVLSSD 174
Query: 178 GRLYRGGVNGPLEDVMDNVDA 198
G+LY G GPL+DVMD VDA
Sbjct: 175 GKLYHGAAEGPLKDVMDGVDA 195
>gi|242041741|ref|XP_002468265.1| hypothetical protein SORBIDRAFT_01g042703 [Sorghum bicolor]
gi|241922119|gb|EER95263.1| hypothetical protein SORBIDRAFT_01g042703 [Sorghum bicolor]
Length = 193
Score = 177 bits (449), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 93/193 (48%), Positives = 136/193 (70%), Gaps = 2/193 (1%)
Query: 775 GGFRDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMDERLQEIQNLFDKTVQVLARKIYT 834
GGFRDAC FQ++ + E+G+ +L E +++ ++E+ +I++L +K QV AR+ Y
Sbjct: 2 GGFRDACITFQQRPLSMFEDGLQNLLELLQVFKGKVEEQCSKIEDLRNKVFQVSARQTYM 61
Query: 835 EGIVKQASDSRYWDLWNRQKLSPELELKRGHILSINQDLINQLIELERHFNSLELNKFGE 894
+GIV Q+SD++YWD+WN QKLSPE E+KR IL NQ+L NQL+ELERHFN+LE+N+ E
Sbjct: 62 KGIVSQSSDNQYWDIWNCQKLSPEFEVKRQIILKANQNLTNQLVELERHFNNLEMNRSSE 121
Query: 895 NDGVHGCQRRPQSRFGTSRHIQSLHSLHTTMSSQIAAAEQLSDRLSKQMALLSIESPVKQ 954
V +R S S Q L S++ ++SQ+AAAEQLS+ LSKQ+++L+I SP ++
Sbjct: 122 TGRVASNRRAVYSNKTRSSQTQ-LSSVYNALNSQLAAAEQLSECLSKQISVLNIGSPTRK 180
Query: 955 QN-VKKELFETLG 966
Q V KELFE++G
Sbjct: 181 QGAVTKELFESIG 193
>gi|224073112|ref|XP_002303977.1| predicted protein [Populus trichocarpa]
gi|222841409|gb|EEE78956.1| predicted protein [Populus trichocarpa]
Length = 104
Score = 158 bits (400), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 80/104 (76%), Positives = 93/104 (89%)
Query: 925 MSSQIAAAEQLSDRLSKQMALLSIESPVKQQNVKKELFETLGIPYDASFSSPDVTKVMDQ 984
MSSQ+AAAEQLS+ LSKQM++LSIESPV+Q+NVKKELFET+GIPYDASFSSPD TKV D
Sbjct: 1 MSSQLAAAEQLSECLSKQMSMLSIESPVRQKNVKKELFETIGIPYDASFSSPDATKVGDT 60
Query: 985 SSIKKLIHSSGSAAANDQSRRRQSSAMKSYDPETARRRRDSLDR 1028
+S+KKL+ SSGSAA +SRR QSSAMKS D ET+RRRRDSLD+
Sbjct: 61 TSLKKLLLSSGSAATKGKSRRHQSSAMKSSDSETSRRRRDSLDQ 104
>gi|413925763|gb|AFW65695.1| hypothetical protein ZEAMMB73_930105 [Zea mays]
Length = 549
Score = 122 bits (307), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 61/150 (40%), Positives = 93/150 (62%), Gaps = 12/150 (8%)
Query: 778 RDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMDERLQEIQNLFDKTVQ----------- 826
RDAC Q++ + E+G+ + + +++ ++E+ +I++L + Q
Sbjct: 148 RDACITIQQRSLSMFEDGLQNFLQLLQVFKRKVEEQCSKIEDLRNNMFQGEIFVCSFDYL 207
Query: 827 -VLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGHILSINQDLINQLIELERHFN 885
+ AR+ Y +GIV Q+SD++YWD+WNRQKLSPE E+KR +IL NQDL NQL+ELERHFN
Sbjct: 208 FISARQTYMKGIVSQSSDNQYWDIWNRQKLSPEFEVKRQNILRANQDLTNQLVELERHFN 267
Query: 886 SLELNKFGENDGVHGCQRRPQSRFGTSRHI 915
+LE+N+ GE V R S +SR +
Sbjct: 268 NLEMNRSGETGRVASNLRAVYSNKSSSRQV 297
>gi|414877944|tpg|DAA55075.1| TPA: hypothetical protein ZEAMMB73_147712, partial [Zea mays]
Length = 399
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 55/138 (39%), Positives = 84/138 (60%), Gaps = 22/138 (15%)
Query: 778 RDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMDERLQEIQNL---------FDKT---- 824
RDAC Q++ + E+G+ + + +++S ++E+ +I+ L FDK
Sbjct: 139 RDACITIQQRSLSMFEDGLQNFLQLLQVFKSKVEEQCSKIEYLRNNMFQGEIFDKYFMSF 198
Query: 825 VQVLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGHILSIN---------QDLIN 875
+ + + Y +GIV Q+SD++YWD+WNRQKLSPE E+KR + L N QDL N
Sbjct: 199 LPLFVMQTYMKGIVSQSSDNQYWDIWNRQKLSPEFEVKRQNSLRANQENNMSFVIQDLTN 258
Query: 876 QLIELERHFNSLELNKFG 893
QL+ELERHFN+LE+N+ G
Sbjct: 259 QLVELERHFNNLEMNRSG 276
>gi|414877943|tpg|DAA55074.1| TPA: hypothetical protein ZEAMMB73_147712 [Zea mays]
Length = 561
Score = 105 bits (261), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/138 (39%), Positives = 83/138 (60%), Gaps = 22/138 (15%)
Query: 778 RDACTVFQRQKVEELEEGIGSLSEKCGMWRSIMDERLQEIQNL---------FDKTVQ-- 826
RDAC Q++ + E+G+ + + +++S ++E+ +I+ L FDK
Sbjct: 139 RDACITIQQRSLSMFEDGLQNFLQLLQVFKSKVEEQCSKIEYLRNNMFQGEIFDKYFMSF 198
Query: 827 --VLARKIYTEGIVKQASDSRYWDLWNRQKLSPELELKRGHILSIN---------QDLIN 875
+ + Y +GIV Q+SD++YWD+WNRQKLSPE E+KR + L N QDL N
Sbjct: 199 LPLFVMQTYMKGIVSQSSDNQYWDIWNRQKLSPEFEVKRQNSLRANQENNMSFVIQDLTN 258
Query: 876 QLIELERHFNSLELNKFG 893
QL+ELERHFN+LE+N+ G
Sbjct: 259 QLVELERHFNNLEMNRSG 276
>gi|226495089|ref|NP_001144022.1| uncharacterized protein LOC100276843 [Zea mays]
gi|195635607|gb|ACG37272.1| hypothetical protein [Zea mays]
Length = 161
Score = 101 bits (251), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 86/149 (57%), Gaps = 10/149 (6%)
Query: 1 MATPTRIEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQ---FDLQNPPSLPLAVSQL 57
M P +E+ +E+EG + DF F G+ IP++ D + FDL +PP+ PLAVS
Sbjct: 1 MGAPRELELSDEVEGEEDGTTDFVFRLAGDPIPLLPTDSNPLPLFDLLSPPARPLAVSD- 59
Query: 58 SHQLIFAAHSSGFCVARTNDVIDAAKEMKENG--TRCCVQELSVVDVPFENENVDILSLS 115
H +F AH SGF RT ++I+A+KE +E G + C Q+ V DVP V +++LS
Sbjct: 60 RHATVFLAHPSGFIAVRTKELIEASKEAREKGKSSTXCAQDCCVADVPL--PGVSLIALS 117
Query: 116 NDESTLAVSLYQSPHIHFFSVHSLLNKEI 144
+DES LA + I FFS+ SLL ++
Sbjct: 118 HDESVLAACTHT--EIQFFSLASLLTHKV 144
>gi|384249461|gb|EIE22942.1| hypothetical protein COCSUDRAFT_63339 [Coccomyxa subellipsoidea
C-169]
Length = 1892
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 91/354 (25%), Positives = 152/354 (42%), Gaps = 28/354 (7%)
Query: 94 VQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPHIHFFSVHSLLNK-EIKPSFSCSL 152
+E+ +P +V +LS+DE LA + S I+ +S+ L+NK + P +
Sbjct: 11 AEEVCACTLPVAAASV---ALSSDEELLAA--WDSKMIYVWSLSMLINKRDTMPLTEWLV 65
Query: 153 NGSTYVKDIRWRK------KPENSFLALSNVGRLYRGGVNGPLEDVMDN----VDAVEWS 202
T VK + WR PE + + + G+L RG + G L+ + D V V WS
Sbjct: 66 PDGTPVKQLAWRPLQAPDAVPEAAVVTMR--GKLLRGSLGGRLQALPDVAAGVVSCVAWS 123
Query: 203 VKGKFVAVAKKNVLSILTSKLKERLPISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIG 262
G+ +A+A N +++ + K D + SV VD++ W + +++
Sbjct: 124 ADGQLLALASANCITVQHVESGRHFRTEFDSKE---AKDADGSVTVDTLAWSGPNSLLVS 180
Query: 263 C-FQLTEDGKEENYLVQVIQSKDGKI-TDASSEPVVLSFSDVFSDVID-DILPSGTGPYL 319
ED + E + + + A ++P L ++ I+ D +GPY
Sbjct: 181 ANLHPNEDLEAEMDIAPLALVRWAAWDVHAGADPEELQLTEFIPPPINPDAAARDSGPYA 240
Query: 320 FLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKWIPRIKLQDNGNDN 379
+ V WG + A+ K DDH+ L E VD+ D RI + NDN
Sbjct: 241 RIALVPAWGAIVVAHSKASDDHIRLFEHVPGGEAKG---VDVTEDWSAIRIPTAMDYNDN 297
Query: 380 LIMGLCIDKVSLCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHVASVSGPS 433
+ GL ID S +V + E L+P +L+ T DG L +F A ++ P+
Sbjct: 298 YVTGLGIDFTSTEVEV-MPPDPEASALAPSPVLLVATSDGALRLFTFAHMTKPT 350
>gi|242051330|ref|XP_002463409.1| hypothetical protein SORBIDRAFT_02g043333 [Sorghum bicolor]
gi|241926786|gb|EER99930.1| hypothetical protein SORBIDRAFT_02g043333 [Sorghum bicolor]
Length = 231
Score = 95.5 bits (236), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 89/240 (37%), Positives = 126/240 (52%), Gaps = 24/240 (10%)
Query: 918 LHSLHTTMSSQIAAAEQLSDRLSKQMALLSIESPVKQQ-NVKKELFETLGIPY--DAS-F 973
L S++ ++SQ+AAAEQLS+ LSKQ+++L+I SP ++Q V KELFE++G+ + DA+ F
Sbjct: 4 LSSVYNALNSQLAAAEQLSECLSKQISVLNIGSPTRKQGTVTKELFESIGLAHTTDATKF 63
Query: 974 SSPDVTKVMDQSSIKKLIHSSGSAAANDQSRRRQSSAMKSYDPETARRRRDSLDRSWASF 1033
S V S+ + HS G + KS +PETARRRR+SLD S AS
Sbjct: 64 SGSTPKPVKRFPSVNE--HSKGILGPS-----------KSGEPETARRRRESLDMSLASL 110
Query: 1034 EPPKTTVKRMLLQDHQKCSQVKSSLKDKQQISPHMLEGAAIVRPKDRTTPSTSWNPLRIK 1093
EP KTTVKR+ Q K S +K+ M AAI + K + ++S +
Sbjct: 111 EPQKTTVKRIAQQQRLKISSDLPFRSNKKIFDSQM---AAISQEKSSDSSNSSIVESYVS 167
Query: 1094 GL----QDASLKQTSESQSTLFKWAGDPTGPSQMTGLKSPVFQSNIASTRSSLSATQLSP 1149
L +D K + Q+ LFKW + TG SQ++ K + ST S T SP
Sbjct: 168 RLHSLSEDVKEKPSEPQQNPLFKWIKESTGSSQISEQKHIELPGQMKSTAHSSKLTPSSP 227
>gi|21954068|gb|AAK59573.2| unknown protein [Arabidopsis thaliana]
Length = 829
Score = 93.6 bits (231), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 63/161 (39%), Positives = 91/161 (56%), Gaps = 19/161 (11%)
Query: 1012 KSYDPETARRRRDSLDRSWASFEPPKTTVKRMLLQDHQKCSQVKSSLKDKQQISPHMLEG 1071
K+ DPETARRRR+SLDR+WA+FEPPKTTVKRMLLQ+ QK + ++ ++ S + +
Sbjct: 1 KNSDPETARRRRESLDRNWAAFEPPKTTVKRMLLQEQQKTGMNQQTVLSERLRSANNTQD 60
Query: 1072 AAIVRPKDRTTPSTSWNPLRIKGLQDASLKQTSESQSTLFKWAGDPTGPSQMTGLKSPVF 1131
+++ KD +P S N KG+ ++ + TSE+QST FK + P+
Sbjct: 61 RSLLHVKDHASPVVSSN----KGIMESFQQDTSEAQSTPFK-------------TRPPMP 103
Query: 1132 QSNIASTRSSLSATQLSPMGWQ-NHARNTGDVTAEKLSSGV 1171
QSN T S +SA++ S W N + NT E S +
Sbjct: 104 QSNSPFTISPISASKPS-FNWSGNKSSNTTSYAEESAPSQI 143
>gi|302792987|ref|XP_002978259.1| hypothetical protein SELMODRAFT_443776 [Selaginella moellendorffii]
gi|300154280|gb|EFJ20916.1| hypothetical protein SELMODRAFT_443776 [Selaginella moellendorffii]
Length = 1089
Score = 83.6 bits (205), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 105/432 (24%), Positives = 174/432 (40%), Gaps = 66/432 (15%)
Query: 13 IEGGRIESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCV 72
+E E+ D F IG + ++ +P + L+VS L F A +GF V
Sbjct: 2 VEKAPEENVDIAFKAIGGGVSLLSPGAKASSDYHPARV-LSVSN-RFGLTFFATPAGFGV 59
Query: 73 ARTNDVIDAAKEMK-ENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPHI 131
T DV++ + K + + C+ + E E V I+ LS DE TLAV + +
Sbjct: 60 V-TADVLELSSGPKPASASEKCLASV-------ELEKVSIVELSKDELTLAVCAGFA--V 109
Query: 132 HFFSVHSLLNKEIKPSFSCS-LNGSTYVKDIRWRKKPENSFLALSNVGRLYRGGVNGPLE 190
F+ V SL K F + +KD +W ++ N G+L G +
Sbjct: 110 RFYRVPSLALKTDTRRFEIRDIGDDALIKDFQWCSLDQDK-----NKGKLLVGKIGEEPT 164
Query: 191 DVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFKSWVGDCDVNCSVKVDS 250
+ +NV A WS GK + A L+I +S + I+ + C+ VD+
Sbjct: 165 FMKENVSAACWSQFGKQIMYATSMKLTIASSDFAKSFIINPRVE--FPGCEEGLETHVDA 222
Query: 251 IRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGK---------ITDASSEPVVLSFSD 301
+RW+ I+I Q +D EE ++ S G+ I++AS+E L+ +
Sbjct: 223 LRWISPQSIVISYVQSEDD--EEITCPLLVLSASGRQQLDEVLISISEASTECSALAVTR 280
Query: 302 VFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDI 361
F + I+P PY+ V +
Sbjct: 281 RFPAIEASIIPPMNSPYML--------------------------------AGGPKVFAL 308
Query: 362 NRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYFILMCLTLDGKL 421
+ D W I + D+ +DN + GL ID+ SL + + EK L P +++C +LD +L
Sbjct: 309 SDDLWRADISMFDSSSDNALTGLAIDRTSLLKQPDPR--NPEKLLPPSPVMLCTSLDSRL 366
Query: 422 NMFHVASVSGPS 433
++ V + G S
Sbjct: 367 LLYSVLRLDGES 378
>gi|148228420|ref|NP_001079666.1| Nuclear pore complex protein Nup214-like [Xenopus laevis]
gi|5764080|emb|CAB53357.1| nucleoporin CAN [Xenopus laevis]
Length = 2037
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 114/437 (26%), Positives = 190/437 (43%), Gaps = 69/437 (15%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E+ DF F ++ + +R D DL S L +S + L+F G V T D+
Sbjct: 12 ETKDFQFRQLKK----VRLFDYPADLPKQRSNLLVISN-KYGLLFVGGFMGLKVFHTKDI 66
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAV---SLYQSPHIHFFS 135
+ K KEN + V + VP N + L+LS+D TL+V S Q + F+
Sbjct: 67 LVTVKP-KENANKTVVGPQGI-HVPM-NSPIHHLALSSDNLTLSVCMTSAEQGSSVSFYD 123
Query: 136 VHSLLNKEIK---PSFSCSL--NGSTYVKDIRWRKK-PENSFLALSNVGRLYRGGVNGPL 189
V +LLN+ + P SC L + S+ V D++W P + LS+ G + V +
Sbjct: 124 VRTLLNESKQNKMPFASCKLLRDPSSSVTDLQWNPTLPSMVAVCLSD-GSISVLQVTDTV 182
Query: 190 EDVMDNVDA------VEWSVKGKFVAVAKKN--VLSILTSKLKERLPISLSFKSWVGDCD 241
V N+ A V WS KGK +AV K+N V+ L S ++++ SF D
Sbjct: 183 -SVFANLPATLGVTSVCWSPKGKQLAVGKQNGTVVQYLPSLQEKKVIPCPSF------YD 235
Query: 242 VNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVLSFSD 301
+ VKV + W+ + DG E VI + K D +E L+F++
Sbjct: 236 SDNPVKVLDVLWL--STYVFTVVYAAADGSLEASPQLVIVTLPKK-EDKRAE-RFLNFTE 291
Query: 302 VFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNID--------DHVVLLRWSLDDEK 353
+ + + FL Y+E W + ++A+ ++D D V +W L+D
Sbjct: 292 TCYSICSE-----RQHHFFLNYIEDWEILLAASAASVDVGVIARPPDQVGWEQWLLEDSS 346
Query: 354 NDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKEL-SPYFIL 412
+ + +N +D L MG+ +D C ++++ + E ++ P +L
Sbjct: 347 RA-------------EMPMTENNDDTLPMGVALDYT--C---QLEVFISESQILPPVPVL 388
Query: 413 MCLTLDGKLNMFHVASV 429
+ L+ DG L FHV ++
Sbjct: 389 LLLSTDGVLCPFHVVNL 405
>gi|28422157|gb|AAH46835.1| MGC52541 protein [Xenopus laevis]
Length = 694
Score = 66.6 bits (161), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 112/437 (25%), Positives = 189/437 (43%), Gaps = 69/437 (15%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E+ DF F ++ + +R D DL S L +S + L+F G V T D+
Sbjct: 12 ETKDFQFRQLKK----VRLFDYPADLPKQRSNLLVISN-KYGLLFVGGFMGLKVFHTKDI 66
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLY---QSPHIHFFS 135
+ K KEN + V + VP N + L+LS+D TL+V + Q + F+
Sbjct: 67 LVTVKP-KENANKTVVGPQGI-HVPM-NSPIHHLALSSDNLTLSVCMTSAEQGSSVSFYD 123
Query: 136 VHSLLNKEIK---PSFSCSL--NGSTYVKDIRWRKK-PENSFLALSNVGRLYRGGVNGPL 189
V +LLN+ + P SC L + S+ V D++W P + LS+ G + V +
Sbjct: 124 VRTLLNESKQNKMPFASCKLLRDPSSSVTDLQWNPTLPSMVAVCLSD-GSISVLQVTDTV 182
Query: 190 EDVMDNVDA------VEWSVKGKFVAVAKKN--VLSILTSKLKERLPISLSFKSWVGDCD 241
V N+ A V WS KGK +AV K+N V+ L S ++++ SF D
Sbjct: 183 S-VFANLPATLGVTSVCWSPKGKQLAVGKQNGTVVQYLPSLQEKKVIPCPSFY------D 235
Query: 242 VNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVLSFSD 301
+ VKV + W+ + DG E VI + K D +E L+F++
Sbjct: 236 SDNPVKVLDVLWLST--YVFTVVYAAADGSLEASPQLVIVTLPKK-EDKRAE-RFLNFTE 291
Query: 302 VFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNID--------DHVVLLRWSLDDEK 353
+ + + FL Y+E W + ++A+ ++D D V +W L+D
Sbjct: 292 TCYSICSE-----RQHHFFLNYIEDWEILLAASAASVDVGVIARPPDQVGWEQWLLEDSS 346
Query: 354 NDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKEL-SPYFIL 412
+ + +N +D L MG+ +D ++++ + E ++ P +L
Sbjct: 347 RA-------------EMPMTENNDDTLPMGVALDYT-----CQLEVFISESQILPPVPVL 388
Query: 413 MCLTLDGKLNMFHVASV 429
+ L+ DG L FHV ++
Sbjct: 389 LLLSTDGVLCPFHVVNL 405
>gi|329664276|ref|NP_001192372.1| nuclear pore complex protein Nup214 [Bos taurus]
gi|296482127|tpg|DAA24242.1| TPA: nucleoporin 214kDa-like [Bos taurus]
Length = 2087
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 111/438 (25%), Positives = 190/438 (43%), Gaps = 77/438 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S L+VS + L+FA +SG V T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLSVSN-KYGLVFAGGASGLHVFSTKNL 67
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+V + S + I FF
Sbjct: 68 LIQNKPGDDPNKIVDKVQSL-LVPMKFP---IHHLALSCDNLTLSVCMMSSEYGSIIAFF 123
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKKPENSF--------LALSNVGRLY 181
V + N K+ K F+ + V D++W + +A+ V
Sbjct: 124 DVRTFSNEAKQQKRPFAYHKLLKDAGGLVIDMKWNPAVSSMVAVCLADGSIAVLQVTETV 183
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P +
Sbjct: 184 K--VCATLPSTV-GVTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFY----- 235
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVLS 298
+ + V+V + W IG ++ T V + DG + A VV++
Sbjct: 236 --EADHPVRVLDVLW-------IGTYEFT----------IVYAAADGTLETAPD--VVMA 274
Query: 299 F--------SDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWS 348
+VF + ++ S T +L+Y+E W L ++A+ + + +L R S
Sbjct: 275 LLPKKEEKHPEVFMNFMEPCYGSCTERQHRYYLSYIEEWDLVLAASAASTEVS-ILARQS 333
Query: 349 LDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSP 408
+ + D +R + + + D +D+L MG+ ID + +V++ + EEK L P
Sbjct: 334 SQNSWESWLLEDSSRAE----LPVTDKSDDSLPMGVAIDYTN---QVEITI-TEEKTLPP 385
Query: 409 YFILMCLTLDGKLNMFHV 426
+LM L+ DG L F++
Sbjct: 386 APVLMLLSTDGVLCPFYM 403
>gi|224073554|ref|XP_002195441.1| PREDICTED: nuclear pore complex protein Nup214 [Taeniopygia
guttata]
Length = 2027
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 106/437 (24%), Positives = 185/437 (42%), Gaps = 74/437 (16%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS DL S LAVS + L+FA + + T D+
Sbjct: 12 EMKDFQFRALKK----VRIFDSPDDLPKERSSLLAVSN-KYGLVFAGGGTSLQIFHTRDL 66
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQS---PHIHFFS 135
+ K+ E+ + V+ + VP + + L+LS D TL+V + S I FF
Sbjct: 67 L-MQKQQGEDPNKI-VENIQSTLVPMKFP-MQHLALSCDNLTLSVCMMSSEIGSFISFFD 123
Query: 136 VHSLLN--KEIKPSFSCSL-----NGSTYVKDIRWRKKPENSFLALSNVGRLYRGGVNGP 188
V + LN K+ K +F+C + V D++W S +A+ L G ++
Sbjct: 124 VRTFLNQAKQQKHAFACHKLAKDSGSAAVVNDLKWSPA-SPSMVAIC----LSDGSIS-- 176
Query: 189 LEDVMDNVD------------AVEWSVKGKFVAVAKKN--VLSILTSKLKERLPISLSFK 234
+ V D+V +V WS KGK +A ++N V+ L + ++++ F
Sbjct: 177 VLQVTDSVKVYATLPSSAAVTSVCWSPKGKQLAAGRQNGTVVQYLPNLQEKKVIPCPPF- 235
Query: 235 SWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDAS 291
D + VKV + W+ + DG E ++ ++ K+ K +
Sbjct: 236 -----YDSDNPVKVLDVLWI--GTYVFAIVYAEADGTLEMSPQVVIAILPKKEEKRPEK- 287
Query: 292 SEPVVLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSL 349
F + ++ S T + FL +VE W L ++A+ + + V +L
Sbjct: 288 -----------FVNFMEPCYTSCTERQHHYFLNFVEEWDLVLAASAASTE--VSILARQG 334
Query: 350 DDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPY 409
D + D +R + + L D +D L +G+ +D S + + + +EK L P
Sbjct: 335 DQNWESWVLEDSSRAE----LPLTDKSDDTLPVGVAVDYTS---NMPIVIS-DEKILPPA 386
Query: 410 FILMCLTLDGKLNMFHV 426
+L+ L+ DG L FH+
Sbjct: 387 PVLILLSTDGVLCPFHM 403
>gi|335281115|ref|XP_001926630.3| PREDICTED: nuclear pore complex protein Nup214 [Sus scrofa]
Length = 2079
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 114/439 (25%), Positives = 192/439 (43%), Gaps = 79/439 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+V + S + I FF
Sbjct: 68 LIQNKPGDDPNKIVDKVQSL-LVPMKFP---IHHLALSCDNLTLSVCMMSSEYGSIIAFF 123
Query: 135 SVHSLLN--KEIKPSFS---CSLNGSTYVKDIRWRKKPE--------NSFLALSNVGRLY 181
V + N K+ K F+ S + V D++W + +A+ V
Sbjct: 124 DVRTFSNEAKQQKRPFAYHKLSKDAGGMVIDMKWNPTVASMVAVCLADGSIAVLQVTETV 183
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P + S
Sbjct: 184 K--VCATLPSTV-GVTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYVS--- 237
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVLS 298
+ V+V + W IG ++ T V + DG + +S VV++
Sbjct: 238 ----DHPVRVLDVLW-------IGTYEFT----------VVYAAADGTLE--TSPDVVMA 274
Query: 299 F--------SDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWS 348
+VF + ++ S T + +L YVE W L ++A+ +I+ +L R S
Sbjct: 275 LLPKKEEKHPEVFVNFMEPCYGSCTERQHHYYLNYVEEWDLVLAASASSIEVS-ILARQS 333
Query: 349 LDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGV-EEKELS 407
+ D +R + + + D +D+L +G+ ID + +V++ V +EK L
Sbjct: 334 GQSNWESWLLEDSSRAE----LPVTDKSDDSLPVGVAIDFTT-----QVEITVNDEKTLP 384
Query: 408 PYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 385 PAPVLMLLSTDGVLCPFYM 403
>gi|426222924|ref|XP_004005629.1| PREDICTED: nuclear pore complex protein Nup214 [Ovis aries]
Length = 2083
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 111/443 (25%), Positives = 188/443 (42%), Gaps = 84/443 (18%)
Query: 26 DEIGESIPIIRDDDSQFD-------LQNPPSLPLAVSQL-----SHQLIFAAHSSGFCVA 73
DE+ IP D QF +P LP S L + L+FA +SG V
Sbjct: 3 DEMDAVIPEREMKDFQFRALKKVRIFDSPEELPKERSSLLSVSNKYGLVFAGGASGLHVF 62
Query: 74 RTNDVIDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH-- 130
T +++ K + N VQ L +V + F + L+LS D TL+V + S +
Sbjct: 63 STKNLLIQNKPGDDPNKIVDKVQSL-LVPMKFP---IHHLALSCDNLTLSVCMMSSEYGS 118
Query: 131 -IHFFSVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKKPENSF--------LALSN 176
I FF V + N K+ K F+ + V D++W + +A+
Sbjct: 119 IIAFFDVRTFSNEAKQQKRPFAYHKLLKDAGGLVIDMKWNPTVSSMVAVCLADGSIAVLQ 178
Query: 177 VGRLYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSF 233
V + V L + V +V WS KGK +AV K+N V + T + K+ +P +
Sbjct: 179 VTETVK--VCATLPSTV-GVTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFY 235
Query: 234 KSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSE 293
+ + V+V + W IG ++ T V + DG + A
Sbjct: 236 -------EADHPVRVLDVLW-------IGTYEFT----------IVYAAADGTLETAPD- 270
Query: 294 PVVLSF--------SDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVV 343
VV++ +VF + ++ S T + +L++VE W L ++A+ + + +
Sbjct: 271 -VVMALLPKKEEKHPEVFMNFMEPCYGSCTERQHHYYLSFVEEWDLVLAASAASTEVS-I 328
Query: 344 LLRWSLDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEE 403
L R S + + D +R + + + D +D+L MG+ ID + +V++ + EE
Sbjct: 329 LARQSSQNSWESWLLEDSSRAE----LPVTDKSDDSLPMGVAIDYTN---QVEITIN-EE 380
Query: 404 KELSPYFILMCLTLDGKLNMFHV 426
K L P +LM L+ DG L F++
Sbjct: 381 KTLPPAPVLMLLSTDGVLCPFYM 403
>gi|147820641|emb|CAN72099.1| hypothetical protein VITISV_021728 [Vitis vinifera]
Length = 433
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 53/94 (56%), Gaps = 3/94 (3%)
Query: 691 GDRASTGAGKIESLPSVRSSQFSLPQNFASGKSHNQKLYPSKDDYKTATLSGLPNSEPNL 750
G R S G +E + S +Q S+ F + KS N K Y K++Y+T L G NSE +L
Sbjct: 74 GQRYSMVVGNVEPISSTLGTQLSM--QFPA-KSPNYKSYHLKENYRTLLLQGQLNSESHL 130
Query: 751 SKQSGNIKEMTKELDMLLQSIEETGGFRDACTVF 784
SKQ GN+KEM KELD LLQ I+ D +F
Sbjct: 131 SKQFGNVKEMAKELDTLLQCIKGPEKAVDPFDIF 164
>gi|307106989|gb|EFN55233.1| hypothetical protein CHLNCDRAFT_52628 [Chlorella variabilis]
Length = 1773
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 92/206 (44%), Gaps = 30/206 (14%)
Query: 246 VKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVLSFSDVFSD 305
++VDS+ WV +++ +L G EE Y + + G S L+ S+ F++
Sbjct: 242 LQVDSVAWVGEAALLVSS-KLATGGAEEPYAPLCMLTWQGAAPTEGS----LALSEFFAN 296
Query: 306 VI--DDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLL-----RWSLDDEKNDIAV 358
I DD+ GP+L V +W A+ A+RK DDHV L R D +D
Sbjct: 297 NISGDDV---ARGPWLQTATVPQWSAAVYAHRKANDDHVKLAALVGGRPVAADVTDDRLA 353
Query: 359 VDI----NRDKWIP-----RIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEE--KELS 407
+ I + + +P R QD DN ++GL +D C +V V ++ +L
Sbjct: 354 IRIPNAPGKHRLLPARATLRRARQD--ADNFVVGLGVDLT--CTEVPVLHPTDDTAPDLP 409
Query: 408 PYFILMCLTLDGKLNMFHVASVSGPS 433
P +L+ T DG L + ++ PS
Sbjct: 410 PQPVLLVATSDGVLRCYTFGRMATPS 435
>gi|344271519|ref|XP_003407585.1| PREDICTED: nuclear pore complex protein Nup214 [Loxodonta africana]
Length = 2100
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 111/445 (24%), Positives = 192/445 (43%), Gaps = 88/445 (19%)
Query: 26 DEIGESIPIIRDDDSQFD-------LQNPPSLP------LAVSQLSHQLIFAAHSSGFCV 72
DE+ IP D QF ++P LP LAVS + L+FA +SG +
Sbjct: 3 DEMDAMIPEREMKDFQFRALKKVRIFESPDKLPKERTSLLAVSN-KYGLVFAGGTSGLQI 61
Query: 73 ARTNDVIDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH- 130
T +++ K + N VQ L +V + F + L+LS D TL+V + S +
Sbjct: 62 FPTKNLLIQNKPGDDPNKIVDKVQSL-LVPMKFP---IHHLALSCDNLTLSVCMMSSEYG 117
Query: 131 --IHFFSVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALS 175
I FF V + N K+ K F+ + V D++W P + +A+
Sbjct: 118 SIIAFFDVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPAVPSMVAVCLADGSIAVL 177
Query: 176 NVGRLYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLS 232
V + V L + V +V WS KGK +AV K+N V + T + K+ +P
Sbjct: 178 QVTETVK--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPF 234
Query: 233 FKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITD 289
++S + V+V + W+ I DG E + ++ ++ K+ K
Sbjct: 235 YES-------DHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKEDK--- 282
Query: 290 ASSEPVVLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRW 347
++F + ++ S T + FL+Y+E W L ++A+ + + + +L R
Sbjct: 283 ---------HPEIFVNFMEPCYSSCTERQHHYFLSYIEEWDLVLAASAASTEVN-ILARQ 332
Query: 348 SLDDEKNDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGV 401
S D+ IN + W+ + + D +D+L MG+ ID ++++ +
Sbjct: 333 S--DQ--------INWESWLLEDSGRAELPVTDKNDDSLPMGVAID---YTNQMEITIN- 378
Query: 402 EEKELSPYFILMCLTLDGKLNMFHV 426
+EK L P +LM L+ DG L F++
Sbjct: 379 DEKTLPPAPVLMLLSTDGVLCPFYM 403
>gi|410903520|ref|XP_003965241.1| PREDICTED: nuclear pore complex protein Nup214-like [Takifugu
rubripes]
Length = 1576
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 94/395 (23%), Positives = 164/395 (41%), Gaps = 66/395 (16%)
Query: 59 HQLIFAAHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDE 118
+ L F F V +T D+++A K E + V+ ++ + + L LS DE
Sbjct: 47 YGLTFVGFVRTFKVYQTQDILNADKS--EGNSNEVVEGITALAEVTGELVLHHLDLSCDE 104
Query: 119 STLAV---SLYQSPHIHFFSVHSLLNKEI--KPSFSC---SLNGSTYVKDIRWRKKPENS 170
TL+V S + F+ V + +NK K F+ + T V D++W
Sbjct: 105 LTLSVCGTSAEGGLSLRFYDVRTFINKARVQKLPFASVVPPVPPGTLVLDLKWNPTQATK 164
Query: 171 FLALSNVGRLYRGGVNGPLEDVMDNVDAVE------------WSVKGKFVAVAKKNV-LS 217
GR+ + DV D+V V WS KGK VA K N +S
Sbjct: 165 LAVCMTDGRMQ-------ILDVADSVKMVSELPASSGITCVCWSPKGKQVAAGKMNATVS 217
Query: 218 ILTSKLKER--LPISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE-- 273
T L+E+ +P + S + +KV + W++ + DG E
Sbjct: 218 QYTPALEEKKVIPCPHFYTS-------DEPIKVLDVLWLK--TFVFAVVYAAADGSPETP 268
Query: 274 -NYLVQVIQSKDGKITDASSEPVVLSFSD-VFSDVIDDILPSGTGPYLFLTYVERWGLAI 331
+ ++ + KD K+ E L+FSD V+ + + FL++VE W L
Sbjct: 269 PDLVLISLPKKDEKV-----ETKYLNFSDTVYGSCTERQH------HYFLSHVEDWDLVF 317
Query: 332 SANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSL 391
+A+ +I+ V+ + DD+ ++ +++ +P + D D L +GL ID S
Sbjct: 318 AASAASIEVSVIARQ---DDKIWELWILEDASRAELPVTETND---DTLPLGLAIDYTS- 370
Query: 392 CGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHV 426
+ ++ + +EK L P ++ L+ +G L F +
Sbjct: 371 --QTEIHI-TDEKILPPAPTMLMLSTEGLLCPFAL 402
>gi|440900084|gb|ELR51293.1| Nuclear pore complex protein Nup214, partial [Bos grunniens mutus]
Length = 2113
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 112/440 (25%), Positives = 191/440 (43%), Gaps = 81/440 (18%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S L+VS + L+FA +SG V T ++
Sbjct: 37 EMKDFQFRALKK----VRIFDSPEELPKERSSLLSVSN-KYGLVFAGGASGLHVFSTKNL 91
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+V + S + I FF
Sbjct: 92 LIQNKPGDDPNKIVDKVQSL-LVPMKFP---IHHLALSCDSLTLSVCMTSSEYGSVIAFF 147
Query: 135 SVHSLLN--KEIKPSFSCS-----LNGSTYVKDIRWRKKPENSF--------LALSNVGR 179
V + N K+ K F+ + G V D++W + +A+ V
Sbjct: 148 DVRTFSNEAKQQKRPFAYHKLLKDVGG--LVIDMKWNPAVSSMVAVCLADGSIAVLQVTE 205
Query: 180 LYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSW 236
+ V L + V +V WS KGK +AV K+N V + T + K+ +P +
Sbjct: 206 TVK--VCATLPSTV-GVTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFY--- 259
Query: 237 VGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVV 296
+ + V+V + W IG ++ T V + DG + A VV
Sbjct: 260 ----EADHPVRVLDVLW-------IGTYEFT----------IVYAAADGTLETAPD--VV 296
Query: 297 LSF--------SDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLR 346
++ +VF + ++ S T +L+Y+E W L ++A+ + + +L R
Sbjct: 297 MALLPKKEEKHPEVFMNFMEPCYGSCTERQHRYYLSYIEEWDLVLAASAASTEVS-ILAR 355
Query: 347 WSLDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKEL 406
S + + D +R + + + D +D+L MG+ ID + +V++ + EEK L
Sbjct: 356 QSSQNSWESWLLEDSSRAE----LPVTDKSDDSLPMGVAIDYTN---QVEITIN-EEKTL 407
Query: 407 SPYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 408 PPAPVLMLLSTDGVLCPFYM 427
>gi|296191040|ref|XP_002743452.1| PREDICTED: nuclear pore complex protein Nup214 isoform 1
[Callithrix jacchus]
Length = 2078
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 105/438 (23%), Positives = 187/438 (42%), Gaps = 77/438 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFFS 135
+ K + V ++ + VP + + L+LS D TL+V + + I FF
Sbjct: 68 LIQNKPGDD--PNKIVDKVQGLLVPMKFP-IHHLALSCDNLTLSVCMMSGEYGSIIAFFD 124
Query: 136 VHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLYR 182
V + N K+ K F+ + V D++W P + +A+ V +
Sbjct: 125 VRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETVK 184
Query: 183 GGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVGD 239
V L + V +V WS KGK +AV K+N V + T + K+ +P +
Sbjct: 185 --VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFY------ 235
Query: 240 CDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPVV 296
+ + V+V + W+ I DG E + ++ ++ K+ K
Sbjct: 236 -EADHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKEEK---------- 282
Query: 297 LSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKN 354
++F + ++ S T + +L+YVE W L ++A+ + + ++ +N
Sbjct: 283 --HPEIFVNFMEPCYGSCTERQHHYYLSYVEEWDLVLAASAASTEVSILA-------RQN 333
Query: 355 DIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSP 408
D IN + W+ + + D +D+L MG+ ID +V+V + +EK L P
Sbjct: 334 D----QINWESWLLEDSSRAELPVTDKSDDSLPMGVAID---YTNQVEVTIS-DEKTLPP 385
Query: 409 YFILMCLTLDGKLNMFHV 426
+LM L+ DG L F++
Sbjct: 386 APVLMLLSTDGVLCPFYM 403
>gi|296191042|ref|XP_002743453.1| PREDICTED: nuclear pore complex protein Nup214 isoform 2
[Callithrix jacchus]
Length = 2088
Score = 63.9 bits (154), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 105/438 (23%), Positives = 187/438 (42%), Gaps = 77/438 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFFS 135
+ K + V ++ + VP + + L+LS D TL+V + + I FF
Sbjct: 68 LIQNKPGDD--PNKIVDKVQGLLVPMKFP-IHHLALSCDNLTLSVCMMSGEYGSIIAFFD 124
Query: 136 VHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLYR 182
V + N K+ K F+ + V D++W P + +A+ V +
Sbjct: 125 VRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETVK 184
Query: 183 GGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVGD 239
V L + V +V WS KGK +AV K+N V + T + K+ +P +
Sbjct: 185 --VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFY------ 235
Query: 240 CDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPVV 296
+ + V+V + W+ I DG E + ++ ++ K+ K
Sbjct: 236 -EADHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKEEK---------- 282
Query: 297 LSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKN 354
++F + ++ S T + +L+YVE W L ++A+ + + ++ +N
Sbjct: 283 --HPEIFVNFMEPCYGSCTERQHHYYLSYVEEWDLVLAASAASTEVSILA-------RQN 333
Query: 355 DIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSP 408
D IN + W+ + + D +D+L MG+ ID +V+V + +EK L P
Sbjct: 334 D----QINWESWLLEDSSRAELPVTDKSDDSLPMGVAID---YTNQVEVTIS-DEKTLPP 385
Query: 409 YFILMCLTLDGKLNMFHV 426
+LM L+ DG L F++
Sbjct: 386 APVLMLLSTDGVLCPFYM 403
>gi|413916829|gb|AFW56761.1| hypothetical protein ZEAMMB73_598462 [Zea mays]
Length = 480
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 37/47 (78%), Gaps = 5/47 (10%)
Query: 850 WNR-----QKLSPELELKRGHILSINQDLINQLIELERHFNSLELNK 891
W+R +KLSPE E+KR +IL NQDL NQL+ELERHFN+LE+N+
Sbjct: 46 WDRIYGTVRKLSPEFEVKRQNILRANQDLTNQLVELERHFNNLEMNR 92
>gi|403256526|ref|XP_003920924.1| PREDICTED: nuclear pore complex protein Nup214 [Saimiri boliviensis
boliviensis]
Length = 2133
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 112/450 (24%), Positives = 188/450 (41%), Gaps = 88/450 (19%)
Query: 21 NDFFFDEIGESIPIIRDDDSQFD-------LQNPPSLP------LAVSQLSHQLIFAAHS 67
D DE+ IP D QF +P LP LAVS + L+FA +
Sbjct: 39 GDTMGDEMDAMIPEREMKDFQFRALKKVRIFDSPEELPKERSSLLAVSN-KYGLVFAGGA 97
Query: 68 SGFCVARTNDVIDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLY 126
SG + T +++ K + N VQ L +V + F + L+LS D TL+V +
Sbjct: 98 SGLQIFPTKNLLIQNKPGDDPNKIVDKVQGL-LVPMKFP---IHHLALSCDNLTLSVCMM 153
Query: 127 QSPH---IHFFSVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NS 170
+ I FF V + N K+ K F+ + V D++W P +
Sbjct: 154 SGEYGSIIAFFDVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADG 213
Query: 171 FLALSNVGRLYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERL 227
+A+ V + V L + V +V WS KGK +AV K+N V + T + K+ +
Sbjct: 214 SIAVLQVTETVK--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVI 270
Query: 228 PISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKD 284
P ++S + V+V + W+ I DG E + ++ ++ K+
Sbjct: 271 PCPPFYES-------DHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKE 321
Query: 285 GKITDASSEPVVLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHV 342
K ++F + ++ S T + +L+YVE W L ++A+ + + V
Sbjct: 322 EK------------HPEIFVNFMEPCYGSCTERQHHYYLSYVEEWDLVLAASAASTE--V 367
Query: 343 VLLRWSLDDEKNDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVK 396
+L D IN + W+ + + D +D+L MG+ ID +V+
Sbjct: 368 SILARQSD---------QINWESWLLEDSSRAELPVTDKSDDSLPMGVAID---YTNQVE 415
Query: 397 VQLGVEEKELSPYFILMCLTLDGKLNMFHV 426
V + +EK L P +LM L+ DG L F++
Sbjct: 416 VTIN-DEKTLPPAPVLMLLSTDGVLCPFYM 444
>gi|395506303|ref|XP_003757474.1| PREDICTED: nuclear pore complex protein Nup214 [Sarcophilus
harrisii]
Length = 2139
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 94/402 (23%), Positives = 178/402 (44%), Gaps = 68/402 (16%)
Query: 52 LAVSQLSHQLIFAAHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDI 111
LAVS + L+FA ++GF V T +++ + ++ V+++ + +P + V
Sbjct: 42 LAVSN-KYGLVFAGGATGFRVFPTKNLLIQNRPGED--PNKIVEKVPNIFIPMKFP-VYH 97
Query: 112 LSLSNDESTLAVSLYQSPH--IHFFSVHSLLN--KEIKPSFS---CSLNGSTYVKDIRWR 164
L+LS D TL+V + S I FF V + N K+ K F+ + + V D++W
Sbjct: 98 LALSCDNLTLSVCMVSSEFGLIAFFDVRTFSNEAKQQKRPFAYHKLTKDAGGVVVDMKWN 157
Query: 165 KK-PENSFLALSNVGRLYRGGVNGPLEDVMD-----NVDAVEWSVKGKFVAVAKKN--VL 216
P + + L++ G + V ++ V +V WS KGK +AV K N V+
Sbjct: 158 PTVPTMAAVCLAD-GSIAVLQVTDAVKVSASLPPSVAVTSVCWSPKGKQLAVGKHNGTVI 216
Query: 217 SIL-TSKLKERLPISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE-- 273
L T + K+ +P + + + V+V + W+ + DG E
Sbjct: 217 QYLPTLQEKKVIPCPPFY-------EADHPVRVLDVLWI--GTYVFSIVYAAADGTLETS 267
Query: 274 -NYLVQVIQSKDGKITDASSEPVVLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLA 330
+ ++ V+ K+ K ++F + ++ S T + +L Y+E W L
Sbjct: 268 PDVVMTVLPKKEDK------------HPEIFVNFMEPCYGSCTERQHHYYLNYIEEWDLV 315
Query: 331 ISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKWI------PRIKLQDNGNDNLIMGL 384
++A+ + + ++ +ND N + W+ + + D +D+L +G+
Sbjct: 316 LAASAASTEVSILA-------RQND----QTNWESWLLEDSSRAELPVTDKSDDSLPVGV 364
Query: 385 CIDKVSLCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHV 426
ID S +V++ + +EK L P +LM L+ DG L F++
Sbjct: 365 AIDYTS---QVEIPIN-DEKTLPPAPVLMLLSTDGVLCPFYM 402
>gi|417414060|gb|JAA53332.1| Putative nuclear pore complex nup214/can component, partial
[Desmodus rotundus]
Length = 2086
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 99/406 (24%), Positives = 176/406 (43%), Gaps = 75/406 (18%)
Query: 52 LAVSQLSHQLIFAAHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDI 111
LAVS + L+FA +SG + T +++ K + V ++ + VP + +
Sbjct: 28 LAVSN-KYGLVFAGGASGLQIFPTKNLLIQNKPGDD--PNKIVDKVQGLLVPMKFP-IRH 83
Query: 112 LSLSNDESTLAVSLYQSPH---IHFFSVHSLLN--KEIKPSFS---CSLNGSTYVKDIRW 163
L+LS D TL+V + S + I FF V + N K+ K F+ S + V D++W
Sbjct: 84 LALSCDNLTLSVCMMSSEYGSIIAFFDVRTFSNEAKQQKRPFAYHKLSKDAGGMVIDMKW 143
Query: 164 RKK-PE-------NSFLALSNVGRLYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN- 214
P + +A+ V + V L + V +V WS KGK +AV K+N
Sbjct: 144 NPAVPSMVAVCLADGSIAVLQVTETVK--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNG 200
Query: 215 --VLSILTSKLKERLPISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKE 272
V + T + K+ +P ++S + V+V + W+ I DG
Sbjct: 201 TVVQYLPTLQEKKVIPCPPFYES-------DHPVRVLDVLWIGTYVFTIA--YAAADGSL 251
Query: 273 E---NYLVQVIQSKDGKITDASSEPVVLSFSDVFSDVIDDILPSGT--GPYLFLTYVERW 327
E + ++ ++ K+ K +VF + ++ T + +L+Y+E W
Sbjct: 252 ETSPDVVMALLPKKEEK------------HPEVFVNFMEPCYGCCTERQHHYYLSYIEEW 299
Query: 328 GLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKWI------PRIKLQDNGNDNLI 381
L ++A+ + + V +L D IN + W+ + + D +D+L
Sbjct: 300 DLVLAASAASTE--VSILARQSDQ---------INWESWLLEDSSRAELPVTDKNDDSLP 348
Query: 382 MGLCIDKVSLCGKVKVQLGV-EEKELSPYFILMCLTLDGKLNMFHV 426
+G+ ID + +V++ V EEK L P +LM L+ DG L F++
Sbjct: 349 VGVAIDHTN-----QVEITVNEEKTLPPAPVLMLLSTDGVLCPFYM 389
>gi|194225945|ref|XP_001917308.1| PREDICTED: LOW QUALITY PROTEIN: nuclear pore complex protein Nup214
[Equus caballus]
Length = 2087
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 110/439 (25%), Positives = 188/439 (42%), Gaps = 79/439 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+V + S + I FF
Sbjct: 68 LIQNKPGDDPNKIVDKVQSL-LVPMKFP---IHHLALSCDNLTLSVCMMSSEYGSIIAFF 123
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 124 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 183
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 184 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 237
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ I DG E + ++ ++ K+ K
Sbjct: 238 ----DHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKEDK--------- 282
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
+VF + ++ S + +L+YVE W L ++A+ + + V +L D
Sbjct: 283 ---HPEVFVNFMEPCYGSCAERQHHYYLSYVEEWDLVLAASAASTE--VSILARQSDQ-- 335
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELS 407
IN + W+ + + D +D+L MG+ ID +V++ + +EK L
Sbjct: 336 -------INWESWLLEDSSRAELPVTDKSDDSLPMGVAID---YTNQVEITIN-DEKTLP 384
Query: 408 PYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 385 PAPVLMLLSTDGVLCPFYM 403
>gi|431898925|gb|ELK07295.1| Nuclear pore complex protein Nup214 [Pteropus alecto]
Length = 2096
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 111/440 (25%), Positives = 188/440 (42%), Gaps = 81/440 (18%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRLFDSPDELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+V + S + I FF
Sbjct: 68 LIQNKPGDDPNKIVDKVQSL-LVPMKFP---IHHLALSCDNLTLSVCMMSSEYGSIIAFF 123
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 124 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 183
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 184 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 237
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + WV + DG E + ++ ++ K+ K
Sbjct: 238 ----DHPVRVLDVLWV--GTYVFTVVYAAADGTSETSPDVVMALLPKKEEK--------- 282
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
+VF + ++ S T +L YVE W L ++A+ + + V +L D
Sbjct: 283 ---HPEVFVNFMEPCYGSCTERQHRYYLNYVEEWDLVLAASAASTE--VSILARQSD--- 334
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGV-EEKEL 406
IN + W+ + + D +D+L +G+ ID + +V++ V +EK L
Sbjct: 335 ------QINWESWLLEDSSRAELPVTDKNDDSLPVGVAIDYTN-----QVEVAVNDEKTL 383
Query: 407 SPYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 384 PPAPVLMLLSTDGVLCPFYM 403
>gi|345805929|ref|XP_850680.2| PREDICTED: LOW QUALITY PROTEIN: nuclear pore complex protein Nup214
[Canis lupus familiaris]
Length = 2086
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 108/445 (24%), Positives = 188/445 (42%), Gaps = 88/445 (19%)
Query: 26 DEIGESIPIIRDDDSQFD-------LQNPPSLP------LAVSQLSHQLIFAAHSSGFCV 72
DE+ IP D QF +P LP LAVS + L+FA +SG +
Sbjct: 3 DEMDAMIPEREMKDFQFRALKKVRIFDSPEELPKDRSSLLAVSN-KYGLVFAGGASGLQI 61
Query: 73 ARTNDVIDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH- 130
T +++ K + N VQ L +V + F + L+LS D TL+V + S +
Sbjct: 62 FPTKNLLIQNKPGDDPNKIVDKVQSL-LVPMKFP---IHHLALSCDNLTLSVCMMSSEYG 117
Query: 131 --IHFFSVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKKPENSF--------LALS 175
I FF V + N K+ K F+ + V D++W + +A+
Sbjct: 118 SIIAFFDVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMMAVCLADGSIAVL 177
Query: 176 NVGRLYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLS 232
+ + V L + V +V WS KGK +AV K+N V + T + K+ +P
Sbjct: 178 QITETVK--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPF 234
Query: 233 FKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITD 289
++S + V+V + W+ I DG E + ++ ++ K+ K
Sbjct: 235 YES-------DHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKEEK--- 282
Query: 290 ASSEPVVLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRW 347
++F + ++ S T + +L+Y+E W L ++A+ + + V +L
Sbjct: 283 ---------HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILAR 331
Query: 348 SLDDEKNDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGV 401
D IN + W+ + + D +D+L +G+ ID + +V++ L
Sbjct: 332 QTD---------QINWESWLLEDSSRAELPVTDKSDDSLPVGVAIDYTN---QVEITLN- 378
Query: 402 EEKELSPYFILMCLTLDGKLNMFHV 426
+EK L P +LM L+ DG L F++
Sbjct: 379 DEKTLPPAPVLMLLSTDGVLCPFYM 403
>gi|397503652|ref|XP_003822433.1| PREDICTED: nuclear pore complex protein Nup214 isoform 3 [Pan
paniscus]
Length = 2143
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 107/440 (24%), Positives = 189/440 (42%), Gaps = 81/440 (18%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 54 EMKDFQFKALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 108
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+ + S + I FF
Sbjct: 109 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---IHHLALSCDNLTLSACMMSSEYGSIIAFF 164
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 165 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 224
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 225 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 278
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 279 ----DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK--------- 323
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 324 ---HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSD--- 375
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGV-EEKEL 406
IN + W+ + + D +D+L MG+ +D + +V++ V +EK L
Sbjct: 376 ------QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVDYTN-----QVEITVSDEKTL 424
Query: 407 SPYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 425 PPAPVLMLLSTDGVLCPFYM 444
>gi|397503648|ref|XP_003822431.1| PREDICTED: nuclear pore complex protein Nup214 isoform 1 [Pan
paniscus]
Length = 2131
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 107/440 (24%), Positives = 189/440 (42%), Gaps = 81/440 (18%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 54 EMKDFQFKALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 108
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+ + S + I FF
Sbjct: 109 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---IHHLALSCDNLTLSACMMSSEYGSIIAFF 164
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 165 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 224
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 225 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 278
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 279 ----DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK--------- 323
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 324 ---HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSD--- 375
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGV-EEKEL 406
IN + W+ + + D +D+L MG+ +D + +V++ V +EK L
Sbjct: 376 ------QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVDYTN-----QVEITVSDEKTL 424
Query: 407 SPYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 425 PPAPVLMLLSTDGVLCPFYM 444
>gi|285957|dbj|BAA03515.1| KIAA0023 [Homo sapiens]
gi|168274292|dbj|BAG09566.1| nucleoporin 214kDa [synthetic construct]
Length = 2093
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 106/439 (24%), Positives = 188/439 (42%), Gaps = 79/439 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+ + S + I FF
Sbjct: 68 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---IHHLALSCDNLTLSACMMSSEYGSIIAFF 123
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 124 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 183
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 184 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 237
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 238 ----DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK--------- 282
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 283 ---HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSD--- 334
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELS 407
IN + W+ + + D +D+L MG+ +D +V++ + +EK L
Sbjct: 335 ------QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVD---YTNQVEITIS-DEKTLP 384
Query: 408 PYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 385 PAPVLMLLSTDGVLCPFYM 403
>gi|119608367|gb|EAW87961.1| nucleoporin 214kDa, isoform CRA_b [Homo sapiens]
gi|119608368|gb|EAW87962.1| nucleoporin 214kDa, isoform CRA_b [Homo sapiens]
Length = 2131
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 106/439 (24%), Positives = 188/439 (42%), Gaps = 79/439 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 54 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 108
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+ + S + I FF
Sbjct: 109 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---IHHLALSCDNLTLSACMMSSEYGSIIAFF 164
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 165 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 224
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 225 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 278
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 279 ----DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK--------- 323
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 324 ---HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSD--- 375
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELS 407
IN + W+ + + D +D+L MG+ +D +V++ + +EK L
Sbjct: 376 ------QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVD---YTNQVEITIS-DEKTLP 425
Query: 408 PYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 426 PAPVLMLLSTDGVLCPFYM 444
>gi|119608369|gb|EAW87963.1| nucleoporin 214kDa, isoform CRA_c [Homo sapiens]
Length = 2144
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 106/439 (24%), Positives = 188/439 (42%), Gaps = 79/439 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 54 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 108
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+ + S + I FF
Sbjct: 109 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---IHHLALSCDNLTLSACMMSSEYGSIIAFF 164
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 165 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 224
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 225 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 278
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 279 ----DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK--------- 323
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 324 ---HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSD--- 375
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELS 407
IN + W+ + + D +D+L MG+ +D +V++ + +EK L
Sbjct: 376 ------QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVD---YTNQVEITIS-DEKTLP 425
Query: 408 PYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 426 PAPVLMLLSTDGVLCPFYM 444
>gi|397503650|ref|XP_003822432.1| PREDICTED: nuclear pore complex protein Nup214 isoform 2 [Pan
paniscus]
Length = 2111
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 104/439 (23%), Positives = 188/439 (42%), Gaps = 79/439 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 54 EMKDFQFKALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 108
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFFS 135
+ K + V ++ + VP + + L+LS D TL+ + S + I FF
Sbjct: 109 LIQNKPGDD--PNKIVDKVQGLLVPMKFP-IHHLALSCDNLTLSACMMSSEYGSIIAFFD 165
Query: 136 VHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLYR 182
V + N K+ K F+ + V D++W P + +A+ V +
Sbjct: 166 VRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETVK 225
Query: 183 GGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVGD 239
V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 226 --VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES---- 278
Query: 240 CDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPVV 296
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 279 ---DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK---------- 323
Query: 297 LSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKN 354
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 324 --HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSD---- 375
Query: 355 DIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGV-EEKELS 407
IN + W+ + + D +D+L MG+ +D + +V++ V +EK L
Sbjct: 376 -----QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVDYTN-----QVEITVSDEKTLP 425
Query: 408 PYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 426 PAPVLMLLSTDGVLCPFYM 444
>gi|29653|emb|CAA45535.1| putative oncogene [Homo sapiens]
Length = 2090
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 107/446 (23%), Positives = 189/446 (42%), Gaps = 93/446 (20%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+ + S + I FF
Sbjct: 68 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---IHHLALSCDNLTLSACMMSSEYGSIIAFF 123
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKKPENSFLALSNVGRLYRGGVNGPL 189
V + N K+ K F+ + + V D++W S +A+ L G +
Sbjct: 124 DVRTFSNEAKQQKRPFAYHKLLKDAAGMVIDMKWNPT-VPSMVAVC----LADGSI---- 174
Query: 190 EDVMD---------------NVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISL 231
DV+ V +V WS KGK +AV K+N V + T + K+ +P
Sbjct: 175 -DVLQVTETVKVCATLPSTVAVTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPP 233
Query: 232 SFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKIT 288
++S + V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 234 FYES-------DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK-- 282
Query: 289 DASSEPVVLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLR 346
++F + ++ S T + +L+Y+E W L ++A+ + + V +L
Sbjct: 283 ----------HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILA 330
Query: 347 WSLDDEKNDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLG 400
D IN + W+ + + D +D+L MG+ +D +V++ +
Sbjct: 331 RQSD---------QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVD---YTNQVEITIS 378
Query: 401 VEEKELSPYFILMCLTLDGKLNMFHV 426
+EK L P +LM L+ DG L F++
Sbjct: 379 -DEKTLPPAPVLMLLSTDGVLCPFYM 403
>gi|410043329|ref|XP_001167010.3| PREDICTED: nuclear pore complex protein Nup214 isoform 9 [Pan
troglodytes]
Length = 2091
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 106/439 (24%), Positives = 188/439 (42%), Gaps = 79/439 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 54 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 108
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+ + S + I FF
Sbjct: 109 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---IHHLALSCDNLTLSACMMSSEYGSIIAFF 164
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 165 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 224
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 225 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 278
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 279 ----DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK--------- 323
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 324 ---HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSD--- 375
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELS 407
IN + W+ + + D +D+L MG+ +D +V++ + +EK L
Sbjct: 376 ------QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVD---YTNQVEITIS-DEKTLP 425
Query: 408 PYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 426 PAPVLMLLSTDGVLCPFYM 444
>gi|410226244|gb|JAA10341.1| nucleoporin 214kDa [Pan troglodytes]
gi|410267680|gb|JAA21806.1| nucleoporin 214kDa [Pan troglodytes]
gi|410296292|gb|JAA26746.1| nucleoporin 214kDa [Pan troglodytes]
gi|410336455|gb|JAA37174.1| nucleoporin 214kDa [Pan troglodytes]
Length = 2090
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 106/439 (24%), Positives = 188/439 (42%), Gaps = 79/439 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+ + S + I FF
Sbjct: 68 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---IHHLALSCDNLTLSACMMSSEYGSIIAFF 123
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 124 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 183
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 184 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 237
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 238 ----DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK--------- 282
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 283 ---HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSD--- 334
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELS 407
IN + W+ + + D +D+L MG+ +D +V++ + +EK L
Sbjct: 335 ------QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVD---YTNQVEITIS-DEKTLP 384
Query: 408 PYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 385 PAPVLMLLSTDGVLCPFYM 403
>gi|426363408|ref|XP_004048832.1| PREDICTED: nuclear pore complex protein Nup214 [Gorilla gorilla
gorilla]
Length = 2248
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 103/438 (23%), Positives = 187/438 (42%), Gaps = 77/438 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 131 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 185
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFFS 135
+ K + V ++ + VP + + L+LS D TL+ + S + I FF
Sbjct: 186 LIQNKPGDD--PNKIVDKVQGLLVPMKFP-IHHLALSCDNLTLSACMMSSEYGSIIAFFD 242
Query: 136 VHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLYR 182
V + N K+ K F+ + V D++W P + +A+ V +
Sbjct: 243 VRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETVK 302
Query: 183 GGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVGD 239
V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 303 --VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES---- 355
Query: 240 CDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPVV 296
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 356 ---DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK---------- 400
Query: 297 LSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKN 354
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 401 --HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSD---- 452
Query: 355 DIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSP 408
IN + W+ + + D +D+L MG+ +D +V++ + +EK L P
Sbjct: 453 -----QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVD---YTNQVEITIS-DEKTLPP 503
Query: 409 YFILMCLTLDGKLNMFHV 426
+LM L+ DG L F++
Sbjct: 504 APVLMLLSTDGVLCPFYM 521
>gi|145346525|ref|XP_001417737.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577965|gb|ABO96030.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1803
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 74/329 (22%), Positives = 144/329 (43%), Gaps = 36/329 (10%)
Query: 109 VDILSLSNDESTLAVSLYQSPHIHFFSVHSLLNKEIKPSFSCSLNGSTYVKDIRWRKKPE 168
++I+S++ DE A + + F++ + + + + V+ ++W
Sbjct: 104 IEIVSMTPDERAFATCDRRG-RVEFYAAKDSSDGKARGEKFGEMTLEAPVRAVKWCAN-S 161
Query: 169 NSFLALSNVGRLYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTS-----KL 223
SF+AL+ +Y V +++ NV V G +A A NV+SI +S +
Sbjct: 162 ASFIALTGDALMYVEAVGKEPKEIATNVTCVSARANGT-LAWASGNVISIASSVDPAQTV 220
Query: 224 KERLPISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSK 283
KE + IS F D SV++D + +V I+ + ED + + V++
Sbjct: 221 KETIEIS-PFH------DPEDSVEIDGV-YVHSADKILFTSRSVED--PADCSLAVLKKV 270
Query: 284 DGKITDASSEPVVLSFSDVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVV 343
DG E S D+ S+V+D TGP L + W + + +RK D+ ++
Sbjct: 271 DGNWVCTRLE----SAFDIDSEVVDL-----TGPVLDASAFSPWNVVFATHRKAWDNQLL 321
Query: 344 LLRWSLDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEE 403
L+ + D D V+++ D+ + + ++ +N + GL +D L G L ++
Sbjct: 322 TLQITRD---ADPCVLEVEDDRCSASVPMTEDDENNYVTGLGLD---LTGAGGTMLNPQD 375
Query: 404 K---ELSPYFILMCLTLDGKLNMFHVASV 429
K EL+ ++ T DG++ + A++
Sbjct: 376 KSAPELAKGPAIVFSTTDGRITILKCANL 404
>gi|126297778|ref|XP_001364829.1| PREDICTED: nuclear pore complex protein Nup214 isoform 1
[Monodelphis domestica]
Length = 2089
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 109/445 (24%), Positives = 186/445 (41%), Gaps = 88/445 (19%)
Query: 26 DEIGESIPIIRDDDSQFD-------LQNPPSLP------LAVSQLSHQLIFAAHSSGFCV 72
DE+ IP D QF P LP LAVS + L+FA +SGF V
Sbjct: 3 DEMDAMIPEREMKDFQFRALKKVRIFDAPEELPKERSSLLAVSN-KYGLVFAGGASGFRV 61
Query: 73 ARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH-- 130
T +++ + ++ V+++ + +P + V L+LS D TL+V + S
Sbjct: 62 FPTKNLLIQNRPGED--PNKIVEKVPNILIPMKFP-VHHLALSCDNLTLSVCMMSSEFGS 118
Query: 131 -IHFFSVHSLLN--KEIKPSFS---CSLNGSTYVKDIRWRKKPENSFLALSNVGRLYRGG 184
I FF V + N K+ K F+ + V D++W N + L G
Sbjct: 119 LIAFFDVRTFSNETKQQKRPFAYHKLMKDTGGMVIDMKW-----NPTIPTMVAVCLADGS 173
Query: 185 VNGPLEDVMDNVDAVE---------------WSVKGKFVAVAKKN---VLSILTSKLKER 226
+ V+ DAV+ WS KGK +AV K+N V + T + K+
Sbjct: 174 I-----AVLQVTDAVKVSASLPPSVAVTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKV 228
Query: 227 LPISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSK 283
+P ++S + V+V + W+ I DG E + ++ V+ K
Sbjct: 229 IPCPPFYES-------DHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMAVLPKK 279
Query: 284 DGKITDASSEPVVLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDH 341
+ K ++F + ++ S T + FL Y+E W L ++A+ + +
Sbjct: 280 EEK------------HPEMFVNFMEPCYGSCTERQHHYFLNYIEDWDLVLAASAASTEVS 327
Query: 342 VVLLRWSLDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGV 401
+L R S + D +R + + + D +D+L +G+ ID S +V++ +
Sbjct: 328 -ILARQSDQTNWESWLLEDSSRAE----LPVTDKSDDSLPVGVGIDYTS---QVEIPIN- 378
Query: 402 EEKELSPYFILMCLTLDGKLNMFHV 426
+EK L P +LM L+ DG L F++
Sbjct: 379 DEKTLPPAPVLMLLSTDGVLCPFYM 403
>gi|33946327|ref|NP_005076.3| nuclear pore complex protein Nup214 [Homo sapiens]
gi|205831380|sp|P35658.2|NU214_HUMAN RecName: Full=Nuclear pore complex protein Nup214; AltName:
Full=214 kDa nucleoporin; AltName: Full=Nucleoporin
Nup214; AltName: Full=Protein CAN
Length = 2090
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 106/439 (24%), Positives = 188/439 (42%), Gaps = 79/439 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+ + S + I FF
Sbjct: 68 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---IHHLALSCDNLTLSACMMSSEYGSIIAFF 123
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 124 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 183
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 184 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 237
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 238 ----DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK--------- 282
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 283 ---HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSD--- 334
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELS 407
IN + W+ + + D +D+L MG+ +D +V++ + +EK L
Sbjct: 335 ------QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVD---YTNQVEITIS-DEKTLP 384
Query: 408 PYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 385 PAPVLMLLSTDGVLCPFYM 403
>gi|41016075|dbj|BAD07398.1| KIAA0023 splice variant 1 [Homo sapiens]
Length = 2091
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 106/439 (24%), Positives = 188/439 (42%), Gaps = 79/439 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 14 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 68
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+ + S + I FF
Sbjct: 69 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---IHHLALSCDNLTLSACMMSSEYGSIIAFF 124
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 125 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 184
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 185 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 238
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 239 ----DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK--------- 283
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 284 ---HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSD--- 335
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELS 407
IN + W+ + + D +D+L MG+ +D +V++ + +EK L
Sbjct: 336 ------QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVD---YTNQVEITIS-DEKTLP 385
Query: 408 PYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 386 PAPVLMLLSTDGVLCPFYM 404
>gi|76779204|gb|AAI05999.1| NUP214 protein [Homo sapiens]
Length = 2080
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 106/439 (24%), Positives = 188/439 (42%), Gaps = 79/439 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+ + S + I FF
Sbjct: 68 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---IHHLALSCDNLTLSACMMSSEYGSIIAFF 123
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 124 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 183
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 184 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 237
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 238 ----DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK--------- 282
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 283 ---HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSD--- 334
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELS 407
IN + W+ + + D +D+L MG+ +D +V++ + +EK L
Sbjct: 335 ------QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVD---YTNQVEITIS-DEKTLP 384
Query: 408 PYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 385 PAPVLMLLSTDGVLCPFYM 403
>gi|60688059|gb|AAH45620.2| Nucleoporin 214kDa [Homo sapiens]
Length = 2090
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 106/439 (24%), Positives = 188/439 (42%), Gaps = 79/439 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+ + S + I FF
Sbjct: 68 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---IHHLALSCDNLTLSACMMSSEYGSIIAFF 123
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 124 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 183
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 184 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 237
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 238 ----DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK--------- 282
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 283 ---HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSD--- 334
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELS 407
IN + W+ + + D +D+L MG+ +D +V++ + +EK L
Sbjct: 335 ------QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVD---YTNQVEITIS-DEKTLP 384
Query: 408 PYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 385 PAPVLMLLSTDGVLCPFYM 403
>gi|410979388|ref|XP_003996067.1| PREDICTED: nuclear pore complex protein Nup214 [Felis catus]
Length = 1907
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 105/439 (23%), Positives = 189/439 (43%), Gaps = 79/439 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+V + S + I FF
Sbjct: 68 LIQNKPGDDPNKIVDKVQSL-LVPMKFP---IHHLALSCDNLTLSVCMMSSEYGSIIAFF 123
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKKPENSF--------LALSNVGRLY 181
V + N K+ K F+ + V D++W + +A+ +
Sbjct: 124 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVASMMAVCLADGSIAVLQITETV 183
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K++ V + T + K+ +P ++S
Sbjct: 184 K--VCATLPSTV-AVTSVCWSPKGKQLAVGKQDGTVVQYLPTLQEKKVIPCPPFYES--- 237
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ I DG E + ++ ++ K+ K
Sbjct: 238 ----DHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKEEK--------- 282
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 283 ---HPEIFVNFMEPCYGSCTERKHHYYLSYIEEWDLVLAASAASTE--VSILARQTDQ-- 335
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELS 407
+N + W+ + + D +D+L MG+ ID + +V++ + +EK L
Sbjct: 336 -------VNWESWLLEDSSRAELPVTDKSDDSLPMGVAIDYTN---QVEITIN-DEKTLP 384
Query: 408 PYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 385 PAPVLMLLSTDGVLCPFYM 403
>gi|297270027|ref|XP_001118571.2| PREDICTED: nuclear pore complex protein Nup214-like [Macaca
mulatta]
Length = 2139
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 104/438 (23%), Positives = 188/438 (42%), Gaps = 77/438 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 54 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 108
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFFS 135
+ K + V ++ + VP + + L+LS D TL+V + S + I FF
Sbjct: 109 LIQNKPGDD--PNKIVDKVQGLLVPMKFP-IHHLALSCDNLTLSVCMMSSEYGSIIAFFD 165
Query: 136 VHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLYR 182
V + N K+ K F+ + V D++W P + +A+ V +
Sbjct: 166 VRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETVK 225
Query: 183 GGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVGD 239
V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 226 --VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES---- 278
Query: 240 CDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPVV 296
+ V+V + W+ I DG E + ++ ++ K+ K
Sbjct: 279 ---DHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKEEK---------- 323
Query: 297 LSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKN 354
++F + ++ S + +L+Y+E W L ++A+ + + V +L D
Sbjct: 324 --HPEIFVNFMEPCYGSCAERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSDQ--- 376
Query: 355 DIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSP 408
IN + W+ + + D +D+L MG+ +D + +V++ + +EK L P
Sbjct: 377 ------INWESWLLEDSSRAELPVTDKSDDSLPMGVAVDYTN---QVEITIN-DEKTLPP 426
Query: 409 YFILMCLTLDGKLNMFHV 426
+LM L+ DG L F++
Sbjct: 427 TPVLMLLSTDGVLCPFYM 444
>gi|380789569|gb|AFE66660.1| nuclear pore complex protein Nup214 [Macaca mulatta]
gi|383408345|gb|AFH27386.1| nuclear pore complex protein Nup214 [Macaca mulatta]
gi|384940334|gb|AFI33772.1| nuclear pore complex protein Nup214 [Macaca mulatta]
Length = 2090
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 107/439 (24%), Positives = 188/439 (42%), Gaps = 79/439 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F + L+LS D TL+V + S + I FF
Sbjct: 68 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---IHHLALSCDNLTLSVCMMSSEYGSIIAFF 123
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 124 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 183
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 184 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 237
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ I DG E + ++ ++ K+ K
Sbjct: 238 ----DHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKEEK--------- 282
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S + +L+Y+E W L ++A+ + + V +L D
Sbjct: 283 ---HPEIFVNFMEPCYGSCAERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSDQ-- 335
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELS 407
IN + W+ + + D +D+L MG+ +D +V++ + +EK L
Sbjct: 336 -------INWESWLLEDSSRAELPVTDKSDDSLPMGVAVD---YTNQVEITIN-DEKTLP 384
Query: 408 PYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 385 PTPVLMLLSTDGVLCPFYM 403
>gi|302845433|ref|XP_002954255.1| hypothetical protein VOLCADRAFT_94982 [Volvox carteri f. nagariensis]
gi|300260460|gb|EFJ44679.1| hypothetical protein VOLCADRAFT_94982 [Volvox carteri f. nagariensis]
Length = 1205
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 103/436 (23%), Positives = 178/436 (40%), Gaps = 38/436 (8%)
Query: 4 PTRIEIEEEIEGGRIESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIF 63
PT E+++ E F+F IG I + L+ P + + + H L+F
Sbjct: 752 PTVAELDQA------EDVGFYFKYIGG----ITFANLSSALEAPGATSMVTASSKHGLVF 801
Query: 64 AAHSSGFCVARTNDVID--AAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTL 121
+ SG RT+D++ K E+ ++ + P + L+LS DES L
Sbjct: 802 FSDLSGVYAVRTDDLLPRLCTKVASESYVTPRPADVCLWHKPLGG--ITQLALSTDESVL 859
Query: 122 AVSLYQSPHIHFFSVHSLLNKEIKPSFSCSLNGSTYVKD----IR-WRKKPENSFLALSN 176
A + + S + S S G V D +R R +SF+ +++
Sbjct: 860 AAVDGPAGSLGGIKQFSWCKAVEHRAGSGSGPGVFLVVDADRSLRVGRYGDADSFVVVAS 919
Query: 177 VGRLYRGGVNGPLEDVMDNVD--AVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSFK 234
G G P D D + A+ +S G+ A + S +++ + L
Sbjct: 920 SG--VEGADWCPAAD-GDTLGSLALAYSTGGQLAISAVVSGSSGSPLAVRQLACLPLEHP 976
Query: 235 SWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEEN----YLVQVIQSKDGKITDA 290
+G V +S+RW+ D +++G QL +DG + Y ++ S A
Sbjct: 977 QGLG-------VVFESVRWLLPDTLLLGGSQLVDDGNGDEGGDEYSSLLVLSGWAPNAVA 1029
Query: 291 SSEPVVLSFSDVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLD 350
+ L ++D+ V +D P +GPYL V RWG + +RK +DDHV L+R +
Sbjct: 1030 PPQGAAL-YTDLPMFVEEDSTPRLSGPYLHAVAVPRWGALVRCHRKAVDDHVRLMRTAEG 1088
Query: 351 D--EKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSP 408
+ V + + + R+ +G N ++GL +D + L E+ L P
Sbjct: 1089 EFGSGGSPQAVAVTEEPMMIRLPNGPDGGSNYVLGLGLDTSMPLDPLPNPLSAEKPPLVP 1148
Query: 409 YFILMCLTLDGKLNMF 424
+L+ LT DG L ++
Sbjct: 1149 PPVLLILTSDGVLRVY 1164
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 90/191 (47%), Gaps = 9/191 (4%)
Query: 246 VKVDSIRWVRRDCIIIGCFQLTEDGKEEN----YLVQVIQSKDGKITDASSEPVVLSF-S 300
V +S+RW+ D +++G QL +DG + Y ++ S G +A + P + +
Sbjct: 234 VVFESVRWLLPDTLLLGGSQLVDDGNGDEGGDEYSSLLVLS--GWAPNAVAPPQGAALHT 291
Query: 301 DVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDD--EKNDIAV 358
D+ V +D P +GPYL V RWG + +RK +DDHV L+R + +
Sbjct: 292 DLPMFVEEDSTPRLSGPYLHAVAVPRWGALVRCHRKAVDDHVRLMRTAEGEFGSGGSPQA 351
Query: 359 VDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYFILMCLTLD 418
V + + + R+ +G N ++GL +D + L E+ L P +L+ LT D
Sbjct: 352 VAVTEEPMMIRLPNGPDGGSNYVLGLGLDTSMPLDPLPNPLSAEKPPLVPPPVLLILTSD 411
Query: 419 GKLNMFHVASV 429
G L ++ V
Sbjct: 412 GVLRVYGFGHV 422
>gi|432887582|ref|XP_004074925.1| PREDICTED: nuclear pore complex protein Nup214-like [Oryzias
latipes]
Length = 1989
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 94/396 (23%), Positives = 162/396 (40%), Gaps = 71/396 (17%)
Query: 59 HQLIFAAHSSGFCVARTNDVIDAAKE---MKENGTRCCVQELSVVDVPFENENVDILSLS 115
+ L F F V +T D++ A K GT L+ V V + L+LS
Sbjct: 47 YGLTFVGLERTFKVYQTQDILSADKADPGFIVKGT----PPLATVTVELA---LHHLALS 99
Query: 116 NDESTLAVSLYQSPHIH--FFSVHSLLN-----KEIKPSFSCSLNGSTYVKDIRWRKKPE 168
DE TL+V + ++ F+ V + +N K +F + T V+D++W
Sbjct: 100 CDELTLSVCGVEEENLFLTFYDVRTFINGKSTQKLPFATFRPDVPPGTLVQDLKWNPANV 159
Query: 169 NSFLALSNVGRLYRGGVNGPLEDVMD------------NVDAVEWSVKGKFVAVAKKN-V 215
+ A + G + + DV D + + WS KGK VA K N
Sbjct: 160 SMLAACLSDGSMM-------ILDVTDLVKMQAKLPSTSGITCIGWSPKGKQVAAGKMNGT 212
Query: 216 LSILTSKLKER--LPISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE 273
+S T L+E+ +P + S VK + W+R DG E
Sbjct: 213 VSQYTPALEEKKVIPCPHFYTS-------EDPVKALDVLWLR--TFAFAVTYAAADGSLE 263
Query: 274 N--YLVQV-IQSKDGKITDASSEPVVLSFSDVFSDVIDDILPSGTGPYLFLTYVERWGLA 330
LV V + KD K+ + L+F D+ + + FL+++E W L
Sbjct: 264 TPPELVLVSLPKKDEKV-----DTKYLNFGDIVYGTCTE-----RQHHYFLSHIEDWDLV 313
Query: 331 ISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVS 390
+A+ +I+ V+ + DD+ ++ +++ D +++ ++ D L +GL ID S
Sbjct: 314 FAASAASIEVSVIARQ---DDKMWELWILE---DAGRAELRVTESNEDTLPLGLAIDYTS 367
Query: 391 LCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHV 426
+ ++ + +EK L P LM L+ +G L F +
Sbjct: 368 ---QQEIYI-TDEKTLPPAPTLMMLSTEGLLCSFAL 399
>gi|291411416|ref|XP_002721985.1| PREDICTED: nucleoporin 214kDa [Oryctolagus cuniculus]
Length = 2089
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 111/451 (24%), Positives = 190/451 (42%), Gaps = 100/451 (22%)
Query: 26 DEIGESIPIIRDDDSQFD-------LQNPPSLP------LAVSQLSHQLIFAAHSSGFCV 72
DE+ IP D QF +P LP LAVS + L+FA +SG V
Sbjct: 3 DEMDAMIPEREMKDFQFRALRKVRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQV 61
Query: 73 ARTNDVIDAAKEMKENGTRCC--VQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH 130
T +++ K + ++ + VQ L +V + F + L+LS D TL+V + S +
Sbjct: 62 FPTKNLLIQNK-LGDDPNKIVDKVQSL-LVPMKFP---IHHLALSCDNLTLSVCMMSSEY 116
Query: 131 ---IHFFSVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKKPENSFLALSNVGRLYR 182
I FF V + N K+ K F+ + V D++W S +A+ L
Sbjct: 117 GSIITFFDVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPT-VPSMVAVC----LAD 171
Query: 183 GGVNGPLEDVMDNVDA------------VEWSVKGKFVAVAKKN---VLSILTSKLKERL 227
G + + V D V V WS KGK +AV K+N V + T + K+ +
Sbjct: 172 GSIA--VLQVTDTVKVCATLPSTAAVTCVCWSPKGKQLAVGKQNGTVVQHLPTLQEKKVI 229
Query: 228 PISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKD 284
P ++S + V+V + W+ I DG E + ++ ++ K+
Sbjct: 230 PCPPFYES-------DHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKE 280
Query: 285 GKITDASSEPVVLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHV 342
K ++F + ++ S T + +L++VE W L ++A+ + + V
Sbjct: 281 EK------------HPEIFVNFMEPCYGSCTERQHHYYLSHVEEWDLVLAASAASTE--V 326
Query: 343 VLLRWSLDDEKNDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVK 396
+L D IN + W+ + + D +D+L +G+ ID + +
Sbjct: 327 SILARQSD---------QINWESWLLEDSSRAELPVTDKSDDSLPVGVAIDYTN-----Q 372
Query: 397 VQLGV-EEKELSPYFILMCLTLDGKLNMFHV 426
V++ V +EK L P +LM L+ DG L F++
Sbjct: 373 VEITVNDEKTLPPAPVLMLLSTDGVLCPFYM 403
>gi|332255363|ref|XP_003276802.1| PREDICTED: nuclear pore complex protein Nup214 isoform 2 [Nomascus
leucogenys]
Length = 2120
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 108/440 (24%), Positives = 188/440 (42%), Gaps = 81/440 (18%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +S + T ++
Sbjct: 54 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASSLQIFPTKNL 108
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F V L+LS D TL+V + + I FF
Sbjct: 109 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---VHHLALSCDNLTLSVCMMSGEYGSIIAFF 164
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 165 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 224
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 225 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 278
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ I DG E + ++ ++ K+ K
Sbjct: 279 ----DHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKEEK--------- 323
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 324 ---HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQTD--- 375
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGV-EEKEL 406
IN + W+ + + D +D+L MG+ +D + +V++ V +EK L
Sbjct: 376 ------QINWESWLLEDSSRAELPVTDKSDDSLPMGVAVDYTN-----QVEITVSDEKTL 424
Query: 407 SPYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 425 PPAPVLMLLSTDGVLCPFYM 444
>gi|441623187|ref|XP_004088890.1| PREDICTED: nuclear pore complex protein Nup214 [Nomascus
leucogenys]
Length = 2130
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 108/440 (24%), Positives = 188/440 (42%), Gaps = 81/440 (18%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +S + T ++
Sbjct: 54 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASSLQIFPTKNL 108
Query: 79 IDAAKEMKE-NGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFF 134
+ K + N VQ L +V + F V L+LS D TL+V + + I FF
Sbjct: 109 LIQNKPGDDPNKIVDKVQGL-LVPMKFP---VHHLALSCDNLTLSVCMMSGEYGSIIAFF 164
Query: 135 SVHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLY 181
V + N K+ K F+ + V D++W P + +A+ V
Sbjct: 165 DVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETV 224
Query: 182 RGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVG 238
+ V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 225 K--VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES--- 278
Query: 239 DCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPV 295
+ V+V + W+ I DG E + ++ ++ K+ K
Sbjct: 279 ----DHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKEEK--------- 323
Query: 296 VLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEK 353
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 324 ---HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQTD--- 375
Query: 354 NDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGV-EEKEL 406
IN + W+ + + D +D+L MG+ +D + +V++ V +EK L
Sbjct: 376 ------QINWESWLLEDSSRAELPVTDKSDDSLPMGVAVDYTN-----QVEITVSDEKTL 424
Query: 407 SPYFILMCLTLDGKLNMFHV 426
P +LM L+ DG L F++
Sbjct: 425 PPAPVLMLLSTDGVLCPFYM 444
>gi|119608366|gb|EAW87960.1| nucleoporin 214kDa, isoform CRA_a [Homo sapiens]
Length = 1133
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 102/436 (23%), Positives = 187/436 (42%), Gaps = 73/436 (16%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFFS 135
+ K + V ++ + VP + + L+LS D TL+ + S + I FF
Sbjct: 68 LIQNKPGDDPNK--IVDKVQGLLVPMKFP-IHHLALSCDNLTLSACMMSSEYGSIIAFFD 124
Query: 136 VHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PENSFLALSNVGRLYRGGVNGPL 189
V + N K+ K F+ + V D++W P + L++ G + V +
Sbjct: 125 VRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLAD-GSIAVLQVTETV 183
Query: 190 EDVMD-----NVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVGDCD 241
+ V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 184 KVCATLPSTVAVTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES------ 237
Query: 242 VNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPVVLS 298
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 238 -DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK------------ 282
Query: 299 FSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDI 356
++F + ++ S T + +L+Y+E W L ++A+ + + V +L D
Sbjct: 283 HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTE--VSILARQSD------ 334
Query: 357 AVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYF 410
IN + W+ + + D +D+L MG+ +D +V++ + +EK L P
Sbjct: 335 ---QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVD---YTNQVEITIS-DEKTLPPAP 387
Query: 411 ILMCLTLDGKLNMFHV 426
+LM L+ DG L F++
Sbjct: 388 VLMLLSTDGVLCPFYM 403
>gi|303276296|ref|XP_003057442.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461794|gb|EEH59087.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 2567
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/161 (21%), Positives = 73/161 (45%), Gaps = 6/161 (3%)
Query: 307 IDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKW 366
+DD+ GP+L ++ WG++++ +RK D+ V ++R + + + V+++ D+
Sbjct: 342 VDDL----RGPFLHAAAIDAWGVSLACHRKAEDNQVCVVRCAGRGDADGPCVLEVEDDRC 397
Query: 367 IPRIKL--QDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMF 424
+P I L +D ++N ++G+ +D G++ +L + T D KL +
Sbjct: 398 VPSIPLAGEDEDDENFVVGVAVDHTGAGGRMLHPADKSRDKLPQGPTVFVATADAKLVVM 457
Query: 425 HVASVSGPSCSPDVDFTLSDAEDDVPAVVPVDSNLPRVYSG 465
+ ++ S + A +VPA + P G
Sbjct: 458 RLGNLDDAKASEYASACVEAATKEVPAAWAATATSPTTSPG 498
>gi|255076851|ref|XP_002502091.1| predicted protein [Micromonas sp. RCC299]
gi|226517356|gb|ACO63349.1| predicted protein [Micromonas sp. RCC299]
Length = 2621
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 95/416 (22%), Positives = 159/416 (38%), Gaps = 78/416 (18%)
Query: 65 AHSSGFCVARTNDVIDAAKEMKENGTR---CCVQELSVVDVPFENENVDILSLSNDESTL 121
A G V T ++ A KE R C + + +P +V ++ S E+ L
Sbjct: 56 ADGGGVRVIETTALVAHAASTKEKMPREQLCPPDAFATIAIP----DVSCVAFSPCETIL 111
Query: 122 AVSLYQSPHIHFFSVHSLLNKEIKPS-------FSCSLNG----STYVKDIRWRKKPEN- 169
AV + P + F +L+ +P+ F+ + G S V+D W+ +
Sbjct: 112 AVCV--GPQVRFLETRALVACAREPTATPTPEPFAMASLGADESSDRVRDFLWQPGGASG 169
Query: 170 --SFLAL-----SNVGRLYRG------GVNGPLEDVMDNVDAVEWSVKGK-------FVA 209
++LAL S+ G+L+ G GV D+ D V AV + VA
Sbjct: 170 GLAYLALAGPEDSDEGKLFVGTAGADDGV-----DIADRVVAVAMAPPPDEDPDVSIPVA 224
Query: 210 VAKKNVLSILTSKLKERLPISLSFKSWVG---DCDVNCSVKVDSIRWVRRDCI-IIGCFQ 265
++ +KLK+R + +G D D V V+ + + R + +I +
Sbjct: 225 AWATAAGDVVYAKLKDRGLAAEMRGVAIGKIDDGDGEAEVDVEGLAFGPRPGLALIVAAR 284
Query: 266 LTEDGKEENYLVQVIQSKDGKITDASSEPVVLSFSDVFS--------------DVIDDIL 311
ED + LV DA + P V +F DV S D+ DD
Sbjct: 285 EREDPDRHHVLV----------LDAGT-PEVSNFWDVVSPPPAPSCTRLEGTFDIDDDET 333
Query: 312 PSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKWIPRIK 371
GPYL V +WG+A++++RK D+ + L + + I+ D+ +P I
Sbjct: 334 AGLRGPYLHGLTVRQWGVALASHRKAWDNQLCALE--VAGSGAGPRALQIDDDRCVPAIP 391
Query: 372 LQDNGND-NLIMGLCIDKVSLCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHV 426
L D N + GL D G++ L ++ T D ++ + V
Sbjct: 392 LAGADEDSNFVTGLAADHGGRGGRMPHPADKSRPPLPQGPTVLVATADHRVTVMRV 447
>gi|301758876|ref|XP_002915256.1| PREDICTED: nuclear pore complex protein Nup214-like [Ailuropoda
melanoleuca]
Length = 2083
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 101/438 (23%), Positives = 186/438 (42%), Gaps = 77/438 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFFS 135
+ K + V ++ + VP + + L+LS D TL+V + S + I FF
Sbjct: 68 LIQNKPGDD--PNKIVDKVQGLLVPMKFP-IHHLALSCDNLTLSVCMMSSEYGSIIAFFD 124
Query: 136 VHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKKPENSF--------LALSNVGRLYR 182
V + N K+ K F+ + V D++W + +A+ + +
Sbjct: 125 VRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMMAVCLADGSIAVLQITETVK 184
Query: 183 GGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVGD 239
V L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 185 --VCATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES---- 237
Query: 240 CDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPVV 296
+ V+V + W+ I DG E + ++ ++ K+ K
Sbjct: 238 ---DHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKEEK---------- 282
Query: 297 LSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKN 354
++F + ++ S T + +L+Y+E W ++A+ + + V +L D
Sbjct: 283 --HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDFVLAASAASTE--VSILARQTD---- 334
Query: 355 DIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSP 408
N + W+ + + D +D+L +G+ ID + +V++ + +EK L P
Sbjct: 335 -----QTNWESWLLEDSSRAELPVTDKSDDSLPVGVAIDYTN---QVEITIN-DEKTLPP 385
Query: 409 YFILMCLTLDGKLNMFHV 426
+LM L+ DG L F++
Sbjct: 386 APVLMLLSTDGVLCPFYM 403
>gi|281349501|gb|EFB25085.1| hypothetical protein PANDA_003258 [Ailuropoda melanoleuca]
Length = 2078
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 100/435 (22%), Positives = 185/435 (42%), Gaps = 77/435 (17%)
Query: 22 DFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDVIDA 81
DF F + + +R DS +L S LAVS + L+FA +SG + T +++
Sbjct: 1 DFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNLLIQ 55
Query: 82 AKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFFSVHS 138
K + V ++ + VP + + L+LS D TL+V + S + I FF V +
Sbjct: 56 NKPGDD--PNKIVDKVQGLLVPMKFP-IHHLALSCDNLTLSVCMMSSEYGSIIAFFDVRT 112
Query: 139 LLN--KEIKPSFSCSL---NGSTYVKDIRWRKKPENSF--------LALSNVGRLYRGGV 185
N K+ K F+ + V D++W + +A+ + + V
Sbjct: 113 FSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMMAVCLADGSIAVLQITETVK--V 170
Query: 186 NGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVGDCDV 242
L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 171 CATLPSTVA-VTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES------- 222
Query: 243 NCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPVVLSF 299
+ V+V + W+ I DG E + ++ ++ K+ K
Sbjct: 223 DHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKEEK------------H 268
Query: 300 SDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIA 357
++F + ++ S T + +L+Y+E W ++A+ + + V +L D
Sbjct: 269 PEIFVNFMEPCYGSCTERQHHYYLSYIEEWDFVLAASAASTE--VSILARQTD------- 319
Query: 358 VVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYFI 411
N + W+ + + D +D+L +G+ ID + +V++ + +EK L P +
Sbjct: 320 --QTNWESWLLEDSSRAELPVTDKSDDSLPVGVAIDYTN---QVEITIN-DEKTLPPAPV 373
Query: 412 LMCLTLDGKLNMFHV 426
LM L+ DG L F++
Sbjct: 374 LMLLSTDGVLCPFYM 388
>gi|301613482|ref|XP_002936237.1| PREDICTED: nuclear pore complex protein Nup214 [Xenopus (Silurana)
tropicalis]
Length = 2070
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 106/441 (24%), Positives = 188/441 (42%), Gaps = 75/441 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E+ DF F ++ + IR DS DL S L +S + +F G T DV
Sbjct: 12 ETKDFQFRQLKK----IRLFDSPVDLPKERSNLLVISN-KYGFLFVGGPMGLKFFHTKDV 66
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAV---SLYQSPHIHFFS 135
+ A + ++ V + VP + L+LS+D TL+V S Q + F+
Sbjct: 67 LVAVRPGED--ANKIVMGPQGIHVPMSCP-IHHLALSSDNLTLSVCMTSAEQGTFVSFYD 123
Query: 136 VHSLLN--KEIKPSFSCS---LNGSTYVKDIRWRKK-PENSFLALSNVGRLYRGGVNGPL 189
V +LLN K+ K F+ + S+ V D++W P + LS+ G + V +
Sbjct: 124 VRTLLNESKQQKTPFASYKLLRDPSSSVTDLQWNPALPTMVAVCLSD-GSISVLQVTDTV 182
Query: 190 EDVMD-----NVDAVEWSVKGKFVAVAKKN--VLSILTSKLKERLPISLSFKSWVGDC-D 241
+ V +V WS KGK +AV K+N V+ L S L+E+ I C D
Sbjct: 183 SMFANLPAALGVTSVCWSPKGKQLAVGKQNGTVVQYLPS-LQEKKVIPCP------SCYD 235
Query: 242 VNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPVVLS 298
+ VKV + W+ + DG E +V ++ K+ K + L+
Sbjct: 236 PDNPVKVLDVLWL--STYVFTVVYAAADGSLEASPQLVVVILPKKEDKRAER-----FLN 288
Query: 299 FSDVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLL--------RWSLD 350
F+++ + + + FL Y+E W + ++A+ ++D V+ +W L+
Sbjct: 289 FTEMCFSICSE-----RQHHFFLNYIEDWEVLLAASAASVDVGVIARPPDQAGWEQWLLE 343
Query: 351 DEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKEL-SPY 409
D + + ++ +D L MG+ +D C ++++ + E ++ P
Sbjct: 344 DSSRA-------------EMPMTESNDDTLPMGVVLDYT--C---QLEVFISESQILPPV 385
Query: 410 FILMCLTLDGKLNMFHVASVS 430
+L+ L+ DG L FHV +++
Sbjct: 386 PVLLLLSTDGVLCPFHVVNLT 406
>gi|170285220|gb|AAI61092.1| LOC100145463 protein [Xenopus (Silurana) tropicalis]
Length = 853
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 104/441 (23%), Positives = 187/441 (42%), Gaps = 75/441 (17%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E+ DF F ++ + IR DS DL S L +S + +F G V T DV
Sbjct: 12 ETKDFQFRQLKK----IRLFDSPVDLPKERSNLLVISN-KYGFLFVGGPMGLKVFHTKDV 66
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLY---QSPHIHFFS 135
+ A + E+ + V + VP + L+LS+D TL+V + Q + F+
Sbjct: 67 LVAVRP-GEDANKI-VMGPQGIHVPMSCP-IHHLALSSDNLTLSVCMTSAEQGTFVSFYD 123
Query: 136 VHSLLN-----KEIKPSFSCSLNGSTYVKDIRWRKK-PENSFLALSNVGRLYRGGVNGPL 189
V +LLN K S+ + S+ V D++W P + LS+ G + V +
Sbjct: 124 VRTLLNESKQQKTPFASYKLLRDPSSSVTDLQWNPALPTMVAVCLSD-GSISVLQVTDTV 182
Query: 190 EDVMD-----NVDAVEWSVKGKFVAVAKKN--VLSILTS-KLKERLPISLSFKSWVGDCD 241
+ V +V WS KGK +AV K+N V+ L S + K+ +P + D
Sbjct: 183 SMFANLPAALGVTSVCWSPKGKQLAVGKQNGTVVQYLPSLQEKKVIPCPSCY-------D 235
Query: 242 VNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPVVLS 298
+ VKV + W+ + DG E +V ++ K+ K + L+
Sbjct: 236 PDNPVKVLDVLWL--STYVFTVVYAAADGSLEASPQLVVVILPKKEDKRAER-----FLN 288
Query: 299 FSDVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLL--------RWSLD 350
F+++ + S + FL Y+E W + ++A+ ++D V+ +W L+
Sbjct: 289 FTEMCFSIC-----SERQHHFFLNYIEDWEVLLAASAASVDVGVIARPPDQAGWEQWLLE 343
Query: 351 DEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKEL-SPY 409
D + + ++ +D L MG+ +D ++++ + E ++ P
Sbjct: 344 DSSRA-------------EMPMTESNDDTLPMGVVLDYT-----CQLEVFISESQILPPV 385
Query: 410 FILMCLTLDGKLNMFHVASVS 430
+L+ L+ DG L FHV +++
Sbjct: 386 PVLLLLSTDGVLCPFHVVNLT 406
>gi|307170863|gb|EFN62974.1| Nuclear pore complex protein Nup214 [Camponotus floridanus]
Length = 1547
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 99/414 (23%), Positives = 163/414 (39%), Gaps = 69/414 (16%)
Query: 42 FDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVD 101
FDL+ P +P A + L+ A G +D I K +EN + EL +
Sbjct: 24 FDLKTPTYIPAACN-----LLVADKKRGLLYIGHDDKITVLKPAEENNAEWKI-ELQLPS 77
Query: 102 VPFENENVDILSLSNDESTLAVSLYQSPHIHFFSVHSL----LNKEIKPSFSCSLNGSTY 157
V V ++++ D + +AV++ I + HSL LN I+ + S L+ +Y
Sbjct: 78 V------VSKIAINCDYNYIAVAVALKSTILIYDAHSLTRNSLNLLIETTSSYKLD--SY 129
Query: 158 VKDIRWRKKPENSFLALSN---VG----RLYRGGVNGPLEDVMDNVD------AVEWSVK 204
V DIRW F +++ +G R+ V P ++ N+ WS K
Sbjct: 130 VTDIRWNPTVPTMFCTITSDCTIGSFQLRIDENSV--PKISILSNISNEPQALCAAWSPK 187
Query: 205 GKFVAVAKKNVLSILTSKLKERLPISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCF 264
GK VAV KN + + P +L + + +V SI W+ C
Sbjct: 188 GKQVAVGCKN------GDIVQLKPDTLKIARTIAGPSPSIG-EVISILWISNYQF---CA 237
Query: 265 QLTEDGKEENYLVQVIQSKDGKI--TDASSEPVVLSFSDVFSDVIDDILPSGTGPYL--F 320
+ N L +I + G+I T E + F D G G L +
Sbjct: 238 AYLNSERHTNVL--IIDAPKGEINATFTCFEDITYGFPDA----------EGEGSILRYY 285
Query: 321 LTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKWIPRIKLQDNGNDNL 380
+V WGL I+ + + + VL N +VD R + +P ++ +
Sbjct: 286 FEHVPEWGLIIAGSSTS-SEIAVLGTTDGGANWNQWQLVDSGRAQ-LPLMRT----TEIY 339
Query: 381 IMGLCIDKVSLCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHVASVSGPSC 434
+GL +DK + K+ G + P IL L G+L FH+ +++ P+C
Sbjct: 340 PLGLAVDKSPIN---KLPWGTDTTLPHPVPILHILGTSGQLCSFHMVNLT-PNC 389
>gi|348535067|ref|XP_003455023.1| PREDICTED: nuclear pore complex protein Nup214-like [Oreochromis
niloticus]
Length = 1997
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 96/397 (24%), Positives = 168/397 (42%), Gaps = 55/397 (13%)
Query: 52 LAVSQLSHQLIFAAHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDI 111
LA+S + L F F V T D++ A K + + V+ + V+ +
Sbjct: 41 LAISN-KYGLTFVGLDRTFKVYLTRDILAADK--GDGNSNEIVKGIPVLAEVTVELVLHH 97
Query: 112 LSLSNDESTLAV---SLYQSPHIHFFSVHSLLNK----EIKPSF-SCSLNGSTYVKDIRW 163
L+LS DE TL+V S + F+ V + +NK ++ +F ++ V+D++W
Sbjct: 98 LALSCDELTLSVCGMSEEAGLSLTFYDVRTFMNKGKLQKLPFAFHQPAVAPDVLVQDLKW 157
Query: 164 RKKPENSFLALSNVGRLYRGGVNGPLEDVMDNVDA------------VEWSVKGKFVAVA 211
+ S LA+ L G + + DV D+V+ + WS KGK VA
Sbjct: 158 NPV-QVSVLAVC----LSDGSIM--ILDVTDSVNVQARLPSSSGITCICWSPKGKQVAAG 210
Query: 212 KKN-VLSILTSKLKERLPISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDG 270
K + ++ T L+E+ I G+ VK + W++ + DG
Sbjct: 211 KMDGTVTQYTPGLEEKKVIPCPQFFTPGE-----PVKAVDVLWLK--TYVFAVAYAAGDG 263
Query: 271 KEENYLVQVIQSKDGKITDASSEPVVLSFSD-VFSDVIDDILPSGTGPYLFLTYVERWGL 329
E V+ S K D EP L+F + VF + + FL++VE W L
Sbjct: 264 SLETPPELVLISLPKK--DEKVEPKFLNFGEFVFGSCTERQH------HYFLSHVEDWDL 315
Query: 330 AISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKV 389
+A+ +I+ V+ + D+K ++ V + D + + + D L +GL ID
Sbjct: 316 VFAASAASIEVSVI----ARQDDKVNMWEVWLLEDSSRAELPVTETNEDTLPLGLAIDYT 371
Query: 390 SLCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHV 426
S + ++ + +EK L P ++ L+ +G L F +
Sbjct: 372 S---QQEIHI-TDEKTLPPAPTMLMLSTEGLLCPFAL 404
>gi|405962689|gb|EKC28340.1| hypothetical protein CGI_10011562 [Crassostrea gigas]
Length = 2007
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/420 (21%), Positives = 150/420 (35%), Gaps = 91/420 (21%)
Query: 46 NPPSLPL-AVSQL-----SHQLIFAAHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSV 99
+PP P V+QL + L F ++GF V +T+DV D +++ R L V
Sbjct: 52 DPPDSPFQGVTQLIACSSKYGLTFLGTTNGFKVIKTSDVTDI--DIRHAAERTT---LIV 106
Query: 100 VDVPFENE-----NVDILSLSNDESTLAVSLYQSP--HIHFFSVHSLLNKEIKPSFSCSL 152
D P V L LS DE TLAV + + I+ + V N+ +
Sbjct: 107 ADPPIRTSVQVEGTVSHLCLSCDELTLAVVVTCNGIIQIYIYDVKGFANQGQNVDPFQKI 166
Query: 153 NGSTYVKDIRWRKKPENSFLALSNVGRLYRGGVNGPLEDVMDNVDAVE------------ 200
+ V D+ W + + GR L +V +N V
Sbjct: 167 RVTCEVLDMAWNPTQPTLLVVCMSDGRAQ-------LLEVAENFKIVASLPPVVSACSVC 219
Query: 201 WSVKGKFVAVAKKN-VLSILTSKLKERLPISLSFKSWVGDCDV--NCSVKVDSIRWVRRD 257
WS KGK + + N L +LK++ + W + + +V + W+
Sbjct: 220 WSPKGKQLVIGTGNGTLMQFDHELKKK-------RDWDRPSVLPEDQQFEVKGVSWISTF 272
Query: 258 CIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVLSFSDVFSDVIDDILPSGTG- 316
+ F ++ +V SKDG + F + +D +G G
Sbjct: 273 MFLASYFPRGSP-DDQPTVVLTSGSKDG--------------APAFINFLDPCYGNGEGI 317
Query: 317 -PYLFLTYVERWGLAISANR---------KNIDDHVVLLRWSLDDEKNDIAVVDINRDKW 366
+ Y+ +W + + + K+ DD RW+LDD
Sbjct: 318 ASTMHFNYIPQWEMVLCTSSTACETAIVGKHFDDKNTFERWTLDDAARA----------- 366
Query: 367 IPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHV 426
+ L ++ D +G ID S + Q+ V E+ P + M L+ DG L +++
Sbjct: 367 --ELPLTEDYADTFPLGAAIDFSS-----QFQVPVGEQRYPPCPMFMLLSTDGVLVTYYM 419
>gi|28972035|dbj|BAC65471.1| mKIAA0023 protein [Mus musculus]
Length = 2112
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 110/463 (23%), Positives = 192/463 (41%), Gaps = 100/463 (21%)
Query: 11 EEIEGGRIESNDFFFDEIGESIPIIRDDDSQFD-------LQNPPSLP------LAVSQL 57
E EGG + DE+ IP D QF +P LP L +S
Sbjct: 21 EHPEGGTMG------DEMDAMIPEREMKDFQFRALKKVRIFDSPEELPKERSSVLTISN- 73
Query: 58 SHQLIFAAHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSND 117
+ ++FA ++G V T ++ K + V + ++VP + V L+LS D
Sbjct: 74 KYGMLFAGGTNGLNVFPTKSLLIQNKPGDD--PNKIVDTIQGLNVPMKFP-VHHLALSCD 130
Query: 118 ESTLAVSLYQSPH---IHFFSVHSLLN--KEIKPSFS---CSLNGSTYVKDIRWRKKPEN 169
TL+ + S + I FF V + N K +K F+ S + S V D++W
Sbjct: 131 SLTLSACMMSSEYGSIIAFFDVRTFSNQAKPLKRPFTYHKVSNDASGMVNDMKWNPT-VP 189
Query: 170 SFLALSNVGRLYRGGVNGPLEDVMD------------NVDAVEWSVKGKFVAVAKKN--- 214
S +A+ L G ++ + V D V V WS KGK +AV K+N
Sbjct: 190 SMVAVC----LADGSIS--VLQVTDVVKVCATLPPSTGVTCVCWSPKGKQLAVGKQNGTV 243
Query: 215 VLSILTSKLKERLPISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE- 273
V + T + K+ +P ++S + V+V + W+ I DG E
Sbjct: 244 VQYLPTLQEKKVIPCPPFYES-------DHPVRVLDVLWIGTYVFTI--VYAGADGTLET 294
Query: 274 --NYLVQVIQSKDGKITDASSEPVVLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGL 329
+ ++ ++ K+ K ++F + ++ S T + +L+Y+E W L
Sbjct: 295 CPDVVMALLPKKEEK------------HPEIFVNFMEPCYSSCTERQHHYYLSYIEEWDL 342
Query: 330 AISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKWI------PRIKLQDNGNDNLIMG 383
++A+ + + ++ +ND N + W+ + + D +D+L MG
Sbjct: 343 VLAASAASTEVSILA-------RQND----QTNWESWLLEDSSRAELPVTDKSDDSLPMG 391
Query: 384 LCIDKVSLCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHV 426
+ ID + +V+V + EEK L P +L+ L+ DG L F++
Sbjct: 392 VAIDYTN---EVEVTIN-EEKTLPPAPVLLLLSTDGVLCPFYM 430
>gi|237823848|pdb|3FMO|A Chain A, Crystal Structure Of The Nucleoporin Nup214 In Complex
With The Dead- Box Helicase Ddx19
gi|237823850|pdb|3FMP|A Chain A, Crystal Structure Of The Nucleoporin Nup214 In Complex
With The Dead- Box Helicase Ddx19
gi|237823852|pdb|3FMP|C Chain C, Crystal Structure Of The Nucleoporin Nup214 In Complex
With The Dead- Box Helicase Ddx19
Length = 450
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 103/436 (23%), Positives = 190/436 (43%), Gaps = 73/436 (16%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFFS 135
+ K + V ++ + VP + + L+LS D TL+ + S + I FF
Sbjct: 68 LIQNKPGDDPNK--IVDKVQGLLVPMKFP-IHHLALSCDNLTLSACMMSSEYGSIIAFFD 124
Query: 136 VHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PENSFLALSNVGRLYRGGVNGPL 189
V + N K+ K F+ + V D++W P + L++ G + V +
Sbjct: 125 VRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLAD-GSIAVLQVTETV 183
Query: 190 EDVMD-----NVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVGDCD 241
+ V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 184 KVCATLPSTVAVTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES------ 237
Query: 242 VNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPVVLS 298
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 238 -DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK------------ 282
Query: 299 FSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDI 356
++F + ++ S T + +L+Y+E W L ++A+ + + +L R S D+
Sbjct: 283 HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTEVS-ILARQS--DQ---- 335
Query: 357 AVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYF 410
IN + W+ + + D +D+L MG+ +D + +V++ + +EK L P
Sbjct: 336 ----INWESWLLEDSSRAELPVTDKSDDSLPMGVVVDYTN---QVEITIS-DEKTLPPAP 387
Query: 411 ILMCLTLDGKLNMFHV 426
+LM L+ DG L F++
Sbjct: 388 VLMLLSTDGVLCPFYM 403
>gi|147835462|emb|CAN61117.1| hypothetical protein VITISV_022568 [Vitis vinifera]
Length = 492
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/59 (45%), Positives = 37/59 (62%)
Query: 872 DLINQLIELERHFNSLELNKFGENDGVHGCQRRPQSRFGTSRHIQSLHSLHTTMSSQIA 930
DL NQLIE ERHFN++ELNKFG+N G + + GT+ + Q+L S T + +A
Sbjct: 354 DLTNQLIESERHFNAIELNKFGKNGGSRVSYSQALTNGGTNTYSQALPSRSTIYNQGLA 412
>gi|395844574|ref|XP_003795033.1| PREDICTED: nuclear pore complex protein Nup214 [Otolemur garnettii]
Length = 2097
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 58/116 (50%), Gaps = 23/116 (19%)
Query: 318 YLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKWI------PRIK 371
+ FL+YVE W L ++A+ + + V +L D IN + W+ +
Sbjct: 304 HYFLSYVEEWDLVLAASAASTE--VSILARQSD---------QINWESWLLEDSSRAELP 352
Query: 372 LQDNGNDNLIMGLCIDKVSLCGKVKVQLGV-EEKELSPYFILMCLTLDGKLNMFHV 426
+ D +D+L MG+ ID + +V++ + +EK L P +LM L+ DG L F++
Sbjct: 353 VTDKNDDSLPMGVAIDYTN-----QVEIAISDEKTLPPAPVLMLLSTDGVLCPFYM 403
>gi|124378033|ref|NP_758472.2| nuclear pore complex protein Nup214 [Mus musculus]
gi|206558312|sp|Q80U93.2|NU214_MOUSE RecName: Full=Nuclear pore complex protein Nup214; AltName:
Full=214 kDa nucleoporin; AltName: Full=Nucleoporin
Nup214
Length = 2085
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 105/442 (23%), Positives = 188/442 (42%), Gaps = 85/442 (19%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S L +S + ++FA ++G V T +
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSVLTISN-KYGMLFAGGTNGLNVFPTKSL 67
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFFS 135
+ K + V + ++VP + V L+LS D TL+ + S + I FF
Sbjct: 68 LIQNKPGDD--PNKIVDTIQGLNVPMKFP-VHHLALSCDSLTLSACMMSSEYGSIIAFFD 124
Query: 136 VHSLLN--KEIKPSFS---CSLNGSTYVKDIRWRKKPENSFLALSNVGRLYRGGVNGPLE 190
V + N K +K F+ S + S V D++W S +A+ L G ++ +
Sbjct: 125 VRTFSNQAKPLKRPFTYHKVSNDASGMVNDMKWNPT-VPSMVAVC----LADGSIS--VL 177
Query: 191 DVMD------------NVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKS 235
V D V V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 178 QVTDVVKVCATLPPSTGVTCVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES 237
Query: 236 WVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASS 292
+ V+V + W+ I DG E + ++ ++ K+ K
Sbjct: 238 -------DHPVRVLDVLWIGTYVFTI--VYAGADGTLETCPDVVMALLPKKEEK------ 282
Query: 293 EPVVLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLD 350
++F + ++ S T + +L+Y+E W L ++A+ + + ++
Sbjct: 283 ------HPEIFVNFMEPCYSSCTERQHHYYLSYIEEWDLVLAASAASTEVSILA------ 330
Query: 351 DEKNDIAVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEK 404
+ND N + W+ + + D +D+L MG+ ID + +V+V + EEK
Sbjct: 331 -RQND----QTNWESWLLEDSSRAELPVTDKSDDSLPMGVAIDYTN---EVEVTIN-EEK 381
Query: 405 ELSPYFILMCLTLDGKLNMFHV 426
L P +L+ L+ DG L F++
Sbjct: 382 TLPPAPVLLLLSTDGVLCPFYM 403
>gi|126031606|pdb|2OIT|A Chain A, Crystal Structure Of The N-Terminal Domain Of The Human
Proto-Oncogene Nup214CAN
Length = 434
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 103/436 (23%), Positives = 190/436 (43%), Gaps = 73/436 (16%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFFS 135
+ K + V ++ + VP + + L+LS D TL+ + S + I FF
Sbjct: 68 LIQNKPGDDPNK--IVDKVQGLLVPMKFP-IHHLALSCDNLTLSACMMSSEYGSIIAFFD 124
Query: 136 VHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PENSFLALSNVGRLYRGGVNGPL 189
V + N K+ K F+ + V D++W P + L++ G + V +
Sbjct: 125 VRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLAD-GSIAVLQVTETV 183
Query: 190 EDVMD-----NVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVGDCD 241
+ V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 184 KVCATLPSTVAVTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES------ 237
Query: 242 VNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPVVLS 298
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 238 -DHPVRVLDVLWI--GTYVFAIVYAAADGTLETSPDVVMALLPKKEEK------------ 282
Query: 299 FSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDI 356
++F + ++ S T + +L+Y+E W L ++A+ + + +L R S D+
Sbjct: 283 HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTEVS-ILARQS--DQ---- 335
Query: 357 AVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYF 410
IN + W+ + + D +D+L MG+ +D + +V++ + +EK L P
Sbjct: 336 ----INWESWLLEDSSRAELPVTDKSDDSLPMGVVVDYTN---QVEITIS-DEKTLPPAP 387
Query: 411 ILMCLTLDGKLNMFHV 426
+LM L+ DG L F++
Sbjct: 388 VLMLLSTDGVLCPFYM 403
>gi|260789029|ref|XP_002589550.1| hypothetical protein BRAFLDRAFT_81507 [Branchiostoma floridae]
gi|229274730|gb|EEN45561.1| hypothetical protein BRAFLDRAFT_81507 [Branchiostoma floridae]
Length = 1316
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 73/336 (21%), Positives = 123/336 (36%), Gaps = 85/336 (25%)
Query: 112 LSLSNDESTLAVSLYQSPHIHF--FSVHSLLNKEIKPSF---SCSLNGSTYVKDIRWRKK 166
L LS DE TL+V + + F + L+ + K F S N V D++W
Sbjct: 97 LGLSCDELTLSVCYTREDGLALVMFDTRAFLSNDPKAPFCKTKLSTNKGVTVADVKWNPA 156
Query: 167 PENSFLALSNVGRLYRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTSKLKER 226
N L + G L + +V DN N + ++ +
Sbjct: 157 HPNMLACLRSDGTLV-------ILEVTDN------------------NSMKMMAN----- 186
Query: 227 LPISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGK 286
F + G +N I W+ ++ +T+D + + +V + KD +
Sbjct: 187 ------FDNSKGITAMN------DILWLSTYVFVVLYDVITDDASQPSLVVVSLPKKDEQ 234
Query: 287 ITDASSEPVVLSFSDV-FSDVIDDILPSGTGPYLFLTYVERWGLAISANR---------K 336
+ +F ++ F D L FL +V+ WGL + A+ K
Sbjct: 235 RSTN-----FTNFEEICFGGGEDRDL------KYFLHHVDEWGLVVCASSRSMEAAVIGK 283
Query: 337 NIDDHVVLLRWSLDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVK 396
N++D V W+LDD + N D+ P MG+ +D L
Sbjct: 284 NLEDKNVWELWNLDDAARAELPLSTNHDETYP-------------MGMAVD---LTSTNH 327
Query: 397 VQLGVEEKELSPYFILMCLTLDGKLNMFHVASVSGP 432
+ +G E + L P I M L+ DG L F++ + + P
Sbjct: 328 IPIG-ENQSLPPMPIFMLLSTDGLLCPFYMVNTAPP 362
>gi|350416222|ref|XP_003490879.1| PREDICTED: hypothetical protein LOC100748493 [Bombus impatiens]
Length = 1616
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 87/351 (24%), Positives = 138/351 (39%), Gaps = 61/351 (17%)
Query: 109 VDILSLSNDESTLAVSLYQSPHIHFFSVHSLLNKEIKPSFSC---SLNGSTYVKDIRWRK 165
V L+L+ D + LAV+ P + + +L ++ +L+G V D+RW
Sbjct: 77 VSKLTLNCDCTYLAVA--HGPMVLIYDAEALTKNNLQLLHKVKISTLSGDILVYDLRWNP 134
Query: 166 KPENSFLALSNVGRLYRGG------------------VNGPLEDVMDNVDAVEWSVKGKF 207
L V Y G + L + N WS KGK
Sbjct: 135 AVPR---MLCTVASDYTIGSFKIKEKKKATTELDTFELKEKLNNEHINALCAAWSPKGKQ 191
Query: 208 VAVAKKNVLSILTSKLKERLPISLSF---KSWVGDCDVNCSVKVDSIRWVRRDCIIIGCF 264
+ V KN SI+ +LK L I+ + ++G+ V SI WV
Sbjct: 192 IVVGCKNG-SIV--QLKPDLKIARTIPGPNPYIGE--------VISILWVSNYQFCAAYL 240
Query: 265 QLTEDGKEENYLVQVIQSKDGKITDASSEPVVLSFSDVFSDVIDDILPSGTGPYLFLTYV 324
G E+ V VI + G+ T+A V ++ + + I D GT P + +V
Sbjct: 241 -----GNEQGINVLVIDAPKGE-TNA-----VFTYYEDITYAITDSEGEGTIPRYYFDHV 289
Query: 325 ERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGL 384
WGL I+A+ N + +L N +VD R + +P I + ++ +GL
Sbjct: 290 PEWGLIIAAS-SNSSEIAILGSTDKGVTWNQWQLVDSGRAE-LPLI----HTTESYPVGL 343
Query: 385 CIDKVSLCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHVASVSGPSCS 435
IDK + K+ G + P IL L G+L FH+ ++S P C+
Sbjct: 344 AIDKSPV---RKLPWGTDSTLPHPVPILHILATSGQLCSFHMVNLS-PKCT 390
>gi|351697298|gb|EHB00217.1| Nuclear pore complex protein Nup214 [Heterocephalus glaber]
Length = 2193
Score = 48.9 bits (115), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 97/409 (23%), Positives = 176/409 (43%), Gaps = 81/409 (19%)
Query: 52 LAVSQLSHQLIFAAHSSGFCVARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDI 111
LA+S + L+FA +SG V T +++ ++ ++ + V ++ + VP + V
Sbjct: 63 LAISN-KYGLLFAGGASGLQVFSTKNLL-VQNKLGDDPNKI-VDKVESLLVPMKFP-VHH 118
Query: 112 LSLSNDESTLAVSLYQSPH---IHFFSVHSLLN--KEIKPSFS---CSLNGSTYVKDIRW 163
L+LS D TL+V + + I FF V + N K+ K F+ S + V D++W
Sbjct: 119 LALSCDNLTLSVCMMSGEYGSIIAFFDVRTFSNEAKQQKRPFAYHKLSRDAGGMVIDMKW 178
Query: 164 RKKPENSF---LALSNVGRLYRGGVNGPLEDVMD---------NVDAVEWSVKGKFVAVA 211
+ LA ++ L + DV+ V +V WS KGK +AV
Sbjct: 179 NPAVPSMVAVCLADGSIAVLQ-------VTDVVKVCATLPPTVAVTSVCWSPKGKQLAVG 231
Query: 212 KKN---VLSILTSKLKERLPISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTE 268
K+N V + T + K+ +P + + + V+V + W+ I
Sbjct: 232 KQNGTVVQYLPTLQEKKVIPCPPFY-------EADHPVRVLDVLWI--GTYIFTIVYAAA 282
Query: 269 DGKEEN---YLVQVIQSKDGKITDASSEPVVLSFSDVFSDVIDDILPSGT--GPYLFLTY 323
DG E ++ ++ K+ K ++F + ++ S + +L+Y
Sbjct: 283 DGTLETSPEVVMALLPKKEEK------------HPEIFVNFMEPCYGSCAERQHHYYLSY 330
Query: 324 VERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKWI------PRIKLQDNGN 377
VE W L ++A+ + + V +L D IN + W+ + + D +
Sbjct: 331 VEEWDLVLAASAASTE--VSILARQSDQ---------INWESWLLEDSSRAELPVTDKSD 379
Query: 378 DNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHV 426
D+L MG+ ID + +V+V + EEK L P +L+ L+ DG L F++
Sbjct: 380 DSLPMGVAIDYTN---QVEVAIN-EEKTLPPAPVLLLLSTDGVLCPFYM 424
>gi|223674047|pdb|3FHC|A Chain A, Crystal Structure Of Human Dbp5 In Complex With Nup214
Length = 405
Score = 48.9 bits (115), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 103/436 (23%), Positives = 190/436 (43%), Gaps = 73/436 (16%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPKERSSLLAVSN-KYGLVFAGGASGLQIFPTKNL 67
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFFS 135
+ K + V ++ + VP + + L+LS D TL+ + S + I FF
Sbjct: 68 LIQNKPGDDPNK--IVDKVQGLLVPMKFP-IHHLALSCDNLTLSACMMSSEYGSIIAFFD 124
Query: 136 VHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PENSFLALSNVGRLYRGGVNGPL 189
V + N K+ K F+ + V D++W P + L++ G + V +
Sbjct: 125 VRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLAD-GSIAVLQVTETV 183
Query: 190 EDVMD-----NVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVGDCD 241
+ V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 184 KVCATLPSTVAVTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES------ 237
Query: 242 VNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPVVLS 298
+ V+V + W+ + DG E + ++ ++ K+ K
Sbjct: 238 -DHPVRVLDVLWIGT--YVFAIVYAAADGTLETSPDVVMALLPKKEEK------------ 282
Query: 299 FSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDI 356
++F + ++ S T + +L+Y+E W L ++A+ + + +L R S D+
Sbjct: 283 HPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTEVS-ILARQS--DQ---- 335
Query: 357 AVVDINRDKWI------PRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYF 410
IN + W+ + + D +D+L MG+ +D + +V++ + +EK L P
Sbjct: 336 ----INWESWLLEDSSRAELPVTDKSDDSLPMGVVVDYTN---QVEITIS-DEKTLPPAP 387
Query: 411 ILMCLTLDGKLNMFHV 426
+LM L+ DG L F++
Sbjct: 388 VLMLLSTDGVLCPFYM 403
>gi|340711449|ref|XP_003394288.1| PREDICTED: hypothetical protein LOC100642319 [Bombus terrestris]
Length = 1604
Score = 48.5 bits (114), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 102/243 (41%), Gaps = 35/243 (14%)
Query: 195 NVDAVEWSVKGKFVAVAKKNVLSILTSKLKERLPISLSF---KSWVGDCDVNCSVKVDSI 251
N WS KGK + V KN SI+ +LK L I+ + ++G+ V SI
Sbjct: 179 NALCAAWSPKGKQIVVGCKNG-SIV--QLKPDLKIARTIPGPNPYIGE--------VISI 227
Query: 252 RWVRRDCIIIGCFQLTEDGKEENYLVQVIQSKDGKITDASSEPVVLSFSDVFSDVIDDIL 311
WV G E+ V VI + G+ T+A V + D+ + D
Sbjct: 228 LWVSNYQFCAAYL-----GNEQGINVLVIDAPKGE-TNA----VFTCYEDITYGITDSEA 277
Query: 312 PSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKWIPRIK 371
GT P + +V WGL I+A+ N + +L N +VD R + +P I
Sbjct: 278 -EGTIPRYYFDHVPEWGLIIAAS-SNSSEIAILGSTDKGVTWNQWQLVDSGRAE-LPLI- 333
Query: 372 LQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHVASVSG 431
+ ++ +GL IDK + K+ G + P IL L G+L FH+ ++S
Sbjct: 334 ---HTTESYPVGLAIDKSPV---RKLPWGADSTLPHPVPILHILATSGQLCSFHMVNLS- 386
Query: 432 PSC 434
P C
Sbjct: 387 PKC 389
>gi|412989003|emb|CCO15594.1| predicted protein [Bathycoccus prasinos]
Length = 1697
Score = 48.1 bits (113), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 61/128 (47%), Gaps = 7/128 (5%)
Query: 307 IDDILPSGTGPYLFLTYVERWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKW 366
ID + TG L + + W +A+ A+ + D+ V+L+ S + V++I+ D+
Sbjct: 277 IDPTQDAATGNPLHVVPINGWDVALCAHARAWDNQVLLVSTSASEP---CRVLEIDDDRC 333
Query: 367 IPRIKLQDNGNDNLIMGLCIDKVSLCGKV--KVQLGVEEKELSPYFILMCLTLDGKLNMF 424
I L + +N + G+ +D + CG + G + L P IL T DG++ F
Sbjct: 334 FCTIPLAFDDENNFVSGVAVDYGASCGAMLNPSDGGAGDLPLGPTVILS--TSDGRIACF 391
Query: 425 HVASVSGP 432
+A ++ P
Sbjct: 392 RLACLTDP 399
>gi|67523259|ref|XP_659690.1| hypothetical protein AN2086.2 [Aspergillus nidulans FGSC A4]
gi|40745762|gb|EAA64918.1| hypothetical protein AN2086.2 [Aspergillus nidulans FGSC A4]
gi|259487456|tpe|CBF86150.1| TPA: Nuclear pore complex protein An-Nup159 (Eurofung) [Aspergillus
nidulans FGSC A4]
Length = 1394
Score = 47.4 bits (111), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 54/232 (23%), Positives = 92/232 (39%), Gaps = 32/232 (13%)
Query: 48 PSLPLAVSQLSHQLIFAAHSSGFCVARTNDVIDA--AKEMKENGTRCCVQELSVVDVPFE 105
P LP + + ++ A + CVA T V A A + KE Q + + +P
Sbjct: 16 PELPDPICGATKGIVVGAGPNTLCVASTESVRAAISADDEKEKVKTKPFQPQATISLPGR 75
Query: 106 NENVDILSLSNDESTLAVSLYQSPHIHFFSVHSLLNKEIKPSFSCSLNGSTYVKDIRWRK 165
++ + ++ +S L ++ H+ F SLL +P+ S NG+T+
Sbjct: 76 PTHI---AFASGDSALVLATESGTHLSVFETGSLLQPNAQPAISIPTNGATFRTVAPNPA 132
Query: 166 KPENSFLAL----SNVGRLYRG---------GVNGPLEDVMDNVDAVEWSVKGK-FVAVA 211
+ E+S +L +N G L G NG + + +V +V WS KGK VA
Sbjct: 133 QAEDSHSSLVALVTNAGELLMADLKAGNLVTGANGNI--LKADVSSVGWSNKGKQLVAGL 190
Query: 212 KKNVLSILTSK--LKERLPISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIII 261
++T K+ +P D+ V SI W+ D ++
Sbjct: 191 VDGTGYVMTPDGVQKDLIPKP---------PDLTDPCHVSSIAWLENDIFLM 233
>gi|432116583|gb|ELK37376.1| Nuclear pore complex protein Nup214 [Myotis davidii]
Length = 1946
Score = 47.0 bits (110), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 104/441 (23%), Positives = 188/441 (42%), Gaps = 83/441 (18%)
Query: 19 ESNDFFFDEIGESIPIIRDDDSQFDLQNPPSLPLAVSQLSHQLIFAAHSSGFCVARTNDV 78
E DF F + + +R DS +L S LAVS + L+FA +SG + T ++
Sbjct: 13 EMKDFQFRALKK----VRIFDSPEELPRERSSLLAVSN-KYGLVFAGGASGLHIFPTKNL 67
Query: 79 IDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH---IHFFS 135
+ K + V ++ + VP + + L+LS D TL+ + S + I FF
Sbjct: 68 LIQNKPGDDPNK--IVDKVQGLLVPMKFP-IHHLALSCDNLTLSACMMSSEYGSIIAFFD 124
Query: 136 VHSLLN--KEIKPSFSCSL---NGSTYVKDIRWRKK-PE-------NSFLALSNVGRLYR 182
V + N K+ K F+ + V D++W P + +A+ V +
Sbjct: 125 VRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETVK 184
Query: 183 GGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKN---VLSILTSKLKERLPISLSFKSWVGD 239
V G L + V +V WS KGK +AV K+N V + T + K+ +P ++S
Sbjct: 185 --VCGTLPSAV-AVTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYES---- 237
Query: 240 CDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEE---NYLVQVIQSKDGKITDASSEPVV 296
+ V+V + W+ I DG E + ++ ++ KD K
Sbjct: 238 ---DHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPDVVMALLPKKDEK---------- 282
Query: 297 LSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNIDDHVVL-----LRWSL 349
+VF + ++ T + +L+++E W L ++A+ + + ++ + W L
Sbjct: 283 --HPEVFVNFMEPCYGCCTERQHHYYLSHMEEWDLVLAASAASTEVSILARQSDQINWEL 340
Query: 350 ----DDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQLGVEEKE 405
D + ++ V D N +D+L MG+ ID + +V++ + +EK
Sbjct: 341 WLLEDSSRAELPVTDKN--------------DDSLPMGVAIDYTN---QVEIAI-TDEKT 382
Query: 406 LSPYFILMCLTLDGKLNMFHV 426
L P +L+ L+ DG L F++
Sbjct: 383 LPPAPVLLLLSTDGVLCPFYM 403
>gi|348570390|ref|XP_003470980.1| PREDICTED: nuclear pore complex protein Nup214 [Cavia porcellus]
Length = 2143
Score = 46.2 bits (108), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 108/453 (23%), Positives = 181/453 (39%), Gaps = 104/453 (22%)
Query: 26 DEIGESIPIIRDDDSQFD-------LQNPPSLP------LAVSQLSHQLIFAAHSSGFCV 72
DE+ IP D QF +P LP LA+S + L+FA +SG V
Sbjct: 3 DEMDAMIPEREMKDFQFRALKKVRIFDSPEELPKERSNLLAISN-KYGLLFAGGASGLQV 61
Query: 73 ARTNDVIDAAKEMKENGTRCCVQELSVVDVPFENENVDILSLSNDESTLAVSLYQSPH-- 130
T +++ + ++ V ++ VP + V L+LS D TL+V + H
Sbjct: 62 FPTKNLLIQNRPGED--PNKVVDKVQSWLVPMKFP-VHHLALSCDNLTLSVCMLSGEHGS 118
Query: 131 -IHFFSVHSLLN--KEIKPSFS---CSLNGSTYVKDIRWRKKPENSFLALSNVGRLYRGG 184
I F V +L N K+ K F+ S + V D++W S +A+ L G
Sbjct: 119 VIAFLDVRTLANEAKQQKRPFAYHKLSKDVGGMVIDMKWNPT-VPSMVAVC----LADGS 173
Query: 185 VNGPLEDVMDNVDAVE---------------WSVKGKFVAVAKKN---VLSILTSKLKER 226
+ V+ DAV+ WS KGK +AV K++ V + T + K+
Sbjct: 174 IA-----VLQVTDAVKVCATLPPSVAVTSVCWSPKGKQLAVGKQDGTVVQYLPTLQEKKV 228
Query: 227 LPISLSFKSWVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEEN---YLVQVIQSK 283
+P + + + V+V + W+ I DG E ++ ++ K
Sbjct: 229 IPCPPFY-------EADHPVRVLDVLWIGTYVFTI--VYAAADGTLETSPEVVMALLPKK 279
Query: 284 DGKITDASSEPVVLSFSDVFSDVIDDILPSGT--GPYLFLTYVERWGLAISANRKNID-- 339
+ K +VF + ++ S + +L Y+E W L ++A+ + +
Sbjct: 280 EEK------------HPEVFVNFMEPCYGSCAERQHHYYLNYIEEWDLVLAASAASTEVS 327
Query: 340 ------DHVVLLRWSLDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCG 393
D V W L+D + + D +D+L MG+ ID +
Sbjct: 328 ILARQSDQVNWESWLLEDSSRA-------------ELPVTDKSDDSLPMGVAIDYTN--- 371
Query: 394 KVKVQLGVEEKELSPYFILMCLTLDGKLNMFHV 426
+V+V L EEK L P +L+ L+ DG L F++
Sbjct: 372 QVEVALN-EEKTLPPAPVLLLLSTDGVLCPFYM 403
>gi|380029820|ref|XP_003698563.1| PREDICTED: uncharacterized protein LOC100866545 [Apis florea]
Length = 1501
Score = 45.8 bits (107), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 81/332 (24%), Positives = 134/332 (40%), Gaps = 52/332 (15%)
Query: 109 VDILSLSNDESTLAVSLYQSPHIHFFSVHSL------LNKEIKPSFSCSLNGSTYVKDIR 162
V L+L+ D S LAV+ + S I + +L L ++K S S NG +V D+R
Sbjct: 77 VSKLTLNCDCSYLAVAAHGS-IILIYDAQALTKNNLQLLHKVKVSTS---NGDIFVNDLR 132
Query: 163 WRKKPENSFLALSNVGRL-YRGGVNGPLEDVMDNVDAVEWSVKGKFVAVAKKNVLSILTS 221
W A+S + + V D + WS KGK + V KN +
Sbjct: 133 WNP-------AISGIKKKEIESKVLEKFNDDGLHALCAAWSPKGKQIVVGCKNGNIV--- 182
Query: 222 KLKERLPISLSFKS---WVGDCDVNCSVKVDSIRWVRRDCIIIGCFQLTEDGKEENYLVQ 278
+ K L ++ + ++G+ V SI W+ G E+ V
Sbjct: 183 QFKPDLKVARTISGPNPYIGE--------VVSILWISNYQFCAAYL-----GNEQGINVL 229
Query: 279 VIQSKDGKITDASSEPVVLSFSDVFSDVIDDILPSGTGPYLFLTYVERWGLAISANRKNI 338
+I + G+ + + + D+ V D GT P + +V WGL I+A+ N
Sbjct: 230 IIDAPKGE-----ANAIFTCYEDITYGV-TDAEGEGTIPRYYFDHVPEWGLIIAASS-NS 282
Query: 339 DDHVVLLRWSLDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLCIDKVSLCGKVKVQ 398
+ VL N +VD R + +P I + ++ +GL IDK ++ K+
Sbjct: 283 SEVAVLGSTDKGITWNQWQLVDSGRAE-LPLI----HTTESYPVGLAIDKSAV---RKLP 334
Query: 399 LGVEEKELSPYFILMCLTLDGKLNMFHVASVS 430
G + P IL L G+L FH+ +++
Sbjct: 335 WGADSTLPHPVPILHILATSGQLCSFHMVNLT 366
>gi|147839370|emb|CAN67812.1| hypothetical protein VITISV_026390 [Vitis vinifera]
Length = 576
Score = 44.3 bits (103), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 20/28 (71%), Positives = 26/28 (92%), Gaps = 1/28 (3%)
Query: 819 NLFD-KTVQVLARKIYTEGIVKQASDSR 845
N+FD +TVQVLARK+Y +GIVKQA+DS+
Sbjct: 537 NIFDSRTVQVLARKVYMQGIVKQATDSK 564
>gi|328777287|ref|XP_395921.4| PREDICTED: hypothetical protein LOC412464 [Apis mellifera]
Length = 1517
Score = 43.5 bits (101), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 87/345 (25%), Positives = 140/345 (40%), Gaps = 61/345 (17%)
Query: 109 VDILSLSNDESTLAVSLYQSPHIHFFSVHSL------LNKEIKPSFSCSLNGSTYVKDIR 162
V L+L+ D S LAV+ + S I + +L L ++K S S NG +V D+R
Sbjct: 77 VSKLTLNCDCSYLAVAAHGS-IILIYDAQALTKNNLQLLHKVKVSTS---NGDIFVNDLR 132
Query: 163 WRKKPENSFLALSNVGRLYRGG-----------VNGPLEDVMDN---VDAVEWSVKGKFV 208
W P S + L + Y G + LE D+ WS KGK +
Sbjct: 133 WN--PAISGM-LCTIASDYTIGSFKIKEKKKEIESKVLEKFNDDGLHALCAAWSPKGKQI 189
Query: 209 AVAKKNVLSILTSKLKERLPISLSFKS---WVGDCDVNCSVKVDSIRWVRRDCIIIGCFQ 265
V KN + + K L ++ + ++G+ V SI W+
Sbjct: 190 VVGCKNGNIV---QFKPDLKVARTISGPNPYIGE--------VVSILWISNYQFCAAYL- 237
Query: 266 LTEDGKEENYLVQVIQSKDGKITDASSEPVVLSFSDVFSDVIDDILPSGTGPYLFLTYVE 325
G E+ V +I + G+ T+A + + D+ V D GT P + +V
Sbjct: 238 ----GNEQGINVLIIDAPKGE-TNA----IFTCYEDITYGV-TDAEGEGTIPRYYFDHVP 287
Query: 326 RWGLAISANRKNIDDHVVLLRWSLDDEKNDIAVVDINRDKWIPRIKLQDNGNDNLIMGLC 385
WGL I+A+ N + VL N +VD R + +P I + ++ +GL
Sbjct: 288 EWGLIIAASS-NSSEVAVLGSTDKGITWNQWQLVDSGRAE-LPLI----HTTESYPVGLA 341
Query: 386 IDKVSLCGKVKVQLGVEEKELSPYFILMCLTLDGKLNMFHVASVS 430
IDK ++ K+ G + P IL L G+L FH+ +++
Sbjct: 342 IDKSAV---RKLPWGADSTLPHPVPILHILATSGQLCSFHMVNLT 383
>gi|86652276|gb|ABD14581.1| zonadhesin variant 6 [Pongo pygmaeus]
Length = 439
Score = 41.2 bits (95), Expect = 4.3, Method: Composition-based stats.
Identities = 44/139 (31%), Positives = 60/139 (43%), Gaps = 9/139 (6%)
Query: 1162 VTAEKLSSGVYFDVKSNSTLTNET-KSTMQSESNI--FQKPTISTMSPTLTPSLLKNPSE 1218
++ EKL+ S LT T K T+ +E +KPTIST PT++P +E
Sbjct: 16 ISTEKLTIPTEKPTISTEKLTIPTEKPTIPTEKPTIPMEKPTISTEKPTISPEKPSISTE 75
Query: 1219 MP-ISNGKGTVP-ESSTIGSEKHGAATTK-TLFVESGKNRDAQVSTPAAVPTSSAFPGIV 1275
P I K T+P E STI +EK T K T+F E + + P PT P
Sbjct: 76 KPTIPTEKPTIPIEESTISTEKLNIPTEKPTIFTEKPTIPTEKPTIPTEKPT---IPSEK 132
Query: 1276 PIFDAANKSQPGGKSFTPS 1294
P + P K P+
Sbjct: 133 PTIPTEKLTIPTEKPAIPT 151
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.312 0.128 0.365
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 20,622,202,257
Number of Sequences: 23463169
Number of extensions: 885720658
Number of successful extensions: 2072943
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 55
Number of HSP's successfully gapped in prelim test: 1355
Number of HSP's that attempted gapping in prelim test: 2059029
Number of HSP's gapped (non-prelim): 9692
length of query: 1324
length of database: 8,064,228,071
effective HSP length: 155
effective length of query: 1169
effective length of database: 8,722,404,172
effective search space: 10196490477068
effective search space used: 10196490477068
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 83 (36.6 bits)