BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 009045
(546 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9M8Y0|SEC_ARATH Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC OS=Arabidopsis
thaliana GN=SEC PE=2 SV=1
Length = 977
Score = 51.2 bits (121), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 62/264 (23%), Positives = 109/264 (41%), Gaps = 38/264 (14%)
Query: 148 PKNAHAHFLLGLMYQRLGQPLKAVSSYEKAEEILLRCEADIARPELLSLVQIHHAQCLLP 207
P A+ LG +Y+ LG+P +A+ Y+ A + +R + +A + S+
Sbjct: 254 PAFPDAYLNLGNVYKALGRPTEAIMCYQHALQ--MRPNSAMAFGNIASIY---------- 301
Query: 208 ESSGDNSLDKELEPEELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSS 267
E +L+ + K+++ D R +N LG L GR+ ++ +
Sbjct: 302 -----------YEQGQLDLAIRHYKQALSRDPRFLEAYNNLGNALKDIGRVDEAVRCYNQ 350
Query: 268 LLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLI-----LKDQNHPAALI-----NYA 317
LA+ PN+ + NLG Y + M ++ F+ + L + A+I NY+
Sbjct: 351 CLALQPNHPQAMANLGNIYMEWNMMGPASSLFKATLAVTTGLSAPFNNLAIIYKQQGNYS 410
Query: 318 ALLLCKYGSVLA----GAGANTGEGACLDQASAVNVAKECLLAALKADPKAAHIWANLAN 373
+ C Y VL A A G + V A + + A+ P A ANLA+
Sbjct: 411 DAISC-YNEVLRIDPLAADALVNRGNTYKEIGRVTEAIQDYMHAINFRPTMAEAHANLAS 469
Query: 374 AYYLTGDHRSSGKCLEKAAKLEPN 397
AY +G ++ ++A L P+
Sbjct: 470 AYKDSGHVEAAITSYKQALLLRPD 493
Score = 42.7 bits (99), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/175 (21%), Positives = 74/175 (42%), Gaps = 21/175 (12%)
Query: 222 EELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGN 281
+E + +++ +E+++ + A + + + G +I + + PN D N
Sbjct: 101 QEYDMCIARNEEALRIQPQFAECYGNMANAWKEKGDTDRAIRYYLIAIELRPNFADAWSN 160
Query: 282 LGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACL 341
L AY + G + ++ +C Q AL L +L A +N G +
Sbjct: 161 LASAYMRKGRLSEATQCCQQ----------------ALSL---NPLLVDAHSNLGN--LM 199
Query: 342 DQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEP 396
++ A C L A++ P A W+NLA + +GD + + ++A KL+P
Sbjct: 200 KAQGLIHEAYSCYLEAVRIQPTFAIAWSNLAGLFMESGDLNRALQYYKEAVKLKP 254
Score = 39.7 bits (91), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 36/174 (20%), Positives = 65/174 (37%), Gaps = 21/174 (12%)
Query: 224 LEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLG 283
+ E S E+++ A+ W+ L + ++SG L ++ + + P D NLG
Sbjct: 205 IHEAYSCYLEAVRIQPTFAIAWSNLAGLFMESGDLNRALQYYKEAVKLKPAFPDAYLNLG 264
Query: 284 IAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACLDQ 343
Y G ++ C+Q + N A N A++ +
Sbjct: 265 NVYKALGRPTEAIMCYQHALQMRPNSAMAFGNIASIYY---------------------E 303
Query: 344 ASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPN 397
+++A AL DP+ + NL NA G + +C + L+PN
Sbjct: 304 QGQLDLAIRHYKQALSRDPRFLEAYNNLGNALKDIGRVDEAVRCYNQCLALQPN 357
Score = 35.4 bits (80), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 43/177 (24%), Positives = 74/177 (41%), Gaps = 22/177 (12%)
Query: 355 LAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCMSTRYAVAVSRIKDAE 414
L A++ P A W+NLA+AY G + +C ++A L P + A S + +
Sbjct: 145 LIAIELRPNFADAWSNLASAYMRKGRLSEATQCCQQALSLNPLLVD-----AHSNLGNLM 199
Query: 415 RSQEPTEQLSWAGNEMASILREGDPVQIEPPIAWAGFAAVQKTHHEVAAAFETEENELSK 474
++Q +E S E +Q IAW+ A + ++ A + + E K
Sbjct: 200 KAQGLI-------HEAYSCYLEAVRIQPTFAIAWSNLAGLFMESGDLNRALQYYK-EAVK 251
Query: 475 MEECAGAGESAFLDQASAVNV------AKECLLAALKADPKAAHIWANLANAYYLTG 525
++ A A+L+ + A C AL+ P +A + N+A+ YY G
Sbjct: 252 LKP---AFPDAYLNLGNVYKALGRPTEAIMCYQHALQMRPNSAMAFGNIASIYYEQG 305
Score = 33.5 bits (75), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 64/306 (20%), Positives = 103/306 (33%), Gaps = 55/306 (17%)
Query: 245 WNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLIL 304
W+ L ++ GRL + L+++P D NLG G + ++ C+ + +
Sbjct: 158 WSNLASAYMRKGRLSEATQCCQQALSLNPLLVDAHSNLGNLMKAQGLIHEAYSCYLEAVR 217
Query: 305 KDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACLDQASAVNVAKECLLAALKADPKA 364
A N A L + ++ +N A + A+K P
Sbjct: 218 IQPTFAIAWSNLAGLFM---------------------ESGDLNRALQYYKEAVKLKPAF 256
Query: 365 AHIWANLANAYYLTGDHRSSGKCLEKAAKLEPN-------CMSTRYA-----VAVSRIKD 412
+ NL N Y G + C + A ++ PN S Y +A+ K
Sbjct: 257 PDAYLNLGNVYKALGRPTEAIMCYQHALQMRPNSAMAFGNIASIYYEQGQLDLAIRHYKQ 316
Query: 413 A-ERSQEPTEQLSWAGNEMASILREGDPVQ-------IEP--PIAWAGFAAVQKTHHEVA 462
A R E + GN + I R + V+ ++P P A A + + +
Sbjct: 317 ALSRDPRFLEAYNNLGNALKDIGRVDEAVRCYNQCLALQPNHPQAMANLGNIYMEWNMMG 376
Query: 463 AAFETEENELSKMEECAGAGESAFLD-------QASAVNVAKECLLAALKADPKAAHIWA 515
A + L K G SA + Q + A C L+ DP AA
Sbjct: 377 PA-----SSLFKATLAVTTGLSAPFNNLAIIYKQQGNYSDAISCYNEVLRIDPLAADALV 431
Query: 516 NLANAY 521
N N Y
Sbjct: 432 NRGNTY 437
Score = 33.1 bits (74), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 34/157 (21%), Positives = 56/157 (35%), Gaps = 21/157 (13%)
Query: 219 LEPEELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDC 278
+E +L L KE+++ + LG + GR +I L + PN+
Sbjct: 234 MESGDLNRALQYYKEAVKLKPAFPDAYLNLGNVYKALGRPTEAIMCYQHALQMRPNSAMA 293
Query: 279 IGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYGSVLAGAGANTGEG 338
GN+ Y++ G ++ + + ++ + +D P L Y L G
Sbjct: 294 FGNIASIYYEQGQLDLAIRHYKQALSRD---PRFLEAYNNL------------------G 332
Query: 339 ACLDQASAVNVAKECLLAALKADPKAAHIWANLANAY 375
L V+ A C L P ANL N Y
Sbjct: 333 NALKDIGRVDEAVRCYNQCLALQPNHPQAMANLGNIY 369
>sp|Q9FNS4|MBB1_CHLRE PsbB mRNA maturation factor Mbb1, chloroplastic OS=Chlamydomonas
reinhardtii GN=MBB1 PE=2 SV=1
Length = 662
Score = 48.1 bits (113), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/273 (19%), Positives = 108/273 (39%), Gaps = 54/273 (19%)
Query: 125 QSGLGTSSSSRE--EKGLVHVARK-MPKNAHAHFLLGLMYQRLGQPLKAVSSYEKAEEIL 181
+ G G + +R+ +G+ RK +NA+ + LG M +LG+ +A S +E+
Sbjct: 240 EKGQGNFTRARDLWMQGIQRCRRKPQSQNAYLYNALGCMAAQLGRVGEARSWFEEGTRSA 299
Query: 182 LRCEADIARPELLSLVQIHHAQCLLPESSGDNSLDKELEPEEL----------------- 224
E + V + A +L GD ++ + L + L
Sbjct: 300 ----------EGAASVALWQAWAVLEAKQGDPTVVRYLFRKALGANPRSRYVHLAWALWE 349
Query: 225 ------EEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDC 278
+ L+ L+ + + ++ L+ ++GR++ + + L DP++
Sbjct: 350 RRQGNPQHCLALLRRGCELNPTDPALYQAWALVEKQAGRIERARELFEQGLRADPSDLYM 409
Query: 279 IGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYGSVLAGAGANTGEG 338
G+ + G+M+++ + FQ+ + D P+ + + A G
Sbjct: 410 WQAYGVMEAEQGNMDRARQLFQEGVWADPRSPSTVYVFHAW------------------G 451
Query: 339 ACLDQASAVNVAKECLLAALKADPKAAHIWANL 371
A QA V A+E AA++ DPK+ WA+
Sbjct: 452 ALEWQAGNVQTARELFKAAVRVDPKSETTWASW 484
>sp|Q3UV71|TMTC1_MOUSE Transmembrane and TPR repeat-containing protein 1 OS=Mus musculus
GN=Tmtc1 PE=2 SV=2
Length = 942
Score = 47.0 bits (110), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 73/336 (21%), Positives = 134/336 (39%), Gaps = 50/336 (14%)
Query: 222 EELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGN 281
E+ EE + LKES++ A +++L +L + R + + + + + P++ D N
Sbjct: 622 EKTEEAIMLLKESIKYGPDFADAYSSLASLLAEQERFKEAEDIYQAGIKNCPDSSDLHNN 681
Query: 282 LGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACL 341
+ SG E++ +Q I +H A++N G + G N+
Sbjct: 682 YAVFLVDSGFPEKAVAHYQQAIQLSPSHHVAVVNL--------GRLYRSLGENSK----- 728
Query: 342 DQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCMST 401
A+E ALK + A + + L YY TG H+ + + +A L+P+
Sbjct: 729 --------AEEWYRRALKVA-RTAEVLSPLGALYYNTGRHKEALEVYREAVSLQPSQREL 779
Query: 402 RYAVA-----VSRIKDAERSQEPTEQLSWAGNEMASILREGDPVQIEPPIAWAGFAAVQK 456
R A+A + + K+AE+ + S + +P +E + +A+
Sbjct: 780 RLALAQVLAVMGQTKEAEK--------------ITSHIVSEEPRCLE---CYRLLSAIHS 822
Query: 457 THHEVAAAFETEENELS-KMEECAGAGESAF-----LDQASAVNVAKECLLAALKADPKA 510
A E E L K ++ E F L + + ++ A E AA+ DP
Sbjct: 823 KQEHHGKALEAIEKALQLKPKDPKVISELFFTKGNQLREQNLLDKAFESYEAAVTLDPDQ 882
Query: 511 AHIWANLANAYYLTGDHRSSGKCLEKVLMVYCSSNL 546
A W N+ ++ G + S+ E+ L + S L
Sbjct: 883 AQAWMNMGGIRHIQGSYVSARAYYERALKLVPDSKL 918
>sp|Q8CGY8|OGT1_MOUSE UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit OS=Mus musculus GN=Ogt PE=1 SV=2
Length = 1046
Score = 46.6 bits (109), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/155 (24%), Positives = 62/155 (40%), Gaps = 21/155 (13%)
Query: 245 WNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLIL 304
++ LG + + G+LQ +I L + P+ D NL A +GDME + + +
Sbjct: 92 YSNLGNVYKERGQLQEAIEHYRHALRLKPDFIDGYINLAAALVAAGDMEGAVQAY----- 146
Query: 305 KDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACLDQASAVNVAKECLLAALKADPKA 364
+ + Y L C + G L + AK C L A++ P
Sbjct: 147 ------VSALQYNPDLYC----------VRSDLGNLLKALGRLEEAKACYLKAIETQPNF 190
Query: 365 AHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCM 399
A W+NL + G+ + EKA L+PN +
Sbjct: 191 AVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFL 225
Score = 41.2 bits (95), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 69/330 (20%), Positives = 129/330 (39%), Gaps = 67/330 (20%)
Query: 223 ELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNL 282
LEE + +++++ AV W+ LG + G + +I + +DPN D
Sbjct: 172 RLEEAKACYLKAIETQPNFAVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFLD----- 226
Query: 283 GIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACL- 341
AY G++ + A+ F + AA + AL L +V+ G AC+
Sbjct: 227 --AYINLGNVLKEARIFDRAV-------AAYLR--ALSLSPNHAVV------HGNLACVY 269
Query: 342 DQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCMST 401
+ +++A + A++ P + NLANA G + C A +L P
Sbjct: 270 YEQGLIDLAIDTYRRAIELQPHFPDAYCNLANALKEKGSVAEAEDCYNTALRLCPT---- 325
Query: 402 RYAVAVSRIKDAERSQ--------------EPTEQLSWAGNEMASILREGDPVQIEPPIA 447
+A +++ + + +R Q E + + A + +AS+L++ +Q A
Sbjct: 326 -HADSLNNLANIKREQGNIEEAVRLYRKALEVFPEFAAAHSNLASVLQQQGKLQ----EA 380
Query: 448 WAGFAAVQKTHHEVAAAFETEENELSKMEECAGAGESAFLDQASAVNVAKECLLAALKAD 507
+ + A A+ N L +M++ G A +C A++ +
Sbjct: 381 LMHYKEAIRISPTFADAYSNMGNTLKEMQDVQG---------------ALQCYTRAIQIN 425
Query: 508 PKAAHIWANLANAYYLTGDHRSSGKCLEKV 537
P A +NLA+ H+ SG E +
Sbjct: 426 PAFADAHSNLASI------HKDSGNIPEAI 449
>sp|Q27HV0|OGT1_PIG UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit OS=Sus scrofa GN=OGT PE=2 SV=1
Length = 1046
Score = 46.2 bits (108), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/155 (24%), Positives = 62/155 (40%), Gaps = 21/155 (13%)
Query: 245 WNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLIL 304
++ LG + + G+LQ +I L + P+ D NL A +GDME + + +
Sbjct: 92 YSNLGNVYKERGQLQEAIEHYRHALRLKPDFIDGYINLAAALVAAGDMEGAVQAY----- 146
Query: 305 KDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACLDQASAVNVAKECLLAALKADPKA 364
+ + Y L C + G L + AK C L A++ P
Sbjct: 147 ------VSALQYNPDLYC----------VRSDLGNLLKALGRLEEAKACYLKAIETQPNF 190
Query: 365 AHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCM 399
A W+NL + G+ + EKA L+PN +
Sbjct: 191 AVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFL 225
Score = 40.8 bits (94), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 69/330 (20%), Positives = 129/330 (39%), Gaps = 67/330 (20%)
Query: 223 ELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNL 282
LEE + +++++ AV W+ LG + G + +I + +DPN D
Sbjct: 172 RLEEAKACYLKAIETQPNFAVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFLD----- 226
Query: 283 GIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACL- 341
AY G++ + A+ F + AA + AL L +V+ G AC+
Sbjct: 227 --AYINLGNVLKEARIFDRAV-------AAYLR--ALSLSPNHAVV------HGNLACVY 269
Query: 342 DQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCMST 401
+ +++A + A++ P + NLANA G + C A +L P
Sbjct: 270 YEQGLIDLAIDTYRRAIELQPHFPDAYCNLANALKEKGSVAEAEDCYNTALRLCPT---- 325
Query: 402 RYAVAVSRIKDAERSQ--------------EPTEQLSWAGNEMASILREGDPVQIEPPIA 447
+A +++ + + +R Q E + + A + +AS+L++ +Q A
Sbjct: 326 -HADSLNNLANIKREQGNIEEAVRLYRKALEVFPEFAAAHSNLASVLQQQGKLQ----EA 380
Query: 448 WAGFAAVQKTHHEVAAAFETEENELSKMEECAGAGESAFLDQASAVNVAKECLLAALKAD 507
+ + A A+ N L +M++ G A +C A++ +
Sbjct: 381 LMHYKEAIRISPTFADAYSNMGNTLKEMQDVQG---------------ALQCYTRAIQIN 425
Query: 508 PKAAHIWANLANAYYLTGDHRSSGKCLEKV 537
P A +NLA+ H+ SG E +
Sbjct: 426 PAFADAHSNLASI------HKDSGNIPEAI 449
>sp|P81436|OGT1_RABIT UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit OS=Oryctolagus cuniculus GN=OGT PE=1
SV=2
Length = 1046
Score = 46.2 bits (108), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/155 (24%), Positives = 62/155 (40%), Gaps = 21/155 (13%)
Query: 245 WNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLIL 304
++ LG + + G+LQ +I L + P+ D NL A +GDME + + +
Sbjct: 92 YSNLGNVYKERGQLQEAIEHYRHALRLKPDFIDGYINLAAALVAAGDMEGAVQAY----- 146
Query: 305 KDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACLDQASAVNVAKECLLAALKADPKA 364
+ + Y L C + G L + AK C L A++ P
Sbjct: 147 ------VSALQYNPDLYC----------VRSDLGNLLKALGRLEEAKACYLKAIETQPNF 190
Query: 365 AHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCM 399
A W+NL + G+ + EKA L+PN +
Sbjct: 191 AVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFL 225
Score = 40.8 bits (94), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 69/330 (20%), Positives = 129/330 (39%), Gaps = 67/330 (20%)
Query: 223 ELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNL 282
LEE + +++++ AV W+ LG + G + +I + +DPN D
Sbjct: 172 RLEEAKACYLKAIETQPNFAVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFLD----- 226
Query: 283 GIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACL- 341
AY G++ + A+ F + AA + AL L +V+ G AC+
Sbjct: 227 --AYINLGNVLKEARIFDRAV-------AAYLR--ALSLSPNHAVV------HGNLACVY 269
Query: 342 DQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCMST 401
+ +++A + A++ P + NLANA G + C A +L P
Sbjct: 270 YEQGLIDLAIDTYRRAIELQPHFPDAYCNLANALKEKGSVAEAEDCYNTALRLCPT---- 325
Query: 402 RYAVAVSRIKDAERSQ--------------EPTEQLSWAGNEMASILREGDPVQIEPPIA 447
+A +++ + + +R Q E + + A + +AS+L++ +Q A
Sbjct: 326 -HADSLNNLANIKREQGNIEEAVRLYRKALEVFPEFAAAHSNLASVLQQQGKLQ----EA 380
Query: 448 WAGFAAVQKTHHEVAAAFETEENELSKMEECAGAGESAFLDQASAVNVAKECLLAALKAD 507
+ + A A+ N L +M++ G A +C A++ +
Sbjct: 381 LMHYKEAIRISPTFADAYSNMGNTLKEMQDVQG---------------ALQCYTRAIQIN 425
Query: 508 PKAAHIWANLANAYYLTGDHRSSGKCLEKV 537
P A +NLA+ H+ SG E +
Sbjct: 426 PAFADAHSNLASI------HKDSGNIPEAI 449
>sp|O15294|OGT1_HUMAN UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit OS=Homo sapiens GN=OGT PE=1 SV=3
Length = 1046
Score = 46.2 bits (108), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/155 (24%), Positives = 62/155 (40%), Gaps = 21/155 (13%)
Query: 245 WNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLIL 304
++ LG + + G+LQ +I L + P+ D NL A +GDME + + +
Sbjct: 92 YSNLGNVYKERGQLQEAIEHYRHALRLKPDFIDGYINLAAALVAAGDMEGAVQAY----- 146
Query: 305 KDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACLDQASAVNVAKECLLAALKADPKA 364
+ + Y L C + G L + AK C L A++ P
Sbjct: 147 ------VSALQYNPDLYC----------VRSDLGNLLKALGRLEEAKACYLKAIETQPNF 190
Query: 365 AHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCM 399
A W+NL + G+ + EKA L+PN +
Sbjct: 191 AVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFL 225
Score = 40.8 bits (94), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 69/330 (20%), Positives = 129/330 (39%), Gaps = 67/330 (20%)
Query: 223 ELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNL 282
LEE + +++++ AV W+ LG + G + +I + +DPN D
Sbjct: 172 RLEEAKACYLKAIETQPNFAVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFLD----- 226
Query: 283 GIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACL- 341
AY G++ + A+ F + AA + AL L +V+ G AC+
Sbjct: 227 --AYINLGNVLKEARIFDRAV-------AAYLR--ALSLSPNHAVV------HGNLACVY 269
Query: 342 DQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCMST 401
+ +++A + A++ P + NLANA G + C A +L P
Sbjct: 270 YEQGLIDLAIDTYRRAIELQPHFPDAYCNLANALKEKGSVAEAEDCYNTALRLCPT---- 325
Query: 402 RYAVAVSRIKDAERSQ--------------EPTEQLSWAGNEMASILREGDPVQIEPPIA 447
+A +++ + + +R Q E + + A + +AS+L++ +Q A
Sbjct: 326 -HADSLNNLANIKREQGNIEEAVRLYRKALEVFPEFAAAHSNLASVLQQQGKLQ----EA 380
Query: 448 WAGFAAVQKTHHEVAAAFETEENELSKMEECAGAGESAFLDQASAVNVAKECLLAALKAD 507
+ + A A+ N L +M++ G A +C A++ +
Sbjct: 381 LMHYKEAIRISPTFADAYSNMGNTLKEMQDVQG---------------ALQCYTRAIQIN 425
Query: 508 PKAAHIWANLANAYYLTGDHRSSGKCLEKV 537
P A +NLA+ H+ SG E +
Sbjct: 426 PAFADAHSNLASI------HKDSGNIPEAI 449
>sp|P56558|OGT1_RAT UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit OS=Rattus norvegicus GN=Ogt PE=1 SV=1
Length = 1036
Score = 46.2 bits (108), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 38/155 (24%), Positives = 62/155 (40%), Gaps = 21/155 (13%)
Query: 245 WNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLIL 304
++ LG + + G+LQ +I L + P+ D NL A +GDME + + +
Sbjct: 82 YSNLGNVYKERGQLQEAIEHYRHALRLKPDFIDGYINLAAALVAAGDMEGAVQAY----- 136
Query: 305 KDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACLDQASAVNVAKECLLAALKADPKA 364
+ + Y L C + G L + AK C L A++ P
Sbjct: 137 ------VSALQYNPDLYC----------VRSDLGNLLKALGRLEEAKACYLKAIETQPNF 180
Query: 365 AHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCM 399
A W+NL + G+ + EKA L+PN +
Sbjct: 181 AVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFL 215
Score = 40.8 bits (94), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 69/330 (20%), Positives = 129/330 (39%), Gaps = 67/330 (20%)
Query: 223 ELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNL 282
LEE + +++++ AV W+ LG + G + +I + +DPN D
Sbjct: 162 RLEEAKACYLKAIETQPNFAVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFLD----- 216
Query: 283 GIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACL- 341
AY G++ + A+ F + AA + AL L +V+ G AC+
Sbjct: 217 --AYINLGNVLKEARIFDRAV-------AAYLR--ALSLSPNHAVV------HGNLACVY 259
Query: 342 DQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCMST 401
+ +++A + A++ P + NLANA G + C A +L P
Sbjct: 260 YEQGLIDLAIDTYRRAIELQPHFPDAYCNLANALKEKGSVAEAEDCYNTALRLCPT---- 315
Query: 402 RYAVAVSRIKDAERSQ--------------EPTEQLSWAGNEMASILREGDPVQIEPPIA 447
+A +++ + + +R Q E + + A + +AS+L++ +Q A
Sbjct: 316 -HADSLNNLANIKREQGNIEEAVRLYRKALEVFPEFAAAHSNLASVLQQQGKLQ----EA 370
Query: 448 WAGFAAVQKTHHEVAAAFETEENELSKMEECAGAGESAFLDQASAVNVAKECLLAALKAD 507
+ + A A+ N L +M++ G A +C A++ +
Sbjct: 371 LMHYKEAIRISPTFADAYSNMGNTLKEMQDVQG---------------ALQCYTRAIQIN 415
Query: 508 PKAAHIWANLANAYYLTGDHRSSGKCLEKV 537
P A +NLA+ H+ SG E +
Sbjct: 416 PAFADAHSNLASI------HKDSGNIPEAI 439
>sp|O94474|SKI3_SCHPO Superkiller protein 3 OS=Schizosaccharomyces pombe (strain 972 /
ATCC 24843) GN=ski3 PE=3 SV=1
Length = 1389
Score = 45.8 bits (107), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 43/169 (25%), Positives = 67/169 (39%), Gaps = 20/169 (11%)
Query: 258 LQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNH----PAAL 313
L+ S S S L DPN +LG+ Y DM ++ KCFQ D + A
Sbjct: 557 LEDSFSAFVSSLRKDPNYAPAYTSLGLYYRDIHDMVRATKCFQKAFELDASQVEAAEALA 616
Query: 314 INYAALLLCKYGSVLAGAGANTGE----------------GACLDQASAVNVAKECLLAA 357
+A + V++ NT E G A + A +A
Sbjct: 617 KTFAEANEWELVEVISRRVLNTSENDLKRKKKFNWHHTSLGVLELNAKNFHKAIVHFQSA 676
Query: 358 LKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCMSTRYAVA 406
L+ PK + W+ L AY +G + S+ K +A+ L+P+ +Y +A
Sbjct: 677 LRISPKDTNAWSGLGEAYARSGRYVSALKAFNRASILDPDDWYVKYFIA 725
Score = 40.0 bits (92), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 60/269 (22%), Positives = 106/269 (39%), Gaps = 46/269 (17%)
Query: 139 GLVHVARKMPKNAHAHFLLGLMYQRLGQPLKAVSSYEKAEEILLRCEADIARPELLSLVQ 198
V RK P A A+ LGL Y+ + ++A ++KA E L Q
Sbjct: 563 AFVSSLRKDPNYAPAYTSLGLYYRDIHDMVRATKCFQKAFE--------------LDASQ 608
Query: 199 IHHAQCLLPESSGDNSLDKELEPEELEEILSKLKESMQSDTRQAVVWN----TLGLILLK 254
+ A+ L + N E E +E I ++ + ++D ++ +N +LG++ L
Sbjct: 609 VEAAEALAKTFAEAN------EWELVEVISRRVLNTSENDLKRKKKFNWHHTSLGVLELN 662
Query: 255 SGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALI 314
+ +I S L + P + + LG AY +SG + K F + D +
Sbjct: 663 AKNFHKAIVHFQSALRISPKDTNAWSGLGEAYARSGRYVSALKAFNRASILDPDD----- 717
Query: 315 NYAALLLCKYGSVLAGAGANTGEG----ACLDQASAVNVAKECLLAALKADPKAAHIWAN 370
KY +A + GE + L + AV + C+ +L A +
Sbjct: 718 -----WYVKY--FIATLEKDMGEYEVAVSTLSEILAVRSKELCVQVSL------AETYVR 764
Query: 371 LANAYYLTGDHRSSGKCLEKAAKLEPNCM 399
LA Y+ G + + LEK+ ++ N +
Sbjct: 765 LAKLYHARGFYSRAADSLEKSIQICCNVL 793
>sp|Q6FM42|PEX5_CANGA Peroxisomal targeting signal receptor OS=Candida glabrata (strain
ATCC 2001 / CBS 138 / JCM 3761 / NBRC 0622 / NRRL Y-65)
GN=PEX5 PE=3 SV=1
Length = 590
Score = 45.4 bits (106), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/206 (20%), Positives = 87/206 (42%), Gaps = 21/206 (10%)
Query: 223 ELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNL 282
+L E + +++ D + W LG++ +++ + + +S L + L +DPNN + + NL
Sbjct: 301 KLSEAALAFEAAIKQDPKHVDAWLKLGIVQIQNEKELNGMSALETCLKLDPNNLEAMKNL 360
Query: 283 GIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAAL-------------LLCKYGSVLA 329
I+Y G + SA + D +P N A L + ++ S++
Sbjct: 361 AISYINEG-YDMSAYNMLNR-WADTKYP-GFYNSAELEGKRDEHENIHSKMTRRFLSLVN 417
Query: 330 GAGANTGE-----GACLDQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSS 384
+ + G + +C AALK +P +W L + + +
Sbjct: 418 RINSVDPDIQLCLGLLFYANDEFDRTIDCFQAALKVNPNDELMWNRLGASLANSNRSEEA 477
Query: 385 GKCLEKAAKLEPNCMSTRYAVAVSRI 410
+ +A +L+P+ + RY +AVS +
Sbjct: 478 IQAYHRALQLKPSFVRARYNLAVSSM 503
>sp|P14922|CYC8_YEAST General transcriptional corepressor CYC8 OS=Saccharomyces
cerevisiae (strain ATCC 204508 / S288c) GN=CYC8 PE=1
SV=2
Length = 966
Score = 45.1 bits (105), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/176 (22%), Positives = 73/176 (41%), Gaps = 33/176 (18%)
Query: 244 VWNTLGLILLKSGRLQSSISVLSSLLAVDPN---NCDCIGNLGIAYFQSGDMEQSAKCFQ 300
+W+ +G++ + G L + + +L +DP+ + LGI Y G Q+ +CF+
Sbjct: 152 LWHGIGILYDRYGSLDYAEEAFAKVLELDPHFEKANEIYFRLGIIYKHQGKWSQALECFR 211
Query: 301 DLILKDQNHPAALINYAALLLCKYGSVLAGAGANTG--------------------EGAC 340
++ + PA L + + + GSVL G G + C
Sbjct: 212 YILPQP---PAPLQEWD--IWFQLGSVLESMGEWQGAKEAYEHVLAQNQHHAKVLQQLGC 266
Query: 341 LDQASAVNV-----AKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKA 391
L S V A + LL +L+ADP A W +L + + D+ ++ ++A
Sbjct: 267 LYGMSNVQFYDPQKALDYLLKSLEADPSDATTWYHLGRVHMIRTDYTAAYDAFQQA 322
>sp|P21296|FLBA_CAUCR Protein FlbA OS=Caulobacter crescentus (strain ATCC 19089 / CB15)
GN=flbA PE=4 SV=2
Length = 596
Score = 44.7 bits (104), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 74/180 (41%), Gaps = 21/180 (11%)
Query: 235 MQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQ 294
++ D R + W+ L + K+G SS+ + LA+ P++ G+LG F+ E
Sbjct: 84 LEKDERLGLAWHILAIAREKTGDFASSLRAYEAALALLPDHGPVAGDLGRLAFRMNMPEL 143
Query: 295 SAKCFQDLILKDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACLDQASAVNVAKECL 354
+AK F L + + A L C L + + + A E L
Sbjct: 144 AAKFFAHYRLARPDD----VEGANNLAC-----------------ALRELNRESEAIEVL 182
Query: 355 LAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCMSTRYAVAVSRIKDAE 414
AAL A+P+AA +W L GD S +++ +L P+ + A +R+ E
Sbjct: 183 KAALGANPEAAVLWNTLGTVLCNIGDAAGSIVFFDESLRLAPDFSKAYHNRAFARLDLGE 242
Score = 38.5 bits (88), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/160 (25%), Positives = 67/160 (41%), Gaps = 16/160 (10%)
Query: 153 AHFLLGLMYQRLGQPLKAVSSYEKAEEILLR---CEADIAR-------PELLSLVQIHHA 202
A +L + ++ G ++ +YE A +L D+ R PEL + H+
Sbjct: 93 AWHILAIAREKTGDFASSLRAYEAALALLPDHGPVAGDLGRLAFRMNMPELAAKFFAHYR 152
Query: 203 QCLLPESSGDNSLD---KELEPEELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQ 259
+ G N+L +EL E E + LK ++ ++ AV+WNTLG +L G
Sbjct: 153 LARPDDVEGANNLACALRELNRES--EAIEVLKAALGANPEAAVLWNTLGTVLCNIGDAA 210
Query: 260 SSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQS-AKC 298
SI L + P+ N A G++E + A C
Sbjct: 211 GSIVFFDESLRLAPDFSKAYHNRAFARLDLGEIEAALADC 250
>sp|P10505|APC3_SCHPO Anaphase-promoting complex subunit 3 OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=nuc2 PE=1 SV=3
Length = 665
Score = 43.9 bits (102), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 21/148 (14%)
Query: 249 GLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQN 308
G+ LL +L+ +++ SL N + LGI YF+ D E+S + FQ L+D +
Sbjct: 336 GVYLLAQYKLREALNCFQSLPIEQQNTPFVLAKLGITYFELVDYEKSEEVFQK--LRDLS 393
Query: 309 HPAALINYAALLLCKYGSVLAGAGANTGEGACLDQASAVNVAKECLLAALKADPKAAHIW 368
P+ + K V + A + + L +A E L+ +P + W
Sbjct: 394 -PSRV---------KDMEVFSTALWHLQKSVPLSY-----LAHE----TLETNPYSPESW 434
Query: 369 ANLANAYYLTGDHRSSGKCLEKAAKLEP 396
LAN + L +H + KC+ +A +L+P
Sbjct: 435 CILANCFSLQREHSQALKCINRAIQLDP 462
Score = 40.4 bits (93), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 35/159 (22%), Positives = 63/159 (39%), Gaps = 13/159 (8%)
Query: 222 EELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGN 281
EE E+ + +++++ + R W LG++ LK+GR + ++PNN I
Sbjct: 479 EEYEKSKTSFRKAIRVNVRHYNAWYGLGMVYLKTGRNDQADFHFQRAAEINPNNSVLITC 538
Query: 282 LGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYG-----------SVLAG 330
+G+ Y + D +++ + D+ A A +L+ + +A
Sbjct: 539 IGMIYERCKDYKKALDFYDRACKLDEKSSLARFKKAKVLILLHDHDKALVELEQLKAIAP 598
Query: 331 AGANTGE--GACLDQASAVNVAKECLLAALKADPKAAHI 367
AN G Q N+A + A D KA HI
Sbjct: 599 DEANVHFLLGKIFKQMRKKNLALKHFTIAWNLDGKATHI 637
>sp|Q8IYB4|PEX5R_HUMAN PEX5-related protein OS=Homo sapiens GN=PEX5L PE=1 SV=2
Length = 626
Score = 43.5 bits (101), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/233 (21%), Positives = 84/233 (36%), Gaps = 53/233 (22%)
Query: 221 PEELEEILSKLKE------------SMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSL 268
P EE L +LKE ++ D A W LG+ ++ Q++I L
Sbjct: 327 PGAFEEGLKRLKEGDLPVTILFMEAAILQDPGDAEAWQFLGITQAENENEQAAIVALQRC 386
Query: 269 LAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYG--- 325
L + PNN + L ++Y +G + + ++ I + +P Y L+ K G
Sbjct: 387 LELQPNNLKALMALAVSYTNTGHQQDACDALKNWI---KQNP----KYKYLVKSKKGSPG 439
Query: 326 ------------SVLAGAGA-----------------NTGEGACLDQASAVNVAKECLLA 356
SVL G TG G + N A + A
Sbjct: 440 LTRRMSKSPVDSSVLEGVKELYLEAAHQNGDMIDPDLQTGLGVLFHLSGEFNRAIDAFNA 499
Query: 357 ALKADPKAAHIWANLANAYYLTGDHRSSG-KCLEKAAKLEPNCMSTRYAVAVS 408
AL P+ +W L A GD + +A +++P + +RY + +S
Sbjct: 500 ALTVRPEDYSLWNRLG-ATLANGDRSEEAVEAYTRALEIQPGFIRSRYNLGIS 551
>sp|Q7K4B6|TMTC3_DROME Transmembrane and TPR repeat-containing protein CG4050
OS=Drosophila melanogaster GN=CG4050 PE=2 SV=1
Length = 926
Score = 42.7 bits (99), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 56/138 (40%), Gaps = 21/138 (15%)
Query: 202 AQCLLPESSGDNSLDKELEPEELEEIL----------SKLKE---------SMQSDTRQA 242
A+ L P++ S + P L + ++L+E SM+SD QA
Sbjct: 574 AKALFPQAKPGVSYHARIAPNHLNVFINLANLIAKNQTRLEEADHLYRQAISMRSDYVQA 633
Query: 243 VVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDL 302
+ G IL+K R + V L D N D NLG+ + + G +Q+ F
Sbjct: 634 YI--NRGDILMKLNRTAQAQEVYEQALLYDNENADIYYNLGVVFLEQGKSQQAQVYFNKA 691
Query: 303 ILKDQNHPAALINYAALL 320
I H AL+N A LL
Sbjct: 692 IELYPEHEQALLNSAILL 709
>sp|Q8LGU6|CD27B_ARATH Cell division cycle protein 27 homolog B OS=Arabidopsis thaliana
GN=CDC27B PE=1 SV=1
Length = 744
Score = 42.7 bits (99), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 22/101 (21%), Positives = 48/101 (47%)
Query: 222 EELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGN 281
E+ E + + +++ DTR W LG+I L+ +L+ S ++P++ +
Sbjct: 566 EDFENGMKSYQNALRVDTRHYNAWYGLGMIYLRQEKLEFSEHHFRMAFLINPSSSVIMSY 625
Query: 282 LGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLC 322
LG + E++ + + I+ D+ +P + A +L+C
Sbjct: 626 LGTSLHALKRSEEALEIMEQAIVADRKNPLPMYQKANILVC 666
>sp|P33292|PEX5_PICPA Peroxisomal targeting signal receptor OS=Komagataella pastoris
GN=PEX5 PE=3 SV=2
Length = 576
Score = 42.4 bits (98), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 80/216 (37%), Gaps = 26/216 (12%)
Query: 231 LKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSG 290
+ ++Q D + W LG + ++ + I+ L L +DP N + L I+Y G
Sbjct: 301 FEAAVQQDPKHVDAWLKLGEVQTQNEKESDGIAALEKCLELDPTNLAALMTLAISYINDG 360
Query: 291 ------------------DMEQSAKCFQ-DLILKD---QNHPAA-LINYAALLLCKYGSV 327
D+ A+ DL D QN L AA L S+
Sbjct: 361 YDNAAYATLERWIETKYPDIASRARSSNPDLDGGDRIEQNKRVTELFMKAAQLSPDVASM 420
Query: 328 LAGAGANTGEGACLDQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKC 387
A TG G + +C AA++ +P A W L A + +
Sbjct: 421 --DADVQTGLGVLFYSMEEFDKTIDCFKAAIEVEPDKALNWNRLGAALANYNKPEEAVEA 478
Query: 388 LEKAAKLEPNCMSTRYAVAVSRIKDAERSQEPTEQL 423
+A +L PN + RY + VS I + R +E E L
Sbjct: 479 YSRALQLNPNFVRARYNLGVSFI-NMGRYKEAVEHL 513
Score = 37.0 bits (84), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 17/69 (24%), Positives = 35/69 (50%)
Query: 222 EELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGN 281
EE ++ + K +++ + +A+ WN LG L + + ++ S L ++PN N
Sbjct: 436 EEFDKTIDCFKAAIEVEPDKALNWNRLGAALANYNKPEEAVEAYSRALQLNPNFVRARYN 495
Query: 282 LGIAYFQSG 290
LG+++ G
Sbjct: 496 LGVSFINMG 504
>sp|P19737|Y425_SYNP2 TPR repeat-containing protein SYNPCC7002_A0425 OS=Synechococcus sp.
(strain ATCC 27264 / PCC 7002 / PR-6)
GN=SYNPCC7002_A0425 PE=4 SV=2
Length = 387
Score = 42.4 bits (98), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 45/90 (50%)
Query: 226 EILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIA 285
E ++ + + + + A +N LG L +SG +S+I+ + PNN L
Sbjct: 115 EAVTAYRRATELEDDNAEFFNALGFNLAQSGDNRSAINAYQRATQLQPNNLAYSLGLATV 174
Query: 286 YFQSGDMEQSAKCFQDLILKDQNHPAALIN 315
F++GD +Q+ ++ ++ KD N+ AL N
Sbjct: 175 QFRAGDYDQALVAYRKVLAKDSNNTMALQN 204
>sp|Q8IUR5|TMTC1_HUMAN Transmembrane and TPR repeat-containing protein 1 OS=Homo sapiens
GN=TMTC1 PE=1 SV=3
Length = 882
Score = 41.6 bits (96), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 46/199 (23%), Positives = 90/199 (45%), Gaps = 27/199 (13%)
Query: 222 EELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGN 281
E+ EE ++ LK+S++ A +++L +L + R + + + + + P++ D N
Sbjct: 562 EKKEEAITLLKDSIKYGPEFADAYSSLASLLAEQERFKEAEEIYQTGIKNCPDSSDLHNN 621
Query: 282 LGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACL 341
G+ +G E++ +Q I +H A++N G + G N+
Sbjct: 622 YGVFLVDTGLPEKAVAHYQQAIKLSPSHHVAMVN--------LGRLYRSLGENS------ 667
Query: 342 DQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCMST 401
+A+E AL+ K A I + L YY TG + + + ++AA L+P+
Sbjct: 668 -------MAEEWYKRALQVAHK-AEILSPLGALYYNTGRYEEALQIYQEAAALQPSQREL 719
Query: 402 RYAVA-----VSRIKDAER 415
R A+A + + K+AE+
Sbjct: 720 RLALAQVLAVMGQTKEAEK 738
>sp|O18158|OGT1_CAEEL UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
OS=Caenorhabditis elegans GN=ogt-1 PE=1 SV=2
Length = 1151
Score = 41.6 bits (96), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 40/181 (22%), Positives = 74/181 (40%), Gaps = 21/181 (11%)
Query: 219 LEPEELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDC 278
+ + LE+ + +++ + + A ++ LG + G+LQ ++ + + P D
Sbjct: 170 FQTKNLEKSMQYSMLAIKVNNQCAEAYSNLGNYYKEKGQLQDALENYKLAVKLKPEFIDA 229
Query: 279 IGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYGSVLAGAGANTGEG 338
NL A GD+EQ+ + + + Q +P L C + G
Sbjct: 230 YINLAAALVSGGDLEQAVTAYFNAL---QINPD--------LYC----------VRSDLG 268
Query: 339 ACLDQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNC 398
L + AK C L A++ P+ A W+NL + G+ + EKA L+PN
Sbjct: 269 NLLKAMGRLEEAKVCYLKAIETQPQFAVAWSNLGCVFNSQGEIWLAIHHFEKAVTLDPNF 328
Query: 399 M 399
+
Sbjct: 329 L 329
Score = 35.0 bits (79), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 70/308 (22%), Positives = 125/308 (40%), Gaps = 45/308 (14%)
Query: 151 AHAHFLLGLMYQRLGQPLKAVSSYEKAEEILLRCEADIARPELLSLVQIHHAQCLLPESS 210
A A+ LG Y+ GQ A+ +Y+ A ++ +PE + I+ A L+ S
Sbjct: 193 AEAYSNLGNYYKEKGQLQDALENYKLAVKL---------KPEFID-AYINLAAALV--SG 240
Query: 211 GDNSLDKELEPEELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLA 270
GD LE+ ++ ++Q + V + LG +L GRL+ + +
Sbjct: 241 GD-----------LEQAVTAYFNALQINPDLYCVRSDLGNLLKAMGRLEEAKVCYLKAIE 289
Query: 271 VDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALL---------L 321
P NLG + G++ + F+ + D N A IN +L +
Sbjct: 290 TQPQFAVAWSNLGCVFNSQGEIWLAIHHFEKAVTLDPNFLDAYINLGNVLKEARIFDRAV 349
Query: 322 CKYGSVLAGAGANT---GEGACL-DQASAVNVAKECLLAALKADPKAAHIWANLANAYYL 377
Y L +G + G AC+ + +++A + A+ P + NLANA
Sbjct: 350 SAYLRALNLSGNHAVVHGNLACVYYEQGLIDLAIDTYKKAIDLQPHFPDAYCNLANALKE 409
Query: 378 TGDHRSSGKCLEKAAKLEPNCMSTRYAVA-----VSRIKDAER----SQEPTEQLSWAGN 428
G + + KA +L P ++ +A +I+DA R + E + + A +
Sbjct: 410 KGSVVEAEQMYMKALELCPTHADSQNNLANIKREQGKIEDATRLYLKALEIYPEFAAAHS 469
Query: 429 EMASILRE 436
+ASIL++
Sbjct: 470 NLASILQQ 477
>sp|Q6CT48|PEX5_KLULA Peroxisomal targeting signal receptor OS=Kluyveromyces lactis
(strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 /
NRRL Y-1140 / WM37) GN=PEX5 PE=3 SV=1
Length = 566
Score = 41.6 bits (96), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 42/205 (20%), Positives = 77/205 (37%), Gaps = 19/205 (9%)
Query: 223 ELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNL 282
+L E + ++Q D W LGL+ ++ + + IS L L++DP N + +
Sbjct: 293 KLSEAALAFEAAVQEDPAHVDAWLKLGLVQTQNEKEMNGISALEQCLSLDPTNQQALMTI 352
Query: 283 GIAYFQSGDMEQSAKCFQDLILKDQNHPAAL------------INYAALLLCKYGSVL-- 328
I+Y G + +A + L D +P N + ++ KY V
Sbjct: 353 SISYINEG-YDLTAFSMLNRWL-DSKYPELTRSPTIDEANIDRFNLSKQVITKYLQVANA 410
Query: 329 ---AGAGANTGEGACLDQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSG 385
G G +C AL+ +P +W L + + +
Sbjct: 411 LPQVDPEVQLGLGTLFYANEEFGKTIDCFRTALEVNPNDELMWNRLGASLANSNRSEEAI 470
Query: 386 KCLEKAAKLEPNCMSTRYAVAVSRI 410
+ KA L+P+ + RY +A+S +
Sbjct: 471 QAYHKALALKPSFVRARYNLAISSM 495
>sp|Q920N5|PEX5_CRIGR Peroxisomal targeting signal 1 receptor OS=Cricetulus griseus
GN=PEX5 PE=1 SV=1
Length = 640
Score = 41.2 bits (95), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 80/192 (41%), Gaps = 22/192 (11%)
Query: 221 PEELEEILSKLKE------------SMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSL 268
P+ EE L +L+E ++Q D + W LG ++ + +IS L
Sbjct: 337 PQAFEEGLRRLEEGDLPNAVLLFEAAVQQDPKHMEAWQYLGTTQAENEQELLAISALRRC 396
Query: 269 LAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLL-CKYGSV 327
L + P+N + L +++ Q+ + +D + + PA YA L+ + G+
Sbjct: 397 LELKPDNRTALMALAVSFTNESLQRQACETLRDWL---RYSPA----YAHLVTPGEEGAS 449
Query: 328 LAGAGANTGEGACLDQASAVNVAKECLLAALKADPKA--AHIWANLANAYYLTGDHRSSG 385
AG G + L S KE LAA++ DP + + L + L+G++ +
Sbjct: 450 GAGLGPSKRVLGSLLSDSLFLEVKELFLAAVRLDPTSIDPDVQCGLGVLFNLSGEYDKAV 509
Query: 386 KCLEKAAKLEPN 397
C A + PN
Sbjct: 510 DCFTAALSVRPN 521
>sp|Q99144|PEX5_YARLI Peroxisomal targeting signal receptor OS=Yarrowia lipolytica
(strain CLIB 122 / E 150) GN=PAY32 PE=3 SV=1
Length = 598
Score = 41.2 bits (95), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 48/211 (22%), Positives = 82/211 (38%), Gaps = 27/211 (12%)
Query: 224 LEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLG 283
L E + ++Q +T A W LG ++ + +I L + ++P N + NL
Sbjct: 320 LSEAALAFEAAVQKNTEHAEAWGRLGACQAQNEKEDPAIRALERCIKLEPGNLSALMNLS 379
Query: 284 IAYFQSGDMEQSAKC---------FQDLILKDQNHPAALINYAALLL-CKYGSVLAGAGA 333
++Y G E +A + +++ + +N L N L + + A
Sbjct: 380 VSYTNEG-YENAAYATLERWLATKYPEVVDQARNQEPRLGNEDKFQLHSRVTELFIRAAQ 438
Query: 334 NTGEGACLDQASAVNV------------AKECLLAALKADPKAAHIWANLANAYYLTGDH 381
+ +GA +D V + A +C AA+ P A +W L L H
Sbjct: 439 LSPDGANIDADVQVGLGVLFYGNEEYDKAIDCFNAAIAVRPDDALLWNRLGAT--LANSH 496
Query: 382 RSSG--KCLEKAAKLEPNCMSTRYAVAVSRI 410
RS KA +L P+ + RY + VS I
Sbjct: 497 RSEEAIDAYYKALELRPSFVRARYNLGVSCI 527
>sp|Q8NBP0|TTC13_HUMAN Tetratricopeptide repeat protein 13 OS=Homo sapiens GN=TTC13 PE=2
SV=3
Length = 860
Score = 41.2 bits (95), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 25/97 (25%), Positives = 45/97 (46%), Gaps = 1/97 (1%)
Query: 231 LKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSG 290
++S++ + Q + GL G L+ +I L + D +LG AY + G
Sbjct: 272 FQQSLELNKNQPIAMLYKGLTFFHRGLLKEAIESFKEALKQKVDFIDAYKSLGQAYRELG 331
Query: 291 DMEQSAKCFQDLILKDQNHPAALINYAALLLCKYGSV 327
+ E + + FQ +L +QNH L ++L +GS+
Sbjct: 332 NFEAATESFQKALLLNQNHVQTL-QLRGMMLYHHGSL 367
>sp|Q2M2R8|PEX5_RAT Peroxisomal targeting signal 1 receptor OS=Rattus norvegicus
GN=Pex5 PE=1 SV=2
Length = 640
Score = 41.2 bits (95), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 80/192 (41%), Gaps = 22/192 (11%)
Query: 221 PEELEEILSKLKE------------SMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSL 268
P+ EE L +L+E ++Q D + W LG ++ + +IS L
Sbjct: 337 PQPFEEGLRRLEEGDLPNAVLLFEAAVQQDPKHMEAWQYLGTTQAENEQELLAISALRRC 396
Query: 269 LAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLL-CKYGSV 327
L + P+N + L +++ Q+ + +D + + PA YA L+ + G+
Sbjct: 397 LELKPDNRTALMALAVSFTNESLQRQACETLRDWL---RYSPA----YAHLVAPGEEGAS 449
Query: 328 LAGAGANTGEGACLDQASAVNVAKECLLAALKADPKA--AHIWANLANAYYLTGDHRSSG 385
AG G + L S KE LAA++ DP + + L + L+G++ +
Sbjct: 450 GAGLGPSKRVLGSLLSDSLFLEVKELFLAAVRLDPTSIDPDVQCGLGVLFNLSGEYDKAV 509
Query: 386 KCLEKAAKLEPN 397
C A + PN
Sbjct: 510 DCFTAALSVRPN 521
>sp|Q58741|Y1345_METJA TPR repeat-containing protein MJ1345 OS=Methanocaldococcus
jannaschii (strain ATCC 43067 / DSM 2661 / JAL-1 / JCM
10045 / NBRC 100440) GN=MJ1345 PE=4 SV=1
Length = 314
Score = 40.8 bits (94), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 37/174 (21%), Positives = 76/174 (43%), Gaps = 23/174 (13%)
Query: 148 PKNAHAHFLLGLMYQRLGQPLKAVSSYEKAEEILLRCEADIARPELLSLVQIHHAQCLLP 207
PK A+FL G + LG+ E+A+E+ L+ + R E S + + + +
Sbjct: 76 PKYILANFLKGALLVSLGK-------LEEAKEVFLK----LCRLEK-SDLPVKYVTAFIL 123
Query: 208 ESSGDNSLDKELEPEELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSS 267
+ G E + L + + ++ + A+ W G IL + G+L+ S+ +
Sbjct: 124 KKLG-----------EYDYALKIIDKILKKYPKSAIAWAEKGEILYREGKLKKSLECFDN 172
Query: 268 LLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLL 321
L ++P +C + G F+ G ++ KC + + ++ AL+ +L+
Sbjct: 173 ALKINPKDCQSLLYKGEILFKLGRYGEALKCLKKVFERNNKDIRALMYIIQILI 226
>sp|Q752X0|PEX5_ASHGO Peroxisomal targeting signal receptor OS=Ashbya gossypii (strain
ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056)
GN=PEX5 PE=3 SV=5
Length = 569
Score = 40.4 bits (93), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 44/202 (21%), Positives = 79/202 (39%), Gaps = 16/202 (7%)
Query: 223 ELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNL 282
+L E + ++Q D W LGL+ ++ + S I+ L L DP+N + +
Sbjct: 297 KLSEAALAFEAAVQQDPGHVDAWLRLGLVQTQNEKELSGINALEQCLKADPHNLMALMTV 356
Query: 283 GIAYFQSGDMEQSAKCFQDLILKDQNHPAAL---------INYAALLLCKYGSVL----- 328
I+Y G + SA L+ + +PA + N + L++ +Y V
Sbjct: 357 AISYINEG-YDVSAFTMLGRWLETK-YPAFVEEPLDRVDRYNLSRLIIEQYLRVANALPE 414
Query: 329 AGAGANTGEGACLDQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCL 388
G G + +C AAL P +W L + + + +
Sbjct: 415 VDPDVQLGLGILFYANEDFDKTIDCFRAALAVRPDDECMWNRLGASLANSNRSEEAIQAY 474
Query: 389 EKAAKLEPNCMSTRYAVAVSRI 410
+A +L+P + RY +AVS +
Sbjct: 475 HRAIQLKPTFVRARYNLAVSSM 496
>sp|Q925N3|PEX5R_RAT PEX5-related protein OS=Rattus norvegicus GN=Pex5l PE=1 SV=1
Length = 602
Score = 40.4 bits (93), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 51/235 (21%), Positives = 85/235 (36%), Gaps = 53/235 (22%)
Query: 221 PEELEEILSKLKE------------SMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSL 268
P EE L +LKE ++ D A W LG+ ++ Q++I L
Sbjct: 303 PGAFEEGLKRLKEGDLPVTILFMEAAILQDPGNAEAWQFLGITQAENENEQAAIVALQRC 362
Query: 269 LAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYG--- 325
L + PNN + L ++Y + + + + ++ I + +P Y L+ K G
Sbjct: 363 LELQPNNLKALMALAVSYTNTSHQQDACEALKNWI---KQNP----KYKYLVKNKKGSPG 415
Query: 326 ------------SVLAGA-----------------GANTGEGACLDQASAVNVAKECLLA 356
SVL G TG G + N A + A
Sbjct: 416 LTRRMSKSPVDSSVLEGVKDLYLEAAHQNGDMIDPDLQTGLGVLFHLSGEFNRAIDAFNA 475
Query: 357 ALKADPKAAHIWANLANAYYLTGDHRSSG-KCLEKAAKLEPNCMSTRYAVAVSRI 410
AL P+ +W L A GD + +A +++P + +RY + +S I
Sbjct: 476 ALTVRPEDYSLWNRLG-ATLANGDRSEEAVEAYTRALEIQPGFIRSRYNLGISCI 529
>sp|Q8BRH0|TMTC3_MOUSE Transmembrane and TPR repeat-containing protein 3 OS=Mus musculus
GN=Tmtc3 PE=2 SV=2
Length = 920
Score = 40.4 bits (93), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 109/262 (41%), Gaps = 42/262 (16%)
Query: 234 SMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDME 293
SM+ D +QA + + G +LLK + + L +D NN D NL I Y + +
Sbjct: 562 SMRPDFKQAYI--SRGELLLKMNKPLKAKEAYLKALELDRNNADLWYNLAIVYIELKEPN 619
Query: 294 QSAKCFQDLILKDQNHPAALINYAALLLCKYGSVLAGAGANTGEGACLDQASAVNVAKEC 353
++ K F + + H AL N A L+ +GE + A++
Sbjct: 620 EALKNFNRALELNPKHKLALFNSAILM------------QESGEVKLRPE------ARKR 661
Query: 354 LLAALKADPKAAHIWANLANAYYLTGDHRSSGKC---LEKAAKLEPNCMSTRYAVAVSRI 410
LL + +P+ A+ + NL L D + + ++KA KL+P+ S + +A+
Sbjct: 662 LLNYVNEEPQDANGYFNLG---MLAMDDKKDSEAESWMKKAIKLQPDFRSALFNLALLYS 718
Query: 411 KDA-ERSQEPT--EQLSWAGNEMASILREGDPV---QIEPPIAWAGF--------AAVQK 456
+ A E P E L + + ++ +GD + + + P A F + VQ
Sbjct: 719 QTAKELKALPILEELLKYYPDHTKGLILKGDILMNQKKDIPGAKKCFEKILEMDPSNVQG 778
Query: 457 THHEVAAAFETEENELSKMEEC 478
H+ F EE EL K E C
Sbjct: 779 KHNLCVVYF--EEKELLKAERC 798
>sp|Q8C437|PEX5R_MOUSE PEX5-related protein OS=Mus musculus GN=Pex5l PE=1 SV=2
Length = 567
Score = 40.4 bits (93), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 51/235 (21%), Positives = 85/235 (36%), Gaps = 53/235 (22%)
Query: 221 PEELEEILSKLKE------------SMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSL 268
P EE L +LKE ++ D A W LG+ ++ Q++I L
Sbjct: 268 PGAFEEGLKRLKEGDLPVTILFMEAAILQDPGDAEAWQFLGITQAENENEQAAIVALQRC 327
Query: 269 LAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYG--- 325
L + PNN + L ++Y + + + + ++ I + +P Y L+ K G
Sbjct: 328 LELQPNNLKALMALAVSYTNTSHQQDACEALKNWI---KQNP----KYKYLVKNKKGSPG 380
Query: 326 ------------SVLAGAGA-----------------NTGEGACLDQASAVNVAKECLLA 356
SVL G TG G + N A + A
Sbjct: 381 LTRRMSKSPVDSSVLEGVKELYLEAAHQNGDMIDPDLQTGLGVLFHLSGEFNRAIDAFNA 440
Query: 357 ALKADPKAAHIWANLANAYYLTGDHRSSG-KCLEKAAKLEPNCMSTRYAVAVSRI 410
AL P+ +W L A GD + +A +++P + +RY + +S I
Sbjct: 441 ALTVRPEDYSLWNRLG-ATLANGDRSEEAVEAYTRALEIQPGFIRSRYNLGISCI 494
>sp|Q8ILC1|STI1L_PLAF7 STI1-like protein OS=Plasmodium falciparum (isolate 3D7)
GN=PF14_0324 PE=4 SV=1
Length = 564
Score = 40.0 bits (92), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 39/176 (22%), Positives = 71/176 (40%), Gaps = 32/176 (18%)
Query: 271 VDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALL--LCKYGSVL 328
+DP+ + N G YF++ D + K + + I ++ N N AA L L +Y S L
Sbjct: 373 IDPDKAEEHKNKGNEYFKNNDFPNAKKEYDEAIRRNPNDAKLYSNRAAALTKLIEYPSAL 432
Query: 329 AGAGANTGEGACLDQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCL 388
E ++ A++ DP ++ N ++ D+ + +
Sbjct: 433 -----------------------EDVMKAIELDPTFVKAYSRKGNLHFFMKDYYKALQAY 469
Query: 389 EKAAKLEPN---CMS--TRYAVAVSRIKDAERSQEPTEQLSWAGNEMASILREGDP 439
K +L+PN C+ R A + + +E+ E + S A E+ I+ DP
Sbjct: 470 NKGLELDPNNKECLEGYQRCAFKIDEMSKSEKVDEEQFKKSMADPEIQQII--SDP 523
>sp|Q6DFB8|TTC37_XENLA Tetratricopeptide repeat protein 37 OS=Xenopus laevis GN=ttc37 PE=2
SV=1
Length = 1564
Score = 39.7 bits (91), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 32/53 (60%)
Query: 354 LLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCMSTRYAVA 406
L AAL+ADPK ++ W L AY G + ++ K KA++L P+ + + Y +A
Sbjct: 587 LHAALRADPKDSNCWECLGEAYLSRGGYTTALKSFMKASELNPDSIYSVYKIA 639
>sp|Q6PGP7|TTC37_HUMAN Tetratricopeptide repeat protein 37 OS=Homo sapiens GN=TTC37 PE=1
SV=1
Length = 1564
Score = 38.9 bits (89), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 31/53 (58%)
Query: 354 LLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCMSTRYAVA 406
L AAL+ADPK + W +L AY G + ++ K KA++L P + + + VA
Sbjct: 587 LQAALRADPKDFNCWESLGEAYLSRGGYTTALKSFTKASELNPESIYSVFKVA 639
Score = 35.8 bits (81), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 54/109 (49%), Gaps = 7/109 (6%)
Query: 194 LSLVQIHHAQCLLPESSGDNSLDKELEPEELEEILSKLKESMQSDTRQAVVWNTLGLILL 253
L + AQ L S N L KEL LE+ L LK++++ D+ + WN LG++
Sbjct: 782 LGINYYRQAQHLAETGSNMNDL-KEL----LEKSLHCLKKAVRLDSNNHLYWNALGVVAC 836
Query: 254 KSGRLQSSISVLSSLLAVDPNNCDCIG--NLGIAYFQSGDMEQSAKCFQ 300
SG +++ + ++ + + NLG+ Y + ++EQ+ + F+
Sbjct: 837 YSGIGNYALAQHCFIKSIQSEQINAVAWTNLGVLYLTNENIEQAHEAFK 885
>sp|Q01495|PEX5_PICAN Peroxisomal targeting signal receptor OS=Pichia angusta GN=PEX5
PE=3 SV=1
Length = 569
Score = 38.9 bits (89), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 19/69 (27%), Positives = 34/69 (49%)
Query: 222 EELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGN 281
EE + L + +++ + A+ WN LG L S + + +I S L ++PN N
Sbjct: 430 EEYSKTLDCFQAAIEHNPNDALAWNRLGASLANSNKPEQAIEAYSRTLQLNPNFVRARYN 489
Query: 282 LGIAYFQSG 290
LG+++ G
Sbjct: 490 LGVSFINMG 498
>sp|P17883|SKI3_YEAST Superkiller protein 3 OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=SKI3 PE=1 SV=2
Length = 1432
Score = 38.5 bits (88), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 28/57 (49%)
Query: 352 ECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEPNCMSTRYAVAVS 408
E +AL+ DP W L AY+ G +S K +KA +L P+ +Y A+S
Sbjct: 723 EWFQSALRVDPNDVESWVGLGQAYHACGRIEASIKVFDKAIQLRPSHTFAQYFKAIS 779
>sp|Q13099|IFT88_HUMAN Intraflagellar transport protein 88 homolog OS=Homo sapiens
GN=IFT88 PE=2 SV=2
Length = 833
Score = 38.5 bits (88), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 18/60 (30%), Positives = 32/60 (53%), Gaps = 1/60 (1%)
Query: 244 VWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLI 303
+ +G+ +++G+ +I+ ++++ PN NL I YF GD E+ K FQ LI
Sbjct: 283 IMQNIGVTFIQAGQYSDAINSYEHIMSMAPN-LKAGYNLTICYFAIGDREKMKKAFQKLI 341
>sp|Q1RMV0|PEX5_BOVIN Peroxisomal targeting signal 1 receptor OS=Bos taurus GN=PEX5 PE=2
SV=1
Length = 640
Score = 38.1 bits (87), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 43/192 (22%), Positives = 80/192 (41%), Gaps = 22/192 (11%)
Query: 221 PEELEEILSKLKE------------SMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSL 268
P+ EE L +L+E ++Q D + W LG ++ + +IS L
Sbjct: 337 PQPFEEGLRRLQEGDLPNAVLLFEAAVQQDPKHMEAWQYLGTTQAENEQELLAISALRKC 396
Query: 269 LAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLL-CKYGSV 327
L + P+N + L +++ Q+ + +D + + PA YA L+ + G+
Sbjct: 397 LELKPDNRTALMALAVSFTNESLQRQACETLRDWL---RYTPA----YAHLVAPGEEGAG 449
Query: 328 LAGAGANTGEGACLDQASAVNVAKECLLAALKADPKA--AHIWANLANAYYLTGDHRSSG 385
G G++ L S KE LAA++ DP + + L + L+G++ +
Sbjct: 450 GVGLGSSKRILGSLLSDSLFLEVKELFLAAVRLDPTSIDPDVQCGLGVLFNLSGEYDKAV 509
Query: 386 KCLEKAAKLEPN 397
C A + P+
Sbjct: 510 DCFTAALSVRPD 521
>sp|Q04364|YMP8_YEAST TPR repeat-containing protein YMR018W OS=Saccharomyces cerevisiae
(strain ATCC 204508 / S288c) GN=YMR018W PE=1 SV=1
Length = 514
Score = 38.1 bits (87), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 19/63 (30%), Positives = 30/63 (47%)
Query: 242 AVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQD 301
+WN G IL + S+I+ + + PN NL IAY GD +++K +
Sbjct: 394 GTIWNRYGAILANTKSYHSAINAYNKCKQLRPNFTRVRYNLAIAYMNKGDYVKASKMLIE 453
Query: 302 LIL 304
+IL
Sbjct: 454 VIL 456
Score = 37.7 bits (86), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 54/128 (42%), Gaps = 24/128 (18%)
Query: 264 VLSSLLAV---DPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALL 320
+L SLL + + D L I Y+ ++QS KC + L+L+ N+ +
Sbjct: 345 ILESLLNMGIEKKDQYDIYSVLSILYYSDQKIKQSQKCLEFLLLEKPNN--------GTI 396
Query: 321 LCKYGSVLAGAGANTGEGACLDQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGD 380
+YG++LA + SA+N +C + P + NLA AY GD
Sbjct: 397 WNRYGAILANTKS---------YHSAINAYNKCK----QLRPNFTRVRYNLAIAYMNKGD 443
Query: 381 HRSSGKCL 388
+ + K L
Sbjct: 444 YVKASKML 451
Score = 36.6 bits (83), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 44/209 (21%), Positives = 81/209 (38%), Gaps = 18/209 (8%)
Query: 224 LEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLG 283
L E++ + ++ D W LG++ ++ + L + L +DPNN + NL
Sbjct: 235 LNEVVMAFEAAISQDPSHINAWLKLGIVNFENESESNGELALRNCLNLDPNNTIALENLA 294
Query: 284 IAYFQSGDMEQSAKCFQDLILKDQN---HPAALINYAALLLCKYGSVLAG---AGANTG- 336
I + + +S K F IL + P+A N ++ + LA + N G
Sbjct: 295 IHHINQQNESESLKLFHKWILSKFSKVFQPSAGENKDSINKIPKKAHLAHILESLLNMGI 354
Query: 337 -EGACLDQASAVNV----------AKECLLAALKADPKAAHIWANLANAYYLTGDHRSSG 385
+ D S +++ +++CL L P IW T + S+
Sbjct: 355 EKKDQYDIYSVLSILYYSDQKIKQSQKCLEFLLLEKPNNGTIWNRYGAILANTKSYHSAI 414
Query: 386 KCLEKAAKLEPNCMSTRYAVAVSRIKDAE 414
K +L PN RY +A++ + +
Sbjct: 415 NAYNKCKQLRPNFTRVRYNLAIAYMNKGD 443
>sp|Q9DBB4|NAA16_MOUSE N-alpha-acetyltransferase 16, NatA auxiliary subunit OS=Mus
musculus GN=Naa16 PE=2 SV=1
Length = 864
Score = 38.1 bits (87), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 22/93 (23%), Positives = 41/93 (44%)
Query: 225 EEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGI 284
EE +++ ++SD R V W+ GL+ + +I + L +D +N + +L +
Sbjct: 63 EEAYEFVRKGLRSDVRSHVCWHVYGLLQRSDKKYDEAIKCYRNALKLDKDNLQILRDLSL 122
Query: 285 AYFQSGDMEQSAKCFQDLILKDQNHPAALINYA 317
Q D+E + L+ A+ I YA
Sbjct: 123 LQIQMRDLEGYRETRYQLLQLRPTQRASWIGYA 155
>sp|O09012|PEX5_MOUSE Peroxisomal targeting signal 1 receptor OS=Mus musculus GN=Pex5
PE=2 SV=2
Length = 639
Score = 37.7 bits (86), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 41/191 (21%), Positives = 78/191 (40%), Gaps = 22/191 (11%)
Query: 221 PEELEEILSKLKE------------SMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSL 268
P+ EE L +L+E ++Q D + W LG ++ + +IS L
Sbjct: 338 PQPFEEGLHRLEEGDLPNAVLLFEAAVQQDPKHMEAWQYLGTTQAENEQELLAISALRRC 397
Query: 269 LAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALLLCKYGSVL 328
L + P+N + L +++ Q+ + +D + + PA YA L+
Sbjct: 398 LELKPDNRTALMALAVSFTNESLQRQACETLRDWL---RYSPA----YAHLVAPGEEGAT 450
Query: 329 AGAGANTGEGACLDQASAVNVAKECLLAALKADPKA--AHIWANLANAYYLTGDHRSSGK 386
+ G+ L + + V K+ LAA++ DP + + L + L+G++ +
Sbjct: 451 GAGPSKRILGSLLSDSLFLEV-KDLFLAAVRLDPTSIDPDVQCGLGVLFNLSGEYDKAVD 509
Query: 387 CLEKAAKLEPN 397
C A + PN
Sbjct: 510 CFTAALSVRPN 520
>sp|Q6YZI0|SPY_ORYSJ Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY OS=Oryza sativa
subsp. japonica GN=SPY PE=3 SV=1
Length = 927
Score = 37.7 bits (86), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 41/196 (20%), Positives = 81/196 (41%), Gaps = 22/196 (11%)
Query: 144 ARKMPKNAHAHFLLGLMYQRLGQPLKAVSSYEKAEEI-----LLRCEADIARPELLSLVQ 198
A + P A A+ +G++Y+ G+ A++ YE+ I + + IA +L + V+
Sbjct: 204 ALERPLYAEAYCNMGVIYKNRGELEAAIACYERCLTISPNFEIAKNNMAIALTDLGTKVK 263
Query: 199 I---------HHAQCLLPESSGDNSLDK------ELEPEELEEILSKLKESMQSDTRQAV 243
I ++ + L +++ E+ E+ + +L ++ + R A
Sbjct: 264 IEGDINQGVAYYKKALFYNWHYADAMYNLGVAYGEMLNFEMAIVFYEL--ALHFNPRCAE 321
Query: 244 VWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLI 303
N LG+I L ++ L++ PN + NLG+ Y G M+ ++ Q I
Sbjct: 322 ACNNLGVIYKDRDNLDKAVECYQMALSIKPNFSQSLNNLGVVYTVQGKMDAASSMIQKAI 381
Query: 304 LKDQNHPAALINYAAL 319
+ + A N L
Sbjct: 382 FANSTYAEAYNNLGVL 397
Score = 33.9 bits (76), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 40/180 (22%), Positives = 76/180 (42%), Gaps = 22/180 (12%)
Query: 144 ARKMPKNAHAHFLLGLMYQRLGQPLKAVSSYEKAEEILLRCEADIA-RP--ELLSLVQIH 200
R P NA A G++Y+ G ++A +Y+KA AD + +P E L++V
Sbjct: 95 VRIDPGNACALTYCGMIYKDEGHLVEAAEAYQKAR------NADPSYKPAAEFLAIVLTD 148
Query: 201 HAQCLLPESSGDNSLDKELEPEELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQS 260
L + +G+ EE + K E+++ D+ A + LG++ + +
Sbjct: 149 LGTSL--KLAGNT-----------EEGIQKYCEALEVDSHYAPAYYNLGVVYSEMMQFDL 195
Query: 261 SISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALL 320
+++ P + N+G+ Y G++E + C++ + N A N A L
Sbjct: 196 ALTCYEKAALERPLYAEAYCNMGVIYKNRGELEAAIACYERCLTISPNFEIAKNNMAIAL 255
>sp|O82422|SPY_HORVU Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY OS=Hordeum
vulgare GN=SPY PE=2 SV=1
Length = 944
Score = 37.7 bits (86), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 22/86 (25%), Positives = 37/86 (43%)
Query: 234 SMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDME 293
++ + R A N LG+I L ++ L++ PN + NLG+ Y G M+
Sbjct: 312 ALHFNPRCAEACNNLGVIYKDRDNLDKAVECYQMALSIKPNFSQSLNNLGVVYTVQGKMD 371
Query: 294 QSAKCFQDLILKDQNHPAALINYAAL 319
+A + IL + + A N L
Sbjct: 372 AAASMIEKAILANPTYAEAYNNLGVL 397
Score = 36.2 bits (82), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 63/293 (21%), Positives = 104/293 (35%), Gaps = 50/293 (17%)
Query: 148 PKNAHAHFLLGLMYQRLGQPLKAVSSYEKAEEILLRCEADI---ARPELLSLVQIHHAQC 204
PKNA A G++Y+ G ++A +Y+KA AD A E L++V
Sbjct: 99 PKNACALTHCGMIYKDEGHLVEAAEAYQKAR------SADPSYKAASEFLAIVLTDLGTS 152
Query: 205 LLPESSGDNSLDKELEPEELEEILSK--------LKESMQSDTR-------------QAV 243
L + ++ + K E E++ + E MQ D A
Sbjct: 153 LKLAGNTEDGIQKYCEALEVDSHYAPAYYNLGVVYSEMMQFDVALTCYEKAALERPLYAE 212
Query: 244 VWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQ-------SGDMEQSA 296
+ +G+I G L ++I+ L + PN N+ IA GD+ Q
Sbjct: 213 AYCNMGVIYKNRGELDAAIACYDRCLTISPNFEIAKNNMAIALTDLGTKVKIEGDINQGV 272
Query: 297 KCFQDLILKDQNHPAALIN----YAALLLCKYGSVLA---------GAGANTGEGACLDQ 343
++ + + ++ A+ N Y +L + V A A G
Sbjct: 273 AYYKKALFYNWHYADAMYNLGVAYGEMLNFEMAIVFYELALHFNPRCAEACNNLGVIYKD 332
Query: 344 ASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEP 396
++ A EC AL P + NL Y + G ++ +EKA P
Sbjct: 333 RDNLDKAVECYQMALSIKPNFSQSLNNLGVVYTVQGKMDAAASMIEKAILANP 385
Score = 32.7 bits (73), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 34/78 (43%), Gaps = 4/78 (5%)
Query: 324 YGSVLAGAGANT----GEGACLDQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTG 379
Y +VL GAN G+G CL S A +C A+K DPK A + Y G
Sbjct: 57 YTTVLDKDGANVEALIGKGICLQAQSLPRQALDCFTEAVKVDPKNACALTHCGMIYKDEG 116
Query: 380 DHRSSGKCLEKAAKLEPN 397
+ + +KA +P+
Sbjct: 117 HLVEAAEAYQKARSADPS 134
>sp|Q12797|ASPH_HUMAN Aspartyl/asparaginyl beta-hydroxylase OS=Homo sapiens GN=ASPH PE=1
SV=3
Length = 758
Score = 37.7 bits (86), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 16/65 (24%), Positives = 34/65 (52%)
Query: 256 GRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALIN 315
G ++ S+ L L+ + PN+ +LG+ Y GD + + K +++++ N A ++
Sbjct: 434 GHMRGSLLTLQRLVQLFPNDTSLKNDLGVGYLLIGDNDNAKKVYEEVLSVTPNDGFAKVH 493
Query: 316 YAALL 320
Y +L
Sbjct: 494 YGFIL 498
>sp|Q6P3X3|TTC27_HUMAN Tetratricopeptide repeat protein 27 OS=Homo sapiens GN=TTC27 PE=1
SV=1
Length = 843
Score = 37.7 bits (86), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 6/73 (8%)
Query: 352 ECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEP------NCMSTRYAV 405
EC ++K +P +W +L AY D++ S K ++ LEP N +ST Y
Sbjct: 548 ECFERSVKINPMQLGVWFSLGCAYLALEDYQGSAKAFQRCVTLEPDNAEAWNNLSTSYIR 607
Query: 406 AVSRIKDAERSQE 418
++K QE
Sbjct: 608 LKQKVKAFRTLQE 620
Score = 36.2 bits (82), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 67/162 (41%), Gaps = 39/162 (24%)
Query: 161 YQRLGQPLKAVSSYEKAEEILLRCEADIARPELLSLVQIHHAQCLLPESSGDNS-LDK-- 217
Y+R GQ + KAEEIL ++ + E SL CLL + GD+S DK
Sbjct: 476 YERAGQ-------HGKAEEIL---RQELEKKETPSLY------CLLGDVLGDHSCYDKAW 519
Query: 218 -----------------ELEPEELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQS 260
L +E +E + + S++ + Q VW +LG L Q
Sbjct: 520 ELSRYRSARAQRSKALLHLRNKEFQECVECFERSVKINPMQLGVWFSLGCAYLALEDYQG 579
Query: 261 SISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDL 302
S + ++P+N + NL +Y + ++Q K F+ L
Sbjct: 580 SAKAFQRCVTLEPDNAEAWNNLSTSYIR---LKQKVKAFRTL 618
>sp|Q5RBW9|TTC27_PONAB Tetratricopeptide repeat protein 27 OS=Pongo abelii GN=TTC27 PE=2
SV=1
Length = 843
Score = 37.7 bits (86), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 6/73 (8%)
Query: 352 ECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKAAKLEP------NCMSTRYAV 405
EC ++K +P +W +L AY D++ S K ++ LEP N +ST Y
Sbjct: 548 ECFERSVKINPMQLGVWFSLGCAYLALEDYQGSAKAFQRCVTLEPDNAEAWNNLSTSYIR 607
Query: 406 AVSRIKDAERSQE 418
++K QE
Sbjct: 608 LKQKVKAFRTLQE 620
Score = 35.0 bits (79), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 39/162 (24%)
Query: 161 YQRLGQPLKAVSSYEKAEEILLRCEADIARPELLSLVQIHHAQCLLPESSGDNS-LDK-- 217
Y+R GQ + KAEEIL ++ + + SL CLL + GD+S DK
Sbjct: 476 YERAGQ-------HGKAEEIL---RQELEKKQTPSLY------CLLGDVLGDHSCYDKAW 519
Query: 218 -----------------ELEPEELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQS 260
L +E +E + + S++ + Q VW +LG L Q
Sbjct: 520 ELSRYRSARAQRSKALLHLRNKEFQECVECFERSVKINPMQLGVWFSLGCAYLALEDYQG 579
Query: 261 SISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDL 302
S + ++P+N + NL +Y + ++Q K F+ L
Sbjct: 580 SAKAFQRCVTLEPDNAEAWNNLSTSYIR---LKQKVKAFRTL 618
>sp|Q9WX63|BCSC3_GLUXY Cellulose synthase 1 operon protein C OS=Gluconacetobacter xylinus
GN=bcsCI PE=3 SV=1
Length = 1325
Score = 37.7 bits (86), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 16/58 (27%), Positives = 31/58 (53%)
Query: 249 GLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKD 306
G L SGRL ++ S L ++ ++ D +G +G+ + GD ++ + FQ+ + D
Sbjct: 299 GFQQLNSGRLSAAEQSFQSALQINSHDADSLGGMGLVSMRQGDAAEARRYFQEAMAAD 356
>sp|P19450|BCSC1_GLUXY Cellulose synthase operon protein C OS=Gluconacetobacter xylinus
GN=bcsC PE=3 SV=1
Length = 1319
Score = 37.7 bits (86), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 64/293 (21%), Positives = 116/293 (39%), Gaps = 57/293 (19%)
Query: 249 GLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQN 308
G L +GRL ++ S L ++ ++ D +G +G+ + GD ++ + F++ + D
Sbjct: 298 GYQQLNAGRLAAAEQSFQSALQINSHDADSLGGMGLVSMRQGDTAEARRYFEEAMAADPK 357
Query: 309 HPAALINYAALLLCKYGSVLAGAGANTGEGACLDQASAVNVAKEC--LLAALKADPKAAH 366
++ LAG A +GE A + Q A + E LA L P
Sbjct: 358 -----------TADRWRPALAGM-AVSGEYASVRQLIAAHQYTEAKQQLATLARQP---- 401
Query: 367 IWANLANAYYLTGD-HRSSGKCLEKAAKLEPNCMSTRYAVAVSRIKDAERSQEPTEQLSW 425
A + D RS+G+ AA+ E + +R EP QL+
Sbjct: 402 --GQYTGATLMLADLQRSTGQI--AAAEQEYRGILSR---------------EPNNQLAL 442
Query: 426 AGNEMASILREGDPVQIEPPIAWAGFAAVQKTHHEVAAAFETEENELSKMEECAGAGESA 485
G +A + +A A ++ V + ++ E+ A A +++
Sbjct: 443 MG--LARV-----------DMAQGNTAEARQLLSRVGPQYASQVGEIEVSGLMAAASQTS 489
Query: 486 FLDQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKVL 538
D A V++ +E + A P+ + NLANA GD +G+ ++ +L
Sbjct: 490 --DSARKVSILREAMAQA----PRDPWVRINLANALQQQGDVAEAGRVMQPIL 536
>sp|Q8RVB2|SPY_SOLLC Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY OS=Solanum
lycopersicum GN=SPY PE=2 SV=1
Length = 931
Score = 37.4 bits (85), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 46/196 (23%), Positives = 80/196 (40%), Gaps = 33/196 (16%)
Query: 225 EEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGI 284
E +L K +S++S ++ + L + +GRL + S + VDP N + + GI
Sbjct: 72 ESVLEKDSKSIES-----LIGKGICLQMQNTGRL--AFESFSEAIKVDPQNACALTHCGI 124
Query: 285 AYFQSGDMEQSAKCFQDLILKDQNH-PAALINYAALLLCKYGSVLAGAGANTGEG----- 338
Y G + ++A+ ++ + D ++ PAA A++L G+ L AG NT EG
Sbjct: 125 LYKDEGRLVEAAESYEKALKADPSYTPAA--ECLAIVLTDIGTSLKLAG-NTQEGIQKYY 181
Query: 339 -----------------ACLDQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDH 381
+ ++A C A P A + N+ + GD
Sbjct: 182 EAIKIDSHYAPAYYNLGVVYSEMMQYDMALNCYEKAALERPMYAEAYCNMGVIFKNRGDL 241
Query: 382 RSSGKCLEKAAKLEPN 397
S+ C E+ + PN
Sbjct: 242 ESAIACYERCLAVSPN 257
Score = 35.8 bits (81), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 19/74 (25%), Positives = 32/74 (43%)
Query: 246 NTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILK 305
N LG+I L ++ L++ PN + NLG+ Y G M+ +A + I+
Sbjct: 338 NNLGVIYKDRDNLDKAVECYQLALSIKPNFSQSLNNLGVVYTVQGKMDAAASMIEKAIIA 397
Query: 306 DQNHPAALINYAAL 319
+ + A N L
Sbjct: 398 NPTYAEAYNNLGVL 411
Score = 34.7 bits (78), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 38/174 (21%), Positives = 75/174 (43%), Gaps = 18/174 (10%)
Query: 148 PKNAHAHFLLGLMYQRLGQPLKAVSSYEKAEEILLRCEADIA-RPELLSLVQIHHAQCLL 206
P+NA A G++Y+ G+ ++A SYEKA L+ + E L++V L
Sbjct: 113 PQNACALTHCGILYKDEGRLVEAAESYEKA----LKADPSYTPAAECLAIVLTDIGTSL- 167
Query: 207 PESSGDNSLDKELEPEELEEILSKLKESMQSDTRQAVVWNTLGLILLKSGRLQSSISVLS 266
+ +G+ +E + K E+++ D+ A + LG++ + + +++
Sbjct: 168 -KLAGNT-----------QEGIQKYYEAIKIDSHYAPAYYNLGVVYSEMMQYDMALNCYE 215
Query: 267 SLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLILKDQNHPAALINYAALL 320
P + N+G+ + GD+E + C++ + N A N A L
Sbjct: 216 KAALERPMYAEAYCNMGVIFKNRGDLESAIACYERCLAVSPNFEIAKNNMAIAL 269
>sp|A1Z8E9|BBS4_DROME Bardet-Biedl syndrome 4 protein homolog OS=Drosophila melanogaster
GN=BBS4 PE=3 SV=1
Length = 486
Score = 37.4 bits (85), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 43/186 (23%), Positives = 76/186 (40%), Gaps = 29/186 (15%)
Query: 143 VARKMPKNAHAHFLLGLM-------YQRLGQPLKAVSSYEKAEEILLRC-EADIARPELL 194
VA + A +F L + Y RL + + Y+KA EIL C E+L
Sbjct: 158 VASQQQDEARTYFELAVQSGRKLESYVRLAELYRKDKQYQKAIEILENCLHLTPENSEVL 217
Query: 195 SLVQIHHAQCLLPESSGDN-----SLDKELEPE-------------ELEEILSKLKESMQ 236
+ + + + + + D S++++ P+ +++ LSK +
Sbjct: 218 IEISVLYLKINETQKAHDRLAEVVSIERKCSPKGLLAFGAILQSRNDIDGALSKYSQIAN 277
Query: 237 SDTRQAVVWNTLGLILLKSGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSA 296
++ A +WN +GL K + +IS L + + P N + + NL + Y S EQ A
Sbjct: 278 AEPEIAELWNNIGLCFFKKQKFIVAISSLRKSVWLSPLNYNALYNLSLIYIAS---EQYA 334
Query: 297 KCFQDL 302
F L
Sbjct: 335 SAFHTL 340
Score = 35.0 bits (79), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 48/277 (17%), Positives = 114/277 (41%), Gaps = 27/277 (9%)
Query: 135 REEKGLVHVARKMPKNAHAHFLLGLMYQRLGQPLKAVSSYEKAEEILLRCEADIARPELL 194
REE + R + K+A + Y+ +G+ L + + +A + E +R +
Sbjct: 78 REEGNHIEALRHLQKSAELNPRNIETYKEIGRTLYIMGRFSQALGVFREAEQRSSRQD-- 135
Query: 195 SLVQIHHAQCLLPESSGDNSLDKELEPEELEEILSKLKESMQSDTRQAVVWNTLGLILLK 254
+I+H L + K++ ++ +E + + ++QS R+ + L + K
Sbjct: 136 --HEIYHYLGELLYRAATTQSQKDVASQQQDEARTYFELAVQSG-RKLESYVRLAELYRK 192
Query: 255 SGRLQSSISVLSSLLAVDPNNCDCIGNLGIAYFQSGDMEQSAKCFQDLI-LKDQNHPAAL 313
+ Q +I +L + L + P N + + + + Y + + +++ +++ ++ + P L
Sbjct: 193 DKQYQKAIEILENCLHLTPENSEVLIEISVLYLKINETQKAHDRLAEVVSIERKCSPKGL 252
Query: 314 INYAALLLCKYGSVLAGAGANTGEGACLDQASAVNVAKECLLAALKADPKAAHIWANLAN 373
+ + A+L + N +GA + N A+P+ A +W N+
Sbjct: 253 LAFGAILQSR----------NDIDGALSKYSQIAN-----------AEPEIAELWNNIGL 291
Query: 374 AYYLTGDHRSSGKCLEKAAKLEPNCMSTRYAVAVSRI 410
++ + L K+ L P + Y +++ I
Sbjct: 292 CFFKKQKFIVAISSLRKSVWLSPLNYNALYNLSLIYI 328
>sp|Q6B950|YCF37_GRATL Uncharacterized protein ycf37 OS=Gracilaria tenuistipitata var.
liui GN=ycf37 PE=3 SV=1
Length = 148
Score = 37.4 bits (85), Expect = 0.34, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 29/61 (47%), Gaps = 5/61 (8%)
Query: 335 TGE-----GACLDQASAVNVAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLE 389
TGE G+C +AK+ L A K P HI NLAN Y GD +++ K +
Sbjct: 86 TGEYYNCIGSCYQAIKMYKIAKKYYLKAYKRTPSRQHILKNLANIYNTVGDIKNAKKIYQ 145
Query: 390 K 390
+
Sbjct: 146 R 146
Score = 33.5 bits (75), Expect = 4.4, Method: Composition-based stats.
Identities = 16/43 (37%), Positives = 24/43 (55%)
Query: 495 VAKECLLAALKADPKAAHIWANLANAYYLTGDHRSSGKCLEKV 537
+AK+ L A K P HI NLAN Y GD +++ K +++
Sbjct: 105 IAKKYYLKAYKRTPSRQHILKNLANIYNTVGDIKNAKKIYQRL 147
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.312 0.127 0.360
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 192,465,304
Number of Sequences: 539616
Number of extensions: 7641367
Number of successful extensions: 19908
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 82
Number of HSP's successfully gapped in prelim test: 79
Number of HSP's that attempted gapping in prelim test: 19154
Number of HSP's gapped (non-prelim): 691
length of query: 546
length of database: 191,569,459
effective HSP length: 122
effective length of query: 424
effective length of database: 125,736,307
effective search space: 53312194168
effective search space used: 53312194168
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 64 (29.3 bits)