RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10286
(748 letters)
>gnl|CDD|222983 PHA03098, PHA03098, kelch-like protein; Provisional.
Length = 534
Score = 217 bits (555), Expect = 3e-62
Identities = 113/498 (22%), Positives = 219/498 (43%), Gaps = 25/498 (5%)
Query: 196 DACADFLKKRFHPNNVLDYYVL--FSCRAMEALINFAYSGRVTIHSQNVQSLMVVASFLQ 253
+ +++ KK F NN + + + +I + Y+G++ I S NV+ ++ +A++L
Sbjct: 31 SSSSEYFKKMFK-NNFKENEINLNIDYDSFNEVIKYIYTGKINITSNNVKDILSIANYLI 89
Query: 254 MQKVADACADFLKKRFHPNNVLGIRQFADTLNCLQLSEAADKYVQQYFHEVSMSDEFIGL 313
+ + + C +++ K NN + I +F+ C +L AA Y++ + +FI L
Sbjct: 90 IDFLINLCINYIIKIIDDNNCIDIYRFSFFYGCKKLYSAAYNYIRNNIELIYNDPDFIYL 149
Query: 314 GVNEVNDIVKRSELHLMSEEQVFEAVMRWVKHNASERAPSLPRLLAAVRLPLLSPHYLAD 373
NE+ I+ +L++ SE+ V E +++W+ + + + +L +R+ LS +
Sbjct: 150 SKNELIKILSDDKLNVSSEDVVLEIIIKWLTSKKNNKYKDICLILKVLRITFLSEEGIKK 209
Query: 374 RVATEALIRSSHECRDLVDEARDFHLMPERRFLLAGEKTTPRRCNYVMGHIFAVGGLTKA 433
+ I+ + + ++ + ++ L K PR + + +
Sbjct: 210 LKRWKLRIKKK---KIVFNKRCIKIIYSKKYNL---NKILPRSSTFGSIIYIHIT---MS 260
Query: 434 GDSLSTVEVFDPLVGRWQMAEE--------ETLSNAVISTKSCLTKAGDSLSTVEVFDPL 485
+ + + + PL + + L+N + + K S+++V +D
Sbjct: 261 IFTYNYITNYSPLSEINTIIDIHYVYCFGSVVLNNVIYFIG-GMNKNNLSVNSVVSYDTK 319
Query: 486 VGRWQMAEAMSMLRSRVGVAVMKNRLYAFGGYNGSERLSTVEEFDPVRRVWNKVSPMCFK 545
W + R GV V NR+Y GG S L+TVE + P W + P+ F
Sbjct: 320 TKSWNKVPELIYPRKNPGVTVFNNRIYVIGGIYNSISLNTVESWKPGESKWREEPPLIFP 379
Query: 546 RSAVGAAALNDKLYVCGG-YDGVSSLNTVECYEPDKDQWRIVKSMQKHRSAGGVIAFDSY 604
R +N+ +YV GG L TVEC+ + ++W + G I D
Sbjct: 380 RYNPCVVNVNNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDGK 439
Query: 605 VYALGGH---DGLSIFDSVERYDPKTDEWTSVKPMLTKRCRLGVAALNNKIYVCGGYDGA 661
+Y +GG D + +++ VE Y+P T++WT + + R + NNKIYV GG
Sbjct: 440 IYVIGGISYIDNIKVYNIVESYNPVTNKWTELSSLNFPRINASLCIFNNKIYVVGGDKYE 499
Query: 662 IFLQSVEMYDPITDEWKM 679
++ +E+YD T+ W +
Sbjct: 500 YYINEIEVYDDKTNTWTL 517
Score = 148 bits (376), Expect = 6e-38
Identities = 66/238 (27%), Positives = 112/238 (47%), Gaps = 5/238 (2%)
Query: 503 GVAVMKNRLYAFGGYNGS-ERLSTVEEFDPVRRVWNKVSPMCFKRSAVGAAALNDKLYVC 561
G V+ N +Y GG N + +++V +D + WNKV + + R G N+++YV
Sbjct: 289 GSVVLNNVIYFIGGMNKNNLSVNSVVSYDTKTKSWNKVPELIYPRKNPGVTVFNNRIYVI 348
Query: 562 GGYDGVSSLNTVECYEPDKDQWRIVKSMQKHRSAGGVIAFDSYVYALGG-HDGLSIFDSV 620
GG SLNTVE ++P + +WR + R V+ ++ +Y +GG + +V
Sbjct: 349 GGIYNSISLNTVESWKPGESKWREEPPLIFPRYNPCVVNVNNLIYVIGGISKNDELLKTV 408
Query: 621 ERYDPKTDEWTSVKPMLTKRCRLGVAALNNKIYVCGGY---DGAIFLQSVEMYDPITDEW 677
E + T++W+ P+ + KIYV GG D VE Y+P+T++W
Sbjct: 409 ECFSLNTNKWSKGSPLPISHYGGCAIYHDGKIYVIGGISYIDNIKVYNIVESYNPVTNKW 468
Query: 678 KMIASMNVMRSRVALVANMGKLWAIGGYDGVSNLPTVEVYDPSTDSWAFVAPMCAHEG 735
++S+N R +L K++ +GG + +EVYD T++W G
Sbjct: 469 TELSSLNFPRINASLCIFNNKIYVVGGDKYEYYINEIEVYDDKTNTWTLFCKFPKVIG 526
Score = 119 bits (300), Expect = 3e-28
Identities = 53/216 (24%), Positives = 93/216 (43%), Gaps = 13/216 (6%)
Query: 517 YNGSERLSTVEEFDPVRRVWNKVS---PMCFKRSAVGAAALNDKLYVCGGYDG-VSSLNT 572
+ + + P+ + + CF G+ LN+ +Y GG + S+N+
Sbjct: 258 TMSIFTYNYITNYSPLSEINTIIDIHYVYCF-----GSVVLNNVIYFIGGMNKNNLSVNS 312
Query: 573 VECYEPDKDQWRIVKSMQKHRSAGGVIAFDSYVYALGGHDGLSIFDSVERYDPKTDEWTS 632
V Y+ W V + R GV F++ +Y +GG ++VE + P +W
Sbjct: 313 VVSYDTKTKSWNKVPELIYPRKNPGVTVFNNRIYVIGGIYNSISLNTVESWKPGESKWRE 372
Query: 633 VKPMLTKRCRLGVAALNNKIYVCGG-YDGAIFLQSVEMYDPITDEWKMIASMNVMRSRVA 691
P++ R V +NN IYV GG L++VE + T++W + + +
Sbjct: 373 EPPLIFPRYNPCVVNVNNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLPISHYGGC 432
Query: 692 LVANMGKLWAIGGY---DGVSNLPTVEVYDPSTDSW 724
+ + GK++ IGG D + VE Y+P T+ W
Sbjct: 433 AIYHDGKIYVIGGISYIDNIKVYNIVESYNPVTNKW 468
Score = 89.4 bits (222), Expect = 1e-18
Identities = 47/196 (23%), Positives = 79/196 (40%), Gaps = 20/196 (10%)
Query: 414 PRRCNYVM---GHIFAVGGLTKAGDSLSTVEVFDPLVGRWQMAEEETL----SNAVISTK 466
PR+ V I+ +GG+ SL+TVE + P +W+ EE L N +
Sbjct: 332 PRKNPGVTVFNNRIYVIGGI-YNSISLNTVESWKPGESKWR--EEPPLIFPRYNPCVVNV 388
Query: 467 SCL-------TKAGDSLSTVEVFDPLVGRWQMAEAMSMLRSRVGVAVMKNRLYAFGGY-- 517
+ L +K + L TVE F +W + + ++Y GG
Sbjct: 389 NNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDGKIYVIGGISY 448
Query: 518 -NGSERLSTVEEFDPVRRVWNKVSPMCFKRSAVGAAALNDKLYVCGGYDGVSSLNTVECY 576
+ + + VE ++PV W ++S + F R N+K+YV GG +N +E Y
Sbjct: 449 IDNIKVYNIVESYNPVTNKWTELSSLNFPRINASLCIFNNKIYVVGGDKYEYYINEIEVY 508
Query: 577 EPDKDQWRIVKSMQKH 592
+ + W + K
Sbjct: 509 DDKTNTWTLFCKFPKV 524
Score = 48.6 bits (116), Expect = 1e-05
Identities = 26/91 (28%), Positives = 55/91 (60%), Gaps = 4/91 (4%)
Query: 125 IVLAATIPYFQAMFTSDMAESKQREITMQGIDAVAMEALINFVYSGRVTIHSQNVQSLMV 184
I+L+++ YF+ MF ++ K+ EI + ID + +I ++Y+G++ I S NV+ ++
Sbjct: 28 IILSSSSEYFKKMFKNN---FKENEINLN-IDYDSFNEVIKYIYTGKINITSNNVKDILS 83
Query: 185 VASFLQMQKVADACADFLKKRFHPNNVLDYY 215
+A++L + + + C +++ K NN +D Y
Sbjct: 84 IANYLIIDFLINLCINYIIKIIDDNNCIDIY 114
>gnl|CDD|149006 pfam07707, BACK, BTB And C-terminal Kelch. This domain is found
associated with pfam00651 and pfam01344. The BACK domain
is found juxtaposed to the BTB domain; they are
separated by as little as two residues. This family
appears to be closely related to the BTB domain (Finn
RD, personal observation).
Length = 101
Score = 128 bits (325), Expect = 3e-35
Identities = 50/101 (49%), Positives = 71/101 (70%)
Query: 274 VLGIRQFADTLNCLQLSEAADKYVQQYFHEVSMSDEFIGLGVNEVNDIVKRSELHLMSEE 333
LGI +FADT C +L+E A +++ + F EVS S+EF+ L ++ +++ EL++ SEE
Sbjct: 1 CLGIYRFADTYGCEELAEVALRFILKNFLEVSKSEEFLELSKEQLLELLSSDELNVPSEE 60
Query: 334 QVFEAVMRWVKHNASERAPSLPRLLAAVRLPLLSPHYLADR 374
+VFEAV++WVKH+ R LP LL+AVRLPLLSP YL D
Sbjct: 61 EVFEAVIKWVKHDVENRKKHLPELLSAVRLPLLSPEYLLDV 101
>gnl|CDD|197943 smart00875, BACK, BTB And C-terminal Kelch. The BACK domain is
found juxtaposed to the BTB domain; they are separated
by as little as two residues.
Length = 101
Score = 120 bits (303), Expect = 2e-32
Identities = 48/101 (47%), Positives = 70/101 (69%)
Query: 274 VLGIRQFADTLNCLQLSEAADKYVQQYFHEVSMSDEFIGLGVNEVNDIVKRSELHLMSEE 333
LGIR+FAD +L+E A +++ Q F EVS S+EF+ L + ++ +++ +L++ SEE
Sbjct: 1 CLGIRRFADAHGLEELAEKALRFILQNFSEVSSSEEFLELPLEQLLELLSSDDLNVSSEE 60
Query: 334 QVFEAVMRWVKHNASERAPSLPRLLAAVRLPLLSPHYLADR 374
+VFEAV+RWVKH+ +R LP LL VRLPLLSP YL +
Sbjct: 61 EVFEAVLRWVKHDPEKRREHLPELLEHVRLPLLSPDYLLEV 101
>gnl|CDD|216043 pfam00651, BTB, BTB/POZ domain. The BTB (for BR-C, ttk and bab) or
POZ (for Pox virus and Zinc finger) domain is present
near the N-terminus of a fraction of zinc finger
(pfam00096) proteins and in proteins that contain the
pfam01344 motif such as Kelch and a family of pox virus
proteins. The BTB/POZ domain mediates homomeric
dimerisation and in some instances heteromeric
dimerisation. The structure of the dimerised PLZF
BTB/POZ domain has been solved and consists of a tightly
intertwined homodimer. The central scaffolding of the
protein is made up of a cluster of alpha-helices flanked
by short beta-sheets at both the top and bottom of the
molecule. POZ domains from several zinc finger proteins
have been shown to mediate transcriptional repression
and to interact with components of histone deacetylase
co-repressor complexes including N-CoR and SMRT. The POZ
or BTB domain is also known as BR-C/Ttk or ZiN.
Length = 101
Score = 101 bits (253), Expect = 1e-25
Identities = 42/139 (30%), Positives = 65/139 (46%), Gaps = 38/139 (27%)
Query: 65 MEEIRRQGKLCDVTIKVDDQSFTCHRIVLAATIPYFQAMFTSDMAESKQREITMQGIDAV 124
+ E+R G+LCDVT+ V D+ F H+ VLAA PYF+A+FT +
Sbjct: 1 LNELRENGELCDVTLVVGDKEFHAHKAVLAACSPYFKALFTGNKEV-------------- 46
Query: 125 IVLAATIPYFQAMFTSDMAESKQREITMQGIDAVAMEALINFVYSGRVTIHSQNVQSLMV 184
EIT++ + EAL+ F+Y+G++ I +NV L+
Sbjct: 47 ------------------------EITLEDVSPEDFEALLEFIYTGKLEITEENVDDLLA 82
Query: 185 VASFLQMQKVADACADFLK 203
+A LQ+ + D C +FL
Sbjct: 83 LADKLQIPALIDKCEEFLI 101
Score = 55.3 bits (134), Expect = 2e-09
Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 1/71 (1%)
Query: 197 ACADFLKKRFHPNNVLDYYVL-FSCRAMEALINFAYSGRVTIHSQNVQSLMVVASFLQMQ 255
AC+ + K F N ++ + S EAL+ F Y+G++ I +NV L+ +A LQ+
Sbjct: 31 ACSPYFKALFTGNKEVEITLEDVSPEDFEALLEFIYTGKLEITEENVDDLLALADKLQIP 90
Query: 256 KVADACADFLK 266
+ D C +FL
Sbjct: 91 ALIDKCEEFLI 101
>gnl|CDD|197585 smart00225, BTB, Broad-Complex, Tramtrack and Bric a brac. Domain
in Broad-Complex, Tramtrack and Bric a brac. Also known
as POZ (poxvirus and zinc finger) domain. Known to be a
protein-protein interaction motif found at the N-termini
of several C2H2-type transcription factors as well as
Shaw-type potassium channels. Known structure reveals a
tightly intertwined dimer formed via interactions
between N-terminal strand and helix structures. However
in a subset of BTB/POZ domains, these two secondary
structures appear to be missing. Be aware SMART predicts
BTB/POZ domains without the beta1- and alpha1-secondary
structures.
Length = 97
Score = 90.1 bits (224), Expect = 1e-21
Identities = 38/130 (29%), Positives = 62/130 (47%), Gaps = 34/130 (26%)
Query: 76 DVTIKVDDQSFTCHRIVLAATIPYFQAMFTSDMAESKQREITMQGIDAVIVLAATIPYFQ 135
DVT+ V + F H+ VLAA S PYF+
Sbjct: 1 DVTLVVGGKKFHAHKAVLAA--------------HS--------------------PYFK 26
Query: 136 AMFTSDMAESKQREITMQGIDAVAMEALINFVYSGRVTIHSQNVQSLMVVASFLQMQKVA 195
A+F+SD ES + EI + + AL+NF+Y+G++ + +NV+ L+ +A +LQ+ +
Sbjct: 27 ALFSSDFKESDKSEIYLDDVSPEDFRALLNFLYTGKLDLPEENVEELLELADYLQIPGLV 86
Query: 196 DACADFLKKR 205
+ C +FL K
Sbjct: 87 ELCEEFLLKL 96
Score = 54.2 bits (131), Expect = 4e-09
Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 5/77 (6%)
Query: 197 ACADFLKKRFHPNNVLD-----YYVLFSCRAMEALINFAYSGRVTIHSQNVQSLMVVASF 251
A + + K F + Y S AL+NF Y+G++ + +NV+ L+ +A +
Sbjct: 20 AHSPYFKALFSSDFKESDKSEIYLDDVSPEDFRALLNFLYTGKLDLPEENVEELLELADY 79
Query: 252 LQMQKVADACADFLKKR 268
LQ+ + + C +FL K
Sbjct: 80 LQIPGLVELCEEFLLKL 96
>gnl|CDD|128874 smart00612, Kelch, Kelch domain.
Length = 47
Score = 72.6 bits (179), Expect = 3e-16
Identities = 22/47 (46%), Positives = 29/47 (61%)
Query: 651 KIYVCGGYDGAIFLQSVEMYDPITDEWKMIASMNVMRSRVALVANMG 697
KIYV GG+DG L+SVE+YDP T++W + SM RS + G
Sbjct: 1 KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47
Score = 67.2 bits (165), Expect = 3e-14
Identities = 21/46 (45%), Positives = 28/46 (60%)
Query: 605 VYALGGHDGLSIFDSVERYDPKTDEWTSVKPMLTKRCRLGVAALNN 650
+Y +GG DG SVE YDP+T++WT + M T R GVA +N
Sbjct: 2 IYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47
Score = 64.5 bits (158), Expect = 2e-13
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 557 KLYVCGGYDGVSSLNTVECYEPDKDQWRIVKSMQKHRSAGGVIAFD 602
K+YV GG+DG L +VE Y+P+ ++W + SM RS GV +
Sbjct: 1 KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVIN 46
Score = 57.6 bits (140), Expect = 7e-11
Identities = 17/47 (36%), Positives = 26/47 (55%)
Query: 510 RLYAFGGYNGSERLSTVEEFDPVRRVWNKVSPMCFKRSAVGAAALND 556
++Y GG++G +RL +VE +DP W + M RS G A +N
Sbjct: 1 KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47
Score = 51.4 bits (124), Expect = 1e-08
Identities = 20/45 (44%), Positives = 27/45 (60%)
Query: 698 KLWAIGGYDGVSNLPTVEVYDPSTDSWAFVAPMCAHEGGVGVGVI 742
K++ +GG+DG L +VEVYDP T+ W + M G GV VI
Sbjct: 1 KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVI 45
Score = 41.0 bits (97), Expect = 5e-05
Identities = 15/37 (40%), Positives = 20/37 (54%)
Query: 473 GDSLSTVEVFDPLVGRWQMAEAMSMLRSRVGVAVMKN 509
G L +VEV+DP +W +M RS GVAV+
Sbjct: 11 GQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47
Score = 34.1 bits (79), Expect = 0.012
Identities = 12/30 (40%), Positives = 16/30 (53%), Gaps = 1/30 (3%)
Query: 423 HIFAVGGLTKAGDSLSTVEVFDPLVGRWQM 452
I+ VGG G L +VEV+DP +W
Sbjct: 1 KIYVVGGFD-GGQRLKSVEVYDPETNKWTP 29
>gnl|CDD|201739 pfam01344, Kelch_1, Kelch motif. The kelch motif was initially
discovered in Kelch. In this protein there are six
copies of the motif. It has been shown that the
Drosophila ring canal kelch protein is related to
Galactose Oxidase, for which a structure has been
solved. The kelch motif forms a beta sheet. Several of
these sheets associate to form a beta propeller
structure as found in pfam00064, pfam00400 and
pfam00415.
Length = 46
Score = 68.1 bits (167), Expect = 1e-14
Identities = 24/46 (52%), Positives = 27/46 (58%)
Query: 639 KRCRLGVAALNNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASMN 684
R GV L KIYV GGYDG L SVE+YDP T+ W + SM
Sbjct: 1 PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSMP 46
Score = 64.6 bits (158), Expect = 2e-13
Identities = 21/46 (45%), Positives = 28/46 (60%)
Query: 545 KRSAVGAAALNDKLYVCGGYDGVSSLNTVECYEPDKDQWRIVKSMQ 590
RS G L K+YV GGYDG SL++VE Y+P+ + W + SM
Sbjct: 1 PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSMP 46
Score = 61.5 bits (150), Expect = 2e-12
Identities = 18/45 (40%), Positives = 26/45 (57%)
Query: 592 HRSAGGVIAFDSYVYALGGHDGLSIFDSVERYDPKTDEWTSVKPM 636
RS GV+ +Y +GG+DG SVE YDP+T+ W+ + M
Sbjct: 1 PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSM 45
Score = 58.8 bits (143), Expect = 2e-11
Identities = 19/44 (43%), Positives = 28/44 (63%)
Query: 499 RSRVGVAVMKNRLYAFGGYNGSERLSTVEEFDPVRRVWNKVSPM 542
RS GV V+ ++Y GGY+G + LS+VE +DP W+K+ M
Sbjct: 2 RSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSM 45
Score = 56.5 bits (137), Expect = 2e-10
Identities = 21/45 (46%), Positives = 30/45 (66%)
Query: 686 MRSRVALVANMGKLWAIGGYDGVSNLPTVEVYDPSTDSWAFVAPM 730
RS +V GK++ IGGYDG +L +VEVYDP T++W+ + M
Sbjct: 1 PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSM 45
Score = 36.1 bits (84), Expect = 0.003
Identities = 15/41 (36%), Positives = 20/41 (48%), Gaps = 5/41 (12%)
Query: 415 RRCNYVM----GHIFAVGGLTKAGDSLSTVEVFDPLVGRWQ 451
R + G I+ +GG G SLS+VEV+DP W
Sbjct: 1 PRSGAGVVVLGGKIYVIGGYDG-GQSLSSVEVYDPETNTWS 40
>gnl|CDD|165086 PHA02713, PHA02713, hypothetical protein; Provisional.
Length = 557
Score = 70.8 bits (173), Expect = 1e-12
Identities = 43/169 (25%), Positives = 75/169 (44%), Gaps = 19/169 (11%)
Query: 505 AVMKNRLYAFGGYN-GSERLSTVEEFDPVRRVWNKVSPMCFKRSAVGAAALNDKLYVCGG 563
A++ N + GGYN + L+ V + + ++ ++ PM R A ++D +Y GG
Sbjct: 300 AIVDNEIIIAGGYNFNNPSLNKVYKINIENKIHVELPPMIKNRCRFSLAVIDDTIYAIGG 359
Query: 564 YDGVSSLNTVECYEPDKDQWRIVKSMQKHRSAGGVIAFDSYVYALGG------------- 610
+G + T+ECY D+W+++ M S+ G+ D Y+Y +GG
Sbjct: 360 QNGTNVERTIECYTMGDDKWKMLPDMPIALSSYGMCVLDQYIYIIGGRTEHIDYTSVHHM 419
Query: 611 -----HDGLSIFDSVERYDPKTDEWTSVKPMLTKRCRLGVAALNNKIYV 654
+ + V RYD + W ++ T R GV + + IYV
Sbjct: 420 NSIDMEEDTHSSNKVIRYDTVNNIWETLPNFWTGTIRPGVVSHKDDIYV 468
Score = 50.8 bits (121), Expect = 2e-06
Identities = 22/70 (31%), Positives = 37/70 (52%)
Query: 635 PMLTKRCRLGVAALNNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASMNVMRSRVALVA 694
PM+ RCR +A +++ IY GG +G +++E Y D+WKM+ M + S +
Sbjct: 337 PMIKNRCRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDMPIALSSYGMCV 396
Query: 695 NMGKLWAIGG 704
++ IGG
Sbjct: 397 LDQYIYIIGG 406
Score = 46.5 bits (110), Expect = 4e-05
Identities = 36/138 (26%), Positives = 63/138 (45%), Gaps = 1/138 (0%)
Query: 551 AAALNDKLYVCGGYD-GVSSLNTVECYEPDKDQWRIVKSMQKHRSAGGVIAFDSYVYALG 609
+A +++++ + GGY+ SLN V + + M K+R + D +YA+G
Sbjct: 299 SAIVDNEIIIAGGYNFNNPSLNKVYKINIENKIHVELPPMIKNRCRFSLAVIDDTIYAIG 358
Query: 610 GHDGLSIFDSVERYDPKTDEWTSVKPMLTKRCRLGVAALNNKIYVCGGYDGAIFLQSVEM 669
G +G ++ ++E Y D+W + M G+ L+ IY+ GG I SV
Sbjct: 359 GQNGTNVERTIECYTMGDDKWKMLPDMPIALSSYGMCVLDQYIYIIGGRTEHIDYTSVHH 418
Query: 670 YDPITDEWKMIASMNVMR 687
+ I E +S V+R
Sbjct: 419 MNSIDMEEDTHSSNKVIR 436
Score = 36.1 bits (83), Expect = 0.073
Identities = 26/131 (19%), Positives = 52/131 (39%), Gaps = 36/131 (27%)
Query: 74 LCDVTIKV-DDQSFTCHRIVLAATIPYFQAMFTSDMAESKQREITMQGIDAVIVLAATIP 132
LCDV I + D + H+ +LAA YF+ +FT+ M
Sbjct: 25 LCDVIITIGDGEEIKAHKTILAAGSKYFRTLFTTPM------------------------ 60
Query: 133 YFQAMFTSDMAESKQREITMQGIDAVAMEALINFVYSGRVTIHSQNVQSLMVVASFLQMQ 192
+ + +Q D A++ ++ ++Y+ ++ S NV ++ A +L +
Sbjct: 61 ---------IIRDLVTRVNLQMFDKDAVKNIVQYLYNRHIS--SMNVIDVLKCADYLLID 109
Query: 193 KVADACADFLK 203
+ C ++K
Sbjct: 110 DLVTDCESYIK 120
Score = 33.1 bits (75), Expect = 0.63
Identities = 29/112 (25%), Positives = 48/112 (42%), Gaps = 7/112 (6%)
Query: 623 YDPKTDEWTSVKPMLTKRCRLGVAALNNKIYVCGGYD-GAIFLQSVEMYDPITDEWKMIA 681
Y+ T E++ + + A ++N+I + GGY+ L V I E K+
Sbjct: 277 YNINTMEYSVISTIPNHIINYASAIVDNEIIIAGGYNFNNPSLNKVY---KINIENKIHV 333
Query: 682 SMNVM---RSRVALVANMGKLWAIGGYDGVSNLPTVEVYDPSTDSWAFVAPM 730
+ M R R +L ++AIGG +G + T+E Y D W + M
Sbjct: 334 ELPPMIKNRCRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDM 385
Score = 31.5 bits (71), Expect = 1.9
Identities = 33/176 (18%), Positives = 66/176 (37%), Gaps = 23/176 (13%)
Query: 478 TVEVFDPLVGRWQMAEAMSMLRSRVGVAVMKNRLYAFGG------YNGSERLSTVE---- 527
T+E + +W+M M + S G+ V+ +Y GG Y +++++
Sbjct: 368 TIECYTMGDDKWKMLPDMPIALSSYGMCVLDQYIYIIGGRTEHIDYTSVHHMNSIDMEED 427
Query: 528 --------EFDPVRRVWNKVSPMCFKRSAVGAAALNDKLY-VCGGYDGVSSLNTVECYEP 578
+D V +W + G + D +Y VC D + + Y
Sbjct: 428 THSSNKVIRYDTVNNIWETLPNFWTGTIRPGVVSHKDDIYVVCDIKDEKNVKTCIFRYNT 487
Query: 579 DK-DQWRIVKSMQKHRSAGGVIAFDSYVYALGGHDGLSIFDSVERYDPKTDEWTSV 633
+ + W ++ + + SA I D+ + L ++ + D+ Y T EW +
Sbjct: 488 NTYNGWELITTTESRLSALHTILHDNTIMMLHCYESYMLQDTFNVY---TYEWNHI 540
>gnl|CDD|165153 PHA02790, PHA02790, Kelch-like protein; Provisional.
Length = 480
Score = 63.1 bits (153), Expect = 3e-10
Identities = 57/219 (26%), Positives = 92/219 (42%), Gaps = 23/219 (10%)
Query: 511 LYAFGGYNGSERLSTVEEFDPVRRVWNKVSPMCFKRSAVGAAALNDKLYVCGGYDGVSSL 570
+Y GG+ +E + + + W + PM R N+KLYV GG +S
Sbjct: 274 VYLIGGWMNNEIHNNAIAVNYISNNWIPIPPMNSPRLYASGVPANNKLYVVGGLPNPTS- 332
Query: 571 NTVECYEPDKDQWRIVKSMQKHRSAGGVIAFDSYVYALGGHDGLSIFDSVERYD-PKTDE 629
VE + W + S+ K R V + ++ +Y +GGH S D+ Y P D+
Sbjct: 333 --VERWFHGDAAWVNMPSLLKPRCNPAVASINNVIYVIGGH---SETDTTTEYLLPNHDQ 387
Query: 630 W----TSVKPMLTKRCRLGVAALNNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASMNV 685
W ++ P K C L ++++ G ++ E Y ++ W +I
Sbjct: 388 WQFGPSTYYPHY-KSCAL---VFGRRLFLVG--------RNAEFYCESSNTWTLIDDPIY 435
Query: 686 MRSRVALVANMGKLWAIGGYDGVSNLPTVEVYDPSTDSW 724
R L+ KL IGG+ S + T+EVY+ T SW
Sbjct: 436 PRDNPELIIVDNKLLLIGGFYRGSYIDTIEVYNNRTYSW 474
Score = 60.1 bits (145), Expect = 3e-09
Identities = 88/467 (18%), Positives = 183/467 (39%), Gaps = 72/467 (15%)
Query: 222 AMEALINFAYSGRVTIHSQNVQSLMVVASFLQMQKVADACADFLKKRFHPNN-----VLG 276
++ +++ ++Y+G+V I S NV +L+ + ++ + C +F+ + F ++G
Sbjct: 73 SLTSIVIYSYTGKVYIDSHNVVNLLRASILTSVEFIIYTCINFILRDFRKEYCVECYMMG 132
Query: 277 IRQFADTLNCLQLSEAADKYVQQYFHEVS--MSDEFIGLGVNEVNDIVKRSELHLMSEEQ 334
I L C ++ ++F E+ + D F L + I++ EL++ E+
Sbjct: 133 IEYGLSNLLC-----HTKDFIAKHFLELEDDIIDNFDYLSMKL---ILESDELNVPDEDY 184
Query: 335 VFEAVMRWVKHNASERAPSLPRLLAAVRLPLLSPHYLADRVATEALIRSSHECRDLVDEA 394
V + V++W + L + +R LSP R + + ++D
Sbjct: 185 VVDFVIKWYMKRRNRLGNLLLLIKNVIRSNYLSP-------------RGINNVKWILDCT 231
Query: 395 RDFHL--MPERRFLLAGEKTTPRRCNYVMGHIFAVGGLTKAGDSLSTVEVFDPLVGRWQM 452
+ FH P + + + P + ++ IF + T G+ + L+G W
Sbjct: 232 KIFHCDKQPRKSYKYPFIEY-PMNMDQII-DIFHMCTSTHVGEVVY-------LIGGWMN 282
Query: 453 AEEETLSNAVISTKSCLTKAGDSLSTVEVFDPLVGRWQMAEAMSMLRSRVGVAVMKNRLY 512
E + N I+ + ++ W M+ R N+LY
Sbjct: 283 NE---IHNNAIAV-NYISN----------------NWIPIPPMNSPRLYASGVPANNKLY 322
Query: 513 AFGGYNGSERLSTVEEFDPVRRVWNKVSPMCFKRSAVGAAALNDKLYVCGGYDGVSSLNT 572
GG ++VE + W + + R A++N+ +YV GG+ + T
Sbjct: 323 VVGGLPNP---TSVERWFHGDAAWVNMPSLLKPRCNPAVASINNVIYVIGGHSETDT--T 377
Query: 573 VECYEPDKDQWRIVKSMQKHRSAGGVIAFDSYVYALGGHDGLSIFDSVERYDPKTDEWTS 632
E P+ DQW+ S + F ++ +G + E Y ++ WT
Sbjct: 378 TEYLLPNHDQWQFGPSTYYPHYKSCALVFGRRLFLVG--------RNAEFYCESSNTWTL 429
Query: 633 VKPMLTKRCRLGVAALNNKIYVCGGYDGAIFLQSVEMYDPITDEWKM 679
+ + R + ++NK+ + GG+ ++ ++E+Y+ T W +
Sbjct: 430 IDDPIYPRDNPELIIVDNKLLLIGGFYRGSYIDTIEVYNNRTYSWNI 476
Score = 36.6 bits (84), Expect = 0.056
Identities = 22/116 (18%), Positives = 58/116 (50%), Gaps = 6/116 (5%)
Query: 108 MAESKQREITMQGIDAVIVLAATI-----PYFQAMFTSDMAESKQREITM-QGIDAVAME 161
++ +K+ + ++ I I++ +TI PYF+ ++K + +D ++
Sbjct: 16 LSMTKKFKTIIEAIGGNIIVNSTILKKLSPYFRTHLRQKYTKNKDPVTRVCLDLDIHSLT 75
Query: 162 ALINFVYSGRVTIHSQNVQSLMVVASFLQMQKVADACADFLKKRFHPNNVLDYYVL 217
+++ + Y+G+V I S NV +L+ + ++ + C +F+ + F ++ Y++
Sbjct: 76 SIVIYSYTGKVYIDSHNVVNLLRASILTSVEFIIYTCINFILRDFRKEYCVECYMM 131
>gnl|CDD|177844 PLN02193, PLN02193, nitrile-specifier protein.
Length = 470
Score = 60.4 bits (146), Expect = 2e-09
Identities = 61/257 (23%), Positives = 100/257 (38%), Gaps = 59/257 (22%)
Query: 484 PLVGRWQMAEAMSM---LRSRVGVAVMKNRLYAFGGYNGSERLSTVEEFDPVRRVWNKVS 540
L+G+W E LR G+A + N++Y+FGG EF P +
Sbjct: 148 KLLGKWIKVEQKGEGPGLRCSHGIAQVGNKIYSFGG-----------EFTPNQ------- 189
Query: 541 PMCFKRSAVGAAALNDKLYVCGGYDGVSSLNTVECYEPDKDQWRIVKSMQK--HRSAGGV 598
++ LYV ++ + W I + H S GV
Sbjct: 190 ------------PIDKHLYV---------------FDLETRTWSISPATGDVPHLSCLGV 222
Query: 599 --IAFDSYVYALGGHDGLSIFDSVERYDPKTDEW---TSVKPMLTKRCRLGVAALNNKIY 653
++ S +Y GG D ++ +D T+EW T V+ T R +AA +Y
Sbjct: 223 RMVSIGSTLYVFGGRDASRQYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVY 282
Query: 654 VCGGYDGAIFLQSVEMYDPITDEWKMIASMN---VMRSRVALVANMGKLWAIGGYDGVSN 710
V GG L++++ Y+ + +W ++ +R L GK+W + G++G
Sbjct: 283 VFGGVSATARLKTLDSYNIVDKKWFHCSTPGDSFSIRGGAGLEVVQGKVWVVYGFNGCE- 341
Query: 711 LPTVEVYDPSTDSWAFV 727
+ V YDP D W V
Sbjct: 342 VDDVHYYDPVQDKWTQV 358
Score = 58.0 bits (140), Expect = 9e-09
Identities = 50/190 (26%), Positives = 80/190 (42%), Gaps = 14/190 (7%)
Query: 481 VFDPLVGRWQMAEAMSMLR--SRVGVAVMK--NRLYAFGGYNGSERLSTVEEFDPVRRVW 536
VFD W ++ A + S +GV ++ + LY FGG + S + + FD W
Sbjct: 197 VFDLETRTWSISPATGDVPHLSCLGVRMVSIGSTLYVFGGRDASRQYNGFYSFDTTTNEW 256
Query: 537 NKVSPM---CFKRSAVGAAALNDKLYVCGGYDGVSSLNTVECYEPDKDQWRIVKSMQKH- 592
++P+ RS AA + +YV GG + L T++ Y +W +
Sbjct: 257 KLLTPVEEGPTPRSFHSMAADEENVYVFGGVSATARLKTLDSYNIVDKKWFHCSTPGDSF 316
Query: 593 --RSAGGVIAFDSYVYALGGHDGLSIFDSVERYDPKTDEWTSVKPMLTKRCRLGV---AA 647
R G+ V+ + G +G + D V YDP D+WT V+ + V AA
Sbjct: 317 SIRGGAGLEVVQGKVWVVYGFNGCEV-DDVHYYDPVQDKWTQVETFGVRPSERSVFASAA 375
Query: 648 LNNKIYVCGG 657
+ I + GG
Sbjct: 376 VGKHIVIFGG 385
Score = 51.9 bits (124), Expect = 7e-07
Identities = 39/151 (25%), Positives = 65/151 (43%), Gaps = 13/151 (8%)
Query: 482 FDPLVGRWQM---AEAMSMLRSRVGVAVMKNRLYAFGGYNGSERLSTVEEFDPVRRVWNK 538
FD W++ E RS +A + +Y FGG + + RL T++ ++ V + W
Sbjct: 249 FDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVYVFGGVSATARLKTLDSYNIVDKKWFH 308
Query: 539 VSP---MCFKRSAVGAAALNDKLYVCGGYDGVSSLNTVECYEPDKDQWRIVKSMQKHRSA 595
S R G + K++V G++G ++ V Y+P +D+W V++ S
Sbjct: 309 CSTPGDSFSIRGGAGLEVVQGKVWVVYGFNGC-EVDDVHYYDPVQDKWTQVETFGVRPSE 367
Query: 596 GGVIAFDSYVYALGGHDGLSIFDSVERYDPK 626
V A A+G H + IF DP
Sbjct: 368 RSVFA----SAAVGKH--IVIFGGEIAMDPL 392
Score = 35.7 bits (82), Expect = 0.079
Identities = 38/164 (23%), Positives = 71/164 (43%), Gaps = 16/164 (9%)
Query: 593 RSAGGVIAFDSYVYALGGH--DGLSIFDSVERYDPKTDEWTSVKP----MLTKRCRLGV- 645
R + G+ + +Y+ GG I + +D +T W S+ P + C LGV
Sbjct: 166 RCSHGIAQVGNKIYSFGGEFTPNQPIDKHLYVFDLETRTW-SISPATGDVPHLSC-LGVR 223
Query: 646 -AALNNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASMN---VMRSRVALVANMGKLWA 701
++ + +YV GG D + +D T+EWK++ + RS ++ A+ ++
Sbjct: 224 MVSIGSTLYVFGGRDASRQYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVYV 283
Query: 702 IGGYDGVSNLPTVEVYDPSTDSWAFVAP---MCAHEGGVGVGVI 742
GG + L T++ Y+ W + + GG G+ V+
Sbjct: 284 FGGVSATARLKTLDSYNIVDKKWFHCSTPGDSFSIRGGAGLEVV 327
>gnl|CDD|206134 pfam13964, Kelch_6, Kelch motif.
Length = 50
Score = 47.8 bits (114), Expect = 2e-07
Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 1/49 (2%)
Query: 640 RCRLGVAALNNKIYVCGGY-DGAIFLQSVEMYDPITDEWKMIASMNVMR 687
R ++ KIYV GGY +G+ V +YDP T W+ + + R
Sbjct: 2 RTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPTPR 50
Score = 44.7 bits (106), Expect = 2e-06
Identities = 14/50 (28%), Positives = 23/50 (46%), Gaps = 1/50 (2%)
Query: 592 HRSAGGVIAFDSYVYALGG-HDGLSIFDSVERYDPKTDEWTSVKPMLTKR 640
R+ ++ +Y GG +G + V YDP+T W + P+ T R
Sbjct: 1 PRTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPTPR 50
Score = 38.6 bits (90), Expect = 4e-04
Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 1/47 (2%)
Query: 546 RSAVGAAALNDKLYVCGGY-DGVSSLNTVECYEPDKDQWRIVKSMQK 591
R+ A ++ K+YV GGY +G + N V Y+P+ W + +
Sbjct: 2 RTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPT 48
Score = 38.2 bits (89), Expect = 5e-04
Identities = 15/45 (33%), Positives = 24/45 (53%), Gaps = 1/45 (2%)
Query: 499 RSRVGVAVMKNRLYAFGGY-NGSERLSTVEEFDPVRRVWNKVSPM 542
R+ + ++Y FGGY NGS+ + V +DP W K+ P+
Sbjct: 2 RTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPL 46
Score = 37.8 bits (88), Expect = 7e-04
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 686 MRSRVALVANMGKLWAIGGYDGVSN-LPTVEVYDPSTDSWAFVAPM 730
R+ + V+ GK++ GGY S V VYDP T SW + P+
Sbjct: 1 PRTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPL 46
Score = 35.5 bits (82), Expect = 0.004
Identities = 10/31 (32%), Positives = 16/31 (51%)
Query: 422 GHIFAVGGLTKAGDSLSTVEVFDPLVGRWQM 452
G I+ GG + + + V V+DP G W+
Sbjct: 12 GKIYVFGGYSNGSKASNKVLVYDPETGSWEK 42
>gnl|CDD|237617 PRK14131, PRK14131, N-acetylneuraminic acid mutarotase;
Provisional.
Length = 376
Score = 52.3 bits (126), Expect = 5e-07
Identities = 41/136 (30%), Positives = 58/136 (42%), Gaps = 21/136 (15%)
Query: 541 PMCFKRSAVGAAAL-NDKLYVCGGYDGVS--SLNTVECYEPDKDQWRIVKSMQKHRSAGG 597
P+ FK G A+ N+ +YV G G S L+ P K W + +
Sbjct: 26 PVPFKN---GTGAIDNNTVYVGLGSAGTSWYKLDL---NAPSK-GWTKIAAFPGGPREQA 78
Query: 598 VIAF-DSYVYALGG-----HDG-LSIFDSVERYDPKTDEWTSVKPMLTKRCRLG---VAA 647
V AF D +Y GG +G +FD V +YDPKT+ W + + G V+
Sbjct: 79 VAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKL-DTRSPVGLAGHVAVSL 137
Query: 648 LNNKIYVCGGYDGAIF 663
N K Y+ GG + IF
Sbjct: 138 HNGKAYITGGVNKNIF 153
Score = 37.3 bits (87), Expect = 0.026
Identities = 21/78 (26%), Positives = 30/78 (38%), Gaps = 16/78 (20%)
Query: 499 RSRVGVAVMKNRLYAFGGYNGSER------LSTVEEFDPVRRVWNKV---SPMCFKRSAV 549
R + A + +LY FGG + V ++DP W K+ SP
Sbjct: 75 REQAVAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKLDTRSP----VGLA 130
Query: 550 GAAAL---NDKLYVCGGY 564
G A+ N K Y+ GG
Sbjct: 131 GHVAVSLHNGKAYITGGV 148
Score = 33.8 bits (78), Expect = 0.27
Identities = 27/90 (30%), Positives = 39/90 (43%), Gaps = 12/90 (13%)
Query: 644 GVAAL-NNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASM-NVMRSRVALVANMGKLWA 701
G A+ NN +YV G G + +++ P + W IA+ R + GKL+
Sbjct: 32 GTGAIDNNTVYVGLGSAGTSWY-KLDLNAP-SKGWTKIAAFPGGPREQAVAAFIDGKLYV 89
Query: 702 IGGYDGVSN-------LPTVEVYDPSTDSW 724
GG G +N V YDP T+SW
Sbjct: 90 FGGI-GKTNSEGSPQVFDDVYKYDPKTNSW 118
>gnl|CDD|177814 PLN02153, PLN02153, epithiospecifier protein.
Length = 341
Score = 48.8 bits (116), Expect = 5e-06
Identities = 53/238 (22%), Positives = 97/238 (40%), Gaps = 38/238 (15%)
Query: 499 RSRVGVAVMKNRLYAFGG-YNGSERLST-VEEFDPVRRVWNKVSP-------MCFKRSAV 549
R G+AV+ ++LY+FGG +E + + FD W+ C V
Sbjct: 23 RCSHGIAVVGDKLYSFGGELKPNEHIDKDLYVFDFNTHTWSIAPANGDVPRISCL---GV 79
Query: 550 GAAALNDKLYVCGGYDGVSSLNTVECYEPDKDQWRIVKSMQKH-----RSAGGVIAFDSY 604
A+ KLY+ GG D + Y+ K++W + + + R+ + + +++
Sbjct: 80 RMVAVGTKLYIFGGRDEKREFSDFYSYDTVKNEWTFLTKLDEEGGPEARTFHSMASDENH 139
Query: 605 VYALGG--HDGL----SIFDSVERYDPKTDEWTSVK---PMLTKRCRLGVAALNNKIYVC 655
VY GG GL F ++E Y+ +W + KR G A + KI+V
Sbjct: 140 VYVFGGVSKGGLMKTPERFRTIEAYNIADGKWVQLPDPGENFEKRGGAGFAVVQGKIWVV 199
Query: 656 GGYDGAIF--------LQSVEMYDPITDEWKMIASMNVMRSRVALVANMGKLWAIGGY 705
G+ +I +V+ +DP + +W + + S ++ A+ +G Y
Sbjct: 200 YGFATSILPGGKSDYESNAVQFFDPASGKWTEVETTGAKPSARSVFAHA----VVGKY 253
Score = 43.1 bits (101), Expect = 3e-04
Identities = 32/151 (21%), Positives = 59/151 (39%), Gaps = 19/151 (12%)
Query: 593 RSAGGVIAFDSYVYALGGH--DGLSIFDSVERYDPKTDEWTSVKP---MLTKRCRLGV-- 645
R + G+ +Y+ GG I + +D T W+ + C LGV
Sbjct: 23 RCSHGIAVVGDKLYSFGGELKPNEHIDKDLYVFDFNTHTWSIAPANGDVPRISC-LGVRM 81
Query: 646 AALNNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASMNVM-----RSRVALVANMGKLW 700
A+ K+Y+ GG D YD + +EW + ++ R+ ++ ++ ++
Sbjct: 82 VAVGTKLYIFGGRDEKREFSDFYSYDTVKNEWTFLTKLDEEGGPEARTFHSMASDENHVY 141
Query: 701 AIGGYD--GVSNLP----TVEVYDPSTDSWA 725
GG G+ P T+E Y+ + W
Sbjct: 142 VFGGVSKGGLMKTPERFRTIEAYNIADGKWV 172
Score = 38.8 bits (90), Expect = 0.009
Identities = 50/210 (23%), Positives = 84/210 (40%), Gaps = 37/210 (17%)
Query: 481 VFDPLVGRWQMAEA------MSMLRSRVGVAVMKNRLYAFGGYNGSERLSTVEEFDPVRR 534
VFD W +A A +S L R+ VAV +LY FGG + S +D V+
Sbjct: 54 VFDFNTHTWSIAPANGDVPRISCLGVRM-VAV-GTKLYIFGGRDEKREFSDFYSYDTVKN 111
Query: 535 VWNKVSPMCFK-----RSAVGAAALNDKLYVCGGYDGVSSLN------TVECYEPDKDQW 583
W ++ + + R+ A+ + +YV GG + T+E Y +W
Sbjct: 112 EWTFLTKLDEEGGPEARTFHSMASDENHVYVFGGVSKGGLMKTPERFRTIEAYNIADGKW 171
Query: 584 RIV----KSMQKHRSAGGVIA---------FDSYVYALGGHDGLSIFDSVERYDPKTDEW 630
+ ++ +K AG + F + + G D S ++V+ +DP + +W
Sbjct: 172 VQLPDPGENFEKRGGAGFAVVQGKIWVVYGFATSILPGGKSDYES--NAVQFFDPASGKW 229
Query: 631 TSVKPMLTK---RCRLGVAALNNKIYVCGG 657
T V+ K R A + I + GG
Sbjct: 230 TEVETTGAKPSARSVFAHAVVGKYIIIFGG 259
Score = 34.6 bits (79), Expect = 0.19
Identities = 29/104 (27%), Positives = 44/104 (42%), Gaps = 9/104 (8%)
Query: 640 RCRLGVAALNNKIYVCGG--YDGAIFLQSVEMYDPITDEWKMIASMNVMRSRVA-----L 692
RC G+A + +K+Y GG + + ++D T W IA N R++ +
Sbjct: 23 RCSHGIAVVGDKLYSFGGELKPNEHIDKDLYVFDFNTHTWS-IAPANGDVPRISCLGVRM 81
Query: 693 VANMGKLWAIGGYDGVSNLPTVEVYDPSTDSWAFVAPMCAHEGG 736
VA KL+ GG D YD + W F+ + EGG
Sbjct: 82 VAVGTKLYIFGGRDEKREFSDFYSYDTVKNEWTFLTKL-DEEGG 124
>gnl|CDD|116261 pfam07646, Kelch_2, Kelch motif. The kelch motif was initially
discovered in Kelch. In this protein there are six
copies of the motif. It has been shown that Drosophila
ring canal kelch protein is related to Galactose Oxidase
for which a structure has been solved. The kelch motif
forms a beta sheet. Several of these sheets associate to
form a beta propeller structure as found in pfam00064,
pfam00400 and pfam00415.
Length = 48
Score = 41.6 bits (98), Expect = 3e-05
Identities = 12/47 (25%), Positives = 18/47 (38%), Gaps = 2/47 (4%)
Query: 499 RSRVGVAVMKNRLYAFGGY--NGSERLSTVEEFDPVRRVWNKVSPMC 543
R V+ +LY GG G S + DP VW ++ +
Sbjct: 2 RYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPALP 48
Score = 40.8 bits (96), Expect = 6e-05
Identities = 11/46 (23%), Positives = 21/46 (45%), Gaps = 2/46 (4%)
Query: 546 RSAVGAAALNDKLYVCGGY--DGVSSLNTVECYEPDKDQWRIVKSM 589
R + + KLYV GG G S + + +P+ + W + ++
Sbjct: 2 RYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPAL 47
Score = 33.9 bits (78), Expect = 0.018
Identities = 11/41 (26%), Positives = 17/41 (41%), Gaps = 2/41 (4%)
Query: 640 RCRLGVAALNNKIYVCGGY--DGAIFLQSVEMYDPITDEWK 678
R + K+YV GG G + + + DP T+ W
Sbjct: 2 RYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWT 42
Score = 32.7 bits (75), Expect = 0.046
Identities = 10/46 (21%), Positives = 18/46 (39%), Gaps = 2/46 (4%)
Query: 593 RSAGGVIAFDSYVYALGG--HDGLSIFDSVERYDPKTDEWTSVKPM 636
R + +Y +GG G + DP+T+ WT + +
Sbjct: 2 RYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPAL 47
Score = 32.0 bits (73), Expect = 0.087
Identities = 14/47 (29%), Positives = 21/47 (44%), Gaps = 2/47 (4%)
Query: 687 RSRVALVANMGKLWAIGGY--DGVSNLPTVEVYDPSTDSWAFVAPMC 731
R A V GKL+ +GG G + + V DP T+ W + +
Sbjct: 2 RYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPALP 48
>gnl|CDD|211835 TIGR03548, mutarot_permut, cyclically-permuted mutarotase family
protein. Members of this protein family show
essentially full-length homology, cyclically permuted,
to YjhT from Escherichia coli. YjhT was shown to act as
a mutarotase for sialic acid, and by this ability to be
able to act as a virulence factor. Members of the YjhT
family (TIGR03547) and this cyclically-permuted family
have multiple repeats of the beta-propeller-forming
Kelch repeat.
Length = 323
Score = 44.8 bits (106), Expect = 1e-04
Identities = 39/170 (22%), Positives = 65/170 (38%), Gaps = 12/170 (7%)
Query: 509 NRLYAFGGYNGSERLSTVEEF---DPVRRVWNK-VSPMCFKRSAVGAAALNDKLYVCGGY 564
N +Y GG N SE S+V + + + + + F A + KLYV GG
Sbjct: 73 NGIYYIGGSNSSESFSSVYRITLDESKEALIIETLPSLPFAMDNGSATYKDGKLYVGGGN 132
Query: 565 DGVSSLNTVECYEPDKDQWRIVKSMQ-KHRSAGGVIAFDSYVYALGGHDGLSIFDSVERY 623
N C+ + +W + R + + +Y GG D + +Y
Sbjct: 133 ANGKPSNKFYCFNLETQEWEELPDFPGAPRVQPVCVKLQNELYVFGGGDN-IAYTDGYKY 191
Query: 624 DPKTDEWTSVKPMLTK---RCRLGVAAL---NNKIYVCGGYDGAIFLQSV 667
PKT W V L+ LG A++ + + GG++ +F +V
Sbjct: 192 SPKTGTWEKVADPLSDGEPISLLGAASIAINESLLLCIGGFNYDVFNDAV 241
>gnl|CDD|234253 TIGR03547, muta_rot_YjhT, mutatrotase, YjhT family. Members of
this protein family contain multiple copies of the
beta-propeller-forming Kelch repeat. All are full-length
homologs to YjhT of Escherichia coli, which has been
identified as a mutarotase for sialic acid. This protein
improves bacterial ability to obtain host sialic acid,
and thus serves as a virulence factor. Some bacteria
carry what appears to be a cyclically permuted homolog
of this protein.
Length = 346
Score = 43.9 bits (104), Expect = 2e-04
Identities = 39/134 (29%), Positives = 56/134 (41%), Gaps = 17/134 (12%)
Query: 541 PMCFKRSAVGAAAL-NDKLYVCGGYDGVSSLNTVECYEPDKDQWRIVKSMQKHRSAGGVI 599
P+ FK G A+ DK+YV G G S ++ +P K W+ + V
Sbjct: 5 PVGFKN---GTGAIIGDKVYVGLGSAGTS-WYKLDLKKPSK-GWQKIADFPGGPRNQAVA 59
Query: 600 AF-DSYVYALGG-----HDG-LSIFDSVERYDPKTDEWTSVKPMLTKRCRLGVAAL---N 649
A D +Y GG +G +FD V RYDPK + W + + LG + N
Sbjct: 60 AAIDGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKNSWQKL-DTRSPVGLLGASGFSLHN 118
Query: 650 NKIYVCGGYDGAIF 663
+ Y GG + IF
Sbjct: 119 GQAYFTGGVNKNIF 132
Score = 35.8 bits (83), Expect = 0.066
Identities = 48/206 (23%), Positives = 76/206 (36%), Gaps = 38/206 (18%)
Query: 469 LTKAGDSLSTVEVFDPLVGRWQ-MAEAMSMLRSRVGVAVMKNRLYAFGGY--NGSER--- 522
L AG S +++ P G WQ +A+ R++ A + +LY FGG SE
Sbjct: 24 LGSAGTSWYKLDLKKPSKG-WQKIADFPGGPRNQAVAAAIDGKLYVFGGIGKANSEGSPQ 82
Query: 523 -LSTVEEFDPVRRVWNKV---SPMCFKRSAVGAAAL---NDKLYVCGGYDGVSSLNTVEC 575
V +DP + W K+ SP +GA+ N + Y GG + N +
Sbjct: 83 VFDDVYRYDPKKNSWQKLDTRSP----VGLLGASGFSLHNGQAYFTGGVN----KNIFDG 134
Query: 576 YEPDKDQWRIVKSMQKHRSAGGVIAFDSYVYALGGHDGLSIF--DSVERYDPKTDEWTSV 633
Y D + + D + A F +V YDP T++W ++
Sbjct: 135 YFAD-----LSA------ADKDSEPKDKLIAAYFSQPPEDYFWNKNVLSYDPSTNQWRNL 183
Query: 634 --KPMLTKRCRLGVAALNNKIYVCGG 657
P L + NK+ + G
Sbjct: 184 GENPFLG-TAGSAIVHKGNKLLLING 208
Score = 33.8 bits (78), Expect = 0.29
Identities = 25/89 (28%), Positives = 38/89 (42%), Gaps = 10/89 (11%)
Query: 644 GVAAL-NNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASM-NVMRSRVALVANMGKLWA 701
G A+ +K+YV G G + + +++ P W+ IA R++ A GKL+
Sbjct: 11 GTGAIIGDKVYVGLGSAGTSWYK-LDLKKPSKG-WQKIADFPGGPRNQAVAAAIDGKLYV 68
Query: 702 IGGY--DGVSNLPTVE----VYDPSTDSW 724
GG P V YDP +SW
Sbjct: 69 FGGIGKANSEGSPQVFDDVYRYDPKKNSW 97
Score = 32.7 bits (75), Expect = 0.66
Identities = 20/73 (27%), Positives = 32/73 (43%), Gaps = 16/73 (21%)
Query: 646 AALNNKIYVCGGYDGA------IFLQSVEMYDPITDEWKMIASMNVMRSRVALVA----- 694
AA++ K+YV GG A V YDP + W+ + + RS V L+
Sbjct: 60 AAIDGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKNSWQKLDT----RSPVGLLGASGFS 115
Query: 695 -NMGKLWAIGGYD 706
+ G+ + GG +
Sbjct: 116 LHNGQAYFTGGVN 128
>gnl|CDD|222113 pfam13415, Kelch_3, Galactose oxidase, central domain.
Length = 48
Score = 38.4 bits (90), Expect = 3e-04
Identities = 11/36 (30%), Positives = 17/36 (47%), Gaps = 1/36 (2%)
Query: 649 NNKIYVCGGYDG-AIFLQSVEMYDPITDEWKMIASM 683
KIYV GG L + +YD T+ W+ + +
Sbjct: 1 GGKIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDL 36
Score = 37.3 bits (87), Expect = 0.001
Identities = 11/48 (22%), Positives = 20/48 (41%), Gaps = 1/48 (2%)
Query: 508 KNRLYAFGGYNGS-ERLSTVEEFDPVRRVWNKVSPMCFKRSAVGAAAL 554
++Y FGG RL+ + +D W K+ + R+ A +
Sbjct: 1 GGKIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDLPGPRAGHAATVI 48
Score = 31.1 bits (71), Expect = 0.15
Identities = 10/36 (27%), Positives = 16/36 (44%), Gaps = 1/36 (2%)
Query: 555 NDKLYVCGGYDGVS-SLNTVECYEPDKDQWRIVKSM 589
K+YV GG LN + Y+ D + W + +
Sbjct: 1 GGKIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDL 36
Score = 30.3 bits (69), Expect = 0.29
Identities = 8/41 (19%), Positives = 14/41 (34%), Gaps = 1/41 (2%)
Query: 602 DSYVYALGGHDGLSIF-DSVERYDPKTDEWTSVKPMLTKRC 641
+Y GG + + YD T+ W + + R
Sbjct: 1 GGKIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDLPGPRA 41
Score = 29.9 bits (68), Expect = 0.38
Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 1/35 (2%)
Query: 697 GKLWAIGGYDGVSN-LPTVEVYDPSTDSWAFVAPM 730
GK++ GG L + VYD T++W + +
Sbjct: 2 GKIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDL 36
>gnl|CDD|225597 COG3055, COG3055, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 381
Score = 42.7 bits (101), Expect = 5e-04
Identities = 33/128 (25%), Positives = 46/128 (35%), Gaps = 21/128 (16%)
Query: 550 GAAAL-NDKLYVCGGYDGVS--SLNTVECYEPDKDQWRIVKSMQKHRSAGGVIAFDSYVY 606
GA AL D +YV G G + L+ +P K +I R+ +Y
Sbjct: 40 GAGALIGDTVYVGLGSAGTAFYVLDL---KKPGKGWTKIADFPGGARNQAVAAVIGGKLY 96
Query: 607 ALGGH-----DGLSIFDSVERYDPKTDEWT---SVKPML---TKRCRLGVAALNNKIYVC 655
GG+ +F+ RYDP T+ W + P L KIY
Sbjct: 97 VFGGYGKSVSSSPQVFNDAYRYDPSTNSWHKLDTRSPTGLVGASTFSLN----GTKIYFF 152
Query: 656 GGYDGAIF 663
GG + IF
Sbjct: 153 GGVNQNIF 160
Score = 36.2 bits (84), Expect = 0.057
Identities = 30/88 (34%), Positives = 40/88 (45%), Gaps = 9/88 (10%)
Query: 644 GVAAL-NNKIYVCGGYDGAIFLQSVEMYDPITDEWKMIASM-NVMRSRVALVANMGKLWA 701
G AL + +YV G G F +++ P W IA R++ GKL+
Sbjct: 40 GAGALIGDTVYVGLGSAGTAFY-VLDLKKP-GKGWTKIADFPGGARNQAVAAVIGGKLYV 97
Query: 702 IGGY-DGVSNLPTVE----VYDPSTDSW 724
GGY VS+ P V YDPST+SW
Sbjct: 98 FGGYGKSVSSSPQVFNDAYRYDPSTNSW 125
Score = 35.0 bits (81), Expect = 0.12
Identities = 20/69 (28%), Positives = 29/69 (42%), Gaps = 9/69 (13%)
Query: 505 AVMKNRLYAFGGYNGSER-----LSTVEEFDPVRRVWNKV---SPMCFKRSAVGAAALND 556
AV+ +LY FGGY S + +DP W+K+ SP A +
Sbjct: 89 AVIGGKLYVFGGYGKSVSSSPQVFNDAYRYDPSTNSWHKLDTRSPTGLV-GASTFSLNGT 147
Query: 557 KLYVCGGYD 565
K+Y GG +
Sbjct: 148 KIYFFGGVN 156
Score = 34.7 bits (80), Expect = 0.17
Identities = 16/68 (23%), Positives = 26/68 (38%), Gaps = 7/68 (10%)
Query: 646 AALNNKIYVCGGYDGA-----IFLQSVEMYDPITDEWKMIASMNVMRS--RVALVANMGK 698
A + K+YV GGY + YDP T+ W + + + N K
Sbjct: 89 AVIGGKLYVFGGYGKSVSSSPQVFNDAYRYDPSTNSWHKLDTRSPTGLVGASTFSLNGTK 148
Query: 699 LWAIGGYD 706
++ GG +
Sbjct: 149 IYFFGGVN 156
>gnl|CDD|205596 pfam13418, Kelch_4, Galactose oxidase, central domain.
Length = 49
Score = 37.6 bits (88), Expect = 8e-04
Identities = 13/40 (32%), Positives = 18/40 (45%), Gaps = 1/40 (2%)
Query: 504 VAVMKNRLYAFGGYNGS-ERLSTVEEFDPVRRVWNKVSPM 542
++ RLY FGG N LS V FD W ++ +
Sbjct: 8 TSIGDGRLYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47
Score = 30.3 bits (69), Expect = 0.26
Identities = 10/36 (27%), Positives = 17/36 (47%), Gaps = 1/36 (2%)
Query: 602 DSYVYALGGHDGL-SIFDSVERYDPKTDEWTSVKPM 636
D +Y GG + S+ V +D T+ WT + +
Sbjct: 12 DGRLYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47
Score = 29.5 bits (67), Expect = 0.54
Identities = 9/36 (25%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Query: 649 NNKIYVCGGYDG-AIFLQSVEMYDPITDEWKMIASM 683
+ ++Y+ GG + L V ++D T+ W + S+
Sbjct: 12 DGRLYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47
Score = 28.8 bits (65), Expect = 0.97
Identities = 12/35 (34%), Positives = 20/35 (57%), Gaps = 1/35 (2%)
Query: 697 GKLWAIGGYDG-VSNLPTVEVYDPSTDSWAFVAPM 730
G+L+ GG + S L V V+D ST++W + +
Sbjct: 13 GRLYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47
>gnl|CDD|224531 COG1615, COG1615, Uncharacterized conserved protein [Function
unknown].
Length = 885
Score = 33.2 bits (76), Expect = 0.69
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 1/51 (1%)
Query: 496 SMLRSRVGVAVMKNRLYAF-GGYNGSERLSTVEEFDPVRRVWNKVSPMCFK 545
S+ GV ++N + A Y+G+ + DPV W K+ P FK
Sbjct: 573 SLPTDANGVNYIRNSVKALVDAYDGTVTFYQSDLNDPVINAWMKIFPGTFK 623
>gnl|CDD|132110 TIGR03066, Gem_osc_para_1, Gemmata obscuriglobus paralogous family
TIGR03066. This model represents an uncharacterized
paralogous family in Gemmata obscuriglobus UQM 2246, a
member of the Planctomycetes. This family shows sequence
similarity to TIGR03067, which is also found in Gemmata
obscuriglobus as well as in a few other species
[Hypothetical proteins, Conserved].
Length = 111
Score = 30.4 bits (69), Expect = 1.1
Identities = 10/36 (27%), Positives = 17/36 (47%), Gaps = 5/36 (13%)
Query: 425 FAVGGLTKAGDSLSTVEVFDP--LVGRWQMAEEETL 458
A+ G +A D E D LVG+W+ ++ +
Sbjct: 3 LALAGGGRADDK---DEKIDAKKLVGKWETSKTKDD 35
>gnl|CDD|221245 pfam11822, DUF3342, Domain of unknown function (DUF3342). This
family of proteins are functionally uncharacterized.
This family is found in bacteria. This presumed domain
is typically between 170 to 303 amino acids in length.
The N-terminal half of this family is a BTB-like domain.
Length = 302
Score = 31.6 bits (72), Expect = 1.4
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 178 NVQSLMVVASFLQMQKVADACADFLKKR 205
NV S+++ +SFLQM+++ D C +
Sbjct: 79 NVVSILLSSSFLQMEQLVDECLMYCHAH 106
Score = 31.6 bits (72), Expect = 1.4
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 241 NVQSLMVVASFLQMQKVADACADFLKKR 268
NV S+++ +SFLQM+++ D C +
Sbjct: 79 NVVSILLSSSFLQMEQLVDECLMYCHAH 106
>gnl|CDD|235047 PRK02509, PRK02509, hypothetical protein; Provisional.
Length = 973
Score = 31.9 bits (73), Expect = 1.6
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 517 YNGSERLSTVEEFDPVRRVWNKVSPMCFK 545
YNG + + DP+ + W+K+ P FK
Sbjct: 724 YNGDVQFYVADPNDPLIQTWSKIFPQLFK 752
>gnl|CDD|148056 pfam06223, Phage_tail_T, Minor tail protein T. Minor tail protein
T is located at the distal end and is involved in the
assembly of the initiator complex for tail
polymerisation.
Length = 103
Score = 28.7 bits (64), Expect = 3.9
Identities = 11/36 (30%), Positives = 18/36 (50%)
Query: 278 RQFADTLNCLQLSEAADKYVQQYFHEVSMSDEFIGL 313
R+ ++ +L E AD +V F +V + EF L
Sbjct: 12 RRMLAEMSATELGEWADHFVTHSFSDVLLDAEFATL 47
>gnl|CDD|235525 PRK05595, PRK05595, replicative DNA helicase; Provisional.
Length = 444
Score = 30.2 bits (68), Expect = 4.0
Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 8/54 (14%)
Query: 111 SKQREITMQ-GIDAVIVLAATIPYFQAMFTSDMAESKQREIT--MQGIDAVAME 161
SK R + ++ GID ++ I Y Q M +ES+Q+E++ + I A+A E
Sbjct: 301 SKCRRLKIEHGIDMIL-----IDYLQLMSGGKGSESRQQEVSEISRSIKALAKE 349
>gnl|CDD|132488 TIGR03447, mycothiol_MshC, cysteine--1-D-myo-inosityl
2-amino-2-deoxy-alpha-D-glucopyranoside ligase. Members
of this protein family are MshC,
l-cysteine:1-D-myo-inosityl
2-amino-2-deoxy-alpha-D-glucopyranoside ligase, an
enzyme that uses ATP to ligate a Cys residue to a
mycothiol precursor molecule, in the second to last step
in mycothiol biosynthesis. This enzyme shows
considerable homology to Cys--tRNA ligases, and many
instances are misannotated as such. Mycothiol is found
in Mycobacterium tuberculosis, Corynebacterium
glutamicum, Streptomyces coelicolor, and various other
members of the Actinobacteria. Mycothiol is an analog to
glutathione [Biosynthesis of cofactors, prosthetic
groups, and carriers, Glutathione and analogs].
Length = 411
Score = 30.1 bits (68), Expect = 4.6
Identities = 20/65 (30%), Positives = 29/65 (44%), Gaps = 11/65 (16%)
Query: 359 AAVRLPLLSPHYLADRVATEALIRSSHECRDL---------VDEARDFHLMPERRFLLAG 409
AA+RL LL+ HY DR T+A++ + +A D L+ R LA
Sbjct: 310 AAIRLGLLAGHYRQDRDWTDAVLAEAEARLARWRAALALPDAPDATD--LIARLRQHLAN 367
Query: 410 EKTTP 414
+ TP
Sbjct: 368 DLDTP 372
>gnl|CDD|217678 pfam03699, UPF0182, Uncharacterized protein family (UPF0182). This
family contains uncharacterized integral membrane
proteins.
Length = 771
Score = 30.2 bits (69), Expect = 5.5
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 517 YNGSERLSTVEEFDPVRRVWNKVSPMCFK 545
Y+G+ V+ DP+ + W K+ P FK
Sbjct: 603 YDGTVDFYVVDPEDPILKTWQKIFPGLFK 631
>gnl|CDD|200475 cd11336, AmyAc_MTSase, Alpha amylase catalytic domain found in
maltooligosyl trehalose synthase (MTSase).
Maltooligosyl trehalose synthase (MTSase) domain. MTSase
and maltooligosyl trehalose trehalohydrolase (MTHase)
work together to produce trehalose. MTSase is
responsible for converting the alpha-1,4-glucosidic
linkage to an alpha,alpha-1,1-glucosidic linkage at the
reducing end of the maltooligosaccharide through an
intramolecular transglucosylation reaction, while MTHase
hydrolyzes the penultimate alpha-1,4 linkage of the
reducing end, resulting in the release of trehalose. The
Alpha-amylase family comprises the largest family of
glycoside hydrolases (GH), with the majority of enzymes
acting on starch, glycogen, and related oligo- and
polysaccharides. These proteins catalyze the
transformation of alpha-1,4 and alpha-1,6 glucosidic
linkages with retention of the anomeric center. The
protein is described as having 3 domains: A, B, C. A is
a (beta/alpha) 8-barrel; B is a loop between the beta 3
strand and alpha 3 helix of A; C is the C-terminal
extension characterized by a Greek key. The majority of
the enzymes have an active site cleft found between
domains A and B where a triad of catalytic residues
(Asp, Glu and Asp) performs catalysis. Other members of
this family have lost the catalytic activity as in the
case of the human 4F2hc, or only have 2 residues that
serve as the catalytic nucleophile and the acid/base,
such as Thermus A4 beta-galactosidase with 2 Glu
residues (GH42) and human alpha-galactosidase with 2 Asp
residues (GH31). The family members are quite extensive
and include: alpha amylase, maltosyltransferase,
cyclodextrin glycotransferase, maltogenic amylase,
neopullulanase, isoamylase, 1,4-alpha-D-glucan
maltotetrahydrolase, 4-alpha-glucotransferase,
oligo-1,6-glucosidase, amylosucrase, sucrose
phosphorylase, and amylomaltase.
Length = 660
Score = 29.8 bits (68), Expect = 6.2
Identities = 31/117 (26%), Positives = 41/117 (35%), Gaps = 35/117 (29%)
Query: 323 KRSE-----LHLMSE--EQVFEAVMRWVKHNASERAPSLPR----------LLAAVRLPL 365
KR E L ++SE E+ EAV RW + NA R P L+ A L
Sbjct: 449 KRGEDVRARLAVLSELPEEWAEAVRRWRRLNAPLRTGPAPDPNDEYLLYQTLVGAWPLDG 508
Query: 366 LSPHYLADRVAT-------EALIRSSH---------ECRDLVDEARDFHLMPERRFL 406
+ A+R+A EA + +S VD P R FL
Sbjct: 509 DALADFAERLAAYMLKALREAKLHTSWTDPDEAYEEAVAAFVDAL--LDPPPSRAFL 563
>gnl|CDD|133321 cd04121, Rab40, Rab GTPase family 40 (Rab40) contains Rab40a,
Rab40b and Rab40c. The Rab40 subfamily contains Rab40a,
Rab40b, and Rab40c, which are all highly homologous. In
rat, Rab40c is localized to the perinuclear recycling
compartment (PRC), and is distributed in a
tissue-specific manor, with high expression in brain,
heart, kidney, and testis, low expression in lung and
liver, and no expression in spleen and skeletal muscle.
Rab40c is highly expressed in differentiated
oligodendrocytes but minimally expressed in
oligodendrocyte progenitors, suggesting a role in the
vesicular transport of myelin components. Unlike most
other Ras-superfamily proteins, Rab40c was shown to have
a much lower affinity for GTP, and an affinity for GDP
that is lower than for GTP. GTPase activating proteins
(GAPs) interact with GTP-bound Rab and accelerate the
hydrolysis of GTP to GDP. Guanine nucleotide exchange
factors (GEFs) interact with GDP-bound Rabs to promote
the formation of the GTP-bound state. Rabs are further
regulated by guanine nucleotide dissociation inhibitors
(GDIs), which facilitate Rab recycling by masking
C-terminal lipid binding and promoting cytosolic
localization. Most Rab GTPases contain a lipid
modification site at the C-terminus, with sequence
motifs CC, CXC, or CCX. Lipid binding is essential for
membrane attachment, a key feature of most Rab proteins.
Length = 189
Score = 28.7 bits (64), Expect = 7.1
Identities = 12/30 (40%), Positives = 18/30 (60%), Gaps = 1/30 (3%)
Query: 336 FEAVMRWVKHNASERAPSLPRLLAAVRLPL 365
F+ + RW+K E AP +P++L RL L
Sbjct: 94 FDGIDRWIKE-IDEHAPGVPKILVGNRLHL 122
>gnl|CDD|183518 PRK12418, PRK12418, cysteinyl-tRNA synthetase; Provisional.
Length = 384
Score = 29.5 bits (67), Expect = 7.7
Identities = 13/28 (46%), Positives = 19/28 (67%)
Query: 359 AAVRLPLLSPHYLADRVATEALIRSSHE 386
AA+RL LL+ HY ADR T+A++ +
Sbjct: 283 AAIRLALLAGHYRADREWTDAVLAEAEA 310
>gnl|CDD|129733 TIGR00647, MG103, conserved hypothetical protein. [Hypothetical
proteins, Conserved].
Length = 279
Score = 29.1 bits (65), Expect = 8.4
Identities = 29/148 (19%), Positives = 46/148 (31%), Gaps = 23/148 (15%)
Query: 185 VASFLQM--------QKVADACADFLKKRFHPNNVLDY-------YVLFSCRAMEALINF 229
+A Q + LKK+F + + V FS A ++
Sbjct: 25 LAGLFQNNLKLLFTRHWSFKVQTELLKKQFDVSLEFLFRKKMRLKIVRFSFNADLKILGE 84
Query: 230 AYSGRVTIHSQNVQSLMVVASFLQMQKVADACADFLKKRFH----PNNVLGIRQFADTLN 285
TI V+ ++ +FL V+D FH NN I Q N
Sbjct: 85 LLVFDRTISKLLVKKSYMIGAFLSGGSVSDL---EHSSNFHLQISSNNEEQIDQLCKLFN 141
Query: 286 CLQLSEAADKYVQQYFHEVSMSDEFIGL 313
+L+ + + Y EF+ L
Sbjct: 142 EFKLTSKTLQLI-VYLKSYEKICEFLKL 168
>gnl|CDD|238867 cd01829, SGNH_hydrolase_peri2, SGNH_peri2; putative periplasmic
member of the SGNH-family of hydrolases, a diverse
family of lipases and esterases. The tertiary fold of
the enzyme is substantially different from that of the
alpha/beta hydrolase family and unique among all known
hydrolases; its active site closely resembles the
Ser-His-Asp(Glu) triad found in other serine hydrolases.
Length = 200
Score = 28.8 bits (65), Expect = 9.2
Identities = 8/29 (27%), Positives = 10/29 (34%)
Query: 605 VYALGGHDGLSIFDSVERYDPKTDEWTSV 633
V LG +D I D + EW
Sbjct: 64 VVFLGANDRQDIRDGDGYLKFGSPEWEEE 92
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.134 0.399
Gapped
Lambda K H
0.267 0.0714 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 38,220,656
Number of extensions: 3748712
Number of successful extensions: 3154
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3088
Number of HSP's successfully gapped: 102
Length of query: 748
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 644
Effective length of database: 6,324,786
Effective search space: 4073162184
Effective search space used: 4073162184
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.1 bits)