RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14726
(712 letters)
>gnl|CDD|222983 PHA03098, PHA03098, kelch-like protein; Provisional.
Length = 534
Score = 249 bits (637), Expect = 3e-74
Identities = 134/534 (25%), Positives = 233/534 (43%), Gaps = 40/534 (7%)
Query: 93 CDATIVL-ADNSHFPIHRNILCACSDYFRALFTTSLHDQEKFSLVEIP-GVDASVMTSLI 150
CD +I++ +H+ IL + S+YF+ +F + F EI +D +I
Sbjct: 10 CDESIIIVNGGGIIKVHKIILSSSSEYFKKMF------KNNFKENEINLNIDYDSFNEVI 63
Query: 151 EYAYLRKLHITEDNVASLLLATDFFCMSKAQDLCCDFIKRLTTPRNCFGILLFSRDHFCK 210
+Y Y K++IT +NV +L ++ + +LC ++I ++ NC I FS + CK
Sbjct: 64 KYIYTGKINITSNNVKDILSIANYLIIDFLINLCINYIIKIIDDNNCIDIYRFSFFYGCK 123
Query: 211 KLEEWTRKYILRNFVDVARESEEIVFISEQELEEIISSDELNVKSEETVWELIIRWIDYD 270
KL YI N + + +++S+ EL +I+S D+LNV SE+ V E+II+W+
Sbjct: 124 KLYSAAYNYIRNNIELIYN-DPDFIYLSKNELIKILSDDKLNVSSEDVVLEIIIKWLTSK 182
Query: 271 PENRKKHIVSLMKNIRLGLLETEFF------REKVKHHPYVQSTPECRPIIIETFKFLYD 324
N+ K I ++K +R+ L E + ++K V + C II L
Sbjct: 183 KNNKYKDICLILKVLRITFLSEEGIKKLKRWKLRIKKKKIVFNK-RCIKIIYSKKYNL-- 239
Query: 325 LELIAQIDGEVPTPEIARPRVPHEILFAIGGWSGGSPTDFVETYDTRADRWVKIEQVDPM 384
+ +P + I +++ Y +I + +
Sbjct: 240 -------NKILPRSS------TFGSIIYIHITMSIFTYNYITNY----SPLSEINTIIDI 282
Query: 385 GPRAYHGTAVIGYNIYVIGGFDGNEY-YNSCRCFNAVTKVWKEIAPMNFKRCYVSTAVHH 443
G+ V+ IY IGG + N NS ++ TK W ++ + + R V +
Sbjct: 283 HYVYCFGSVVLNNVIYFIGGMNKNNLSVNSVVSYDTKTKSWNKVPELIYPRKNPGVTVFN 342
Query: 444 DVIYAMGGYNGQRRQNSVEKYNYTENQWSLIAPMNVERSDASATTLQGKIYITGG-FNGH 502
+ IY +GG N+VE + E++W P+ R + + IY+ GG
Sbjct: 343 NRIYVIGGIYNSISLNTVESWKPGESKWREEPPLIFPRYNPCVVNVNNLIYVIGGISKND 402
Query: 503 NCLNSCEVYDPECNQWTLIEPMRHRRSGVSCIAYHECIYVIGGF---NGMSRMCNGEKYN 559
L + E + N+W+ P+ G I + IYVIGG + + E YN
Sbjct: 403 ELLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDGKIYVIGGISYIDNIKVYNIVESYN 462
Query: 560 PVTKTWSQVPDMYNPRSNFAIEVIDDMIFAIGGFNGVTTIYHVECYDEKTDEWY 613
PVT W+++ + PR N ++ + ++ I+ +GG I +E YD+KT+ W
Sbjct: 463 PVTNKWTELSSLNFPRINASLCIFNNKIYVVGGDKYEYYINEIEVYDDKTNTWT 516
Score = 103 bits (259), Expect = 4e-23
Identities = 56/197 (28%), Positives = 85/197 (43%), Gaps = 12/197 (6%)
Query: 334 EVPTPEIARPR------VPHEILFAIGGWSGGSPTDFVETYDTRADRWVKIEQVDPMGPR 387
PE+ PR V + ++ IGG + VE++ +W E+ + PR
Sbjct: 323 WNKVPELIYPRKNPGVTVFNNRIYVIGGIYNSISLNTVESWKPGESKW--REEPPLIFPR 380
Query: 388 AYHGTAVIGYNIYVIGG-FDGNEYYNSCRCFNAVTKVWKEIAPMNFKRCYVSTAVHHDVI 446
+ IYVIGG +E + CF+ T W + +P+ H I
Sbjct: 381 YNPCVVNVNNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDGKI 440
Query: 447 YAMGGY---NGQRRQNSVEKYNYTENQWSLIAPMNVERSDASATTLQGKIYITGGFNGHN 503
Y +GG + + N VE YN N+W+ ++ +N R +AS KIY+ GG
Sbjct: 441 YVIGGISYIDNIKVYNIVESYNPVTNKWTELSSLNFPRINASLCIFNNKIYVVGGDKYEY 500
Query: 504 CLNSCEVYDPECNQWTL 520
+N EVYD + N WTL
Sbjct: 501 YINEIEVYDDKTNTWTL 517
Score = 67.1 bits (164), Expect = 2e-11
Identities = 38/174 (21%), Positives = 66/174 (37%), Gaps = 6/174 (3%)
Query: 463 KYNYTENQWSLIAPMNVERSDAS--ATTLQGKIYITGGFNGHNCLNSCEVYDPECNQWTL 520
K + + +I + ++T IYI N Y P T+
Sbjct: 221 KIVFNKRCIKIIYSKKYNLNKILPRSSTFGSIIYIHIT-MSIFTYNYITNYSPLSEINTI 279
Query: 521 IEPMRHRRSGVSCIAYHECIYVIGGFNG-MSRMCNGEKYNPVTKTWSQVPDMYNPRSNFA 579
I+ H + + IY IGG N + + Y+ TK+W++VP++ PR N
Sbjct: 280 ID--IHYVYCFGSVVLNNVIYFIGGMNKNNLSVNSVVSYDTKTKSWNKVPELIYPRKNPG 337
Query: 580 IEVIDDMIFAIGGFNGVTTIYHVECYDEKTDEWYEATDMNIYRSALSACVIMGL 633
+ V ++ I+ IGG ++ VE + +W E + R + L
Sbjct: 338 VTVFNNRIYVIGGIYNSISLNTVESWKPGESKWREEPPLIFPRYNPCVVNVNNL 391
>gnl|CDD|149006 pfam07707, BACK, BTB And C-terminal Kelch. This domain is found
associated with pfam00651 and pfam01344. The BACK domain
is found juxtaposed to the BTB domain; they are
separated by as little as two residues. This family
appears to be closely related to the BTB domain (Finn
RD, personal observation).
Length = 101
Score = 119 bits (302), Expect = 3e-32
Identities = 45/101 (44%), Positives = 73/101 (72%), Gaps = 1/101 (0%)
Query: 197 CFGILLFSRDHFCKKLEEWTRKYILRNFVDVARESEEIVFISEQELEEIISSDELNVKSE 256
C GI F+ + C++L E ++IL+NF++V+ +SEE + +S+++L E++SSDELNV SE
Sbjct: 1 CLGIYRFADTYGCEELAEVALRFILKNFLEVS-KSEEFLELSKEQLLELLSSDELNVPSE 59
Query: 257 ETVWELIIRWIDYDPENRKKHIVSLMKNIRLGLLETEFFRE 297
E V+E +I+W+ +D ENRKKH+ L+ +RL LL E+ +
Sbjct: 60 EEVFEAVIKWVKHDVENRKKHLPELLSAVRLPLLSPEYLLD 100
>gnl|CDD|197943 smart00875, BACK, BTB And C-terminal Kelch. The BACK domain is
found juxtaposed to the BTB domain; they are separated
by as little as two residues.
Length = 101
Score = 111 bits (279), Expect = 3e-29
Identities = 41/102 (40%), Positives = 70/102 (68%), Gaps = 1/102 (0%)
Query: 197 CFGILLFSRDHFCKKLEEWTRKYILRNFVDVARESEEIVFISEQELEEIISSDELNVKSE 256
C GI F+ H ++L E ++IL+NF +V+ SEE + + ++L E++SSD+LNV SE
Sbjct: 1 CLGIRRFADAHGLEELAEKALRFILQNFSEVSS-SEEFLELPLEQLLELLSSDDLNVSSE 59
Query: 257 ETVWELIIRWIDYDPENRKKHIVSLMKNIRLGLLETEFFREK 298
E V+E ++RW+ +DPE R++H+ L++++RL LL ++ E
Sbjct: 60 EEVFEAVLRWVKHDPEKRREHLPELLEHVRLPLLSPDYLLEV 101
>gnl|CDD|165153 PHA02790, PHA02790, Kelch-like protein; Provisional.
Length = 480
Score = 115 bits (290), Expect = 3e-27
Identities = 118/525 (22%), Positives = 220/525 (41%), Gaps = 82/525 (15%)
Query: 96 TIVLADNSHFPIHRNILCACSDYFRALFTTSLHDQEKFSLVEIP------GVDASVMTSL 149
TI+ A + ++ IL S YFR H ++K++ + P +D +TS+
Sbjct: 24 TIIEAIGGNIIVNSTILKKLSPYFRT------HLRQKYTKNKDPVTRVCLDLDIHSLTSI 77
Query: 150 IEYAYLRKLHITEDNVASLLLATDFFCMSKAQDLCCDFIKRLTTPRNCFGILLFSRDHFC 209
+ Y+Y K++I NV +LL A+ + C +FI R C + ++
Sbjct: 78 VIYSYTGKVYIDSHNVVNLLRASILTSVEFIIYTCINFILRDFRKEYCVECYMMGIEYGL 137
Query: 210 KKLEEWTRKYILRNFVDVARESEEIVFISEQELEEIISSDELNVKSEETVWELIIRWIDY 269
L T+ +I ++F+++ E + I ++ I+ SDELNV E+ V + +I+W
Sbjct: 138 SNLLCHTKDFIAKHFLEL--EDDIIDNFDYLSMKLILESDELNVPDEDYVVDFVIKWY-M 194
Query: 270 DPENRKKHIVSLMKNIRLGLLETEFFREKVKHHPYVQSTPECRPIIIETFKFLYDLELIA 329
NR +++ L+KN+ + + + P I K++ D I
Sbjct: 195 KRRNRLGNLLLLIKNV----IRSNYLS----------------PRGINNVKWILDCTKIF 234
Query: 330 QIDGEVPTPEIARPRVPHEILFAIGGWSGGSPTDFVETYDTRADRWVKIEQVDPMGPRAY 389
D +PR ++ F+E Y D+ + I +
Sbjct: 235 HCD--------KQPRKSYKY-------------PFIE-YPMNMDQIIDI----------F 262
Query: 390 H--GTAVIGYNIYVIGGFDGNEYYNSCRCFNAVTKVWKEIAPMNFKRCYVSTAVHHDVIY 447
H + +G +Y+IGG+ NE +N+ N ++ W I PMN R Y S ++ +Y
Sbjct: 263 HMCTSTHVGEVVYLIGGWMNNEIHNNAIAVNYISNNWIPIPPMNSPRLYASGVPANNKLY 322
Query: 448 AMGGYNGQRRQNSVEKYNYTENQWSLIAPMNVERSDASATTLQGKIYITGGFNGHNCLNS 507
+GG SVE++ + + W + + R + + ++ IY+ GG + + +
Sbjct: 323 VVGGLPN---PTSVERWFHGDAAWVNMPSLLKPRCNPAVASINNVIYVIGGHSETD--TT 377
Query: 508 CEVYDPECNQWTLIEPMRHRRSGVSCIAYHECIYVIGGFNGMSRMCNGEKYNPVTKTWSQ 567
E P +QW + + + ++++G N E Y + TW+
Sbjct: 378 TEYLLPNHDQWQFGPSTYYPHYKSCALVFGRRLFLVGR--------NAEFYCESSNTWTL 429
Query: 568 VPDMYNPRSNFAIEVIDDMIFAIGGFNGVTTIYHVECYDEKTDEW 612
+ D PR N + ++D+ + IGGF + I +E Y+ +T W
Sbjct: 430 IDDPIYPRDNPELIIVDNKLLLIGGFYRGSYIDTIEVYNNRTYSW 474
>gnl|CDD|216043 pfam00651, BTB, BTB/POZ domain. The BTB (for BR-C, ttk and bab) or
POZ (for Pox virus and Zinc finger) domain is present
near the N-terminus of a fraction of zinc finger
(pfam00096) proteins and in proteins that contain the
pfam01344 motif such as Kelch and a family of pox virus
proteins. The BTB/POZ domain mediates homomeric
dimerisation and in some instances heteromeric
dimerisation. The structure of the dimerised PLZF
BTB/POZ domain has been solved and consists of a tightly
intertwined homodimer. The central scaffolding of the
protein is made up of a cluster of alpha-helices flanked
by short beta-sheets at both the top and bottom of the
molecule. POZ domains from several zinc finger proteins
have been shown to mediate transcriptional repression
and to interact with components of histone deacetylase
co-repressor complexes including N-CoR and SMRT. The POZ
or BTB domain is also known as BR-C/Ttk or ZiN.
Length = 101
Score = 98.8 bits (247), Expect = 8e-25
Identities = 39/107 (36%), Positives = 54/107 (50%), Gaps = 6/107 (5%)
Query: 83 LNEFREKNLLCDATIVLADNSHFPIHRNILCACSDYFRALFTTSLHDQEKFSLVEIPGVD 142
LNE RE LCD T+V + F H+ +L ACS YF+ALFT + + + V
Sbjct: 1 LNELRENGELCDVTLV-VGDKEFHAHKAVLAACSPYFKALFTGNK-----EVEITLEDVS 54
Query: 143 ASVMTSLIEYAYLRKLHITEDNVASLLLATDFFCMSKAQDLCCDFIK 189
+L+E+ Y KL ITE+NV LL D + D C +F+
Sbjct: 55 PEDFEALLEFIYTGKLEITEENVDDLLALADKLQIPALIDKCEEFLI 101
>gnl|CDD|197585 smart00225, BTB, Broad-Complex, Tramtrack and Bric a brac. Domain
in Broad-Complex, Tramtrack and Bric a brac. Also known
as POZ (poxvirus and zinc finger) domain. Known to be a
protein-protein interaction motif found at the N-termini
of several C2H2-type transcription factors as well as
Shaw-type potassium channels. Known structure reveals a
tightly intertwined dimer formed via interactions
between N-terminal strand and helix structures. However
in a subset of BTB/POZ domains, these two secondary
structures appear to be missing. Be aware SMART predicts
BTB/POZ domains without the beta1- and alpha1-secondary
structures.
Length = 97
Score = 84.3 bits (209), Expect = 1e-19
Identities = 30/99 (30%), Positives = 51/99 (51%), Gaps = 2/99 (2%)
Query: 94 DATIVLADNSHFPIHRNILCACSDYFRALFTTSLHDQEKFSLVEIPGVDASVMTSLIEYA 153
D T+V F H+ +L A S YF+ALF++ + +K S + + V +L+ +
Sbjct: 1 DVTLV-VGGKKFHAHKAVLAAHSPYFKALFSSDFKESDK-SEIYLDDVSPEDFRALLNFL 58
Query: 154 YLRKLHITEDNVASLLLATDFFCMSKAQDLCCDFIKRLT 192
Y KL + E+NV LL D+ + +LC +F+ +L
Sbjct: 59 YTGKLDLPEENVEELLELADYLQIPGLVELCEEFLLKLL 97
>gnl|CDD|128874 smart00612, Kelch, Kelch domain.
Length = 47
Score = 69.9 bits (172), Expect = 3e-15
Identities = 20/47 (42%), Positives = 29/47 (61%)
Query: 539 CIYVIGGFNGMSRMCNGEKYNPVTKTWSQVPDMYNPRSNFAIEVIDD 585
IYV+GGF+G R+ + E Y+P T W+ +P M PRS + VI+
Sbjct: 1 KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47
Score = 61.0 bits (149), Expect = 3e-12
Identities = 22/47 (46%), Positives = 27/47 (57%)
Query: 492 KIYITGGFNGHNCLNSCEVYDPECNQWTLIEPMRHRRSGVSCIAYHE 538
KIY+ GGF+G L S EVYDPE N+WT + M RSG +
Sbjct: 1 KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47
Score = 60.3 bits (147), Expect = 7e-12
Identities = 16/47 (34%), Positives = 25/47 (53%)
Query: 445 VIYAMGGYNGQRRQNSVEKYNYTENQWSLIAPMNVERSDASATTLQG 491
IY +GG++G +R SVE Y+ N+W+ + M RS + G
Sbjct: 1 KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47
Score = 58.7 bits (143), Expect = 2e-11
Identities = 15/46 (32%), Positives = 21/46 (45%)
Query: 399 IYVIGGFDGNEYYNSCRCFNAVTKVWKEIAPMNFKRCYVSTAVHHD 444
IYV+GGFDG + S ++ T W + M R AV +
Sbjct: 2 IYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47
Score = 56.4 bits (137), Expect = 1e-10
Identities = 17/46 (36%), Positives = 24/46 (52%)
Query: 587 IFAIGGFNGVTTIYHVECYDEKTDEWYEATDMNIYRSALSACVIMG 632
I+ +GGF+G + VE YD +T++W M RS VI G
Sbjct: 2 IYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47
Score = 42.5 bits (101), Expect = 1e-05
Identities = 17/48 (35%), Positives = 25/48 (52%), Gaps = 4/48 (8%)
Query: 350 LFAIGGWSGGSPTDFVETYDTRADRWVKIEQVDPM-GPRAYHGTAVIG 396
++ +GG+ GG VE YD ++W + M PR+ HG AVI
Sbjct: 2 IYVVGGFDGGQRLKSVEVYDPETNKW---TPLPSMPTPRSGHGVAVIN 46
>gnl|CDD|177844 PLN02193, PLN02193, nitrile-specifier protein.
Length = 470
Score = 75.4 bits (185), Expect = 3e-14
Identities = 50/165 (30%), Positives = 74/165 (44%), Gaps = 20/165 (12%)
Query: 395 IGYNIYVIGGFDGNEYYNSCRCFNAVTKVWKEIAPMN---FKRCYVSTAVHHDVIYAMGG 451
IG +YV GG D + YN F+ T WK + P+ R + S A + +Y GG
Sbjct: 227 IGSTLYVFGGRDASRQYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVYVFGG 286
Query: 452 YNGQRRQNSVEKYNYTENQW---SLIAPMNVERSDASATTLQGKIYITGGFNGHNCLNSC 508
+ R +++ YN + +W S R A +QGK+++ GFNG C
Sbjct: 287 VSATARLKTLDSYNIVDKKWFHCSTPGDSFSIRGGAGLEVVQGKVWVVYGFNG------C 340
Query: 509 EV-----YDPECNQWTLIEPMRHR---RSGVSCIAYHECIYVIGG 545
EV YDP ++WT +E R RS + A + I + GG
Sbjct: 341 EVDDVHYYDPVQDKWTQVETFGVRPSERSVFASAAVGKHIVIFGG 385
Score = 70.8 bits (173), Expect = 8e-13
Identities = 63/253 (24%), Positives = 95/253 (37%), Gaps = 63/253 (24%)
Query: 374 RWVKIEQV-DPMGPRAYHGTAVIGYNIYVIGG-FDGNEYYNS-CRCFNAVTKVWKEIAPM 430
+W+K+EQ + G R HG A +G IY GG F N+ + F+ T+ W I+P
Sbjct: 152 KWIKVEQKGEGPGLRCSHGIAQVGNKIYSFGGEFTPNQPIDKHLYVFDLETRTW-SISPA 210
Query: 431 N--------FKRCYVSTAVHHDVIYAMGGYNGQRRQNSVEKYNYTENQWSLIAPMNVERS 482
VS +Y GG + R+ N ++ T N+W L+ P
Sbjct: 211 TGDVPHLSCLGVRMVSIG---STLYVFGGRDASRQYNGFYSFDTTTNEWKLLTP------ 261
Query: 483 DASATTLQGKIYITGGFNGHNCLNSCEVYDPECNQWTLIEPMRHRRSGVSCIAYHECIYV 542
+E RS S A E +YV
Sbjct: 262 --------------------------------------VEEGPTPRSFHSMAADEENVYV 283
Query: 543 IGGFNGMSRMCNGEKYNPVTKTW---SQVPDMYNPRSNFAIEVIDDMIFAIGGFNGVTTI 599
GG + +R+ + YN V K W S D ++ R +EV+ ++ + GFNG +
Sbjct: 284 FGGVSATARLKTLDSYNIVDKKWFHCSTPGDSFSIRGGAGLEVVQGKVWVVYGFNGC-EV 342
Query: 600 YHVECYDEKTDEW 612
V YD D+W
Sbjct: 343 DDVHYYDPVQDKW 355
>gnl|CDD|201739 pfam01344, Kelch_1, Kelch motif. The kelch motif was initially
discovered in Kelch. In this protein there are six
copies of the motif. It has been shown that the
Drosophila ring canal kelch protein is related to
Galactose Oxidase, for which a structure has been
solved. The kelch motif forms a beta sheet. Several of
these sheets associate to form a beta propeller
structure as found in pfam00064, pfam00400 and
pfam00415.
Length = 46
Score = 63.1 bits (154), Expect = 7e-13
Identities = 22/45 (48%), Positives = 28/45 (62%)
Query: 481 RSDASATTLQGKIYITGGFNGHNCLNSCEVYDPECNQWTLIEPMR 525
RS A L GKIY+ GG++G L+S EVYDPE N W+ + M
Sbjct: 2 RSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSMP 46
Score = 61.1 bits (149), Expect = 3e-12
Identities = 16/46 (34%), Positives = 22/46 (47%)
Query: 433 KRCYVSTAVHHDVIYAMGGYNGQRRQNSVEKYNYTENQWSLIAPMN 478
R V IY +GGY+G + +SVE Y+ N WS + M
Sbjct: 1 PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSMP 46
Score = 60.8 bits (148), Expect = 4e-12
Identities = 19/45 (42%), Positives = 27/45 (60%)
Query: 527 RRSGVSCIAYHECIYVIGGFNGMSRMCNGEKYNPVTKTWSQVPDM 571
RSG + IYVIGG++G + + E Y+P T TWS++P M
Sbjct: 1 PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSM 45
Score = 58.1 bits (141), Expect = 4e-11
Identities = 17/46 (36%), Positives = 26/46 (56%)
Query: 386 PRAYHGTAVIGYNIYVIGGFDGNEYYNSCRCFNAVTKVWKEIAPMN 431
PR+ G V+G IYVIGG+DG + +S ++ T W ++ M
Sbjct: 1 PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSMP 46
Score = 57.3 bits (139), Expect = 7e-11
Identities = 16/46 (34%), Positives = 26/46 (56%)
Query: 574 PRSNFAIEVIDDMIFAIGGFNGVTTIYHVECYDEKTDEWYEATDMN 619
PRS + V+ I+ IGG++G ++ VE YD +T+ W + M
Sbjct: 1 PRSGAGVVVLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKLPSMP 46
Score = 38.4 bits (90), Expect = 3e-04
Identities = 12/34 (35%), Positives = 17/34 (50%)
Query: 345 VPHEILFAIGGWSGGSPTDFVETYDTRADRWVKI 378
V ++ IGG+ GG VE YD + W K+
Sbjct: 9 VLGGKIYVIGGYDGGQSLSSVEVYDPETNTWSKL 42
>gnl|CDD|165086 PHA02713, PHA02713, hypothetical protein; Provisional.
Length = 557
Score = 67.7 bits (165), Expect = 1e-11
Identities = 109/561 (19%), Positives = 206/561 (36%), Gaps = 73/561 (13%)
Query: 58 IENAIKNKKNFSSIDRAMSVQAMQALNEFREKNLLCDATIVLADNSHFPIHRNILCACSD 117
I++ N++ S+I + + ++LCD I + D H+ IL A S
Sbjct: 3 IDDIKHNRRVVSNISNLL------------DDDILCDVIITIGDGEEIKAHKTILAAGSK 50
Query: 118 YFRALFTTSLHDQEKFSLVEIPGVDASVMTSLIEYAYLRKLHITEDNVASLLLATDFFCM 177
YFR LFTT + ++ + V + D + ++++Y Y R HI+ NV +L D+ +
Sbjct: 51 YFRTLFTTPMIIRDLVTRVNLQMFDKDAVKNIVQYLYNR--HISSMNVIDVLKCADYLLI 108
Query: 178 SKAQDLCCDFIKRLTTPRNCFGIL--LFSRDHFCKKLEEWTRKYILRNFVDVARESEEIV 235
C +IK T C + L+ H + ++ ++ ++ N +
Sbjct: 109 DDLVTDCESYIKDYTNHDTCIYMYHRLYEMSHI--PIVKYIKRMLMSNIPTLITTDAFKK 166
Query: 236 FISEQELEEIISSDELNVKSEETVWELIIRWIDYDPENRKK--------HIVSLMKNIRL 287
+ E + I ++D + + E ++++W++Y+ ++ I +L K RL
Sbjct: 167 TVFEILFDIISTNDNVYLYREGYKVTILLKWLEYNYITEEQLLCILSCIDIQNLDKKSRL 226
Query: 288 GLLETEFFREKVKHHPYVQSTPECRPIIIETFKFLY--------------------DLEL 327
L + ++ + R II + + +
Sbjct: 227 LLYSNKTINMYPSCIQFLLDNKQNRNIIPRQLCLVCHDTKYNVCNPCILVYNINTMEYSV 286
Query: 328 IAQIDGEVPTPEIARPRVPHEILFAIGGWSGGSPTDFVETYDTRADRWVKIEQVDPMGPR 387
I+ I + A V +EI+ A GG++ +P+ + Y + + +E + R
Sbjct: 287 ISTIPNHIIN--YASAIVDNEIIIA-GGYNFNNPS-LNKVYKINIENKIHVELPPMIKNR 342
Query: 388 AYHGTAVIGYNIYVIGGFDGNEYYNSCRCFNAVTKVWKEIAPMNFKRCYVSTAVHHDVIY 447
AVI IY IGG +G + C+ WK + M V IY
Sbjct: 343 CRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDMPIALSSYGMCVLDQYIY 402
Query: 448 AMGG------YNGQRRQNSVE------------KYNYTENQWSLIAPMNVERSDASATTL 489
+GG Y NS++ +Y+ N W + +
Sbjct: 403 IIGGRTEHIDYTSVHHMNSIDMEEDTHSSNKVIRYDTVNNIWETLPNFWTGTIRPGVVSH 462
Query: 490 QGKIYITGGFNGHNCLNSCEV-YDPEC-NQWTLIEPMRHRRSGVSCIAYHECIYVIGGFN 547
+ IY+ + +C Y+ N W LI R S + I + I ++ +
Sbjct: 463 KDDIYVVCDIKDEKNVKTCIFRYNTNTYNGWELITTTESRLSALHTILHDNTIMMLHCYE 522
Query: 548 GMSRMCNGEKYNPVTKTWSQV 568
S M + +N T W+ +
Sbjct: 523 --SYMLQ-DTFNVYTYEWNHI 540
Score = 58.9 bits (142), Expect = 5e-09
Identities = 40/131 (30%), Positives = 61/131 (46%), Gaps = 2/131 (1%)
Query: 502 HNCLNSC-EVYDPECNQWTLIEPMRHRRSGVSCIAYHECIYVIGGFNGMSRMCNG-EKYN 559
+N N C VY+ ++++I + + + I + GG+N + N K N
Sbjct: 267 YNVCNPCILVYNINTMEYSVISTIPNHIINYASAIVDNEIIIAGGYNFNNPSLNKVYKIN 326
Query: 560 PVTKTWSQVPDMYNPRSNFAIEVIDDMIFAIGGFNGVTTIYHVECYDEKTDEWYEATDMN 619
K ++P M R F++ VIDD I+AIGG NG +ECY D+W DM
Sbjct: 327 IENKIHVELPPMIKNRCRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDMP 386
Query: 620 IYRSALSACVI 630
I S+ CV+
Sbjct: 387 IALSSYGMCVL 397
Score = 51.9 bits (124), Expect = 9e-07
Identities = 43/154 (27%), Positives = 70/154 (45%), Gaps = 8/154 (5%)
Query: 464 YNYTENQWSLIAPMNVERSDASATTLQGKIYITGGFNGHN-CLNSCEVYDPECNQWTLIE 522
YN ++S+I+ + + ++ + +I I GG+N +N LN +VY +E
Sbjct: 277 YNINTMEYSVISTIPNHIINYASAIVDNEIIIAGGYNFNNPSLN--KVYKINIENKIHVE 334
Query: 523 --PMRHRRSGVSCIAYHECIYVIGGFNGMSRMCNGEKYNPVTKTWSQVPDMYNPRSNFAI 580
PM R S + IY IGG NG + E Y W +PDM S++ +
Sbjct: 335 LPPMIKNRCRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDMPIALSSYGM 394
Query: 581 EVIDDMIFAIGG---FNGVTTIYHVECYDEKTDE 611
V+D I+ IGG T+++H+ D + D
Sbjct: 395 CVLDQYIYIIGGRTEHIDYTSVHHMNSIDMEEDT 428
>gnl|CDD|177814 PLN02153, PLN02153, epithiospecifier protein.
Length = 341
Score = 64.6 bits (157), Expect = 5e-11
Identities = 65/255 (25%), Positives = 108/255 (42%), Gaps = 41/255 (16%)
Query: 375 WVKIEQ---VDPMGPRAYHGTAVIGYNIYVIGG-FDGNEYYNS-CRCFNAVTKVWKEIAP 429
W+K+EQ P GPR HG AV+G +Y GG NE+ + F+ T W IAP
Sbjct: 9 WIKVEQKGGKGP-GPRCSHGIAVVGDKLYSFGGELKPNEHIDKDLYVFDFNTHTW-SIAP 66
Query: 430 MNFKRCYVST------AVHHDVIYAMGGYNGQRRQNSVEKYNYTENQWSLIAPMNVE--- 480
N +S AV +Y GG + +R + Y+ +N+W+ + ++ E
Sbjct: 67 ANGDVPRISCLGVRMVAVGTK-LYIFGGRDEKREFSDFYSYDTVKNEWTFLTKLDEEGGP 125
Query: 481 --RSDASATTLQGKIYITGGFNGHNCLN------SCEVYDPECNQWT-LIEPMRH--RRS 529
R+ S + + +Y+ GG + + + E Y+ +W L +P + +R
Sbjct: 126 EARTFHSMASDENHVYVFGGVSKGGLMKTPERFRTIEAYNIADGKWVQLPDPGENFEKRG 185
Query: 530 GVSCIAYHECIYVIGGFNGMSRMCNGEK---------YNPVTKTWSQVPDM---YNPRSN 577
G I+V+ GF S + G+ ++P + W++V + RS
Sbjct: 186 GAGFAVVQGKIWVVYGF-ATSILPGGKSDYESNAVQFFDPASGKWTEVETTGAKPSARSV 244
Query: 578 FAIEVIDDMIFAIGG 592
FA V+ I GG
Sbjct: 245 FAHAVVGKYIIIFGG 259
>gnl|CDD|211835 TIGR03548, mutarot_permut, cyclically-permuted mutarotase family
protein. Members of this protein family show
essentially full-length homology, cyclically permuted,
to YjhT from Escherichia coli. YjhT was shown to act as
a mutarotase for sialic acid, and by this ability to be
able to act as a virulence factor. Members of the YjhT
family (TIGR03547) and this cyclically-permuted family
have multiple repeats of the beta-propeller-forming
Kelch repeat.
Length = 323
Score = 55.5 bits (134), Expect = 3e-08
Identities = 47/184 (25%), Positives = 81/184 (44%), Gaps = 14/184 (7%)
Query: 423 VWKEIAPMNFKRCYVSTAVHHDVIYAMGGYNGQRRQNSVEKYNYTENQWSL----IAPMN 478
W + + + Y ++ + IY +GG N +SV + E++ +L + +
Sbjct: 52 KWVKAGQLPYAAAYGASISTENGIYYIGGSNSSESFSSVYRITLDESKEALIIETLPSLP 111
Query: 479 VERSDASATTLQGKIYITGGFNGHNCLNSCEVYDPECNQWTLIE--PMRHRRSGVSCIAY 536
+ SAT GK+Y+ GG N ++ E +W + P R V C+
Sbjct: 112 FAMDNGSATYKDGKLYVGGGNANGKPSNKFYCFNLETQEWEELPDFPGAPRVQPV-CVKL 170
Query: 537 HECIYVIGGFNGMSRMCNGEKYNPVTKTWSQVPDMY---NPRS---NFAIEVIDDMIFAI 590
+YV GG + ++ +G KY+P T TW +V D P S +I + + ++ I
Sbjct: 171 QNELYVFGGGDNIAYT-DGYKYSPKTGTWEKVADPLSDGEPISLLGAASIAINESLLLCI 229
Query: 591 GGFN 594
GGFN
Sbjct: 230 GGFN 233
Score = 34.7 bits (80), Expect = 0.15
Identities = 35/173 (20%), Positives = 60/173 (34%), Gaps = 27/173 (15%)
Query: 345 VPHEILFAIGGWSGGSPTDFVETYDTRADRWVKIEQVDPMGPRAY---HGTAV-IGYNIY 400
+ I + G S S + Y D + ++ + + +G+A +Y
Sbjct: 71 TENGIYYIGGSNSSESFSS---VYRITLDESKEALIIETLPSLPFAMDNGSATYKDGKLY 127
Query: 401 VIGGFDGNEYYNSCRCFNAVTKVWKEIA--PMNFKRCYVSTAVHHDVIYAMGGYNGQRRQ 458
V GG + N CFN T+ W+E+ P + V + +++ GG N
Sbjct: 128 VGGGNANGKPSNKFYCFNLETQEWEELPDFPGAPRVQPVCVKLQNELYVFGGGDN----I 183
Query: 459 NSVEKYNYT--ENQWSLIAPMNVERSDASATTLQG---------KIYITGGFN 500
+ Y Y+ W + SD +L G + GGFN
Sbjct: 184 AYTDGYKYSPKTGTWE---KVADPLSDGEPISLLGAASIAINESLLLCIGGFN 233
>gnl|CDD|206134 pfam13964, Kelch_6, Kelch motif.
Length = 50
Score = 48.6 bits (116), Expect = 9e-08
Identities = 15/50 (30%), Positives = 26/50 (52%), Gaps = 1/50 (2%)
Query: 386 PRAYHGTAVIGYNIYVIGGF-DGNEYYNSCRCFNAVTKVWKEIAPMNFKR 434
PR H +G IYV GG+ +G++ N ++ T W+++ P+ R
Sbjct: 1 PRTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPTPR 50
Score = 48.6 bits (116), Expect = 1e-07
Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 1/49 (2%)
Query: 528 RSGVSCIAYHECIYVIGGFNGMSRMCN-GEKYNPVTKTWSQVPDMYNPR 575
R+G S ++ IYV GG++ S+ N Y+P T +W ++P + PR
Sbjct: 2 RTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPTPR 50
Score = 45.9 bits (109), Expect = 9e-07
Identities = 20/49 (40%), Positives = 27/49 (55%), Gaps = 1/49 (2%)
Query: 481 RSDASATTLQGKIYITGGF-NGHNCLNSCEVYDPECNQWTLIEPMRHRR 528
R+ SA ++ GKIY+ GG+ NG N VYDPE W + P+ R
Sbjct: 2 RTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPTPR 50
Score = 42.1 bits (99), Expect = 2e-05
Identities = 15/49 (30%), Positives = 19/49 (38%), Gaps = 1/49 (2%)
Query: 434 RCYVSTAVHHDVIYAMGGY-NGQRRQNSVEKYNYTENQWSLIAPMNVER 481
R S IY GGY NG + N V Y+ W + P+ R
Sbjct: 2 RTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPTPR 50
Score = 37.8 bits (88), Expect = 6e-04
Identities = 11/50 (22%), Positives = 22/50 (44%), Gaps = 1/50 (2%)
Query: 574 PRSNFAIEVIDDMIFAIGGFNGVTTIYH-VECYDEKTDEWYEATDMNIYR 622
PR+ + + I+ GG++ + + V YD +T W + + R
Sbjct: 1 PRTGHSAVSVGGKIYVFGGYSNGSKASNKVLVYDPETGSWEKLPPLPTPR 50
>gnl|CDD|222113 pfam13415, Kelch_3, Galactose oxidase, central domain.
Length = 48
Score = 40.7 bits (96), Expect = 5e-05
Identities = 15/47 (31%), Positives = 21/47 (44%), Gaps = 1/47 (2%)
Query: 491 GKIYITGGFNGHN-CLNSCEVYDPECNQWTLIEPMRHRRSGVSCIAY 536
GKIY+ GG LN VYD + N W + + R+G +
Sbjct: 2 GKIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDLPGPRAGHAATVI 48
Score = 39.2 bits (92), Expect = 2e-04
Identities = 14/42 (33%), Positives = 19/42 (45%), Gaps = 1/42 (2%)
Query: 540 IYVIGGFNGMSRMCNG-EKYNPVTKTWSQVPDMYNPRSNFAI 580
IYV GG N Y+ T TW ++ D+ PR+ A
Sbjct: 4 IYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDLPGPRAGHAA 45
Score = 37.7 bits (88), Expect = 6e-04
Identities = 10/46 (21%), Positives = 17/46 (36%), Gaps = 1/46 (2%)
Query: 398 NIYVIGGFDG-NEYYNSCRCFNAVTKVWKEIAPMNFKRCYVSTAVH 442
IYV GG N ++ T W+++ + R + V
Sbjct: 3 KIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDLPGPRAGHAATVI 48
Score = 31.1 bits (71), Expect = 0.16
Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 1/48 (2%)
Query: 584 DDMIFAIGGFNGVTTIYH-VECYDEKTDEWYEATDMNIYRSALSACVI 630
I+ GG T + + YD T+ W + D+ R+ +A VI
Sbjct: 1 GGKIYVFGGLGDGGTRLNDLWVYDLDTNTWEKLGDLPGPRAGHAATVI 48
>gnl|CDD|205596 pfam13418, Kelch_4, Galactose oxidase, central domain.
Length = 49
Score = 40.3 bits (95), Expect = 7e-05
Identities = 13/46 (28%), Positives = 26/46 (56%), Gaps = 2/46 (4%)
Query: 481 RSDASATTLQ-GKIYITGGFNG-HNCLNSCEVYDPECNQWTLIEPM 524
R+ ++T++ G++Y+ GG N + L+ V+D N WT + +
Sbjct: 2 RAYHTSTSIGDGRLYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47
Score = 39.6 bits (93), Expect = 1e-04
Identities = 13/47 (27%), Positives = 22/47 (46%), Gaps = 2/47 (4%)
Query: 386 PRAYHGTAVIGYN-IYVIGGFDGN-EYYNSCRCFNAVTKVWKEIAPM 430
PRAYH + IG +Y+ GG + + + F+ T W + +
Sbjct: 1 PRAYHTSTSIGDGRLYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47
Score = 31.8 bits (73), Expect = 0.073
Identities = 9/44 (20%), Positives = 20/44 (45%), Gaps = 1/44 (2%)
Query: 435 CYVSTAVHHDVIYAMGGYNGQ-RRQNSVEKYNYTENQWSLIAPM 477
+ ST++ +Y GG N + V ++ + N W+ + +
Sbjct: 4 YHTSTSIGDGRLYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47
Score = 31.5 bits (72), Expect = 0.10
Identities = 9/33 (27%), Positives = 19/33 (57%), Gaps = 1/33 (3%)
Query: 540 IYVIGGFNG-MSRMCNGEKYNPVTKTWSQVPDM 571
+Y+ GG N S + + ++ T TW+++P +
Sbjct: 15 LYLFGGENEDGSVLSDVWVFDLSTNTWTRLPSL 47
>gnl|CDD|237617 PRK14131, PRK14131, N-acetylneuraminic acid mutarotase;
Provisional.
Length = 376
Score = 43.5 bits (103), Expect = 3e-04
Identities = 36/127 (28%), Positives = 52/127 (40%), Gaps = 22/127 (17%)
Query: 391 GT-AVIGYNIYVIGGFDGNEYYNSCRCFNAVTKVWKEIAPMNF---KRCYVSTAVHHDVI 446
GT A+ +YV G G +Y NA +K W +IA F R A +
Sbjct: 32 GTGAIDNNTVYVGLGSAGTSWYK--LDLNAPSKGWTKIA--AFPGGPREQAVAAFIDGKL 87
Query: 447 YAMGGYNGQ-------RRQNSVEKYNYTENQWS---LIAPMNVERSDASATTLQG-KIYI 495
Y GG G+ + + V KY+ N W +P+ + + A +L K YI
Sbjct: 88 YVFGG-IGKTNSEGSPQVFDDVYKYDPKTNSWQKLDTRSPVGL--AGHVAVSLHNGKAYI 144
Query: 496 TGGFNGH 502
TGG N +
Sbjct: 145 TGGVNKN 151
Score = 43.5 bits (103), Expect = 3e-04
Identities = 25/93 (26%), Positives = 35/93 (37%), Gaps = 11/93 (11%)
Query: 465 NYTENQWSLIAPM-NVERSDASATTLQGKIYITGGFNGHN------CLNSCEVYDPECNQ 517
N W+ IA R A A + GK+Y+ GG N + YDP+ N
Sbjct: 58 NAPSKGWTKIAAFPGGPREQAVAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYDPKTNS 117
Query: 518 WTLIEPMRHRRS--GVSCIAYHEC-IYVIGGFN 547
W + R G ++ H Y+ GG N
Sbjct: 118 WQKL-DTRSPVGLAGHVAVSLHNGKAYITGGVN 149
Score = 38.8 bits (91), Expect = 0.008
Identities = 23/89 (25%), Positives = 40/89 (44%), Gaps = 11/89 (12%)
Query: 374 RWVKIEQVDPMGPRAYHGTAVIGYNIYVIGGF-----DG-NEYYNSCRCFNAVTKVWKEI 427
W KI P GPR A I +YV GG +G + ++ ++ T W+++
Sbjct: 63 GWTKIAAF-PGGPREQAVAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKL 121
Query: 428 ---APMNFKRCYVSTAVHHDVIYAMGGYN 453
+P+ +V+ ++H+ Y GG N
Sbjct: 122 DTRSPVGLA-GHVAVSLHNGKAYITGGVN 149
Score = 31.5 bits (72), Expect = 1.6
Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 11/69 (15%)
Query: 350 LFAIGGW----SGGSPTDF--VETYDTRADRWVKIEQVDPMGPRAYHGTAVIGYNIYVIG 403
L+ GG S GSP F V YD + + W K++ P+G + ++ Y+ G
Sbjct: 87 LYVFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKLDTRSPVGLAGHVAVSLHNGKAYITG 146
Query: 404 G-----FDG 407
G FDG
Sbjct: 147 GVNKNIFDG 155
>gnl|CDD|234253 TIGR03547, muta_rot_YjhT, mutatrotase, YjhT family. Members of
this protein family contain multiple copies of the
beta-propeller-forming Kelch repeat. All are full-length
homologs to YjhT of Escherichia coli, which has been
identified as a mutarotase for sialic acid. This protein
improves bacterial ability to obtain host sialic acid,
and thus serves as a virulence factor. Some bacteria
carry what appears to be a cyclically permuted homolog
of this protein.
Length = 346
Score = 43.1 bits (102), Expect = 3e-04
Identities = 36/126 (28%), Positives = 54/126 (42%), Gaps = 18/126 (14%)
Query: 390 HGT-AVIGYNIYVIGGFDGNEYYNSCRCFNAVTKVWKEIAPMNF---KRCYVSTAVHHDV 445
+GT A+IG +YV G G +Y +K W++IA +F R A
Sbjct: 10 NGTGAIIGDKVYVGLGSAGTSWYK--LDLKKPSKGWQKIA--DFPGGPRNQAVAAAIDGK 65
Query: 446 IYAMGGYNGQ-------RRQNSVEKYNYTENQWS-LIAPMNVERSDASATTLQ-GKIYIT 496
+Y GG G+ + + V +Y+ +N W L V AS +L G+ Y T
Sbjct: 66 LYVFGG-IGKANSEGSPQVFDDVYRYDPKKNSWQKLDTRSPVGLLGASGFSLHNGQAYFT 124
Query: 497 GGFNGH 502
GG N +
Sbjct: 125 GGVNKN 130
Score = 42.7 bits (101), Expect = 5e-04
Identities = 26/97 (26%), Positives = 35/97 (36%), Gaps = 19/97 (19%)
Query: 465 NYTENQWSLIAPM-NVERSDASATTLQGKIYITGGFNGHN-------CLNSCEVYDPECN 516
W IA R+ A A + GK+Y+ GG G + YDP+ N
Sbjct: 37 KKPSKGWQKIADFPGGPRNQAVAAAIDGKLYVFGG-IGKANSEGSPQVFDDVYRYDPKKN 95
Query: 517 QWTLIEPMRHRRS-----GVSCIAYHEC-IYVIGGFN 547
W ++ RS G S + H Y GG N
Sbjct: 96 SWQKLD----TRSPVGLLGASGFSLHNGQAYFTGGVN 128
Score = 35.8 bits (83), Expect = 0.060
Identities = 32/153 (20%), Positives = 53/153 (34%), Gaps = 24/153 (15%)
Query: 375 WVKIEQVDPMGPRAYHGTAVIGYNIYVIGGFDGNE------YYNSCRCFNAVTKVWKEI- 427
W KI P GPR A I +YV GG ++ ++ W+++
Sbjct: 43 WQKIADF-PGGPRNQAVAAAIDGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKNSWQKLD 101
Query: 428 --APMNFKRCYVSTAVHHDVIYAMGGYNGQRRQNSVEKYNYTENQWSLIAPMNVERSDAS 485
+P+ ++H+ Y GG N +N + Y + + S + E D
Sbjct: 102 TRSPVGLLG-ASGFSLHNGQAYFTGGVN----KNIFDGY-FAD--LSAADK-DSEPKDKL 152
Query: 486 ATTLQGKIYITGGFNGHNCLNSCEVYDPECNQW 518
+ +N + YDP NQW
Sbjct: 153 IAAYFSQPPEDYFWN-----KNVLSYDPSTNQW 180
>gnl|CDD|225597 COG3055, COG3055, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 381
Score = 40.4 bits (95), Expect = 0.003
Identities = 23/79 (29%), Positives = 30/79 (37%), Gaps = 15/79 (18%)
Query: 480 ERSDASATTLQGKIYITGGFNGHN-----CLNSCEVYDPECNQWTLIEPMRHRRS----- 529
R+ A A + GK+Y+ GG+ N YDP N W RS
Sbjct: 82 ARNQAVAAVIGGKLYVFGGYGKSVSSSPQVFNDAYRYDPSTNSWH----KLDTRSPTGLV 137
Query: 530 GVSCIAYHEC-IYVIGGFN 547
G S + + IY GG N
Sbjct: 138 GASTFSLNGTKIYFFGGVN 156
Score = 30.8 bits (70), Expect = 2.3
Identities = 28/108 (25%), Positives = 40/108 (37%), Gaps = 14/108 (12%)
Query: 355 GWSGGSPTDFVETYDTRADRWVKIEQVDPMGPRAYHGTAVIGYNIYVIGGFDGNE----- 409
G +G + +V W KI P G R AVIG +YV GG+ +
Sbjct: 54 GSAGTA--FYVLDLKKPGKGWTKIADF-PGGARNQAVAAVIGGKLYVFGGYGKSVSSSPQ 110
Query: 410 YYNSCRCFNAVTKVWKE---IAPMNFKRCYVSTAVHHDV-IYAMGGYN 453
+N ++ T W + +P ST + IY GG N
Sbjct: 111 VFNDAYRYDPSTNSWHKLDTRSPTGLVG--ASTFSLNGTKIYFFGGVN 156
>gnl|CDD|116261 pfam07646, Kelch_2, Kelch motif. The kelch motif was initially
discovered in Kelch. In this protein there are six
copies of the motif. It has been shown that Drosophila
ring canal kelch protein is related to Galactose Oxidase
for which a structure has been solved. The kelch motif
forms a beta sheet. Several of these sheets associate to
form a beta propeller structure as found in pfam00064,
pfam00400 and pfam00415.
Length = 48
Score = 35.0 bits (81), Expect = 0.007
Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 2/46 (4%)
Query: 481 RSDASATTLQGKIYITGGFNGHNCLNSCEV--YDPECNQWTLIEPM 524
R ++ + GK+Y+ GG G L+S ++ DPE N WT + +
Sbjct: 2 RYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPAL 47
Score = 33.5 bits (77), Expect = 0.021
Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 2/47 (4%)
Query: 386 PRAYHGTAVIGYNIYVIGGFDGN--EYYNSCRCFNAVTKVWKEIAPM 430
PR H + V+G +YV+GG G + + T VW E+ +
Sbjct: 1 PRYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPAL 47
Score = 33.5 bits (77), Expect = 0.023
Identities = 10/48 (20%), Positives = 21/48 (43%), Gaps = 4/48 (8%)
Query: 527 RRSGVSCIAYHECIYVIGGFNGMSRMCNGEK---YNPVTKTWSQVPDM 571
R + + +YV+GG G+ + +P T W+++P +
Sbjct: 1 PRYPHASVVVGGKLYVVGGSTGLG-DLSSSDLWVLDPETNVWTELPAL 47
Score = 32.3 bits (74), Expect = 0.052
Identities = 11/47 (23%), Positives = 20/47 (42%), Gaps = 2/47 (4%)
Query: 574 PRSNFAIEVIDDMIFAIGGF--NGVTTIYHVECYDEKTDEWYEATDM 618
PR A V+ ++ +GG G + + D +T+ W E +
Sbjct: 1 PRYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPAL 47
Score = 28.1 bits (63), Expect = 1.6
Identities = 8/46 (17%), Positives = 18/46 (39%), Gaps = 2/46 (4%)
Query: 434 RCYVSTAVHHDVIYAMGGY--NGQRRQNSVEKYNYTENQWSLIAPM 477
R ++ V +Y +GG G + + + N W+ + +
Sbjct: 2 RYPHASVVVGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPAL 47
>gnl|CDD|222419 pfam13854, Kelch_5, Kelch motif. The kelch motif was initially
discovered in Kelch. In this protein there are six
copies of the motif. It has been shown that the
Drosophila ring canal kelch protein is related to
Galactose Oxidase for which a structure has been solved.
The kelch motif forms a beta sheet. Several of these
sheets associate to form a beta propeller structure as
found in pfam00064, pfam00400 and pfam00415.
Length = 42
Score = 33.7 bits (78), Expect = 0.015
Identities = 10/29 (34%), Positives = 15/29 (51%), Gaps = 2/29 (6%)
Query: 386 PRAYHGTAVIGYNIYVIGGFDG--NEYYN 412
PRA H V+G IY+ GG+ + +
Sbjct: 4 PRANHCAVVVGGEIYLYGGYTSGNGQSSD 32
Score = 28.7 bits (65), Expect = 0.74
Identities = 7/27 (25%), Positives = 11/27 (40%)
Query: 574 PRSNFAIEVIDDMIFAIGGFNGVTTIY 600
PR+N V+ I+ GG+
Sbjct: 4 PRANHCAVVVGGEIYLYGGYTSGNGQS 30
>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein. This model
describes nucleoid occlusion protein, a close homolog to
ParB chromosome partitioning proteins including Spo0J in
Bacillus subtilis. Its gene often is located near the
gene for the Spo0J ortholog. This protein bind a
specific DNA sequence and blocks cytokinesis from
happening until chromosome segregation is complete.
Length = 255
Score = 35.9 bits (84), Expect = 0.053
Identities = 17/48 (35%), Positives = 25/48 (52%), Gaps = 6/48 (12%)
Query: 242 LEEIISSDELNVKSEETVWELIIRWIDY--DPENRKKHIVSLMKNIRL 287
L EII LNVK E ELI + ++ P+ +KK K++R+
Sbjct: 174 LNEIIE-KGLNVKQTE---ELIKKLLEKPEKPKKKKKRRKGFSKDVRI 217
>gnl|CDD|205932 pfam13758, Prefoldin_3, Prefoldin subunit. This family includes
prefoldin subunits that are not detected by pfam02996.
Length = 97
Score = 32.3 bits (74), Expect = 0.17
Identities = 20/71 (28%), Positives = 39/71 (54%), Gaps = 9/71 (12%)
Query: 216 TRKYILRNFVDVARESEEIVFISEQELEEIISSDELNVKSEETVWELIIRWIDYDPENRK 275
TR+ +LR + R+ + ++E+E++EI+ + ++ + V +LI R IDY +N
Sbjct: 29 TREDLLR----IGRDFGGSL-VNEKEIKEILGEKQGITRTRQQVVDLISRRIDYVKQN-- 81
Query: 276 KHIVSLMKNIR 286
I +L K +
Sbjct: 82 --IATLEKRLE 90
>gnl|CDD|215908 pfam00413, Peptidase_M10, Matrixin. The members of this family are
enzymes that cleave peptides. These proteases require
zinc for catalysis.
Length = 159
Score = 31.8 bits (73), Expect = 0.54
Identities = 13/37 (35%), Positives = 17/37 (45%)
Query: 337 TPEIARPRVPHEILFAIGGWSGGSPTDFVETYDTRAD 373
TP+++R V I A WS +P F E AD
Sbjct: 15 TPDLSRDEVRRAIRRAFKVWSEVTPLTFTEVPTGTAD 51
>gnl|CDD|170033 PRK09644, PRK09644, RNA polymerase sigma factor SigM; Provisional.
Length = 165
Score = 31.4 bits (71), Expect = 0.97
Identities = 18/79 (22%), Positives = 35/79 (44%), Gaps = 2/79 (2%)
Query: 200 ILLFSRDHFCKKLEEWTRKYILRNFVDVARESEEIVFISEQELEEIISSDELNVKSEETV 259
I L D+ +K++ W K F+D R+ +++ F+ E+E I + +
Sbjct: 41 IYLEDYDN--QKVKPWLFKVAYHTFIDFVRKEKKVSFVGTDEIEAIQAESTEEYVVAKNS 98
Query: 260 WELIIRWIDYDPENRKKHI 278
+E +I+ I P + I
Sbjct: 99 YEKLIQIIHTLPVIEAQAI 117
>gnl|CDD|236567 PRK09559, PRK09559, putative global regulator; Reviewed.
Length = 327
Score = 31.2 bits (71), Expect = 1.8
Identities = 19/74 (25%), Positives = 33/74 (44%), Gaps = 2/74 (2%)
Query: 98 VLADNSHFPIHRNILCACSDYFRALFTTS--LHDQEKFSLVEIPGVDASVMTSLIEYAYL 155
V AD S +++L A D +++ + F+ +E V + +T L +YA
Sbjct: 45 VTADVSQLTEDQHLLAAHCDAKGKMWSNLRLFRRGDGFAWIERRSVRENQLTELKKYAVF 104
Query: 156 RKLHITEDNVASLL 169
K+ I D+ LL
Sbjct: 105 SKVTIAPDDERVLL 118
>gnl|CDD|222785 PHA00368, PHA00368, internal virion protein D.
Length = 1315
Score = 30.9 bits (70), Expect = 2.8
Identities = 24/111 (21%), Positives = 39/111 (35%), Gaps = 20/111 (18%)
Query: 571 MYNPRSNFAIEVIDDMIFAIGGFNGVTTIYHVECYDEKTDEWYEATDMNIYRSALSACVI 630
YN + + + D F+ GG + E Y + I R L A +
Sbjct: 564 TYNDLYDAMEDAMKDPEFSTGGAKMSR--------EAARQEIYRRVALAIERPELQANLT 615
Query: 631 MGLPNVYDYIHQHRDSLMEEKRQKLLAIEGRQRQNQSEEGNSSQQSVLPSS 681
V D + +H D KR+ + +N + GN+ S+ P S
Sbjct: 616 KAERKVMDILKEHFDL----KREMM--------ENPAIFGNTKAVSIFPGS 654
>gnl|CDD|181899 PRK09485, mmuM, homocysteine methyltransferase; Provisional.
Length = 304
Score = 30.6 bits (70), Expect = 3.1
Identities = 9/16 (56%), Positives = 10/16 (62%)
Query: 555 GEKYNPVTKTWSQVPD 570
GE Y+ VTKTW D
Sbjct: 250 GEVYDAVTKTWHGPAD 265
>gnl|CDD|223185 COG0107, HisF, Imidazoleglycerol-phosphate synthase [Amino acid
transport and metabolism].
Length = 256
Score = 30.2 bits (69), Expect = 3.3
Identities = 18/52 (34%), Positives = 22/52 (42%), Gaps = 14/52 (26%)
Query: 323 YDLELIAQIDGEVPTPEIARPRVPHEILFAIGGWSGGSPTDFVETY-DTRAD 373
YDLEL + V P IA GG G P FVE + + +AD
Sbjct: 186 YDLELTRAVREAVNIPVIAS-----------GG--AGKPEHFVEAFTEGKAD 224
>gnl|CDD|227780 COG5493, COG5493, Uncharacterized conserved protein containing a
coiled-coil domain [Function unknown].
Length = 231
Score = 29.8 bits (67), Expect = 4.3
Identities = 20/96 (20%), Positives = 44/96 (45%), Gaps = 10/96 (10%)
Query: 211 KLEEWTRKYILRNFVDVARESEEIVFISEQELEEIISSDELNVKSEETVWELIIRWIDYD 270
KL W + ++ ++ +E+E+ ++EL + ++E + +L+ R+ + +
Sbjct: 35 KLTPWQQLATKQDVEELRKETEQR----QKELADEKLEVRKQKATKEDL-KLLQRFQEEE 89
Query: 271 PENRKKHIVSLMKNI-----RLGLLETEFFREKVKH 301
K+ I L I R G+L + FR+ V+
Sbjct: 90 FRATKEDIKRLETIITGLGARWGILSEDAFRQGVRE 125
>gnl|CDD|220221 pfam09402, MSC, Man1-Src1p-C-terminal domain. MAN1 is an integral
protein of the inner nuclear membrane which binds to
chromatin associated proteins and plays a role in
nuclear organisation. The C terminal nucleoplasmic
region forms a DNA binding winged helix and binds to
Smad. This C-terminal tail is also found in S.
cerevisiae and is thought to consist of three conserved
helices followed by two downstream strands.
Length = 326
Score = 29.7 bits (67), Expect = 5.2
Identities = 21/91 (23%), Positives = 37/91 (40%), Gaps = 10/91 (10%)
Query: 210 KKLEEWTRKYILRNFVDVARESEEIVFISEQELEEIIS---SDELNVKSEETVWELIIRW 266
++L + K+ E + ISE EL E++S S +N + E +W+ I+
Sbjct: 104 EELRKRNAKHE-------CGEDDLSPGISENELYELLSEKKSPWINEEEFEDLWKAAIKE 156
Query: 267 IDYDPENRKKHIVSLMKNIRLGLLETEFFRE 297
+ +PE + L KN + R
Sbjct: 157 LKENPEVVIRPEPILSKNGDGSTETQKTLRS 187
>gnl|CDD|240082 cd04731, HisF, The cyclase subunit of imidazoleglycerol phosphate
synthase (HisF). Imidazole glycerol phosphate synthase
(IGPS) catalyzes the fifth step of histidine
biosynthesis, the formation of the imidazole ring. IGPS
converts
N1-(5'-phosphoribulosyl)-formimino-5-aminoimidazole-4-
carboxamide ribonucleotide (PRFAR) to imidazole glycerol
phosphate (ImGP) and 5'-(5-aminoimidazole-4-carboxamide)
ribonucleotide (AICAR). This conversion involves two
tightly coupled reactions in distinct active sites of
IGPS. The two catalytic domains can be fused, like in
fungi and plants, or peformed by a heterodimer
(HisH-glutaminase and HisF-cyclase), like in bacteria.
Length = 243
Score = 29.4 bits (67), Expect = 5.4
Identities = 17/46 (36%), Positives = 18/46 (39%), Gaps = 17/46 (36%)
Query: 323 YDLELIAQIDGEVPTPEIARPRVPHEILFAIGGWSGG--SPTDFVE 366
YDLELI + V P IA SGG P FVE
Sbjct: 180 YDLELIRAVSSAVNIPVIA---------------SGGAGKPEHFVE 210
>gnl|CDD|129873 TIGR00791, gntP, gluconate transporter. This family includes known
gluconate transporters of E. coli and Bacillus species
as well as an idonate transporter from E. coli
[Transport and binding proteins, Carbohydrates, organic
alcohols, and acids].
Length = 440
Score = 29.2 bits (66), Expect = 7.5
Identities = 10/36 (27%), Positives = 15/36 (41%)
Query: 115 CSDYFRALFTTSLHDQEKFSLVEIPGVDASVMTSLI 150
SL Q+ FS E+P S+ T+L+
Sbjct: 197 LGRIVPPSPEASLGSQKTFSEEEMPSFGISLFTALL 232
>gnl|CDD|236761 PRK10795, PRK10795, penicillin-binding protein 2; Provisional.
Length = 634
Score = 29.4 bits (66), Expect = 7.9
Identities = 13/50 (26%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 42 EDMASRLDLTEDDMTHIENAIKNKKNFSSIDRAMSVQAMQ----ALNEFR 87
+ + S +DLT+DD+ + F+SI ++ +Q A+N++R
Sbjct: 104 DALRSVVDLTDDDIAAFRKERARSRRFTSIPVKTNLTEVQVARFAVNQYR 153
>gnl|CDD|185600 PTZ00410, PTZ00410, NAD-dependent SIR2; Provisional.
Length = 349
Score = 29.1 bits (65), Expect = 8.0
Identities = 11/23 (47%), Positives = 13/23 (56%)
Query: 668 EEGNSSQQSVLPSSNSNVSSSDG 690
E +SS S S +S SSSDG
Sbjct: 265 EGRSSSSSSRSSSDSSTSSSSDG 287
>gnl|CDD|217114 pfam02574, S-methyl_trans, Homocysteine S-methyltransferase. This
is a family of related homocysteine S-methyltransferases
enzymes: 5-methyltetrahydrofolate--homocysteine
S-methyltransferases also known EC:2.1.1.13;
Betaine--homocysteine S-methyltransferase (vitamin B12
dependent), EC:2.1.1.5; and Homocysteine
S-methyltransferase, EC:2.1.1.10.
Length = 303
Score = 28.9 bits (65), Expect = 8.8
Identities = 15/48 (31%), Positives = 19/48 (39%), Gaps = 1/48 (2%)
Query: 555 GEKYNPVTKTWSQVPDMYNPRSNFAIEVIDDMIFAIGGFNGVTTIYHV 602
GE Y+ K + PD P E + IGG G TT H+
Sbjct: 248 GEPYDAGKKEYDGTPDELAPSYWSLKEFVASGARIIGGCCG-TTPDHI 294
>gnl|CDD|153336 cd07652, F-BAR_Rgd1, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Saccharomyces cerevisiae
Rho GTPase activating protein Rgd1 and similar proteins.
F-BAR domains are dimerization modules that bind and
bend membranes and are found in proteins involved in
membrane dynamics and actin reorganization.
Saccharomyces cerevisiae Rgd1 is a GTPase activating
protein (GAP) with activity towards Rho3p and Rho4p,
which are involved in bud growth and cytokinesis,
respectively. At low pH, S. cerevisiae Rgd1 is required
for cell survival and the activation of the protein
kinase C pathway, which is important in cell integrity
and the maintenance of cell shape. It contains an
N-terminal F-BAR domain and a C-terminal Rho GAP domain.
The F-BAR domain of S. cerevisiae Rgd1 binds to
phosphoinositides and plays an important role in the
localization of the protein to the bud tip/neck during
the cell cycle. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 234
Score = 28.5 bits (64), Expect = 9.9
Identities = 11/42 (26%), Positives = 17/42 (40%), Gaps = 2/42 (4%)
Query: 52 EDDMTHIENAIKNKKNF-SSIDRAMSVQAMQALNEFREKNLL 92
E H A+K+ + ID A+ +Q Q E +L
Sbjct: 194 ELLSRHRPEAVKDLFDLILEIDAALRLQ-YQKYALPNELLVL 234
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.135 0.417
Gapped
Lambda K H
0.267 0.0714 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 36,708,738
Number of extensions: 3605118
Number of successful extensions: 3281
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3203
Number of HSP's successfully gapped: 93
Length of query: 712
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 608
Effective length of database: 6,324,786
Effective search space: 3845469888
Effective search space used: 3845469888
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.1 bits)