RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= 008170
(575 letters)
>gnl|CDD|166363 PLN02722, PLN02722, indole-3-acetamide amidohydrolase.
Length = 422
Score = 473 bits (1220), Expect = e-164
Identities = 213/421 (50%), Positives = 277/421 (65%), Gaps = 6/421 (1%)
Query: 48 DFGAFLERFELIPFPQPPPPAARQPLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAER 107
D+GAF+E+F L P + PL GL FA+KD+FDV+GYVTGFG+PDW R H A
Sbjct: 6 DYGAFMEKFVL----SPTSSSHDLPLHGLTFAVKDIFDVEGYVTGFGNPDWARTHSAATS 61
Query: 108 TAVVVTLLLKNGATCIGKTVLDEFAFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVA 167
TA V +L+ GATC+GKT++DE A+ I GEN HYGTP NP P +PGGSSSGSAVAV
Sbjct: 62 TAPAVLAVLRGGATCVGKTIMDEMAYSINGENAHYGTPTNPIAPDRVPGGSSSGSAVAVG 121
Query: 168 AQLVDFALGTDTIGCVRVPASFCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLARNASIL 227
A+LVDF+LGTDT G VRVPAS+CGI G+RPSHG VS +GV+P +QS DTVG AR+ IL
Sbjct: 122 AKLVDFSLGTDTGGSVRVPASYCGIFGFRPSHGAVSTVGVIPMAQSFDTVGWFARDPVIL 181
Query: 228 HRVGHVLLQLNAVEPRRARRLIFADDIFQLSKVPKLKTIHVISKAIESLSGYQN-PEHMN 286
RVGHVLLQ V P + ++I A+D FQLS +P + + V+ K++E L G + +H+N
Sbjct: 182 KRVGHVLLQQPDVNPIKPSQIIIAEDCFQLSSIPHDRLVQVLVKSVEKLFGGGDIVKHVN 241
Query: 287 VGQYIASNVPSLKG-LRAQSTSPENGTATLKALSSVMLSLQRYEFKTIYEEWVKSAKPRL 345
+G Y+ VPSLK + + E +L ALSS M LQRYEFK + EW+ + KP
Sbjct: 242 LGDYVEDKVPSLKHFMSKEIKEQEYNIPSLAALSSAMRLLQRYEFKINHGEWITAVKPEF 301
Query: 346 GYNVFERVLEAINTTQDNIKILYKVRNEMRAALQRLLKDDKILVIPTVSDPPLKLNTKKT 405
G + ER+ EA+ TT++ I V+ E+RAAL LL + +LVIPTV PP KL T
Sbjct: 302 GPGISERIWEAVRTTEEKIDACQSVKTELRAALTTLLGEFGVLVIPTVPGPPPKLQADPT 361
Query: 406 YSAEFHDRTLVLSSIGSMSGCCQVAIPIGKYNEHPISISFITYHGGDKFLLDTVLDLYDS 465
F R L SI +SG CQV+IP+G ++ P+S+S + HG D FLL+ V LY +
Sbjct: 362 TLESFRARAFSLLSIAGVSGFCQVSIPLGLHDNLPVSVSLLAKHGSDGFLLNLVESLYGT 421
Query: 466 L 466
+
Sbjct: 422 I 422
>gnl|CDD|181375 PRK08310, PRK08310, amidase; Provisional.
Length = 395
Score = 351 bits (902), Expect = e-116
Identities = 156/410 (38%), Positives = 204/410 (49%), Gaps = 32/410 (7%)
Query: 49 FGAFLERFELIPFPQPPPPAARQ-PLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAER 107
F AF I P P P A PLAGL+FA+KDVFDV GYVTG G+PDW + A R
Sbjct: 7 FNAF------IAKPDKPLPHAASGPLAGLRFAVKDVFDVAGYVTGCGNPDWLAESPVATR 60
Query: 108 TAVVVTLLLKNGATCIGKTVLDEFAFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVA 167
TA V LL GA +GKT DE AF + G+N HYGTPVNP P +PGGSSSGSA AVA
Sbjct: 61 TAPAVEKLLAAGARFVGKTQTDELAFSLNGQNAHYGTPVNPAAPDRVPGGSSSGSAAAVA 120
Query: 168 AQLVDFALGTDTIGCVRVPASFCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLARNASIL 227
L DFALGTDT G VR PASFCG+ G RP+HG +S+ GV+P + S DTVG AR+ ++L
Sbjct: 121 GGLADFALGTDTGGSVRAPASFCGLYGLRPTHGRISLEGVMPLAPSFDTVGWFARDIALL 180
Query: 228 HRVGHVLLQLNAVEPRRARRLIFADDIFQLSKVPKLKTIHVISKAIESLSGYQNPEHMNV 287
RVG VLL +A E +RL+ D+F L +P V
Sbjct: 181 ERVGEVLLGDDAQEFPLTQRLLIPVDLFALL----------------------DPA---V 215
Query: 288 GQYIASNVPSLKGLRAQSTSPENGTATLKALSSVMLSLQRYEFKTIYEEWVKSAKPRLGY 347
+ + + L+ + +L LQ E + W+ S P+LG
Sbjct: 216 RAALEAALARLRPHLGPAKPASVPPLSLDEWYEAFRVLQAAEAWETHGAWISSGNPQLGP 275
Query: 348 NVFERVLEAINTTQDNIKILYKVRNEMRAALQRLLKDDKILVIPTVSDPPLKLNTKKTYS 407
V +R T D ++ R L LL D +L++PTV
Sbjct: 276 GVADRFAAGAEVTADQVEAARARRAAFARELAALLGPDAVLLLPTVPGAAPLRGAPFEAL 335
Query: 408 AEFHDRTLVLSSIGSMSGCCQVAIPIGKYNEHPISISFITYHGGDKFLLD 457
+ +R L L I ++G Q+++P+ + P +S I G D+ LL
Sbjct: 336 EAYRERALRLLCIAGLAGLPQISLPLASVDGAPFGLSLIGPRGSDRSLLA 385
>gnl|CDD|223232 COG0154, GatA, Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit
and related amidases [Translation, ribosomal structure
and biogenesis].
Length = 475
Score = 235 bits (602), Expect = 5e-71
Identities = 120/448 (26%), Positives = 185/448 (41%), Gaps = 37/448 (8%)
Query: 30 VLAETLRR--RRNPKLNGRVDFGAFLERFELIPFPQPPPPAA---------RQPLAGLKF 78
++ L R NP LN AF+ AA PLAG+
Sbjct: 26 LVEAYLARIEALNPDLN------AFVAVDPEAALALAEAAAADARLAAGEPLGPLAGVPI 79
Query: 79 AIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVLDEFAFGITGE 138
A+KD D G T GS + + A VV L GA +GKT +DEFA G + E
Sbjct: 80 AVKDNIDTAGLPTTAGSKALE--DYVPPYDATVVERLRAAGAVILGKTNMDEFAMGSSTE 137
Query: 139 NKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPASFCGILGYRPS 198
N +G NP +PGGSS GSA AVAA LV ALG+DT G +R+PA+FCG++G +P+
Sbjct: 138 NSAFGPTRNPWNLERVPGGSSGGSAAAVAAGLVPLALGSDTGGSIRIPAAFCGLVGLKPT 197
Query: 199 HGTVSMIGVLPNSQSLDTVGLLARNASILHRVGHVLLQLNAVEPRRARRLIFADDIFQLS 258
+G VS GV+P + SLD +G LAR + V+ + + +
Sbjct: 198 YGRVSRYGVVPLASSLDQIGPLARTVRDAALLLDVIAGPDPRDSPLPPPPPVPPAL--AG 255
Query: 259 KVPKLKTIHVISKAIESLSGYQNPEHMNVGQYIASNVPSLKGLRAQ----STSPENGTAT 314
K K I V + G P +V + V +L+ A+ S +
Sbjct: 256 KDLKGLRIGVPKE-----LGGGGPLDPDVRAAFEAAVKALEAAGAEVVEVSLPLLSDDYA 310
Query: 315 LKALSSVMLSLQRYEFKTIYEEWVKSAKPRLGYNVFERVLE-AINTTQDNIKILYK--VR 371
L A +RY + + + K+ G V R++ + Y+ +
Sbjct: 311 LAAYYLARFDGERYGLRAA-DLYGKTRAEGFGPEVKRRIMLGTYLLSAGYYDAYYRRAQK 369
Query: 372 NEMRAALQRLLKDDKILVIPTVSDPPLKLNTKKTY--SAEFHDRTLVLSSIGSMSGCCQV 429
+R A +L ++ +L+ PT P K+ ++ V + +++G +
Sbjct: 370 TLIRRAFDKLFEEVDVLLTPTTPTPAPKIGESESDGDDPLEMYLLDVFTVPANLAGLPAI 429
Query: 430 AIPIGKYNEH-PISISFITYHGGDKFLL 456
++P G + P+ + I D LL
Sbjct: 430 SVPAGFTADGLPVGLQLIGPAFDDATLL 457
>gnl|CDD|216494 pfam01425, Amidase, Amidase.
Length = 431
Score = 203 bits (519), Expect = 2e-59
Identities = 119/406 (29%), Positives = 177/406 (43%), Gaps = 41/406 (10%)
Query: 72 PLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVLDEF 131
PL G+ ++KD DVKG T GS + + A VV L GA +GKT +DEF
Sbjct: 46 PLHGVPISLKDNIDVKGVPTTAGSKALE--GYPPPYDATVVERLRAAGAVILGKTNMDEF 103
Query: 132 AFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPASFCG 191
A G T EN +G NP S PGGSS GSA AVAA LV A+GTDT G +R+PA+FCG
Sbjct: 104 AMGSTTENSAFGPTRNPWDLSRTPGGSSGGSAAAVAAGLVPLAIGTDTGGSIRIPAAFCG 163
Query: 192 ILGYRPSHGTVSMIGVLPNSQSLDTVGLLARNASILHRVGHVLLQLNAVEPRRARRLIFA 251
++G +P++G VS GV+P S SLD VG LAR + V+ + +P A +
Sbjct: 164 LVGLKPTYGRVSRYGVVPYSSSLDHVGPLARTVEDAALLLDVIAGYDPADPTSAPSPVP- 222
Query: 252 DDIFQLSKVPKLKTIHV----------------ISKAIESLSGYQ-NPEHMNVGQYIASN 294
D L K K I V + KA +L + +
Sbjct: 223 DFAEPLKKSLKGLRIGVPREDFYFSLDPEVQRAVRKAAAALEALGHEVVEVEPP-SLKHA 281
Query: 295 VPSLKGLRAQSTSPENGTATLKALSSVMLSLQRYEFKTIYEEWVKSAKPRLGYNVFERVL 354
+P +P ++ L L + ++ + + +E K R+ + L
Sbjct: 282 LPLYYI-----IAPAEASSNLSDLDELYPRIRD---ELLGDEV----KRRIE--LGAYAL 327
Query: 355 EAINTTQDNIKILYKVRNEMRAALQRLLKDDKILVIPTVSDPPLKLNTKKTYSAEFHDRT 414
A + +K KVR +R L ++ +L+ PT P +L S
Sbjct: 328 SAGYSGAYYLK-AQKVRRLLRREFAGLFEELDVLLSPTTPTPAPRLGE-AADSPTVLYNL 385
Query: 415 LVL---SSIGSMSGCCQVAIPIG-KYNEHPISISFITYHGGDKFLL 456
L + +++G +++P G + P+ + I G D+ LL
Sbjct: 386 LDFTANTVPANLAGLPAISLPAGFSEDGLPVGLQIIGPPGDDETLL 431
>gnl|CDD|232842 TIGR00132, gatA, aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase,
A subunit. In many species, Gln--tRNA ligase is
missing. tRNA(Gln) is misacylated with Glu after which a
heterotrimeric amidotransferase converts Glu to Gln.
This model represents the amidase chain of that
heterotrimer, encoded by the gatA gene. In the Archaea,
Asn--tRNA ligase is also missing. This amidase subunit
may also function in the conversion of Asp-tRNA(Asn) to
Asn-tRNA(Asn), presumably with a different recognition
unit to replace gatB. Both Methanococcus jannaschii and
Methanobacterium thermoautotrophicum have both authentic
gatB and a gatB-related gene, but only one gene like
gatA. It has been shown that gatA can be expressed only
when gatC is also expressed. In most species expressing
the amidotransferase, the gatC ortholog is about 90
residues in length, but in Mycoplasma genitalium and
Mycoplasma pneumoniae the gatC equivalent is as the
C-terminal domain of a much longer protein. Not
surprisingly, the Mycoplasmas also represent the most
atypical lineage of gatA orthology. This orthology group
is more narrowly defined here than in Proc Natl Acad Aci
USA 94, 11819-11826 (1997). In particular, a Rhodococcus
homolog found in association with nitrile hydratase
genes and described as an enantiomer-selective amidase
active on several 2-aryl propionamides, is excluded
here. It is likely, however, that the amidase subunit
GatA is not exclusively a part of the Glu-tRNA(Gln)
amidotransferase heterotrimer and restricted to that
function in all species [Protein synthesis, tRNA
aminoacylation].
Length = 460
Score = 181 bits (461), Expect = 5e-51
Identities = 116/414 (28%), Positives = 180/414 (43%), Gaps = 44/414 (10%)
Query: 72 PLAGLKFAIKDVFDVKGYVTGFGS---PDWKRDHHEAERTAVVVTLLLKNGATCIGKTVL 128
PLAG+ A+KD KG VT S ++ + A V+ L + GA IGKT +
Sbjct: 55 PLAGIPIAVKDNISTKGIVTTCASKILENYIPPYD-----ATVIERLKQAGALIIGKTNM 109
Query: 129 DEFAFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPAS 188
DEFA G + E +G NP +PGGSS GSA AVAA L F+LG+DT G +R PAS
Sbjct: 110 DEFAMGSSTETSAFGPTKNPWNLDRVPGGSSGGSAAAVAADLAPFSLGSDTGGSIRQPAS 169
Query: 189 FCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLARNASILHRVGHVLLQLNAVEPRRARRL 248
FCG++G++P++G VS G++ + SLD +G AR + + V+ + + A+
Sbjct: 170 FCGVVGFKPTYGRVSRYGLVAYASSLDQIGPFARTVEDIALLLDVISGHDKRDSTSAKV- 228
Query: 249 IFADDIFQLSKVPKLKTIHVISKAIESLSGYQNPEHMNVGQYIASNVPSLKGLRAQST-- 306
D F LK + V ++ S + E + + L+ L A+
Sbjct: 229 --PDPEFFEELKKDLKGLKV--GVVKEFSEEMDKEVQEK---FENALEVLEELGAEIVEV 281
Query: 307 ---SPENGTATLKALSSVMLS--LQRY----------EFKTIYEEWVKSAKPRLGYNVFE 351
+ +S S L RY E ++ E + K+ G V
Sbjct: 282 SFPHVKYALPIYYIISPSEASSNLARYDGIRYGYRIEEPNSLKELYAKTRAEGFGEEVKR 341
Query: 352 R-VLEAINTTQDNIKILY----KVRNEMRAALQRLLKDDKILVIPTVSDPPLKLNTKKTY 406
R +L + Y KVR + +L ++ ++V PT P K+ K
Sbjct: 342 RIMLGNYALSAGYYDKYYLKAQKVRTLIIDDFLKLFEEVDVIVSPTAPTLPFKIGEKLDD 401
Query: 407 SAEFH--DRTLVLSSIGSMSGCCQVAIPIGKYNEH-PISISFITYHGGDKFLLD 457
E + D +L+ +++G +++P G + PI + I DK LL
Sbjct: 402 PLEMYLSD---ILTVPANLAGLPAISVPCGVKEKGLPIGLQIIGKCFDDKTLLQ 452
>gnl|CDD|234572 PRK00012, gatA, aspartyl/glutamyl-tRNA amidotransferase subunit A;
Reviewed.
Length = 459
Score = 173 bits (442), Expect = 4e-48
Identities = 73/175 (41%), Positives = 94/175 (53%), Gaps = 25/175 (14%)
Query: 72 PLAGLKFAIKDVFDVKGYVT--------GFGSP-DWKRDHHEAERTAVVVTLLLKNGATC 122
PLAG+ AIKD KG T + P D A VV L GA
Sbjct: 55 PLAGIPIAIKDNICTKGIRTTCASKILENYVPPYD-----------ATVVEKLKAAGAVI 103
Query: 123 IGKTVLDEFAFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGC 182
+GKT +DEFA G + EN +G NP +PGGSS GSA AVAA L ALG+DT G
Sbjct: 104 LGKTNMDEFAMGSSTENSAFGPTKNPWDLERVPGGSSGGSAAAVAAGLAPAALGSDTGGS 163
Query: 183 VRVPASFCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLARN---ASILHRV--GH 232
+R PA+FCG++G +P++G VS G++ + SLD +G AR A++L GH
Sbjct: 164 IRQPAAFCGVVGLKPTYGRVSRYGLIAFASSLDQIGPFARTVEDAALLLNAIAGH 218
Score = 29.7 bits (68), Expect = 4.9
Identities = 15/96 (15%), Positives = 41/96 (42%), Gaps = 12/96 (12%)
Query: 369 KVRNEMRAALQRLLKDDKILVIPTVSDPPLKLNTKKT-----YSAEFHDRTLVLSSIGSM 423
KVR ++ ++ + +++ PT K+ K Y ++ + + ++
Sbjct: 365 KVRTLIKQDFEKAFEKVDVILGPTAPTTAFKIGEKTDDPLAMYLSD------IFTVPANL 418
Query: 424 SGCCQVAIPIGKYNEH-PISISFITYHGGDKFLLDT 458
+G +++P G + P+ + I + ++ LL+
Sbjct: 419 AGLPAISVPAGFDDGGLPVGLQLIGKYFDEETLLNV 454
>gnl|CDD|236409 PRK09201, PRK09201, amidase; Provisional.
Length = 465
Score = 173 bits (441), Expect = 6e-48
Identities = 81/172 (47%), Positives = 100/172 (58%), Gaps = 6/172 (3%)
Query: 63 QPPPPAARQPLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATC 122
+P P LAG+ FA+K++FDV G T GS RD A R A V L GA
Sbjct: 67 EPLGP-----LAGVPFAVKNLFDVAGLTTLAGS-KINRDRPPATRDATAVRRLEAAGAVL 120
Query: 123 IGKTVLDEFAFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGC 182
+G +DE+A+G T EN HYG NP + I GGSS GSA AVAA LV F LG+DT G
Sbjct: 121 VGALNMDEYAYGFTTENSHYGATRNPHDLTRIAGGSSGGSAAAVAAGLVPFTLGSDTNGS 180
Query: 183 VRVPASFCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLARNASILHRVGHVL 234
+RVPAS CGI G +P++G +S G P SLD +G AR+ + L V VL
Sbjct: 181 IRVPASLCGIFGLKPTYGRLSRAGSFPFVASLDHIGPFARSVADLALVYDVL 232
>gnl|CDD|235921 PRK07056, PRK07056, amidase; Provisional.
Length = 454
Score = 170 bits (433), Expect = 5e-47
Identities = 76/170 (44%), Positives = 102/170 (60%), Gaps = 8/170 (4%)
Query: 68 AARQPLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTV 127
AA PLAG+ ++KD+FDV G VT GS D A A V L + GA IG+T
Sbjct: 67 AAPSPLAGIPVSVKDLFDVAGQVTRAGSRVLA-DAPPAAADAPAVARLRRAGAVLIGRTN 125
Query: 128 LDEFAFGITGENKHYGTPVNPQM----PSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCV 183
+ EFAF G N HYGTP NP IPGGSSSG+AV+VA + ALGTDT G +
Sbjct: 126 MTEFAFSGLGLNPHYGTPRNPWRRDVGDGRIPGGSSSGAAVSVADGMAAAALGTDTGGSI 185
Query: 184 RVPASFCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLARN---ASILHRV 230
R+PA+ CG+ G++P+ V + G +P S +LD++G LAR+ +++ V
Sbjct: 186 RIPAALCGLTGFKPTARRVPLQGAVPLSTTLDSIGPLARSVACCALVDAV 235
>gnl|CDD|235698 PRK06102, PRK06102, hypothetical protein; Provisional.
Length = 452
Score = 168 bits (427), Expect = 4e-46
Identities = 81/153 (52%), Positives = 99/153 (64%), Gaps = 3/153 (1%)
Query: 72 PLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVLDEF 131
L G+ A KD+FDV G VT GS + A R A VV LL + G IG+T + EF
Sbjct: 69 LLDGIPIAWKDLFDVAGSVTTAGSV-VLANAAPASRDAAVVALLARAGMVSIGRTNMSEF 127
Query: 132 AFGITGENKHYGTPVNPQMPSH--IPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPASF 189
AF G N HYGTPVNP+ IPGGSSSGSAVAVAA LV A+GTDT G VR+PA+F
Sbjct: 128 AFSGLGLNPHYGTPVNPRSTDVPRIPGGSSSGSAVAVAAGLVPVAMGTDTGGSVRIPAAF 187
Query: 190 CGILGYRPSHGTVSMIGVLPNSQSLDTVGLLAR 222
G++GY+ + G SM GV P ++SLD++G L R
Sbjct: 188 NGLVGYKATRGRYSMDGVFPLAKSLDSLGPLCR 220
Score = 38.5 bits (90), Expect = 0.009
Identities = 24/94 (25%), Positives = 38/94 (40%), Gaps = 4/94 (4%)
Query: 344 RLGYNVFERVLEAINTTQDNIKILYKVRNEMRAALQRLLKDDKILVIPTVSD--PPL-KL 400
R+ V +R T + L + R + A + R L +L PTV+ PPL L
Sbjct: 324 RMDPRVVKRTRLGRKITASDYIALLEARERLIAQVTREL-GGALLATPTVAHVAPPLAPL 382
Query: 401 NTKKTYSAEFHDRTLVLSSIGSMSGCCQVAIPIG 434
+ +TL + G+ C V++P G
Sbjct: 383 EADDDLFFATNLKTLRNTMPGNFLDMCGVSLPCG 416
>gnl|CDD|233982 TIGR02715, amido_AtzE, amidohydrolase, AtzE family. Members of
this protein family are aminohydrolases related to, but
distinct from, glutamyl-tRNA(Gln) amidotransferase
subunit A. The best characterized member is the biuret
hydrolase of Pseudomonas sp. ADP, which hydrolyzes
ammonia from the three-nitrogen compound biuret to yield
allophanate. Allophanate is also an intermediate in urea
degradation by the urea carboxylase/allophanate
hydrolase pathway, an alternative to urease [Unknown
function, Enzymes of unknown specificity].
Length = 452
Score = 167 bits (424), Expect = 8e-46
Identities = 79/163 (48%), Positives = 100/163 (61%), Gaps = 1/163 (0%)
Query: 72 PLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVLDEF 131
PLAG+ FA+K++FDV G T G+ RD A+R A +V L GA +G +DEF
Sbjct: 64 PLAGVPFAVKNLFDVAGLTTLAGA-KINRDLAPAKRDATLVQRLSAAGAVLVGALNMDEF 122
Query: 132 AFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPASFCG 191
A+G T EN HYG NP + I GGSS GSA AVAA LV F+LG+DT G +RVPAS CG
Sbjct: 123 AYGFTTENAHYGPTRNPHDLTRIAGGSSGGSAAAVAAGLVPFSLGSDTNGSIRVPASLCG 182
Query: 192 ILGYRPSHGTVSMIGVLPNSQSLDTVGLLARNASILHRVGHVL 234
+ G +P++G +S GV P SLD VG AR+ L V+
Sbjct: 183 VFGLKPTYGRLSRQGVFPFVASLDHVGPFARSVEDLALAYDVM 225
>gnl|CDD|233981 TIGR02713, allophanate_hyd, allophanate hydrolase. Allophanate
hydrolase catalyzes the second reaction in an
ATP-dependent two-step degradation of urea to ammonia
and C02, following the action of the biotin-containing
urea carboxylase. The yeast enzyme, a fusion of
allophanate hydrolase to urea carboxylase, is designated
urea amidolyase [Central intermediary metabolism,
Nitrogen metabolism].
Length = 561
Score = 166 bits (422), Expect = 1e-44
Identities = 78/177 (44%), Positives = 99/177 (55%), Gaps = 3/177 (1%)
Query: 66 PPAARQPLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGK 125
R PL G+ FA+KD DV G T P + + E A VV LL GA +GK
Sbjct: 25 ARPERLPLYGVPFAVKDNIDVAGLPTTAACPAFA---YTPEEDATVVALLRAAGAIVVGK 81
Query: 126 TVLDEFAFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRV 185
T LD+FA G+ G YG N P++I GGSSSGSAVAVA LV FALGTDT G RV
Sbjct: 82 TNLDQFATGLVGTRSPYGAVRNAFDPAYISGGSSSGSAVAVARGLVPFALGTDTAGSGRV 141
Query: 186 PASFCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLARNASILHRVGHVLLQLNAVEP 242
PA+ I+G +P+ G VS GV+P +SLD V + A + +V + +A +P
Sbjct: 142 PAALNNIVGLKPTKGLVSTTGVVPACRSLDCVSIFALTVADAEQVLRIAAAPDARDP 198
>gnl|CDD|236177 PRK08186, PRK08186, allophanate hydrolase; Provisional.
Length = 600
Score = 166 bits (422), Expect = 2e-44
Identities = 78/181 (43%), Positives = 98/181 (54%), Gaps = 4/181 (2%)
Query: 68 AARQPLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTV 127
A PL G+ FA+KD DV G T P + ER A VV L GA IGKT
Sbjct: 64 PAALPLYGVPFAVKDNIDVAGLPTTAACPAFAYT---PERDATVVARLRAAGAIVIGKTN 120
Query: 128 LDEFAFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPA 187
LD+FA G+ G YG N P ++ GGSSSGSAVAVA LV FALGTDT G RVPA
Sbjct: 121 LDQFATGLVGTRSPYGAVRNAFDPEYVSGGSSSGSAVAVALGLVSFALGTDTAGSGRVPA 180
Query: 188 SFCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLARNASILHRVGHVLLQLNAVEPRRARR 247
+F I+G +P+ G +S GV+P ++LD V + A V V+ + +P +R
Sbjct: 181 AFNNIVGLKPTLGLLSTRGVVPACRTLDCVSVFALTVDDADAVLAVMAGFDPADP-YSRA 239
Query: 248 L 248
Sbjct: 240 N 240
>gnl|CDD|236030 PRK07488, PRK07488, indole acetimide hydrolase; Validated.
Length = 472
Score = 156 bits (397), Expect = 8e-42
Identities = 77/194 (39%), Positives = 94/194 (48%), Gaps = 4/194 (2%)
Query: 31 LAETL--RRRRNPKLNGRVDFGAFLERFELIPFPQPPPPAARQPLAGLKFAIKDVFDVKG 88
L E L R LN A A LAG+ IKD + G
Sbjct: 30 LVEALLARAAALAPLNAFTTVDAEGALAAARRIDAQRAAGAALLLAGVPIVIKDNINTAG 89
Query: 89 YVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVLDEFAFGITGENKHYGTPVNP 148
T G+P A VV LL GA +GK + E AFGIT N +G NP
Sbjct: 90 MPTTAGTPALLG--FVPATDAPVVQRLLDAGAVPLGKANMHELAFGITSNNGAFGAVRNP 147
Query: 149 QMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPASFCGILGYRPSHGTVSMIGVL 208
P+ I GGSS G+A AVAA+L LGTDT G VR+PA+ CG++G RP+ G S GV+
Sbjct: 148 YDPARIAGGSSGGTAAAVAARLAPAGLGTDTGGSVRIPAALCGVVGLRPTVGRYSGDGVV 207
Query: 209 PNSQSLDTVGLLAR 222
P S + DTVG +AR
Sbjct: 208 PISHTRDTVGPIAR 221
>gnl|CDD|168315 PRK05962, PRK05962, amidase; Validated.
Length = 424
Score = 142 bits (360), Expect = 4e-37
Identities = 71/151 (47%), Positives = 94/151 (62%), Gaps = 1/151 (0%)
Query: 72 PLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVLDEF 131
PL G +IKD+FDV G T GS +RD A A++V L GA IGKT + EF
Sbjct: 45 PLDGRIVSIKDLFDVAGEPTLAGSV-IRRDAPPAGADALIVQRLRNAGAVIIGKTHMTEF 103
Query: 132 AFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPASFCG 191
AF G N HYG P N P+ IPGGSSSG+AV+VA + A+G+DT G VR+PA+ G
Sbjct: 104 AFTPVGLNPHYGEPGNAIDPARIPGGSSSGAAVSVAEGTSEIAIGSDTGGSVRIPAALNG 163
Query: 192 ILGYRPSHGTVSMIGVLPNSQSLDTVGLLAR 222
++G++P+ + + G P S SLD++G LAR
Sbjct: 164 LVGFKPTARRIPLEGAFPLSPSLDSIGPLAR 194
>gnl|CDD|235945 PRK07139, PRK07139, amidase; Provisional.
Length = 439
Score = 126 bits (318), Expect = 2e-31
Identities = 101/399 (25%), Positives = 163/399 (40%), Gaps = 49/399 (12%)
Query: 72 PLAGLKFAIKDVFDVKGYVTGFGSPDWK--RDHHEAERTAVVVTLLLKNGATCIGKTVLD 129
PLA F IKD F T S + + + A VV L+ GA + K D
Sbjct: 36 PLANCVFTIKDNFATSEGPTHASSKSLENFKPSYNA----TVVQKLINAGAKPVAKVHCD 91
Query: 130 EFAFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPASF 189
E G TG +G NP S + GGSSSGSA A + + FA+G+DT VR+PASF
Sbjct: 92 ELGLGGTGLFSAFGLIKNPLDSSKLVGGSSSGSA-ATFNKNISFAIGSDTGDSVRLPASF 150
Query: 190 CGILGYRPSHGTVSMIGVLPNSQSLDTVGLLARNASILHRVGHVLLQLNAVEPRRARRLI 249
G +G++PS+G +S G+ + SLDTV N + + VL + L
Sbjct: 151 IGKVGFKPSYGAISRYGLFAYASSLDTVAYFTHNVNDAIILSKVLFG------KDENDLT 204
Query: 250 FADDIFQLSKVPKLKTIHVIS--------------KAIESLSGYQNPEHMNVGQYIASNV 295
D K K K + + K I L + N+
Sbjct: 205 SVDVKINNVKKTKPKKVAYLDCFKELEEYVAKKYKKLINIL------KSENIEVEKIK-- 256
Query: 296 PSLKGLRAQST-----SPENGTATLKALSSVMLSLQRYEFKTIYEEWVKSAKPRLGYNVF 350
K L+A S ++ L L+ + R + + E + + G V
Sbjct: 257 IDEKLLKAIKPVYKIISYSEASSNLANLNGIAFG-NREKGSSWEEIMINTRSEGFGKMVQ 315
Query: 351 ER-VLEAINTTQDNIKILY----KVRNEMRAALQRLLKDDKILVIPTVSDPPLKLNTKKT 405
+R +L + ++N + + KVR ++ + + I++ P +D ++ +
Sbjct: 316 KRLILGSYFLEEENQEKYFLKAKKVRRVIKNYYESIHNKFDIVIYPAYADIAPDIDENEN 375
Query: 406 YSAEFHDRTLVLSSIGSMSGCCQVAIPIGKYNEHPISIS 444
S + D L +I ++ G ++IP+GKYN P +++
Sbjct: 376 KSDNYMDYIL---TISNLVGNPSLSIPLGKYNNLPFNLA 411
>gnl|CDD|235723 PRK06170, PRK06170, amidase; Provisional.
Length = 490
Score = 119 bits (300), Expect = 9e-29
Identities = 59/166 (35%), Positives = 83/166 (50%), Gaps = 9/166 (5%)
Query: 69 ARQPLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVL 128
R PL G+ +K+ F+V G T +G PD + + AV V L GA +GKT +
Sbjct: 72 ERGPLLGIPVTVKESFNVAGLPTTWGFPDLR--DYVPAEDAVAVARLKAAGAVILGKTNV 129
Query: 129 DEFAFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPAS 188
N+ YGT NP + PGGSS GSA A+AA ++G+D G +RVPA
Sbjct: 130 PLGLQDWQSYNEIYGTTNNPWDLARTPGGSSGGSAAALAAGFGALSIGSDIGGSLRVPAH 189
Query: 189 FCGILGYRPSHGTVSMIG-------VLPNSQSLDTVGLLARNASIL 227
+CG+ ++P+ G V + G LP L G +AR+A L
Sbjct: 190 YCGVYAHKPTLGLVPLRGHIPPPAPALPGQADLAVAGPMARSARDL 235
>gnl|CDD|235979 PRK07235, PRK07235, amidase; Provisional.
Length = 502
Score = 118 bits (298), Expect = 2e-28
Identities = 62/166 (37%), Positives = 82/166 (49%), Gaps = 18/166 (10%)
Query: 66 PPAARQPLAGLKFAIKDVFDVKGYVTGFGS-------PDWKRDHHEAERTAVVVTLLLKN 118
AA LAG A+KD V G GS P A VVT LL
Sbjct: 81 KGAAEGKLAGKTVALKDNVAVAGVPMMNGSSTLEGFVPS---------FDATVVTRLLDA 131
Query: 119 GATCIGKTVLDEFAFGITGENKHYGTPV-NPQMPSHIPGGSSSGSAVAVAAQLVDFALGT 177
GAT +GK ++ F G + PV NP+ P + GGSSSGSA VAA VD A+G
Sbjct: 132 GATIVGKATCEDLCFS-GGSHTSDPGPVHNPRDPGYSAGGSSSGSAALVAAGEVDMAIGG 190
Query: 178 DTIGCVRVPASFCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLARN 223
D G +R+P+++CGI G +P+HG V G P +++D +G +
Sbjct: 191 DQGGSIRIPSAWCGIYGMKPTHGLVPYTGAFPIERTIDHLGPMTAT 236
>gnl|CDD|236028 PRK07486, PRK07486, amidase; Provisional.
Length = 484
Score = 111 bits (280), Expect = 3e-26
Identities = 69/203 (33%), Positives = 92/203 (45%), Gaps = 15/203 (7%)
Query: 30 VLAETLRR--RRNPKLNGRVDFGAFLERFELIPFP-----QPPPPAARQPLAGLKFAIKD 82
V+ L R NP +N V A +R L+ R L G+ A KD
Sbjct: 31 VMRAYLAHIERVNPAVNAIV---ALRDRDALLAEAAEKDAALARGEYRGWLHGMPQAPKD 87
Query: 83 VFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVLDEFAFGITGENKHY 142
+ KG T GSP + + A+VV + GA IGKT EF G N Y
Sbjct: 88 LAPTKGIRTTLGSPIFA--DQVPQEDAIVVERMRAAGAIFIGKTNTPEFGLGSHTYNPVY 145
Query: 143 GTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPASFCGILGYRPSHGTV 202
G NP PS GGSS G+A A+A +++ A G+D +G +R PA+F + G+RPS G V
Sbjct: 146 GATRNPYDPSRSAGGSSGGAAAALALRMLPVADGSDMMGSLRNPAAFNNVYGFRPSQGRV 205
Query: 203 SMIGVLPNS--QSLDTVGLLARN 223
G + Q L T G + R
Sbjct: 206 PH-GPGGDVFVQQLGTEGPMGRT 227
>gnl|CDD|180437 PRK06169, PRK06169, putative amidase; Provisional.
Length = 466
Score = 106 bits (267), Expect = 1e-24
Identities = 60/162 (37%), Positives = 82/162 (50%), Gaps = 20/162 (12%)
Query: 72 PLAGLKFAIKDVFDVKGYVTGFGS----PD--WKRDHHEAERTAVVVTLLLKNGATCIGK 125
L G+ +IKD+F +G+ T GS D W D A V L + GA +GK
Sbjct: 71 LLDGVPVSIKDIFLTRGWPTLRGSRAIDADGPWDVD-------APAVARLREAGAVLLGK 123
Query: 126 TVLDEFAF-GITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVR 184
T EF + G+T ++ YG NP GGSS G+A AVA + ++GTD G VR
Sbjct: 124 TTTPEFGWKGVT-DSPLYGITRNPWDTRLTAGGSSGGAAAAVALGMGPLSVGTDGGGSVR 182
Query: 185 VPASFCGILGYRPSHGTVSMIGVLPNSQ--SLDTVGLLARNA 224
+PASFCG G++P+ G V + P S +L VG + R
Sbjct: 183 IPASFCGTFGFKPTFGRVPL---YPASPFGTLAHVGPMTRTV 221
>gnl|CDD|171524 PRK12470, PRK12470, amidase; Provisional.
Length = 462
Score = 103 bits (258), Expect = 1e-23
Identities = 63/156 (40%), Positives = 81/156 (51%), Gaps = 9/156 (5%)
Query: 70 RQPLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVLD 129
R PL G+ AIKD DV G VT +GS A A VV L GA IGKT +
Sbjct: 70 RLPLLGVPIAIKDDVDVAGEVTTYGSAGHGP---AATSDAEVVRRLRAAGAVIIGKTNVP 126
Query: 130 EFAFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPASF 189
E E+ +G NP P+ PGGSS GSA AVAA L ALG+D G +R+P+++
Sbjct: 127 ELMIMPFTESLAFGATRNPWDPNRTPGGSSGGSAAAVAAGLAPVALGSDGGGSIRIPSTW 186
Query: 190 CGILGYRPSHGTVSMIGVLPNS---QSLDTVGLLAR 222
CG+ G +P +S+ P+ Q L G +AR
Sbjct: 187 CGLFGLKPQRDRISL---EPHDGAWQGLSVNGPIAR 219
>gnl|CDD|235686 PRK06061, PRK06061, amidase; Provisional.
Length = 483
Score = 103 bits (259), Expect = 2e-23
Identities = 61/163 (37%), Positives = 77/163 (47%), Gaps = 21/163 (12%)
Query: 69 ARQPLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVL 128
R PL G+ A+KD DV G T FG+ + A + VV L GA +GKT
Sbjct: 78 DRLPLLGVPIAVKDDVDVAGVPTAFGTAG---EVPPATADSEVVRRLRAAGAVIVGKTNT 134
Query: 129 DEF---------AFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDT 179
E AFG T NP H PGGSS GSA AVAA LV A+G+D
Sbjct: 135 CELGQWPFTSGPAFGHTR---------NPWSRDHTPGGSSGGSAAAVAAGLVTAAIGSDG 185
Query: 180 IGCVRVPASFCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLAR 222
G VR+PA++ ++G +P G +S + L G LAR
Sbjct: 186 AGSVRIPAAWTHLVGIKPQRGRISTWPLPEAFNGLTVNGPLAR 228
>gnl|CDD|181154 PRK07869, PRK07869, amidase; Provisional.
Length = 468
Score = 100 bits (250), Expect = 2e-22
Identities = 57/173 (32%), Positives = 74/173 (42%), Gaps = 8/173 (4%)
Query: 29 VVLAETLRRRR-NPKLNGRVDFGAFLERFELIPFPQPPPPAARQPLAGLKFAIKDVFDVK 87
VV A R NP LN F+ P + +G+ IKD DV
Sbjct: 34 VVEAAIARAEAVNPALNALA-----YAAFDRARDRAARPGSQGGFFSGVPTFIKDNVDVA 88
Query: 88 GYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVLDEFAFGITGENKHYGTPVN 147
G T GS W A+ + L G +GKT L EF F + E+ G N
Sbjct: 89 GLPTMHGSDAWTP--RPAKADSDFARQFLATGLISLGKTQLPEFGFSASTEHPRLGPVRN 146
Query: 148 PQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPASFCGILGYRPSHG 200
P + G SS GSA VAA +V A D G +R+PA+ CG++G +PS G
Sbjct: 147 PWNTDYSAGASSGGSAALVAAGVVPIAHANDGGGSIRIPAACCGLVGLKPSRG 199
>gnl|CDD|236161 PRK08137, PRK08137, amidase; Provisional.
Length = 497
Score = 99.5 bits (248), Expect = 4e-22
Identities = 58/161 (36%), Positives = 77/161 (47%), Gaps = 8/161 (4%)
Query: 68 AARQPLAGLKFAIKDVFDV-KGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKT 126
R PL G+ +KD D T GS + R A +V L GA +GK
Sbjct: 68 KVRGPLHGIPVLLKDNIDAADPMPTTAGS--LALAGNRPTRDAFLVARLRDAGAVILGKA 125
Query: 127 VLDEFA-----FGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIG 181
L E+A +G + G NP P GSSSGS AVAA L A+GT+T G
Sbjct: 126 NLSEWANFRSTRSSSGWSARGGLTRNPYALDRSPCGSSSGSGAAVAAGLAAVAIGTETDG 185
Query: 182 CVRVPASFCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLAR 222
+ PA+ G++G +P+ G VS G++P S S DT G + R
Sbjct: 186 SITCPAAINGLVGLKPTVGLVSRDGIVPISHSQDTAGPMTR 226
>gnl|CDD|235855 PRK06707, PRK06707, amidase; Provisional.
Length = 536
Score = 96.9 bits (241), Expect = 3e-21
Identities = 58/173 (33%), Positives = 85/173 (49%), Gaps = 15/173 (8%)
Query: 61 FPQPPPPAARQPLAGLKFAIKD-VFDVKGYVTGFGS---PDWKRDHHEAERTAVVVTLLL 116
Q + L G+ +KD V K T G+ DW A++ A +V L
Sbjct: 125 LDQERSRNKKSNLYGIPVVVKDNVQTAKVMPTSAGTYVLKDWI-----ADQDATIVKQLK 179
Query: 117 KNGATCIGKTVLDEFAFGI-----TGENKHYGTPVNPQMPSHI-PGGSSSGSAVAVAAQL 170
+ GA +GK + E+A + +G + G +NP P GSSSGSA VAA
Sbjct: 180 EEGAFVLGKANMSEWANYLSFTMPSGYSGKKGQNLNPYGPIKFDTSGSSSGSATVVAADF 239
Query: 171 VDFALGTDTIGCVRVPASFCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLARN 223
A+GT+T G + PA+ ++G RPS G VS G++P +++LDT G +AR
Sbjct: 240 APLAVGTETTGSIVAPAAQQSVVGLRPSLGMVSRTGIIPLAETLDTAGPMART 292
>gnl|CDD|236029 PRK07487, PRK07487, amidase; Provisional.
Length = 469
Score = 93.9 bits (234), Expect = 2e-20
Identities = 61/166 (36%), Positives = 78/166 (46%), Gaps = 15/166 (9%)
Query: 72 PLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVLDEF 131
PLAG+ +K D G+ T G K A + VV L K GA IG+T F
Sbjct: 72 PLAGVPVTVKVNVDQAGFATTNGVRLQK--DLIAPADSPVVDNLRKAGAVIIGRTNTPAF 129
Query: 132 AFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPASFCG 191
++ +N +G +NP PS PGGSS G+A AVAA + A GTD G +R PA CG
Sbjct: 130 SYRWFTDNPLHGRTLNPWDPSLTPGGSSGGAAAAVAAGIGAIAHGTDIGGSIRYPAYACG 189
Query: 192 ILGYRPSHGTVSM---------IGVLPNSQSLDTVGLLARNASILH 228
+ G RP+ G V IG Q + G LAR + L
Sbjct: 190 VHGLRPTLGRVPAYNASSPERPIGA----QLMSVQGPLARTVADLR 231
>gnl|CDD|180715 PRK06828, PRK06828, amidase; Provisional.
Length = 491
Score = 86.8 bits (215), Expect = 4e-18
Identities = 55/170 (32%), Positives = 86/170 (50%), Gaps = 14/170 (8%)
Query: 70 RQPLAGLKFAIKDVFDVKGYV-TGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVL 128
R PL G+ +KD + + T G+ + H + A +VT L + GA +GK +
Sbjct: 77 RGPLHGIPVLLKDNIETNDSMHTSAGT--IALEQHISSEDAFLVTKLREAGAVILGKANM 134
Query: 129 DEFAFGIT-----GENKHYGTPVNPQ---MPSHIPGGSSSGSAVAVAAQLVDFALGTDTI 180
E A ++ G + G +NP GGSS+GSA+AVAA ++GT+T
Sbjct: 135 TELANFMSFEMWAGYSARGGQTINPYGTGEDDMFVGGSSTGSAIAVAANFTVVSVGTETD 194
Query: 181 GCVRVPASFCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLAR---NASIL 227
G + PA ++G +P+ G +S G++P + S DT G AR +A+IL
Sbjct: 195 GSILSPAVQNSVVGIKPTVGLISRRGIIPFTYSQDTAGPFARTVTDAAIL 244
>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat.
Length = 69
Score = 75.8 bits (187), Expect = 3e-17
Identities = 28/69 (40%), Positives = 43/69 (62%), Gaps = 1/69 (1%)
Query: 487 DASELLKEKGNAAFKGKQWNKAVNYYSEAIKLNGTSATYYSNRAAAYLELGC-FQQAEED 545
D +E LK GNA FK +++A+ Y +A++L+ +A Y N A AYL+LG +++A ED
Sbjct: 1 DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALED 60
Query: 546 CSKTISLDK 554
K + LD
Sbjct: 61 LEKALELDP 69
Score = 32.7 bits (75), Expect = 0.055
Identities = 10/34 (29%), Positives = 15/34 (44%)
Query: 522 SATYYSNRAAAYLELGCFQQAEEDCSKTISLDKK 555
+A N A +LG + +A E K + LD
Sbjct: 2 NAEALKNLGNALFKLGDYDEAIEAYEKALELDPD 35
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
34 amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
in a variety of organisms including bacteria,
cyanobacteria, yeast, fungi, plants, and humans in
various subcellular locations; involved in a variety of
functions including protein-protein interactions, but
common features in the interaction partners have not
been defined; involved in chaperone, cell-cycle,
transciption, and protein transport complexes; the
number of TPR motifs varies among proteins (1,3-11,13
15,16,19); 5-6 tandem repeats generate a right-handed
helical structure with an amphipathic channel that is
thought to accomodate an alpha-helix of a target
protein; it has been proposed that TPR proteins
preferably interact with WD-40 repeat proteins, but in
many instances several TPR-proteins seem to aggregate to
multi-protein complexes; examples of TPR-proteins
include, Cdc16p, Cdc23p and Cdc27p components of the
cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
targeting signals, the Tom70p co-receptor for
mitochondrial targeting signals, Ser/Thr phosphatase 5C
and the p110 subunit of O-GlcNAc transferase; three
copies of the repeat are present here.
Length = 100
Score = 73.6 bits (181), Expect = 5e-16
Identities = 25/66 (37%), Positives = 40/66 (60%)
Query: 490 ELLKEKGNAAFKGKQWNKAVNYYSEAIKLNGTSATYYSNRAAAYLELGCFQQAEEDCSKT 549
E L GN +K +++A+ YY +A++L+ +A Y N AAAY +LG +++A ED K
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKA 60
Query: 550 ISLDKK 555
+ LD
Sbjct: 61 LELDPD 66
Score = 54.7 bits (132), Expect = 2e-09
Identities = 19/69 (27%), Positives = 36/69 (52%)
Query: 487 DASELLKEKGNAAFKGKQWNKAVNYYSEAIKLNGTSATYYSNRAAAYLELGCFQQAEEDC 546
D ++ A +K ++ +A+ Y +A++L+ +A Y N AY +LG +++A E
Sbjct: 32 DNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAY 91
Query: 547 SKTISLDKK 555
K + LD
Sbjct: 92 EKALELDPN 100
>gnl|CDD|183377 PRK11910, PRK11910, amidase; Provisional.
Length = 615
Score = 79.7 bits (196), Expect = 1e-15
Identities = 58/183 (31%), Positives = 83/183 (45%), Gaps = 13/183 (7%)
Query: 68 AARQPLAGLKFAIKDVFDVKGYVTGFGS---PDWKRDHHEAERTAVVVTLLLKNGATCIG 124
+ L G+ +KD K T G+ DW + A +V L NGA +G
Sbjct: 225 TNKSALYGMPVLLKDNIGTKELPTSAGTVALKDWV-----IGKDATIVENLKANGALILG 279
Query: 125 KTVLDEFAFGI-----TGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDT 179
KT + E+A G+ G + G NP + P GSSSGSA A + A+GT+T
Sbjct: 280 KTNMSEWAAGMDEDLPNGYSGKKGQSKNPYSSNLDPSGSSSGSATAATSDFAAIAIGTET 339
Query: 180 IGCVRVPASFCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLARNASILHRVGHVLLQLNA 239
G + PAS +GY+PS G V+ G++P S DT G L R + + + L +
Sbjct: 340 NGSIITPASAQSAVGYKPSQGLVNNKGIIPLSSRFDTPGPLTRTVNDAYLTTNALTNTTS 399
Query: 240 VEP 242
P
Sbjct: 400 NPP 402
>gnl|CDD|235915 PRK07042, PRK07042, amidase; Provisional.
Length = 464
Score = 74.2 bits (183), Expect = 4e-14
Identities = 44/135 (32%), Positives = 63/135 (46%), Gaps = 2/135 (1%)
Query: 68 AARQPLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTV 127
PL G+ IK+ +G G+ D A A L + GA + KT
Sbjct: 67 EPLGPLDGVPVTIKENIATRGVPVPLGTA--ATDLPPAAADAPPAARLREAGAVILAKTT 124
Query: 128 LDEFAFGITGENKHYGTPVNPQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPA 187
+ ++ +G + +G NP PGGSS+G+ A AA LGTD G VR+PA
Sbjct: 125 MPDYGMLSSGLSSFHGLTRNPWDLDQNPGGSSAGAGAAAAAGYGPLHLGTDIGGSVRLPA 184
Query: 188 SFCGILGYRPSHGTV 202
+CGI+G +PS G +
Sbjct: 185 GWCGIVGLKPSLGRI 199
>gnl|CDD|180626 PRK06565, PRK06565, amidase; Validated.
Length = 566
Score = 74.8 bits (184), Expect = 5e-14
Identities = 55/156 (35%), Positives = 76/156 (48%), Gaps = 9/156 (5%)
Query: 72 PLAGLKFAIKDVFDVKGYVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVLDEF 131
PL G+ + KD + VKG GSP +K A+R A + L GA C+GKT +
Sbjct: 75 PLDGIPYTAKDSYLVKGLTAASGSPAFK--DLVAQRDAFTIERLRAAGAICLGKTNMPPM 132
Query: 132 AFGITGENK-HYG---TPVNPQ-MPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVP 186
A G G + YG +P N + + GSS+G+ A AA F L +T R P
Sbjct: 133 ANG--GMQRGVYGRAESPYNAAYLTAPFASGSSNGAGTATAASFSAFGLAEETWSSGRGP 190
Query: 187 ASFCGILGYRPSHGTVSMIGVLPNSQSLDTVGLLAR 222
AS G+ Y PS G +S+ G P + ++D V AR
Sbjct: 191 ASNNGLCAYTPSRGVISVRGNWPLTPTMDVVVPYAR 226
>gnl|CDD|180608 PRK06529, PRK06529, amidase; Provisional.
Length = 482
Score = 71.0 bits (174), Expect = 5e-13
Identities = 55/175 (31%), Positives = 87/175 (49%), Gaps = 30/175 (17%)
Query: 40 NPKLNGRVDFGAFLERFELIPFPQPPPPAARQ----------PLAGLKFAIKDVF-DVKG 88
NP LN V ER+E A + P AG+ +KD+ ++KG
Sbjct: 35 NPTLNAIVS-----ERYE----------EALEEAKQRDFSGKPFAGVPIFLKDLGQELKG 79
Query: 89 YVTGFGSPDWKRDHHEAERTAVVVTLLLKNGATCIGKTVLDEFAFGITGENKHYGTPVN- 147
++ GS +K +++A +T + V L G +G++ EF F ++ +G PVN
Sbjct: 80 QLSTSGSRLFK--NYQATKTDLYVKRLEDLGFIILGRSNTPEFGFKNISDSSLHG-PVNL 136
Query: 148 PQMPSHIPGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPASFCGILGYRPSHGTV 202
P S GGSS G+A V++ +V A +D G +R+PASF G++G +PS G +
Sbjct: 137 PFDNSRNAGGSSGGAAALVSSGIVALAAASDGGGSIRIPASFNGLIGLKPSRGRI 191
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 59.2 bits (143), Expect = 3e-09
Identities = 28/62 (45%), Positives = 40/62 (64%), Gaps = 1/62 (1%)
Query: 492 LKEKGNAAFKGKQWNKAVNYYSEAIKLNGTSATYYSNRAAAYLELGCFQQAEEDCSKTIS 551
LKEKGN A++ K +NKA+ YS+AI+ YYSNRAA + LG +++ ED + +
Sbjct: 130 LKEKGNKAYRNKDFNKAIKLYSKAIECK-PDPVYYSNRAACHNALGDWEKVVEDTTAALE 188
Query: 552 LD 553
LD
Sbjct: 189 LD 190
Score = 36.9 bits (85), Expect = 0.031
Identities = 22/79 (27%), Positives = 34/79 (43%), Gaps = 10/79 (12%)
Query: 506 NKAVNYYSEAIKLNGTSATYYSNRAAAYLELGCFQQAEEDCSKTISLDKK---------- 555
+KA + +A+KLN Y +RA + G F QA +D K+I LD
Sbjct: 382 DKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDFIFSHIQLGV 441
Query: 556 VRARVGYVDCSMLIYLLCI 574
+ + G + SM + C
Sbjct: 442 TQYKEGSIASSMATFRRCK 460
>gnl|CDD|215568 PLN03088, PLN03088, SGT1, suppressor of G2 allele of SKP1;
Provisional.
Length = 356
Score = 51.7 bits (124), Expect = 5e-07
Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 488 ASELLKEKGNAAFKGKQWNKAVNYYSEAIKLNGTSATYYSNRAAAYLELGCFQQAEEDCS 547
A +L +K AF + AV+ Y++AI L+ +A Y++RA A ++LG F +A D +
Sbjct: 2 AKDLE-DKAKEAFVDDDFALAVDLYTQAIDLDPNNAELYADRAQANIKLGNFTEAVADAN 60
Query: 548 KTISLD 553
K I LD
Sbjct: 61 KAIELD 66
>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat.
Length = 34
Score = 38.2 bits (90), Expect = 2e-04
Identities = 11/33 (33%), Positives = 16/33 (48%)
Query: 523 ATYYSNRAAAYLELGCFQQAEEDCSKTISLDKK 555
A N AYL+LG + +A E K + L+
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33
Score = 35.1 bits (82), Expect = 0.003
Identities = 11/31 (35%), Positives = 21/31 (67%)
Query: 489 SELLKEKGNAAFKGKQWNKAVNYYSEAIKLN 519
++ L GNA K ++++A+ YY +A++LN
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELN 31
>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats. Repeats present in 4
or more copies in proteins. Contain a minimum of 34
amino acids each and self-associate via a "knobs and
holes" mechanism.
Length = 34
Score = 36.7 bits (86), Expect = 0.001
Identities = 12/33 (36%), Positives = 16/33 (48%)
Query: 523 ATYYSNRAAAYLELGCFQQAEEDCSKTISLDKK 555
A N AYL+LG + +A E K + LD
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33
Score = 36.3 bits (85), Expect = 0.001
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 489 SELLKEKGNAAFKGKQWNKAVNYYSEAIKLN 519
+E L GNA K +++A+ YY +A++L+
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELD 31
>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat.
Length = 34
Score = 36.0 bits (84), Expect = 0.002
Identities = 11/34 (32%), Positives = 19/34 (55%)
Query: 511 YYSEAIKLNGTSATYYSNRAAAYLELGCFQQAEE 544
Y +A++L+ +A Y N A L LG + +A +
Sbjct: 1 LYEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34
>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat.
Length = 65
Score = 35.4 bits (82), Expect = 0.005
Identities = 10/60 (16%), Positives = 20/60 (33%)
Query: 494 EKGNAAFKGKQWNKAVNYYSEAIKLNGTSATYYSNRAAAYLELGCFQQAEEDCSKTISLD 553
AA + +++A+ A+ +A A L G +A ++ D
Sbjct: 2 ALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAAD 61
>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat. This Pfam entry
includes outlying Tetratricopeptide-like repeats (TPR)
that are not matched by pfam00515.
Length = 34
Score = 34.0 bits (79), Expect = 0.007
Identities = 9/30 (30%), Positives = 17/30 (56%)
Query: 490 ELLKEKGNAAFKGKQWNKAVNYYSEAIKLN 519
E L G A +K + +A+ Y +A++L+
Sbjct: 2 EALYNLGLAYYKLGDYEEALEAYEKALELD 31
Score = 32.9 bits (76), Expect = 0.022
Identities = 11/33 (33%), Positives = 16/33 (48%)
Query: 523 ATYYSNRAAAYLELGCFQQAEEDCSKTISLDKK 555
A N AY +LG +++A E K + LD
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKALELDPN 33
>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat.
Length = 78
Score = 35.4 bits (82), Expect = 0.008
Identities = 16/76 (21%), Positives = 33/76 (43%), Gaps = 7/76 (9%)
Query: 487 DASELLKEKGNAAFKGKQWNKAVNYYSEAIKL-------NGTSATYYSNRAAAYLELGCF 539
D + L + +++A+ +A++L + +A +N A YL LG +
Sbjct: 3 DLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDY 62
Query: 540 QQAEEDCSKTISLDKK 555
+A E K ++L +
Sbjct: 63 DEALEYLEKALALREA 78
Score = 30.8 bits (70), Expect = 0.26
Identities = 9/34 (26%), Positives = 16/34 (47%)
Query: 523 ATYYSNRAAAYLELGCFQQAEEDCSKTISLDKKV 556
A +N A LG + +A E K + L +++
Sbjct: 5 AAALNNLALVLRRLGDYDEALELLEKALELAREL 38
>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
Length = 291
Score = 38.3 bits (87), Expect = 0.009
Identities = 16/78 (20%), Positives = 32/78 (41%), Gaps = 1/78 (1%)
Query: 486 IDASELLKEKGNAAFKGKQWNKAVNYYSEAIKLNGTSATYYSNRA-AAYLELGCFQQAEE 544
+ +E L G ++ +A+ +A+ L+ + A A ELG +++A E
Sbjct: 92 PNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALE 151
Query: 545 DCSKTISLDKKVRARVGY 562
K + LD ++
Sbjct: 152 LYEKALELDPELNELAEA 169
Score = 36.0 bits (81), Expect = 0.047
Identities = 27/111 (24%), Positives = 43/111 (38%), Gaps = 10/111 (9%)
Query: 455 LLDTVLDLYDSLQEQVNIASNLVPLPDTNGSIDASELLKEKGNAAFKGKQWNKAVNYYSE 514
LL L Y+ E + A L P PD ++ A L E G + +A+ Y +
Sbjct: 103 LLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELG-------DYEEALELYEK 155
Query: 515 AIKLNGTS---ATYYSNRAAAYLELGCFQQAEEDCSKTISLDKKVRARVGY 562
A++L+ A A LG +++A E K + L+ A
Sbjct: 156 ALELDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALL 206
>gnl|CDD|211358 cd08352, Glo_EDI_BRP_like_1, This conserved domain belongs to a
superfamily including the bleomycin resistance protein,
glyoxalase I, and type I ring-cleaving dioxygenases.
This protein family belongs to a conserved domain
superfamily that is found in a variety of structurally
related metalloproteins, including the bleomycin
resistance protein, glyoxalase I, and type I
ring-cleaving dioxygenases. A bound metal ion is
required for protein activities for the members of this
superfamily. A variety of metal ions have been found in
the catalytic centers of these proteins including
Fe(II), Mn(II), Zn(II), Ni(II) and Mg(II). The protein
superfamily contains members with or without domain
swapping. The proteins of this family share three
conserved metal binding amino acids with the type I
extradiol dioxygenases, which shows no domain swapping.
Length = 125
Score = 35.8 bits (83), Expect = 0.013
Identities = 22/76 (28%), Positives = 35/76 (46%), Gaps = 7/76 (9%)
Query: 23 VSVVGFVVLAETLRRRRNP-KLNGRVDFGAFLERFELIPFPQPPPPAARQPLAGLK---F 78
V V+G ++ ET R R KL+ ++ +E F FP PPP + GL+ F
Sbjct: 22 VEVLGLTLIRETYRPERKSYKLDLALNGEYQIELFS---FPNPPPRPSYPEACGLRHLAF 78
Query: 79 AIKDVFDVKGYVTGFG 94
A++D+ + G
Sbjct: 79 AVEDIEAAVAELKAHG 94
>gnl|CDD|227343 COG5010, TadD, Flp pilus assembly protein TadD, contains TPR
repeats [Intracellular trafficking and secretion].
Length = 257
Score = 34.3 bits (79), Expect = 0.11
Identities = 17/79 (21%), Positives = 33/79 (41%), Gaps = 3/79 (3%)
Query: 487 DASELLKEKGNAAFKGKQWNKAVNYYSEAIKLNGTSATYYSNRAAAYLELGCFQQAEEDC 546
ELL +G + + +AV+ +A +L T ++ AA +LG F +A
Sbjct: 98 KDRELLAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAY 157
Query: 547 SKTISL---DKKVRARVGY 562
+ + L + + +G
Sbjct: 158 RQALELAPNEPSIANNLGM 176
>gnl|CDD|212593 cd11719, FANC, Fanconi anemia ID complex proteins FANCI and FANCD2.
The Fanconi anemia ID complex consists of two subunits,
Fanconi anemia I and Fanconi anemia D2 (FANCI-FANCD2)
and plays a central role in the repair of DNA
interstrand cross-links (ICLs). The complex is activated
via DNA damage-induced phosphorylation by ATR (ataxia
telangiectasia and Rad3-related) and monoubiquitination
by the FA core complex ubiquitin ligase, and it binds to
DNA at the ICL site, recognizing branched DNA
structures. Defects in the complex cause Fanconi anemia,
a cancer predisposition syndrome.
Length = 977
Score = 35.3 bits (80), Expect = 0.11
Identities = 7/108 (6%), Positives = 22/108 (20%), Gaps = 18/108 (16%)
Query: 283 EHMNVGQYIASNVPSLKGLRAQSTSPENGTATLKALS-SVMLSLQRYEFKTIYEEWVKSA 341
++ S L T+P + ++ ++ + + + E
Sbjct: 198 RELDKQHRE---EQSSDELSELITAPADELYHVEGTVILHIVFAIKLDCELGRELLKHL- 253
Query: 342 KPRLGYNVFERVLEAINTTQDNIK-----ILYKVRNEMRAALQRLLKD 384
+ L +++ L
Sbjct: 254 --------KAGQQGDPSKCLCPFSIALLLSLTRIQRFEEQVFDLLKTS 293
>gnl|CDD|177870 PLN02225, PLN02225, 1-deoxy-D-xylulose-5-phosphate synthase.
Length = 701
Score = 33.5 bits (76), Expect = 0.32
Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 5/91 (5%)
Query: 264 KTIHVISKAIESLSGYQNPEHMNVGQYIASNVPSLKGLRAQSTSP---ENGTATLKALSS 320
+ + VI A +++ Q E M+ Y+ SN+ + S P E A++ ALSS
Sbjct: 213 RVVAVIDNA--TITAGQAYEAMSNAGYLDSNMIVILNDSRHSLHPNMEEGSKASISALSS 270
Query: 321 VMLSLQRYEFKTIYEEWVKSAKPRLGYNVFE 351
+M +Q + + E K+ R+G ++E
Sbjct: 271 IMSKIQSSKIFRKFRELAKAMTKRIGKGMYE 301
>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
[Coenzyme metabolism].
Length = 400
Score = 33.1 bits (76), Expect = 0.35
Identities = 21/59 (35%), Positives = 29/59 (49%), Gaps = 1/59 (1%)
Query: 486 IDASELLKEKGNAAFKGKQWNKAVNYYSEAIKLNGTSATYYSNRAAAYLELGCFQQAEE 544
+ LL G A K K W KA A+KL SA+ Y+ A A +LG ++AE+
Sbjct: 325 PEDPLLLSTLGRLALKNKLWGKASEALEAALKL-RPSASDYAELADALDQLGEPEEAEQ 382
>gnl|CDD|225464 COG2912, COG2912, Uncharacterized conserved protein [Function
unknown].
Length = 269
Score = 32.3 bits (74), Expect = 0.51
Identities = 14/45 (31%), Positives = 19/45 (42%)
Query: 503 KQWNKAVNYYSEAIKLNGTSATYYSNRAAAYLELGCFQQAEEDCS 547
QW A+ + LN +R Y +LGC+ A ED S
Sbjct: 195 LQWELALRVAERLLDLNPEDPYEIRDRGLIYAQLGCYHVALEDLS 239
>gnl|CDD|214587 smart00255, TIR, Toll - interleukin 1 - resistance.
Length = 140
Score = 31.1 bits (71), Expect = 0.58
Identities = 20/106 (18%), Positives = 32/106 (30%), Gaps = 28/106 (26%)
Query: 337 WVKSAKPRLGYNVFERVLEA----INTTQDNIK-------IL-------YKVRNEMRAAL 378
+ G VF E + + I+ +L +E+ AAL
Sbjct: 21 HLLEKLRGYGLCVFIDDFEPGGGDLEEIDEAIEKSRIAIVVLSPNYAESEWCLDELVAAL 80
Query: 379 QRLLKDDKILVIP-----TVSDPP-----LKLNTKKTYSAEFHDRT 414
+ L++ + VIP SD + KK Y D
Sbjct: 81 ENALEEGGLRVIPIFYEVIPSDVRKQPGKFRKVFKKNYLKWPEDEK 126
>gnl|CDD|183156 PRK11478, PRK11478, putative lyase; Provisional.
Length = 129
Score = 31.0 bits (70), Expect = 0.74
Identities = 23/85 (27%), Positives = 37/85 (43%), Gaps = 18/85 (21%)
Query: 7 IIKTNTSNPKVWVVIGVSVVGFVVLAETLRRRRNP-----KLNGRVDFGAFLERFELIPF 61
II T+ + K + ++GF + +E R R+ LNG+ EL F
Sbjct: 12 IIATDYAVSKAFYC---DILGFTLQSEVYREARDSWKGDLALNGQY-------VIELFSF 61
Query: 62 PQPPPPAARQPLAGLK---FAIKDV 83
P PP +R GL+ F++ D+
Sbjct: 62 PFPPERPSRPEACGLRHLAFSVDDI 86
>gnl|CDD|237279 PRK13030, PRK13030, 2-oxoacid ferredoxin oxidoreductase;
Provisional.
Length = 1159
Score = 32.3 bits (74), Expect = 0.80
Identities = 60/273 (21%), Positives = 90/273 (32%), Gaps = 73/273 (26%)
Query: 28 FV-VLAETLRRRRNPKLNGRVDFGAFLERFELIPFPQPPPPAARQPLAGLKFAIKDVFDV 86
FV V TLR+ + A R + +P P P A +D+
Sbjct: 678 FVTVEGATLRKAAGAAFDA----AALAARVDALPLPATPADA-------------APYDL 720
Query: 87 KGYVTGFGSPDWKRDHHEAERTAVV-VTLLLKNGATCIGK--TVLDEFAFGITGENKHYG 143
VTG G T VV V LL A GK +VLD TG + G
Sbjct: 721 --LVTGVGG------------TGVVTVGALLAMAAHLEGKSASVLD-----FTGLAQKGG 761
Query: 144 TPVNPQMPSHI---PGGSSSGSAVAVAAQLVDFALGTDTIGCVRVPASFCGILGYRPSHG 200
+ SH+ V + Q D L D + V AS + R
Sbjct: 762 AVL-----SHVRIAASPEWLH-QVRIDTQQADLLLACDMV----VGASPDALQTVRHGRT 811
Query: 201 TVSM-IGVLPNSQSL--DTVGL--LARNASILHRVGHVLLQLNAVEPRRARRLIFADDI- 254
+ +P + + L A + H G +L +++ + + D I
Sbjct: 812 RAVVNTHEIPTADFVRNPDANLPADALLDKMRHAAGD--ERLASLDAQALAEKLLGDAIG 869
Query: 255 ---------FQLSKVPKLKTIHVISKAIESLSG 278
+QL VP ++ + +AIE L+G
Sbjct: 870 ANILMLGYAWQLGLVP--LSLAALMRAIE-LNG 899
>gnl|CDD|220662 pfam10265, DUF2217, Uncharacterized conserved protein (DUF2217).
This is a family of conserved proteins of from 500 -
600 residues found from worms to humans. Its function
is not known.
Length = 515
Score = 32.0 bits (73), Expect = 0.86
Identities = 15/51 (29%), Positives = 20/51 (39%)
Query: 1 MSKSFNIIKTNTSNPKVWVVIGVSVVGFVVLAETLRRRRNPKLNGRVDFGA 51
S + +K + KV VV +VLA LRRRR + G
Sbjct: 2 SQSSLSQLKLSPGTKKVLFSTAAGVVSLIVLAHFLRRRRGKRKMEGEQLGT 52
>gnl|CDD|205550 pfam13371, TPR_9, Tetratricopeptide repeat.
Length = 73
Score = 29.1 bits (66), Expect = 1.3
Identities = 11/49 (22%), Positives = 19/49 (38%)
Query: 497 NAAFKGKQWNKAVNYYSEAIKLNGTSATYYSNRAAAYLELGCFQQAEED 545
+ + + +A+ + L +R Y +LGCFQ A D
Sbjct: 3 AIYLREEDFERALAVVERLLLLAPDDPYERRDRGLLYAQLGCFQAALAD 51
>gnl|CDD|235179 PRK03947, PRK03947, prefoldin subunit alpha; Reviewed.
Length = 140
Score = 29.5 bits (67), Expect = 2.2
Identities = 11/37 (29%), Positives = 18/37 (48%)
Query: 345 LGYNVFERVLEAINTTQDNIKILYKVRNEMRAALQRL 381
GY+ + + EAI + L K ++ ALQ+L
Sbjct: 84 AGYSAEKDLDEAIEILDKRKEELEKALEKLEEALQKL 120
>gnl|CDD|225913 COG3378, COG3378, Phage associated DNA primase [General function
prediction only].
Length = 517
Score = 30.1 bits (68), Expect = 3.5
Identities = 19/78 (24%), Positives = 29/78 (37%), Gaps = 16/78 (20%)
Query: 40 NPKLNGRVDFGAFLERFELIPFPQPPPPAARQPLAGLKFAI--KDVFD--VKGYV----T 91
N R D A R ++PF + PPA R K A + + V G++
Sbjct: 334 NHPPRIRGDDEAIWRRLLIVPFEKQFPPAERDDKLDEKLAAERPGILNWLVAGFLDLQRE 393
Query: 92 GFGSP--------DWKRD 101
G P ++KR+
Sbjct: 394 GLDQPQAVLDALEEYKRE 411
>gnl|CDD|233013 TIGR00540, TPR_hemY_coli, heme biosynthesis-associated TPR protein.
Members of this protein family are uncharacterized
tetratricopeptide repeat (TPR) proteins invariably found
in heme biosynthesis gene clusters. The absence of any
invariant residues other than Ala argues against this
protein serving as an enzyme per se. The gene symbol
hemY assigned in E. coli is unfortunate in that an
unrelated protein, protoporphyrinogen oxidase (HemG in
E. coli) is designated HemY in Bacillus subtilis
[Unknown function, General].
Length = 367
Score = 29.6 bits (67), Expect = 5.4
Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 5/57 (8%)
Query: 490 ELLKEKGNAAFKGKQWNKAVNYYSEAIKLNGTSATYYSNRAAAYL--ELGCFQQAEE 544
LL G + + W KA +Y ++ L + T ++ A L +LG + A +
Sbjct: 309 LLLLALGRLCLRQQLWGKAQSYLEASLSL---APTEEAHLELAQLFEQLGDTEAAAQ 362
>gnl|CDD|220525 pfam10022, DUF2264, Uncharacterized protein conserved in bacteria
(DUF2264). Members of this family of hypothetical
bacterial proteins have no known function.
Length = 361
Score = 29.1 bits (66), Expect = 6.1
Identities = 11/31 (35%), Positives = 17/31 (54%), Gaps = 3/31 (9%)
Query: 508 AVNYYSEA---IKLNGTSATYYSNRAAAYLE 535
+ Y+SE I L G + +Y +AA +LE
Sbjct: 17 LLPYFSEGGARINLPGETGAHYDRKAADHLE 47
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 29.2 bits (66), Expect = 6.7
Identities = 37/201 (18%), Positives = 73/201 (36%), Gaps = 40/201 (19%)
Query: 232 HVLLQLNAVEPRRARRLIFADDIFQLSKVPKLKTIHVISKAIESLSGYQNPEHMNVGQY- 290
+ L + ++ R+L D + + ++S + L V +
Sbjct: 611 LLRLSASKLKKLEPRKLSLTDLFASSEEEDDSLKLSLLSTLLSLLDKA-------VDLWK 663
Query: 291 -------IASNV-PSLKGLRAQS-TSPENGTATLKALSSVM---------LSLQRYEFKT 332
I LK LR PE + L LS ++ L+LQ+++
Sbjct: 664 DLSAFPEIFEPFLALLKHLRKLPKPLPEKLQSLLDKLSKLLALALLERRPLALQKHKPLA 723
Query: 333 IYEEWVKSAKPRL--GYNVFERVLEAINTTQDNIKILYKVRNEMRAALQRLLKDDKILVI 390
I K+ P+ +N ++ + + K+ +++ E + A++ L KD++ +
Sbjct: 724 I-----KTFAPKFEENFNPDKKSYDPDRERAELNKLKAQLKKERKGAMRELRKDNRFIA- 777
Query: 391 PTVSDPPLKLNTKKTYSAEFH 411
KL KK AE+H
Sbjct: 778 ------REKLKEKKEKDAEYH 792
>gnl|CDD|182595 PRK10624, PRK10624, L-1,2-propanediol oxidoreductase; Provisional.
Length = 382
Score = 29.2 bits (66), Expect = 6.8
Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 6/39 (15%)
Query: 263 LKTIHVISKAIESL--SGYQNPEHMNVGQYIA----SNV 295
LK I +I+ A+ + E M +GQYIA SNV
Sbjct: 218 LKAIEIIAGALRGAVAGDKEAGEGMALGQYIAGMGFSNV 256
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only
if a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 29.3 bits (66), Expect = 7.1
Identities = 21/93 (22%), Positives = 35/93 (37%), Gaps = 4/93 (4%)
Query: 465 SLQEQVNIASNLVPLPDTNGSIDASELLKEKGNAAFKGKQWNKAVNYYSEAIKLNGTSAT 524
LQ + + +P A+ELL +G A Q A Y +A+ ++ S
Sbjct: 101 LLQGKFQQVLDELPGKTLLDDEGAAELLALRGLAYLGLGQLELAQKSYEQALAIDPRSL- 159
Query: 525 YYSNRAAAYLEL--GCFQQAEEDCSKTISLDKK 555
Y+ A L L F +A + ++ D
Sbjct: 160 -YAKLGLAQLALAENRFDEARALIDEVLTADPG 191
>gnl|CDD|224952 COG2041, COG2041, Sulfite oxidase and related enzymes [General
function prediction only].
Length = 271
Score = 28.6 bits (64), Expect = 7.9
Identities = 14/42 (33%), Positives = 19/42 (45%), Gaps = 2/42 (4%)
Query: 445 FITYHGGDKFLLDTVLDLYDSLQEQVNIAS--NLVPLPDTNG 484
++ +H D T L L D+L +A N PLP NG
Sbjct: 159 YVMFHSLDGPDYTTGLPLDDALHPLTLLAYGMNGEPLPPENG 200
>gnl|CDD|215040 PLN00046, PLN00046, photosystem I reaction center subunit O;
Provisional.
Length = 141
Score = 27.9 bits (62), Expect = 8.7
Identities = 14/63 (22%), Positives = 24/63 (38%), Gaps = 13/63 (20%)
Query: 11 NTSNPKVWV-----VIGVSVVGFVVLAETLRRRRNPKLNGRVDFGAFLERF--ELIPFPQ 63
+ + W+ VIG ++G++ + P + G G F + EL FP
Sbjct: 49 KKTFDRDWLRKDLNVIGFGLIGWLAPSSI------PAIGGNSLTGLFFDSIGTELAHFPT 102
Query: 64 PPP 66
P
Sbjct: 103 GPA 105
>gnl|CDD|220577 pfam10111, Glyco_tranf_2_2, Glycosyltransferase like family 2.
Members of this family of prokaryotic proteins include
putative glucosyltransferase, which are involved in
bacterial capsule biosynthesis.
Length = 278
Score = 28.5 bits (64), Expect = 9.0
Identities = 20/79 (25%), Positives = 27/79 (34%), Gaps = 2/79 (2%)
Query: 216 TVGLLARNASILHRVGHVLLQLNAVEPRRARRLIFADDIFQLSKVPKLKTIHVISKAIES 275
V L R IL R+ L +L+ +I D LS +LK I + AI
Sbjct: 5 PVYLSLRTRDILKRLQSCLNRLSYFLSDPDVEIIVVDGDSPLSFAKELKKIIAKNGAINY 64
Query: 276 LSGYQNPEHMNVGQYIASN 294
+S A N
Sbjct: 65 ISH--KTHKDLFSPAKARN 81
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.135 0.395
Gapped
Lambda K H
0.267 0.0723 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 29,559,850
Number of extensions: 2944221
Number of successful extensions: 3070
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3020
Number of HSP's successfully gapped: 84
Length of query: 575
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 473
Effective length of database: 6,413,494
Effective search space: 3033582662
Effective search space used: 3033582662
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (27.7 bits)