RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14277
(217 letters)
>gnl|CDD|215689 pfam00067, p450, Cytochrome P450. Cytochrome P450s are
haem-thiolate proteins involved in the oxidative
degradation of various compounds. They are particularly
well known for their role in the degradation of
environmental toxins and mutagens. They can be divided
into 4 classes, according to the method by which
electrons from NAD(P)H are delivered to the catalytic
site. Sequence conservation is relatively low within the
family - there are only 3 absolutely conserved residues
- but their general topography and structural fold are
highly conserved. The conserved core is composed of a
coil termed the 'meander', a four-helix bundle, helices
J and K, and two sets of beta-sheets. These constitute
the haem-binding loop (with an absolutely conserved
cysteine that serves as the 5th ligand for the haem
iron), the proton-transfer groove and the absolutely
conserved EXXR motif in helix K. While prokaryotic P450s
are soluble proteins, most eukaryotic P450s are
associated with microsomal membranes. their general
enzymatic function is to catalyze regiospecific and
stereospecific oxidation of non-activated hydrocarbons
at physiological temperatures.
Length = 461
Score = 160 bits (407), Expect = 5e-47
Identities = 62/156 (39%), Positives = 91/156 (58%), Gaps = 4/156 (2%)
Query: 55 MQEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDL 114
+E+ + V L G + ++ST + L+ELA + EVQ+K R E+ +V TY+DL
Sbjct: 259 DEELRATVLELFFAGTDTTSSTLSWALYELAKHPEVQEKLREEIDEVI-GDKRSPTYDDL 317
Query: 115 AKMTYLEQVISEALRLYPLG-NGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATL 173
M YL+ VI E LRL+P+ L RE T+D IP ++I KG LV + YAL D +
Sbjct: 318 QNMPYLDAVIKETLRLHPVVPLLLPREVTKDTVIPG--YLIPKGTLVIVNLYALHRDPEV 375
Query: 174 WSDPLEFNPDRFAPENESKIVPGSYVPFGDGPRTCI 209
+ +P EF+P+RF EN +++PFG GPR C+
Sbjct: 376 FPNPEEFDPERFLDENGKFRKSFAFLPFGAGPRNCL 411
>gnl|CDD|225035 COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis,
transport, and catabolism].
Length = 411
Score = 109 bits (275), Expect = 3e-28
Identities = 41/154 (26%), Positives = 64/154 (41%), Gaps = 28/154 (18%)
Query: 56 QEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLA 115
E+ + L+V GHE + + L+ L + + K R E +
Sbjct: 235 DEIRDELITLLVAGHETTANALAWALYALLRHPDQLAKLRAEPDR--------------- 279
Query: 116 KMTYLEQVISEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWS 175
LE V+ E LRLYP R T+D + + I G +V + A D ++
Sbjct: 280 --PLLEAVVEETLRLYPPVPLARRVATED--VELGGYRIPAGTVVLLSIGAANRDPEVFP 335
Query: 176 DPLEFNPDRFAPENESKIVPGSYVPFGDGPRTCI 209
DP EF+P+RF +++PFG GP C+
Sbjct: 336 DPDEFDPERF---------NNAHLPFGGGPHRCL 360
>gnl|CDD|215164 PLN02290, PLN02290, cytokinin trans-hydroxylase.
Length = 516
Score = 101 bits (254), Expect = 4e-25
Identities = 62/179 (34%), Positives = 82/179 (45%), Gaps = 16/179 (8%)
Query: 32 NNNNNNNNNKNNINSPNFLGVIRMQEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQ 91
N +N N+N +Q ++ GHE + T L LA N Q
Sbjct: 300 NEMEKKRSNGFNLN---------LQLIMDECKTFFFAGHETTALLLTWTLMLLASNPTWQ 350
Query: 92 DKARREVQKVYKEGGGKVTYEDLAKMTYLEQVISEALRLYPLGNGLFRECTQDYAIPDSP 151
DK R EV +V GG + + L+K+T L VI+E+LRLYP L R +D + D
Sbjct: 351 DKVRAEVAEVC--GGETPSVDHLSKLTLLNMVINESLRLYPPATLLPRMAFEDIKLGDL- 407
Query: 152 HVIQKGVLVHIPTYALQTDATLW-SDPLEFNPDRFAPENESKIVPGSYVPFGDGPRTCI 209
I KG+ + IP A+ LW D EFNPDRFA ++PF GPR CI
Sbjct: 408 -HIPKGLSIWIPVLAIHHSEELWGKDANEFNPDRFA--GRPFAPGRHFIPFAAGPRNCI 463
>gnl|CDD|178524 PLN02936, PLN02936, epsilon-ring hydroxylase.
Length = 489
Score = 84.8 bits (210), Expect = 3e-19
Identities = 48/149 (32%), Positives = 75/149 (50%), Gaps = 8/149 (5%)
Query: 65 LIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLEQVI 124
++V GHE + S T L+ L+ N E KA+ E+ +V G TYED+ ++ YL + I
Sbjct: 286 MLVAGHETTGSVLTWTLYLLSKNPEALRKAQEELDRVL--QGRPPTYEDIKELKYLTRCI 343
Query: 125 SEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDR 184
+E++RLYP L R + +P + + G + I Y + +W EF P+R
Sbjct: 344 NESMRLYPHPPVLIRRAQVEDVLPGG-YKVNAGQDIMISVYNIHRSPEVWERAEEFVPER 402
Query: 185 FAPE----NESKIVPGSYVPFGDGPRTCI 209
F + NE+ Y+PF GPR C+
Sbjct: 403 FDLDGPVPNETN-TDFRYIPFSGGPRKCV 430
>gnl|CDD|215393 PLN02738, PLN02738, carotene beta-ring hydroxylase.
Length = 633
Score = 83.8 bits (207), Expect = 1e-18
Identities = 48/148 (32%), Positives = 72/148 (48%), Gaps = 7/148 (4%)
Query: 65 LIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLEQVI 124
+++ GHE S + T + L+ V K + EV V G T ED+ K+ Y +VI
Sbjct: 399 MLIAGHETSAAVLTWTFYLLSKEPSVVAKLQEEVDSVL--GDRFPTIEDMKKLKYTTRVI 456
Query: 125 SEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDR 184
+E+LRLYP L R ++ + P I++G + I + L W D +FNP+R
Sbjct: 457 NESLRLYPQPPVLIRRSLENDMLGGYP--IKRGEDIFISVWNLHRSPKHWDDAEKFNPER 514
Query: 185 F---APENESKIVPGSYVPFGDGPRTCI 209
+ P SY+PFG GPR C+
Sbjct: 515 WPLDGPNPNETNQNFSYLPFGGGPRKCV 542
>gnl|CDD|215171 PLN02302, PLN02302, ent-kaurenoic acid oxidase.
Length = 490
Score = 77.1 bits (190), Expect = 1e-16
Identities = 47/143 (32%), Positives = 71/143 (49%), Gaps = 8/143 (5%)
Query: 69 GHENSTSTSTNVLHELAYNQEVQDKARREVQKVYK---EGGGKVTYEDLAKMTYLEQVIS 125
GHE+S + L + EV KA+ E +++ K G +T +D+ KM YL QVI
Sbjct: 299 GHESSGHLTMWATIFLQEHPEVLQKAKAEQEEIAKKRPPGQKGLTLKDVRKMEYLSQVID 358
Query: 126 EALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRF 185
E LRL + +FRE D + + + I KG V + D ++ +P EF+P R+
Sbjct: 359 ETLRLINISLTVFREAKTDVEV--NGYTIPKGWKVLAWFRQVHMDPEVYPNPKEFDPSRW 416
Query: 186 APENESKIVPGSYVPFGDGPRTC 208
K G+++PFG G R C
Sbjct: 417 D-NYTPK--AGTFLPFGLGSRLC 436
>gnl|CDD|215371 PLN02687, PLN02687, flavonoid 3'-monooxygenase.
Length = 517
Score = 76.8 bits (189), Expect = 2e-16
Identities = 54/156 (34%), Positives = 76/156 (48%), Gaps = 21/156 (13%)
Query: 65 LIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGK-VTYEDLAKMTYLEQV 123
L G + ++ST + EL + ++ KA+ E+ V G + V+ DL ++TYL+ V
Sbjct: 305 LFTAGTDTTSSTVEWAIAELIRHPDILKKAQEELDAVV--GRDRLVSESDLPQLTYLQAV 362
Query: 124 ISEALRLYP-----LGNGLFRECTQD-YAIPDSPHVIQKGVLVHIPTYALQTDATLWSDP 177
I E RL+P L EC + Y IP KG + + +A+ D W DP
Sbjct: 363 IKETFRLHPSTPLSLPRMAAEECEINGYHIP-------KGATLLVNVWAIARDPEQWPDP 415
Query: 178 LEFNPDRFAP--ENESKIVPGS---YVPFGDGPRTC 208
LEF PDRF P E+ V GS +PFG G R C
Sbjct: 416 LEFRPDRFLPGGEHAGVDVKGSDFELIPFGAGRRIC 451
>gnl|CDD|215354 PLN02655, PLN02655, ent-kaurene oxidase.
Length = 466
Score = 75.5 bits (186), Expect = 5e-16
Identities = 46/139 (33%), Positives = 63/139 (45%), Gaps = 15/139 (10%)
Query: 76 TSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLEQVISEALRLY---P 132
T+ ++ELA N + Q++ RE+++V G +VT EDL + YL V E LR Y P
Sbjct: 281 TTEWAMYELAKNPDKQERLYREIREVC--GDERVTEEDLPNLPYLNAVFHETLRKYSPVP 338
Query: 133 LGNGLF-RECTQ--DYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRFAPEN 189
L F E T Y IP G + I Y D W +P E++P+RF E
Sbjct: 339 LLPPRFVHEDTTLGGYDIP-------AGTQIAINIYGCNMDKKRWENPEEWDPERFLGEK 391
Query: 190 ESKIVPGSYVPFGDGPRTC 208
+ FG G R C
Sbjct: 392 YESADMYKTMAFGAGKRVC 410
>gnl|CDD|173595 PTZ00404, PTZ00404, cytochrome P450; Provisional.
Length = 482
Score = 75.1 bits (185), Expect = 8e-16
Identities = 41/153 (26%), Positives = 70/153 (45%), Gaps = 7/153 (4%)
Query: 58 VVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKM 117
+++ + + G + S ++ ++ L E+Q+KA E++ G KV D
Sbjct: 284 ILATILDFFLAGVDTSATSLEWMVLMLCNYPEIQEKAYNEIKSTVN-GRNKVLLSDRQST 342
Query: 118 TYLEQVISEALRLYPLGN-GLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSD 176
Y +I E LR P+ GL R + D I H I K + I Y+L + + +
Sbjct: 343 PYTVAIIKETLRYKPVSPFGLPRSTSNDIIIGGG-HFIPKDAQILINYYSLGRNEKYFEN 401
Query: 177 PLEFNPDRFAPENESKIVPGSYVPFGDGPRTCI 209
P +F+P RF +S +++PF GPR C+
Sbjct: 402 PEQFDPSRFL-NPDS---NDAFMPFSIGPRNCV 430
>gnl|CDD|215086 PLN00168, PLN00168, Cytochrome P450; Provisional.
Length = 519
Score = 72.7 bits (178), Expect = 5e-15
Identities = 44/145 (30%), Positives = 72/145 (49%), Gaps = 10/145 (6%)
Query: 72 NSTSTSTN-VLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLEQVISEALRL 130
++TST+ ++ EL N +Q K E++ + +V+ ED+ KM YL+ V+ E LR
Sbjct: 320 DTTSTALQWIMAELVKNPSIQSKLHDEIKAKTGDDQEEVSEEDVHKMPYLKAVVLEGLRK 379
Query: 131 YPLGNGLF-RECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRFAP-- 187
+P + + + +D + ++I KG V+ + D W P+EF P+RF
Sbjct: 380 HPPAHFVLPHKAAEDMEV--GGYLIPKGATVNFMVAEMGRDEREWERPMEFVPERFLAGG 437
Query: 188 ENESKIVPGS----YVPFGDGPRTC 208
+ E V GS +PFG G R C
Sbjct: 438 DGEGVDVTGSREIRMMPFGVGRRIC 462
>gnl|CDD|215583 PLN03112, PLN03112, cytochrome P450 family protein; Provisional.
Length = 514
Score = 71.0 bits (174), Expect = 2e-14
Identities = 44/144 (30%), Positives = 62/144 (43%), Gaps = 13/144 (9%)
Query: 74 TSTSTN--VLHELAYNQEVQDKARREVQKVYKEGGGK-VTYEDLAKMTYLEQVISEALRL 130
TS TN + E+ N V K + E+ V G + V DL + YL V+ E R+
Sbjct: 311 TSAVTNEWAMAEVIKNPRVLRKIQEELDSVV--GRNRMVQESDLVHLNYLRCVVRETFRM 368
Query: 131 YPLGNGLF-RECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRFAPEN 189
+P G L E + I + + I V I T+ L + +W D EF P+R P
Sbjct: 369 HPAGPFLIPHESLRATTI--NGYYIPAKTRVFINTHGLGRNTKIWDDVEEFRPERHWPAE 426
Query: 190 ESK--IVPGS---YVPFGDGPRTC 208
S+ I G +PF G R C
Sbjct: 427 GSRVEISHGPDFKILPFSAGKRKC 450
>gnl|CDD|177725 PLN00110, PLN00110, flavonoid 3',5'-hydroxylase (F3'5'H);
Provisional.
Length = 504
Score = 65.6 bits (160), Expect = 1e-12
Identities = 53/180 (29%), Positives = 81/180 (45%), Gaps = 22/180 (12%)
Query: 47 PNFLGVIRM-QEVVSGVFV-----------LIVGGHENSTSTSTNVLHELAYNQEVQDKA 94
P+FL V+ QE +G + L G + S+S L E+ N + +A
Sbjct: 267 PDFLDVVMANQENSTGEKLTLTNIKALLLNLFTAGTDTSSSVIEWSLAEMLKNPSILKRA 326
Query: 95 RREVQKVYKEGGGKVTYE-DLAKMTYLEQVISEALRLYP-LGNGLFRECTQDYAIPDSPH 152
E+ +V G + E DL K+ YL+ + E+ R +P L R TQ A + +
Sbjct: 327 HEEMDQVI--GRNRRLVESDLPKLPYLQAICKESFRKHPSTPLNLPRVSTQ--ACEVNGY 382
Query: 153 VIQKGVLVHIPTYALQTDATLWSDPLEFNPDRFAPENESKIVPG----SYVPFGDGPRTC 208
I K + + +A+ D +W +P EF P+RF E +KI P +PFG G R C
Sbjct: 383 YIPKNTRLSVNIWAIGRDPDVWENPEEFRPERFLSEKNAKIDPRGNDFELIPFGAGRRIC 442
>gnl|CDD|215235 PLN02426, PLN02426, cytochrome P450, family 94, subfamily C
protein.
Length = 502
Score = 65.5 bits (160), Expect = 2e-12
Identities = 48/159 (30%), Positives = 70/159 (44%), Gaps = 17/159 (10%)
Query: 58 VVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKM 117
VVS F+L G + S T+ L+ + EV R E +V ++E++ +M
Sbjct: 298 VVS--FLL--AGRDTVASALTSFFWLLSKHPEVASAIREEADRVMGPNQEAASFEEMKEM 353
Query: 118 TYLEQVISEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLW-SD 176
YL + E++RL+P + +D +PD V KG V YA+ +W D
Sbjct: 354 HYLHAALYESMRLFPPVQFDSKFAAEDDVLPDGTFV-AKGTRVTYHPYAMGRMERIWGPD 412
Query: 177 PLEFNPDR------FAPENESKIVPGSYVPFGDGPRTCI 209
LEF P+R F PEN K Y F G R C+
Sbjct: 413 CLEFKPERWLKNGVFVPENPFK-----YPVFQAGLRVCL 446
>gnl|CDD|165828 PLN02183, PLN02183, ferulate 5-hydroxylase.
Length = 516
Score = 64.5 bits (157), Expect = 4e-12
Identities = 48/154 (31%), Positives = 67/154 (43%), Gaps = 27/154 (17%)
Query: 68 GGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYE-DLAKMTYLEQVISE 126
GG E S + EL + E + ++E+ V G + E DL K+TYL+ + E
Sbjct: 315 GGTETVASAIEWAMAELMKSPEDLKRVQQELADVV--GLNRRVEESDLEKLTYLKCTLKE 372
Query: 127 ALRLYPLGNGLFRECTQD-----YAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFN 181
LRL+P L E +D Y IP V+ I +A+ D W DP F
Sbjct: 373 TLRLHPPIPLLLHETAEDAEVAGYFIPKRSRVM-------INAWAIGRDKNSWEDPDTFK 425
Query: 182 PDRF----APENESKIVPGS---YVPFGDGPRTC 208
P RF P+ GS ++PFG G R+C
Sbjct: 426 PSRFLKPGVPD-----FKGSHFEFIPFGSGRRSC 454
>gnl|CDD|177826 PLN02169, PLN02169, fatty acid (omega-1)-hydroxylase/midchain
alkane hydroxylase.
Length = 500
Score = 63.9 bits (155), Expect = 5e-12
Identities = 44/151 (29%), Positives = 72/151 (47%), Gaps = 11/151 (7%)
Query: 62 VFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLE 121
+F L++ G + ++S T L+ + +V K R E+ + EDL K+ YL
Sbjct: 306 IFSLVLAGRDTTSSALTWFFWLLSKHPQVMAKIRHEINTKFDN-------EDLEKLVYLH 358
Query: 122 QVISEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLW-SDPLEF 180
+SE++RLYP + + +P S H + + I YAL ++W D L+F
Sbjct: 359 AALSESMRLYPPLPFNHKAPAKPDVLP-SGHKVDAESKIVICIYALGRMRSVWGEDALDF 417
Query: 181 NPDRFAPENESKIVPGSY--VPFGDGPRTCI 209
P+R+ +N SY + F GPRTC+
Sbjct: 418 KPERWISDNGGLRHEPSYKFMAFNSGPRTCL 448
>gnl|CDD|178373 PLN02774, PLN02774, brassinosteroid-6-oxidase.
Length = 463
Score = 63.6 bits (155), Expect = 5e-12
Identities = 42/156 (26%), Positives = 76/156 (48%), Gaps = 8/156 (5%)
Query: 56 QEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVY--KEGGGKVTYED 113
+E++ + ++ G+E ++TS + L + + + R+E + K + + D
Sbjct: 263 EEIIDQIITILYSGYETVSTTSMMAVKYLHDHPKALQELRKEHLAIRERKRPEDPIDWND 322
Query: 114 LAKMTYLEQVISEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATL 173
M + VI E RL + NG+ R+ TQD + + +VI KG +++ T + D L
Sbjct: 323 YKSMRFTRAVIFETSRLATIVNGVLRKTTQDMEL--NGYVIPKGWRIYVYTREINYDPFL 380
Query: 174 WSDPLEFNPDRFAPEN-ESKIVPGSYVPFGDGPRTC 208
+ DP+ FNP R+ ++ ES + FG G R C
Sbjct: 381 YPDPMTFNPWRWLDKSLESH---NYFFLFGGGTRLC 413
>gnl|CDD|215221 PLN02394, PLN02394, trans-cinnamate 4-monooxygenase.
Length = 503
Score = 62.4 bits (152), Expect = 2e-11
Identities = 45/136 (33%), Positives = 64/136 (47%), Gaps = 19/136 (13%)
Query: 83 ELAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLEQVISEALRLY-------PLGN 135
EL + E+Q K R E+ V G +VT D K+ YL+ V+ E LRL+ P N
Sbjct: 319 ELVNHPEIQKKLRDELDTVLGPGN-QVTEPDTHKLPYLQAVVKETLRLHMAIPLLVPHMN 377
Query: 136 GLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRFAPEN---ESK 192
L Y IP + +LV+ + L + LW +P EF P+RF E E+
Sbjct: 378 -LEDAKLGGYDIPA-----ESKILVN--AWWLANNPELWKNPEEFRPERFLEEEAKVEAN 429
Query: 193 IVPGSYVPFGDGPRTC 208
++PFG G R+C
Sbjct: 430 GNDFRFLPFGVGRRSC 445
>gnl|CDD|178550 PLN02966, PLN02966, cytochrome P450 83A1.
Length = 502
Score = 61.3 bits (148), Expect = 4e-11
Identities = 45/148 (30%), Positives = 73/148 (49%), Gaps = 6/148 (4%)
Query: 65 LIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGK-VTYEDLAKMTYLEQV 123
++V G + + + + L +V KA+ EV++ KE G VT +D+ + Y +
Sbjct: 297 IVVAGTDTAAAAVVWGMTYLMKYPQVLKKAQAEVREYMKEKGSTFVTEDDVKNLPYFRAL 356
Query: 124 ISEALRLYPLGNGLF-RECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWS-DPLEFN 181
+ E LR+ P+ L R C QD I + + I G V++ +A+ D W +P EF
Sbjct: 357 VKETLRIEPVIPLLIPRACIQDTKI--AGYDIPAGTTVNVNAWAVSRDEKEWGPNPDEFR 414
Query: 182 PDRF-APENESKIVPGSYVPFGDGPRTC 208
P+RF E + K ++PFG G R C
Sbjct: 415 PERFLEKEVDFKGTDYEFIPFGSGRRMC 442
>gnl|CDD|178773 PLN03234, PLN03234, cytochrome P450 83B1; Provisional.
Length = 499
Score = 61.2 bits (148), Expect = 5e-11
Identities = 47/158 (29%), Positives = 75/158 (47%), Gaps = 8/158 (5%)
Query: 56 QEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYEDLA 115
+ V + + ++V G + + + + L E KA+ EV+ V + G V+ ED+
Sbjct: 287 ENVKAMILDIVVPGTDTAAAVVVWAMTYLIKYPEAMKKAQDEVRNVIGDKG-YVSEEDIP 345
Query: 116 KMTYLEQVISEALRLYPLGNGLF-RECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLW 174
+ YL+ VI E+LRL P+ L RE D I + I ++ + +A+ D W
Sbjct: 346 NLPYLKAVIKESLRLEPVIPILLHRETIADAKI--GGYDIPAKTIIQVNAWAVSRDTAAW 403
Query: 175 SD-PLEFNPDRFAPENESKIVPGS---YVPFGDGPRTC 208
D P EF P+RF E++ G +PFG G R C
Sbjct: 404 GDNPNEFIPERFMKEHKGVDFKGQDFELLPFGSGRRMC 441
>gnl|CDD|177847 PLN02196, PLN02196, abscisic acid 8'-hydroxylase.
Length = 463
Score = 59.6 bits (144), Expect = 2e-10
Identities = 47/158 (29%), Positives = 80/158 (50%), Gaps = 14/158 (8%)
Query: 56 QEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVY--KEGGGKVTYED 113
+++ + +I + + S T +L LA N V + E + KE G +T+ED
Sbjct: 263 EQIADNIIGVIFAARDTTASVLTWILKYLAENPSVLEAVTEEQMAIRKDKEEGESLTWED 322
Query: 114 LAKMTYLEQVISEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYA-LQTDAT 172
KM +VI E LR+ + + FRE +D + ++I KG V +P + + A
Sbjct: 323 TKKMPLTSRVIQETLRVASILSFTFREAVED--VEYEGYLIPKGWKV-LPLFRNIHHSAD 379
Query: 173 LWSDPLEFNPDRF--APENESKIVPGSYVPFGDGPRTC 208
++SDP +F+P RF AP+ P +++PFG+G +C
Sbjct: 380 IFSDPGKFDPSRFEVAPK------PNTFMPFGNGTHSC 411
>gnl|CDD|166628 PLN02987, PLN02987, Cytochrome P450, family 90, subfamily A.
Length = 472
Score = 58.8 bits (142), Expect = 2e-10
Identities = 42/156 (26%), Positives = 71/156 (45%), Gaps = 6/156 (3%)
Query: 56 QEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKV--YKEGGGKVTYED 113
+E+V + L+V G+E +++ T + L + + E +K+ K + + D
Sbjct: 266 EEIVDFLVALLVAGYETTSTIMTLAVKFLTETPLALAQLKEEHEKIRAMKSDSYSLEWSD 325
Query: 114 LAKMTYLEQVISEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATL 173
M + + V++E LR+ + G+FR D + + I KG V A+ D
Sbjct: 326 YKSMPFTQCVVNETLRVANIIGGIFRRAMTDIEVKG--YTIPKGWKVFASFRAVHLDHEY 383
Query: 174 WSDPLEFNPDRFAPENESKIVPGS-YVPFGDGPRTC 208
+ D FNP R+ N VP + + PFG GPR C
Sbjct: 384 FKDARTFNPWRWQ-SNSGTTVPSNVFTPFGGGPRLC 418
>gnl|CDD|215276 PLN02500, PLN02500, cytochrome P450 90B1.
Length = 490
Score = 58.7 bits (142), Expect = 3e-10
Identities = 48/175 (27%), Positives = 76/175 (43%), Gaps = 35/175 (20%)
Query: 56 QEVVSGVFVLIVGGHENST----------STSTNVLHELAYNQEVQDKARREVQKVYKEG 105
++++ + L+ GHE S+ + EL +E + AR + Q G
Sbjct: 278 EQILDLILSLLFAGHETSSVAIALAIFFLQGCPKAVQEL--REEHLEIARAKKQS----G 331
Query: 106 GGKVTYEDLAKMTYLEQVISEALRLYPLGNGLFRECTQD-----YAIPDSPHVIQKGVLV 160
++ +ED KM + + VI+E LRL + L R+ +D Y IP V+ V
Sbjct: 332 ESELNWEDYKKMEFTQCVINETLRLGNVVRFLHRKALKDVRYKGYDIPSGWKVLPVIAAV 391
Query: 161 HIPTYALQTDATLWSDPLEFNPDRFAPENESKIVPGS-------YVPFGDGPRTC 208
H+ D++L+ P FNP R+ N GS ++PFG GPR C
Sbjct: 392 HL-------DSSLYDQPQLFNPWRWQQNNNRGGSSGSSSATTNNFMPFGGGPRLC 439
>gnl|CDD|215627 PLN03195, PLN03195, fatty acid omega-hydroxylase; Provisional.
Length = 516
Score = 56.7 bits (137), Expect = 1e-09
Identities = 41/168 (24%), Positives = 72/168 (42%), Gaps = 28/168 (16%)
Query: 66 IVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGK----------------- 108
++ G + + +T + ++ + N V +K E++ + KE +
Sbjct: 301 VIAGRDTTATTLSWFVYMIMMNPHVAEKLYSELKALEKERAKEEDPEDSQSFNQRVTQFA 360
Query: 109 --VTYEDLAKMTYLEQVISEALRLYPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYA 166
+TY+ L K+ YL VI+E LRLYP + +D +PD ++ G +V Y+
Sbjct: 361 GLLTYDSLGKLQYLHAVITETLRLYPAVPQDPKGILEDDVLPDGT-KVKAGGMVTYVPYS 419
Query: 167 LQTDATLW-SDPLEFNPDRFAPE----NESKIVPGSYVPFGDGPRTCI 209
+ W D F P+R+ + N S P + F GPR C+
Sbjct: 420 MGRMEYNWGPDAASFKPERWIKDGVFQNAS---PFKFTAFQAGPRICL 464
>gnl|CDD|215350 PLN02648, PLN02648, allene oxide synthase.
Length = 480
Score = 53.4 bits (129), Expect = 2e-08
Identities = 43/130 (33%), Positives = 64/130 (49%), Gaps = 18/130 (13%)
Query: 87 NQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLEQVISEALRLYP---LGNGLFRECTQ 143
+E+Q + EV+ K GGG VT+ L KM ++ V+ EALR+ P G RE
Sbjct: 303 GEELQARLAEEVRSAVKAGGGGVTFAALEKMPLVKSVVYEALRIEPPVPFQYGRARE--- 359
Query: 144 DYAIP--DSPHVIQKGVLVHIPTYALQ----TDATLWSDPLEFNPDRFAPENESKIVPGS 197
D+ I D+ I+KG ++ + Q D ++ P EF PDRF E K++
Sbjct: 360 DFVIESHDAAFEIKKGEML----FGYQPLVTRDPKVFDRPEEFVPDRFMGEEGEKLLK-- 413
Query: 198 YVPFGDGPRT 207
YV + +G T
Sbjct: 414 YVFWSNGRET 423
>gnl|CDD|166612 PLN02971, PLN02971, tryptophan N-hydroxylase.
Length = 543
Score = 53.1 bits (127), Expect = 2e-08
Identities = 43/156 (27%), Positives = 68/156 (43%), Gaps = 23/156 (14%)
Query: 65 LIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGK---VTYEDLAKMTYLE 121
L++ +N ++ + E+ E+ KA E+ +V GK V D+ K+ Y++
Sbjct: 335 LVMAAPDNPSNAVEWAMAEMINKPEILHKAMEEIDRVV----GKERFVQESDIPKLNYVK 390
Query: 122 QVISEALRLYPLGN------GLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWS 175
+I EA RL+P+ L Y IP KG V + Y L + +WS
Sbjct: 391 AIIREAFRLHPVAAFNLPHVALSDTTVAGYHIP-------KGSQVLLSRYGLGRNPKVWS 443
Query: 176 DPLEFNPDRFAPE-NESKIVPGS--YVPFGDGPRTC 208
DPL F P+R E +E + ++ F G R C
Sbjct: 444 DPLSFKPERHLNECSEVTLTENDLRFISFSTGKRGC 479
>gnl|CDD|215600 PLN03141, PLN03141, 3-epi-6-deoxocathasterone 23-monooxygenase;
Provisional.
Length = 452
Score = 51.3 bits (123), Expect = 8e-08
Identities = 33/125 (26%), Positives = 60/125 (48%), Gaps = 7/125 (5%)
Query: 84 LAYNQEVQDKARREVQKVYKEGGGKVTYEDLAKMTYLEQVISEALRLYPLGNGLFRECTQ 143
+A Q ++ + + K + G + + D + + + VI+E LR+ + NG+ R+ +
Sbjct: 283 VALQQLTEENMKLKRLKA--DTGEPLYWTDYMSLPFTQNVITETLRMGNIINGVMRKAMK 340
Query: 144 DYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRFAPENESKIVPGSYVPFGD 203
D I ++I KG V ++ D + +P +FNP R+ E + S+ PFG
Sbjct: 341 DVEIKG--YLIPKGWCVLAYFRSVHLDEENYDNPYQFNPWRW---QEKDMNNSSFTPFGG 395
Query: 204 GPRTC 208
G R C
Sbjct: 396 GQRLC 400
>gnl|CDD|178592 PLN03018, PLN03018, homomethionine N-hydroxylase.
Length = 534
Score = 42.3 bits (99), Expect = 1e-04
Identities = 37/147 (25%), Positives = 64/147 (43%), Gaps = 12/147 (8%)
Query: 71 ENSTSTSTNVLHELAYNQEVQDKARREVQKVYKEGGGKVTYE-DLAKMTYLEQVISEALR 129
+N + L E+ N E+ KA +E+ +V G ++ E D+ + YL+ E R
Sbjct: 328 DNPANNMEWTLGEMLKNPEILRKALKELDEVV--GKDRLVQESDIPNLNYLKACCRETFR 385
Query: 130 LYPLGNGLFRECT-QDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRF--- 185
++P + + QD + + I KG +H+ L + +W DPL + P+R
Sbjct: 386 IHPSAHYVPPHVARQDTTL--GGYFIPKGSHIHVCRPGLGRNPKIWKDPLVYEPERHLQG 443
Query: 186 -APENESKIVPGS--YVPFGDGPRTCI 209
E +V +V F G R C+
Sbjct: 444 DGITKEVTLVETEMRFVSFSTGRRGCV 470
>gnl|CDD|240427 PTZ00470, PTZ00470, glycoside hydrolase family 47 protein;
Provisional.
Length = 522
Score = 37.8 bits (88), Expect = 0.003
Identities = 14/25 (56%), Positives = 17/25 (68%)
Query: 24 TTTTNNNNNNNNNNNNNKNNINSPN 48
+ NN ++N NNNNNN N INS N
Sbjct: 8 LSVHNNADDNYNNNNNNNNQINSNN 32
Score = 37.8 bits (88), Expect = 0.003
Identities = 14/30 (46%), Positives = 19/30 (63%)
Query: 22 TTTTTTNNNNNNNNNNNNNKNNINSPNFLG 51
+ ++N NNNNNNNN+ N N+PN G
Sbjct: 8 LSVHNNADDNYNNNNNNNNQINSNNPNNNG 37
Score = 37.0 bits (86), Expect = 0.005
Identities = 12/28 (42%), Positives = 16/28 (57%)
Query: 21 TTTTTTTNNNNNNNNNNNNNKNNINSPN 48
+ ++N NNNNNNNN N+ N N
Sbjct: 8 LSVHNNADDNYNNNNNNNNQINSNNPNN 35
Score = 35.9 bits (83), Expect = 0.012
Identities = 11/29 (37%), Positives = 15/29 (51%)
Query: 20 TTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
+ +N NNNNNNNN +N + N
Sbjct: 8 LSVHNNADDNYNNNNNNNNQINSNNPNNN 36
Score = 35.9 bits (83), Expect = 0.012
Identities = 13/32 (40%), Positives = 15/32 (46%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
+ NNNNNNNN N+N N N N
Sbjct: 8 LSVHNNADDNYNNNNNNNNQINSNNPNNNGNN 39
Score = 32.4 bits (74), Expect = 0.15
Identities = 12/32 (37%), Positives = 15/32 (46%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
+ NNNNNNN N+NN NN +
Sbjct: 9 SVHNNADDNYNNNNNNNNQINSNNPNNNGNNQ 40
Score = 29.3 bits (66), Expect = 1.9
Identities = 12/32 (37%), Positives = 13/32 (40%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
NNN N+NN NNN NN S
Sbjct: 13 NADDNYNNNNNNNNQINSNNPNNNGNNQASKL 44
>gnl|CDD|219312 pfam07150, DUF1390, Protein of unknown function (DUF1390). This
family consists of several Paramecium bursaria
chlorella virus 1 (PBCV-1) proteins of around 250
residues in length. The function of this family is
unknown.
Length = 229
Score = 35.1 bits (81), Expect = 0.016
Identities = 12/29 (41%), Positives = 13/29 (44%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNIN 45
TT + T NNN NN N N NI
Sbjct: 53 ATTVSNINTGVVNNNVNNINKTINTKNIT 81
Score = 30.4 bits (69), Expect = 0.55
Identities = 10/29 (34%), Positives = 11/29 (37%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNIN 45
T + T NN NN N N IN
Sbjct: 55 TVSNINTGVVNNNVNNINKTINTKNITIN 83
Score = 29.7 bits (67), Expect = 1.0
Identities = 10/42 (23%), Positives = 14/42 (33%)
Query: 19 TTTTTTTTTNNNNNNNNNNNNNKNNINSPNFLGVIRMQEVVS 60
TT + NNN NN N N + + + V
Sbjct: 52 LATTVSNINTGVVNNNVNNINKTINTKNITINLTVPDKSTVE 93
>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain. This is a
short domain found in bacterial type II/III secretory
system proteins. The architecture of these proteins
suggest that this family may be functionally analogous
to pfam03958.
Length = 95
Score = 32.7 bits (75), Expect = 0.029
Identities = 6/30 (20%), Positives = 23/30 (76%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINS 46
T+ T+ + +++ +N++++++N++N + +S
Sbjct: 19 TSVTSGSVSSSGSNSSSSSSNSSNGGSSSS 48
Score = 31.5 bits (72), Expect = 0.072
Identities = 5/31 (16%), Positives = 22/31 (70%)
Query: 18 TTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
T+ T+ + +++ +N++++++N++ +S +
Sbjct: 19 TSVTSGSVSSSGSNSSSSSSNSSNGGSSSSS 49
Score = 26.9 bits (60), Expect = 2.9
Identities = 2/30 (6%), Positives = 21/30 (70%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINS 46
+ + +++ + ++++++N++N ++ ++ +
Sbjct: 23 SGSVSSSGSNSSSSSSNSSNGGSSSSSSSG 52
>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
Provisional.
Length = 481
Score = 34.0 bits (77), Expect = 0.049
Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 4/36 (11%)
Query: 17 TTTTTTTTTTTNNNNNN----NNNNNNNKNNINSPN 48
T T T T TNN N + N N N N N + N
Sbjct: 315 NKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSN 350
Score = 33.2 bits (75), Expect = 0.080
Identities = 12/27 (44%), Positives = 15/27 (55%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNN 43
T T T+T + N N N N+N N N N
Sbjct: 325 TNNTNTSTPSKNTNTNTNSNTNTNSNT 351
Score = 33.2 bits (75), Expect = 0.082
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINS 46
T T T+T + N N N N+N N +N N+
Sbjct: 324 NTNNTNTSTPSKNTNTNTNSNTNTNSNTNA 353
Score = 32.9 bits (74), Expect = 0.11
Identities = 13/32 (40%), Positives = 17/32 (53%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
T T+T + T N N+N N N+N N S N
Sbjct: 328 TNTSTPSKNTNTNTNSNTNTNSNTNANQGSSN 359
Score = 32.5 bits (73), Expect = 0.17
Identities = 14/32 (43%), Positives = 19/32 (59%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
T+T + T TN N+N N N+N N N +S N
Sbjct: 329 NTSTPSKNTNTNTNSNTNTNSNTNANQGSSNN 360
Score = 31.7 bits (71), Expect = 0.29
Identities = 12/30 (40%), Positives = 16/30 (53%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINS 46
T T T T+T + N N N N+N N N+
Sbjct: 322 NTNTNNTNTSTPSKNTNTNTNSNTNTNSNT 351
Score = 31.7 bits (71), Expect = 0.30
Identities = 13/32 (40%), Positives = 18/32 (56%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
T T T T+T + N N N N+N N N+ + N
Sbjct: 323 TNTNNTNTSTPSKNTNTNTNSNTNTNSNTNAN 354
Score = 29.8 bits (66), Expect = 1.1
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 18 TTTTTTTTTTNNNNNNNNNNNNNKNNINS 46
+ T T T +N N N+N N N +N NS
Sbjct: 334 SKNTNTNTNSNTNTNSNTNANQGSSNNNS 362
Score = 29.0 bits (64), Expect = 2.2
Identities = 11/37 (29%), Positives = 19/37 (51%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNFLGVI 53
+ T T T + N N+N N N + NN ++ + +I
Sbjct: 334 SKNTNTNTNSNTNTNSNTNANQGSSNNNSNSSASAII 370
Score = 28.6 bits (63), Expect = 3.1
Identities = 13/37 (35%), Positives = 14/37 (37%), Gaps = 6/37 (16%)
Query: 18 TTTTTTTTTTNNNNNNNNNNN------NNKNNINSPN 48
+T T T N N N NN N N N NS
Sbjct: 309 STNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNT 345
>gnl|CDD|151307 pfam10858, DUF2659, Protein of unknown function (DUF2659). This
bacterial family of proteins has no known function.
Length = 220
Score = 33.4 bits (76), Expect = 0.063
Identities = 26/92 (28%), Positives = 35/92 (38%), Gaps = 11/92 (11%)
Query: 6 IFF-FYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNN---INSPNFLGVIRMQEVVSG 61
IFF L I+ + T NNNN NN KN + S + E
Sbjct: 19 IFFKKLLPIVIIISLIAITIMVINNNNKARQIANNQKNGDIFVKSVD-------LESTKN 71
Query: 62 VFVLIVGGHENSTSTSTNVLHELAYNQEVQDK 93
L V EN S S + E+A+ ++V K
Sbjct: 72 NSELAVQTLENLVSNSNTKIQEIAHLEQVAIK 103
>gnl|CDD|214408 MTH00036, ATP8, ATP synthase F0 subunit 8; Validated.
Length = 54
Score = 30.0 bits (68), Expect = 0.11
Identities = 10/33 (30%), Positives = 19/33 (57%)
Query: 11 LFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNN 43
L + FT +TN +N+++N +++N NN
Sbjct: 19 LVFIVFTYLLLNNWPSTNASNSSSNTSSSNNNN 51
Score = 30.0 bits (68), Expect = 0.13
Identities = 8/34 (23%), Positives = 17/34 (50%)
Query: 6 IFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNN 39
F +F +T + +++N +++NNNN
Sbjct: 18 SLVFIVFTYLLLNNWPSTNASNSSSNTSSSNNNN 51
Score = 26.2 bits (58), Expect = 2.9
Identities = 9/42 (21%), Positives = 18/42 (42%)
Query: 5 GIFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNINS 46
F + L F T + N +N+++N + +N N+
Sbjct: 10 LFNFLLAWSLVFIVFTYLLLNNWPSTNASNSSSNTSSSNNNN 51
>gnl|CDD|240339 PTZ00265, PTZ00265, multidrug resistance protein (mdr1);
Provisional.
Length = 1466
Score = 32.7 bits (74), Expect = 0.14
Identities = 19/71 (26%), Positives = 32/71 (45%), Gaps = 7/71 (9%)
Query: 24 TTTTNNNNNNNNNNNNNKNNINSPNFLGVIRMQEVVSGVFVLIVGGHENSTSTSTNVLHE 83
T NNN NN ++NN NN N+ N + +G +++ G H+ + +
Sbjct: 671 TKDNKENNNKNNKDDNNNNNNNNNNKIN-------NAGSYIIEQGTHDALMKNKNGIYYT 723
Query: 84 LAYNQEVQDKA 94
+ NQ+V K
Sbjct: 724 MINNQKVSSKK 734
>gnl|CDD|214445 MTH00166, ND6, NADH dehydrogenase subunit 6; Provisional.
Length = 160
Score = 30.9 bits (71), Expect = 0.25
Identities = 9/54 (16%), Positives = 15/54 (27%), Gaps = 4/54 (7%)
Query: 1 MFTPGIFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNI----NSPNFL 50
++ + +N NNN N N ++ N P L
Sbjct: 76 FKFSLKLLILFLLMLLLMLNNFIKNSEMSNFNNNFNFFENSFSLSKLYNFPTML 129
>gnl|CDD|177251 MTH00208, ND5, NADH dehydrogenase subunit 5; Provisional.
Length = 628
Score = 31.9 bits (73), Expect = 0.26
Identities = 17/59 (28%), Positives = 30/59 (50%)
Query: 8 FFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNFLGVIRMQEVVSGVFVLI 66
F L I F ++ +T+N + NNNNN+ N NS + +++ +S + +LI
Sbjct: 17 FIILLISIFFSSNLPNISTSNASLGGNNNNNSIINYNNSNFSISLLKTLAFLSLINLLI 75
Score = 29.2 bits (66), Expect = 1.9
Identities = 13/48 (27%), Positives = 22/48 (45%)
Query: 6 IFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNFLGVI 53
IFF T+ + NN+ N NN+N + + + + FL +I
Sbjct: 24 IFFSSNLPNISTSNASLGGNNNNNSIINYNNSNFSISLLKTLAFLSLI 71
Score = 26.9 bits (60), Expect = 9.9
Identities = 12/37 (32%), Positives = 19/37 (51%)
Query: 6 IFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKN 42
I F+ L +T+ + NNNN+ N NN+N +
Sbjct: 22 ISIFFSSNLPNISTSNASLGGNNNNNSIINYNNSNFS 58
>gnl|CDD|178602 PLN03034, PLN03034, phosphoglycerate kinase; Provisional.
Length = 481
Score = 31.5 bits (71), Expect = 0.35
Identities = 12/20 (60%), Positives = 15/20 (75%)
Query: 183 DRFAPENESKIVPGSYVPFG 202
D+FAP+ SKIVP S +P G
Sbjct: 350 DKFAPDANSKIVPASAIPDG 369
>gnl|CDD|205936 pfam13763, DUF4167, Domain of unknown function (DUF4167).
Length = 80
Score = 29.2 bits (66), Expect = 0.40
Identities = 9/21 (42%), Positives = 12/21 (57%)
Query: 28 NNNNNNNNNNNNNKNNINSPN 48
NNN NNN + N +N + N
Sbjct: 5 NNNRNNNRSGGNPRNRVFDSN 25
Score = 28.0 bits (63), Expect = 0.98
Identities = 9/18 (50%), Positives = 10/18 (55%)
Query: 28 NNNNNNNNNNNNNKNNIN 45
NNNN NNN + N N
Sbjct: 2 GRNNNNRNNNRSGGNPRN 19
Score = 26.9 bits (60), Expect = 2.8
Identities = 9/19 (47%), Positives = 11/19 (57%)
Query: 30 NNNNNNNNNNNKNNINSPN 48
NNNN NNN++ N N
Sbjct: 1 RGRNNNNRNNNRSGGNPRN 19
Score = 26.5 bits (59), Expect = 3.9
Identities = 9/22 (40%), Positives = 11/22 (50%)
Query: 28 NNNNNNNNNNNNNKNNINSPNF 49
NNNN NNN + N + F
Sbjct: 1 RGRNNNNRNNNRSGGNPRNRVF 22
>gnl|CDD|150787 pfam10154, DUF2362, Uncharacterized conserved protein (DUF2362).
This is a family of proteins conserved from nematodes to
humans. The function is not known.
Length = 501
Score = 31.3 bits (71), Expect = 0.41
Identities = 5/21 (23%), Positives = 9/21 (42%)
Query: 23 TTTTTNNNNNNNNNNNNNKNN 43
+ T+ + NN +N N
Sbjct: 150 GKSLTDQDVNNLAAQHNENQN 170
Score = 29.4 bits (66), Expect = 1.7
Identities = 7/23 (30%), Positives = 12/23 (52%)
Query: 22 TTTTTTNNNNNNNNNNNNNKNNI 44
+ T + NN +N N+N+I
Sbjct: 150 GKSLTDQDVNNLAAQHNENQNSI 172
>gnl|CDD|219426 pfam07489, Tir_receptor_C, Translocated intimin receptor (Tir)
C-terminus. Intimin and its translocated intimin
receptor (Tir) are bacterial proteins that mediate
adhesion between mammalian cells and attaching and
effacing (A/E) pathogens. A unique and essential
feature of A/E bacterial pathogens is the formation of
actin-rich pedestals beneath the intimately adherent
bacteria and localised destruction of the intestinal
brush border. The bacterial outer membrane adhesin,
intimin, is necessary for the production of the A/E
lesion and diarrhoea. The A/E bacteria translocate
their own receptor for intimin, Tir, into the membrane
of mammalian cells using the type III secretion system.
The translocated Tir triggers additional host
signalling events and actin nucleation, which are
essential for lesion formation. This family represents
the Tir C-terminal domain which has been reported to
bind uninfected host cells and beta-1 integrins
although the role of intimin binding to integrins is
unclear. This intimin C-terminal domain has also been
shown to be sufficient for Tir recognition.
Length = 222
Score = 30.7 bits (69), Expect = 0.44
Identities = 14/31 (45%), Positives = 17/31 (54%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSP 47
TTTTTTTTTT N NN + N ++
Sbjct: 61 TTTTTTTTTTHTTVENKPANNTPAQGNTDTS 91
Score = 29.1 bits (65), Expect = 1.5
Identities = 14/22 (63%), Positives = 15/22 (68%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNN 38
TTTTTTTTTTT+ N NN
Sbjct: 60 TTTTTTTTTTTHTTVENKPANN 81
Score = 27.2 bits (60), Expect = 5.8
Identities = 13/22 (59%), Positives = 14/22 (63%)
Query: 18 TTTTTTTTTTNNNNNNNNNNNN 39
TTTTTTTTTT + N NN
Sbjct: 60 TTTTTTTTTTTHTTVENKPANN 81
>gnl|CDD|214442 MTH00160, ND2, NADH dehydrogenase subunit 2; Provisional.
Length = 335
Score = 30.6 bits (70), Expect = 0.50
Identities = 6/48 (12%), Positives = 17/48 (35%)
Query: 6 IFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNFLGVI 53
I ++ + +++ N N + N ++ + G+I
Sbjct: 280 ITLYFYLRISYSSLMLNYNKNKWNLKKKKNKFSLILNFLSFISLFGLI 327
Score = 30.2 bits (69), Expect = 0.66
Identities = 12/50 (24%), Positives = 17/50 (34%), Gaps = 7/50 (14%)
Query: 1 MFTPGIFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNFL 50
M + +FYL I + ++ N N N N NFL
Sbjct: 276 MSSLITLYFYLRISY-------SSLMLNYNKNKWNLKKKKNKFSLILNFL 318
Score = 28.3 bits (64), Expect = 3.3
Identities = 8/39 (20%), Positives = 10/39 (25%)
Query: 7 FFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNIN 45
F Y I N N+NN + K
Sbjct: 197 FLIYSIISLSIILLFNKFNIFFLNQLFNSNNKSKKIKFL 235
>gnl|CDD|165923 PLN02282, PLN02282, phosphoglycerate kinase.
Length = 401
Score = 30.7 bits (69), Expect = 0.52
Identities = 23/72 (31%), Positives = 32/72 (44%), Gaps = 13/72 (18%)
Query: 131 YPLGNGLFRECTQDYAIPDSPHVIQKGVLVHIPTYALQTDATLWSDPLEFNPDRFAPENE 190
Y +G+ L E D A KGV + +PT D + D+FAP+
Sbjct: 236 YSVGSSLVEEDKLDLATSLIEKAKAKGVSLLLPT-----DVVI--------ADKFAPDAN 282
Query: 191 SKIVPGSYVPFG 202
SK+VP S +P G
Sbjct: 283 SKVVPASAIPDG 294
>gnl|CDD|205235 pfam13054, DUF3915, Protein of unknown function (DUF3915). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 120 amino acids in length.
Length = 116
Score = 29.1 bits (65), Expect = 0.84
Identities = 8/27 (29%), Positives = 17/27 (62%)
Query: 29 NNNNNNNNNNNNKNNINSPNFLGVIRM 55
++ +N +++KNN N F G++R+
Sbjct: 82 AMGDDKDNKDDDKNNQNRNKFTGILRV 108
>gnl|CDD|218869 pfam06043, Reo_P9, Reovirus P9-like family.
Length = 333
Score = 29.8 bits (67), Expect = 0.91
Identities = 16/42 (38%), Positives = 20/42 (47%), Gaps = 3/42 (7%)
Query: 25 TTTNNNNNNNNNNNNNKNNINSPNFLGVIRMQEVVSGVFVLI 66
TTTNN NN N N NN N N R + G++ L+
Sbjct: 18 TTTNNQNNFNTNTQNNSNT---ENRNSTKRKPILDDGIYHLL 56
>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160. This
family consists of several special lobe-specific silk
protein SSP160 sequences which appear to be specific to
Chironomus (Midge) species.
Length = 758
Score = 30.1 bits (67), Expect = 1.00
Identities = 10/28 (35%), Positives = 20/28 (71%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNI 44
+TT + +TTTTN+ + N+ +++N + I
Sbjct: 295 STTNSNSTTTTNSTTSTNSTSSSNSSTI 322
Score = 29.7 bits (66), Expect = 1.3
Identities = 9/32 (28%), Positives = 20/32 (62%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
+ + +T+ +TTN+N+ N+ + N+ +S N
Sbjct: 287 SNSNSTSNSTTNSNSTTTTNSTTSTNSTSSSN 318
Score = 29.4 bits (65), Expect = 1.9
Identities = 9/37 (24%), Positives = 19/37 (51%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNFLGVI 53
+T+ +TT + + N+ + N+ ++ NS G I
Sbjct: 290 NSTSNSTTNSNSTTTTNSTTSTNSTSSSNSSTIAGCI 326
Score = 28.2 bits (62), Expect = 3.8
Identities = 9/32 (28%), Positives = 20/32 (62%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
+T+ + +TT+NN+ ++N+ N+ +S N
Sbjct: 114 NSTSNSNSTTSNNSTTSSNSTTTTSNSTSSSN 145
Score = 27.0 bits (59), Expect = 10.0
Identities = 8/30 (26%), Positives = 20/30 (66%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINS 46
+TT+ +TT++N+ +N+ ++ N+ +S
Sbjct: 120 NSTTSNNSTTSSNSTTTTSNSTSSSNSTSS 149
>gnl|CDD|201496 pfam00895, ATP-synt_8, ATP synthase protein 8.
Length = 54
Score = 27.3 bits (61), Expect = 1.1
Identities = 7/42 (16%), Positives = 9/42 (21%)
Query: 6 IFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNINSP 47
IF L ++ T N N P
Sbjct: 12 IFLSSWLTLLIIIQLKISSHTPPNPPTKKTTLKKKTTPWNWP 53
>gnl|CDD|168775 PRK07034, PRK07034, hypothetical protein; Provisional.
Length = 536
Score = 29.0 bits (64), Expect = 2.0
Identities = 16/33 (48%), Positives = 19/33 (57%), Gaps = 1/33 (3%)
Query: 28 NNNNNNNNNNNNNKNNINSPNFLGVIRMQEVVS 60
N NNNN N +NNN N N+ N V RM +S
Sbjct: 95 NGNNNNPNGDNNNPNGSNN-NIARVQRMPHGIS 126
>gnl|CDD|239175 cd02774, MopB_Res-Cmplx1_Nad11-M, MopB_Res_Cmplx1_Nad11_M:
Mitochondrial-encoded NADH-quinone
oxidoreductase/respiratory complex I, the second domain
of the Nad11/75-kDa subunit of some protists.
NADH-quinone oxidoreductase is the first
energy-transducting complex in the respiratory chain and
functions as a redox pump that uses the redox energy to
translocate H+ ions across the membrane, resulting in a
significant contribution to energy production. The nad11
gene codes for the largest (75-kDa) subunit of the
mitochondrial NADH-quinone oxidoreductase, it
constitutes the electron input part of the enzyme, or
the so-called NADH dehydrogenase fragment. The Nad11
subunit is made of two domains: the first contains three
binding sites for FeS clusters (the fer2 domain), the
second domain (this CD), is of unknown function or, as
postulated, has lost an ancestral formate dehydrogenase
activity that became redundant during the evolution of
the complex I enzyme. Although only vestigial sequence
evidence remains of a molybdopterin binding site, this
protein domain belongs to the molybdopterin_binding
(MopB) superfamily of proteins.
Length = 366
Score = 28.5 bits (64), Expect = 2.5
Identities = 12/41 (29%), Positives = 21/41 (51%)
Query: 28 NNNNNNNNNNNNNKNNINSPNFLGVIRMQEVVSGVFVLIVG 68
N N+NN NNN N++ N+L ++ + L++G
Sbjct: 116 LNTNSNNFLENNNYFNLDLENYLFNNSLKNLDKSDLCLLIG 156
>gnl|CDD|221315 pfam11921, DUF3439, Domain of unknown function (DUF3439). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 46 to 94 amino acids in length. This domain is
found associated with pfam01462, pfam00560.
Length = 122
Score = 27.6 bits (61), Expect = 2.6
Identities = 11/11 (100%), Positives = 11/11 (100%)
Query: 17 TTTTTTTTTTT 27
TTTTTTTTTTT
Sbjct: 48 TTTTTTTTTTT 58
Score = 27.2 bits (60), Expect = 3.2
Identities = 11/11 (100%), Positives = 11/11 (100%)
Query: 17 TTTTTTTTTTT 27
TTTTTTTTTTT
Sbjct: 47 TTTTTTTTTTT 57
>gnl|CDD|224119 COG1198, PriA, Primosomal protein N' (replication factor Y) -
superfamily II helicase [DNA replication, recombination,
and repair].
Length = 730
Score = 28.8 bits (65), Expect = 2.7
Identities = 8/20 (40%), Positives = 10/20 (50%)
Query: 144 DYAIPDSPHVIQKGVLVHIP 163
DY IP+ Q G V +P
Sbjct: 17 DYLIPEGLEPDQPGSRVRVP 36
>gnl|CDD|177303 PHA00735, PHA00735, hypothetical protein.
Length = 808
Score = 28.7 bits (64), Expect = 3.1
Identities = 16/32 (50%), Positives = 18/32 (56%), Gaps = 1/32 (3%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
T TT+TTTT N + N N NNI SPN
Sbjct: 695 TQPTTSTTTTVNTSTLNYNYTQPFSNNI-SPN 725
>gnl|CDD|235670 PRK06009, flgD, flagellar basal body rod modification protein;
Reviewed.
Length = 140
Score = 27.4 bits (61), Expect = 3.4
Identities = 11/34 (32%), Positives = 24/34 (70%)
Query: 17 TTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNFL 50
TT+ T+++TT++ + +N+ + NK ++N +FL
Sbjct: 6 TTSATSSSTTSSTSTASNSTSAANKASVNYDSFL 39
>gnl|CDD|234766 PRK00447, PRK00447, hypothetical protein; Provisional.
Length = 144
Score = 27.3 bits (61), Expect = 3.6
Identities = 15/56 (26%), Positives = 25/56 (44%), Gaps = 2/56 (3%)
Query: 48 NFLGVIRMQEVVSGVFVLIVGGHENSTSTSTNVLHELAYNQEVQDKARREVQKVYK 103
NF + + G + ++VG E T S LH L + + D AR+ + K +
Sbjct: 26 NFFDFEKFEAEDEGEYKILVG--EARTLKSLQKLHRLLRGERILDTARKYLMKGIE 79
>gnl|CDD|187593 cd05332, 11beta-HSD1_like_SDR_c, 11beta-hydroxysteroid
dehydrogenase type 1 (11beta-HSD1)-like, classical (c)
SDRs. Human 11beta_HSD1 catalyzes the NADP(H)-dependent
interconversion of cortisone and cortisol. This subgroup
also includes human dehydrogenase/reductase SDR family
member 7C (DHRS7C) and DHRS7B. These proteins have the
GxxxGxG nucleotide binding motif and S-Y-K catalytic
triad characteristic of the SDRs, but have an atypical
C-terminal domain that contributes to homodimerization
contacts. SDRs are a functionally diverse family of
oxidoreductases that have a single domain with a
structurally conserved Rossmann fold (alpha/beta folding
pattern with a central beta-sheet), an NAD(P)(H)-binding
region, and a structurally diverse C-terminal region.
Classical SDRs are typically about 250 residues long,
while extended SDRs are approximately 350 residues.
Sequence identity between different SDR enzymes are
typically in the 15-30% range, but the enzymes share the
Rossmann fold NAD-binding motif and characteristic
NAD-binding and catalytic sequence patterns. These
enzymes catalyze a wide range of activities including
the metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase (15-PGDH)
numbering). In addition to the Tyr and Lys, there is
often an upstream Ser (Ser-138, 15-PGDH numbering)
and/or an Asn (Asn-107, 15-PGDH numbering) contributing
to the active site; while substrate binding is in the
C-terminal region, which determines specificity. The
standard reaction mechanism is a 4-pro-S hydride
transfer and proton relay involving the conserved Tyr
and Lys, a water molecule stabilized by Asn, and
nicotinamide. Extended SDRs have additional elements in
the C-terminal region, and typically have a TGXXGXXG
cofactor binding motif. Complex (multidomain) SDRs such
as ketoreductase domains of fatty acid synthase have a
GGXGXXG NAD(P)-binding motif and an altered active site
motif (YXXXN). Fungal type ketoacyl reductases have a
TGXXXGX(1-2)G NAD(P)-binding motif. Some atypical SDRs
have lost catalytic activity and/or have an unusual
NAD(P)-binding motif and missing or unusual active site
residues. Reactions catalyzed within the SDR family
include isomerization, decarboxylation, epimerization,
C=N bond reduction, dehydratase activity,
dehalogenation, Enoyl-CoA reduction, and
carbonyl-alcohol oxidoreduction.
Length = 257
Score = 27.9 bits (63), Expect = 4.1
Identities = 15/46 (32%), Positives = 24/46 (52%), Gaps = 8/46 (17%)
Query: 94 ARRE--VQKVYKE----GGGKVTY--EDLAKMTYLEQVISEALRLY 131
ARRE +++V E G D++ + EQV+ EAL+L+
Sbjct: 34 ARREERLEEVKSECLELGAPSPHVVPLDMSDLEDAEQVVEEALKLF 79
>gnl|CDD|215570 PLN03091, PLN03091, hypothetical protein; Provisional.
Length = 459
Score = 28.0 bits (62), Expect = 4.2
Identities = 8/23 (34%), Positives = 19/23 (82%)
Query: 25 TTTNNNNNNNNNNNNNKNNINSP 47
+++++ NN+NNNN++ +N+ +P
Sbjct: 190 SSSSSKINNSNNNNHSNSNLMTP 212
Score = 27.6 bits (61), Expect = 6.4
Identities = 8/19 (42%), Positives = 17/19 (89%)
Query: 22 TTTTTTNNNNNNNNNNNNN 40
+++++ NN+NNNN++N+N
Sbjct: 190 SSSSSKINNSNNNNHSNSN 208
Score = 27.2 bits (60), Expect = 6.7
Identities = 8/19 (42%), Positives = 17/19 (89%)
Query: 21 TTTTTTTNNNNNNNNNNNN 39
+++++ NN+NNNN++N+N
Sbjct: 190 SSSSSKINNSNNNNHSNSN 208
>gnl|CDD|214411 MTH00044, ND4, NADH dehydrogenase subunit 4; Validated.
Length = 458
Score = 27.6 bits (62), Expect = 4.9
Identities = 6/42 (14%), Positives = 13/42 (30%), Gaps = 7/42 (16%)
Query: 12 FILFFTTTTTTTTTT-------TNNNNNNNNNNNNNKNNINS 46
+ F T N+ + + +N + I+S
Sbjct: 392 ILSFPIIGLATVFGAIYSLLIYQLTNSGSFPSWISNVSPISS 433
>gnl|CDD|205311 pfam13130, DUF3952, Domain of unknown function (DUF3952). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria, and is approximately
110 amino acids in length. There is a conserved VMSAS
sequence motif.
Length = 107
Score = 26.6 bits (59), Expect = 5.1
Identities = 8/29 (27%), Positives = 12/29 (41%)
Query: 14 LFFTTTTTTTTTTTNNNNNNNNNNNNNKN 42
+ TT TT +N + NNN +
Sbjct: 77 SLYGETTQEITTDIDNEKEKDKNNNYKEE 105
>gnl|CDD|181285 PRK08199, PRK08199, thiamine pyrophosphate protein; Validated.
Length = 557
Score = 27.5 bits (62), Expect = 5.6
Identities = 14/43 (32%), Positives = 18/43 (41%), Gaps = 4/43 (9%)
Query: 142 TQDYAIPDSPHVIQKGVLVHIPTYAL----QTDATLWSDPLEF 180
TQ Y + D P Q V VH L + D + +DP F
Sbjct: 286 TQGYTLLDIPVPRQTLVHVHPDAEELGRVYRPDLAIVADPAAF 328
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 27.6 bits (62), Expect = 5.6
Identities = 7/25 (28%), Positives = 15/25 (60%)
Query: 88 QEVQDKARREVQKVYKEGGGKVTYE 112
+E+ +KA +EV +E G + ++
Sbjct: 279 EEMVEKATKEVDNAIREEGEQAAFD 303
>gnl|CDD|233208 TIGR00956, 3a01205, Pleiotropic Drug Resistance (PDR) Family
protein. [Transport and binding proteins, Other].
Length = 1394
Score = 27.8 bits (62), Expect = 6.1
Identities = 11/17 (64%), Positives = 11/17 (64%)
Query: 3 TPGIFFFYLFILFFTTT 19
T G FFFYL ILF T
Sbjct: 528 TAGRFFFYLLILFICTL 544
>gnl|CDD|227188 COG4851, CamS, Protein involved in sex pheromone biosynthesis
[General function prediction only].
Length = 382
Score = 27.5 bits (61), Expect = 6.4
Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 2/33 (6%)
Query: 23 TTTTTNNNNNNNNNNNNNKNNINS--PNFLGVI 53
+TT NN N+N+N K I PN+ GV+
Sbjct: 273 STTAANNARNDNDNFLRFKEEIQGYFPNYTGVV 305
>gnl|CDD|177060 CHL00138, rps5, ribosomal protein S5; Validated.
Length = 143
Score = 26.5 bits (59), Expect = 6.8
Identities = 14/56 (25%), Positives = 23/56 (41%), Gaps = 9/56 (16%)
Query: 26 TTNNNNNNNNNNNNNKNNINSPNF----LGVIRMQEVVSG----VF-VLIVGGHEN 72
N N K+NI + + + R+ +VV G F ++V G+EN
Sbjct: 1 MLFLLKKMYNKKKNRKSNIKENKWEERVIQIKRVSKVVKGGKKLSFRAIVVIGNEN 56
>gnl|CDD|224842 COG1931, COG1931, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 140
Score = 26.6 bits (59), Expect = 7.2
Identities = 11/30 (36%), Positives = 14/30 (46%)
Query: 71 ENSTSTSTNVLHELAYNQEVQDKARREVQK 100
E T S L EL Q + D AR ++K
Sbjct: 44 EGRTLKSLERLRELLRKQRILDTARMVLEK 73
>gnl|CDD|219019 pfam06420, Mgm101p, Mitochondrial genome maintenance MGM101. The
mgm101 gene was identified as essential for maintenance
of the mitochondrial genome in Saccharomyces
cerevisiae. Based on its DNA-binding activity, and
experimental work with a temperature-sensitive mgm101
mutant, it has been proposed that the mgm101 gene
product performs an essential function in the repair of
oxidatively damaged mitochondrial DNA.
Length = 241
Score = 26.8 bits (59), Expect = 8.7
Identities = 11/35 (31%), Positives = 16/35 (45%)
Query: 15 FFTTTTTTTTTTTNNNNNNNNNNNNNKNNINSPNF 49
F +TT+ + T NN NN+ N I + F
Sbjct: 4 FISTTSLRVPSNTKRNNGNNSKFNKYIKGIFNREF 38
>gnl|CDD|220296 pfam09580, Spore_YhcN_YlaJ, Sporulation lipoprotein YhcN/YlaJ
(Spore_YhcN_YlaJ). This entry contains YhcN and YlaJ,
which are predicted lipoproteins that have been
detected as spore proteins but not vegetative proteins
in Bacillus subtilis. Both appear to be expressed under
control of the RNA polymerase sigma-G factor. The
YlaJ-like members of this family have a low-complexity,
strongly acidic, 40-residue C-terminal domain.
Length = 169
Score = 26.5 bits (59), Expect = 9.0
Identities = 13/32 (40%), Positives = 14/32 (43%)
Query: 20 TTTTTTTTNNNNNNNNNNNNNKNNINSPNFLG 51
T T N NN NN NNNN + N G
Sbjct: 24 TDNDGNTNNVNNRNNRNNNNGYGRLTKNNVTG 55
>gnl|CDD|131944 TIGR02898, spore_YhcN_YlaJ, sporulation lipoprotein, YhcN/YlaJ
family. YhcN and YlaJ are predicted lipoproteins that
have been detected as spore proteins but not vegetative
proteins in Bacillus subtilis. Both appear to be
expressed under control of the RNA polymerase sigma-G
factor. The YlaJ-like members of this family have a
low-complexity, strongly acidic 40-residue C-terminal
domain that is not included in the seed alignment for
this model. A portion of the low-complexity region
between the lipoprotein signal sequence and the main
conserved region of the protein family was also excised
from the seed alignment [Cellular processes,
Sporulation and germination].
Length = 158
Score = 26.2 bits (58), Expect = 9.7
Identities = 13/43 (30%), Positives = 15/43 (34%)
Query: 6 IFFFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNNKNNINSPN 48
I F L +L T T NNN N +KNN
Sbjct: 1 ILFIILLLLLVLTGCTNAQKKNPPNNNVGETNVMSKNNNGMNT 43
>gnl|CDD|151365 pfam10918, DUF2718, Protein of unknown function (DUF2718). This
viral family of proteins has no known function.
Length = 140
Score = 26.1 bits (57), Expect = 9.7
Identities = 7/33 (21%), Positives = 12/33 (36%), Gaps = 1/33 (3%)
Query: 28 NNNNNNNNNNNNNKNNINSPNFLGVI-RMQEVV 59
N N N + + G + +M E+V
Sbjct: 107 YCYPYPNKNAKNQPKRVEEKSEKGFLEKMVEMV 139
>gnl|CDD|219031 pfam06444, NADH_dehy_S2_C, NADH dehydrogenase subunit 2
C-terminus. This family consists of the C-terminal
region specific to the eukaryotic NADH dehydrogenase
subunit 2 protein and is found in conjunction with
pfam00361.
Length = 56
Score = 24.8 bits (55), Expect = 9.8
Identities = 10/33 (30%), Positives = 14/33 (42%)
Query: 8 FFYLFILFFTTTTTTTTTTTNNNNNNNNNNNNN 40
FFYL + + TT T T+T+ N
Sbjct: 2 FFYLRLSYSTTLTLPPNTSTSTTKWRLKPKKNT 34
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.135 0.397
Gapped
Lambda K H
0.267 0.0729 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,061,052
Number of extensions: 1018099
Number of successful extensions: 2823
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2410
Number of HSP's successfully gapped: 188
Length of query: 217
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 124
Effective length of database: 6,812,680
Effective search space: 844772320
Effective search space used: 844772320
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 57 (25.7 bits)