RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17711
(335 letters)
>gnl|CDD|215484 PLN02894, PLN02894, hydrolase, alpha/beta fold family protein.
Length = 402
Score = 131 bits (331), Expect = 4e-35
Identities = 80/257 (31%), Positives = 118/257 (45%), Gaps = 38/257 (14%)
Query: 2 GRSSRPRFS-TDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
G SSRP F+ E E + S EEWRK L ILLGHSFGGY+A YA+++P+ V+
Sbjct: 143 GGSSRPDFTCKSTEETEAWFIDSFEEWRKAKNLSNFILLGHSFGGYVAAKYALKHPEHVQ 202
Query: 61 HLILADPWGFPQKSIDPQ------KASKIPLWARMIGNLY--KNFNPLWPVRFVGPLGQW 112
HLIL P GF +S D +A+ W + N NF P +R +GP G
Sbjct: 203 HLILVGPAGFSSESDDKSEWLTKFRAT----WKGAVLNHLWESNFTPQKIIRGLGPWG-- 256
Query: 113 VVEKMRPDLPKKFT-----------PVLKEDSSAITEYIFQCNVQAPSGESAFHTLTEGL 161
P+L +++T + +E+S +T+Y++ SGE +
Sbjct: 257 ------PNLVRRYTTARFGAHSTGDILSEEESKLLTDYVYHTLAAKASGELCLKYIFSFG 310
Query: 162 GYAKRPMLHRVDQLAAHVPVTVIYGSRSWVDNSSGDKIKEARSQ-NSFVQVKSVTGAGHH 220
+A++P+L + VP T IYG W++ EAR + ++ V GH
Sbjct: 311 AFARKPLLESASEWK--VPTTFIYGRHDWMNYEGA---VEARKRMKVPCEIIRVPQGGHF 365
Query: 221 VYADRADVFNKMVNDTC 237
V+ D F+ V C
Sbjct: 366 VFLDNPSGFHSAVLYAC 382
>gnl|CDD|201306 pfam00561, Abhydrolase_1, alpha/beta hydrolase fold. This
catalytic domain is found in a very wide range of
enzymes.
Length = 226
Score = 98.0 bits (244), Expect = 4e-24
Identities = 56/241 (23%), Positives = 87/241 (36%), Gaps = 31/241 (12%)
Query: 1 FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
FGRSS P+ + L +E L L ++ L+GHS GG +A AYA +YPDRVK
Sbjct: 11 FGRSSPPKD--FADYRFDDLAEDLEALLDALGLDKVNLVGHSMGGLIALAYAAKYPDRVK 68
Query: 61 HLILADPWGFPQKSIDPQKASKIPLWARMIGNLYKNFNPLWPVRFVGPLGQWVVEKMRPD 120
L+L PL R GNL + R VE +
Sbjct: 69 ALVLVGTV--------HPAGLSSPLTPR--GNLLGLLLDNFFNRLYDS-----VEALLGR 113
Query: 121 LPKKFTPVLKEDSSAITEYIFQCNVQAPSGESAFHTLTEG-----LGYAKRPMLHRVDQL 175
K+F + + S + ++ A L L + + L +D
Sbjct: 114 AIKQFQALGRPFVSDFLKQFELSSLIRFGETLALDGLLGYALGYDLVWDRSAALKDID-- 171
Query: 176 AAHVPVTVIYGSRSW-VDNSSGDKIKEARSQNSFVQVKSVTGAGHHVYADRADVFNKMVN 234
VP +I+G V + +K+ V + AGH ++ D +++
Sbjct: 172 ---VPTLIIWGDDDPLVPPDASEKLAALFPNAQLVVID---DAGHLAQLEKPDEVAELIL 225
Query: 235 D 235
Sbjct: 226 K 226
>gnl|CDD|223669 COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta
hydrolase superfamily) [General function prediction
only].
Length = 282
Score = 83.5 bits (205), Expect = 2e-18
Identities = 46/236 (19%), Positives = 80/236 (33%), Gaps = 20/236 (8%)
Query: 1 FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
GRS +S + L L++++L+GHS GG +A A A+++PDRV+
Sbjct: 61 HGRSDPAGYSLS------AYADDLAALLDALGLEKVVLVGHSMGGAVALALALRHPDRVR 114
Query: 61 HLILADPWGFPQKSIDPQKASKIPLWARMIGNLYKNFNPLWPVRFVGPLGQWVVEKMRPD 120
L+L P P + + +L + + LG +
Sbjct: 115 GLVLIGPAPPPGLLEAALRQPAGAAPLAALADLLLGLDAAAFAALLAALG------LLAA 168
Query: 121 LPKKFTPVLKEDSSAITEYIFQCNVQAPSGESAFHTLTEGLGYAKRPMLHRVDQLAAHVP 180
L L E A + L L R L R+ VP
Sbjct: 169 LAAAARAGLAEALRAPLLGAAAAAFARAARADLAAALLALLDRDLRAALARIT-----VP 223
Query: 181 VTVIYGSRS-WVDNSSGDKIKEARSQNSFVQVKSVTGAGHHVYADRADVFNKMVND 235
+I+G V ++ A ++ ++ + GAGH + + + F +
Sbjct: 224 TLIIHGEDDPVVPAELARRLAAALPNDA--RLVVIPGAGHFPHLEAPEAFAAALLA 277
>gnl|CDD|225176 COG2267, PldB, Lysophospholipase [Lipid metabolism].
Length = 298
Score = 65.5 bits (160), Expect = 4e-12
Identities = 51/249 (20%), Positives = 83/249 (33%), Gaps = 46/249 (18%)
Query: 1 FGRSSR-PRFSTDPETVERQLVTSIEEWRKELKLQ----EMILLGHSFGGYLAFAYAIQY 55
GRS R R D V ++ + + + + LLGHS GG +A Y +Y
Sbjct: 72 HGRSPRGQRGHVD---SFADYVDDLDAFVETIAEPDPGLPVFLLGHSMGGLIALLYLARY 128
Query: 56 PDRVKHLILADPWGFPQKSIDPQKASKI--PLWARMIGNLYKNFNPLWPVRFVGPL-GQW 112
P R+ L+L+ P +I +++ L R+ L + N L V
Sbjct: 129 PPRIDGLVLSSPALGLGGAILRLILARLALKLLGRIRPKLPVDSNLLEGVLTDDLSRDPA 188
Query: 113 VVEKMRPDLPKKFTPVLKEDSSAITEYIFQCNVQAPSGESAFHTLTEGLGYAKRPMLHRV 172
V D V + + P L
Sbjct: 189 EVAAYEAD----------------PLIGVGGPVSRWVDLALLAG--------RVPALR-- 222
Query: 173 DQLAAHVPVTVIYGSR-SWVDNSSG--DKIKEARSQNSFVQVKSVTGAGHHVYAD----R 225
D A +PV ++ G VDN G + A S + ++K + GA H + + R
Sbjct: 223 DAPAIALPVLLLQGGDDRVVDNVEGLARFFERAGSPD--KELKVIPGAYHELLNEPDRAR 280
Query: 226 ADVFNKMVN 234
+V ++
Sbjct: 281 EEVLKDILA 289
>gnl|CDD|221720 pfam12697, Abhydrolase_6, Alpha/beta hydrolase family. This family
contains alpha/beta hydrolase enzymes of diverse
specificity.
Length = 187
Score = 58.6 bits (142), Expect = 3e-10
Identities = 36/182 (19%), Positives = 60/182 (32%), Gaps = 33/182 (18%)
Query: 1 FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
G S P P ++E + L L ++L+GHS GG +A A A + P+RV
Sbjct: 35 HGDSDGPP--RTPYSLE-DDAADLAALLDALGLGPVVLVGHSLGGAVALAAAARRPERVA 91
Query: 61 HLILADPWGFPQKSIDPQKASKIPLWARMIGNLYKNFNPLWPVRFVGPLGQWVVEKMRPD 120
L+L P ++ L + +R
Sbjct: 92 GLVLISPPL--------------RDLEELLAADAAALLALLR-------AALLDADLREA 130
Query: 121 LPKKFTPVL----KEDSSAITEYIFQCNVQAPSGESAFHTLTEGLGYAKRPMLHRVDQLA 176
L + PVL ++D E + + + L G G+ P L +++A
Sbjct: 131 LARLTVPVLVIHGEDDPLVPPEAARR--LAEALPGAELVVL-PGAGHL--PHLEHPEEVA 185
Query: 177 AH 178
Sbjct: 186 EA 187
Score = 33.9 bits (78), Expect = 0.064
Identities = 13/57 (22%), Positives = 24/57 (42%), Gaps = 4/57 (7%)
Query: 176 AAHVPVTVIYGSRS-WVDNSSGDKIKEARSQNSFVQVKSVTGAGHHVYADRADVFNK 231
VPV VI+G V + ++ EA ++ + GAGH + + + +
Sbjct: 133 RLTVPVLVIHGEDDPLVPPEAARRLAEALPG---AELVVLPGAGHLPHLEHPEEVAE 186
>gnl|CDD|188121 TIGR01250, pro_imino_pep_2, proline-specific peptidase, Bacillus
coagulans-type subfamily. This model describes a
subfamily of the alpha/beta fold family of hydrolases.
Characterized members include prolinases (Pro-Xaa
dipeptidase, EC 3.4.13.8), prolyl aminopeptidases (EC
3.4.11.5), and a leucyl aminopeptidase.
Length = 289
Score = 58.2 bits (141), Expect = 1e-09
Identities = 32/100 (32%), Positives = 48/100 (48%), Gaps = 16/100 (16%)
Query: 2 GRSSRPRFSTDPE--TVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRV 59
G S +P S D E T++ V +EE R++L L + LLGHS+GG LA YA++Y +
Sbjct: 65 GYSDQPDDS-DEELWTIDY-FVDELEEVREKLGLDKFYLLGHSWGGMLAQEYALKYGQHL 122
Query: 60 KHLILADPWGFPQKSIDPQKASKIPLWARMIGNLYKNFNP 99
K LI++ P + + + L K P
Sbjct: 123 KGLIISSM------------LDSAPEYVKELNRLRKELPP 150
>gnl|CDD|234315 TIGR03695, menH_SHCHC,
2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate
synthase. This protein catalyzes the formation of
SHCHC, or (1 R,6
R)-2-succinyl-6-hydroxy-2,
4-cyclohexadiene-1-carboxylate, by elmination of
pyruvate from
2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-
carboxylate (SEPHCHC). Note that SHCHC synthase activity
previously was attributed to MenD, which in fact is
SEPHCHC synthase [Biosynthesis of cofactors, prosthetic
groups, and carriers, Menaquinone and ubiquinone].
Length = 252
Score = 46.1 bits (110), Expect = 8e-06
Identities = 21/63 (33%), Positives = 32/63 (50%), Gaps = 1/63 (1%)
Query: 2 GRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKH 61
G S P + E + +L ++ L+G+S GG +A YA+QYP+RV+
Sbjct: 40 GSSQSPS-DIERYDFEEIAQLLLATLLDQLGIEPFFLVGYSMGGRIALYYALQYPERVQG 98
Query: 62 LIL 64
LIL
Sbjct: 99 LIL 101
>gnl|CDD|130316 TIGR01249, pro_imino_pep_1, proline iminopeptidase, Neisseria-type
subfamily. This model represents one of two related
families of proline iminopeptidase in the alpha/beta
fold hydrolase family. The fine specificities of the
various members, including both the range of short
peptides from which proline can be removed and whether
other amino acids such as alanine can be also removed,
may vary among members.
Length = 306
Score = 45.2 bits (107), Expect = 2e-05
Identities = 18/45 (40%), Positives = 29/45 (64%)
Query: 20 LVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLIL 64
LV IE+ R++L ++ ++ G S+G LA AYA +P+ V L+L
Sbjct: 81 LVADIEKLREKLGIKNWLVFGGSWGSTLALAYAQTHPEVVTGLVL 125
>gnl|CDD|214007 cd12808, Esterase_713_like-1, Uncharacterized enzymes similar to
novel bacterial esterase that cleaves esters on
halogenated cyclic compounds. This family contains
uncharacterized proteins similar to a novel bacterial
esterase (Alcaligenes esterase 713) with the alpha/beta
hydrolase fold but does not contain the GXSXXG
pentapeptide around the active site serine residue as
commonly seen in other enzymes of this class. Esterase
713 shows negligible sequence homology to other esterase
and lipase enzymes. It is active as a dimer and cleaves
esters on halogenated cyclic compounds though its
natural substrate is unknown.
Length = 309
Score = 44.2 bits (105), Expect = 5e-05
Identities = 14/36 (38%), Positives = 20/36 (55%)
Query: 37 ILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFPQ 72
I++ HS GG AF A PD V+ ++ +P G P
Sbjct: 191 IVVAHSQGGGFAFEAARARPDLVRAVVALEPSGAPD 226
>gnl|CDD|221718 pfam12695, Abhydrolase_5, Alpha/beta hydrolase family. This family
contains a diverse range of alpha/beta hydrolase
enzymes.
Length = 145
Score = 40.1 bits (94), Expect = 3e-04
Identities = 15/75 (20%), Positives = 28/75 (37%), Gaps = 4/75 (5%)
Query: 13 PETVERQLVTSIEEWRKELK--LQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGF 70
P E + + ++L+GHS GG +A A + P ++LA
Sbjct: 35 PGHGASLGAPDAEAVLADAPLDPERIVLVGHSLGGGVALLLAARDPRVKAAVVLAAGDPP 94
Query: 71 PQKSIDPQKASKIPL 85
++D +P+
Sbjct: 95 D--ALDDLAKLTVPV 107
Score = 29.3 bits (66), Expect = 1.5
Identities = 17/62 (27%), Positives = 27/62 (43%), Gaps = 4/62 (6%)
Query: 160 GLGYAKRPMLHRVDQLAA-HVPVTVIYGSR-SWVDNSSGDKIKEARSQNSFVQVKSVTGA 217
+ A +D LA VPV +I+G+R V + + A + + V + GA
Sbjct: 85 AVVLAAGDPPDALDDLAKLTVPVLIIHGTRDGVVPPEEAEALAAALPGPAELVV--IEGA 142
Query: 218 GH 219
GH
Sbjct: 143 GH 144
>gnl|CDD|215315 PLN02578, PLN02578, hydrolase.
Length = 354
Score = 41.8 bits (98), Expect = 3e-04
Identities = 52/231 (22%), Positives = 95/231 (41%), Gaps = 31/231 (13%)
Query: 1 FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
FG S + D Q + ++ KE+ + +L+G+S GG+ A + A+ YP+ V
Sbjct: 123 FGWSDKALIEYDAMVWRDQ----VADFVKEVVKEPAVLVGNSLGGFTALSTAVGYPELVA 178
Query: 61 HLILADPWGFPQKSIDPQKASKIPLWARMIGNLYKNFNPL--WPVRFVGPLGQWVVEKMR 118
+ L + G K I + ++ PL W R V W ++
Sbjct: 179 GVALLNSAGQFGSESRE-KEEAIVVEETVLTRFV--VKPLKEWFQRVVLGFLFWQAKQ-- 233
Query: 119 PDLPKKFTPVLKE---DSSAITEYIFQCNVQAPS-----GESAFHTLTEGLGYAKRPMLH 170
P + VLK D S + +Y+ + ++ P+ GE + ++ L R L
Sbjct: 234 ---PSRIESVLKSVYKDKSNVDDYLVE-SITEPAADPNAGEVYYRLMSRFLFNQSRYTLD 289
Query: 171 RV-DQLAAHVPVTVIYG-SRSWVDNSSGDKIKEARSQNSFVQVKSVTGAGH 219
+ +L+ P+ +++G WV + +KIK + V ++ AGH
Sbjct: 290 SLLSKLSC--PLLLLWGDLDPWVGPAKAEKIKAFYPDTTLVNLQ----AGH 334
>gnl|CDD|132386 TIGR03343, biphenyl_bphD, 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate
hydrolase. Members of this family are
2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or
HOPD hydrolase, the BphD protein of biphenyl
degradation. BphD acts on the product of ring
meta-cleavage by BphC. Many species carrying bphC and
bphD are capable of degrading polychlorinated biphenyls
as well as biphenyl itself.
Length = 282
Score = 40.3 bits (94), Expect = 7e-04
Identities = 20/59 (33%), Positives = 31/59 (52%)
Query: 31 LKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFPQKSIDPQKASKIPLWARM 89
L +++ L+G+S GG A +A++YPDR+ LIL P G P I L ++
Sbjct: 98 LDIEKAHLVGNSMGGATALNFALEYPDRIGKLILMGPGGLGPSLFAPMPMEGIKLLFKL 156
>gnl|CDD|215567 PLN03087, PLN03087, BODYGUARD 1 domain containing hydrolase;
Provisional.
Length = 481
Score = 39.4 bits (92), Expect = 0.002
Identities = 25/72 (34%), Positives = 36/72 (50%), Gaps = 4/72 (5%)
Query: 1 FGRSSRPRFSTDPETVERQLVTSIEEWRKE-LKLQEMILLGHSFGGYLAFAYAIQYPDRV 59
FGRS +P D R+ + IE E K++ ++ HS G LA A A+++P V
Sbjct: 243 FGRSPKP---ADSLYTLREHLEMIERSVLERYKVKSFHIVAHSLGCILALALAVKHPGAV 299
Query: 60 KHLILADPWGFP 71
K L L P +P
Sbjct: 300 KSLTLLAPPYYP 311
>gnl|CDD|184875 PRK14875, PRK14875, acetoin dehydrogenase E2 subunit
dihydrolipoyllysine-residue acetyltransferase;
Provisional.
Length = 371
Score = 39.2 bits (92), Expect = 0.002
Identities = 17/52 (32%), Positives = 26/52 (50%)
Query: 19 QLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGF 70
+L ++ + L ++ L+GHS GG +A A + P RV L L P G
Sbjct: 182 ELAAAVLAFLDALGIERAHLVGHSMGGAVALRLAARAPQRVASLTLIAPAGL 233
Score = 29.5 bits (67), Expect = 2.5
Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 12/67 (17%)
Query: 173 DQLAAH-VPVTVIYGSRSWVDNSSGDKIKEAR-SQN--SFVQVKSVTGAGHHVYADRADV 228
D+LA+ +PV VI+G + D+I A +Q V V + GAGH + A
Sbjct: 308 DRLASLAIPVLVIWGEQ--------DRIIPAAHAQGLPDGVAVHVLPGAGHMPQMEAAAD 359
Query: 229 FNKMVND 235
N+++ +
Sbjct: 360 VNRLLAE 366
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 36.6 bits (85), Expect = 0.003
Identities = 12/33 (36%), Positives = 18/33 (54%)
Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
A EE +EEEE ++EEE++ EE+
Sbjct: 64 AAAAAAAAAAAAEEEEEEEEEEEEEEEESEEEA 96
Score = 35.8 bits (83), Expect = 0.005
Identities = 11/42 (26%), Positives = 19/42 (45%)
Query: 241 DEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
+E + V +EEEE ++EEE+++EE
Sbjct: 51 EEAIKKAAAAPVAAAAAAAAAAAAAEEEEEEEEEEEEEEEES 92
Score = 35.4 bits (82), Expect = 0.007
Identities = 10/31 (32%), Positives = 16/31 (51%)
Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEE 281
A E +EEEE ++EEE+++ E
Sbjct: 63 AAAAAAAAAAAAAEEEEEEEEEEEEEEEESE 93
Score = 32.3 bits (74), Expect = 0.072
Identities = 13/34 (38%), Positives = 18/34 (52%)
Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQ 284
A EEE +EEEE ++EEE+ +EE
Sbjct: 65 AAAAAAAAAAAEEEEEEEEEEEEEEEESEEEAMA 98
Score = 29.6 bits (67), Expect = 0.70
Identities = 9/44 (20%), Positives = 21/44 (47%)
Query: 245 DIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQD 288
+ + A P EE ++EEE+++EE+ + +++
Sbjct: 52 EAIKKAAAAPVAAAAAAAAAAAAAEEEEEEEEEEEEEEEESEEE 95
Score = 29.2 bits (66), Expect = 0.87
Identities = 11/52 (21%), Positives = 23/52 (44%)
Query: 239 LSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
L ++ KA P E ++EEE+++EE+ + +++A
Sbjct: 45 LEGVNIEEAIKKAAAAPVAAAAAAAAAAAAAEEEEEEEEEEEEEEEESEEEA 96
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 36.5 bits (85), Expect = 0.003
Identities = 9/30 (30%), Positives = 16/30 (53%)
Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
E K+EEE ++EE+++ EE+
Sbjct: 68 AAAAAAAAAAEEKKEEEEEEEEKEESEEEA 97
Score = 34.5 bits (80), Expect = 0.015
Identities = 8/32 (25%), Positives = 16/32 (50%)
Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
A +++EE ++EEEK++ E+
Sbjct: 64 AAAAAAAAAAAAAAEEKKEEEEEEEEKEESEE 95
Score = 33.8 bits (78), Expect = 0.030
Identities = 8/32 (25%), Positives = 16/32 (50%)
Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
A +E++E ++EEE+K+E +
Sbjct: 63 AAAAAAAAAAAAAAAEEKKEEEEEEEEKEESE 94
Score = 33.4 bits (77), Expect = 0.041
Identities = 12/33 (36%), Positives = 17/33 (51%)
Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
A EE KEEEE ++E+E+ +EE
Sbjct: 66 AAAAAAAAAAAAEEKKEEEEEEEEKEESEEEAA 98
Score = 33.0 bits (76), Expect = 0.047
Identities = 8/32 (25%), Positives = 14/32 (43%)
Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
EE++ ++EEE++KEE
Sbjct: 62 VAAAAAAAAAAAAAAAEEKKEEEEEEEEKEES 93
Score = 31.5 bits (72), Expect = 0.14
Identities = 8/42 (19%), Positives = 19/42 (45%)
Query: 241 DEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
+E + V EE++++EEE++++E+
Sbjct: 51 EEAIKKAAAAPVAAAAAAAAAAAAAAAEEKKEEEEEEEEKEE 92
Score = 29.2 bits (66), Expect = 1.1
Identities = 8/46 (17%), Positives = 20/46 (43%)
Query: 245 DIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
+ + A P +K+EE+++EE+ + +++A
Sbjct: 52 EAIKKAAAAPVAAAAAAAAAAAAAAAEEKKEEEEEEEEKEESEEEA 97
>gnl|CDD|179554 PRK03204, PRK03204, haloalkane dehalogenase; Provisional.
Length = 286
Score = 38.3 bits (89), Expect = 0.004
Identities = 22/83 (26%), Positives = 43/83 (51%), Gaps = 3/83 (3%)
Query: 1 FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
FG S RP S ++ + I E+ L L + +G +GG ++ A A++ DRV+
Sbjct: 71 FGLSERP--SGFGYQID-EHARVIGEFVDHLGLDRYLSMGQDWGGPISMAVAVERADRVR 127
Query: 61 HLILADPWGFPQKSIDPQKASKI 83
++L + W +P ++ + S++
Sbjct: 128 GVVLGNTWFWPADTLAMKAFSRV 150
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 37.2 bits (87), Expect = 0.004
Identities = 27/123 (21%), Positives = 43/123 (34%), Gaps = 14/123 (11%)
Query: 201 EARSQNSFVQVKSVTGAGHHVYADRADVFNKMVNDTCTLSDEKLDIVTTKAVKPPKEPQE 260
EA Q +F Q KS+ + AD ++ +L K + KE
Sbjct: 36 EASDQWAFYQAKSIKQHLAELAADLLELEAAAPRA-------ELQ---AKIARYKKEKAR 85
Query: 261 PEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTFMTLGIVMVSMMILMTLLTMFLVLFG 320
E KE E + KE E + + A +V + + I + S + LT L+
Sbjct: 86 YRSEAKELEAKAKEAEAESDHALHQHHRFALAVALLQIAIALAS----IAALTKKKWLWY 141
Query: 321 LGY 323
Sbjct: 142 ASL 144
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 37.5 bits (88), Expect = 0.006
Identities = 11/32 (34%), Positives = 17/32 (53%)
Query: 252 VKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
+ Q E +EEEE ++EEE+ EE+
Sbjct: 290 EVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEA 321
Score = 32.5 bits (75), Expect = 0.25
Identities = 16/47 (34%), Positives = 25/47 (53%), Gaps = 1/47 (2%)
Query: 230 NKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEE 276
+ D L +E ++++ +A E +E EEE +EEEE EEE
Sbjct: 275 AAALADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEE-PSEEE 320
Score = 32.5 bits (75), Expect = 0.26
Identities = 9/31 (29%), Positives = 17/31 (54%)
Query: 252 VKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
+K Q +EEEE ++EEE+++ +
Sbjct: 288 LKEVLSAQAQAAAAEEEEEEEEEEEEEEPSE 318
>gnl|CDD|216224 pfam00975, Thioesterase, Thioesterase domain. Peptide synthetases
are involved in the non-ribosomal synthesis of peptide
antibiotics. Next to the operons encoding these enzymes,
in almost all cases, are genes that encode proteins that
have similarity to the type II fatty acid thioesterases
of vertebrates. There are also modules within the
peptide synthetases that also share this similarity.
With respect to antibiotic production, thioesterases are
required for the addition of the last amino acid to the
peptide antibiotic, thereby forming a cyclic antibiotic.
Thioesterases (non-integrated) have molecular masses of
25-29 kDa.
Length = 224
Score = 36.2 bits (84), Expect = 0.012
Identities = 30/124 (24%), Positives = 44/124 (35%), Gaps = 27/124 (21%)
Query: 38 LLGHSFGGYLAFAYAIQYPDR---VKHLILADPW----GFPQKSIDPQKASKIPLWARMI 90
L GHS GG LAF A + R LIL+D + G + S + R+
Sbjct: 69 LFGHSMGGLLAFEVARRLERRGEEPAGLILSDAYAPHFGRREASHLLGDDELLAELRRIG 128
Query: 91 GNLYKNFNPLWPVRFVGPLGQWVVEKMRPDLPKKFTPVLKEDSSAITEYIFQCNVQAPSG 150
G + ++E +L P L+ D A+ Y + APS
Sbjct: 129 GTP-----------------EALLED--EELLSLLLPALRADYRALENYPIH-PIAAPSD 168
Query: 151 ESAF 154
+ F
Sbjct: 169 ATLF 172
>gnl|CDD|182637 PRK10673, PRK10673, acyl-CoA esterase; Provisional.
Length = 255
Score = 36.6 bits (85), Expect = 0.012
Identities = 12/37 (32%), Positives = 19/37 (51%)
Query: 30 ELKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILAD 66
L++++ +GHS GG A PDR+ L+ D
Sbjct: 77 ALQIEKATFIGHSMGGKAVMALTALAPDRIDKLVAID 113
>gnl|CDD|224932 COG2021, MET2, Homoserine acetyltransferase [Amino acid transport
and metabolism].
Length = 368
Score = 36.5 bits (85), Expect = 0.015
Identities = 13/27 (48%), Positives = 18/27 (66%)
Query: 37 ILLGHSFGGYLAFAYAIQYPDRVKHLI 63
++G S GG A +AI+YPDRV+ I
Sbjct: 150 AVVGGSMGGMQALEWAIRYPDRVRRAI 176
>gnl|CDD|235135 PRK03592, PRK03592, haloalkane dehalogenase; Provisional.
Length = 295
Score = 36.1 bits (84), Expect = 0.020
Identities = 14/37 (37%), Positives = 25/37 (67%)
Query: 24 IEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
++ W L L +++L+GH +G L F +A ++PDRV+
Sbjct: 83 LDAWFDALGLDDVVLVGHDWGSALGFDWAARHPDRVR 119
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 33.9 bits (78), Expect = 0.021
Identities = 12/32 (37%), Positives = 15/32 (46%)
Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
A E E E EE +KEEE ++E D
Sbjct: 69 AAAAAAAGAEAAAEADEAEEEEKEEEAEEESD 100
Score = 30.4 bits (69), Expect = 0.38
Identities = 8/32 (25%), Positives = 13/32 (40%)
Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
A E E + +EEEK++E +
Sbjct: 65 AAAGAAAAAAAGAEAAAEADEAEEEEKEEEAE 96
Score = 29.3 bits (66), Expect = 0.90
Identities = 8/34 (23%), Positives = 16/34 (47%)
Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
A E +E +EEE+ ++ EE+ ++
Sbjct: 71 AAAAAGAEAAAEADEAEEEEKEEEAEEESDDDML 104
Score = 28.9 bits (65), Expect = 1.5
Identities = 19/82 (23%), Positives = 30/82 (36%), Gaps = 16/82 (19%)
Query: 211 VKSVT-GAGHHVYADRADVFNKMVNDTCTLSDEKLDIVTTKAVKPP---------KEPQE 260
+KSV AG V RA L +D V A + P
Sbjct: 22 LKSVLEAAGVEVEEARAKALVA------ALEGVDIDEVIKNAAEAPAAAAAAGAAAAAAA 75
Query: 261 PEEEVKEEEERKKEEEKKKEED 282
E E +E ++EE++++ E+
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEE 97
>gnl|CDD|233393 TIGR01392, homoserO_Ac_trn, homoserine O-acetyltransferase. This
family describes homoserine-O-acetyltransferase, an
enzyme of methionine biosynthesis. This model has been
rebuilt to identify sequences more broadly, including a
number of sequences suggested to be homoserine
O-acetyltransferase based on proximity to other Met
biosynthesis genes [Amino acid biosynthesis, Aspartate
family].
Length = 351
Score = 35.7 bits (83), Expect = 0.023
Identities = 16/61 (26%), Positives = 29/61 (47%), Gaps = 11/61 (18%)
Query: 7 PRFST-DPETVERQLVTS--IEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLI 63
P + D ++ L+ IE+ ++G S GG A +AI YP+RV+ ++
Sbjct: 105 PLITIRDDVKAQKLLLDHLGIEQIA--------AVVGGSMGGMQALEWAIDYPERVRAIV 156
Query: 64 L 64
+
Sbjct: 157 V 157
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 34.0 bits (78), Expect = 0.024
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEED 282
E+ ++EEE+KKEEEK++EE+
Sbjct: 68 AAAAAAAEEKAEEKEEEKKKEEEKEEEEE 96
Score = 34.0 bits (78), Expect = 0.026
Identities = 14/41 (34%), Positives = 23/41 (56%)
Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
+K + A EE+ +E+EE KK+EE+K+EE+
Sbjct: 55 KKAAVAAAAAAPAAAAAAAAEEKAEEKEEEKKKEEEKEEEE 95
Score = 33.2 bits (76), Expect = 0.037
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 252 VKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
E + E KEEE++K+EE++++EE+
Sbjct: 67 AAAAAAAAEEKAEEKEEEKKKEEEKEEEEEEA 98
Score = 31.3 bits (71), Expect = 0.21
Identities = 17/54 (31%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 239 LSDEKLDIVTTKAVKPPKE--PQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
L + +D KA P EE+ +KEEEKKKEE+ + ++++A
Sbjct: 45 LEEVNIDEAIKKAAVAAAAAAPAAAAAAAAEEKAEEKEEEKKKEEEKEEEEEEA 98
Score = 29.4 bits (66), Expect = 0.88
Identities = 10/33 (30%), Positives = 21/33 (63%)
Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
A +E EE +EE+++++E+E+++EE
Sbjct: 67 AAAAAAAAEEKAEEKEEEKKKEEEKEEEEEEAL 99
>gnl|CDD|214006 cd12807, Esterase_713, Novel bacterial esterase 713 that cleaves
esters on halogenated cyclic compounds. This family
contains proteins similar to a novel bacterial esterase
(esterase 713) with the alpha/beta hydrolase fold that
cleaves esters on halogenated cyclic compounds. This
Alcaligenes esterase, however, does not contain the
GXSXXG pentapeptide around the active site serine
residue as seen in other esterase families. This enzyme
is active as a dimer though its natural substrate is
unknown. It has two distinct disulfide bridges; one
formed between adjacent cysteines appears to facilitate
the correct formation of the oxyanion cleft in the
catalytic site. Esterase 713 also resembles human
pancreatic lipase in its location of the acidic residue
of the catalytic triad. It is possibly exported from the
cytosol to the periplasmic space. A large majority of
sequences in this family have yet to be characterized.
Length = 315
Score = 35.4 bits (82), Expect = 0.031
Identities = 23/77 (29%), Positives = 32/77 (41%), Gaps = 5/77 (6%)
Query: 32 KLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFPQKSIDPQKA--SKIPLWARM 89
KL +LLGHS G A+ P VK ++ +P P Q +KIP+ +
Sbjct: 187 KLGGAVLLGHSQSGPFPLEAALLRPAGVKGIVSVEPGCCP-APTADQIKTLAKIPILV-V 244
Query: 90 IG-NLYKNFNPLWPVRF 105
G +L P W R
Sbjct: 245 FGDHLDAASGPTWQQRL 261
>gnl|CDD|216102 pfam00756, Esterase, Putative esterase. This family contains
Esterase D. However it is not clear if all members of
the family have the same function. This family is
related to the pfam00135 family.
Length = 245
Score = 34.4 bits (79), Expect = 0.055
Identities = 12/43 (27%), Positives = 15/43 (34%), Gaps = 8/43 (18%)
Query: 38 LLGHSFGGYLAFAYAIQYPDRVKHLIL--------ADPWGFPQ 72
L G S GG A A++YPD + WG
Sbjct: 113 LAGQSMGGLGALYLALKYPDLFGSVSSFSPILNPSNSMWGPED 155
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 34.8 bits (80), Expect = 0.064
Identities = 17/50 (34%), Positives = 27/50 (54%), Gaps = 1/50 (2%)
Query: 241 DEKLDIVTTKAVKPPKEP-QEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
++KL T K + E E E K +EE+ K+EE +K++ Q +DK
Sbjct: 375 NKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK 424
Score = 29.8 bits (67), Expect = 2.0
Identities = 14/61 (22%), Positives = 30/61 (49%)
Query: 231 KMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
K N +KL V + + + +EE ++EE +K+++++ +ED + +Q
Sbjct: 372 KEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431
Query: 291 R 291
R
Sbjct: 432 R 432
Score = 29.8 bits (67), Expect = 2.4
Identities = 21/101 (20%), Positives = 49/101 (48%), Gaps = 9/101 (8%)
Query: 190 WVDNSSGDKIKEARSQNSFVQVKSVTGAGHHVYADRADVFNKMVNDTCTLSDEKLDIVTT 249
W+ + G+ KEA + + +T + A ++ ++ ++T LS +K +
Sbjct: 321 WIYSGRGE-FKEAINIARNLDDNDLT-----LLA-LTKLYEEVKSNT-DLSGDKRQELLK 372
Query: 250 KAVKPPKEPQEPEEEVKEE-EERKKEEEKKKEEDGQHQQDK 289
+ K ++ + EVK+E + ++ E K KEE + ++++
Sbjct: 373 EYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENE 413
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 34.5 bits (80), Expect = 0.073
Identities = 17/39 (43%), Positives = 27/39 (69%)
Query: 249 TKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQ 287
K + +E +E +EE KEEEE + EEEK++EE+ + +Q
Sbjct: 438 KKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQ 476
Score = 32.6 bits (75), Expect = 0.28
Identities = 15/45 (33%), Positives = 27/45 (60%)
Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQ 286
+K K + +E ++ EEE +EEEE +EE++++EE + Q
Sbjct: 432 KKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQ 476
Score = 32.6 bits (75), Expect = 0.32
Identities = 15/35 (42%), Positives = 24/35 (68%)
Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
KA K+ +E EEE +++EE K+EEE++ EE+
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Score = 31.0 bits (71), Expect = 1.0
Identities = 18/54 (33%), Positives = 32/54 (59%)
Query: 241 DEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVT 294
+EK + KE +E EE+ K+EEE+++EEE+ +EE + ++ K + T
Sbjct: 425 EEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQAT 478
Score = 29.1 bits (66), Expect = 3.6
Identities = 14/56 (25%), Positives = 28/56 (50%), Gaps = 2/56 (3%)
Query: 239 LSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVT 294
+K K V+ K ++ EEE KE++++ +KK+EE+ + ++ K
Sbjct: 403 TGSKKATKKIKKIVE--KAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456
Score = 29.1 bits (66), Expect = 3.7
Identities = 13/45 (28%), Positives = 24/45 (53%)
Query: 239 LSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
L++E+++ +T K + E+ K+ EE KKE++KK
Sbjct: 394 LTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGK 438
>gnl|CDD|234678 PRK00175, metX, homoserine O-acetyltransferase; Provisional.
Length = 379
Score = 34.4 bits (80), Expect = 0.076
Identities = 22/72 (30%), Positives = 35/72 (48%), Gaps = 19/72 (26%)
Query: 10 STDPETVE----RQLVTSIEEWRKELKLQEMIL-----------LGHSFGGYLAFAYAIQ 54
S +P+T + V +I +W ++ Q +L +G S GG A +AI
Sbjct: 111 SINPDTGKPYGSDFPVITIRDW---VRAQARLLDALGITRLAAVVGGSMGGMQALEWAID 167
Query: 55 YPDRVKH-LILA 65
YPDRV+ L++A
Sbjct: 168 YPDRVRSALVIA 179
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 33.5 bits (77), Expect = 0.079
Identities = 16/41 (39%), Positives = 25/41 (60%)
Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
+KL+ K +E +E E+ KEEEERK+ EE+ ++E
Sbjct: 32 KKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQ 72
Score = 28.9 bits (65), Expect = 2.8
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
K ++ E E KEEEE ++E EKKKEE+ + ++++
Sbjct: 33 KLEEKREGERKEEEELEEEREKKKEEEERKEREEQA 68
Score = 28.9 bits (65), Expect = 2.9
Identities = 12/36 (33%), Positives = 24/36 (66%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
E + E +EEE ++ E+KK+EE+ + ++++AR
Sbjct: 34 LEEKREGERKEEEELEEEREKKKEEEERKEREEQAR 69
Score = 27.7 bits (62), Expect = 6.3
Identities = 12/36 (33%), Positives = 26/36 (72%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
+E +E E E +++ E K+E E+K+EE+ + +++K +
Sbjct: 21 REAEEEEREERKKLEEKREGERKEEEELEEEREKKK 56
>gnl|CDD|225856 COG3319, COG3319, Thioesterase domains of type I polyketide
synthases or non-ribosomal peptide synthetases
[Secondary metabolites biosynthesis, transport, and
catabolism].
Length = 257
Score = 34.0 bits (78), Expect = 0.080
Identities = 14/35 (40%), Positives = 17/35 (48%), Gaps = 3/35 (8%)
Query: 37 ILLGHSFGGYLAFAYAIQYPDR---VKHLILADPW 68
+LLG S GG +AF A Q + V L L D
Sbjct: 68 VLLGWSLGGAVAFEVAAQLEAQGEEVAFLGLLDAV 102
>gnl|CDD|214005 cd12806, Esterase_713_like, Novel bacterial esterase that cleaves
esters on halogenated cyclic compounds. This family
contains proteins similar to a novel bacterial esterase
(Alcaligenes esterase 713) with the alpha/beta hydrolase
fold but does not contain the GXSXXG pentapeptide around
the active site serine residue as commonly seen in other
enzymes of this class. Esterase 713 shows negligible
sequence homology to other esterase and lipase enzymes.
It is active as a dimer and cleaves esters on
halogenated cyclic compounds though its natural
substrate is unknown. This enzyme is possibly exported
from the cytosol to the periplasmic space. A large
majority of sequences in this family have yet to be
characterized.
Length = 261
Score = 34.0 bits (78), Expect = 0.081
Identities = 22/73 (30%), Positives = 33/73 (45%), Gaps = 3/73 (4%)
Query: 32 KLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFPQKSIDPQKASKIPLWARMIG 91
KL +LL HS G F A P +K ++ +P G P K D + + IP+ ++
Sbjct: 138 KLDPTVLLTHSQSGIFGFQTAAMRPKGIKAIVAVEPGGCP-KPEDVKPLTSIPVL--VVY 194
Query: 92 NLYKNFNPLWPVR 104
+ P W VR
Sbjct: 195 GDHIEEFPRWAVR 207
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 34.0 bits (77), Expect = 0.14
Identities = 15/44 (34%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 252 VKPPKEPQEPEEEVKEEEERKKEEE--KKKEEDGQHQQDKARSV 293
+K +E ++ EE+ K+ EE KK EE KK E + + ++A+
Sbjct: 1661 IKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKA 1704
Score = 32.4 bits (73), Expect = 0.38
Identities = 17/53 (32%), Positives = 29/53 (54%), Gaps = 3/53 (5%)
Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVK---EEEERKKEEEKKKEEDGQHQQDKAR 291
E+L + K +E ++ EEE K EE +K EE+KKK E+ + ++ +
Sbjct: 1636 EQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688
Score = 32.4 bits (73), Expect = 0.44
Identities = 22/73 (30%), Positives = 33/73 (45%), Gaps = 1/73 (1%)
Query: 220 HVYADRADVFNKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKK 279
HV D + D D + D T +A +E ++ E K EE RK EE KKK
Sbjct: 1065 HVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETG-KAEEARKAEEAKKK 1123
Query: 280 EEDGQHQQDKARS 292
ED + ++ ++
Sbjct: 1124 AEDARKAEEARKA 1136
Score = 32.0 bits (72), Expect = 0.51
Identities = 13/50 (26%), Positives = 23/50 (46%)
Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
+K + + E EE E E+KKEE KKK + + + ++ +
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKK 1391
Score = 31.6 bits (71), Expect = 0.78
Identities = 12/43 (27%), Positives = 24/43 (55%)
Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
+A K E ++ E K+ +E KK EE KK ++ + ++ ++
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKA 1536
Score = 31.6 bits (71), Expect = 0.79
Identities = 13/40 (32%), Positives = 25/40 (62%)
Query: 252 VKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
K E ++ EE+ K +E +KK EE KK ++ + + ++A+
Sbjct: 1289 KKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1328
Score = 31.3 bits (70), Expect = 0.92
Identities = 15/43 (34%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 250 KAVKPPKEPQEPEE-EVKEEEERKKEEEKKKEEDGQHQQDKAR 291
+A K +E ++ +E + K EE +K +E KKK E+ + + D+A+
Sbjct: 1461 EAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAK 1503
Score = 31.3 bits (70), Expect = 1.0
Identities = 17/39 (43%), Positives = 21/39 (53%)
Query: 252 VKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
K E ++ EE K EE++K EE KK EED KA
Sbjct: 1545 KKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKA 1583
Score = 30.9 bits (69), Expect = 1.1
Identities = 15/54 (27%), Positives = 29/54 (53%), Gaps = 2/54 (3%)
Query: 240 SDEKLDIVTTKAVKPPKEPQEPEEEVK--EEEERKKEEEKKKEEDGQHQQDKAR 291
++E+ I + K +E ++ EE K EE+E+K E KKE + + ++ +
Sbjct: 1655 AEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELK 1708
Score = 30.5 bits (68), Expect = 1.4
Identities = 14/40 (35%), Positives = 24/40 (60%)
Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
K +K + + E ++K EE +K EEEKKK E + ++ +
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAE 1644
Score = 30.5 bits (68), Expect = 1.7
Identities = 12/29 (41%), Positives = 21/29 (72%)
Query: 263 EEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
EE K+ +E KK EEKKK ++ + + ++A+
Sbjct: 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAK 1315
Score = 30.5 bits (68), Expect = 1.7
Identities = 12/33 (36%), Positives = 22/33 (66%)
Query: 259 QEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
++ EE K +E +KK EE KK E+ + + ++A+
Sbjct: 1438 KKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK 1470
Score = 30.1 bits (67), Expect = 2.2
Identities = 14/42 (33%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 251 AVKPPKEPQEPEEEVKEEEE-RKKEEEKKKEEDGQHQQDKAR 291
A K E ++ EE K+ +E +KK EE KK ++ + + ++A+
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1457
Score = 30.1 bits (67), Expect = 2.2
Identities = 16/54 (29%), Positives = 29/54 (53%), Gaps = 4/54 (7%)
Query: 242 EKLDIVTTKAVKPPKEPQE----PEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
+K D KA + K+ E + K +E +KK EEKKK ++ + + ++A+
Sbjct: 1391 KKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAK 1444
Score = 29.7 bits (66), Expect = 2.5
Identities = 13/41 (31%), Positives = 20/41 (48%)
Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
E+ I + ++ + EE K EE + K EE KK E+
Sbjct: 1590 EEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEE 1630
Score = 29.7 bits (66), Expect = 2.7
Identities = 14/42 (33%), Positives = 26/42 (61%)
Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
+A K + ++ EE K EE +KK EE KK ++ + + ++A+
Sbjct: 1442 EAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAK 1483
Score = 29.7 bits (66), Expect = 3.0
Identities = 15/38 (39%), Positives = 24/38 (63%)
Query: 252 VKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
K +E ++ +E K EE +K EE+KK EE + ++DK
Sbjct: 1539 AKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDK 1576
Score = 29.7 bits (66), Expect = 3.2
Identities = 12/33 (36%), Positives = 22/33 (66%)
Query: 252 VKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQ 284
+K +E ++ +E K EE++K +E KKK E+ +
Sbjct: 1283 LKKAEEKKKADEAKKAEEKKKADEAKKKAEEAK 1315
Score = 29.3 bits (65), Expect = 3.3
Identities = 13/42 (30%), Positives = 26/42 (61%)
Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
+A K + ++ EE K +E +KK EE KK ++ + + ++A+
Sbjct: 1455 EAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAK 1496
Score = 29.3 bits (65), Expect = 3.7
Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 4/44 (9%)
Query: 253 KPPKEPQEPEEEVKEEEERKKEEE----KKKEEDGQHQQDKARS 292
K + ++ EE K+ EE KK EE K EE + ++DK ++
Sbjct: 1634 KVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKA 1677
Score = 29.3 bits (65), Expect = 3.8
Identities = 14/42 (33%), Positives = 22/42 (52%)
Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
KA + K E + EE+++K EE KK EED + + +
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695
Score = 29.0 bits (64), Expect = 4.3
Identities = 15/42 (35%), Positives = 26/42 (61%)
Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
+A K +E ++ EE K+ EE KK +E KK+ + + D+A+
Sbjct: 1448 EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAK 1489
Score = 29.0 bits (64), Expect = 4.7
Identities = 13/32 (40%), Positives = 19/32 (59%)
Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEE 281
+A K E ++ EE K +E +K EE KK +E
Sbjct: 1507 EAKKKADEAKKAEEAKKADEAKKAEEAKKADE 1538
Score = 29.0 bits (64), Expect = 4.9
Identities = 16/55 (29%), Positives = 32/55 (58%), Gaps = 3/55 (5%)
Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVK---EEEERKKEEEKKKEEDGQHQQDKARSV 293
E+L + K +E ++ EEE K EE +++ EE+KKK E+ + +++ + +
Sbjct: 1705 EELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKI 1759
Score = 29.0 bits (64), Expect = 5.2
Identities = 16/52 (30%), Positives = 27/52 (51%)
Query: 231 KMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
+ + L +E+ + +A K + + EE K EEE+KK E+ KK+E
Sbjct: 1592 ARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEA 1643
Score = 28.6 bits (63), Expect = 6.2
Identities = 14/43 (32%), Positives = 27/43 (62%), Gaps = 1/43 (2%)
Query: 250 KAVKPPKEPQEPEE-EVKEEEERKKEEEKKKEEDGQHQQDKAR 291
+A K +E ++ +E + K EE +KK +E KK + + + D+A+
Sbjct: 1474 EAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAK 1516
Score = 28.6 bits (63), Expect = 6.5
Identities = 12/30 (40%), Positives = 19/30 (63%)
Query: 262 EEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
E + K EE +K +E KKK E+ + + D A+
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAK 1335
Score = 28.6 bits (63), Expect = 6.7
Identities = 10/37 (27%), Positives = 20/37 (54%)
Query: 252 VKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQD 288
+ +E ++ E+ K E +K EE KK E+ + ++
Sbjct: 1211 ERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEE 1247
Score = 28.2 bits (62), Expect = 8.0
Identities = 14/37 (37%), Positives = 22/37 (59%)
Query: 253 KPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
K E ++ EE+ K +E +K EE KK EE + ++ K
Sbjct: 1534 KKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAK 1570
Score = 28.2 bits (62), Expect = 8.1
Identities = 13/52 (25%), Positives = 27/52 (51%)
Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSV 293
E+ ++ + E E+++K EE +K EE K K E+ + +++ + V
Sbjct: 1584 EEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKV 1635
Score = 28.2 bits (62), Expect = 8.4
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 259 QEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
+ E++ K EE +K EEEKKK + +++K
Sbjct: 1738 EAEEDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768
Score = 28.2 bits (62), Expect = 8.9
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
+A K ++ ++ EE K E+ RK EE +K E+
Sbjct: 1119 EAKKKAEDARKAEEARKAEDARKAEEARKAEDA 1151
Score = 28.2 bits (62), Expect = 8.9
Identities = 15/45 (33%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 250 KAVKPPKEPQEPE--EEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
KA + K+ E + EE K+ +E KK EE KK ++ + ++K ++
Sbjct: 1504 KAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKA 1548
Score = 28.2 bits (62), Expect = 9.5
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 253 KPPKEPQEPEEEVKEEEERKKEEEKKKEE 281
K E ++ EE K +E +K EE+KK +E
Sbjct: 1522 KKADEAKKAEEAKKADEAKKAEEKKKADE 1550
Score = 28.2 bits (62), Expect = 9.6
Identities = 13/37 (35%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKK--EEEKKKEED 282
+A K ++ EE K+ +E KK EE+KKK ++
Sbjct: 1373 KEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409
Score = 27.8 bits (61), Expect = 9.8
Identities = 15/42 (35%), Positives = 26/42 (61%)
Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
+A K +E ++ +E K+ EE KK EE KK+ + + D+A+
Sbjct: 1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAK 1476
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 33.8 bits (77), Expect = 0.16
Identities = 14/29 (48%), Positives = 24/29 (82%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQ 284
+E +E EEE +EEEE ++EEE+++EE+ +
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 33.4 bits (76), Expect = 0.21
Identities = 13/31 (41%), Positives = 26/31 (83%)
Query: 257 EPQEPEEEVKEEEERKKEEEKKKEEDGQHQQ 287
+ +E EEE +EEEE ++EEE+++EE+ ++++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 30.7 bits (69), Expect = 1.2
Identities = 13/29 (44%), Positives = 23/29 (79%)
Query: 260 EPEEEVKEEEERKKEEEKKKEEDGQHQQD 288
E EEE +EEEE ++EEE+++EE+ + ++
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 30.7 bits (69), Expect = 1.2
Identities = 12/30 (40%), Positives = 25/30 (83%)
Query: 260 EPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
+ EEE +EEEE ++EEE+++EE+ + ++++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 30.0 bits (67), Expect = 2.1
Identities = 13/32 (40%), Positives = 25/32 (78%)
Query: 262 EEEVKEEEERKKEEEKKKEEDGQHQQDKARSV 293
EEE +EEEE ++EEE+++EE+ + + ++ S+
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEENEEPLSL 896
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 32.8 bits (75), Expect = 0.20
Identities = 9/49 (18%), Positives = 22/49 (44%)
Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
E + KP K+P++P+ + K + + K + + + + + A
Sbjct: 78 EPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAA 126
Score = 31.7 bits (72), Expect = 0.39
Identities = 11/34 (32%), Positives = 18/34 (52%)
Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEE 281
T P P +P+E+ K E++ KK + K K +
Sbjct: 71 TEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPK 104
>gnl|CDD|223477 COG0400, COG0400, Predicted esterase [General function prediction
only].
Length = 207
Score = 32.3 bits (74), Expect = 0.21
Identities = 11/43 (25%), Positives = 18/43 (41%), Gaps = 2/43 (4%)
Query: 24 IEEW--RKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLIL 64
+EE + +IL+G S G +A + + P IL
Sbjct: 87 LEELAEEYGIDSSRIILIGFSQGANIALSLGLTLPGLFAGAIL 129
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 32.4 bits (74), Expect = 0.22
Identities = 12/21 (57%), Positives = 17/21 (80%)
Query: 262 EEEVKEEEERKKEEEKKKEED 282
EE+ +EEEE+ EEEK +EE+
Sbjct: 160 EEKEREEEEKAAEEEKAREEE 180
Score = 28.2 bits (63), Expect = 5.8
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 259 QEPEEEVKEEEERKKEEEKKKEED 282
+E EE + EEE K EE+K E+
Sbjct: 156 KERAEEKEREEEEKAAEEEKAREE 179
Score = 27.8 bits (62), Expect = 7.3
Identities = 8/27 (29%), Positives = 16/27 (59%)
Query: 266 KEEEERKKEEEKKKEEDGQHQQDKARS 292
+ EE+ ++EEEK EE+ +++
Sbjct: 158 RAEEKEREEEEKAAEEEKAREEEILTG 184
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 32.9 bits (75), Expect = 0.22
Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 9/48 (18%)
Query: 253 KPPKEPQEPEEEVKEEEERK---------KEEEKKKEEDGQHQQDKAR 291
K ++P+E ++ K +EE K KE+EKK EE +++K R
Sbjct: 119 KKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKR 166
Score = 32.2 bits (73), Expect = 0.39
Identities = 13/40 (32%), Positives = 18/40 (45%)
Query: 253 KPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
K +EP++ EEE K E R K KK + + K
Sbjct: 152 KKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPP 191
Score = 31.8 bits (72), Expect = 0.60
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 3/47 (6%)
Query: 249 TKAVKPPKEPQEPEEEVKEE---EERKKEEEKKKEEDGQHQQDKARS 292
TK K PK EEE ++E EE+KK++EK KEE + +
Sbjct: 92 TKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138
Score = 31.4 bits (71), Expect = 0.75
Identities = 16/47 (34%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 248 TTKAVKPPKEPQE--PEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
K KP +E +E P +E ++E+E+K EE + +EE+ + ++ +A+S
Sbjct: 127 EPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKS 173
Score = 30.2 bits (68), Expect = 1.6
Identities = 13/46 (28%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 249 TKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVT 294
K +E +E +E+VKEE+++KKE+ K++ +D + +++
Sbjct: 98 PKNESGKEEEKE-KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142
Score = 29.9 bits (67), Expect = 2.0
Identities = 9/40 (22%), Positives = 20/40 (50%)
Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQ 287
A K + E KEEE+ K++ +++K++ + +
Sbjct: 86 KGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPK 125
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 33.1 bits (76), Expect = 0.22
Identities = 18/40 (45%), Positives = 28/40 (70%)
Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEE 281
++L + + +PP P EPEEE +EEEE +EEE+++EE
Sbjct: 286 DRLPLRRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEE 325
Score = 29.2 bits (66), Expect = 3.3
Identities = 15/40 (37%), Positives = 27/40 (67%)
Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSV 293
PP P+ EEE +EEE ++EEE+++EE+ +++ +V
Sbjct: 299 PPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEEVRATV 338
>gnl|CDD|211851 TIGR03611, RutD, pyrimidine utilization protein D. This protein is
observed in operons extremely similar to that
characterized in E. coli K-12 responsible for the import
and catabolism of pyrimidines, primarily uracil. This
protein is a member of the hydrolase, alpha/beta fold
family defined by pfam00067.
Length = 248
Score = 32.4 bits (74), Expect = 0.23
Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 4/47 (8%)
Query: 31 LKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFPQKSIDP 77
L ++ +GH+ GG + A+ YP+R+ L+L + W DP
Sbjct: 76 LGIERFHFVGHALGGLIGLQLALDYPERLTSLVLINGW----SRPDP 118
>gnl|CDD|181553 PRK08775, PRK08775, homoserine O-acetyltransferase; Provisional.
Length = 343
Score = 32.8 bits (75), Expect = 0.24
Identities = 8/26 (30%), Positives = 17/26 (65%)
Query: 40 GHSFGGYLAFAYAIQYPDRVKHLILA 65
G+S+G + +A ++P RV+ L++
Sbjct: 144 GYSYGALVGLQFASRHPARVRTLVVV 169
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 32.5 bits (74), Expect = 0.27
Identities = 11/38 (28%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 253 KPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
K ++ +E +++ E+ER K+ EK++ Q Q+ +A
Sbjct: 84 KEQQQAEELQQKQAAEQERLKQLEKERLAA-QEQKKQA 120
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 32.7 bits (74), Expect = 0.27
Identities = 13/52 (25%), Positives = 31/52 (59%)
Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSV 293
EKL + + + K P++ + E++ +K ++++KKE++ + +DK + V
Sbjct: 171 EKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEV 222
Score = 30.8 bits (69), Expect = 1.0
Identities = 11/39 (28%), Positives = 27/39 (69%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVT 294
K+ ++P+++ K+E+E++++++KKKE +G A +
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDS 235
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 30.3 bits (69), Expect = 0.28
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 261 PEEEVKEEEERKKEEEKKKEEDG 283
EE KKEEE+++EED
Sbjct: 59 AAAAAAAAEEEKKEEEEEEEEDD 81
Score = 29.5 bits (67), Expect = 0.54
Identities = 9/26 (34%), Positives = 13/26 (50%)
Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEE 276
A EE K+EEE ++EE+
Sbjct: 55 AAAAAAAAAAAAEEEKKEEEEEEEED 80
Score = 28.7 bits (65), Expect = 0.97
Identities = 10/26 (38%), Positives = 14/26 (53%)
Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEE 276
A EEE KEEEE ++E++
Sbjct: 56 AAAAAAAAAAAEEEKKEEEEEEEEDD 81
Score = 28.0 bits (63), Expect = 2.3
Identities = 9/25 (36%), Positives = 15/25 (60%)
Query: 258 PQEPEEEVKEEEERKKEEEKKKEED 282
EEE+K+EEE+++E+D
Sbjct: 57 AAAAAAAAAAEEEKKEEEEEEEEDD 81
>gnl|CDD|221019 pfam11187, DUF2974, Protein of unknown function (DUF2974). This
bacterial family of proteins has no known function.
Length = 224
Score = 32.2 bits (74), Expect = 0.30
Identities = 17/44 (38%), Positives = 23/44 (52%), Gaps = 4/44 (9%)
Query: 37 ILLGHSFGGYLAFAYAIQ----YPDRVKHLILADPWGFPQKSID 76
L GHS GG LA A+ DR+K + D GFP++ I+
Sbjct: 87 YLGGHSKGGNLAVYAAMNAEPDLQDRIKKIYSFDGPGFPKEVIE 130
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 32.7 bits (74), Expect = 0.32
Identities = 13/35 (37%), Positives = 26/35 (74%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
K +E E++ +EE ER+KE+EK++E + + + ++A
Sbjct: 587 KAKREAEQKAREEREREKEKEKEREREREREAERA 621
Score = 28.1 bits (62), Expect = 8.7
Identities = 12/39 (30%), Positives = 26/39 (66%)
Query: 253 KPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
K K+ +E E+ K E E+K EE+++E++ + ++++ R
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKERERER 614
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 29.6 bits (67), Expect = 0.33
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEE 281
KE + EE+ +EEEE +K EE+K+ E
Sbjct: 27 KELKAEEEKREEEEEARKREERKERE 52
Score = 28.8 bits (65), Expect = 0.76
Identities = 10/32 (31%), Positives = 25/32 (78%)
Query: 262 EEEVKEEEERKKEEEKKKEEDGQHQQDKARSV 293
++E+K EEE+++EEE+ ++ + + +++K +S
Sbjct: 26 KKELKAEEEKREEEEEARKREERKEREKNKSF 57
Score = 26.5 bits (59), Expect = 4.6
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 259 QEPEEEVKEEEERKKEEEKKKEE 281
+E EE K EE +++E+ K EE
Sbjct: 37 EEEEEARKREERKEREKNKSFEE 59
>gnl|CDD|165193 PHA02857, PHA02857, monoglyceride lipase; Provisional.
Length = 276
Score = 32.2 bits (73), Expect = 0.35
Identities = 40/235 (17%), Positives = 89/235 (37%), Gaps = 34/235 (14%)
Query: 2 GRSSRPRFS-TDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
GRS+ + D R +V + + + LLGHS G ++ A + P+
Sbjct: 64 GRSNGEKMMIDDFGVYVRDVVQHVVTIKSTYPGVPVFLLGHSMGATISILAAYKNPNLFT 123
Query: 61 HLILADPWGFPQKSIDPQKASKIPLWARMIGNLYKNFNPLWPVRFVGPLGQWVVEKMRPD 120
+IL P ++ + ++ L A + +P + VG K+ P+
Sbjct: 124 AMILMSPL------VNAEAVPRLNLLAAKL------MGIFYPNKIVG--------KLCPE 163
Query: 121 LPKKFTPVLKEDSSAITEYIFQCNVQAPSGESAFHTLTEGLGYAKRPMLHRVDQLAAHVP 180
+ D + +Y + V ++ F + R ++ ++ P
Sbjct: 164 S-------VSRDMDEVYKYQYDPLVNHEKIKAGFASQVLKATNKVRKIIPKIK-----TP 211
Query: 181 VTVIYGSRSWVDNSSGDKIKEARSQNSFVQVKSVTGAGHHVYADRADVFNKMVND 235
+ ++ G+ + + + SG + N ++K GA HH++ + +V ++ +
Sbjct: 212 ILILQGTNNEISDVSGAYYF-MQHANCNREIKIYEGAKHHLHKETDEVKKSVMKE 265
>gnl|CDD|225256 COG2382, Fes, Enterochelin esterase and related enzymes [Inorganic
ion transport and metabolism].
Length = 299
Score = 31.6 bits (72), Expect = 0.44
Identities = 10/37 (27%), Positives = 19/37 (51%), Gaps = 2/37 (5%)
Query: 37 ILLGHSFGGYLAFAYAIQYPDRVKHLILADP--WGFP 71
+L G S GG ++ +++P+R H++ W P
Sbjct: 180 VLAGDSLGGLVSLYAGLRHPERFGHVLSQSGSFWWTP 216
>gnl|CDD|130253 TIGR01185, devC, DevC protein. This model describes a predicted
membrane subunit, DevC, of an ABC transporter known so
far from two species of cyanobacteria. Some experimental
data from mutational analysis suggest that this protein
along with DevA and DevB encoded in the same operon may
be involved in the transport/export of glycolipids
[Transport and binding proteins, Other].
Length = 380
Score = 31.8 bits (72), Expect = 0.44
Identities = 15/44 (34%), Positives = 22/44 (50%)
Query: 1 FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFG 44
F RSSRP + + E+ S E R+ + + + LG SFG
Sbjct: 140 FDRSSRPEYGPIAKNFEQGQSISAEVRRRRINVVGLFTLGASFG 183
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 32.0 bits (73), Expect = 0.44
Identities = 13/61 (21%), Positives = 28/61 (45%), Gaps = 7/61 (11%)
Query: 220 HVYADRADVFNKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKK 279
+ + F++ N T S T K K K + ++ K+++++K+++E K
Sbjct: 33 VLSKEILSTFSEEENKVATTS-------TKKDKKEDKNNESKKKSEKKKKKKKEKKEPKS 85
Query: 280 E 280
E
Sbjct: 86 E 86
>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved region
at the N-terminal end of a family of proteins referred
to as CIRs (CBF1-interacting co-repressors). CBF1
(centromere-binding factor 1) acts as a transcription
factor that causes repression by binding specifically to
GTGGGAA motifs in responsive promoters, and it requires
CIR as a co-repressor. CIR binds to histone deacetylase
and to SAP30 and serves as a linker between CBF1 and the
histone deacetylase complex.
Length = 37
Score = 28.4 bits (64), Expect = 0.46
Identities = 11/19 (57%), Positives = 13/19 (68%)
Query: 262 EEEVKEEEERKKEEEKKKE 280
E E K EE+KK EE +KE
Sbjct: 15 EAEQKALEEQKKIEELRKE 33
>gnl|CDD|220735 pfam10408, Ufd2P_core, Ubiquitin elongating factor core. This is
the most conserved part of the core region of Ufd2P
ubiquitin elongating factor or E4, running from helix
alpha-11 to alpha-38. It consists of 31 helices of
variable length connected by loops of variable size
forming a compact unit; the helical packing pattern of
the compact unit consists of five structural repeats
that resemble tandem Armadillo (ARM) repeats. This
domain is involved in ubiquitination as it binds Cdc48p
and escorts ubiquitinated proteins from Cdc48p to the
proteasome for degradation. The core is structurally
similar to the nuclear transporter protein
importin-alpha. The core is associated with the U-box at
the C-terminus, pfam04564, which has ligase activity.
Length = 625
Score = 31.9 bits (73), Expect = 0.53
Identities = 21/81 (25%), Positives = 42/81 (51%), Gaps = 7/81 (8%)
Query: 230 NKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
N ++ND L DE L K +K + Q+ ++ E E +EE +++E + + + +
Sbjct: 439 NLLLNDATYLLDESLS--YLKEIK---QLQKELQDRAEWESLSQEEREERESELRQLERQ 493
Query: 290 ARSVTFMTLGIVMVSMMILMT 310
A+S ++ L V ++ L+T
Sbjct: 494 AKS--YLQLANETVKLLKLLT 512
>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family. This
family includes the HABP4 family of hyaluronan-binding
proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
HABP4 has been observed to bind hyaluronan (a
glucosaminoglycan), but it is not known whether this is
its primary role in vivo. It has also been observed to
bind RNA, but with a lower affinity than that for
hyaluronan. PAI-1 mRNA-binding protein specifically
binds the mRNA of type-1 plasminogen activator inhibitor
(PAI-1), and is thought to be involved in regulation of
mRNA stability. However, in both cases, the sequence
motifs predicted to be important for ligand binding are
not conserved throughout the family, so it is not known
whether members of this family share a common function.
Length = 106
Score = 30.1 bits (68), Expect = 0.54
Identities = 16/62 (25%), Positives = 29/62 (46%), Gaps = 7/62 (11%)
Query: 238 TLSDEKLDIVTTKAVKPPKEPQEPEEEVKEE---EERKKEEEKKKE----EDGQHQQDKA 290
+++ + + +E Q EEE +E EE + EEE+ KE E + Q+K
Sbjct: 33 EIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKEMTLEEYEKALQEKR 92
Query: 291 RS 292
++
Sbjct: 93 KA 94
Score = 26.6 bits (59), Expect = 7.2
Identities = 11/35 (31%), Positives = 16/35 (45%)
Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
+ E Q EEE E EE++ EE+ +E
Sbjct: 30 VKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEG 64
>gnl|CDD|225749 COG3208, GrsT, Predicted thioesterase involved in non-ribosomal
peptide biosynthesis [Secondary metabolites
biosynthesis, transport, and catabolism].
Length = 244
Score = 31.5 bits (72), Expect = 0.55
Identities = 18/73 (24%), Positives = 24/73 (32%), Gaps = 14/73 (19%)
Query: 2 GRSSRPR--FSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRV 59
GR R TD E++ +L + + L GHS G LAF A
Sbjct: 43 GRGDRFGEPLLTDIESLADELANELLPPLLD---APFALFGHSMGAMLAFEVA------- 92
Query: 60 KHLILADPWGFPQ 72
L P+
Sbjct: 93 --RRLERAGLPPR 103
>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex. This Mediator
complex subunit was formerly known as Srb4 in yeasts or
Trap80 in Drosophila and human. The Med17 subunit is
located within the head domain and is essential for cell
viability to the extent that a mutant strain of
cerevisiae lacking it shows all RNA polymerase
II-dependent transcription ceasing at non-permissive
temperatures.
Length = 454
Score = 31.6 bits (72), Expect = 0.56
Identities = 10/61 (16%), Positives = 26/61 (42%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTFMTLGIVMVSMMILMTLLTMF 315
K E + +E++E ++ ++ +D + KAR L + + + +++
Sbjct: 64 KIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARDEVLEQLNQALNESSLALDFVSLL 123
Query: 316 L 316
L
Sbjct: 124 L 124
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 31.6 bits (72), Expect = 0.63
Identities = 12/39 (30%), Positives = 23/39 (58%)
Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
P + E+E +EEE+R E E+++ ED + + ++R
Sbjct: 364 PIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSRE 402
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and
387 amino acids in length.
Length = 213
Score = 30.8 bits (70), Expect = 0.67
Identities = 34/153 (22%), Positives = 55/153 (35%), Gaps = 38/153 (24%)
Query: 186 GSRSWVDNSSGDKIKEARSQNSFVQVKSVTGAGHHVY--ADRAD-----VFNKMV--NDT 236
G+ + VD+++ K+ F+ V T +G+ Y DRA F V D
Sbjct: 59 GNATLVDDATDASGKQ------FITV--TTKSGNTFYIVIDRAAEGENVHFLNQVDEADL 110
Query: 237 CTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTFM 296
L +E+ T + P E E E + E E++ E+
Sbjct: 111 LALMEEEDAAAPTCSCCPVCSV-NMTECTGPEPEPEPEPEEEPEKKS------------- 156
Query: 297 TLGIVMVSMMILMTLLTMFLVLFGLGYYFSYLK 329
M L+ +L + L+ G YYF + K
Sbjct: 157 -------GMGPLLLVLAVALIGGGAYYYFKFYK 182
>gnl|CDD|223700 COG0627, COG0627, Predicted esterase [General function prediction
only].
Length = 316
Score = 31.3 bits (71), Expect = 0.72
Identities = 12/23 (52%), Positives = 15/23 (65%)
Query: 40 GHSFGGYLAFAYAIQYPDRVKHL 62
GHS GGY A A+++PDR K
Sbjct: 158 GHSMGGYGALKLALKHPDRFKSA 180
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
MW N terminal. This domain is found in eukaryotes. This
domain is about 90 amino acids in length. This domain is
found associated with pfam00076. This domain is part of
U1 snRNP, which is the pre-mRNA binding protein of the
penta-snRNP spliceosome complex. It extends over a
distance of 180 A from its RNA binding domain, wraps
around the core domain of U1 snRNP consisting of the
seven Sm proteins and finally contacts U1-C, which is
crucial for 5'-splice-site recognition.
Length = 94
Score = 29.2 bits (66), Expect = 0.76
Identities = 12/28 (42%), Positives = 20/28 (71%)
Query: 253 KPPKEPQEPEEEVKEEEERKKEEEKKKE 280
+PP EP E E +E E+R+K+E+ +K+
Sbjct: 53 EPPPEPTETWLEKREREKREKKEKLEKK 80
>gnl|CDD|131294 TIGR02240, PHA_depoly_arom, poly(3-hydroxyalkanoate) depolymerase.
This family consists of the polyhydroxyalkanoic acid
(PHA) depolymerase of Pseudomonas oleovorans,
Pseudomonas putida BM01, and related species. This
enzyme is part of polyester storage and mobilization
system as in many bacteria. However, species containing
this enzyme are unusual in their capacity to produce
aromatic polyesters when grown on carbon sources such as
benzoic acid or phenylacetic acid [Energy metabolism,
Other].
Length = 276
Score = 30.7 bits (69), Expect = 0.81
Identities = 15/27 (55%), Positives = 19/27 (70%)
Query: 39 LGHSFGGYLAFAYAIQYPDRVKHLILA 65
+G S+GG LA +A YP+R K LILA
Sbjct: 96 IGVSWGGALAQQFAHDYPERCKKLILA 122
>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7. TMA7 plays
a role in protein translation. Deletions of the TMA7
gene results in altered protein synthesis rates.
Length = 63
Score = 28.7 bits (64), Expect = 0.83
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEE 281
P K+ +E +EE ++++KEE K +E
Sbjct: 14 PKKQKKELDEEDLAFKQKQKEEAKALKE 41
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 31.1 bits (71), Expect = 0.90
Identities = 14/65 (21%), Positives = 37/65 (56%), Gaps = 10/65 (15%)
Query: 223 ADRADVF------NKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEE 276
A R D F +++ + +++++ + K KPPK+ +E ++ K ++++K++++
Sbjct: 349 AARVDYFSGRYIGDELKEEL----NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKK 404
Query: 277 KKKEE 281
KK +
Sbjct: 405 GKKRK 409
>gnl|CDD|190521 pfam03096, Ndr, Ndr family. This family consists of proteins from
different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3.
Their similarity was previously noted. The precise
molecular and cellular function of members of this
family is still unknown. Yet, they are known to be
involved in cellular differentiation events. The Ndr1
group was the first to be discovered. Their expression
is repressed by the proto-oncogenes N-myc and c-myc, and
in line with this observation, Ndr1 protein expression
is down-regulated in neoplastic cells, and is
reactivated when differentiation is induced by chemicals
such as retinoic acid. Ndr2 and Ndr3 expression is not
under the control of N-myc or c-myc. Ndr1 expression is
also activated by several chemicals: tunicamycin and
homocysteine induce Ndr1 in human umbilical endothelial
cells; nickel induces Ndr1 in several cell types.
Members of this family are found in wide variety of
multicellular eukaryotes, including an Ndr1 type protein
in Helianthus annuus (sunflower), known as Sf21.
Interestingly, the highest scoring matches in the noise
are all alpha/beta hydrolases pfam00561, suggesting that
this family may have an enzymatic function (Bateman A
pers. obs.).
Length = 284
Score = 30.8 bits (70), Expect = 0.91
Identities = 14/49 (28%), Positives = 27/49 (55%)
Query: 19 QLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADP 67
L I +L+ +I +G G Y+ +A+++P+RV+ L+L +P
Sbjct: 84 DLADMIPVVLDHFRLKSVIGMGVGAGAYILARFALKHPERVEGLVLINP 132
>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
represents the eukaryotic large ribosomal protein P2.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P2 is located in the L12 stalk, with proteins
P1, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers). Bacteria may have four or six copies
of L7/L12 (two or three homodimers) depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plants have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site in the C-terminal region
of P2. TCS inactivates the ribosome by depurinating a
specific adenine in the sarcin-ricin loop of 28S rRNA.
Length = 109
Score = 29.2 bits (66), Expect = 1.0
Identities = 19/79 (24%), Positives = 34/79 (43%), Gaps = 7/79 (8%)
Query: 211 VKSVTGA-GHHVYADRAD-VFNKM----VNDTCTLSDEKLDIVTTKAVKP-PKEPQEPEE 263
VK + G+ G V ++ + V +++ V + EKL V A P
Sbjct: 23 VKKILGSVGVEVDDEKLNKVISELEGKDVEELIAAGKEKLASVPAGAGGAAPAAAAAAAA 82
Query: 264 EVKEEEERKKEEEKKKEED 282
++E KKEE +++ +D
Sbjct: 83 AAAAKKEEKKEESEEESDD 101
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 30.6 bits (69), Expect = 1.1
Identities = 11/40 (27%), Positives = 22/40 (55%), Gaps = 1/40 (2%)
Query: 253 KPPKEPQEPEEEVKEEEERKKE-EEKKKEEDGQHQQDKAR 291
K ++ +E E++ E+ R+KE E++ E Q ++A
Sbjct: 72 KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAA 111
Score = 30.2 bits (68), Expect = 1.7
Identities = 12/46 (26%), Positives = 20/46 (43%), Gaps = 4/46 (8%)
Query: 250 KAVKPPKEPQEPEEEVKEEEERKK----EEEKKKEEDGQHQQDKAR 291
K+ Q E + K+ E K E EKK +E+ + Q ++
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEA 156
Score = 29.4 bits (66), Expect = 2.8
Identities = 7/32 (21%), Positives = 15/32 (46%)
Query: 259 QEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
+ + E+ K+ EEK+K+ + + A
Sbjct: 99 AAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAA 130
Score = 28.6 bits (64), Expect = 4.4
Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 4/46 (8%)
Query: 250 KAVKPPKEPQEPEEEVKEEEE--RKKEEEKKKE--EDGQHQQDKAR 291
K K E E E K +EE ++ EEE K + + + + +A+
Sbjct: 127 KQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAK 172
Score = 28.6 bits (64), Expect = 5.1
Identities = 10/49 (20%), Positives = 27/49 (55%), Gaps = 5/49 (10%)
Query: 247 VTTKAVKPPKEPQEPEEEVKE-----EEERKKEEEKKKEEDGQHQQDKA 290
+ + K+ QE ++++++ E++R E+ ++KE + + +KA
Sbjct: 55 IQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKA 103
Score = 28.6 bits (64), Expect = 5.2
Identities = 7/29 (24%), Positives = 16/29 (55%)
Query: 259 QEPEEEVKEEEERKKEEEKKKEEDGQHQQ 287
+E E+ E+ K+ E+ K+ + + +Q
Sbjct: 92 KELEQRAAAEKAAKQAEQAAKQAEEKQKQ 120
Score = 28.3 bits (63), Expect = 6.3
Identities = 10/36 (27%), Positives = 20/36 (55%)
Query: 255 PKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
K ++ E+ K+ EE++K+ E+ K + + KA
Sbjct: 101 EKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKA 136
Score = 27.9 bits (62), Expect = 8.3
Identities = 8/44 (18%), Positives = 23/44 (52%)
Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
+A + ++ + + E+ ++ E+ K+ E+ Q Q ++A+
Sbjct: 82 KQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAK 125
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 30.0 bits (68), Expect = 1.3
Identities = 10/44 (22%), Positives = 24/44 (54%)
Query: 241 DEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQ 284
DE+ + K K + + +++E+E+ K+E+E+K + +
Sbjct: 42 DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELE 85
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 30.7 bits (69), Expect = 1.3
Identities = 13/29 (44%), Positives = 23/29 (79%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQ 284
+E + EEE +EEEE ++EE++ +EE+G+
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGE 468
Score = 28.7 bits (64), Expect = 5.9
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 253 KPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQD 288
+ +E +E EEE +EEE+ +EEE + EE+ + +
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEA 478
Score = 28.0 bits (62), Expect = 10.0
Identities = 15/41 (36%), Positives = 24/41 (58%)
Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQD 288
+T P QE EEE EEE ++EEE+++EE +++
Sbjct: 426 STSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEE 466
>gnl|CDD|164765 MTH00222, ATP9, ATP synthase F0 subunit 9; Provisional.
Length = 77
Score = 28.3 bits (63), Expect = 1.4
Identities = 12/33 (36%), Positives = 17/33 (51%)
Query: 289 KARSVTFMTLGIVMVSMMILMTLLTMFLVLFGL 321
K + T+ LG + M L L+ FL+LF L
Sbjct: 45 KQQLFTYAILGFAISEAMGLFCLMMAFLILFAL 77
>gnl|CDD|205219 pfam13038, DUF3899, Domain of unknown function (DUF3899). Putative
Tryptophanyl-tRNA synthetase.
Length = 92
Score = 28.4 bits (64), Expect = 1.4
Identities = 11/51 (21%), Positives = 26/51 (50%)
Query: 266 KEEEERKKEEEKKKEEDGQHQQDKARSVTFMTLGIVMVSMMILMTLLTMFL 316
E++E+ EE + D + ++ K T+ ++ ++IL ++L F+
Sbjct: 42 VEKKEKDSIEEDEFANDKKVKKKKYSVFTWTAPLLLSGLLLILSSILLSFI 92
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 30.3 bits (68), Expect = 1.5
Identities = 8/31 (25%), Positives = 18/31 (58%)
Query: 253 KPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
+ E++++E+ ++ EE+K EE+G
Sbjct: 302 DKLSPEIPAKPEIEQDEDSEESEEEKNEEEG 332
Score = 30.3 bits (68), Expect = 1.5
Identities = 12/40 (30%), Positives = 20/40 (50%)
Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
+ K E E ++E+ + EEEK +EE G ++ K
Sbjct: 300 REDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGK 339
Score = 28.4 bits (63), Expect = 6.4
Identities = 6/32 (18%), Positives = 12/32 (37%)
Query: 260 EPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
+ +E E++ E K E Q + +
Sbjct: 292 ASGNDPEEREDKLSPEIPAKPEIEQDEDSEES 323
>gnl|CDD|177659 PLN00021, PLN00021, chlorophyllase.
Length = 313
Score = 30.0 bits (68), Expect = 1.5
Identities = 14/35 (40%), Positives = 15/35 (42%), Gaps = 5/35 (14%)
Query: 38 LLGHSFGGYLAFAYAIQYPDRVKH-----LILADP 67
L GHS GG AFA A+ LI DP
Sbjct: 130 LAGHSRGGKTAFALALGKAAVSLPLKFSALIGLDP 164
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 29.5 bits (67), Expect = 1.6
Identities = 13/37 (35%), Positives = 18/37 (48%), Gaps = 1/37 (2%)
Query: 245 DIVTTKAVKPPKEPQ-EPEEEVKEEEERKKEEEKKKE 280
DI+ K + PPK P + EEE E ++ E E
Sbjct: 5 DILRKKGILPPKPPSPKEEEEEALELAIQEAHENALE 41
>gnl|CDD|215859 pfam00326, Peptidase_S9, Prolyl oligopeptidase family.
Length = 212
Score = 29.9 bits (68), Expect = 1.6
Identities = 13/52 (25%), Positives = 20/52 (38%), Gaps = 2/52 (3%)
Query: 18 RQLVTSIEEWRKEL--KLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADP 67
+ + E + + + G S+GGYL A Q PD K + P
Sbjct: 46 DDFIAAAEYLIAQGYVDPDRLAIWGGSYGGYLTGAALNQRPDLFKAAVAVVP 97
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 30.3 bits (68), Expect = 1.6
Identities = 10/40 (25%), Positives = 20/40 (50%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTF 295
E ++ +E E +KK+EEK KE++ + + +
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKA 45
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 29.2 bits (66), Expect = 1.7
Identities = 9/20 (45%), Positives = 17/20 (85%)
Query: 262 EEEVKEEEERKKEEEKKKEE 281
EE+ +E+E K+E+E+K+E+
Sbjct: 15 EEKREEKEREKEEKERKEEK 34
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 30.1 bits (68), Expect = 1.7
Identities = 18/43 (41%), Positives = 28/43 (65%), Gaps = 2/43 (4%)
Query: 254 PPKEPQEPEEEVKEEEERK--KEEEKKKEEDGQHQQDKARSVT 294
P +E +E +EE KEEEE+ KEEE +EE+ + ++ K + V
Sbjct: 36 PDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVK 78
Score = 29.7 bits (67), Expect = 2.3
Identities = 13/33 (39%), Positives = 22/33 (66%)
Query: 257 EPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
E + P+EE +EE+E KKEEE+K + + ++
Sbjct: 32 EKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEE 64
Score = 29.4 bits (66), Expect = 2.9
Identities = 11/31 (35%), Positives = 18/31 (58%)
Query: 249 TKAVKPPKEPQEPEEEVKEEEERKKEEEKKK 279
+ K +E +EE +EEE K+E++KK
Sbjct: 44 KEEKKEEEEKTTDKEEEVDEEEEKEEKKKKT 74
Score = 29.4 bits (66), Expect = 3.4
Identities = 13/24 (54%), Positives = 21/24 (87%)
Query: 259 QEPEEEVKEEEERKKEEEKKKEED 282
+E E+EV +EEE +++EEKK+EE+
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEE 52
Score = 27.8 bits (62), Expect = 9.7
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEE 281
+E E + +EE+E KK++ KK +E
Sbjct: 54 TTDKEEEVDEEEEKEEKKKKTKKVKE 79
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 30.3 bits (69), Expect = 1.7
Identities = 7/39 (17%), Positives = 21/39 (53%)
Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
P E ++ ++ ++ E++ ++EE +K + + A+
Sbjct: 407 SPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKK 445
>gnl|CDD|214008 cd12809, Esterase_713_like-2, Uncharacterized enzymes similar to
novel bacterial esterase that cleaves esters on
halogenated cyclic compounds. This family contains
uncharacterized proteins similar to a novel bacterial
esterase (Alcaligenes esterase 713) with the alpha/beta
hydrolase fold but does not contain the GXSXXG
pentapeptide around the active site serine residue as
commonly seen in other enzymes of this class. Esterase
713 shows negligible sequence homology to other esterase
and lipase enzymes. It is active as a dimer and cleaves
esters on halogenated cyclic compounds though its
natural substrate is unknown.
Length = 280
Score = 29.9 bits (68), Expect = 1.8
Identities = 14/35 (40%), Positives = 19/35 (54%)
Query: 37 ILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFP 71
IL+ HS GG + A PD VK ++ +P G P
Sbjct: 174 ILITHSQGGPFGWLAADARPDLVKAIVAIEPSGPP 208
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 29.0 bits (65), Expect = 1.8
Identities = 14/42 (33%), Positives = 23/42 (54%), Gaps = 8/42 (19%)
Query: 254 PPKEP--------QEPEEEVKEEEERKKEEEKKKEEDGQHQQ 287
PP+ P +EE +EEEE +K+++ + + QHQQ
Sbjct: 80 PPEGPAGTTSGTGHTRQEEEEEEEENEKQQQSDEAQVQQHQQ 121
>gnl|CDD|234235 TIGR03503, TIGR03503, TIGR03503 family protein. This set of
conserved hypothetical protein has a phylogenetic range
that closely matches that of TIGR03501, a putative
C-terminal protein targeting signal.
Length = 374
Score = 30.0 bits (68), Expect = 1.8
Identities = 14/66 (21%), Positives = 31/66 (46%), Gaps = 3/66 (4%)
Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTFMTLGIVMVSMMILMT 310
V+ P E + EE EE K + E+++ + + ++ F + IV +++L+
Sbjct: 306 NVEEPPEAEPSAEEAATEEALKAKAEQEQALLVEREAKDSK---FWIIIIVGNVVILLLG 362
Query: 311 LLTMFL 316
+ F+
Sbjct: 363 GIGFFV 368
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 29.3 bits (66), Expect = 1.9
Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 2/34 (5%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
E E EV+EEE KKE++KKKE + ++ K
Sbjct: 140 TAKVEKEAEVEEEE--KKEKKKKKEVKKEKKEKK 171
>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein. This family consists of mammalian
MRVI1 proteins which are related to the
lymphoid-restricted membrane protein (JAW1) and the IP3
receptor associated cGMP kinase substrates A and B
(IRAGA and IRAGB). The function of MRVI1 is unknown
although mutations in the Mrvi1 gene induces myeloid
leukaemia by altering the expression of a gene important
for myeloid cell growth and/or differentiation so it has
been speculated that Mrvi1 is a tumour suppressor gene.
IRAG is very similar in sequence to MRVI1 and is an
essential NO/cGKI-dependent regulator of IP3-induced
calcium release. Activation of cGKI decreases
IP3-stimulated elevations in intracellular calcium,
induces smooth muscle relaxation and contributes to the
antiproliferative and pro-apoptotic effects of NO/cGMP.
Jaw1 is a member of a class of proteins with
COOH-terminal hydrophobic membrane anchors and is
structurally similar to proteins involved in vesicle
targeting and fusion. This suggests that the function
and/or the structure of the ER in lymphocytes may be
modified by lymphoid-restricted resident ER proteins.
Length = 538
Score = 30.0 bits (67), Expect = 2.0
Identities = 16/84 (19%), Positives = 31/84 (36%), Gaps = 5/84 (5%)
Query: 250 KAVKPPKEPQEPEEEVKEEEERKKEEEKKK-----EEDGQHQQDKARSVTFMTLGIVMVS 304
K K ++ +EPE E E RK ++ + Q A VTF+ + +
Sbjct: 418 KKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEEWDKEQRSSKLASWVTFLQVSYRKAN 477
Query: 305 MMILMTLLTMFLVLFGLGYYFSYL 328
+ + + L+ + + L
Sbjct: 478 QALWLMAAAILLLAALMSFLTGQL 501
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 29.8 bits (67), Expect = 2.0
Identities = 9/34 (26%), Positives = 19/34 (55%)
Query: 249 TKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
+ + +E EE+ KE+ +R K+ + ++ED
Sbjct: 275 KPSGERSDSEEETEEKEKEKRKRLKKMMEDEDED 308
>gnl|CDD|223187 COG0109, CyoE, Polyprenyltransferase (cytochrome oxidase assembly
factor) [Posttranslational modification, protein
turnover, chaperones].
Length = 304
Score = 29.9 bits (68), Expect = 2.0
Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 4/40 (10%)
Query: 297 TLGIVMVSMMILMT-LLTMFLVLFGL-GYYFSYLKTHFVK 334
LG+ S++ + LL L LFG+ Y Y T ++K
Sbjct: 107 VLGVAGFSLLWFLVNLLAAVLGLFGIFFYVVVY--TLWLK 144
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 29.3 bits (66), Expect = 2.0
Identities = 16/58 (27%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 225 RADVFNKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
F + V ++E+ DI ++ K + E+V EE+E+ +EEE+++EE+
Sbjct: 131 ALSKFKRKVG---LFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEE 185
Score = 27.8 bits (62), Expect = 8.1
Identities = 11/43 (25%), Positives = 24/43 (55%), Gaps = 2/43 (4%)
Query: 243 KLDIVTTKAVKPPKEPQEPEEEVKE--EEERKKEEEKKKEEDG 283
K+ + T + ++ E+++KE E+ +E+EK +EE+
Sbjct: 138 KVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEE 180
>gnl|CDD|223964 COG1033, COG1033, Predicted exporters of the RND superfamily
[General function prediction only].
Length = 727
Score = 30.0 bits (68), Expect = 2.0
Identities = 14/68 (20%), Positives = 27/68 (39%), Gaps = 3/68 (4%)
Query: 258 PQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTFMTLGIVMVSMMILMTLLTMFLV 317
E E+ +E ++E G SV ++ + ++ S ++ T+L + LV
Sbjct: 525 QGELEDVGREILRDIEKEN---IPTGVKVYVTGESVIYVEMNELLTSSQLISTVLGIILV 581
Query: 318 LFGLGYYF 325
L F
Sbjct: 582 FALLLIIF 589
>gnl|CDD|177159 MTH00097, ND6, NADH dehydrogenase subunit 6; Provisional.
Length = 121
Score = 28.7 bits (65), Expect = 2.0
Identities = 7/40 (17%), Positives = 14/40 (35%), Gaps = 2/40 (5%)
Query: 296 MTLGIVMVSMMILMTLLTMFLVLFGLGYYFSYLKTHFVKL 335
+ + ++ L+MF F Y+F + L
Sbjct: 43 SSFKLKKPYLVFFSLFLSMFF--FFFFYFFKVYFFSNLSL 80
>gnl|CDD|130799 TIGR01738, bioH, pimelyl-[acyl-carrier protein] methyl ester
esterase. This CoA-binding enzyme is required for the
production of pimeloyl-coenzyme A, the substrate of the
BioF protein early in the biosynthesis of biotin. Its
exact function is unknown, but is proposed in ref 2.
This enzyme belongs to the alpha/beta hydrolase fold
family (Pfam model pfam00561). Members of this family
are restricted to the Proteobacteria [Biosynthesis of
cofactors, prosthetic groups, and carriers, Biotin].
Length = 245
Score = 29.4 bits (66), Expect = 2.1
Identities = 34/161 (21%), Positives = 51/161 (31%), Gaps = 39/161 (24%)
Query: 37 ILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFPQKSIDPQ--KASKIPLWARMIGNLY 94
I LG S GG +A A +PDRV+ L+ P S + K + L
Sbjct: 68 IWLGWSLGGLVALHIAATHPDRVRALVTVA--SSPCFSAREDWPEGIKPDVLTGFQQQLS 125
Query: 95 KNFNPLWPVRFVG------PLGQWVVEKMRPDLPKKFTPVLKEDSSAITEYIFQCNVQAP 148
++ RF+ P + ++ L + TP NVQ
Sbjct: 126 DDYQ-RTIERFLALQTLGTPTARQDARALKQTLLARPTP----------------NVQ-- 166
Query: 149 SGESAFHTLTEGLGYAKRPML-HRVDQLAAHVPVTVIYGSR 188
L GL L + ++ VP +YG
Sbjct: 167 -------VLQAGLEILATVDLRQPLQNIS--VPFLRLYGYL 198
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 29.2 bits (66), Expect = 2.1
Identities = 12/51 (23%), Positives = 24/51 (47%), Gaps = 8/51 (15%)
Query: 249 TKAVKPPKEPQEPEEEVKEEE--------ERKKEEEKKKEEDGQHQQDKAR 291
K + +E + EE+ EEE E K + +K+ E+ + Q++ +
Sbjct: 98 KKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILK 148
>gnl|CDD|217536 pfam03403, PAF-AH_p_II, Platelet-activating factor acetylhydrolase,
isoform II. Platelet-activating factor acetylhydrolase
(PAF-AH) is a subfamily of phospholipases A2,
responsible for inactivation of platelet-activating
factor through cleavage of an acetyl group. Three known
PAF-AHs are the brain heterotrimeric PAF-AH Ib, whose
catalytic beta and gamma subunits are aligned in
pfam02266, the extracellular, plasma PAF-AH (pPAF-AH),
and the intracellular PAF-AH isoform II (PAF-AH II).
This family aligns pPAF-AH and PAF-AH II, whose
similarity was previously noted.
Length = 372
Score = 29.7 bits (67), Expect = 2.1
Identities = 12/35 (34%), Positives = 17/35 (48%), Gaps = 1/35 (2%)
Query: 37 ILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGFP 71
++GHSFGG ++ R + I D W FP
Sbjct: 224 AVIGHSFGGATVIQ-SLSEDTRFRCGIALDAWMFP 257
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 30.0 bits (67), Expect = 2.2
Identities = 12/40 (30%), Positives = 19/40 (47%)
Query: 251 AVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
PKE ++ E K E E+ + E KK +E+ +D
Sbjct: 265 DTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHK 304
>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal. This
domain family is found in eukaryotes, and is typically
between 99 and 114 amino acids in length. The family is
found in association with pfam08697, pfam01585. There
are two completely conserved residues (G and F) that may
be functionally important. TIP is involved in enamel
assembly by interacting with one of the major proteins
responsible for biomineralisation of enamel - tuftelin.
Length = 106
Score = 28.1 bits (63), Expect = 2.3
Identities = 8/35 (22%), Positives = 22/35 (62%)
Query: 247 VTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEE 281
V+ + KEP+E ++E + E++ + +E+ +++
Sbjct: 72 VSGGIKQAAKEPEEDDKEDESEDDDESDEDDDEDD 106
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 28.1 bits (63), Expect = 2.3
Identities = 10/23 (43%), Positives = 15/23 (65%), Gaps = 2/23 (8%)
Query: 260 EPEEEVKEEEERKKEEEKKKEED 282
E K+EE KKEEE+++ +D
Sbjct: 76 AAAAEAKKEE--KKEEEEEESDD 96
>gnl|CDD|115027 pfam06342, DUF1057, Alpha/beta hydrolase of unknown function
(DUF1057). This family consists of several
Caenorhabditis elegans specific proteins of unknown
function. Members of this family have an alpha/beta
hydrolase fold.
Length = 297
Score = 29.3 bits (66), Expect = 2.3
Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 4/51 (7%)
Query: 29 KELKLQE-MILLGHSFGGYLAFAYAIQYPDRVKHLILADPWGF-PQKSIDP 77
+EL+L+ +I++GHS G A A P L++ +P GF K I P
Sbjct: 98 EELELKGKLIIMGHSRGCENALQTATTRP--AHGLVMINPTGFRIHKGIRP 146
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 28.3 bits (63), Expect = 2.4
Identities = 9/41 (21%), Positives = 21/41 (51%)
Query: 255 PKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTF 295
+ Q +E+ E+ + ++KKK + +++ R+VT
Sbjct: 2 APKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTV 42
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 29.6 bits (67), Expect = 2.4
Identities = 13/35 (37%), Positives = 23/35 (65%)
Query: 249 TKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
V P ++ +E E+E +EEEE ++E E+ + E+G
Sbjct: 339 CTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373
>gnl|CDD|130895 TIGR01836, PHA_synth_III_C, poly(R)-hydroxyalkanoic acid synthase,
class III, PhaC subunit. This model represents the PhaC
subunit of a heterodimeric form of polyhydroxyalkanoic
acid (PHA) synthase. Excepting the PhaC of Bacillus
megaterium (which needs PhaR), all members require PhaE
(TIGR01834) for activity and are designated class III.
This enzyme builds ester polymers for carbon and energy
storage that accumulate in inclusions, and both this
enzyme and the depolymerase associate with the
inclusions. Class III enzymes polymerize
short-chain-length hydroxyalkanoates [Fatty acid and
phospholipid metabolism, Biosynthesis].
Length = 350
Score = 29.7 bits (67), Expect = 2.4
Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 7/60 (11%)
Query: 32 KLQEMILLGHSFGGYLAFAYAIQYPDRVKHLI-LADPWGFPQKSIDPQKASKIPLWARMI 90
KL ++ LLG GG + YA YPD++K+L+ + P F + + WAR +
Sbjct: 134 KLDQISLLGICQGGTFSLCYAALYPDKIKNLVTMVTPVDFETPG------NMLSNWARHV 187
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 28.6 bits (64), Expect = 2.6
Identities = 9/28 (32%), Positives = 18/28 (64%)
Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEE 281
P + +E +E KE EE++ + KK+++
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKK 119
Score = 28.2 bits (63), Expect = 4.6
Identities = 11/30 (36%), Positives = 20/30 (66%), Gaps = 2/30 (6%)
Query: 254 PPKEPQEPEEEVKEE--EERKKEEEKKKEE 281
KE +E +E +E+ + KKE+++KKE+
Sbjct: 95 SEKEQKEVSKETEEKEAIKAKKEKKEKKEK 124
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 29.0 bits (65), Expect = 2.6
Identities = 13/46 (28%), Positives = 21/46 (45%)
Query: 247 VTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
TTK K + + EEE K E + K EE + + D + ++
Sbjct: 63 ATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQE 108
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 29.1 bits (65), Expect = 2.6
Identities = 9/26 (34%), Positives = 20/26 (76%)
Query: 266 KEEEERKKEEEKKKEEDGQHQQDKAR 291
K++E+R+ E+ +K +ED + +Q++ R
Sbjct: 165 KQKEKRRVEDSQKHKEDRRKKQEEKR 190
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 28.6 bits (64), Expect = 2.8
Identities = 13/55 (23%), Positives = 26/55 (47%)
Query: 235 DTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
D ++E+ T + P + + E++ +E + K E+EK K E + + K
Sbjct: 96 DEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150
>gnl|CDD|236060 PRK07581, PRK07581, hypothetical protein; Validated.
Length = 339
Score = 29.1 bits (66), Expect = 2.9
Identities = 16/61 (26%), Positives = 32/61 (52%), Gaps = 6/61 (9%)
Query: 1 FGRSSRPRFST-DPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRV 59
F + P + D + +L+T E++ E +L +++G S G + +A++YPD V
Sbjct: 95 FNAARFPHVTIYDNVRAQHRLLT--EKFGIE-RLA--LVVGWSMGAQQTYHWAVRYPDMV 149
Query: 60 K 60
+
Sbjct: 150 E 150
>gnl|CDD|131868 TIGR02821, fghA_ester_D, S-formylglutathione hydrolase. This model
describes a protein family from bacteria, yeast, and
human, with a conserved critical role in formaldehyde
detoxification as S-formylglutathione hydrolase (EC
3.1.2.12). Members in eukaryotes such as the human
protein are better known as esterase D (EC 3.1.1.1), an
enzyme with broad specificity, although
S-formylglutathione hydrolase has now been demonstrated
as well [Cellular processes, Detoxification].
Length = 275
Score = 29.0 bits (65), Expect = 2.9
Identities = 17/45 (37%), Positives = 23/45 (51%), Gaps = 8/45 (17%)
Query: 38 LLGHSFGGYLAFAYAIQYPDRVKHL-----ILAD---PWGFPQKS 74
+ GHS GG+ A A++ PDR K + I+A PWG S
Sbjct: 142 ITGHSMGGHGALVIALKNPDRFKSVSAFAPIVAPSRCPWGQKAFS 186
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 28.9 bits (65), Expect = 2.9
Identities = 9/34 (26%), Positives = 22/34 (64%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
KE +E ++ ++++ KK+++K K++ + DK
Sbjct: 76 KEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109
>gnl|CDD|115071 pfam06390, NESP55, Neuroendocrine-specific golgi protein P55
(NESP55). This family consists of several mammalian
neuroendocrine-specific golgi protein P55 (NESP55)
sequences. NESP55 is a novel member of the chromogranin
family and is a soluble, acidic, heat-stable secretory
protein that is expressed exclusively in endocrine and
nervous tissues, although less widely than
chromogranins.
Length = 261
Score = 29.1 bits (64), Expect = 3.0
Identities = 11/45 (24%), Positives = 27/45 (60%)
Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
+T+ + +E +EPE +++ R EEE++++E+ + Q + +
Sbjct: 192 STQEPESAREGEEPERGPLDKDPRDPEEEEEEKEEEKQQPHRCKP 236
>gnl|CDD|177149 MTH00080, COX2, cytochrome c oxidase subunit II; Provisional.
Length = 231
Score = 28.8 bits (65), Expect = 3.0
Identities = 7/33 (21%), Positives = 15/33 (45%)
Query: 300 IVMVSMMILMTLLTMFLVLFGLGYYFSYLKTHF 332
++ +L ++ +FL L +YF K +
Sbjct: 28 SLLFGEFVLAFVVFLFLYLISNNFYFKSKKIEY 60
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 27.7 bits (62), Expect = 3.0
Identities = 7/21 (33%), Positives = 13/21 (61%)
Query: 259 QEPEEEVKEEEERKKEEEKKK 279
E EE + E+E ++E+ +K
Sbjct: 84 LEDREEERLEKEEEREKRARK 104
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 28.7 bits (64), Expect = 3.2
Identities = 9/24 (37%), Positives = 16/24 (66%)
Query: 259 QEPEEEVKEEEERKKEEEKKKEED 282
E +E+ +EEEE +EE ++ E+
Sbjct: 45 DEKQEDDEEEEEEDEEEIEEPEDI 68
Score = 28.3 bits (63), Expect = 3.6
Identities = 14/39 (35%), Positives = 21/39 (53%), Gaps = 2/39 (5%)
Query: 257 EPQEPEEEVKEEEERKKEEEKKKEED--GQHQQDKARSV 293
E +E EEE +E+ K+ EKK D Q D A+++
Sbjct: 76 EDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQNL 114
Score = 27.5 bits (61), Expect = 6.1
Identities = 12/34 (35%), Positives = 19/34 (55%), Gaps = 2/34 (5%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
E E EEE+ E+EE +EEE +++ +K
Sbjct: 65 PEDIEDEEEIVEDEE--EEEEDEEDNVDLKDIEK 96
Score = 27.1 bits (60), Expect = 9.5
Identities = 17/68 (25%), Positives = 36/68 (52%), Gaps = 3/68 (4%)
Query: 224 DRADVF--NKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEER-KKEEEKKKE 280
D+ D+ N+ V D DE+ + + ++ P++ ++ EE V++EEE + EE+
Sbjct: 32 DKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDL 91
Query: 281 EDGQHQQD 288
+D + +
Sbjct: 92 KDIEKKNI 99
>gnl|CDD|220223 pfam09405, Btz, CASC3/Barentsz eIF4AIII binding. This domain is
found on CASC3 (cancer susceptibility candidate gene 3
protein) which is also known as Barentsz (Btz). CASC3 is
a component of the EJC (exon junction complex) which is
a complex that is involved in post-transcriptional
regulation of mRNA in metazoa. The complex is formed by
the association of four proteins (eIF4AIII, Barentsz,
Mago, and Y14), mRNA, and ATP. This domain wraps around
eIF4AIII and stacks against the 5' nucleotide.
Length = 116
Score = 27.8 bits (62), Expect = 3.3
Identities = 9/24 (37%), Positives = 16/24 (66%)
Query: 259 QEPEEEVKEEEERKKEEEKKKEED 282
EP E +++E +++E EK +ED
Sbjct: 14 AEPTEPKEDKERKRREHEKYDDED 37
>gnl|CDD|131480 TIGR02427, protocat_pcaD, 3-oxoadipate enol-lactonase. Members of
this family are 3-oxoadipate enol-lactonase. Note that
the substrate is known as 3-oxoadipate enol-lactone,
2-oxo-2,3-dihydrofuran-5-acetate,
4,5-Dihydro-5-oxofuran-2-acetate, and
5-oxo-4,5-dihydrofuran-2-acetate. The enzyme the
catalyzes the fourth step in the protocatechuate
degradation to beta-ketoadipate and then to succinyl-CoA
and acetyl-CoA. 4-hydroxybenzoate, 3-hydroxybenzoate,
and vanillate all can be converted in one step to
protocatechuate. This enzyme also acts in catechol
degradation. In genomes that catabolize both catechol
and protocatechuate, two forms of this enzyme may be
found. All members of the seed alignment for this model
were chosen from within protocatechuate degradation
operons of at least three genes of the pathway, from
genomes with the complete pathway through
beta-ketoadipate [Energy metabolism, Other].
Length = 251
Score = 28.9 bits (65), Expect = 3.3
Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 4/65 (6%)
Query: 2 GRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKH 61
G S P P ++E L + L ++ + G S GG +A A + PDRV+
Sbjct: 51 GLSDAPE---GPYSIE-DLADDVLALLDHLGIERAVFCGLSLGGLIAQGLAARRPDRVRA 106
Query: 62 LILAD 66
L+L++
Sbjct: 107 LVLSN 111
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 28.3 bits (63), Expect = 3.5
Identities = 13/38 (34%), Positives = 23/38 (60%), Gaps = 4/38 (10%)
Query: 256 KEPQEPEEEVKEEEERKKEEEKKKEED----GQHQQDK 289
+E +E +E ++E EE +K + +K EE+ Q +DK
Sbjct: 34 EEDEEIQEALREAEEERKAKHRKMEEEREVMRQGIRDK 71
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 29.4 bits (66), Expect = 3.6
Identities = 17/55 (30%), Positives = 30/55 (54%), Gaps = 1/55 (1%)
Query: 238 TLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
TLSD + V K P E +E + E +++E +K ++KK+ G + KA++
Sbjct: 170 TLSDSDQEAVLPLVKKSPSEAKEEKAEERKQESKKG-AKRKKDASGDDKSKKAKT 223
>gnl|CDD|179147 PRK00870, PRK00870, haloalkane dehalogenase; Provisional.
Length = 302
Score = 28.8 bits (65), Expect = 3.7
Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 2/65 (3%)
Query: 1 FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
FGRS +P D T R V + W ++L L ++ L+ +GG + A ++PDR
Sbjct: 84 FGRSDKPTRRED-YTYARH-VEWMRSWFEQLDLTDVTLVCQDWGGLIGLRLAAEHPDRFA 141
Query: 61 HLILA 65
L++A
Sbjct: 142 RLVVA 146
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 28.9 bits (64), Expect = 3.8
Identities = 16/50 (32%), Positives = 31/50 (62%), Gaps = 5/50 (10%)
Query: 247 VTTKAVKPPKEP-----QEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
V KA KP +E QE +E K EE++ +EE++++E + + +Q++ +
Sbjct: 190 VQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQ 239
Score = 28.5 bits (63), Expect = 5.2
Identities = 13/31 (41%), Positives = 19/31 (61%), Gaps = 1/31 (3%)
Query: 258 PQEPEEEVKEEEERKKEEEKKKEEDGQHQQD 288
E ++ +E+EE + EEE K+EE GQ D
Sbjct: 214 KTEEKQPQEEQEEEEVEEEAKQEE-GQGTDD 243
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 28.9 bits (65), Expect = 3.8
Identities = 14/35 (40%), Positives = 20/35 (57%), Gaps = 7/35 (20%)
Query: 254 PPKEPQEPEEEVKEEEE-------RKKEEEKKKEE 281
P +E +EPEE +E EE RK+ E+ KE+
Sbjct: 150 PEEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQ 184
>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
Length = 869
Score = 29.2 bits (66), Expect = 3.8
Identities = 14/49 (28%), Positives = 24/49 (48%), Gaps = 3/49 (6%)
Query: 239 LSDEKLDIVTTKAVKPPKEPQE---PEEEVKEEEERKKEEEKKKEEDGQ 284
L+ K+ + P +EP+E E E + E+ K E++ E+ GQ
Sbjct: 817 LTTYKVKTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQ 865
>gnl|CDD|178633 PLN03084, PLN03084, alpha/beta hydrolase fold protein; Provisional.
Length = 383
Score = 28.7 bits (64), Expect = 4.1
Identities = 20/67 (29%), Positives = 33/67 (49%)
Query: 1 FGRSSRPRFSTDPETVERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVK 60
FG S +P+ + V+S+E ELK ++ L+ + YA +PD++K
Sbjct: 164 FGFSDKPQPGYGFNYTLDEYVSSLESLIDELKSDKVSLVVQGYFSPPVVKYASAHPDKIK 223
Query: 61 HLILADP 67
LIL +P
Sbjct: 224 KLILLNP 230
>gnl|CDD|219038 pfam06459, RR_TM4-6, Ryanodine Receptor TM 4-6. This region covers
TM regions 4-6 of the ryanodine receptor 1 family.
Length = 217
Score = 28.5 bits (64), Expect = 4.2
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 7/41 (17%)
Query: 254 PPKEPQEPE-------EEVKEEEERKKEEEKKKEEDGQHQQ 287
P+EP PE E +E EE K E EK EDG+ +
Sbjct: 74 EPEEPPTPEGAPAEKRELKEEGEEPKSEPEKADLEDGEKRT 114
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 28.6 bits (64), Expect = 4.2
Identities = 5/36 (13%), Positives = 20/36 (55%), Gaps = 1/36 (2%)
Query: 255 PKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
++P++ +E E ++ ++K++ E + +++
Sbjct: 71 DQKPEDEKELSASSLEAEQAKKKEEAE-AKKKKEME 105
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 27.6 bits (62), Expect = 4.4
Identities = 18/74 (24%), Positives = 32/74 (43%), Gaps = 9/74 (12%)
Query: 223 ADRADVFNKMVN--DTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKE 280
+ VN + ++D + V+P E +E E +R ++EEKK +
Sbjct: 51 ERKRSENAGKVNIGGGLFVDQAEVDAIARSRVQPVL--DEIDERA--EAQRARDEEKKLD 106
Query: 281 ED---GQHQQDKAR 291
E+ QH++ K R
Sbjct: 107 EEEAKRQHEEAKER 120
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 29.0 bits (65), Expect = 4.4
Identities = 11/28 (39%), Positives = 16/28 (57%), Gaps = 2/28 (7%)
Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEE 281
PP EP EPEEE +E + + + +E
Sbjct: 278 PPPEPPEPEEE--PDEPDQTDPDDGEET 303
>gnl|CDD|114761 pfam06057, VirJ, Bacterial virulence protein (VirJ). This family
consists of several bacterial VirJ virulence proteins.
VirJ is thought to be involved in the type IV secretion
system. It is thought that the substrate proteins
localised to the periplasm may associate with the pilus
in a manner that is mediated by VirJ, and suggest a
two-step process for type IV secretion in
Agrobacterium.
Length = 192
Score = 28.5 bits (64), Expect = 4.4
Identities = 18/50 (36%), Positives = 27/50 (54%), Gaps = 2/50 (4%)
Query: 12 DPETVERQLVTSIEEWRKELKLQEMILLGHSFGG-YLAFAYAIQYPDRVK 60
PE V L I+ +RK K++ ++L+G+SFG L AY + P K
Sbjct: 46 TPEEVADDLDRIIDTYRKRWKVKNVVLIGYSFGADVLPAAYN-RLPPATK 94
>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain.
Length = 225
Score = 28.6 bits (64), Expect = 4.5
Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 9/48 (18%)
Query: 249 TKAVKPPKEPQEPEEEVKEE-----EERKKEEEKKKEEDGQHQQDKAR 291
++A + E E + +E EER + EKKKEE +KA+
Sbjct: 101 SQADRVEGEEPESIRKWRERRDLRIEERDEASEKKKEE----LIEKAQ 144
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 28.8 bits (65), Expect = 4.5
Identities = 20/81 (24%), Positives = 35/81 (43%), Gaps = 12/81 (14%)
Query: 222 YADRADVFNKMVN--DTCT---LSDEKL-------DIVTTKAVKPPKEPQEPEEEVKEEE 269
D + N ++ D LS E L + K +K +E ++ E + K+EE
Sbjct: 231 LEDLLPLLNLVLYLVDKLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEE 290
Query: 270 ERKKEEEKKKEEDGQHQQDKA 290
++K+E E K + +Q K
Sbjct: 291 KKKEEREAKLAKLSPEEQRKL 311
>gnl|CDD|183178 PRK11525, dinD, DNA-damage-inducible protein D; Provisional.
Length = 279
Score = 28.5 bits (64), Expect = 4.6
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 255 PKEPQEPEEEVKEEEERKKEEEKKKEEDG 283
P+E PEE +K+ E R K EKK ++
Sbjct: 251 PEELPTPEESIKKLERRLKSTEKKLLKNP 279
>gnl|CDD|201483 pfam00873, ACR_tran, AcrB/AcrD/AcrF family. Members of this family
are integral membrane proteins. Some are involved in drug
resistance. AcrB cooperates with a membrane fusion
protein, AcrA, and an outer membrane channel TolC. The
structure shows the AcrB forms a homotrimer.
Length = 1021
Score = 28.8 bits (65), Expect = 4.6
Identities = 9/20 (45%), Positives = 14/20 (70%)
Query: 298 LGIVMVSMMILMTLLTMFLV 317
LGIV+ +I T+LT+ +V
Sbjct: 994 LGIVVFGGLITSTVLTLLVV 1013
>gnl|CDD|203773 pfam07819, PGAP1, PGAP1-like protein. The sequences found in this
family are similar to PGAP1. This is an endoplasmic
reticulum membrane protein with a catalytic serine
containing motif that is conserved in a number of
lipases. PGAP1 functions as a GPI inositol-deacylase;
this deacylation is important for the efficient
transport of GPI-anchored proteins from the endoplasmic
reticulum to the Golgi body.
Length = 225
Score = 28.5 bits (64), Expect = 4.7
Identities = 18/69 (26%), Positives = 25/69 (36%), Gaps = 15/69 (21%)
Query: 37 ILLGHSFGGYLA---FAYAIQYPDRVKHLI-LADPWGFPQKSIDPQKASKIPLWARMIGN 92
IL+GHS GG +A PD V ++ L+ P P + D +
Sbjct: 88 ILIGHSMGGLVARAALTLPNYIPDSVNTIVTLSSPHAGPPLTFDGD-----------LLR 136
Query: 93 LYKNFNPLW 101
Y N W
Sbjct: 137 FYALLNEYW 145
>gnl|CDD|183863 PRK13107, PRK13107, preprotein translocase subunit SecA; Reviewed.
Length = 908
Score = 28.8 bits (64), Expect = 4.7
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 260 EPEEEVKEEEERKKEEEKKKEEDGQHQQDKA 290
+ + +V+E E R++EE+ K + D QH +A
Sbjct: 832 QAQSDVEEMEARRREEDAKIQRDYQHAAAEA 862
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 28.8 bits (64), Expect = 4.8
Identities = 12/43 (27%), Positives = 24/43 (55%), Gaps = 2/43 (4%)
Query: 250 KAVKPPKEPQEPEEEVKEEEERKK--EEEKKKEEDGQHQQDKA 290
+ K + +E + + E+ER K E+E+ K ++ Q Q ++A
Sbjct: 81 RKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEA 123
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 29.0 bits (65), Expect = 5.1
Identities = 12/61 (19%), Positives = 33/61 (54%), Gaps = 2/61 (3%)
Query: 234 NDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDG--QHQQDKAR 291
N+ L +EKL ++ ++ ++ +E E +E+ +E+ +++++ + ++ KA
Sbjct: 2556 NEIKELENEKLKLLEKAKIEESRKERERIESETQEDNTDEEQINRQQQERLQKEEEQKAY 2615
Query: 292 S 292
S
Sbjct: 2616 S 2616
>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and L12p.
Ribosomal proteins P1 and P2 are the eukaryotic proteins
that are functionally equivalent to bacterial L7/L12.
L12p is the archaeal homolog. Unlike other ribosomal
proteins, the archaeal L12p and eukaryotic P1 and P2 do
not share sequence similarity with their bacterial
counterparts. They are part of the ribosomal stalk
(called the L7/L12 stalk in bacteria), along with 28S
rRNA and the proteins L11 and P0 in eukaryotes (23S
rRNA, L11, and L10e in archaea). In bacterial ribosomes,
L7/L12 homodimers bind the extended C-terminal helix of
L10 to anchor the L7/L12 molecules to the ribosome.
Eukaryotic P1/P2 heterodimers and archaeal L12p
homodimers are believed to bind the L10 equivalent
proteins, eukaryotic P0 and archaeal L10e, in a similar
fashion. P1 and P2 (L12p, L7/L12) are the only proteins
in the ribosome to occur as multimers, always appearing
as sets of dimers. Recent data indicate that most
archaeal species contain six copies of L12p (three
homodimers), while eukaryotes have two copies each of P1
and P2 (two heterodimers). Bacteria may have four or six
copies (two or three homodimers), depending on the
species. As in bacteria, the stalk is crucial for
binding of initiation, elongation, and release factors
in eukaryotes and archaea.
Length = 105
Score = 27.2 bits (60), Expect = 5.1
Identities = 12/41 (29%), Positives = 20/41 (48%)
Query: 242 EKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEED 282
E + A P E ++ EE K+EEE++++ED
Sbjct: 59 ELMSSQAAAAAAPAATAAATAEPAEKAEEAKEEEEEEEDED 99
>gnl|CDD|240536 cd13131, MATE_NorM_like, Subfamily of the multidrug and toxic
compound extrusion (MATE)-like proteins similar to
Vibrio cholerae NorM. The integral membrane proteins
from the MATE family are involved in exporting
metabolites across the cell membrane and are responsible
for multidrug resistance (MDR) in many bacteria and
animals. This subfamily includes Vibrio cholerae NorM
and functions most likely as a multidrug efflux pump,
removing xenobiotics from the interior of the cell. The
pump utilizes a cation gradient across the membrane to
facilitate the export process. NorM appears to bind
monovalent cations in an outward-facing conformation and
may subsequently cycle through an inward-facing and
outward-facing conformation to capture and release its
substrate.
Length = 435
Score = 28.6 bits (65), Expect = 5.2
Identities = 10/42 (23%), Positives = 19/42 (45%), Gaps = 3/42 (7%)
Query: 283 GQHQQDKARSVTFMTLGI-VMVS--MMILMTLLTMFLVLFGL 321
G + + R L + +++S +M+L+ L L L G
Sbjct: 71 GAGDRRRIRRAVRQGLWLALLLSLPLMLLLWLAPPILPLMGQ 112
>gnl|CDD|216672 pfam01738, DLH, Dienelactone hydrolase family.
Length = 215
Score = 28.1 bits (63), Expect = 5.2
Identities = 14/54 (25%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Query: 180 PVTVIYGSR-SWVDNSSGDKIKEA-RSQNSFVQVKSVTGAGHHVYADRADVFNK 231
P+ +G +V + D ++EA R+ N ++ GAGH AD + ++
Sbjct: 145 PILAHFGEEDPFVPAEARDLLEEALRAANVDHELHVYPGAGHAFAADGRESYDA 198
>gnl|CDD|238382 cd00741, Lipase, Lipase. Lipases are esterases that can
hydrolyze long-chain acyl-triglycerides into di- and
monoglycerides, glycerol, and free fatty acids at a
water/lipid interface. A typical feature of lipases is
"interfacial activation", the process of becoming
active at the lipid/water interface, although several
examples of lipases have been identified that do not
undergo interfacial activation . The active site of a
lipase contains a catalytic triad consisting of Ser -
His - Asp/Glu, but unlike most serine proteases, the
active site is buried inside the structure. A "lid" or
"flap" covers the active site, making it inaccessible
to solvent and substrates. The lid opens during the
process of interfacial activation, allowing the lipid
substrate access to the active site.
Length = 153
Score = 27.9 bits (62), Expect = 5.4
Identities = 11/75 (14%), Positives = 25/75 (33%), Gaps = 9/75 (12%)
Query: 18 RQLVTSIEEWRKELKLQEMILLGHSFGG----YLAFAYAIQYPDRVKHLILADP-----W 68
++ ++ + ++ + GHS GG + R+ + P
Sbjct: 12 NLVLPLLKSALAQYPDYKIHVTGHSLGGALAGLAGLDLRGRGLGRLVRVYTFGPPRVGNA 71
Query: 69 GFPQKSIDPQKASKI 83
F + +DP A +
Sbjct: 72 AFAEDRLDPSDALFV 86
>gnl|CDD|204918 pfam12436, USP7, Ubiquitin-specific protease 7. This domain family
is found in eukaryotes, and is approximately 40 amino
acids in length. The family is found in association with
pfam00443, pfam00917. USP7 regulates the turnover of
p53.
Length = 35
Score = 25.5 bits (57), Expect = 5.4
Identities = 8/19 (42%), Positives = 13/19 (68%)
Query: 268 EEERKKEEEKKKEEDGQHQ 286
EEER++ E ++KE + H
Sbjct: 14 EEEREERERRRKEREEAHL 32
>gnl|CDD|240542 cd13137, MATE_NorM_like, Subfamily of the multidrug and toxic
compound extrusion (MATE)-like proteins similar to
Thermotoga marina NorM. The integral membrane proteins
from the MATE family are involved in exporting
metabolites across the cell membrane and are responsible
for multidrug resistance (MDR) in many bacteria and
animals. A number of family members are involved in the
synthesis of peptidoglycan components in bacteria.
Length = 432
Score = 28.6 bits (65), Expect = 5.6
Identities = 10/38 (26%), Positives = 16/38 (42%), Gaps = 4/38 (10%)
Query: 283 GQHQQDKARSVTFMTLGIVMVSMMILMTLLTMFLVLFG 320
G D+AR L + ++ M LL + L +F
Sbjct: 294 GAGDPDEARRYGKRALRLG----VLFMGLLGLLLFIFA 327
>gnl|CDD|198151 smart01083, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved region
at the N-terminal end of a family of proteins referred
to as CIRs (CBF1-interacting co-repressors). CBF1
(centromere-binding factor 1) acts as a transcription
factor that causes repression by binding specifically to
GTGGGAA motifs in responsive promoters, and it requires
CIR as a co-repressor. CIR binds to histone deacetylase
and to SAP30 and serves as a linker between CBF1 and the
histone deacetylase complex.
Length = 37
Score = 25.2 bits (56), Expect = 5.7
Identities = 8/19 (42%), Positives = 14/19 (73%)
Query: 263 EEVKEEEERKKEEEKKKEE 281
+ V + E++ +EE+KK EE
Sbjct: 11 KRVWKAEQKAEEEKKKIEE 29
Score = 25.2 bits (56), Expect = 6.4
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 262 EEEVKEEEERKKEEEKKKE 280
+ E K EEE+KK EE++KE
Sbjct: 15 KAEQKAEEEKKKIEERRKE 33
>gnl|CDD|214445 MTH00166, ND6, NADH dehydrogenase subunit 6; Provisional.
Length = 160
Score = 27.8 bits (63), Expect = 5.7
Identities = 10/36 (27%), Positives = 20/36 (55%), Gaps = 1/36 (2%)
Query: 294 TFMTLGIVMVSMMILMTLLT-MFLVLFGLGYYFSYL 328
F+ + M ++I+ T+L + L L ++FSY+
Sbjct: 16 LFLNHPLSMGLILIIQTILISLLLGLMMKSFWFSYI 51
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 28.5 bits (63), Expect = 6.0
Identities = 16/55 (29%), Positives = 25/55 (45%)
Query: 232 MVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQ 286
+VN+ L E K + EEE E++ RK+ + KK +ED + Q
Sbjct: 99 VVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQ 153
>gnl|CDD|227257 COG4920, COG4920, Predicted membrane protein [Function unknown].
Length = 249
Score = 27.9 bits (62), Expect = 6.0
Identities = 19/103 (18%), Positives = 38/103 (36%), Gaps = 11/103 (10%)
Query: 244 LDIVTTKAVKPP----KEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVTFMTLG 299
L IV P QE EE++ E +K+E+ ++ + +FM +
Sbjct: 45 LSIVMGLRSNPYLKGVVSAQEISRARPLYEEKEANELLEKDEELLNEYKRFARASFMPML 104
Query: 300 IVMVSMMILMTLLTMFLVLFG-------LGYYFSYLKTHFVKL 335
++ ++L T L L + F+ ++F +
Sbjct: 105 TMLPYFILLATFLFPTLPPLAENGLGPVVLREFARFLSYFELV 147
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 28.2 bits (63), Expect = 6.1
Identities = 13/47 (27%), Positives = 27/47 (57%), Gaps = 3/47 (6%)
Query: 248 TTKAVKPPKE-PQEPEEEVKEEEERKKEE--EKKKEEDGQHQQDKAR 291
+ + KP + QE ++ +E E KK + ++K+EE+ + Q+ K +
Sbjct: 165 LSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQ 211
Score = 27.8 bits (62), Expect = 8.6
Identities = 10/33 (30%), Positives = 23/33 (69%), Gaps = 2/33 (6%)
Query: 259 QEPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
QE E + +++++K+EEE++K+ + QQ++
Sbjct: 185 QEWELKKLKQQQQKREEERRKQR--KKQQEEEE 215
>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
Length = 112
Score = 27.3 bits (59), Expect = 6.3
Identities = 12/42 (28%), Positives = 27/42 (64%)
Query: 248 TTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDK 289
K + K+P++PE E +++ E+ + E++KK E +++ +K
Sbjct: 59 IKKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDEK 100
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 28.5 bits (63), Expect = 6.4
Identities = 12/29 (41%), Positives = 19/29 (65%), Gaps = 4/29 (13%)
Query: 259 QEPEEEVKEEEERKK----EEEKKKEEDG 283
E EE K+ EER+K EE+++K+E+
Sbjct: 214 LELEELKKKREERRKVLEEEEQRRKQEEA 242
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 28.4 bits (63), Expect = 6.5
Identities = 17/81 (20%), Positives = 26/81 (32%), Gaps = 2/81 (2%)
Query: 204 SQNSFVQVKSVTG--AGHHVYADRADVFNKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEP 261
+ S VQV G AD+ + D + + A P+EP
Sbjct: 345 AAESVVQVADRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAAL 404
Query: 262 EEEVKEEEERKKEEEKKKEED 282
E +E E + E+ D
Sbjct: 405 ASEAHDETEPEVPEKAAPIPD 425
>gnl|CDD|224978 COG2067, FadL, Long-chain fatty acid transport protein [Lipid
metabolism].
Length = 440
Score = 28.1 bits (63), Expect = 6.6
Identities = 20/63 (31%), Positives = 24/63 (38%), Gaps = 16/63 (25%)
Query: 147 APSGESAFHTLTEGLGYAKRPM--------LHRV-DQLAAHVPVTVIYGSRSWVDNSSGD 197
P G A +T G G + H+V DQ A H GS W D SS D
Sbjct: 263 LPIGPLALGGVTPGSGKLTLSLPASAELSGQHKVADQWAIH-------GSVKWTDWSSFD 315
Query: 198 KIK 200
K+
Sbjct: 316 KLD 318
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 28.0 bits (62), Expect = 6.6
Identities = 14/31 (45%), Positives = 18/31 (58%), Gaps = 2/31 (6%)
Query: 256 KEPQEPEEEVKEE--EERKKEEEKKKEEDGQ 284
K Q+ EEEVKE+ +K K+EEDG
Sbjct: 96 KTKQKVEEEVKEQLQSLLEKIVVSKQEEDGP 126
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 28.4 bits (64), Expect = 6.7
Identities = 7/39 (17%), Positives = 19/39 (48%)
Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARS 292
PP++P+ +++E K++ + + + Q A +
Sbjct: 136 PPEDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEA 174
>gnl|CDD|237418 PRK13534, PRK13534, 7-cyano-7-deazaguanine tRNA-ribosyltransferase;
Provisional.
Length = 639
Score = 28.4 bits (64), Expect = 6.8
Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 6/39 (15%)
Query: 244 LDIVTTKAVKPPKEPQE-PEEEVKEEEERKKEEEKKKEE 281
LDI T PP +E EE+++E ER KE + KE+
Sbjct: 121 LDIPT-----PPDVSREKAEEDLEETLERAKEAIEIKEK 154
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 26.3 bits (58), Expect = 6.8
Identities = 7/26 (26%), Positives = 18/26 (69%)
Query: 257 EPQEPEEEVKEEEERKKEEEKKKEED 282
E +EPE++ +E++ +++ K++D
Sbjct: 46 EGEEPEDDDDDEDDDDDDDKDDKDDD 71
>gnl|CDD|224882 COG1971, COG1971, Predicted membrane protein [Function unknown].
Length = 190
Score = 27.6 bits (62), Expect = 7.0
Identities = 13/71 (18%), Positives = 26/71 (36%), Gaps = 17/71 (23%)
Query: 272 KKEEEKKKEEDGQHQQD---------------KARSVTFMTLGIVMVSMMILMTLLTMFL 316
K EE++ + +H + A V LG+ ++ + + L+T+ L
Sbjct: 90 KNEEDEFVDPAEKHDLNFKELILLAIATSIDALAVGVGLAFLGVNILLAAVAIGLITLIL 149
Query: 317 VLFG--LGYYF 325
G +G
Sbjct: 150 SALGAIIGRKL 160
>gnl|CDD|192955 pfam12214, TPX2_importin, Cell cycle regulated microtubule
associated protein. This domain is found in eukaryotes.
This domain is typically between 127 to 182 amino acids
in length. This domain is found associated with
pfam06886. This domain is found in the protein TPX2
(a.k.a p100) which is involved in cell cycling. It is
only expressed between the start of the S phase and
completion of cytokinesis. The microtubule-associated
protein TPX2 has been reported to be crucial for mitotic
spindle formation. This domain is close to the C
terminal of TPX2. The protein importin alpha regulates
the activity of TPX2 by binding to the nuclear
localisation signal in this domain.
Length = 176
Score = 27.5 bits (61), Expect = 7.2
Identities = 15/72 (20%), Positives = 26/72 (36%), Gaps = 8/72 (11%)
Query: 221 VYADRADVFNKMVNDTCTLSDEKLDIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKE 280
+Y +A NK + + + K PK P E +E E +R +E K +
Sbjct: 38 IYKFKARPLNKKIFEAPSPPLPK--------KPTPKLPTEFQEFHLETSKRAQERSSKAK 89
Query: 281 EDGQHQQDKARS 292
+ +R
Sbjct: 90 TASEQFTFHSRP 101
>gnl|CDD|236833 PRK11059, PRK11059, regulatory protein CsrD; Provisional.
Length = 640
Score = 28.3 bits (64), Expect = 7.4
Identities = 8/31 (25%), Positives = 12/31 (38%)
Query: 303 VSMMILMTLLTMFLVLFGLGYYFSYLKTHFV 333
+ + L+ L MF+ L G F L
Sbjct: 12 SAFVTLLVALAMFVTLLGCTLSFYQLTQEKQ 42
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 118
Score = 26.8 bits (60), Expect = 7.4
Identities = 8/32 (25%), Positives = 18/32 (56%)
Query: 249 TKAVKPPKEPQEPEEEVKEEEERKKEEEKKKE 280
+ ++ + EE E + ++KE++KKK+
Sbjct: 54 VTVTRGGRKGKTAREEAVEAKAKEKEKKKKKK 85
Score = 26.8 bits (60), Expect = 8.3
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 246 IVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKE 280
VT K +E E +E+E+KK+++K+ E
Sbjct: 55 TVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKELE 89
>gnl|CDD|198412 cd10430, BI-1, BAX inhibitor (BI)-1. Mammalian members of the BAX
inhibitor (BI)-1 like family of small transmembrane
proteins have been shown to have an antiapoptotic effect
either by stimulating the antiapoptotic function of
Bcl-2, a well-characterized oncogene, or by inhibiting
the proapoptotic effect of Bax, another member of the
Bcl-2 family. Their broad tissue distribution and high
degree of conservation suggests an important regulatory
role. In plants, BI-1 like proteins play a role in
pathogen resistance.
Length = 213
Score = 27.6 bits (62), Expect = 7.6
Identities = 6/31 (19%), Positives = 18/31 (58%)
Query: 290 ARSVTFMTLGIVMVSMMILMTLLTMFLVLFG 320
A+ ++ LG ++ S + ++ L+++ + G
Sbjct: 118 AKRREYLYLGGLLSSALSILLLVSLANIFGG 148
>gnl|CDD|220692 pfam10324, 7TM_GPCR_Srw, Serpentine type 7TM GPCR chemoreceptor
Srw. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srw is a solo family
amongst the superfamilies of chemoreceptors.
Chemoperception is one of the central senses of soil
nematodes like C. elegans which are otherwise 'blind'
and 'deaf'. The genes encoding Srw do not appear to be
under as strong an adaptive evolutionary pressure as
those of Srz.
Length = 317
Score = 27.9 bits (63), Expect = 7.7
Identities = 17/75 (22%), Positives = 25/75 (33%), Gaps = 17/75 (22%)
Query: 276 EKKKEEDGQHQQDKARS---VTFMT---------LGIVMVSMMILMTLLTMFLVLFGLGY 323
+K ++ R+ V FMT LGI + + L+L L
Sbjct: 223 KKSRKNLSSSSNKSDRTTKLVIFMTISFFIAEFPLGISYIIQFFFYDDPGIRLILTYLSV 282
Query: 324 YFSYLKT-----HFV 333
FS L T H +
Sbjct: 283 IFSLLLTINAISHCL 297
>gnl|CDD|214444 MTH00165, ND5, NADH dehydrogenase subunit 5; Provisional.
Length = 573
Score = 28.2 bits (64), Expect = 7.8
Identities = 11/30 (36%), Positives = 18/30 (60%), Gaps = 2/30 (6%)
Query: 302 MVSMMILMTLLTMFLVLFG--LGYYFSYLK 329
M+ + + LLT+F++L G LGY S +
Sbjct: 447 MICLPFYLKLLTLFVILLGGWLGYELSLMS 476
>gnl|CDD|191022 pfam04538, BEX, Brain expressed X-linked like family. This is a
family of transcription elongation factors which
includes those referred to as Bex proteins as well as
those named TCEAL7. Bex1 was shown to be a novel link
between neurotrophin signalling, the cell cycle, and
neuronal differentiation, suggesting it might function
by coordinating internal cellular states with the
ability of cells to respond to external signals. TCEAL7
has been shown negatively to regulate the NF-kappaB
pathway, hence being important in ovarian cancer as it
one of the genes frequently downregulated in this
cancer. A closely related protein, TFIIS/TCEA, found in
pfam07500 is involved in transcription elongation and
transcript fidelity. TFIIS/TCEA promotes 3'
endoribonuclease activity of RNA polymerase II (pol II)
and allows pol II to bypass transcript pause or 'arrest'
during elongation process. It is thus possible that BEX
is also acting in this way.
Length = 97
Score = 26.5 bits (59), Expect = 7.8
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 262 EEEVKEEEERKKEEEKKKEEDGQHQQDK 289
E E K E E K+EEEK+ E+G+ ++ +
Sbjct: 7 ENEGKPESEPKEEEEKRPLEEGEGKKPE 34
>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118). This is
the highly conserved family of the major tail subunit
protein.
Length = 121
Score = 27.0 bits (59), Expect = 7.9
Identities = 8/26 (30%), Positives = 20/26 (76%)
Query: 264 EVKEEEERKKEEEKKKEEDGQHQQDK 289
E+KEE ++ + E +K++++ + ++DK
Sbjct: 47 EMKEERDKMETEREKRDKESKEERDK 72
>gnl|CDD|177157 MTH00094, ND4, NADH dehydrogenase subunit 4; Provisional.
Length = 403
Score = 27.9 bits (63), Expect = 8.0
Identities = 8/39 (20%), Positives = 16/39 (41%), Gaps = 3/39 (7%)
Query: 292 SVTFMT--LGIVMVSMMILMTLLTMFLVLFGLGYYFSYL 328
S++F + L I M + +F + +Y+S
Sbjct: 327 SLSFFSEFLIISNSFMKNKFSFFFIFFYFL-VSFYYSLF 364
>gnl|CDD|226584 COG4099, COG4099, Predicted peptidase [General function prediction
only].
Length = 387
Score = 28.0 bits (62), Expect = 8.0
Identities = 21/104 (20%), Positives = 38/104 (36%), Gaps = 15/104 (14%)
Query: 7 PRFSTDPETVERQLVTSIEEWRKELKLQEMI------LLGHSFGGYLAFAYAIQYPDRVK 60
P F+ E L+ I+ + L I ++G S GG+ +A A ++PD
Sbjct: 236 PIFADSEEKTLLYLIEKIDLILEVLASTYNIDRSRIYVIGLSRGGFGTWALAEKFPDFFA 295
Query: 61 HLIL-ADPWGFPQKSIDPQKASKIPLWARMIGNLYKNFNPLWPV 103
+ A + K P+W + + + + PV
Sbjct: 296 AAVPIAGGGDRVYLV---RTLKKAPIWV-----FHSSDDKVIPV 331
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 28.1 bits (63), Expect = 8.1
Identities = 9/32 (28%), Positives = 21/32 (65%)
Query: 260 EPEEEVKEEEERKKEEEKKKEEDGQHQQDKAR 291
+ EE +E+EE++ +E+K+ + Q++K +
Sbjct: 549 DKEELQREKEEKEALKEQKRLRKLKKQEEKKK 580
>gnl|CDD|221747 pfam12740, Chlorophyllase2, Chlorophyllase enzyme. This family
consists of several chlorophyllase and chlorophyllase-2
(EC:3.1.1.14) enzymes. Chlorophyllase (Chlase) is the
first enzyme involved in chlorophyll (Chl) degradation
and catalyzes the hydrolysis of an ester bond to yield
chlorophyllide and phytol. The family includes both
plant and Amphioxus members.
Length = 258
Score = 27.8 bits (62), Expect = 8.1
Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 4/39 (10%)
Query: 33 LQEMILLGHSFGGYLAFAYAIQYPDRV----KHLILADP 67
++ L GHS GG +AFA A+ + + LI DP
Sbjct: 90 FSKLALAGHSRGGKVAFALALGNANVLHLKFSALIGVDP 128
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 28.2 bits (64), Expect = 8.2
Identities = 10/33 (30%), Positives = 14/33 (42%)
Query: 71 PQKSIDPQKASKIPLWARMIGNLYKNFNPLWPV 103
P + K LWARM + +F L+ V
Sbjct: 37 PTSASSVPAKEKGALWARMGVDPDHDFEALYEV 69
>gnl|CDD|227502 COG5175, MOT2, Transcriptional repressor [Transcription].
Length = 480
Score = 28.1 bits (62), Expect = 8.3
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 255 PKEPQEPEEEVKEEEERKKEEEKKKEEDGQ 284
P+E + E+ +EERK E+++KE +GQ
Sbjct: 72 TLSPEELKMELARKEERKMREKERKEAEGQ 101
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
Length = 193
Score = 27.4 bits (60), Expect = 8.5
Identities = 9/28 (32%), Positives = 20/28 (71%)
Query: 254 PPKEPQEPEEEVKEEEERKKEEEKKKEE 281
P ++P++ EE++++ E E+E++ EE
Sbjct: 35 PEQKPEDIVEELEDQPEEPPEQEEENEE 62
>gnl|CDD|235293 PRK04375, PRK04375, protoheme IX farnesyltransferase; Provisional.
Length = 296
Score = 27.8 bits (63), Expect = 8.6
Identities = 8/34 (23%), Positives = 17/34 (50%), Gaps = 5/34 (14%)
Query: 296 MTLGIVM--VSMMILMTL---LTMFLVLFGLGYY 324
+ G+V+ + ++L L +L L G+ +Y
Sbjct: 94 LIFGLVLGVLGFLLLGLFVNPLAAWLTLAGIFFY 127
>gnl|CDD|214867 smart00864, Tubulin, Tubulin/FtsZ family, GTPase domain. This
domain is found in all tubulin chains, as well as the
bacterial FtsZ family of proteins. These proteins are
involved in polymer formation. Tubulin is the major
component of microtubules, while FtsZ is the
polymer-forming protein of bacterial cell division, it
is part of a ring in the middle of the dividing cell
that is required for constriction of cell membrane and
cell envelope to yield two daughter cells. FtsZ and
tubulin are GTPases, this entry is the GTPase domain.
FtsZ can polymerise into tubes, sheets, and rings in
vitro and is ubiquitous in bacteria and archaea.
Length = 192
Score = 27.5 bits (62), Expect = 8.8
Identities = 10/34 (29%), Positives = 17/34 (50%)
Query: 12 DPETVERQLVTSIEEWRKELKLQEMILLGHSFGG 45
DPE S++E R+EL+ + + + GG
Sbjct: 61 DPEVGREAAEESLDEIREELEGADGVFITAGMGG 94
>gnl|CDD|236879 PRK11195, PRK11195, lysophospholipid transporter LplT; Provisional.
Length = 393
Score = 27.9 bits (63), Expect = 8.9
Identities = 12/43 (27%), Positives = 20/43 (46%), Gaps = 7/43 (16%)
Query: 290 ARSVTFMTLGIVMVSMMILMTLLT------MFLVLFG-LGYYF 325
+ + GI+M +++LM L L+L G LG +F
Sbjct: 268 ETVLRVLPAGILMGLVVLLMALQHSLLPAYPLLILIGALGGFF 310
>gnl|CDD|184853 PRK14851, PRK14851, hypothetical protein; Provisional.
Length = 679
Score = 27.9 bits (62), Expect = 9.1
Identities = 14/52 (26%), Positives = 22/52 (42%), Gaps = 10/52 (19%)
Query: 112 WVVEKM--------RPDLPKKFTPVLKED--SSAITEYIFQCNVQAPSGESA 153
W V+ + R P+ V D + +YI Q +QAPSG++
Sbjct: 293 WYVQNVLIRNTDQARVREPEPPVSVAHFDSLPQGVGDYILQAGIQAPSGDNV 344
>gnl|CDD|234859 PRK00893, PRK00893, aspartate carbamoyltransferase regulatory
subunit; Reviewed.
Length = 152
Score = 27.1 bits (61), Expect = 9.3
Identities = 9/18 (50%), Positives = 13/18 (72%)
Query: 113 VVEKMRPDLPKKFTPVLK 130
VVEK + +LP++ VLK
Sbjct: 90 VVEKRKVELPEEIEGVLK 107
>gnl|CDD|219340 pfam07224, Chlorophyllase, Chlorophyllase. This family consists of
several plant specific Chlorophyllase proteins
(EC:3.1.1.14). Chlorophyllase (Chlase) is the first
enzyme involved in chlorophyll (Chl) degradation and
catalyzes the hydrolysis of ester bond to yield
chlorophyllide and phytol.
Length = 307
Score = 27.5 bits (61), Expect = 9.4
Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 2/40 (5%)
Query: 30 ELKLQEMILLGHSFGGYLAFAYAIQYPDRVK--HLILADP 67
E L ++ L GHS GG AFA A+ Y V LI DP
Sbjct: 117 EANLSKLALSGHSRGGKTAFALALGYSLDVTFSALIGVDP 156
>gnl|CDD|215869 pfam00344, SecY, SecY translocase.
Length = 340
Score = 27.5 bits (62), Expect = 9.7
Identities = 8/21 (38%), Positives = 12/21 (57%), Gaps = 1/21 (4%)
Query: 294 TFMTLGIV-MVSMMILMTLLT 313
+ LGI ++ I+M LLT
Sbjct: 1 SIFALGIGPYITASIIMQLLT 21
>gnl|CDD|225375 COG2819, COG2819, Predicted hydrolase of the alpha/beta superfamily
[General function prediction only].
Length = 264
Score = 27.4 bits (61), Expect = 9.8
Identities = 15/52 (28%), Positives = 20/52 (38%), Gaps = 1/52 (1%)
Query: 16 VERQLVTSIEEWRKELKLQEMILLGHSFGGYLAFAYAIQYPDRVKHLILADP 67
+ QL IE R + ++GHS GG + YPD L P
Sbjct: 120 LTEQLKPFIEA-RYRTNSERTAIIGHSLGGLFVLFALLTYPDCFGRYGLISP 170
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 27.7 bits (62), Expect = 9.8
Identities = 9/49 (18%), Positives = 25/49 (51%), Gaps = 1/49 (2%)
Query: 241 DEKL-DIVTTKAVKPPKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQD 288
++++ + P K E +++ + R+K++EKK + + + Q+
Sbjct: 347 EKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 26.5 bits (58), Expect = 9.9
Identities = 13/40 (32%), Positives = 22/40 (55%), Gaps = 3/40 (7%)
Query: 255 PKEPQEPEEEVKEEEERKKEEEKKKEEDGQHQQDKARSVT 294
+ E EEEV E++E EEE + EE+ ++++ T
Sbjct: 47 GDDEMEEEEEVDEDDE---EEEGEGEEEEGEEEEETEGAT 83
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,280,156
Number of extensions: 1665371
Number of successful extensions: 6984
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6139
Number of HSP's successfully gapped: 470
Length of query: 335
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 238
Effective length of database: 6,635,264
Effective search space: 1579192832
Effective search space used: 1579192832
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.3 bits)