RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14715
(1147 letters)
>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases, insulinase-like
[General function prediction only].
Length = 978
Score = 706 bits (1824), Expect = 0.0
Identities = 333/1002 (33%), Positives = 523/1002 (52%), Gaps = 70/1002 (6%)
Query: 149 GFLVKNVTPIPEFQMTAIKLQHVKTLAEYFHLSRDDSNNVFAVAFRTPPPDSTGITHILE 208
GF V + +PE L+H KT AE H+ +D NNVF++AF+T P DSTG+ HILE
Sbjct: 5 GFKVISSEDLPEISSKGYILEHEKTGAELAHIKNEDPNNVFSIAFKTEPHDSTGVAHILE 64
Query: 209 HLSLCGSVKYPCRDPFMKMLTRSMATFMNAMTGPDYTFYPFSSQNHCDYFNLMSIYLDAV 268
H LCGS KYP +DPF KML RS+ TF+NA T PD T YP SS N D++NL+S+YLDAV
Sbjct: 65 HTVLCGSKKYPVKDPFFKMLKRSLNTFLNAFTFPDKTVYPASSANEKDFYNLLSVYLDAV 124
Query: 269 FNPQLKQLDFMQEGWRLEHEDIKDQNSPIIFKGVVFNEMKGAFSDNSYIFGEALMNNILP 328
F+P L + F+QEGWR+E +D S + +KGVV+NEMKGA+S + A+ ++ P
Sbjct: 125 FHPLLTKESFLQEGWRIEFKDE----SNLKYKGVVYNEMKGAYSSGESVLSRAMQQSLFP 180
Query: 329 TYCYKHVSGGDPIKILNLKYENLVNYHKKHYHPTNSKFFSYGNFNLEDHLSFINTNYLSK 388
Y SGGDP I +L YE +HKKHYHP+N K F YGN E L FI L
Sbjct: 181 GTTYGVNSGGDPKNIPDLTYEEFRAFHKKHYHPSNCKIFVYGNIPTERLLDFIEEKVLR- 239
Query: 389 INPYQHHRSSTAVLPEPAWDKPRQLHIHGRHD-PLASENQSHIAIAYKCAVMDNFKDVFV 447
P+ + + A+ KPR+ + E+Q +++++ + +D
Sbjct: 240 --PFGKRELDVPIPDQKAFKKPRRKVLEYPISFDEEDEDQGLLSLSWLGGSASDAEDSLA 297
Query: 448 LNILGDLLLKGPNAPFYKNLVESGLGLSFSPVTG-YEASIHDTLFTVGLQGVDSNKFDEI 506
L +L ++LL +P + L+ESGLG F+ V+G Y++ + +T+F+V
Sbjct: 298 LEVLEEILLDSAASPLTQALIESGLG--FADVSGSYDSDLKETIFSV------------- 342
Query: 507 IGAVNKTIDEVIAEGFDKERVAIGLQGVDSNKFDEIKGAVNKTIDEVIAEGFDKERVASV 566
GL+GV K ++K V T+ E++ G DK+ + ++
Sbjct: 343 -----------------------GLKGVSEEKIAKLKNLVLSTLKELVKNGIDKKLIEAI 379
Query: 567 LHSLELSLKH-QSSNFGLNLLFWLVPFMNHDCDVIHLLHINDRLNWFKKHIQENPTYLQE 625
LH LE SLK +S FGL L+F + + D L D L ++ +++ P Y ++
Sbjct: 380 LHQLEFSLKEVKSYPFGLGLMFRSLYGWLNGGDPEDSLRFLDYLQNLREKLEKGP-YFEK 438
Query: 626 KVDEYLRNNPHKLIITMSPEKTFDEKLDKVEKDILKDRISQMNDQDLNKVYVNGTELRKE 685
+ +Y +NPH + + + P +EKL+K E+++L+ R S++ D+DL K+ + +L++
Sbjct: 439 LIRKYFLDNPHYVTVIVLPSPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKER 498
Query: 686 QEK---EQNIDVLPTLKISDVDDHVERVVTTDKHILQVPIQLSTQPTNGVTYFRSVVDTS 742
Q++ E+++ LPTLK+ DV D +E+ + + + TNG+TY R D
Sbjct: 499 QDQPDSEEDLATLPTLKLGDVPDPIEKTSLETEVSNEAKVLHHDLFTNGITYLRLYFDLD 558
Query: 743 KLSPELKPLVPLFNYVINQMRTKNYDFREMDQLIHMSTGGISFNSHLGESCSTPNGFEEA 802
L EL P +PLF + + + T+ Y ++E+ I TGGIS + + + +
Sbjct: 559 MLPSELLPYLPLFAFALTNLGTETYSYKELLNQIERHTGGISVSLSVDTDPGDDGEYRPS 618
Query: 803 ILVSSHCLEHNNDKMFDVLSELFNNVQLTDLNRFTTLVNTLSSELINGISGNGHRYAMSI 862
+S L +K+F+++ E+ N D R L+ S+L + + +GH A S+
Sbjct: 619 FSISGKALRSKVEKLFELIREILANTDFHDRERLKELLEQYLSDLTSSVRNSGHSIASSL 678
Query: 863 ASSLVDPVSEQKEIYSGLSFVSKIKEI----AQSPKLENILQDIQSIGAHVLRKDSMRCA 918
A+S + KE+ +GLS V ++E+ ++ + E I +Q++ + + +++R A
Sbjct: 679 ANSRLSSAGALKELLNGLSQVKFLRELSSNFEENFEKE-IADKLQALRKKIFQTNNLRIA 737
Query: 919 LNMSAQSNAPERLES----FLQSIPGDFT--SQPGQTVHSFNVSGIQKVSHVLPFPVNFT 972
+ + + LE+ FL+ + F + P S + P N
Sbjct: 738 I-IGDIDKILDLLENPLLKFLEHLLPGFELPTPPKNPHLDLISSLSEATIIPSPVAYNAL 796
Query: 973 AKSLRGVPFLHKDYVALKVLSKFLTTKYLLREVREKNGAYGAGAVVSP-SGVIQFYSYRD 1031
A S+ G+P+ H DY AL+VLS++L + YL ++REK GAYGA A + GV F SYRD
Sbjct: 797 AFSIGGLPYTHPDYAALQVLSEYLGSGYLWNKIREKGGAYGASASIDANRGVFSFASYRD 856
Query: 1032 PYALETLATFDQSTQFLADTKLSVQDLDEAKLGVFKEVDAPIPPGSKGMSKFLY---GKT 1088
P L+T F +S + LA +DL+EA LG+ +D P P S+G F G T
Sbjct: 857 PNILKTYKVFRKSVKDLASGNFDERDLEEAILGIISTLDTPESPASEGSKSFYRDLSGLT 916
Query: 1089 DEMIEQYRLSVKQVTEDDIRRVADTYLSRDATEKLSSYVVIG 1130
DE + +R + VT++DI+ V D YL ++E S V
Sbjct: 917 DEERQAFRERLLDVTKEDIKEVMDKYLLNFSSEN--SIAVFA 956
>gnl|CDD|240416 PTZ00432, PTZ00432, falcilysin; Provisional.
Length = 1119
Score = 345 bits (886), Expect = e-101
Identities = 270/1153 (23%), Positives = 498/1153 (43%), Gaps = 139/1153 (12%)
Query: 79 SYCRPITDYGSIVYQSAKPSAKGKLNSTHNNFIRLCTGAFRSSPLDSLFCEMAPEKKFIS 138
Y I++ G + K + N +F+ + ++ L S +M +++
Sbjct: 11 GYLNIISNLGKYASSLHNKAEKRRYNVFSEDFL----YSINTTNLYSFTADMNKSPEWVQ 66
Query: 139 HSFEEGAEVEGFLVKNVTPIPEFQMTAIKLQHVKTLAEYFHLSRDDSNNV---FAVAFRT 195
+ E E + +P+F M A H KT + L +DS+ F T
Sbjct: 67 EAMP--VEHEAYEKIEKRYLPDFGMVATVYSHKKTGLQVISLKTNDSSGKEMCFDFYVPT 124
Query: 196 PPPDSTGITHILEHLSLCGSVKYPCRDPFMKMLTRSMATFMNAMTGPDYTFYPFSSQNHC 255
PP + GI HILEH L GS KY +D F ++ +F+NA T D T Y F+S N
Sbjct: 125 PPHNDKGIPHILEHSVLSGSKKYNYKDSFSLLVQGGFNSFLNAYTFKDRTSYLFASTNEK 184
Query: 256 DYFNLMSIYLDAVFNPQLKQLD--FMQEGWRLEHEDIKDQNSP-----------IIFKGV 302
D++N +Y+D+VF P + + F QEGW + +KD + + G+
Sbjct: 185 DFYNTADVYMDSVFQPNILEDKDIFKQEGWHYKVTKLKDDEKNADELGNVHDRHVSYSGI 244
Query: 303 VFNEMKGAFSDNSYIFGEALMNNILPTYCYKHVSGGDPIKILNLKYENLVNYHKKHYHPT 362
V++EMK FSD + N+ + YK+ SGGDP I+ L YE LV ++K +Y P
Sbjct: 245 VYSEMKKRFSDPLSFGYSVIYQNLF-SNVYKYDSGGDPKDIVELTYEELVEFYKTYYGPK 303
Query: 363 NSKFFSYGNFNLEDHLSFINTNYLSKINPYQHHRSSTA--------VLPEPAWDKPRQLH 414
+ + YG ++ + L F++ NYL+K +P S TA +L E DKP+ H
Sbjct: 304 TATVYFYGPNDVTERLEFVD-NYLTK-HPKTGQLSHTAYREDADENLLYEEYKDKPK--H 359
Query: 415 IHGRHDPLASENQSHIAIAY-----KCAVMDNFK------DVFVLNILGDLLLKGPNAPF 463
+ + + E ++ +++++ D K D L +L LLL P +
Sbjct: 360 VKKKFSSHSEEEENLMSVSWLLNPKHNGSKDYDKSLIDPVDYLALLVLNYLLLGTPESVL 419
Query: 464 YKNLVESGLGLSFSPVTGYEASIHDTLFTVGLQGVDSNKFDEIIGAVNKTIDEVIAEGFD 523
YK L++SGLG +G + ++F++GL+G+ K D+V
Sbjct: 420 YKALIDSGLGKKVV-GSGLDDYFKQSIFSIGLKGIKETNE--------KRKDKVHYT--- 467
Query: 524 KERVAIGLQGVDSNKFDEIKGAVNKTIDEVIAEGFDKERVASVLHSLELSLKHQSSNF-- 581
F+++ V + +V+ EGF+K V + L+++E +K +
Sbjct: 468 ---------------FEKV---VLNALTKVVTEGFNKSAVEASLNNIEFVMKELNLGTYP 509
Query: 582 -GLNLLFWLVPFMNHDCDVIHLLHINDRLNWFKKHIQENPTYLQEKVDEYLRNNPHKLII 640
GL L+F + + + D +L LN K I YL++ ++++L NN H++ +
Sbjct: 510 KGLMLIFLMQSRLQYGKDPFEILRFEKLLNELKLRIDNESKYLEKLIEKHLLNNNHRVTV 569
Query: 641 TMS--PEKTFDEKLDKVEKDILKDRISQMNDQDLNKVYVNGTELRKEQEKE---QNIDVL 695
+ ++++ +K+ KD LK+R+S + + ++++ + +KE+E + +++D
Sbjct: 570 HLEAVESSKYEKEFNKLVKDELKERLSHLTKEQVDEMEKAYEKFKKEREADDDPEHLDSF 629
Query: 696 PTLKISDVDDHVERVVT----TDKHILQ-----------VPIQLSTQPTNGVTYFRSVVD 740
P L +SD++ E + T L+ V + + + G+ Y
Sbjct: 630 PILSLSDLNKETEEIPTKLYKLSSDSLKENMDLDSDGGSVTVLVHPIESRGILYLDFAFS 689
Query: 741 TSKLSPELKPLVPLFNYVINQMRTKNYDFREMDQLIHMSTGGISFNSHL-----GESCST 795
L+ + + LF ++ + T E + GG+S ++ +
Sbjct: 690 LDSLTVDELKYLNLFKALLKENGTDKLSSEEFTYKREKNLGGLSASTAFYSETNNLTYDD 749
Query: 796 PNGFEEAILVSSHCLEHNNDKMFDVLSELFNNVQLTDLNRFTTLVNTLSSELINGISGNG 855
P + V + L+H ++M D++ E + ++ + ++ + + S G
Sbjct: 750 PYNGVGYLNVRAKVLKHKVNEMVDIVLEALKDADFSNSKKGVEILKRKINGMKTVFSSKG 809
Query: 856 HRYAMSIASSLVDPVSEQKEIYSGLSFVSKIKE----IAQS--PKLENILQDIQSIGAHV 909
H++A+ S E+ +G S + +KE +A+ K+E+ L +I++ +
Sbjct: 810 HKFALKRMKSKFSVSDYADELVNGYSQLLFLKETLVPLAEKDWSKVESKLNEIRN---KL 866
Query: 910 LRKDSMRCALNMSAQSNAPERL----ESFLQSIPGDFTSQPGQTVHSFNVSGI------Q 959
L ++ +N++ S + L +FL+ + F ++ V + +
Sbjct: 867 LSMKNL--TVNVTGDSELLDSLLDDSTTFLKKLSSTFKENDNKSSDKVWVKEVLDKKLME 924
Query: 960 KVSH----VLPFPVNFTAKSLRGVPFLHKDYV--ALKVLSKFLTTKYLLREVREKNGAYG 1013
V VLP VNF + G F D V + +V+ +L YL + VR GAYG
Sbjct: 925 SVDKNEFIVLPTRVNFV--GMGGKLFDKSDKVDGSFQVIVHYLKNSYLWKTVRMSLGAYG 982
Query: 1014 AGAVVSPSGVIQFYSYRDPYALETLATFDQSTQFL--ADTKLSVQDLDEAKLGVFKEVDA 1071
A + +G + F SY DP +TL + + L A L+ +DL K+G +D
Sbjct: 983 VFADLLYTGHVIFMSYADPNFEKTLEVYKEVASALREAAETLTDKDLLRYKIGKISNIDK 1042
Query: 1072 PIPPGSKGMSKF---LYGKTDEMIEQYRLSVKQVTEDDIRRVADTYLSRDATEKLSSYVV 1128
P+ + ++DE +++R + + T++D R+AD EK+ + VV
Sbjct: 1043 PLHVDELSKLALLRIIRNESDEDRQKFRKDILETTKEDFYRLADLMEKSKEWEKVIA-VV 1101
Query: 1129 IGPKSNNLGDEWK 1141
S+ ++K
Sbjct: 1102 NSKTSDTAPSDFK 1114
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease M16C
(pfam05193).
Length = 248
Score = 225 bits (575), Expect = 8e-67
Identities = 82/248 (33%), Positives = 140/248 (56%)
Query: 642 MSPEKTFDEKLDKVEKDILKDRISQMNDQDLNKVYVNGTELRKEQEKEQNIDVLPTLKIS 701
MSP++T +EKL++ E++ L+ + S ++++D K+ G EL + Q +++ LPTL +S
Sbjct: 1 MSPDETLEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQSTPEDLSCLPTLTLS 60
Query: 702 DVDDHVERVVTTDKHILQVPIQLSTQPTNGVTYFRSVVDTSKLSPELKPLVPLFNYVINQ 761
D+ + ++ + VP+ TNG+ Y R V D + L EL P +PLF V+ +
Sbjct: 61 DIPRSGDEYPLEEEKVGDVPVLHHDVDTNGIVYVRLVFDLNALPEELLPYLPLFCDVLTK 120
Query: 762 MRTKNYDFREMDQLIHMSTGGISFNSHLGESCSTPNGFEEAILVSSHCLEHNNDKMFDVL 821
+ T+NY +RE+++ I + TGGISF+ + + + +E + ++S L K+F+++
Sbjct: 121 LGTENYSYRELEEEILLHTGGISFSVSVSSNPDDTDKYEPSFVISGKALNSKVPKLFELI 180
Query: 822 SELFNNVQLTDLNRFTTLVNTLSSELINGISGNGHRYAMSIASSLVDPVSEQKEIYSGLS 881
SE+ N D R L+ +S L N I+ +GH YAMS A+S + E+ SGL
Sbjct: 181 SEILNETDFDDEERLKELIRESASRLTNSIADSGHSYAMSRAASYLSIAGAISELLSGLD 240
Query: 882 FVSKIKEI 889
V IKE+
Sbjct: 241 QVKFIKEL 248
>gnl|CDD|218490 pfam05193, Peptidase_M16_C, Peptidase M16 inactive domain.
Peptidase M16 consists of two structurally related
domains. One is the active peptidase, whereas the other
is inactive. The two domains hold the substrate like a
clamp.
Length = 182
Score = 94.9 bits (236), Expect = 3e-22
Identities = 39/184 (21%), Positives = 82/184 (44%), Gaps = 8/184 (4%)
Query: 345 NLKYENLVNYHKKHYHPTNSKFFSYGNFNLEDHLSFINTNYLSKINPYQHHRSSTAVLPE 404
+L E+L +++KKHY P N G+ +L++ L+ Y + E
Sbjct: 1 SLTREDLKDFYKKHYSPDNMVLVVVGDVDLDELLALAEK-YFGDLPASPPKPKPREPPLE 59
Query: 405 PAWDKPRQLHIHGRHDPLASENQSHIAIAYKCAVMDNFKDVFVLNILGDLLLKGPNAPFY 464
P +++ + + Q+ +A+A+ + N D L++L +LL G ++ +
Sbjct: 60 PEELTGKEVVVPDK-----DVPQAKLALAFPGPPLGNDPDSAALDVLAELLGGGASSRLF 114
Query: 465 KNLVESGLGLSFSPVTGYEASIHDTLFTVGLQGVDSNKFDEIIGAVNKTIDEVIAEGFDK 524
+ L E GL++S +++ LF + +D DE+I + + + ++ EG +
Sbjct: 115 QELREK-EGLAYSVSAFFDSYSDTGLFGIYAD-LDPENLDEVIELIFEELKKLAEEGITE 172
Query: 525 ERVA 528
E +
Sbjct: 173 EELE 176
Score = 51.7 bits (124), Expect = 2e-07
Identities = 37/150 (24%), Positives = 63/150 (42%), Gaps = 15/150 (10%)
Query: 928 PERLESFLQSIPGDFTSQPGQTVHSFN------VSGIQKVSHVLPFPVNFTAKSLRGVPF 981
+ L + + GD + P + ++G + V P A + G P
Sbjct: 30 LDELLALAEKYFGDLPASPPKPKPREPPLEPEELTGKEVVVPDKDVPQAKLALAFPGPPL 89
Query: 982 L-HKDYVALKVLSKFL---TTKYLLREVREKNG-AYGAGAVV---SPSGVIQFYSYRDPY 1033
D AL VL++ L + L +E+REK G AY A S +G+ Y+ DP
Sbjct: 90 GNDPDSAALDVLAELLGGGASSRLFQELREKEGLAYSVSAFFDSYSDTGLFGIYADLDPE 149
Query: 1034 AL-ETLATFDQSTQFLADTKLSVQDLDEAK 1062
L E + + + LA+ ++ ++L+ AK
Sbjct: 150 NLDEVIELIFEELKKLAEEGITEEELERAK 179
>gnl|CDD|223685 COG0612, PqqL, Predicted Zn-dependent peptidases [General function
prediction only].
Length = 438
Score = 77.9 bits (192), Expect = 7e-15
Identities = 68/345 (19%), Positives = 125/345 (36%), Gaps = 34/345 (9%)
Query: 188 VFAVAFRTPPPDSTGITHILEHLSLCGSVKYPCRDPFMKMLTRSMATFMNAMTGPDYTFY 247
R P GI H LEH++ G+ P + + + NA T DYT Y
Sbjct: 44 WVKAGSRAEPAGKAGIAHFLEHMAFKGTTGLPSAE-LAEAFEKLGGQL-NAFTSFDYTVY 101
Query: 248 PFSSQNHCDYFN-LMSIYLDAVFNPQLKQLDFMQEGWRLEHEDIKDQNSPIIFKGVVFNE 306
S D + + + D + NP + + +E KGV+ E
Sbjct: 102 YLSVLP--DNLDKALDLLADILLNPTFDEEEVERE------------------KGVILEE 141
Query: 307 MKGAFSDNSYIFGEALMNNILPTYCYKHVSGGDPIKILNLKYENLVNYHKKHYHPTNSKF 366
++ D + E L+ + + G I + E+L ++++K Y P N
Sbjct: 142 IRMRQDDPDDLAFERLLEALYGNHPLGRPILGTEESIEAITREDLKDFYQKWYQPDNMVL 201
Query: 367 FSYGNFNLEDHLSFINTNYLSKINPYQHHRSSTAVLPEPAWDKPRQLHIHGRHDPLASEN 426
G+ + E+ + I Y + + PEP R + ++ P
Sbjct: 202 VVVGDVDAEEVVELIE-KYFGDLPG---AAPPPKIPPEPPLGPERVVRVNDPEQPDLE-- 255
Query: 427 QSHIAIAYKCAVMDNFKDVFVLNILGDLLLKGPNAPFYKNLVESGLGLSFSPVTGYEASI 486
Q+ +A+ Y D+ D L +L LL G ++ ++ L E GL++S + +
Sbjct: 256 QAWLALGYPGPDYDSPDDYAALLLLNGLLGGGFSSRLFQELRE-KRGLAYSVSSFSDFLS 314
Query: 487 HDTLFTVGLQGVDSNKFDEIIGAVNKTIDEVIAEG---FDKERVA 528
LF++ G ++ V + + + F +E +
Sbjct: 315 DSGLFSI-YAGTAPENPEKTAELVEEILKALKKGLKGPFTEEELD 358
Score = 45.5 bits (108), Expect = 1e-04
Identities = 39/226 (17%), Positives = 73/226 (32%), Gaps = 22/226 (9%)
Query: 929 ERLESFLQSIPGDFTSQPGQTVHSFNVSGIQKVSHVLPFPVNFT--AKSLRGVPFLHK-D 985
E +E + +PG + +V+ + A G + D
Sbjct: 214 ELIEKYFGDLPGAAPPPKIPPEPPLGPERVVRVNDPEQPDLEQAWLALGYPGPDYDSPDD 273
Query: 986 YVALKVLSKFL---TTKYLLREVREKNG-AYGAGAVVS------PSGVIQFYSYRDPYAL 1035
Y AL +L+ L + L +E+REK G AY + + + +P
Sbjct: 274 YAALLLLNGLLGGGFSSRLFQELREKRGLAYSVSSFSDFLSDSGLFSIYAGTAPENPEKT 333
Query: 1036 -----ETLATFDQSTQFLADTKLSVQDLDEAKLGVFKEVDAPIPPGSKGMSKFLYGKTDE 1090
E L + + + + +D+P L G +
Sbjct: 334 AELVEEILKALKKGLKGPFTEEELDAAKQLLIGLLLLSLDSPSSIAELLGQYLLLGGSLI 393
Query: 1091 MIEQYRLSVKQVTEDDIRRVADTYLSRDATEKLSSYVVIGPKSNNL 1136
+E+ ++ VT +D+ VA L + + VV+GP+
Sbjct: 394 TLEELLERIEAVTLEDVNAVAKKLL----APENLTIVVLGPEKALK 435
>gnl|CDD|223956 COG1025, Ptr, Secreted/periplasmic Zn-dependent peptidases,
insulinase-like [Posttranslational modification, protein
turnover, chaperones].
Length = 937
Score = 53.5 bits (129), Expect = 6e-07
Identities = 38/180 (21%), Positives = 66/180 (36%), Gaps = 26/180 (14%)
Query: 197 PPDSTGITHILEHLSLCGSVKYPCRDPFMKMLTRSMATFMNAMTGPDYTFYPFSSQNHCD 256
P + G+ H LEH+ GS KYP F + L++ + NA T + T + F +
Sbjct: 60 PEEYPGLAHFLEHMLFMGSEKYPDEGGFSEFLSKHGGSH-NASTAGERTAFYFEV-ENDA 117
Query: 257 YFNLMSIYLDAVFNPQLKQLDFMQEGWRLEHEDIKDQNSPIIFKGVVFNEMKGAFSDNSY 316
+ + D P + +E + V +E + + +
Sbjct: 118 LEGALDRFADFFIEPLFNKEALDRE------------------RNAVNSEFTMNLTSDGW 159
Query: 317 IFGEALMNNILPTYCYKHVSGGDPIKILNLK-----YENLVNYHKKHYHPTNSKFFSYGN 371
+ P + S G+ ++ L+ K + L +H+KHY N K YGN
Sbjct: 160 RMYQVQALTANPGHPLSKFSTGN-LETLSDKPGLVVQQELKEFHEKHYSANNMKLVIYGN 218
>gnl|CDD|216056 pfam00675, Peptidase_M16, Insulinase (Peptidase family M16).
Length = 149
Score = 41.9 bits (99), Expect = 4e-04
Identities = 23/100 (23%), Positives = 42/100 (42%), Gaps = 6/100 (6%)
Query: 197 PPDSTGITHILEHLSLCGSVKYPCRDPFMKMLTRSMATFMNAMTGPDYTFYPFSSQNHCD 256
P D+ G+ H LEH++ G+ KYP D + L + + NA T ++T Y N D
Sbjct: 27 PDDNNGLAHFLEHMAFKGTKKYPSED-LEEELEKLGGSL-NAYTSREHTVYYVEVLN--D 82
Query: 257 YFN-LMSIYLDAVFNPQLKQLDFMQEGWRLEHEDIKDQNS 295
+ D NP + + + +++ ++
Sbjct: 83 DLPKAVDRLADFFLNPLFSPSEV-ERERLVVLYEVEAVDA 121
>gnl|CDD|185056 PRK15101, PRK15101, protease3; Provisional.
Length = 961
Score = 37.6 bits (88), Expect = 0.043
Identities = 20/52 (38%), Positives = 25/52 (48%), Gaps = 2/52 (3%)
Query: 197 PPDSTGITHILEHLSLCGSVKYPCRDPFMKMLTRSMATFMNAMTGPDYT-FY 247
P G+ H LEH+ L GS KYP D + L + + NA T T FY
Sbjct: 80 PDAQQGLAHYLEHMVLMGSKKYPQPDSLAEFLKKHGGSH-NASTASYRTAFY 130
>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
represents the eukaryotic large ribosomal protein P2.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P2 is located in the L12 stalk, with proteins
P1, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers). Bacteria may have four or six copies
of L7/L12 (two or three homodimers) depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plants have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site in the C-terminal region
of P2. TCS inactivates the ribosome by depurinating a
specific adenine in the sarcin-ricin loop of 28S rRNA.
Length = 109
Score = 33.8 bits (78), Expect = 0.11
Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 14/57 (24%)
Query: 510 VNKTIDEVIAEGFDKERVAIGLQGVDSNKFDEIKGAVNKTIDEVIAEGFDKERVASV 566
V K + V E D E++ + E++G K ++E+IA G KE++ASV
Sbjct: 23 VKKILGSVGVE-VDDEKL--------NKVISELEG---KDVEELIAAG--KEKLASV 65
>gnl|CDD|227699 COG5412, COG5412, Phage-related protein [Function unknown].
Length = 637
Score = 34.0 bits (78), Expect = 0.58
Identities = 19/87 (21%), Positives = 36/87 (41%), Gaps = 6/87 (6%)
Query: 466 NLVESGLGLSFSPVTGYEASIHDT---LFTVGLQGVDS--NKFDEIIGAVNKTIDEVIAE 520
+L+ S +G++ PV G ++ +T + G + S + ++ + +I
Sbjct: 408 SLIGSKVGMAIIPVLGIITTVMNTIKSTISNGSIAIKSVATRIFSGRSSIFNGVRNLITI 467
Query: 521 GFDKERVAI-GLQGVDSNKFDEIKGAV 546
FD R A GL S F ++ V
Sbjct: 468 VFDGIRGAWGGLTSAVSPIFSKVGNFV 494
>gnl|CDD|233732 TIGR02110, PQQ_syn_pqqF, coenzyme PQQ biosynthesis probable
peptidase PqqF. In a subset of species that make
coenzyme PQQ (pyrrolo-quinoline-quinone), this probable
peptidase is found in the PQQ biosynthesis region and is
thought to act as a protease on PqqA (TIGR02107), a
probable peptide precursor of the coenzyme. PQQ is
required for some glucose dehydrogenases and alcohol
dehydrogenases [Biosynthesis of cofactors, prosthetic
groups, and carriers, Other].
Length = 696
Score = 34.1 bits (78), Expect = 0.59
Identities = 10/29 (34%), Positives = 14/29 (48%)
Query: 202 GITHILEHLSLCGSVKYPCRDPFMKMLTR 230
G+ H LEHL G ++ D M + R
Sbjct: 41 GLAHFLEHLLFLGGERFQGDDRLMPWVQR 69
>gnl|CDD|239248 cd02950, TxlA, TRX-like protein A (TxlA) family; TxlA was
originally isolated from the cyanobacterium
Synechococcus. It is found only in oxygenic
photosynthetic organisms. TRX is a small enzyme that
participate in redox reactions, via the reversible
oxidation of an active site dithiol present in a CXXC
motif. Disruption of the txlA gene suggests that the
protein is involved in the redox regulation of the
structure and function of photosynthetic apparatus. The
plant homolog (designated as HCF164) is localized in
the chloroplast and is involved in the assembly of the
cytochrome b6f complex, which takes a central position
in photosynthetic electron transport.
Length = 142
Score = 31.5 bits (72), Expect = 0.93
Identities = 7/26 (26%), Positives = 13/26 (50%)
Query: 28 KWLGLIVDSKLNWIPQINYLNNKLKL 53
KWL I +++ IP +L+ +
Sbjct: 64 KWLPEIDRYRVDGIPHFVFLDREGNE 89
>gnl|CDD|215730 pfam00118, Cpn60_TCP1, TCP-1/cpn60 chaperonin family. This family
includes members from the HSP60 chaperone family and the
TCP-1 (T-complex protein) family.
Length = 481
Score = 33.0 bits (76), Expect = 1.1
Identities = 11/51 (21%), Positives = 20/51 (39%), Gaps = 1/51 (1%)
Query: 72 ILLKIYNSYCRPITDYGSIVYQSAKPSAKGKLNSTHNNFI-RLCTGAFRSS 121
I LK P++D + A+ S K++S + + +L A
Sbjct: 100 IALKALEELSIPVSDDDEDLLNVARTSLNSKISSRESELLGKLVVDAVLLI 150
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 32.8 bits (74), Expect = 1.2
Identities = 53/235 (22%), Positives = 89/235 (37%), Gaps = 30/235 (12%)
Query: 360 HPTNSKFFSYGNFNLEDHLSFINTNYLSKINPYQHHRSSTAVLPEPAWDKPRQLHIHGRH 419
H TN + S ++ D L I L SS D P I R
Sbjct: 228 HETNKESGSSETIDITDLLDSIPMEQLKVSLKPLVSESSK-------LDAPLAKSIQDRL 280
Query: 420 DPLASENQSHIAIAYKCAVMDNFKDVFVLNILGDLLLKGPNAPFYKNLVESGLGLSFSPV 479
+ A+ Q+ + + +K + N D L+ N +GL SF P
Sbjct: 281 ERQAAYEQTKNDL-------EKWKPIVADNRKSDQLIFPMNETARPVPSNNGLASSFEPR 333
Query: 480 TGYEASIHDTLFTVGLQGVDS-NKFDEIIGAVNKTIDEVIAEG-----------FDKERV 527
T E +H L GL+ + K +E+ A+NK E +AE F +ER
Sbjct: 334 TESERKMHQALLDAGLENESALKKQEEL--ALNKLSVEEVAERTRQLRFMRELMFREERK 391
Query: 528 AIGLQGVDSNKFDEIKGAVNKTIDEVIAEGFDKERVAS-VLHSLE-LSLKHQSSN 580
A + + S + +I+ + +I + D E S +LE ++ +H++++
Sbjct: 392 AKRVAKIKSKTYRKIRKNRKEKEMALIPKSEDLENEKSEEARALERMTQRHKNTS 446
>gnl|CDD|187687 cd09629, DOMON_CIL1_like, DOMON-like domain of Brassica carinata
CIL1 and similar proteins. Brassica carinata CIL1 has
been described as involved in suppression of axillary
meristem development. It contains a single DOMON domain,
the function of which is unclear. Members in this
diverse family of plant proteins may have a cytochrome
b561 domain C-terminal to the DOMON domain, some members
from Arabidopsis have been characterized as
auxin-responsive or auxin-induced proteins. DOMON
domains were initially thought to confer protein-protein
interactions. They were subsequently found as a
heme-binding motif in cellobiose dehydrogenase, an
extracellular fungal oxidoreductase that degrades both
lignin and cellulose, and in ethylbenzene dehydrogenase,
an enzyme that aids in the anaerobic degradation of
hydrocarbons. The domain interacts with sugars in the
type 9 carbohydrate binding modules (CBM9), which are
present in a variety of glycosyl hydrolases, and it can
also be found at the N-terminus of sensor histidine
kinases.
Length = 152
Score = 31.1 bits (71), Expect = 1.6
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 173 TLAEYFHLSRDDSNNVFAVAFRTPPPDSTG 202
TL H + + SN+ VAFR PP S+G
Sbjct: 9 TLGASLHWTYNASNSTLDVAFRATPPSSSG 38
>gnl|CDD|116688 pfam08083, PROCN, PROCN (NUC071) domain. The PROCN domain is the
central domain in pre-mRNA splicing factors of PRO8
family.
Length = 408
Score = 31.6 bits (72), Expect = 2.5
Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 5/81 (6%)
Query: 532 QGVDSNKFDEIKGAVNKTIDEVIAEGFDKERVASVLHSLELSLKHQSSNFGLNLLFWLVP 591
Q V+S E++ AV K I ++I EG + + ++L L + + +N + W VP
Sbjct: 282 QRVESYYDLELRAAVMKDILDMIPEGLRQNKARTILQHLSEAWRCWKAN-----VPWKVP 336
Query: 592 FMNHDCDVIHLLHINDRLNWF 612
+ I L ++ + +W+
Sbjct: 337 GLPAPIKNIILRYVKAKADWW 357
>gnl|CDD|150004 pfam09167, DUF1942, Domain of unknown function (DUF1942). Members
of this family of bacterial proteins assume a
beta-sandwich structure consisting of two antiparallel
beta-sheets similar to an immunoglobulin-like fold, with
an additional small, antiparallel beta-sheet. The
longer-stranded beta-sheet is made up of four
antiparallel beta-strands. The shorter-stranded
beta-sheet consists of five beta-strands, four of these
beta-strands form an antiparallel beta-sheet. The exact
function of this family of proteins is unkown, though a
putative role includes involvement in host-bacterial
interactions involved in endocytosis or phagocytosis,
possibly during bacterial internalisation.
Length = 127
Score = 30.1 bits (68), Expect = 2.7
Identities = 10/31 (32%), Positives = 16/31 (51%)
Query: 939 PGDFTSQPGQTVHSFNVSGIQKVSHVLPFPV 969
P + G V + VS ++ S V+P+PV
Sbjct: 11 PAELVDAGGTVVQDWTVSDLKPSSDVIPYPV 41
>gnl|CDD|130386 TIGR01319, glmL_fam, conserved hypothetical protein. This small
family includes, so far, an uncharacterized protein from
E. coli O157:H7 and GlmL from Clostridium tetanomorphum
and Clostridium cochlearium. GlmL is located between the
genes for the two subunits, epsilon (GlmE) and sigma
(GlmS), of the coenzyme-B12-dependent glutamate mutase
(methylaspartate mutase), the first enzyme in a pathway
of glutamate fermentation. Members shows significant
sequence similarity to the hydantoinase branch of the
hydantoinase/oxoprolinase family (pfam01968).
Length = 463
Score = 30.8 bits (69), Expect = 5.4
Identities = 21/71 (29%), Positives = 27/71 (38%), Gaps = 16/71 (22%)
Query: 431 AIAYKCAVMDNFKDVFVLNILG----------------DLLLKGPNAPFYKNLVESGLGL 474
AIA D D +++I G D L G PF K VE LG+
Sbjct: 237 AIAEGTDKDDGIGDFILIDIGGATTDVHSAAAGELSKPDTTLHGLEEPFAKRTVEGDLGM 296
Query: 475 SFSPVTGYEAS 485
S ++ EAS
Sbjct: 297 RVSAISLGEAS 307
>gnl|CDD|220425 pfam09822, ABC_transp_aux, ABC-type uncharacterized transport
system. This domain is found in various eukaryotic and
prokaryotic intra-flagellar transport proteins involved
in gliding motility, as well as in several hypothetical
proteins.
Length = 268
Score = 30.0 bits (68), Expect = 6.0
Identities = 29/158 (18%), Positives = 62/158 (39%), Gaps = 24/158 (15%)
Query: 621 TYLQEKVDEYLRNNPHKLIITMS-PEKTFDEKLDKVEKDILKDRISQMNDQDLNKVYVNG 679
++E ++EY +P K+ + P++ E + EK ++ Q+ + + V +
Sbjct: 46 PRVRELLEEYAALSPGKIKVEFIDPDEPPSEAEEAAEKYGIQPVPLQVRENKSSVVTIYF 105
Query: 680 TELRKEQEKEQNIDVLPTLKISDVDDHVERVVTTDKHILQVPIQLSTQPTNGVTYFRSVV 739
+ + +K + I L ++E +T I +V T+P YF
Sbjct: 106 GLVVEYGDKREVIPFLDN----VSLQNLEYELTR--AIQKV-----TRPKKPKVYF---- 150
Query: 740 DTSKLS--PELKPLVP--LFNYVINQMRTKNYDFREMD 773
LS EL + +I+++ + YD ++
Sbjct: 151 ----LSGHGELPDEGGGESLSTLIDELEKEGYDVEPLN 184
>gnl|CDD|176017 cd04052, C2B_Tricalbin-like, C2 domain second repeat present in
Tricalbin-like proteins. 5 to 6 copies of the C2
domain are present in Tricalbin, a yeast homolog of
Synaptotagmin, which is involved in membrane
trafficking and sorting. C2 domains fold into an
8-standed beta-sandwich that can adopt 2 structural
arrangements: Type I and Type II, distinguished by a
circular permutation involving their N- and C-terminal
beta strands. Many C2 domains are Ca2+-dependent
membrane-targeting modules that bind a wide variety of
substances including bind phospholipids, inositol
polyphosphates, and intracellular proteins. Most C2
domain proteins are either signal transduction enzymes
that contain a single C2 domain, such as protein kinase
C, or membrane trafficking proteins which contain at
least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of
piccolo/aczonin and intersectin which only have a
single C2 domain. C2 domains with a calcium binding
region have negatively charged residues, primarily
aspartates, that serve as ligands for calcium ions.
This cd contains the second C2 repeat, C2B, and has a
type-II topology.
Length = 111
Score = 28.7 bits (65), Expect = 7.0
Identities = 11/22 (50%), Positives = 14/22 (63%)
Query: 46 YLNNKLKLSSNILKMLNNPSWG 67
YLN KL ++ + K NNPSW
Sbjct: 20 YLNGKLVYTTRVKKKTNNPSWN 41
>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
Provisional.
Length = 693
Score = 30.0 bits (69), Expect = 8.5
Identities = 16/72 (22%), Positives = 27/72 (37%), Gaps = 13/72 (18%)
Query: 504 DEIIGAVNKTIDEVIAEGF---DK-ERVAIGLQGVDSNKFDEIKGAVNKTIDEVIAEGFD 559
+E+ AV + + + E + DK ER D IK V + + E D
Sbjct: 239 EELKAAVKELAEAKLKEAYQITDKQER---------EAALDAIKEEVLEALAAEEEEEED 289
Query: 560 KERVASVLHSLE 571
++ + LE
Sbjct: 290 EKEIKEAFKKLE 301
>gnl|CDD|165202 PHA02869, PHA02869, C4L/C10L-like gene family protein; Provisional.
Length = 418
Score = 30.0 bits (67), Expect = 8.8
Identities = 53/226 (23%), Positives = 99/226 (43%), Gaps = 28/226 (12%)
Query: 622 YLQEKVDEYLRNNPHKLIIT-MSPEKTFDEKLDKVEKDILKDRISQMNDQDLNKVYVNGT 680
YL +K++ Y R N +L M E+ ++K + ++ DR + K + NG+
Sbjct: 207 YLGKKINLYDRENDLQLCYCDMVIERMTEDK--EYSLGMISDRSGR-----CIKSHHNGS 259
Query: 681 ELRKEQEKEQNIDVLPTLKISDVDDHVERVVTTD-KHILQVPIQLSTQ----PTNGVTYF 735
+R +E+ + D L +++VD+ + T D KHI+ I T P + V Y
Sbjct: 260 IVRYRKEEYGSFDALCIYNMNEVDE----IWTGDKKHIIWSTIDKKTGTSFIPIDPVLYE 315
Query: 736 RSVVDTSKLSPELKPLVPLFN----YVINQMRTKNYDFREMDQLIHMSTGGISFNSHLGE 791
+ +SK E K L N Y+ + +D LIH I +++ +
Sbjct: 316 KLKAISSKEHKEYKDLRGFCNSRTEYICCSVSKYYFDLPTKTDLIHEVINSIDYDT---K 372
Query: 792 SCSTPNGFEEAILVSSHCLEH-NNDKMFDVLSELFNNVQLTDLNRF 836
S TP+ + I V L + + +++F+++ N+ L + N +
Sbjct: 373 SVGTPDWYTLPIEVKQTILGNMSYEELFNIVR---GNIALEEDNEY 415
>gnl|CDD|218956 pfam06248, Zw10, Centromere/kinetochore Zw10. Zw10 and rough deal
proteins are both required for correct metaphase
check-pointing during mitosis. These proteins bind to
the centromere/kinetochore.
Length = 593
Score = 29.7 bits (67), Expect = 9.9
Identities = 20/79 (25%), Positives = 36/79 (45%), Gaps = 5/79 (6%)
Query: 624 QEKVDEYLRNNPHKLIITMSPEKTFDEKLDKVEKDI---LKDRISQMNDQDLNKVYVNGT 680
+E+V + + + MS + E++ + +DI LK +S QDLN
Sbjct: 27 KEEVRDMINKKYSDFLPNMSSAEDLLEQVSSLSEDISDLLKSEVSNEILQDLNDAVEELA 86
Query: 681 ELRKEQEKEQNIDVLPTLK 699
EL+ ++ E+ I L L+
Sbjct: 87 ELK--RQLEEKILGLKILQ 103
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.135 0.397
Gapped
Lambda K H
0.267 0.0854 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 58,550,932
Number of extensions: 5846696
Number of successful extensions: 5361
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5327
Number of HSP's successfully gapped: 44
Length of query: 1147
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 1040
Effective length of database: 6,191,724
Effective search space: 6439392960
Effective search space used: 6439392960
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 65 (28.6 bits)