RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7302
(741 letters)
>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
synthesized as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. Alignment contains also inactive enzymes
that have substitutions of the catalytic triad residues.
Length = 232
Score = 266 bits (683), Expect = 2e-84
Identities = 108/233 (46%), Positives = 140/233 (60%), Gaps = 9/233 (3%)
Query: 448 VVGGEDADPAEWCWQVALINSLNQYLCGGALIGTQWVLTAAHCVTNIVRSGDAVYVRVGD 507
+VGG +A + WQV+L + ++ CGG+LI +WVLTAAHCV + S VR+G
Sbjct: 1 IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCVYSSAPS--NYTVRLGS 58
Query: 508 HDLTRKYGSPGAQTLRVATTYIHHNHNSQTLDNDIALLKLHGQAELKDGVCLVCLPARGV 567
HDL+ G Q ++V +H N+N T DNDIALLKL L D V +CLP+ G
Sbjct: 59 HDLSS--NEGGGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSGY 116
Query: 568 NHAAGKRCTVTGYGYMGEAGPIPLRVREAEIPIVSDTECIRKINAVTEKIFILPASSFCA 627
N AG CTV+G+G E GP+P ++E +PIVS+ EC R + + CA
Sbjct: 117 NLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRA----YSYGGTITDNMLCA 172
Query: 628 GGEEGN-DACQGDGGGPLVCQDDGFYELAGLVSWGFGCGRQDVPGVYVKVSSF 679
GG EG DACQGD GGPLVC D+G L G+VSWG GC R + PGVY +VSS+
Sbjct: 173 GGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSY 225
>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease. Many of these
are synthesised as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. A few, however, are active as single chain
molecules, and others are inactive due to substitutions
of the catalytic triad residues.
Length = 229
Score = 251 bits (643), Expect = 1e-78
Identities = 106/235 (45%), Positives = 137/235 (58%), Gaps = 12/235 (5%)
Query: 447 RVVGGEDADPAEWCWQVALINSLNQYLCGGALIGTQWVLTAAHCVTNIVRSGDAVYVRVG 506
R+VGG +A+ + WQV+L ++ CGG+LI +WVLTAAHCV S + VR+G
Sbjct: 1 RIVGGSEANIGSFPWQVSLQYGGGRHFCGGSLISPRWVLTAAHCVRGSDPS--NIRVRLG 58
Query: 507 DHDLTRKYGSPGAQTLRVATTYIHHNHNSQTLDNDIALLKLHGQAELKDGVCLVCLPARG 566
HDL+ Q ++V+ IH N+N T DNDIALLKL L D V +CLP+
Sbjct: 59 SHDLSS---GEEGQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSSN 115
Query: 567 VNHAAGKRCTVTGYGYMGE-AGPIPLRVREAEIPIVSDTECIRKINAVTEKIFILPASSF 625
N AG CTV+G+G E AG +P ++E +PIVS+ C R + +
Sbjct: 116 YNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRA----YSGGGAITDNML 171
Query: 626 CAGGEE-GNDACQGDGGGPLVCQDDGFYELAGLVSWGFGCGRQDVPGVYVKVSSF 679
CAGG E G DACQGD GGPLVC D + L G+VSWG GC R PGVY +VSS+
Sbjct: 172 CAGGLEGGKDACQGDSGGPLVCNDGRWV-LVGIVSWGSGCARPGKPGVYTRVSSY 225
>gnl|CDD|215708 pfam00089, Trypsin, Trypsin.
Length = 218
Score = 187 bits (477), Expect = 6e-55
Identities = 98/233 (42%), Positives = 126/233 (54%), Gaps = 18/233 (7%)
Query: 448 VVGGEDADPAEWCWQVALINSLNQYLCGGALIGTQWVLTAAHCVTNIVRSGDAVYVRVGD 507
+VGG++A P + WQV+L S ++ CGG+LI WVLTAAHCV+N +V V +G
Sbjct: 1 IVGGDEAQPGSFPWQVSLQVSSGKHFCGGSLISENWVLTAAHCVSN----AKSVRVVLGA 56
Query: 508 HDLTRKYGSPGAQTLRVATTYIHHNHNSQTLDNDIALLKLHGQAELKDGVCLVCLPARGV 567
H++ + G G Q V +H N+N T NDIALLKL L D V +CLP
Sbjct: 57 HNIVLREG--GEQKFDVKKVIVHPNYNPDTD-NDIALLKLKSPVTLGDTVRPICLPTASS 113
Query: 568 NHAAGKRCTVTGYGYMGEAGPIPLRVREAEIPIVSDTECIRKINAVTEKIFILPASSFCA 627
+ G CTV+G+G G P ++E +P+VS C I CA
Sbjct: 114 DLPVGTTCTVSGWGNTKTLGL-PDTLQEVTVPVVSRETCRSAYGGTVTDNMI------CA 166
Query: 628 GGEEGNDACQGDGGGPLVCQDDGFYELAGLVSWGFGCGRQDVPGVYVKVSSFN 680
G G DACQGD GGPLVC D EL G+VSWG+GC + PGVY VSS+
Sbjct: 167 GA-GGKDACQGDSGGPLVCSDG---ELIGIVSWGYGCASGNYPGVYTPVSSYL 215
>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
[Posttranslational modification, protein turnover,
chaperones].
Length = 413
Score = 111 bits (278), Expect = 4e-26
Identities = 77/265 (29%), Positives = 115/265 (43%), Gaps = 38/265 (14%)
Query: 439 GTARERGARVVGGEDADPAEWCWQVALINSLNQY----LCGGALIGTQWVLTAAHCVT-N 493
TA E +R++GG +A+ E+ VAL++ ++ Y CGG+ +G ++VLTAAHC +
Sbjct: 24 QTADEVSSRIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADAS 83
Query: 494 IVRSGDAVYVRVGDHDLTRKYGSPGAQTLRVATTYIHHNHNSQTLDNDIALLKLHGQAEL 553
S D V V +D ++ A+ V T Y+H ++ L NDIA+L+L A L
Sbjct: 84 SPISSDVNRVVVDLNDSSQ------AERGHVRTIYVHEFYSPGNLGNDIAVLELARAASL 137
Query: 554 KDGVCLVCLPARGVNHAA---GKRCTVTGYGYMGEAG-------------PIPLRVREAE 597
P + +VT M P + E
Sbjct: 138 ---------PRVKITSFDASDTFLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTILHEVA 188
Query: 598 IPIVSDTECIRKINAVTEKIFILPASSFCAGGEEGNDACQGDGGGPLVCQDDGFYELAGL 657
+ V + C + + FCAG DACQGD GGP+ + + G+
Sbjct: 189 VLFVPLSTCAQYKGCANASDGATGLTGFCAG-RPPKDACQGDSGGPIFHKGEEGRVQRGV 247
Query: 658 VSWGFG-CGRQDVPGVYVKVSSFNG 681
VSWG G CG +PGVY VS++
Sbjct: 248 VSWGDGGCGGTLIPGVYTNVSNYQD 272
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 44.2 bits (104), Expect = 3e-04
Identities = 51/263 (19%), Positives = 69/263 (26%), Gaps = 45/263 (17%)
Query: 28 PPNLVIMERNGTKPNSGSQSSSTTPYVSSTSAV--------TQTTTSRPQQPAGSKPCHG 79
P LV P + Q+S P + AV +RP AG
Sbjct: 2712 PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAP 2771
Query: 80 LLDTITSTAESKDCPGTCVHAIATLICAEVLDDVQCPSPSMRCCLESPANGTTPAAGGST 139
A+A+L + PSP PA+ +
Sbjct: 2772 PAAPAAGPP-----RRLTRPAVASLSESRE----SLPSPW------DPADPPAAVLAPAA 2816
Query: 140 TSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGASNKIDTGGNNNCLGVTTTNKP 199
+SP+ PP P P P G S + GG+ +P
Sbjct: 2817 ALPP-AASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS--VAPGGD-------VRRRP 2866
Query: 200 PSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTSTCSRGSMCCDNTRLPPPTKPKPRPT 259
PS P+ P P RP V SR + P P+P+
Sbjct: 2867 PSRSPAAKPAAP------------ARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAP 2914
Query: 260 QPPTTTTVATTTAVPDDRPECPG 282
PP P P P
Sbjct: 2915 PPPQPQPQPPPPPQPQPPPPPPP 2937
Score = 36.1 bits (83), Expect = 0.098
Identities = 32/192 (16%), Positives = 51/192 (26%), Gaps = 19/192 (9%)
Query: 111 DDVQCPSPSMRCCLESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTS---KPYKKPQ 167
P P + + A G A + + P ++ + P P
Sbjct: 252 IAAPAPPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPA 311
Query: 168 NPEKRPQPPGASNKIDTGGNNNCLGVTTTNKPPSSQPSNLPK------CPGFCLLNIMAA 221
P+ P P + + + V+ +P P PK P L ++ A
Sbjct: 312 PPDPPPPAPAGDAEEEDDEDGAMEVVSPLPRPRQHYPLGFPKRRRPTWTPPSSLEDLSAG 371
Query: 222 FCERPSVLIPQT---------STCSRGSMCCDNTRLPPPT-KPKPRPTQPPTTTTVATTT 271
+P + +RG D TR P P P P +
Sbjct: 372 RHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPP 431
Query: 272 AVPDDRPECPGS 283
A P E
Sbjct: 432 ATPLPSAEPGSD 443
Score = 34.5 bits (79), Expect = 0.24
Identities = 36/246 (14%), Positives = 59/246 (23%), Gaps = 29/246 (11%)
Query: 41 PNSGSQSSSTTPYVSSTSAVTQTTTSRPQQPAGSKPCHGLLDTITSTAESKDCPGTCVHA 100
P +G T P V+S S ++ S + A P +
Sbjct: 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAS----PAGPLPP 2830
Query: 101 IATLICAEVLDDVQCPSPSMRCCLESPANGTTPAAGGSTTSNAGTSSPSTTTP------- 153
+ P A G S P
Sbjct: 2831 PTSA--------QPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPP 2882
Query: 154 ------PTLYTTSKPYKKPQNPEKRPQPPGASNKIDTGGNNNCLGVTTTNKPPSSQPSNL 207
P + +++ + P + +RP P A +PP P
Sbjct: 2883 VRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQ---PPPPPQPQPPPPPPP-R 2938
Query: 208 PKCPGFCLLNIMAAFCERPSVLIPQTSTCSRGSMCCDNTRLPPPTKPKPRPTQPPTTTTV 267
P+ P + A +V P G + R+P P + P T
Sbjct: 2939 PQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTG 2998
Query: 268 ATTTAV 273
+ + V
Sbjct: 2999 HSLSRV 3004
Score = 33.8 bits (77), Expect = 0.42
Identities = 26/170 (15%), Positives = 43/170 (25%), Gaps = 20/170 (11%)
Query: 116 PSPSMRCCLESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQP 175
P P + + P +P + P + +P +RP+
Sbjct: 2626 PPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRR 2685
Query: 176 PGASNKIDTGGNNNCLGVTTTNKPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTST 235
A + + +T+ PP P+ P A P P +
Sbjct: 2686 RAARPTVGS--------LTSLADPPPPPPTPEPAPH--------ALVSATPLPPGPAAAR 2729
Query: 236 CSRGSMCCDNTRLPPPTKP----KPRPTQPPTTTTVATTTAVPDDRPECP 281
+ ++ P P P P TT A P P
Sbjct: 2730 QASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
Score = 32.2 bits (73), Expect = 1.3
Identities = 42/245 (17%), Positives = 55/245 (22%), Gaps = 32/245 (13%)
Query: 35 ERNGTKPNSGSQSSSTTPYVSSTSAVTQTTTSRPQQPAGSKPCHGLLDTITSTAESKD-- 92
ER P G S +A + RP++ A L
Sbjct: 2650 ERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPE 2709
Query: 93 -CPGTCVHAIATLICAEVLDDVQCPSPSMRCCLESPANGTTPAAGGSTTSNAGTSSPSTT 151
P V A P+ PA TP T+ P
Sbjct: 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAP 2769
Query: 152 TPPTLYTTSKPYKKPQNP-----EKRPQPPGASNKIDTGG---------NNNCLGVTTTN 197
PP P + + E R P + D
Sbjct: 2770 APPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLP 2829
Query: 198 KPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTSTCSRGSMCCDNTRLPPPTKPKPR 257
P S+QP+ P PG P + D R PP P +
Sbjct: 2830 PPTSAQPTAPPPPPGP---------------PPPSLPLGGSVAPGGDVRRRPPSRSPAAK 2874
Query: 258 PTQPP 262
P P
Sbjct: 2875 PAAPA 2879
Score = 31.8 bits (72), Expect = 1.7
Identities = 25/164 (15%), Positives = 40/164 (24%), Gaps = 27/164 (16%)
Query: 125 ESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQ-------NPEKRPQPPG 177
++P P A + +P + PP + P P PP
Sbjct: 2590 DAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPP 2649
Query: 178 ASNKIDTGGNNNCLGVTTTNKPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTSTCS 237
+ D ++Q S+ P+ P R P + +
Sbjct: 2650 ERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRP-------------RRRAARPTVGSLT 2696
Query: 238 RGSMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVPDDRPECP 281
+ PPP P P P + R P
Sbjct: 2697 -------SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASP 2733
Score = 30.3 bits (68), Expect = 5.3
Identities = 29/177 (16%), Positives = 45/177 (25%), Gaps = 17/177 (9%)
Query: 112 DVQCPSPSMRCCLESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEK 171
P+P R A AA S+ + T +L + + P P PE
Sbjct: 2652 PRDDPAPG-RVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEP 2710
Query: 172 RPQPPGASNKIDTGGNNNCLGVTTTNKPPSSQPS-NLPKCPGFCLLNIMAAFCERPSVLI 230
P ++ + G P+ P PG A P+
Sbjct: 2711 APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPG-----GPARPARPPTTAG 2765
Query: 231 PQTSTCSRGSMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVPDDRPECPGSCIVS 287
P P P R T+P + + ++P V
Sbjct: 2766 PPAPAPPAA----------PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVL 2812
>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
function prediction only].
Length = 872
Score = 38.7 bits (90), Expect = 0.011
Identities = 17/64 (26%), Positives = 28/64 (43%)
Query: 42 NSGSQSSSTTPYVSSTSAVTQTTTSRPQQPAGSKPCHGLLDTITSTAESKDCPGTCVHAI 101
N+ +T+ ++ T TQTTTS P + P T T+T+ S+ G +
Sbjct: 794 NTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTGGGICGP 853
Query: 102 ATLI 105
+I
Sbjct: 854 IVII 857
Score = 32.5 bits (74), Expect = 0.99
Identities = 30/149 (20%), Positives = 52/149 (34%), Gaps = 18/149 (12%)
Query: 132 TPAAGGSTTSNAGTSSPSTTTP---PTLYTTSKPYKKPQNPEKRPQPPGA----SNKIDT 184
T + +TT ++ +P T T+Y + + + +N P +
Sbjct: 702 TNSLYKATTLSSEAKNPDTVKIGQALTVYGSLEVFPAGENWGFIPTTKRVKVRIMDPASG 761
Query: 185 GGNNNCLGVTTTNKPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTSTCSRGSMCCD 244
G + T T + P S P+ + + + E SV+I +T T
Sbjct: 762 TGTSITTSGTFTAEVPQS-PTKTETTLSYSAYSNTSILIETTSVVITKTVT--------- 811
Query: 245 NTRLPPPTKPKPRPTQPPTTTTVATTTAV 273
+ P P T PT T+ +TTT
Sbjct: 812 -QTQTTTSSPSPTQTTSPTQTSTSTTTTT 839
>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
Length = 327
Score = 37.0 bits (86), Expect = 0.032
Identities = 27/164 (16%), Positives = 44/164 (26%), Gaps = 28/164 (17%)
Query: 126 SPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGASNKIDTG 185
SP+ + G + T+ P + P + T P PQ P +I+
Sbjct: 94 SPSAANNTSDGHDASGVKNTAPPQDISAPPISPT------PTQAAP-PQTPNGQQRIELP 146
Query: 186 GN-NNCL-----GVTTTNKPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTSTCSRG 239
GN ++ L V ++ S LP P+ + P
Sbjct: 147 GNISDALSQQQGQVNAASQNAQGNTSTLPT---------------APATVAPSKGAKVPA 191
Query: 240 SMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVPDDRPECPGS 283
+ T P KP T T + +
Sbjct: 192 TAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAASA 235
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 36.3 bits (83), Expect = 0.071
Identities = 51/290 (17%), Positives = 84/290 (28%), Gaps = 30/290 (10%)
Query: 18 VSKDAYSGTLPPNLVIMERNGTKPNSGSQSSSTTPYVSSTSAVTQTTTSRPQQPAG-SKP 76
+ A P + N T P + +S T S T V TT+ P G +
Sbjct: 491 RTTSATPNATSPTPAVTTPNATSPTTQK-TSDTPNATSPTPIVIGVTTTATSPPTGTTSV 549
Query: 77 CHGLLDTITSTAESKDCPGTCVHAIATLICAEVLDDVQCPSPSMRCCLESPANGTTPAAG 136
+ +T + + V + +++ + V S + +
Sbjct: 550 PNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPR 609
Query: 137 GSTTS--------------NAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGASNKI 182
++TS N +PS + T +T P P + P +S
Sbjct: 610 SNSTSTTPLLTSAHPTGGENITEETPSVPS-TTHVSTLSPGPGPGTTSQVSGPGNSSTSR 668
Query: 183 DTGGNNNCLGV----TTTNKPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTS--TC 236
G + G+ T+ PS Q + +P S L+ TS T
Sbjct: 669 YPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTN 728
Query: 237 SRGSMCCDNTRLPPPT-------KPKPRPTQPPTTTTVATTTAVPDDRPE 279
+P+ T PP TT AT P P+
Sbjct: 729 EGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQATVPVPPTQHPD 778
Score = 31.3 bits (70), Expect = 2.6
Identities = 30/153 (19%), Positives = 51/153 (33%), Gaps = 14/153 (9%)
Query: 24 SGTLPPNLVIMERNGTKPNSGS---QSSSTTPYVSSTSAVTQTTTSRPQQPAGSKPCHGL 80
+ +LPP + P SG+ +SST P +S ++ T + T P +
Sbjct: 452 TPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTT--- 508
Query: 81 LDTITSTAESKDCPGTCVHAIATLICAEVLDDVQCPSPSMRCCLESPANGTTPAAGGSTT 140
TS K + ++ V + S S N T+P +
Sbjct: 509 -PNATSPTTQKTSDTPNATSPTPIVIG-----VTTTATSPPTGTTSVPNATSPQV-TEES 561
Query: 141 SNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRP 173
T++P T+ P++ TS P
Sbjct: 562 PVNNTNTPVVTSAPSV-LTSAVTTGQHGTGSSP 593
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 34.1 bits (78), Expect = 0.30
Identities = 25/166 (15%), Positives = 45/166 (27%), Gaps = 3/166 (1%)
Query: 116 PSPSMRCCLESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQP 175
SP+ P G +S G+ + P ++SK ++P+ +K P+
Sbjct: 459 VSPAESVSSNPPIKQPMEKEGKVKSS--GSQYHPESKEPPPKSSSKEKRRPRTAQKGPES 516
Query: 176 PGASNKIDTGGNNNCLGVTTTNKPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTST 235
K T K P +P L + P QT
Sbjct: 517 GRGKQKSPAQSEAPPQRRTVGKKQP-KKPEKASAGDERTGLRPESEPGTLPYGSSVQTPP 575
Query: 236 CSRGSMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVPDDRPECP 281
+ + + P +PK + + + E
Sbjct: 576 DRPKAATKGSRKPSPRKEPKSSVPPAAEKRKYKSPSKIVPKSREFI 621
>gnl|CDD|144411 pfam00802, Glycoprotein_G, Pneumovirus attachment glycoprotein G.
This family includes attachment proteins from
respiratory synctial virus. Glycoprotein G has not been
shown to have any neuraminidase or hemagglutinin
activity. The amino terminus is thought to be
cytoplasmic, and the carboxyl terminus extracellular.
The extracellular region contains four completely
conserved cysteine residues.
Length = 263
Score = 33.5 bits (76), Expect = 0.32
Identities = 34/155 (21%), Positives = 44/155 (28%), Gaps = 38/155 (24%)
Query: 125 ESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGASNKIDT 184
+ T+P+ +TT T TTP T T + P
Sbjct: 84 QHNQLSTSPSNQSTTTPLIHTIL-DDTTPGTKSTYQHTTVGTKGRTTTP----------- 131
Query: 185 GGNNNCLGVTTTNKPP-SSQPSNLPKCPG----FCLLNIMAAFCERPSVLIPQTSTCSRG 239
TNKPP + SN P+ P F + N + S C
Sbjct: 132 ---------AQTNKPPTKPRQSNPPEKPQDDFHFEVFNFVPC------------SICENN 170
Query: 240 SMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVP 274
C + P KP PT PT TT
Sbjct: 171 PACLSICKRIPEKKPGKAPTTKPTKKPKPKTTKKD 205
Score = 29.7 bits (66), Expect = 4.8
Identities = 35/162 (21%), Positives = 47/162 (29%), Gaps = 27/162 (16%)
Query: 117 SPSMRCCLESPANGTTPAAGGST--TSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQ 174
+P + L+ GT +T T T+ T PPT S P +KPQ+
Sbjct: 99 TPLIHTILDDTTPGTKSTYQHTTVGTKGRTTTPAQTNKPPTKPRQSNPPEKPQDDFHFEV 158
Query: 175 PPGASNKIDTGGNNNCLGV--TTTNKPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQ 232
I N CL + K P P+ P P+
Sbjct: 159 FNFVPCSI-CENNPACLSICKRIPEKKPGKAPTTKPTKK-------------------PK 198
Query: 233 TSTCSRGSMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVP 274
T + + T PT PT T T T P
Sbjct: 199 PKTTKKDTKTQTTKSKEATTH---HPTSEPTKLTTKTNTTTP 237
>gnl|CDD|185588 PTZ00385, PTZ00385, lysyl-tRNA synthetase; Provisional.
Length = 659
Score = 33.5 bits (76), Expect = 0.43
Identities = 12/81 (14%), Positives = 24/81 (29%), Gaps = 17/81 (20%)
Query: 225 RPSVLIPQTSTCSRGSMCCDNTRLPPPTKPK-PRPTQPPTTTTVATTTAVPDDRPECPGS 283
P + + Q ++ S P + K P T T + P + + P
Sbjct: 29 LPGLQLRQVASLSSS---------RSPLELKKPISKASATKTVTQEASRAPRSKLDLPA- 78
Query: 284 CIVSYLSFTCFRNAEMTEIFK 304
+++ FR +
Sbjct: 79 ------AYSSFRGITPISEVR 93
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 33.5 bits (76), Expect = 0.45
Identities = 28/133 (21%), Positives = 46/133 (34%), Gaps = 14/133 (10%)
Query: 30 NLVIMERNGTKPNSGSQSSSTTPYVSSTSAVTQTTTSRPQQPAGSKPCHGLLDTITSTAE 89
L ++ +GT P + + +SAVTQ++ P A P I S+
Sbjct: 289 GLKQIDTHGTVPVAAV---------TPSSAVTQSSAITPSSAAIPSP-----AVIPSSVT 334
Query: 90 SKDCPGTCVHAIATLICAEVLDDVQCPSPSMRCCLESPANGTTPAAGGSTTSNAGTSSPS 149
++ T A+A + DV P E P + T ++ + S
Sbjct: 335 TQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITS 394
Query: 150 TTTPPTLYTTSKP 162
T PT + P
Sbjct: 395 TANGPTTSLPAAP 407
Score = 30.5 bits (68), Expect = 3.4
Identities = 21/92 (22%), Positives = 31/92 (33%), Gaps = 7/92 (7%)
Query: 126 SPANGTTPAAG--GSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGASNKID 183
SPA + +TT + + S P+ T P QP S
Sbjct: 325 SPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTTET 384
Query: 184 TGGNNNCLGVTTTNKPPSSQP----SNLPKCP 211
+ + +T N P +S P SN+P P
Sbjct: 385 QQSSTGNI-TSTANGPTTSLPAAPASNIPVSP 415
>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein. This
family consists of REP proteins from Dictyostelium
(Slime molds). REP protein is likely involved in
transcription regulation and control of DNA replication,
specifically amplification of plasmid at low copy
numbers. The formation of homomultimers may be required
for their regulatory activity.
Length = 910
Score = 33.6 bits (77), Expect = 0.47
Identities = 25/138 (18%), Positives = 34/138 (24%), Gaps = 45/138 (32%)
Query: 124 LESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTS----------------------- 160
LE + + S S PS T TT+
Sbjct: 224 LEVNESDIEQISINSENIQRINSQPSKRPNNTTTTTTTTTTTTFQPRTRKRKSIDDHKLS 283
Query: 161 ---KPYKKPQNPEKRPQPPGASNKI---DTGG-------NNNCL----GVTTTNKPPS-- 201
P K N + P + +G N +C GV + P
Sbjct: 284 LNQAPEKFKNNTKPDDDPQSDFSDKGSRKSGSLKDVRIDNISCSVSHNGVNISISPTETI 343
Query: 202 SQPSNLPKCPGFCLLNIM 219
SN K CLL +
Sbjct: 344 ISVSNKIKL---CLLEVS 358
>gnl|CDD|205161 pfam12956, DUF3845, Domain of Unknown Function with PDB structure.
Member PDB:3GF6 has statistically significant similarity
to TNF-like jelly roll fold may indicate an
immunomodulatory function or a bioadhesion role.
Length = 244
Score = 32.3 bits (73), Expect = 0.67
Identities = 25/73 (34%), Positives = 35/73 (47%), Gaps = 9/73 (12%)
Query: 490 CVTNIVRSGDAVYVRVGDHDLTRKYGSP----GAQTLRVATTYIHHNHNSQTL---DNDI 542
C+T VR DA+Y VG+ D+ R G P G + + VA +Y ++ TL D
Sbjct: 14 CLTGCVRDNDAIYYPVGNVDIER--GGPALEVGEEDVLVARSYNEEDYVLDTLAQYPGDP 71
Query: 543 ALLKLHGQAELKD 555
L KL LK+
Sbjct: 72 TLGKLTFMINLKN 84
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 32.0 bits (72), Expect = 1.0
Identities = 23/126 (18%), Positives = 40/126 (31%), Gaps = 33/126 (26%)
Query: 158 TTSKPYKKPQNPEKRPQPPGASNKIDTGGNNNCLGVTTTNKPPSS--QPSNLPKCPGFCL 215
T++ Q + + +N+ T + + ++P + N+ P
Sbjct: 61 ATTQAANTRQERTPKLEKAPNTNEEKTSASK----IEKISQPKQEEQKSLNISATPA--- 113
Query: 216 LNIMAAFCERPSVLIPQTSTCSRGSMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVPD 275
P QT+T S T PK + T PP+T T +
Sbjct: 114 ----------PKQEQSQTTTES--------------TTPKTKVTTPPSTNTPQPMQSTKS 149
Query: 276 DRPECP 281
D P+ P
Sbjct: 150 DTPQSP 155
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 31.9 bits (72), Expect = 1.2
Identities = 31/102 (30%), Positives = 45/102 (44%), Gaps = 24/102 (23%)
Query: 79 GLLDT--ITSTAESKDCPGTCVHAIATLICAEVLDDVQCPSPSMRCCLESP---ANGTTP 133
GLL T IT+T ++ A AT + ++ Q PS + +E+P N TTP
Sbjct: 14 GLLTTGVITTTTQA---------ANATTPSSTKVEAPQSTPPSTK--VEAPQSKPNATTP 62
Query: 134 AAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQP 175
ST A +P+ TTP S + PQ+P + P
Sbjct: 63 P---STKVEAPQQTPNATTPS-----STKVETPQSPTTKQVP 96
>gnl|CDD|168775 PRK07034, PRK07034, hypothetical protein; Provisional.
Length = 536
Score = 32.1 bits (72), Expect = 1.3
Identities = 24/99 (24%), Positives = 32/99 (32%), Gaps = 16/99 (16%)
Query: 130 GTTPAAGGSTTSNA-------------GTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPP 176
G P +G S+TS A G + T+ PP +P P P
Sbjct: 42 GIVPRSGSSSTSQAENYKEGLADQAASGNNMARTSAPPVTLYQQQPNANDSYPNGNNNNP 101
Query: 177 GASNKIDTGGNNNCLGVTTTNKPPSSQPSNLPKCPGFCL 215
N G NNN + + P L + PG L
Sbjct: 102 NGDNNNPNGSNNN---IARVQRMPHGISRGLYERPGMLL 137
>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c. This family of
proteins is found in eukaryotes. Proteins in this family
are typically between 263 and 318 amino acids in length.
There is a conserved SFT sequence motif. MSA-2 is a
plasma membrane glycoprotein which can be found in
Babesia bovis species.
Length = 201
Score = 30.9 bits (70), Expect = 1.7
Identities = 14/51 (27%), Positives = 30/51 (58%), Gaps = 5/51 (9%)
Query: 131 TTPAAGGSTTS--NAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGAS 179
++ G++T+ + G+++P+T+ PPT P +P+ P + P+ G+S
Sbjct: 145 KHSSSQGTSTTRPSDGSATPNTSAPPT---PGNPAAQPEKPAETPKGNGSS 192
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 30.8 bits (69), Expect = 2.3
Identities = 19/46 (41%), Positives = 23/46 (50%), Gaps = 1/46 (2%)
Query: 125 ESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPE 170
E+ A GTT AG +TTS G P+T TT P+ K PE
Sbjct: 191 EAEAEGTT-VAGPTTTSPNGGFQPTTPPQEVYGTTDPPFGKVTTPE 235
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 30.5 bits (69), Expect = 4.1
Identities = 12/56 (21%), Positives = 16/56 (28%), Gaps = 2/56 (3%)
Query: 226 PSVLIPQTSTCSRGSMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVPDDRPECP 281
PS + P + +R PP +P PP VP P
Sbjct: 376 PSPVRPTPAPSTRPKAAAAAN--IPPKEPVRETATPPPVPPRPVAPPVPHTPESAP 429
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 30.0 bits (67), Expect = 4.7
Identities = 35/176 (19%), Positives = 52/176 (29%), Gaps = 19/176 (10%)
Query: 40 KPNSGSQSSSTTPY---VSSTSAVTQTTTSRPQQPAGSKPCHGLLDTITSTAESKDCPGT 96
+P + +S TT + VS + +Q T G + + S+ E GT
Sbjct: 167 EPKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATA--NQRLSSTEPVGTQGT 224
Query: 97 CVHAIATLICAEVLDDVQCPSPSMRCCLESP-ANGTTPAAGGSTTSNAGTSSPSTTTPPT 155
+ + P PS R SP +T + STT + + TPP
Sbjct: 225 TTSSNP--------EPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPP- 275
Query: 156 LYTTSKPYKKPQNPEKRPQPPGASNKIDTGGNNNCLGVTTTNKPPSSQPSNLPKCP 211
TS P T + PP S P + P
Sbjct: 276 --ATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGS--SPPHSSPPGVQANP 327
>gnl|CDD|217541 pfam03409, Glycoprotein, Transmembrane glycoprotein. This family
of proteins has some GO annotations for positive
regulation of growth rate and nematode larval
development. This is probably a family of membrane
glycoproteins.
Length = 353
Score = 30.0 bits (68), Expect = 5.2
Identities = 8/24 (33%), Positives = 11/24 (45%)
Query: 248 LPPPTKPKPRPTQPPTTTTVATTT 271
LP T T +T+ + TTT
Sbjct: 310 LPTTTSAPGSTTTTTSTSRIETTT 333
>gnl|CDD|227676 COG5386, COG5386, Cell surface protein [Cell envelope biogenesis,
outer membrane].
Length = 352
Score = 29.9 bits (67), Expect = 5.7
Identities = 16/85 (18%), Positives = 30/85 (35%), Gaps = 11/85 (12%)
Query: 129 NGTTPAAGG--STTSNAGTSSPSTTTPPTLYTTSKPYK-KPQNPEKRPQPPGASNKIDTG 185
+ A + + + P TTTP T + P K K +NP+ G +
Sbjct: 178 DTAIDQAANAATPNDDKNVAQPGTTTPKTPTPQTTPNKPKVENPK-----TGLTTTSYLI 232
Query: 186 GNN---NCLGVTTTNKPPSSQPSNL 207
+ T+T + +P+ +
Sbjct: 233 DFHVVKAGTNETSTMFTYTQKPAIV 257
>gnl|CDD|226099 COG3569, COG3569, Topoisomerase IB [DNA replication, recombination,
and repair].
Length = 354
Score = 29.8 bits (67), Expect = 5.9
Identities = 10/25 (40%), Positives = 12/25 (48%)
Query: 499 DAVYVRVGDHDLTRKYGSPGAQTLR 523
D +RVG+ R GS G TL
Sbjct: 145 DGARIRVGNFAYARDNGSYGLTTLL 169
>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986). This
domain is found in serine proteases and is predicted to
contain disulphide bonds.
Length = 267
Score = 29.2 bits (65), Expect = 6.4
Identities = 19/70 (27%), Positives = 28/70 (40%), Gaps = 1/70 (1%)
Query: 455 DPAEWCWQVALINSLNQYLCGGALIGTQWVLTAAHCVTNIVRSGDAVYVRVGDHDLTRKY 514
+ W W +A + Y C G LI WVL + C+ + + V +G H +
Sbjct: 12 EELHWPW-IAKVYVEGNYRCTGVLIDLSWVLVSHSCLWDTSLEHSYISVVLGGHKTLKSV 70
Query: 515 GSPGAQTLRV 524
P Q RV
Sbjct: 71 KGPYEQIYRV 80
>gnl|CDD|183346 PRK11865, PRK11865, pyruvate ferredoxin oxidoreductase subunit
beta; Provisional.
Length = 299
Score = 29.3 bits (66), Expect = 6.9
Identities = 19/56 (33%), Positives = 26/56 (46%), Gaps = 13/56 (23%)
Query: 22 AYSGTLPPN---LVIMERNGTKPNSGSQSSSTTPYVSSTSAVTQTTTSRPQQPAGS 74
+ SG + L +M N N+G Q S +TP+ +S TTTS PAG
Sbjct: 110 SLSGAMERGHNILYLMYDNEAYMNTGIQRSGSTPFGAS------TTTS----PAGK 155
>gnl|CDD|184256 PRK13700, PRK13700, conjugal transfer protein TraD; Provisional.
Length = 732
Score = 29.5 bits (66), Expect = 7.0
Identities = 15/71 (21%), Positives = 25/71 (35%), Gaps = 2/71 (2%)
Query: 143 AGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGASN-KIDTGGNNNCLGVTTTNKPPS 201
A + P + ++PQ P ++P P ++ K D G N G+ K
Sbjct: 611 ASGEDVTQAEQPQQPQQPQQPQQPQQP-QQPVSPVINDKKSDAGVNVPAGGIEQELKMKP 669
Query: 202 SQPSNLPKCPG 212
+ PG
Sbjct: 670 EEEMEQQLPPG 680
>gnl|CDD|222915 PHA02670, PHA02670, ORF112 putative chemokine-binding protein;
Provisional.
Length = 287
Score = 29.2 bits (65), Expect = 8.2
Identities = 12/39 (30%), Positives = 18/39 (46%), Gaps = 2/39 (5%)
Query: 243 CDNTRLPPPTKPKPRPTQPPTTTTVATTTAVPDDRPECP 281
CD+ + P +P P+P P+TT T D + P
Sbjct: 165 CDHQKFPE--EPTPKPEPTPSTTPGPTLLETTLDLTDEP 201
>gnl|CDD|239471 cd03376, TPP_PFOR_porB_like, Thiamine pyrophosphate (TPP family),
PFOR porB-like subfamily, TPP-binding module; composed
of proteins similar to the beta subunit (porB) of the
Helicobacter pylori four-subunit pyruvate ferredoxin
oxidoreductase (PFOR), which are also found in archaea
and some hyperthermophilic bacteria. PFOR catalyzes the
oxidative decarboxylation of pyruvate to form
acetyl-CoA, a crucial step in many metabolic pathways.
Archaea, anaerobic bacteria and eukaryotes that lack
mitochondria (and therefore pyruvate dehydrogenase) use
PFOR to oxidatively decarboxylate pyruvate, with
ferredoxin or flavodoxin as the electron acceptor. The
36-kDa porB subunit contains the binding sites for the
cofactors, TPP and a divalent metal cation, which are
required for activity.
Length = 235
Score = 28.7 bits (65), Expect = 8.4
Identities = 16/46 (34%), Positives = 19/46 (41%), Gaps = 10/46 (21%)
Query: 31 LVIMERNGTKPNSGSQSSSTTPYVSSTSAVTQTTTSRPQQPAGSKP 76
L I N N+G Q S +TPY + T TTT P G
Sbjct: 111 LYICYDNEAYMNTGIQRSGSTPYGAWT-----TTT-----PVGKVS 146
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 29.3 bits (66), Expect = 9.5
Identities = 16/56 (28%), Positives = 20/56 (35%), Gaps = 1/56 (1%)
Query: 124 LESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGAS 179
LE A AA G+ T A + P PP + P P PP A+
Sbjct: 377 LERGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAAR-PAPAPAPPAAA 431
>gnl|CDD|218116 pfam04503, SSDP, Single-stranded DNA binding protein, SSDP. This
is a family of eukaryotic single-stranded DNA binding
proteins with specificity to a pyrimidine-rich element
found in the promoter region of the alpha2(I) collagen
gene.
Length = 293
Score = 28.9 bits (64), Expect = 9.6
Identities = 21/91 (23%), Positives = 30/91 (32%), Gaps = 8/91 (8%)
Query: 125 ESPANGTTPAAGG--STTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGASNKI 182
SP T P GG T + + ST + +YT P Q P PG+ +
Sbjct: 182 SSPGEYTGPPGGGGPPGTPIMPSPADSTNSSDNMYTVMNPVPPGQTRSNFPMGPGSDGPM 241
Query: 183 DTGGN------NNCLGVTTTNKPPSSQPSNL 207
N LG + + P+N
Sbjct: 242 GGMSGMEPHHMNGSLGSGDMDSLSKNSPNNQ 272
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.133 0.422
Gapped
Lambda K H
0.267 0.0738 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 36,052,705
Number of extensions: 3363583
Number of successful extensions: 3869
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3638
Number of HSP's successfully gapped: 89
Length of query: 741
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 637
Effective length of database: 6,324,786
Effective search space: 4028888682
Effective search space used: 4028888682
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 63 (28.0 bits)