Query psy7563
Match_columns 274
No_of_seqs 180 out of 1778
Neff 8.7
Searched_HMMs 46136
Date Fri Aug 16 21:37:19 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7563.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7563hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 8.3E-39 1.8E-43 309.0 23.1 217 1-251 237-459 (2531)
2 KOG4289|consensus 100.0 1.8E-37 3.9E-42 299.8 27.1 265 1-270 342-691 (2531)
3 KOG1219|consensus 100.0 1.3E-35 2.7E-40 294.9 27.5 253 1-262 2441-2782(4289)
4 KOG1219|consensus 100.0 7.6E-35 1.6E-39 289.4 28.1 244 1-251 713-1144(4289)
5 cd00031 CA Cadherin repeat dom 100.0 1.7E-29 3.6E-34 210.4 26.8 180 35-247 1-184 (199)
6 cd00031 CA Cadherin repeat dom 99.8 5.1E-18 1.1E-22 141.1 18.8 124 1-153 70-199 (199)
7 PF00028 Cadherin: Cadherin do 99.7 1.2E-16 2.6E-21 117.1 14.1 88 36-152 1-93 (93)
8 KOG1834|consensus 99.6 1.3E-14 2.8E-19 134.1 18.0 191 25-251 27-233 (952)
9 smart00112 CA Cadherin repeats 99.6 9.9E-15 2.2E-19 103.6 11.1 71 61-159 8-79 (79)
10 PF00028 Cadherin: Cadherin do 99.6 6.2E-14 1.3E-18 102.6 11.8 82 166-251 1-82 (93)
11 KOG1834|consensus 98.9 1.1E-08 2.4E-13 95.4 12.6 122 1-155 111-246 (952)
12 smart00112 CA Cadherin repeats 98.6 1.7E-07 3.6E-12 66.1 7.9 59 186-248 1-59 (79)
13 PF08266 Cadherin_2: Cadherin- 97.4 0.00024 5.1E-09 50.7 4.0 60 36-125 3-67 (84)
14 PF08758 Cadherin_pro: Cadheri 97.2 0.0058 1.3E-07 44.2 9.4 86 27-147 2-88 (90)
15 PF08266 Cadherin_2: Cadherin- 96.7 0.0028 6.1E-08 45.2 4.5 63 166-231 3-67 (84)
16 PF08758 Cadherin_pro: Cadheri 96.7 0.013 2.8E-07 42.4 7.6 80 158-249 3-82 (90)
17 smart00736 CADG Dystroglycan-t 96.1 0.17 3.6E-06 37.0 10.9 70 56-156 24-96 (97)
18 PF13750 Big_3_3: Bacterial Ig 96.0 0.6 1.3E-05 37.4 14.9 123 1-153 19-149 (158)
19 TIGR01965 VCBS_repeat VCBS rep 93.5 1.2 2.6E-05 32.7 9.2 94 52-173 2-97 (99)
20 PF13750 Big_3_3: Bacterial Ig 90.1 8.4 0.00018 30.8 13.4 122 125-251 14-140 (158)
21 TIGR00845 caca sodium/calcium 87.6 37 0.00079 35.0 19.0 157 25-205 396-571 (928)
22 PF05345 He_PIG: Putative Ig d 79.0 12 0.00027 23.4 6.0 36 207-247 14-49 (49)
23 KOG3597|consensus 68.3 1.1E+02 0.0023 28.9 13.1 156 24-205 35-197 (442)
24 TIGR01965 VCBS_repeat VCBS rep 66.8 48 0.001 24.3 8.0 89 181-272 2-97 (99)
25 TIGR03660 T1SS_rpt_143 T1SS-14 64.9 65 0.0014 25.1 11.3 56 113-173 69-127 (137)
26 smart00736 CADG Dystroglycan-t 60.5 60 0.0013 23.2 8.1 56 185-249 24-82 (97)
27 KOG3597|consensus 49.9 76 0.0016 29.9 7.4 60 143-203 24-83 (442)
28 PF07495 Y_Y_Y: Y_Y_Y domain; 48.3 53 0.0011 21.3 4.8 22 229-250 34-55 (66)
29 COG2706 3-carboxymuconate cycl 44.5 2.1E+02 0.0046 25.9 9.1 114 146-271 212-328 (346)
30 COG2706 3-carboxymuconate cycl 39.8 2.9E+02 0.0063 25.1 9.6 67 19-88 215-282 (346)
31 PF03160 Calx-beta: Calx-beta 38.0 1.5E+02 0.0031 21.1 9.6 54 148-205 2-55 (100)
32 TIGR03660 T1SS_rpt_143 T1SS-14 34.6 1.1E+02 0.0025 23.7 5.3 39 1-43 89-127 (137)
33 PF05895 DUF859: Siphovirus pr 31.9 5.2E+02 0.011 25.7 10.3 33 1-34 302-334 (624)
34 KOG4221|consensus 29.9 7.4E+02 0.016 26.8 15.7 45 113-160 575-620 (1381)
35 smart00089 PKD Repeats in poly 28.0 1.6E+02 0.0035 19.6 4.7 29 121-151 50-78 (79)
36 cd00146 PKD polycystic kidney 24.3 2.1E+02 0.0045 19.1 4.8 28 121-149 52-79 (81)
37 PF14339 DUF4394: Domain of un 22.4 5E+02 0.011 22.3 10.0 155 63-222 39-221 (236)
38 PF09100 Qn_am_d_aIV: Quinohem 22.2 2.6E+02 0.0056 21.5 5.0 31 128-159 100-132 (133)
39 COG5584 Predicted small secret 20.9 96 0.0021 22.6 2.4 27 247-273 73-100 (103)
No 1
>KOG4289|consensus
Probab=100.00 E-value=8.3e-39 Score=309.00 Aligned_cols=217 Identities=23% Similarity=0.334 Sum_probs=202.2
Q ss_pred CeEEEEECCCCCceEEEEEEEEec--cCCCCeeeCCceEEEEeCCCCCCCeEEEEEEEeCCC---CCeEEEEEecCCCCC
Q psy7563 1 MPSSILSSGIQPCSTEVSVNVKVI--DKSMPVFDKQFYSVSVREDIQIHSPLPLTIRAESPL---NRKLIYSITGGNSRE 75 (274)
Q Consensus 1 l~v~A~D~g~~~~~~~~~v~I~v~--Ndn~P~F~~~~y~~~v~E~~~~g~~v~~~v~a~D~d---~~~i~y~i~~~~~~~ 75 (274)
|+|+|.|.|.|+++++.+|+|.|. |||.|+|.+.+|..+++||.++|..| ++|.|+|.| |+.|+|++.+|++..
T Consensus 237 lrVtA~d~~~P~~SAtttv~V~V~D~nDhsPvFEq~~Y~e~lREn~evGy~v-LtvrAtD~Dsp~Nani~Yrl~eg~~~~ 315 (2531)
T KOG4289|consen 237 LRVTAQDHGDPRRSATTTVTVLVLDTNDHSPVFEQDEYREELRENLEVGYEV-LTVRATDGDSPPNANIRYRLLEGNAKN 315 (2531)
T ss_pred EEEEeeecCCCcccceeEEEEEEeecCCCCcccchhHHHHHHhhccccCceE-EEEEeccCCCCCCCceEEEecCCCccc
Confidence 589999999999988888777775 99999999999999999999999999 999999999 999999999999889
Q ss_pred cEEEeCCCceEEeceeeEEEeecCCCccceEEeCceeEEEECCCCCcCcccEEEEEEEEEeCCCC-ceEEEEEEEEEEec
Q psy7563 76 QFAVDFNTGLYSLNRKLIYSITGGNSREQFAVDFNTGSITVVDELDFETVQEYTLTVRATDSVSG-HSAEVFVSVRVTDV 154 (274)
Q Consensus 76 ~F~i~~~tg~v~~~~~~~~~~~~~~~~~~f~i~~~tg~l~~~~~lD~E~~~~~~l~v~a~d~~~~-~~~~~~v~I~v~dv 154 (274)
.|.|++.+| .|.+..+||||+...|.|.|.|+|.|.+ ...++.|.|+|.|+
T Consensus 316 ~f~in~rSG----------------------------vI~T~a~lDRE~~~~y~L~VeAsDqG~~pgp~Ta~V~itV~D~ 367 (2531)
T KOG4289|consen 316 VFEINPRSG----------------------------VISTRAPLDREELESYQLDVEASDQGRPPGPRTAMVEITVEDE 367 (2531)
T ss_pred eeEEcCccc----------------------------eeeccCccCHHhhhheEEEEEeccCCCCCCCceEEEEEEEEec
Confidence 999999999 9999999999999999999999999853 55599999999999
Q ss_pred CCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCCCcceEEeccceEEEEeccCCCCcccCCCc
Q psy7563 155 NDCPPEFPQDWYNVSISEAAPFGTLILKVRALDNDSGKQKISSTAQIAHSFLEIFFLSLHRYMFTIGKGFFPERELLQPR 234 (274)
Q Consensus 155 Ndn~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~~g~~~i~~~~~ld~e~~~~~ 234 (274)
|||+|+|..+.|.+.|.|+..+++.|++|+|+|.|.|.|+.++|+|++++..+.|.|+..+|.+.+.. .||.|. .
T Consensus 368 NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~--plD~e~---~ 442 (2531)
T KOG4289|consen 368 NDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVE--PLDFEN---S 442 (2531)
T ss_pred CCCCccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEec--cccccC---C
Confidence 99999999999999999999999999999999999999999999999999999999999999999988 677775 2
Q ss_pred EEEEEEEEeCCCCCcee
Q psy7563 235 CYRSEIEDRRRTGEEKA 251 (274)
Q Consensus 235 ~y~~~v~a~d~~g~~~~ 251 (274)
.|.+.|+|.|+.-.+..
T Consensus 443 ~ytl~IrAqDggrPpLs 459 (2531)
T KOG4289|consen 443 EYTLRIRAQDGGRPPLS 459 (2531)
T ss_pred eeEEEEEcccCCCCCcc
Confidence 69999999997555444
No 2
>KOG4289|consensus
Probab=100.00 E-value=1.8e-37 Score=299.79 Aligned_cols=265 Identities=28% Similarity=0.346 Sum_probs=233.3
Q ss_pred CeEEEEECCCCC--ceEEEEEEEEeccCCCCeeeCCceEEEEeCCCCCCCeEEEEEEEeCCC---CCeEEEEEecCCCCC
Q psy7563 1 MPSSILSSGIQP--CSTEVSVNVKVIDKSMPVFDKQFYSVSVREDIQIHSPLPLTIRAESPL---NRKLIYSITGGNSRE 75 (274)
Q Consensus 1 l~v~A~D~g~~~--~~~~~~v~I~v~Ndn~P~F~~~~y~~~v~E~~~~g~~v~~~v~a~D~d---~~~i~y~i~~~~~~~ 75 (274)
|.|.|.|+|.|+ +++.+.|+|.+.|||+|+|....|.+.|+||..+++.| ++|+|+|.| |+.++|+|.+|+..+
T Consensus 342 L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avv-lrV~AtDrD~g~Ng~VHYsi~Sgn~~G 420 (2531)
T KOG4289|consen 342 LDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVV-LRVTATDRDKGTNGKVHYSIASGNGRG 420 (2531)
T ss_pred EEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceE-EEEEecccCCCcCceEEEEeeccCccc
Confidence 579999999884 45555555555699999999999999999999999999 999999999 999999999999999
Q ss_pred cEEEeCCCceEEeceeeEEEeec---------------------------------------------------------
Q psy7563 76 QFAVDFNTGLYSLNRKLIYSITG--------------------------------------------------------- 98 (274)
Q Consensus 76 ~F~i~~~tg~v~~~~~~~~~~~~--------------------------------------------------------- 98 (274)
.|.||..||.+.+..+|||+...
T Consensus 421 ~f~id~~tGel~vv~plD~e~~~ytl~IrAqDggrPpLsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~v 500 (2531)
T KOG4289|consen 421 QFYIDSLTGELDVVEPLDFENSEYTLRIRAQDGGRPPLSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHV 500 (2531)
T ss_pred cEEEecccceEEEeccccccCCeeEEEEEcccCCCCCccCCCceEEEEEecCCCCceeEechhhhhhhhcccccceEEEE
Confidence 99999999999999999988655
Q ss_pred ----------------CCCccceEEeCceeEEEECCCCCcCcccEEEEEEEEEeCCCC-ceEEEEEEEEEEecCCCCCcc
Q psy7563 99 ----------------GNSREQFAVDFNTGSITVVDELDFETVQEYTLTVRATDSVSG-HSAEVFVSVRVTDVNDCPPEF 161 (274)
Q Consensus 99 ----------------~~~~~~f~i~~~tg~l~~~~~lD~E~~~~~~l~v~a~d~~~~-~~~~~~v~I~v~dvNdn~P~f 161 (274)
+.+.+.|.|+..+|.|++.+.||||+...|.|.|.|+|.|.| ..+.+.|.|.++|+|||.|.|
T Consensus 501 qaidadsg~na~l~y~laG~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~dvndndP~F 580 (2531)
T KOG4289|consen 501 QAIDADSGENARLHYSLAGVGPFQINNGSGWITVTKELDRETVEHYSLGVEARDHGTPPLSTSTSISVTVLDVNDNDPTF 580 (2531)
T ss_pred ecccCCCCcccceeeeeccCCCeeEecCCceEEEeecccccccceEEEEEEEcCCCCCcccccceEEEEecccCCCCCcc
Confidence 345678999999999999999999999999999999999876 778889999999999999999
Q ss_pred CCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCCCcceEEeccceEEEEeccCCCCcccCCCcEEEEEEE
Q psy7563 162 PQDWYNVSISEAAPFGTLILKVRALDNDSGKQKISSTAQIAHSFLEIFFLSLHRYMFTIGKGFFPERELLQPRCYRSEIE 241 (274)
Q Consensus 162 ~~~~~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~~g~~~i~~~~~ld~e~~~~~~y~~~v~ 241 (274)
.++.|..++.|+.+.|+.|.+|+|.|.|. ++.++|+|..++-.+.|.|+..+|...|....-+|+.. .+.|.+.|.
T Consensus 581 t~~eytl~inED~pvgsSI~tvtAvD~d~--~s~ityqi~g~ntrn~Fsi~si~g~Glitlalp~dkKq--e~~~vl~vt 656 (2531)
T KOG4289|consen 581 TQKEYTLRINEDAPVGSSIVTVTAVDRDA--NSVITYQITGGNTRNRFSISSIGGGGLITLALPLDKKQ--ERQYVLAVT 656 (2531)
T ss_pred ccCceEEEecCCccccceEEEEEEecccc--ccceEEEecCCcccccceeeccCCcceEEeecchhhcc--cceEEEEEE
Confidence 99999999999999999999999999998 77899999999889999999998877776644456555 778999999
Q ss_pred EeCCCCCcee------ccCCCCCceEEeeCCCcee
Q psy7563 242 DRRRTGEEKA------SHSENKPYLFTINFATAGI 270 (274)
Q Consensus 242 a~d~~g~~~~------~~~~n~~~~f~i~~~tg~i 270 (274)
|+|+.-+-.+ ++.+...+.|...+-|+.|
T Consensus 657 AtDg~l~d~~~V~v~I~danThrpvFqs~pfTvsI 691 (2531)
T KOG4289|consen 657 ATDGTLQDTCSVNVNITDANTHRPVFQSSPFTVSI 691 (2531)
T ss_pred ecCCccccceEEEEEeeecccCCcccccCCeeEee
Confidence 9995444444 4566678888888877765
No 3
>KOG1219|consensus
Probab=100.00 E-value=1.3e-35 Score=294.91 Aligned_cols=253 Identities=25% Similarity=0.311 Sum_probs=216.4
Q ss_pred CeEEEEECCCCCceEEEEEEEEeccCCCCeeeCCceEEEEeCCCCCCCeEEEEEEEeCCC---CCeEEEEEecC-CCCCc
Q psy7563 1 MPSSILSSGIQPCSTEVSVNVKVIDKSMPVFDKQFYSVSVREDIQIHSPLPLTIRAESPL---NRKLIYSITGG-NSREQ 76 (274)
Q Consensus 1 l~v~A~D~g~~~~~~~~~v~I~v~Ndn~P~F~~~~y~~~v~E~~~~g~~v~~~v~a~D~d---~~~i~y~i~~~-~~~~~ 76 (274)
+.|.|.|.|+...-.+++|.+.++|||||.|..+.|+++|.||+..|+.| +.+.|+|.| |..++|.+.++ .-...
T Consensus 2441 i~v~a~Da~gr~af~tvti~ltDiNDnpPqF~a~~Y~~nI~enaskg~~V-~~v~A~D~De~snadvty~i~~e~~~~~v 2519 (4289)
T KOG1219|consen 2441 IAVMAFDAGGRVAFCTVTIILTDINDNPPQFDAQLYRVNITENASKGKLV-GHVIARDADEGSNADVTYEIVGESDVKHV 2519 (4289)
T ss_pred EEEEEecCCCeEEEEEEEEEEEecCCCCccccceeEEEEeecccCCCceE-EEEEEecCCCCCcccEEEEecCchhhhhe
Confidence 35778899876555666666677799999999999999999999999999 999999999 67999999876 44567
Q ss_pred EEEeCCCceEEeceeeEEEeec----------------------------------------------------------
Q psy7563 77 FAVDFNTGLYSLNRKLIYSITG---------------------------------------------------------- 98 (274)
Q Consensus 77 F~i~~~tg~v~~~~~~~~~~~~---------------------------------------------------------- 98 (274)
|.|++ +|.|++++.|+-...+
T Consensus 2520 ~~in~-sG~Itv~~sL~~~en~tl~l~vkA~D~g~P~~~s~ttV~v~vl~e~v~lPrFSep~y~fsvpEDv~vG~~Ig~v 2598 (4289)
T KOG1219|consen 2520 FEINE-SGVITVKRSLDGLENSTLHLFVKAIDDGKPRRRSNTTVIVTVLPEDVNLPRFSEPIYTFSVPEDVPVGEEIGQV 2598 (4289)
T ss_pred eeecC-CceEEeehhhhcccCcEEEEEEEeccCCCCCcccceEEEEEecCcccCcccccCceEEEeccccCCCCCeeeEE
Confidence 89988 9999986665543222
Q ss_pred --------------------CCCccceEEeCceeEEEECCCCCcCcccEEEEEEEEEeCCCCceEEEEEEEEEEecCCCC
Q psy7563 99 --------------------GNSREQFAVDFNTGSITVVDELDFETVQEYTLTVRATDSVSGHSAEVFVSVRVTDVNDCP 158 (274)
Q Consensus 99 --------------------~~~~~~f~i~~~tg~l~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~~v~I~v~dvNdn~ 158 (274)
++....|++|++||.|.+.++||+|+.++|++.|+|.+.+. ..+.+.|.|+|.|+|||+
T Consensus 2599 ~a~~a~~~~i~~~v~~gt~Esn~d~~Fsvdr~TG~i~v~ksLD~E~kk~yqi~v~a~~~~~-vva~tsv~vqVkDvNDNa 2677 (4289)
T KOG1219|consen 2599 SASDADEHVIYSLVLGGTPESNPDLPFSVDRNTGMIKVNKSLDHEKKKSYQIKVKATCGQW-VVAETSVFVQVKDVNDNA 2677 (4289)
T ss_pred eecccCCceEEEEEeCCCCCCCCCCceEEcCCCceEEeccccchhhhceEEEEEEeecCCc-eEEEEEEEEEeecccCCC
Confidence 45667799999999999999999999999999999999854 778889999999999999
Q ss_pred CccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCCCcceEEeccceEEEEeccCCCCcccCCCcEEEE
Q psy7563 159 PEFPQDWYNVSISEAAPFGTLILKVRALDNDSGKQKISSTAQIAHSFLEIFFLSLHRYMFTIGKGFFPERELLQPRCYRS 238 (274)
Q Consensus 159 P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~~g~~~i~~~~~ld~e~~~~~~y~~ 238 (274)
|.|..+.|.+.+.||.|.|+.|.+++|.|.|.|.||+|+|++.... ..|.|++.+|+++... .||+|. ...|.|
T Consensus 2678 PvFe~d~y~f~i~En~pvGtsV~qf~AsD~Ds~~nGqirysl~~~v--~yF~In~etGwlTt~~--eld~ek--~d~y~l 2751 (4289)
T KOG1219|consen 2678 PVFEKDPYLFIIEENSPVGTSVIQFHASDMDSGNNGQIRYSLTSPV--PYFAINPETGWLTTLF--ELDLEK--QDLYSL 2751 (4289)
T ss_pred ccccCCceeEEEeccCCCCceEEEEEeeccCCCCCceEEEEEcCCc--ceEEEcCCCCeeeehh--hhcccc--CCceEE
Confidence 9999999999999999999999999999999999999999998764 3999999999999666 688887 445999
Q ss_pred EEEEeCCCCCcee-------ccCCCCCceEE
Q psy7563 239 EIEDRRRTGEEKA-------SHSENKPYLFT 262 (274)
Q Consensus 239 ~v~a~d~~g~~~~-------~~~~n~~~~f~ 262 (274)
.|.|+|.++.... .+-++.++.|.
T Consensus 2752 kv~AtDhG~~ssq~~v~v~vtDvndspprf~ 2782 (4289)
T KOG1219|consen 2752 KVVATDHGVPSSQATVLVHVTDVNDSPPRFQ 2782 (4289)
T ss_pred EEEEecCCcccccceEEEEEEecCCCcchhh
Confidence 9999998887443 33555555554
No 4
>KOG1219|consensus
Probab=100.00 E-value=7.6e-35 Score=289.42 Aligned_cols=244 Identities=24% Similarity=0.252 Sum_probs=216.0
Q ss_pred CeEEEEECCCCCceEEEEEEEEe--ccCCCCeeeCCceEEEEeCCCCCCCeEEEEEEEeCCC---CCeEEEEEecCCCCC
Q psy7563 1 MPSSILSSGIQPCSTEVSVNVKV--IDKSMPVFDKQFYSVSVREDIQIHSPLPLTIRAESPL---NRKLIYSITGGNSRE 75 (274)
Q Consensus 1 l~v~A~D~g~~~~~~~~~v~I~v--~Ndn~P~F~~~~y~~~v~E~~~~g~~v~~~v~a~D~d---~~~i~y~i~~~~~~~ 75 (274)
|.|+|-|.|.|+.++.....|.+ .|||+|+|.+..|...|.|+..+|+.| +.|.+.|.| |+.++|+|.. +..
T Consensus 713 Lnvta~D~gtPqkss~r~l~v~vkd~ndn~p~f~e~sy~vtvsedtepgs~I-a~vetnd~D~g~NG~v~fsL~n--~sd 789 (4289)
T KOG1219|consen 713 LNVTAYDLGTPQKSSWRLLLVFVKDYNDNTPIFVERSYHVTVSEDTEPGSFI-AHVETNDTDGGNNGMVSFSLLN--KSD 789 (4289)
T ss_pred EEEEEecCCCchhhceeeEEEEEEecccCCccccccceEEEEecCCCCCceE-EEEEecccCCCCCceEEEEecC--Ccc
Confidence 57899999999877766655555 599999999999999999999999999 999999999 8999999964 567
Q ss_pred cEEEeCCCceEEeceeeEEEeec---------------------------------------------------------
Q psy7563 76 QFAVDFNTGLYSLNRKLIYSITG--------------------------------------------------------- 98 (274)
Q Consensus 76 ~F~i~~~tg~v~~~~~~~~~~~~--------------------------------------------------------- 98 (274)
.|+||+.||.|.+.++||.|.+.
T Consensus 790 vfsIdp~tGivv~~~sLdrE~q~~y~l~I~a~dqp~pq~~svv~l~vsvedVndnpPkci~~hsr~kipedlp~gt~~~~ 869 (4289)
T KOG1219|consen 790 VFSIDPFTGIVVTSKSLDREGQTSYHLKIEARDQPPPQLFSVVELDVSVEDVNDNPPKCIIRHSRSKIPEDLPYGTVTWQ 869 (4289)
T ss_pred eEEecCcccEEEeccccCcccCceeEEEEEEcCCCCCceEEEEEEEEEEeeccCCCCccccccccccCcccCCCceEEEE
Confidence 99999999988887777766444
Q ss_pred --------------------------------------------------------------------------------
Q psy7563 99 -------------------------------------------------------------------------------- 98 (274)
Q Consensus 99 -------------------------------------------------------------------------------- 98 (274)
T Consensus 870 l~A~d~diGq~~kvry~l~~~~v~~rvd~~sGavfi~~~LDf~k~~fynLsv~a~d~g~p~lss~chl~Vevldv~enlh 949 (4289)
T KOG1219|consen 870 LVALDPDIGQLGKVRYYLTDDTVGERVDFPSGAVFIGKPLDFEKSDFYNLSVTAVDRGTPILSSICHLEVEVLDVNENLH 949 (4289)
T ss_pred hhhcCcccCcCceeEEEEecCccccccccccccEEEecccccccccceEEEEEEecCCCcceeeeEEEEEEEeccCCCCC
Confidence
Q ss_pred ---------------------------------------------CCCccceEEeCceeEEEECCCCCcCcccEEEEEEE
Q psy7563 99 ---------------------------------------------GNSREQFAVDFNTGSITVVDELDFETVQEYTLTVR 133 (274)
Q Consensus 99 ---------------------------------------------~~~~~~f~i~~~tg~l~~~~~lD~E~~~~~~l~v~ 133 (274)
|...+.|.||..+|.|+..++||||....|-|++.
T Consensus 950 pp~F~~~v~e~~V~EnapiGT~vi~i~A~dedsgldg~l~Y~I~~gdg~g~FsId~~tG~irTl~~lDrE~ks~Ywltve 1029 (4289)
T KOG1219|consen 950 PPEFISFVTEGHVLENAPIGTIVIRIQARDEDSGLDGELSYKIRTGDGDGIFSIDSTTGSIRTLKALDREKKSSYWLTVE 1029 (4289)
T ss_pred CcchheeeeeeeEeecCCcceEEEEEEEecCCCCccceEEEEEEcCCcceeEEecCCcceEeechhhchhhcceEEEEEE
Confidence 67788999999999999999999999999999999
Q ss_pred EEeCCC-CceEEEEEEEEEEecCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCCCcceEEe
Q psy7563 134 ATDSVS-GHSAEVFVSVRVTDVNDCPPEFPQDWYNVSISEAAPFGTLILKVRALDNDSGKQKISSTAQIAHSFLEIFFLS 212 (274)
Q Consensus 134 a~d~~~-~~~~~~~v~I~v~dvNdn~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~F~i~ 212 (274)
|+|.|. +.++...+.|.|.|+|||+|+|.++.|..+|.|++|.+..|.++.|.|+|...|+++.|.|.+|++.++|.|+
T Consensus 1030 A~D~gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGnyq~FF~Id 1109 (4289)
T KOG1219|consen 1030 AKDLGTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSGNYQGFFQID 1109 (4289)
T ss_pred EEecCCCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccCCccceEEEc
Confidence 999986 4788899999999999999999999999999999999999999999999988899999999999999999999
Q ss_pred ccceEEEEeccCCCCcccCCCcEEEEEEEEeCCCCCcee
Q psy7563 213 LHRYMFTIGKGFFPERELLQPRCYRSEIEDRRRTGEEKA 251 (274)
Q Consensus 213 ~~~g~~~i~~~~~ld~e~~~~~~y~~~v~a~d~~g~~~~ 251 (274)
+.+|.++ +...||||...+ ..|.|...|.+-.+.+
T Consensus 1110 ~~TG~iT--t~r~LDRE~qdE--HiLeVTi~D~gep~l~ 1144 (4289)
T KOG1219|consen 1110 PETGLIT--TIRRLDREKQDE--HILEVTIQDNGEPWLC 1144 (4289)
T ss_pred cccceee--eehhhccccccc--ceEEEEEecCCCCccc
Confidence 9999998 555799999555 4455555555444433
No 5
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97 E-value=1.7e-29 Score=210.42 Aligned_cols=180 Identities=37% Similarity=0.463 Sum_probs=165.0
Q ss_pred ceEEEEeCCCCCCCeEEEEEEEeCCC---CCeEEEEEecCCCCCcEEEeCCCceEEeceeeEEEeecCCCccceEEeCce
Q psy7563 35 FYSVSVREDIQIHSPLPLTIRAESPL---NRKLIYSITGGNSREQFAVDFNTGLYSLNRKLIYSITGGNSREQFAVDFNT 111 (274)
Q Consensus 35 ~y~~~v~E~~~~g~~v~~~v~a~D~d---~~~i~y~i~~~~~~~~F~i~~~tg~v~~~~~~~~~~~~~~~~~~f~i~~~t 111 (274)
.|.+.|.||.+.|+.+ +++.|.|+| +..+.|+|..+....+|.|++.+|
T Consensus 1 ~~~~~i~En~~~g~~v-~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG--------------------------- 52 (199)
T cd00031 1 SYSVSVPENAPPGTVV-GTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTG--------------------------- 52 (199)
T ss_pred CeEEEEeCCCCCCCEE-EEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCC---------------------------
Confidence 3789999999999999 999999999 378999999877657999999888
Q ss_pred eEEEECCCCCcCcccEEEEEEEEEeCCCC-ceEEEEEEEEEEecCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCC
Q psy7563 112 GSITVVDELDFETVQEYTLTVRATDSVSG-HSAEVFVSVRVTDVNDCPPEFPQDWYNVSISEAAPFGTLILKVRALDNDS 190 (274)
Q Consensus 112 g~l~~~~~lD~E~~~~~~l~v~a~d~~~~-~~~~~~v~I~v~dvNdn~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~ 190 (274)
.|++.++||||....|.|.|+|+|.|.+ ++++..+.|.|.|+|||+|.|....|.+.|.|+.++|+.++++.|+|+|.
T Consensus 53 -~l~~~~~lD~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~~~~a~D~D~ 131 (199)
T cd00031 53 -VITTTKPLDREEQSEYTLTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVGTVTATDADS 131 (199)
T ss_pred -EEEECCCCCCcCCceEEEEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEEEEEEEcCCC
Confidence 9999999999999999999999997655 55899999999999999999998899999999999999999999999999
Q ss_pred CCCceEEEEEecCCCCcceEEeccceEEEEeccCCCCcccCCCcEEEEEEEEeCCCC
Q psy7563 191 GKQKISSTAQIAHSFLEIFFLSLHRYMFTIGKGFFPERELLQPRCYRSEIEDRRRTG 247 (274)
Q Consensus 191 g~n~~i~Y~i~~~~~~~~F~i~~~~g~~~i~~~~~ld~e~~~~~~y~~~v~a~d~~g 247 (274)
+.++.++|+|..+...+.|.|++.+|.+.+.. .+|+|. ...|.+.|.|.|..+
T Consensus 132 ~~~~~~~y~l~~~~~~~~f~i~~~~G~i~~~~--~ld~e~--~~~~~l~v~a~D~~~ 184 (199)
T cd00031 132 GENAKLTYSILSGNDKELFSIDPNTGIITLAK--PLDREE--KSSYELTVVATDGGG 184 (199)
T ss_pred CCCccEEEEEeCCCCCCEEEEeCCceEEEeCC--ccCCcc--CceEEEEEEEEECCC
Confidence 99999999998887668999999999988776 789998 448999999999875
No 6
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.80 E-value=5.1e-18 Score=141.05 Aligned_cols=124 Identities=36% Similarity=0.449 Sum_probs=108.0
Q ss_pred CeEEEEECCCCCceEEEEEEE--EeccCCCCeeeCCceEEEEeCCCCCCCeEEEEEEEeCCC---CCeEEEEEecCCCCC
Q psy7563 1 MPSSILSSGIQPCSTEVSVNV--KVIDKSMPVFDKQFYSVSVREDIQIHSPLPLTIRAESPL---NRKLIYSITGGNSRE 75 (274)
Q Consensus 1 l~v~A~D~g~~~~~~~~~v~I--~v~Ndn~P~F~~~~y~~~v~E~~~~g~~v~~~v~a~D~d---~~~i~y~i~~~~~~~ 75 (274)
|+|+|+|.|.|.+++...|+| .++|||+|.|....|.+.|.|+.++|+.+ +++.|+|+| ++.+.|+|..+....
T Consensus 70 l~v~a~D~g~~~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i-~~~~a~D~D~~~~~~~~y~l~~~~~~~ 148 (199)
T cd00031 70 LTVVASDGGGPPLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVV-GTVTATDADSGENAKLTYSILSGNDKE 148 (199)
T ss_pred EEEEEEECCcCcceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEE-EEEEEEcCCCCCCccEEEEEeCCCCCC
Confidence 579999998887545555544 55699999999999999999999999999 999999999 599999999876557
Q ss_pred cEEEeCCCceEEeceeeEEEeecCCCccceEEeCceeEEEECCCCCcCcccEEEEEEEEEeCCC-CceEEEEEEEEEEe
Q psy7563 76 QFAVDFNTGLYSLNRKLIYSITGGNSREQFAVDFNTGSITVVDELDFETVQEYTLTVRATDSVS-GHSAEVFVSVRVTD 153 (274)
Q Consensus 76 ~F~i~~~tg~v~~~~~~~~~~~~~~~~~~f~i~~~tg~l~~~~~lD~E~~~~~~l~v~a~d~~~-~~~~~~~v~I~v~d 153 (274)
+|.|++.+| .|.+.++||||....|.|.|.|+|.+. .++++..++|.|.|
T Consensus 149 ~f~i~~~~G----------------------------~i~~~~~ld~e~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~d 199 (199)
T cd00031 149 LFSIDPNTG----------------------------IITLAKPLDREEKSSYELTVVATDGGGPPLSSTATVTVTVLD 199 (199)
T ss_pred EEEEeCCce----------------------------EEEeCCccCCccCceEEEEEEEEECCCCCceeEEEEEEEEEC
Confidence 899998888 999999999999999999999999864 47788888888875
No 7
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.73 E-value=1.2e-16 Score=117.10 Aligned_cols=88 Identities=42% Similarity=0.528 Sum_probs=82.4
Q ss_pred eEEEEeCCCCCCCeEEEEEEEeCCC---CCeEEEEEecCCCCCcEEEeCCCceEEeceeeEEEeecCCCccceEEeCcee
Q psy7563 36 YSVSVREDIQIHSPLPLTIRAESPL---NRKLIYSITGGNSREQFAVDFNTGLYSLNRKLIYSITGGNSREQFAVDFNTG 112 (274)
Q Consensus 36 y~~~v~E~~~~g~~v~~~v~a~D~d---~~~i~y~i~~~~~~~~F~i~~~tg~v~~~~~~~~~~~~~~~~~~f~i~~~tg 112 (274)
|.++|+|+.++|+.+ +++.|.|+| |+.+.|+|..+++..+|.|++.+|
T Consensus 1 Y~~~v~E~~~~g~~v-~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg---------------------------- 51 (93)
T PF00028_consen 1 YSFSVPENAPPGTVV-GQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTG---------------------------- 51 (93)
T ss_dssp EEEEEETTGSTSSEE-EEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTT----------------------------
T ss_pred CEEEEECCCCCCCEE-EEEEEEeCCCCCCceEEEEEecCcccCceEEeeeee----------------------------
Confidence 789999999999999 999999999 899999999988789999999999
Q ss_pred EEEECCCCCcCcccEEEEEEEEEeC-CCC-ceEEEEEEEEEE
Q psy7563 113 SITVVDELDFETVQEYTLTVRATDS-VSG-HSAEVFVSVRVT 152 (274)
Q Consensus 113 ~l~~~~~lD~E~~~~~~l~v~a~d~-~~~-~~~~~~v~I~v~ 152 (274)
.|++.++||||....|.|.|.|+|. +.+ +++++.|+|+|+
T Consensus 52 ~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~~V~I~V~ 93 (93)
T PF00028_consen 52 EISLKKPLDRETQSSYQLTVRATDSGGSPPLSSTATVTINVL 93 (93)
T ss_dssp EEEESSSSCTTTTSEEEEEEEEEETTTSSEEEEEEEEEEEEE
T ss_pred ccccceecCcccCCEEEEEEEEEECCCCCCCEEEEEEEEEEC
Confidence 9999999999999999999999999 655 888899999874
No 8
>KOG1834|consensus
Probab=99.64 E-value=1.3e-14 Score=134.13 Aligned_cols=191 Identities=23% Similarity=0.247 Sum_probs=142.2
Q ss_pred cCCCCeeeCCceEEEEeCCCCCCCeEEEEEEEeCCC------CCeEEEEEecCC-CCCcEEEeCCCceEEeceeeEEEee
Q psy7563 25 DKSMPVFDKQFYSVSVREDIQIHSPLPLTIRAESPL------NRKLIYSITGGN-SREQFAVDFNTGLYSLNRKLIYSIT 97 (274)
Q Consensus 25 Ndn~P~F~~~~y~~~v~E~~~~g~~v~~~v~a~D~d------~~~i~y~i~~~~-~~~~F~i~~~tg~v~~~~~~~~~~~ 97 (274)
|-+-|.. ...|.+-|.||-..-... ..+.|.|.| +..--|.|.+.. |...--+|..||
T Consensus 27 nkhkpwi-e~ey~gvV~Endntvll~-Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TG------------- 91 (952)
T KOG1834|consen 27 NKHKPWI-EEEYHGVVTENDNTVLLD-PPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTG------------- 91 (952)
T ss_pred cccCccc-ccceeEEEEeCCceEEeC-CCeeeecCCCCcccccccceeEecCCCCCceEEEEeccCC-------------
Confidence 5666654 456899999996533333 456777776 345567776422 222334466666
Q ss_pred cCCCccceEEeCceeEEEECCCCCcCcccEEEEEEEEEeCCCC-------ceEEEEEEEEEEecCCCCCccCCCceEEEE
Q psy7563 98 GGNSREQFAVDFNTGSITVVDELDFETVQEYTLTVRATDSVSG-------HSAEVFVSVRVTDVNDCPPEFPQDWYNVSI 170 (274)
Q Consensus 98 ~~~~~~~f~i~~~tg~l~~~~~lD~E~~~~~~l~v~a~d~~~~-------~~~~~~v~I~v~dvNdn~P~f~~~~~~~~v 170 (274)
.|.|+.+.+||.|.+++|+|+|+|.|+|.. .+..++|.|+|.|+|+++|+|..+.|.+.|
T Consensus 92 -------------egvlRaK~~lDCelqkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V 158 (952)
T KOG1834|consen 92 -------------EGVLRAKEPLDCELQKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHV 158 (952)
T ss_pred -------------ceEEeecCcccccccccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEE
Confidence 568999999999999999999999999642 355689999999999999999999999999
Q ss_pred eCCCCCCcEEEEEEEEeCCCCC-CceE-EEEEecCCCCcceEEeccceEEEEeccCCCCcccCCCcEEEEEEEEeCCCCC
Q psy7563 171 SEAAPFGTLILKVRALDNDSGK-QKIS-STAQIAHSFLEIFFLSLHRYMFTIGKGFFPERELLQPRCYRSEIEDRRRTGE 248 (274)
Q Consensus 171 ~E~~~~g~~v~~v~A~D~D~g~-n~~i-~Y~i~~~~~~~~F~i~~~~g~~~i~~~~~ld~e~~~~~~y~~~v~a~d~~g~ 248 (274)
.|+ +.-..|++|.|.|.|.++ +++| .|.|+. +.-.|.|+.+ |.+. +...|.+.. .+.|.++|.|-|=+-.
T Consensus 159 ~EG-K~yd~il~veAiD~DCspq~sqIC~YEI~t--~d~PFaIdn~-G~ir--nTekLny~k--e~~Y~ltVtAyDCg~k 230 (952)
T KOG1834|consen 159 TEG-KVYDSILRVEAIDKDCSPQYSQICEYEITT--PDVPFAIDND-GNIR--NTEKLNYTK--EHQYKLTVTAYDCGKK 230 (952)
T ss_pred ecc-eeeeeeEEEEeecCCCCCcccceeEEEecC--CCCceEEcCC-Cccc--ccccccccc--ceeEEEEEEEEecccc
Confidence 998 456789999999999875 7776 588866 4568999965 4333 333455555 6789999999996555
Q ss_pred cee
Q psy7563 249 EKA 251 (274)
Q Consensus 249 ~~~ 251 (274)
..+
T Consensus 231 raa 233 (952)
T KOG1834|consen 231 RAA 233 (952)
T ss_pred ccc
Confidence 444
No 9
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.61 E-value=9.9e-15 Score=103.56 Aligned_cols=71 Identities=48% Similarity=0.593 Sum_probs=64.1
Q ss_pred CCeEEEEEecCCCCCcEEEeCCCceEEeceeeEEEeecCCCccceEEeCceeEEEECCCCCcCcccEEEEEEEEEeCCCC
Q psy7563 61 NRKLIYSITGGNSREQFAVDFNTGLYSLNRKLIYSITGGNSREQFAVDFNTGSITVVDELDFETVQEYTLTVRATDSVSG 140 (274)
Q Consensus 61 ~~~i~y~i~~~~~~~~F~i~~~tg~v~~~~~~~~~~~~~~~~~~f~i~~~tg~l~~~~~lD~E~~~~~~l~v~a~d~~~~ 140 (274)
++.+.|+|.+++...+|.|++.+| .|++.++||||..+.|.|.|+|+|.+.+
T Consensus 8 n~~i~Y~i~~~~~~~~F~i~~~tg----------------------------~i~~~~~LD~e~~~~y~l~v~a~D~~~~ 59 (79)
T smart00112 8 NGKVTYSILSGNEDGLFSIDPETG----------------------------EITTTKPLDREEQPEYTLTVEATDGGGP 59 (79)
T ss_pred CcEEEEEEecCCCCCEEEEeCCcc----------------------------EEEeCCccCeeCCCeEEEEEEEEECCCC
Confidence 678999998776557899998888 8999999999999999999999999764
Q ss_pred -ceEEEEEEEEEEecCCCCC
Q psy7563 141 -HSAEVFVSVRVTDVNDCPP 159 (274)
Q Consensus 141 -~~~~~~v~I~v~dvNdn~P 159 (274)
+++++.|.|.|.|+|||+|
T Consensus 60 ~~~~~~~v~I~V~D~Nd~~P 79 (79)
T smart00112 60 PLSSTATVTVTVLDVNDNAP 79 (79)
T ss_pred CcccEEEEEEEEEECCCCCC
Confidence 8889999999999999998
No 10
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.56 E-value=6.2e-14 Score=102.63 Aligned_cols=82 Identities=23% Similarity=0.298 Sum_probs=74.9
Q ss_pred eEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCCCcceEEeccceEEEEeccCCCCcccCCCcEEEEEEEEeCC
Q psy7563 166 YNVSISEAAPFGTLILKVRALDNDSGKQKISSTAQIAHSFLEIFFLSLHRYMFTIGKGFFPERELLQPRCYRSEIEDRRR 245 (274)
Q Consensus 166 ~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~~g~~~i~~~~~ld~e~~~~~~y~~~v~a~d~ 245 (274)
|.+.|+|+.++|+.++++.|.|+|.+.|+.+.|+|..++..+.|.|++.+|.+.+.+ .||+|. ...|.|.|.|.|.
T Consensus 1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~--~LD~E~--~~~y~l~v~a~D~ 76 (93)
T PF00028_consen 1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKK--PLDRET--QSSYQLTVRATDS 76 (93)
T ss_dssp EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESS--SSCTTT--TSEEEEEEEEEET
T ss_pred CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccce--ecCccc--CCEEEEEEEEEEC
Confidence 789999999999999999999999999999999999999899999999999988776 699999 6789999999999
Q ss_pred CCCcee
Q psy7563 246 TGEEKA 251 (274)
Q Consensus 246 ~g~~~~ 251 (274)
.|.+..
T Consensus 77 ~~~~~~ 82 (93)
T PF00028_consen 77 GGSPPL 82 (93)
T ss_dssp TTSSEE
T ss_pred CCCCCC
Confidence 555444
No 11
>KOG1834|consensus
Probab=98.95 E-value=1.1e-08 Score=95.43 Aligned_cols=122 Identities=25% Similarity=0.372 Sum_probs=97.9
Q ss_pred CeEEEEECCC-C-------CceEEEEEEEEeccCCCCeeeCCceEEEEeCCCCCCCeEEEEEEEeCCC-----CCeEEEE
Q psy7563 1 MPSSILSSGI-Q-------PCSTEVSVNVKVIDKSMPVFDKQFYSVSVREDIQIHSPLPLTIRAESPL-----NRKLIYS 67 (274)
Q Consensus 1 l~v~A~D~g~-~-------~~~~~~~v~I~v~Ndn~P~F~~~~y~~~v~E~~~~g~~v~~~v~a~D~d-----~~~i~y~ 67 (274)
|+|+|-|=|. | ...+++.|.|+++|..||+|..+.|.+.|.|... =..| +.|.|.|.| ...-.|.
T Consensus 111 f~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~-yd~i-l~veAiD~DCspq~sqIC~YE 188 (952)
T KOG1834|consen 111 FTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKV-YDSI-LRVEAIDKDCSPQYSQICEYE 188 (952)
T ss_pred EEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEeccee-eeee-EEEEeecCCCCCcccceeEEE
Confidence 5788988764 3 3555666667778999999999999999999865 4456 799999999 4555999
Q ss_pred EecCCCCCcEEEeCCCceEEeceeeEEEeecCCCccceEEeCceeEEEECCCCCcCcccEEEEEEEEEeCCCCceEE-EE
Q psy7563 68 ITGGNSREQFAVDFNTGLYSLNRKLIYSITGGNSREQFAVDFNTGSITVVDELDFETVQEYTLTVRATDSVSGHSAE-VF 146 (274)
Q Consensus 68 i~~~~~~~~F~i~~~tg~v~~~~~~~~~~~~~~~~~~f~i~~~tg~l~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~-~~ 146 (274)
|.. +.-.|.||. .| .|+...+|+|....+|.|+|.|.|+|..+.+. +.
T Consensus 189 I~t--~d~PFaIdn-~G----------------------------~irnTekLny~ke~~Y~ltVtAyDCg~kraa~d~l 237 (952)
T KOG1834|consen 189 ITT--PDVPFAIDN-DG----------------------------NIRNTEKLNYTKEHQYKLTVTAYDCGKKRAASDSL 237 (952)
T ss_pred ecC--CCCceEEcC-CC----------------------------ccccccccccccceeEEEEEEEEecccccccCcce
Confidence 986 446899996 56 89999999999999999999999998654443 77
Q ss_pred EEEEEEecC
Q psy7563 147 VSVRVTDVN 155 (274)
Q Consensus 147 v~I~v~dvN 155 (274)
|+|.|....
T Consensus 238 V~v~Vkp~C 246 (952)
T KOG1834|consen 238 VTVHVKPTC 246 (952)
T ss_pred EEEEecCcc
Confidence 888887654
No 12
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=98.64 E-value=1.7e-07 Score=66.12 Aligned_cols=59 Identities=20% Similarity=0.162 Sum_probs=51.1
Q ss_pred EeCCCCCCceEEEEEecCCCCcceEEeccceEEEEeccCCCCcccCCCcEEEEEEEEeCCCCC
Q psy7563 186 LDNDSGKQKISSTAQIAHSFLEIFFLSLHRYMFTIGKGFFPERELLQPRCYRSEIEDRRRTGE 248 (274)
Q Consensus 186 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~~g~~~i~~~~~ld~e~~~~~~y~~~v~a~d~~g~ 248 (274)
+|+|.|.|+.++|+|.++.....|.|++.+|.+.+.. .||+|. ...|.|.|.|.|.++.
T Consensus 1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~--~LD~e~--~~~y~l~v~a~D~~~~ 59 (79)
T smart00112 1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTK--PLDREE--QPEYTLTVEATDGGGP 59 (79)
T ss_pred CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCC--ccCeeC--CCeEEEEEEEEECCCC
Confidence 4899999999999999887668999999999776654 899998 5789999999998764
No 13
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.37 E-value=0.00024 Score=50.72 Aligned_cols=60 Identities=22% Similarity=0.241 Sum_probs=34.6
Q ss_pred eEEEEeCCCCCCCeEEEEEEEeCCC-----CCeEEEEEecCCCCCcEEEeCCCceEEeceeeEEEeecCCCccceEEeCc
Q psy7563 36 YSVSVREDIQIHSPLPLTIRAESPL-----NRKLIYSITGGNSREQFAVDFNTGLYSLNRKLIYSITGGNSREQFAVDFN 110 (274)
Q Consensus 36 y~~~v~E~~~~g~~v~~~v~a~D~d-----~~~i~y~i~~~~~~~~F~i~~~tg~v~~~~~~~~~~~~~~~~~~f~i~~~ 110 (274)
...+|+|..+.|+.| +.| |.|.- -..-.+++.+.....+|.++..+|
T Consensus 3 i~YsV~EE~~~Gt~I-Gni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG-------------------------- 54 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVI-GNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTG-------------------------- 54 (84)
T ss_dssp EEEEEESS--TT-EE-EEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTS--------------------------
T ss_pred eEEEeecCCCCCCEE-EEh-HHhhCCCcccccccceEEeecCCcceeEecCCce--------------------------
Confidence 356899999999999 887 44442 112245554444445666666666
Q ss_pred eeEEEECCCCCcCcc
Q psy7563 111 TGSITVVDELDFETV 125 (274)
Q Consensus 111 tg~l~~~~~lD~E~~ 125 (274)
.|+++..+|||.-
T Consensus 55 --~L~v~~rIDRE~L 67 (84)
T PF08266_consen 55 --DLFVSERIDREEL 67 (84)
T ss_dssp --EEEESS--SCCCC
T ss_pred --eEEeCCccCHHHH
Confidence 9999999999963
No 14
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.18 E-value=0.0058 Score=44.16 Aligned_cols=86 Identities=24% Similarity=0.371 Sum_probs=44.6
Q ss_pred CCCeeeCCceEEEEeCCCCCCCeEEEEEEEeCCC-CCeEEEEEecCCCCCcEEEeCCCceEEeceeeEEEeecCCCccce
Q psy7563 27 SMPVFDKQFYSVSVREDIQIHSPLPLTIRAESPL-NRKLIYSITGGNSREQFAVDFNTGLYSLNRKLIYSITGGNSREQF 105 (274)
Q Consensus 27 n~P~F~~~~y~~~v~E~~~~g~~v~~~v~a~D~d-~~~i~y~i~~~~~~~~F~i~~~tg~v~~~~~~~~~~~~~~~~~~f 105 (274)
..|=|.+..|...|+.+...|..+ ++|.-.|-. +..+.|.- .++ .|.|.. .|
T Consensus 2 C~pGF~~~~~~~~Vp~~l~~g~~l-g~V~f~dC~~~~~~~~~s--sDp--dF~V~~-DG--------------------- 54 (90)
T PF08758_consen 2 CRPGFSQKKYTFEVPSNLEAGQPL-GKVNFEDCTGRRRVIFES--SDP--DFRVLE-DG--------------------- 54 (90)
T ss_dssp ---B--S-EEEE----SS-SS--E-EE---B--SS---EEEE-----S--EEEEET-TT---------------------
T ss_pred CcCCcccceEEEEcCchhhCCcEE-EEEEeccCCCCCceEEec--CCC--CEEEcC-CC---------------------
Confidence 358899999999999999999999 999999886 45677766 344 799998 57
Q ss_pred EEeCceeEEEECCCCCcCcccEEEEEEEEEeCCCCceEEEEE
Q psy7563 106 AVDFNTGSITVVDELDFETVQEYTLTVRATDSVSGHSAEVFV 147 (274)
Q Consensus 106 ~i~~~tg~l~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~~v 147 (274)
.|++++++.... ..-.|.|.|+|.......++.|
T Consensus 55 -------sVy~~r~v~l~~-~~~~F~V~a~D~~~~~~~~v~V 88 (90)
T PF08758_consen 55 -------SVYAKRPVQLSS-EQRSFTVHAWDSQTQEQKEVKV 88 (90)
T ss_dssp -------EEEEES--S-SS-S-EEEEEEEEETTTTEEEEEEE
T ss_pred -------eEEEeeeEecCC-CceEEEEEEECCCCCeEEEEEE
Confidence 899999887543 3357999999985443334333
No 15
>PF08266 Cadherin_2: Cadherin-like; InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=96.74 E-value=0.0028 Score=45.18 Aligned_cols=63 Identities=19% Similarity=0.141 Sum_probs=41.5
Q ss_pred eEEEEeCCCCCCcEEEEEEEEeCCCCC--CceEEEEEecCCCCcceEEeccceEEEEeccCCCCcccC
Q psy7563 166 YNVSISEAAPFGTLILKVRALDNDSGK--QKISSTAQIAHSFLEIFFLSLHRYMFTIGKGFFPERELL 231 (274)
Q Consensus 166 ~~~~v~E~~~~g~~v~~v~A~D~D~g~--n~~i~Y~i~~~~~~~~F~i~~~~g~~~i~~~~~ld~e~~ 231 (274)
..++|+|..++|+.|+.| |.|.-... -....|++++.....+|.++..+|.+.+.. .+|||..
T Consensus 3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~--rIDRE~L 67 (84)
T PF08266_consen 3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSE--RIDREEL 67 (84)
T ss_dssp EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS----SCCCC
T ss_pred eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCC--ccCHHHH
Confidence 467899999999999999 44443221 123468888877789999999999999875 7999986
No 16
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=96.66 E-value=0.013 Score=42.38 Aligned_cols=80 Identities=14% Similarity=0.037 Sum_probs=39.8
Q ss_pred CCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCCCcceEEeccceEEEEeccCCCCcccCCCcEEE
Q psy7563 158 PPEFPQDWYNVSISEAAPFGTLILKVRALDNDSGKQKISSTAQIAHSFLEIFFLSLHRYMFTIGKGFFPERELLQPRCYR 237 (274)
Q Consensus 158 ~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~~g~~~i~~~~~ld~e~~~~~~y~ 237 (274)
.|-|.+..|.+.|+.+...|..|++|.-.|=.. +..+.|.- ....|+|..++ .++......|-.+. -.
T Consensus 3 ~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~--~~~~~~~s----sDpdF~V~~DG-sVy~~r~v~l~~~~-----~~ 70 (90)
T PF08758_consen 3 RPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTG--RRRVIFES----SDPDFRVLEDG-SVYAKRPVQLSSEQ-----RS 70 (90)
T ss_dssp --B--S-EEEE----SS-SS--EEE---B--SS-----EEEE-------SEEEEETTT-EEEEES--S-SSS------EE
T ss_pred cCCcccceEEEEcCchhhCCcEEEEEEeccCCC--CCceEEec----CCCCEEEcCCC-eEEEeeeEecCCCc-----eE
Confidence 588999999999999999999999999887643 55688873 23489999874 45544433333332 46
Q ss_pred EEEEEeCCCCCc
Q psy7563 238 SEIEDRRRTGEE 249 (274)
Q Consensus 238 ~~v~a~d~~g~~ 249 (274)
+.|.|.|..+..
T Consensus 71 F~V~a~D~~~~~ 82 (90)
T PF08758_consen 71 FTVHAWDSQTQE 82 (90)
T ss_dssp EEEEEEETTTTE
T ss_pred EEEEEECCCCCe
Confidence 888888888775
No 17
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=96.09 E-value=0.17 Score=36.96 Aligned_cols=70 Identities=29% Similarity=0.325 Sum_probs=51.1
Q ss_pred EeCCCCCeEEEEEecCC---CCCcEEEeCCCceEEeceeeEEEeecCCCccceEEeCceeEEEECCCCCcCcccEEEEEE
Q psy7563 56 AESPLNRKLIYSITGGN---SREQFAVDFNTGLYSLNRKLIYSITGGNSREQFAVDFNTGSITVVDELDFETVQEYTLTV 132 (274)
Q Consensus 56 a~D~d~~~i~y~i~~~~---~~~~F~i~~~tg~v~~~~~~~~~~~~~~~~~~f~i~~~tg~l~~~~~lD~E~~~~~~l~v 132 (274)
..|+|+..+.|++...+ ...|.+.++.++ .|.-. |...+ ...|.+.|
T Consensus 24 F~d~d~~~lty~~~~~~~~~lP~Wl~fd~~~~----------------------------~~~Gt-P~~~~-~g~~~i~v 73 (97)
T smart00736 24 FTDADGDTLTYSATLSDGSALPSWLSFDSDTG----------------------------TLSGT-PTNSD-VGSLSLKV 73 (97)
T ss_pred eECCCCCeEEEEEEeCCCCCCCCeEEEeCCCC----------------------------EEEEE-CCCCC-CcEEEEEE
Confidence 46777888899886432 125888888777 66653 43333 45699999
Q ss_pred EEEeCCCCceEEEEEEEEEEecCC
Q psy7563 133 RATDSVSGHSAEVFVSVRVTDVND 156 (274)
Q Consensus 133 ~a~d~~~~~~~~~~v~I~v~dvNd 156 (274)
.|+|.. ..+....++|.|.+.|+
T Consensus 74 ~a~D~~-g~~~~~~f~i~V~~~~~ 96 (97)
T smart00736 74 TATDSS-GASASDTFTITVVNTND 96 (97)
T ss_pred EEEECC-CCEEEEEEEEEEeCCCC
Confidence 999985 36677889999999887
No 18
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=96.04 E-value=0.6 Score=37.42 Aligned_cols=123 Identities=20% Similarity=0.266 Sum_probs=64.6
Q ss_pred CeE-EEEECCCCCceEEEEEEEEeccCCCCeeeCCceEEEEeCCCCCCCeEEEEEEEeCCCC--CeEEEEEecCCCCCcE
Q psy7563 1 MPS-SILSSGIQPCSTEVSVNVKVIDKSMPVFDKQFYSVSVREDIQIHSPLPLTIRAESPLN--RKLIYSITGGNSREQF 77 (274)
Q Consensus 1 l~v-~A~D~g~~~~~~~~~v~I~v~Ndn~P~F~~~~y~~~v~E~~~~g~~v~~~v~a~D~d~--~~i~y~i~~~~~~~~F 77 (274)
+++ .|.|..+...+.....++.+ +..||...- .....+..+..+...+...+.++|... ..-..+|.+|.....-
T Consensus 19 l~~~~a~D~agN~~~~~~~~~~~i-D~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~~i~sv~l~Gg~~~d~v 96 (158)
T PF13750_consen 19 LTVVTATDAAGNTSTSTVSETFTI-DNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDSKITSVSLTGGPASDSV 96 (158)
T ss_pred EEEEEEEecCCCEEEEEEeeEEEE-cCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCceEEEEEEECCcccceE
Confidence 356 69999776444444333332 445998765 222233333333333325677777652 3446667655433332
Q ss_pred EE--eCCCceEEeceeeEEEeecCCCccceEEeCceeEEEEC--CCC-CcCcccEEEEEEEEEeCCCCceEEEEEEEEEE
Q psy7563 78 AV--DFNTGLYSLNRKLIYSITGGNSREQFAVDFNTGSITVV--DEL-DFETVQEYTLTVRATDSVSGHSAEVFVSVRVT 152 (274)
Q Consensus 78 ~i--~~~tg~v~~~~~~~~~~~~~~~~~~f~i~~~tg~l~~~--~~l-D~E~~~~~~l~v~a~d~~~~~~~~~~v~I~v~ 152 (274)
.+ ... | .+ ...+. +-+ ..|....|+|+|.|.|.. +..++..+.+...
T Consensus 97 ~ls~~~~-~-----------------~~---------~~~~~yp~~fpsle~~~~YtLtV~a~D~a-GN~~~~si~F~y~ 148 (158)
T PF13750_consen 97 SLSWTNK-G-----------------NG---------VYTLEYPRIFPSLEADDSYTLTVSATDKA-GNQSTKSISFSYM 148 (158)
T ss_pred EEeeEec-c-----------------Cc---------eEEeecccccCCcCCCCeEEEEEEEEecC-CCEEEEEEEEEEe
Confidence 22 111 1 11 22221 111 237788999999999983 4555666666655
Q ss_pred e
Q psy7563 153 D 153 (274)
Q Consensus 153 d 153 (274)
.
T Consensus 149 P 149 (158)
T PF13750_consen 149 P 149 (158)
T ss_pred C
Confidence 3
No 19
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=93.52 E-value=1.2 Score=32.73 Aligned_cols=94 Identities=19% Similarity=0.229 Sum_probs=57.2
Q ss_pred EEEEEeCCC-CCeEEEEEec-CCCCCcEEEeCCCceEEeceeeEEEeecCCCccceEEeCceeEEEECCCCCcCcccEEE
Q psy7563 52 LTIRAESPL-NRKLIYSITG-GNSREQFAVDFNTGLYSLNRKLIYSITGGNSREQFAVDFNTGSITVVDELDFETVQEYT 129 (274)
Q Consensus 52 ~~v~a~D~d-~~~i~y~i~~-~~~~~~F~i~~~tg~v~~~~~~~~~~~~~~~~~~f~i~~~tg~l~~~~~lD~E~~~~~~ 129 (274)
+++.++|+| +....+++.. ....+.|.|+. .|.. .|.++.. ....+.|.--....-.
T Consensus 2 G~Lt~sD~D~gd~~~~s~~~~~g~yGtlti~~-~G~w-----------------tYtl~n~---~~avq~L~~Ge~~tds 60 (99)
T TIGR01965 2 GQLTISDADAGQAHFIAQTDAAGQYGTFSIDA-DGQW-----------------TYQADNS---QTAVQALKAGETLTDT 60 (99)
T ss_pred CceEEeCCCCCCceEEecccccCCcEEEEECC-CCcE-----------------EEEeCCC---cHHHHhhcCCCEEEEE
Confidence 467899999 6777777742 22346788887 5621 1111111 1122334333333567
Q ss_pred EEEEEEeCCCCceEEEEEEEEEEecCCCCCccCCCceEEEEeCC
Q psy7563 130 LTVRATDSVSGHSAEVFVSVRVTDVNDCPPEFPQDWYNVSISEA 173 (274)
Q Consensus 130 l~v~a~d~~~~~~~~~~v~I~v~dvNdn~P~f~~~~~~~~v~E~ 173 (274)
|++.+.|+ .+..|+|+|.-.|| +|+.... -...|.|+
T Consensus 61 Ftvtv~DG-----tt~~vtItI~GtND-apvi~~~-~~g~v~ED 97 (99)
T TIGR01965 61 FTVTSADG-----TSQTVTITITGAND-AAVIGGA-DTGSVTED 97 (99)
T ss_pred EEEEEeCC-----CeEEEEEEEEccCC-CCEEecc-cceeEecC
Confidence 88888886 27789999999999 8876543 24666665
No 20
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=90.13 E-value=8.4 Score=30.80 Aligned_cols=122 Identities=21% Similarity=0.279 Sum_probs=62.9
Q ss_pred ccEEEEEE-EEEeCCCCceEEEEEEEEEEecCCCCCccCCCceEEEEeCCCCC-CcEEEEEEEEeCCCCCCceEEEEEec
Q psy7563 125 VQEYTLTV-RATDSVSGHSAEVFVSVRVTDVNDCPPEFPQDWYNVSISEAAPF-GTLILKVRALDNDSGKQKISSTAQIA 202 (274)
Q Consensus 125 ~~~~~l~v-~a~d~~~~~~~~~~v~I~v~dvNdn~P~f~~~~~~~~v~E~~~~-g~~v~~v~A~D~D~g~n~~i~Y~i~~ 202 (274)
...|.+.+ .|.|.... ..+..+...+. +...+|.+.- .....+..+... |..=.++.++|.-.+. .-...+|.+
T Consensus 14 dG~Y~l~~~~a~D~agN-~~~~~~~~~~~-iD~T~Ptisi-~~~~~~~~g~~v~~~~~i~i~~tD~~~~~-~i~sv~l~G 89 (158)
T PF13750_consen 14 DGSYTLTVVTATDAAGN-TSTSTVSETFT-IDNTPPTISI-SDGASVANGSTVYGLVNISINVTDNSDDS-KITSVSLTG 89 (158)
T ss_pred CccEEEEEEEEEecCCC-EEEEEEeeEEE-EcCCCCEEEE-ecCCccCCCccccceeeeEEEEEeCCCCc-eEEEEEEEC
Confidence 35799999 79998433 22322332222 2445787743 111122222222 2222568888877654 445577766
Q ss_pred CCCCcceEEe---ccceEEEEeccCCCCcccCCCcEEEEEEEEeCCCCCcee
Q psy7563 203 HSFLEIFFLS---LHRYMFTIGKGFFPERELLQPRCYRSEIEDRRRTGEEKA 251 (274)
Q Consensus 203 ~~~~~~F~i~---~~~g~~~i~~~~~ld~e~~~~~~y~~~v~a~d~~g~~~~ 251 (274)
|.....-.+. ...|.+.+.=...+ ........|.|+|.|.|..|-...
T Consensus 90 g~~~d~v~ls~~~~~~~~~~~~yp~~f-psle~~~~YtLtV~a~D~aGN~~~ 140 (158)
T PF13750_consen 90 GPASDSVSLSWTNKGNGVYTLEYPRIF-PSLEADDSYTLTVSATDKAGNQST 140 (158)
T ss_pred CcccceEEEeeEeccCceEEeeccccc-CCcCCCCeEEEEEEEEecCCCEEE
Confidence 5444333322 22333333211111 011125579999999999997665
No 21
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=87.61 E-value=37 Score=34.96 Aligned_cols=157 Identities=15% Similarity=0.185 Sum_probs=79.5
Q ss_pred cCCCCeeeCCceEEEEeCCCCCCCeEEEEEEEeCCC---CCeEEEEEecCCCCC--cEEEeCCCceEEeceeeEEEeecC
Q psy7563 25 DKSMPVFDKQFYSVSVREDIQIHSPLPLTIRAESPL---NRKLIYSITGGNSRE--QFAVDFNTGLYSLNRKLIYSITGG 99 (274)
Q Consensus 25 Ndn~P~F~~~~y~~~v~E~~~~g~~v~~~v~a~D~d---~~~i~y~i~~~~~~~--~F~i~~~tg~v~~~~~~~~~~~~~ 99 (274)
||..+.|....-...|.||. |+.- .+|.-...+ .-.+.|+...|.+.. -|. +..| .|.|...+
T Consensus 396 dd~~s~i~Fe~~~Y~V~En~--GtV~-VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sG------TLtF~PGE- 463 (928)
T TIGR00845 396 NDPVSKIFFEPGHYTCLENC--GTVA-LTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEG------TLVFKPGE- 463 (928)
T ss_pred cCCcceEEecCCeEEEeecC--cEEE-EEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCc------eEEECCCc-
Confidence 56666766666667888985 5554 555444323 356778876654321 121 1233 12222111
Q ss_pred CCccceEEeCceeEEEECCCCCcCcccEEEEEEEEEeCC-------------CC-ceEEEEEEEEEEecCCCCCccCCCc
Q psy7563 100 NSREQFAVDFNTGSITVVDELDFETVQEYTLTVRATDSV-------------SG-HSAEVFVSVRVTDVNDCPPEFPQDW 165 (274)
Q Consensus 100 ~~~~~f~i~~~tg~l~~~~~lD~E~~~~~~l~v~a~d~~-------------~~-~~~~~~v~I~v~dvNdn~P~f~~~~ 165 (274)
....+ .|.+...=-+|....|.+.+.--..+ .. .......+|+|.| ||++|.|....
T Consensus 464 -t~KtI-------tV~IIDDdi~E~DE~F~V~LSNp~~g~~~G~~~~~~~~~~A~Lg~ps~ATVTIlD-DD~aGIfsFe~ 534 (928)
T TIGR00845 464 -TQKEF-------RIGIIDDDIFEEDEHFYVRLSNLRVGSEDGILEANHVSAVAQLASPNTATVTILD-DDHAGIFTFEE 534 (928)
T ss_pred -eEEEE-------EEEEccCCCCCCCceEEEEEeCCCCCCcccccccccccccceecCCceEEEEEec-CcccCcccccC
Confidence 11222 22333322356666666665432211 01 1123356677777 78899877655
Q ss_pred eEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCC
Q psy7563 166 YNVSISEAAPFGTLILKVRALDNDSGKQKISSTAQIAHSF 205 (274)
Q Consensus 166 ~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~i~Y~i~~~~~ 205 (274)
-...|.|+. |..-.+|.-+-.-. ..-.+.|+-.+|+.
T Consensus 535 ~~~sV~Es~--G~vtvtV~RtsGa~-G~VtV~Y~T~dGTA 571 (928)
T TIGR00845 535 DVFHVSESI--GIMEVKVLRTSGAR-GTVIVPYRTVEGTA 571 (928)
T ss_pred ceEEEEcCC--CEEEEEEEEcCCCC-eeEEEEEEeecCcc
Confidence 567888874 55544544331111 12346677766644
No 22
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=79.00 E-value=12 Score=23.44 Aligned_cols=36 Identities=14% Similarity=0.095 Sum_probs=25.2
Q ss_pred cceEEeccceEEEEeccCCCCcccCCCcEEEEEEEEeCCCC
Q psy7563 207 EIFFLSLHRYMFTIGKGFFPERELLQPRCYRSEIEDRRRTG 247 (274)
Q Consensus 207 ~~F~i~~~~g~~~i~~~~~ld~e~~~~~~y~~~v~a~d~~g 247 (274)
....+++.+|.+.-... ...+...|.+.|.++|..|
T Consensus 14 ~gLs~d~~tG~isGtp~-----~~~~~G~y~~~vtatd~~G 49 (49)
T PF05345_consen 14 SGLSLDPSTGTISGTPT-----SSVQPGTYTFTVTATDGSG 49 (49)
T ss_pred CcEEEeCCCCEEEeecC-----CCccccEEEEEEEEEcCCC
Confidence 46778888998875541 1111357999999999876
No 23
>KOG3597|consensus
Probab=68.31 E-value=1.1e+02 Score=28.88 Aligned_cols=156 Identities=16% Similarity=0.175 Sum_probs=79.2
Q ss_pred ccCCCCeeeCCceEEEEeCCCCCCCeEEEEEEEeCCC--CCeEEEEEecCCC----CCcEEEeCCCceEEeceeeEEEee
Q psy7563 24 IDKSMPVFDKQFYSVSVREDIQIHSPLPLTIRAESPL--NRKLIYSITGGNS----REQFAVDFNTGLYSLNRKLIYSIT 97 (274)
Q Consensus 24 ~Ndn~P~F~~~~y~~~v~E~~~~g~~v~~~v~a~D~d--~~~i~y~i~~~~~----~~~F~i~~~tg~v~~~~~~~~~~~ 97 (274)
+||.+..+-...+...+.|+...-... ..+.+.|+| -..+.|++..... ...|...
T Consensus 35 vndpp~~~~~~~~~l~~~~~~~k~l~~-~~l~~~d~d~~~~~l~f~v~~t~~~~~~~~~~~~~----------------- 96 (442)
T KOG3597|consen 35 VNDPPSLIFPSGSLLVILEGGQKVLDP-ELLTAADPDSAPLPLEFQVLGTSSVPLPVLKFDVP----------------- 96 (442)
T ss_pred cCCCcceeecccceEEeecCCceeccc-eEeeccCCCCCccceEEEEccCCCCCCccceeecc-----------------
Confidence 599888888777778888876544344 578888888 5677888875211 1122222
Q ss_pred cCCCccceEEe-CceeEEEECCCCCcCcccEEEEEEEEEeCCCCceEEEEEEEEEEecCCCCCccCCCceEEEEeCCCCC
Q psy7563 98 GGNSREQFAVD-FNTGSITVVDELDFETVQEYTLTVRATDSVSGHSAEVFVSVRVTDVNDCPPEFPQDWYNVSISEAAPF 176 (274)
Q Consensus 98 ~~~~~~~f~i~-~~tg~l~~~~~lD~E~~~~~~l~v~a~d~~~~~~~~~~v~I~v~dvNdn~P~f~~~~~~~~v~E~~~~ 176 (274)
+...+.|+-. -..|.+.....= .+.....+...++|.- ..+. .+..... ..-..|.+... ....|.-+..+
T Consensus 97 -g~~~~~Fs~~~v~~g~~~yvh~g--~el~~~~~~~~~SDg~--~~S~-~~i~~~~-~~~~~~~~~~~-~gL~v~~gS~~ 168 (442)
T KOG3597|consen 97 -GAPATEFSYEEVEDGSLSYVHSG--TELRESELQLRVSDGL--LVSE-RAILKVE-ATGPAPHLARN-TGLKVLQGSTA 168 (442)
T ss_pred -CCcccceEehHhhcCceeEEecC--cccccceEEEEeecce--Eeee-eEEeccc-CCCcceeeecc-cceEEccCccc
Confidence 2333444431 133344332210 1345667788888872 1111 1111111 11112222221 12333333221
Q ss_pred CcEEEEEEEEeCCCCCCceEEEEEecCCC
Q psy7563 177 GTLILKVRALDNDSGKQKISSTAQIAHSF 205 (274)
Q Consensus 177 g~~v~~v~A~D~D~g~n~~i~Y~i~~~~~ 205 (274)
--.-.-+.+.|.|.++.-.+.|.|.....
T Consensus 169 ~IT~~~L~ved~d~~~d~~v~~~i~~~P~ 197 (442)
T KOG3597|consen 169 PITPSNLSVEDNDSSPDDEVRYDITPPPQ 197 (442)
T ss_pred cccHhHceeecCCCCCCcEEEEEecCCCC
Confidence 11113577888887666778898876543
No 24
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=66.76 E-value=48 Score=24.29 Aligned_cols=89 Identities=11% Similarity=-0.011 Sum_probs=46.8
Q ss_pred EEEEEEeCCCCCCceEEEEEecC-CCCcceEEeccceEEEEec-cC--CCCcccC-CCcEEEEEEEEeCCCCCcee--cc
Q psy7563 181 LKVRALDNDSGKQKISSTAQIAH-SFLEIFFLSLHRYMFTIGK-GF--FPERELL-QPRCYRSEIEDRRRTGEEKA--SH 253 (274)
Q Consensus 181 ~~v~A~D~D~g~n~~i~Y~i~~~-~~~~~F~i~~~~g~~~i~~-~~--~ld~e~~-~~~~y~~~v~a~d~~g~~~~--~~ 253 (274)
+++.++|+|.++. .++++... ..-+.|.|+++ |.++... .. ..+.=.. +...-.+++...|+.-+.+. ..
T Consensus 2 G~Lt~sD~D~gd~--~~~s~~~~~g~yGtlti~~~-G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DGtt~~vtItI~ 78 (99)
T TIGR01965 2 GQLTISDADAGQA--HFIAQTDAAGQYGTFSIDAD-GQWTYQADNSQTAVQALKAGETLTDTFTVTSADGTSQTVTITIT 78 (99)
T ss_pred CceEEeCCCCCCc--eEEecccccCCcEEEEECCC-CcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCCCeEEEEEEEE
Confidence 4688999998654 56665322 23467888874 5444332 11 1111000 12235566666775433333 33
Q ss_pred CCCCCceEEeeCCCceeee
Q psy7563 254 SENKPYLFTINFATAGIES 272 (274)
Q Consensus 254 ~~n~~~~f~i~~~tg~i~~ 272 (274)
..|..|.......-.++|+
T Consensus 79 GtNDapvi~~~~~g~v~ED 97 (99)
T TIGR01965 79 GANDAAVIGGADTGSVTED 97 (99)
T ss_pred ccCCCCEEecccceeEecC
Confidence 6677777766554445554
No 25
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=64.86 E-value=65 Score=25.10 Aligned_cols=56 Identities=20% Similarity=0.274 Sum_probs=37.2
Q ss_pred EEEECCCCCcCc---ccEEEEEEEEEeCCCCceEEEEEEEEEEecCCCCCccCCCceEEEEeCC
Q psy7563 113 SITVVDELDFET---VQEYTLTVRATDSVSGHSAEVFVSVRVTDVNDCPPEFPQDWYNVSISEA 173 (274)
Q Consensus 113 ~l~~~~~lD~E~---~~~~~l~v~a~d~~~~~~~~~~v~I~v~dvNdn~P~f~~~~~~~~v~E~ 173 (274)
...+..+||... .-...|.|.|+|...-. ++..+.|+|.| | .|...... ..+|.|.
T Consensus 69 tftL~~~lDH~~g~d~l~l~~~v~a~D~DGD~-s~~~l~VtI~D--D-~P~~~~~~-~~~V~E~ 127 (137)
T TIGR03660 69 EFTLEGPLDHAAGSDELTLNFPIIATDFDGDT-SSITLPVTIVD--D-VPTITDVD-ALTVDED 127 (137)
T ss_pred EEEEcccccCCCCCceEEEeeeEEEEeCCCCc-cccEEEEEEEC--C-CCeecccc-ceEEecc
Confidence 567777887643 33577889999984332 23578888877 5 57765543 3678774
No 26
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=60.52 E-value=60 Score=23.24 Aligned_cols=56 Identities=5% Similarity=0.014 Sum_probs=36.2
Q ss_pred EEeCCCCCCceEEEEEecCC---CCcceEEeccceEEEEeccCCCCcccCCCcEEEEEEEEeCCCCCc
Q psy7563 185 ALDNDSGKQKISSTAQIAHS---FLEIFFLSLHRYMFTIGKGFFPERELLQPRCYRSEIEDRRRTGEE 249 (274)
Q Consensus 185 A~D~D~g~n~~i~Y~i~~~~---~~~~F~i~~~~g~~~i~~~~~ld~e~~~~~~y~~~v~a~d~~g~~ 249 (274)
..|+| ...++|++...+ -..+...++.++.+.-.+ .... ...|.+.|.|+|+.|..
T Consensus 24 F~d~d---~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~GtP---~~~~---~g~~~i~v~a~D~~g~~ 82 (97)
T smart00736 24 FTDAD---GDTLTYSATLSDGSALPSWLSFDSDTGTLSGTP---TNSD---VGSLSLKVTATDSSGAS 82 (97)
T ss_pred eECCC---CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEEC---CCCC---CcEEEEEEEEEECCCCE
Confidence 46776 356899885432 235777888777666432 2222 24599999999998833
No 27
>KOG3597|consensus
Probab=49.94 E-value=76 Score=29.87 Aligned_cols=60 Identities=18% Similarity=0.014 Sum_probs=44.3
Q ss_pred EEEEEEEEEEecCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecC
Q psy7563 143 AEVFVSVRVTDVNDCPPEFPQDWYNVSISEAAPFGTLILKVRALDNDSGKQKISSTAQIAH 203 (274)
Q Consensus 143 ~~~~v~I~v~dvNdn~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~i~Y~i~~~ 203 (274)
.+....|.|..+||.+..+....+.+-+.|+...-.....+.+.|+|..+- .+.|+++..
T Consensus 24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~~-~l~f~v~~t 83 (442)
T KOG3597|consen 24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAPL-PLEFQVLGT 83 (442)
T ss_pred EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCcc-ceEEEEccC
Confidence 455688999999997776666666788888865545556788999998653 377887664
No 28
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=48.33 E-value=53 Score=21.28 Aligned_cols=22 Identities=23% Similarity=0.359 Sum_probs=15.1
Q ss_pred ccCCCcEEEEEEEEeCCCCCce
Q psy7563 229 ELLQPRCYRSEIEDRRRTGEEK 250 (274)
Q Consensus 229 e~~~~~~y~~~v~a~d~~g~~~ 250 (274)
..++...|.|.|.+.|..|...
T Consensus 34 ~~L~~G~Y~l~V~a~~~~~~~~ 55 (66)
T PF07495_consen 34 TNLPPGKYTLEVRAKDNNGKWS 55 (66)
T ss_dssp ES--SEEEEEEEEEEETTS-B-
T ss_pred EeCCCEEEEEEEEEECCCCCcC
Confidence 3345778999999999888764
No 29
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=44.46 E-value=2.1e+02 Score=25.91 Aligned_cols=114 Identities=11% Similarity=0.043 Sum_probs=56.3
Q ss_pred EEEEEEEecCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecC-CCCcceEEeccceEEEEeccC
Q psy7563 146 FVSVRVTDVNDCPPEFPQDWYNVSISEAAPFGTLILKVRALDNDSGKQKISSTAQIAH-SFLEIFFLSLHRYMFTIGKGF 224 (274)
Q Consensus 146 ~v~I~v~dvNdn~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~i~Y~i~~~-~~~~~F~i~~~~g~~~i~~~~ 224 (274)
.=+|.|...|.+.=.|..-.....++|+-.--...+.|. .. ++++.-|.--.+ +.-..|+|++.+|.+.+....
T Consensus 212 ~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIh-is----~dGrFLYasNRg~dsI~~f~V~~~~g~L~~~~~~ 286 (346)
T COG2706 212 NSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIH-IS----PDGRFLYASNRGHDSIAVFSVDPDGGKLELVGIT 286 (346)
T ss_pred CCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEE-EC----CCCCEEEEecCCCCeEEEEEEcCCCCEEEEEEEe
Confidence 334555555554344444334444444422111112222 11 235555654332 444799999999988876633
Q ss_pred CCCcccCCCcEEEEEEEEeCCCCCcee-c-cCCCCCceEEeeCCCceee
Q psy7563 225 FPERELLQPRCYRSEIEDRRRTGEEKA-S-HSENKPYLFTINFATAGIE 271 (274)
Q Consensus 225 ~ld~e~~~~~~y~~~v~a~d~~g~~~~-~-~~~n~~~~f~i~~~tg~i~ 271 (274)
...-.. ++.|.+ +..|.... . ...+.--.|.+++.||.+.
T Consensus 287 ~teg~~--PR~F~i-----~~~g~~Liaa~q~sd~i~vf~~d~~TG~L~ 328 (346)
T COG2706 287 PTEGQF--PRDFNI-----NPSGRFLIAANQKSDNITVFERDKETGRLT 328 (346)
T ss_pred ccCCcC--Ccccee-----CCCCCEEEEEccCCCcEEEEEEcCCCceEE
Confidence 322221 222322 12222222 2 2334568899999999764
No 30
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=39.83 E-value=2.9e+02 Score=25.07 Aligned_cols=67 Identities=22% Similarity=0.308 Sum_probs=30.8
Q ss_pred EEEEeccCCCCeeeCCceEEEEeCCCCCCCeEEEEEEEeCCCCCeEEEEEecCC-CCCcEEEeCCCceEEe
Q psy7563 19 VNVKVIDKSMPVFDKQFYSVSVREDIQIHSPLPLTIRAESPLNRKLIYSITGGN-SREQFAVDFNTGLYSL 88 (274)
Q Consensus 19 v~I~v~Ndn~P~F~~~~y~~~v~E~~~~g~~v~~~v~a~D~d~~~i~y~i~~~~-~~~~F~i~~~tg~v~~ 88 (274)
|.|.-.|.+.=+|..-.....++|+-. |..= +.--...+ +++.-|---.+. .-..|+|++.+|.+.+
T Consensus 215 V~v~~y~~~~g~~~~lQ~i~tlP~dF~-g~~~-~aaIhis~-dGrFLYasNRg~dsI~~f~V~~~~g~L~~ 282 (346)
T COG2706 215 VDVLEYNPAVGKFEELQTIDTLPEDFT-GTNW-AAAIHISP-DGRFLYASNRGHDSIAVFSVDPDGGKLEL 282 (346)
T ss_pred EEEEEEcCCCceEEEeeeeccCccccC-CCCc-eeEEEECC-CCCEEEEecCCCCeEEEEEEcCCCCEEEE
Confidence 333334554445555544455666543 2222 11112223 344444433332 2246888888885443
No 31
>PF03160 Calx-beta: Calx-beta domain; InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=38.03 E-value=1.5e+02 Score=21.06 Aligned_cols=54 Identities=17% Similarity=0.203 Sum_probs=32.2
Q ss_pred EEEEEecCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCCCceEEEEEecCCC
Q psy7563 148 SVRVTDVNDCPPEFPQDWYNVSISEAAPFGTLILKVRALDNDSGKQKISSTAQIAHSF 205 (274)
Q Consensus 148 ~I~v~dvNdn~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~g~n~~i~Y~i~~~~~ 205 (274)
+|.|.| ||.+ .+....-...+.|+. |..-..|....++....-.+.|...++..
T Consensus 2 tvtI~d-~d~~-~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~v~v~~~~~~gtA 55 (100)
T PF03160_consen 2 TVTILD-DDDP-TVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGPVTVNYSTVDGTA 55 (100)
T ss_dssp EEEEE--TTSE-EEEESSSEEEEETTS--SEEEEEEEEESS-TSSEEEEEEEEEESSS
T ss_pred EEEEEC-CCCC-EEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcceEEEEEEeCCcc
Confidence 567778 6644 766555567788875 44556666565543334557787777643
No 32
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=34.65 E-value=1.1e+02 Score=23.72 Aligned_cols=39 Identities=21% Similarity=0.330 Sum_probs=25.0
Q ss_pred CeEEEEECCCCCceEEEEEEEEeccCCCCeeeCCceEEEEeCC
Q psy7563 1 MPSSILSSGIQPCSTEVSVNVKVIDKSMPVFDKQFYSVSVRED 43 (274)
Q Consensus 1 l~v~A~D~g~~~~~~~~~v~I~v~Ndn~P~F~~~~y~~~v~E~ 43 (274)
|.|.|+|..+-..+ ..+.|.+.+|. |.-.... ...|.|+
T Consensus 89 ~~v~a~D~DGD~s~--~~l~VtI~DD~-P~~~~~~-~~~V~E~ 127 (137)
T TIGR03660 89 FPIIATDFDGDTSS--ITLPVTIVDDV-PTITDVD-ALTVDED 127 (137)
T ss_pred eeEEEEeCCCCccc--cEEEEEEECCC-Ceecccc-ceEEecc
Confidence 56889998764333 46666666665 6655443 3678884
No 33
>PF05895 DUF859: Siphovirus protein of unknown function (DUF859); InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=31.93 E-value=5.2e+02 Score=25.66 Aligned_cols=33 Identities=15% Similarity=0.193 Sum_probs=24.9
Q ss_pred CeEEEEECCCCCceEEEEEEEEeccCCCCeeeCC
Q psy7563 1 MPSSILSSGIQPCSTEVSVNVKVIDKSMPVFDKQ 34 (274)
Q Consensus 1 l~v~A~D~g~~~~~~~~~v~I~v~Ndn~P~F~~~ 34 (274)
+++.++|..+ ..+...+++|+|+.=++|.+...
T Consensus 302 i~atVtDSRG-r~S~~~~~tItVl~Y~~P~lsfs 334 (624)
T PF05895_consen 302 IRATVTDSRG-RTSDPKTKTITVLEYSPPTLSFS 334 (624)
T ss_pred EEEEEEECCC-ccCCceEEEEEEEEcCCCcEEEE
Confidence 4678899987 44557888899998889988543
No 34
>KOG4221|consensus
Probab=29.87 E-value=7.4e+02 Score=26.80 Aligned_cols=45 Identities=27% Similarity=0.408 Sum_probs=27.8
Q ss_pred EEEECCCCCcCcccEEEEEEEEEeCCCC-ceEEEEEEEEEEecCCCCCc
Q psy7563 113 SITVVDELDFETVQEYTLTVRATDSVSG-HSAEVFVSVRVTDVNDCPPE 160 (274)
Q Consensus 113 ~l~~~~~lD~E~~~~~~l~v~a~d~~~~-~~~~~~v~I~v~dvNdn~P~ 160 (274)
.+++. +.|...+|.+.|.|....++ .++...-.++..|+-+-||.
T Consensus 575 e~ti~---gL~k~TeY~~~vvA~N~~G~g~sS~~i~V~Tlsd~PsaPP~ 620 (1381)
T KOG4221|consen 575 EYTIN---GLEKYTEYSIRVVAYNSAGSGVSSADITVRTLSDVPSAPPQ 620 (1381)
T ss_pred EEEee---cCCCccceEEEEEEecCCCCCCCCCceEEEeccCCCCCCCc
Confidence 55554 45677899999999998443 43333333444566665554
No 35
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=27.96 E-value=1.6e+02 Score=19.63 Aligned_cols=29 Identities=34% Similarity=0.466 Sum_probs=20.6
Q ss_pred CcCcccEEEEEEEEEeCCCCceEEEEEEEEE
Q psy7563 121 DFETVQEYTLTVRATDSVSGHSAEVFVSVRV 151 (274)
Q Consensus 121 D~E~~~~~~l~v~a~d~~~~~~~~~~v~I~v 151 (274)
-|+....|.+.+.+.|... +.+..+.|.|
T Consensus 50 ~y~~~G~y~v~l~v~n~~g--~~~~~~~i~v 78 (79)
T smart00089 50 TYTKPGTYTVTLTVTNAVG--SASATVTVVV 78 (79)
T ss_pred EeCCCcEEEEEEEEEcCCC--cEEEEEEEEE
Confidence 4567778999999999854 4455555554
No 36
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=24.31 E-value=2.1e+02 Score=19.14 Aligned_cols=28 Identities=29% Similarity=0.457 Sum_probs=18.8
Q ss_pred CcCcccEEEEEEEEEeCCCCceEEEEEEE
Q psy7563 121 DFETVQEYTLTVRATDSVSGHSAEVFVSV 149 (274)
Q Consensus 121 D~E~~~~~~l~v~a~d~~~~~~~~~~v~I 149 (274)
.|.....|.+++.++|.. ..+.+..+.|
T Consensus 52 ~y~~~G~y~v~l~v~d~~-g~~~~~~~~V 79 (81)
T cd00146 52 TYTKPGTYTVTLTVTNAV-GSSSTKTTTV 79 (81)
T ss_pred EcCCCcEEEEEEEEEeCC-CCEEEEEEEE
Confidence 366778999999999984 3333334444
No 37
>PF14339 DUF4394: Domain of unknown function (DUF4394)
Probab=22.36 E-value=5e+02 Score=22.26 Aligned_cols=155 Identities=13% Similarity=0.119 Sum_probs=81.1
Q ss_pred eEEEEEecCCCCCcEEEeCCCceEEec--eee-----------EEE------eecCCCccceEEeCceeEEE-ECCCCCc
Q psy7563 63 KLIYSITGGNSREQFAVDFNTGLYSLN--RKL-----------IYS------ITGGNSREQFAVDFNTGSIT-VVDELDF 122 (274)
Q Consensus 63 ~i~y~i~~~~~~~~F~i~~~tg~v~~~--~~~-----------~~~------~~~~~~~~~f~i~~~tg~l~-~~~~lD~ 122 (274)
...|-+ ++....+.||+.+|.-+.- .++ ||- ..-.+...-+++++.+|.+. ...+|-|
T Consensus 39 G~LYgl--~~~g~lYtIn~~tG~aT~vg~s~~~~al~g~~~gvDFNP~aDRlRvvs~~GqNlR~npdtGav~~~Dg~L~y 116 (236)
T PF14339_consen 39 GQLYGL--GSTGRLYTINPATGAATPVGASPLTVALSGTAFGVDFNPAADRLRVVSNTGQNLRLNPDTGAVTIVDGNLAY 116 (236)
T ss_pred CCEEEE--eCCCcEEEEECCCCeEEEeecccccccccCceEEEecCcccCcEEEEccCCcEEEECCCCCCceeccCcccc
Confidence 344666 3445799999999975543 222 221 11256667799999999865 4457777
Q ss_pred Cccc-----EEEEEEEEEeCCC--CceEEEEEEEEEEecCCCCCccCCCceEEEEeCCCCCCcEEEEEEEEeCCCCC-Cc
Q psy7563 123 ETVQ-----EYTLTVRATDSVS--GHSAEVFVSVRVTDVNDCPPEFPQDWYNVSISEAAPFGTLILKVRALDNDSGK-QK 194 (274)
Q Consensus 123 E~~~-----~~~l~v~a~d~~~--~~~~~~~v~I~v~dvNdn~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~g~-n~ 194 (274)
.... .-.+.-.|.-..- +..++ ++.-+|.+-..-..+.+.-..+...--+.|--+....+.|.+... ..
T Consensus 117 ~~gd~~~G~~p~v~aaAYTNs~~g~~t~T---tLy~ID~~~~~Lv~Q~ppN~GtL~~vG~LGvd~~~~~gFDI~~~~~~~ 193 (236)
T PF14339_consen 117 AAGDMNAGTTPGVTAAAYTNSFAGATTST---TLYDIDTTLDALVTQNPPNDGTLNTVGPLGVDAAGDAGFDIAGDGNGG 193 (236)
T ss_pred CCCccccCCCCceEEEEEecccCCCccce---EEEEEecCCCeEEEecCCCCCcEEeeeccccccCcccceeeecCCCcc
Confidence 6432 3345555554421 12122 222334443333333222223333323344444445555555422 23
Q ss_pred eEEEEEecCCCCcceEEeccceEEEEec
Q psy7563 195 ISSTAQIAHSFLEIFFLSLHRYMFTIGK 222 (274)
Q Consensus 195 ~i~Y~i~~~~~~~~F~i~~~~g~~~i~~ 222 (274)
..-|.........+++|+..+|..+...
T Consensus 194 ~~a~a~~~~~~~~LY~vdL~TG~at~~g 221 (236)
T PF14339_consen 194 NAAYAVLGVGGSGLYTVDLTTGAATLVG 221 (236)
T ss_pred eEEEEEecCCCcEEEEEECCCcccEEee
Confidence 3445555433367899998888766543
No 38
>PF09100 Qn_am_d_aIV: Quinohemoprotein amine dehydrogenase, alpha subunit domain IV; InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=22.20 E-value=2.6e+02 Score=21.51 Aligned_cols=31 Identities=35% Similarity=0.386 Sum_probs=15.4
Q ss_pred EEEEEEEE--eCCCCceEEEEEEEEEEecCCCCC
Q psy7563 128 YTLTVRAT--DSVSGHSAEVFVSVRVTDVNDCPP 159 (274)
Q Consensus 128 ~~l~v~a~--d~~~~~~~~~~v~I~v~dvNdn~P 159 (274)
=.|.|.|+ |.+.+++.+..+.|+|..-|+ +|
T Consensus 100 Gnl~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp 132 (133)
T PF09100_consen 100 GNLKVVATVKDGGKPLTGEAHLIVTVQRWNN-PP 132 (133)
T ss_dssp EEEEEEEEETTTT---EEEEEEEEE---S----S
T ss_pred ccEEEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence 34555554 455568899999999988777 55
No 39
>COG5584 Predicted small secreted protein [Function unknown]
Probab=20.87 E-value=96 Score=22.58 Aligned_cols=27 Identities=22% Similarity=0.251 Sum_probs=20.2
Q ss_pred CCceecc-CCCCCceEEeeCCCceeeec
Q psy7563 247 GEEKASH-SENKPYLFTINFATAGIESA 273 (274)
Q Consensus 247 g~~~~~~-~~n~~~~f~i~~~tg~i~~~ 273 (274)
|.-.++. ..+.++-|.+|++||+|.+.
T Consensus 73 gGIT~i~~ge~~qyef~aDA~TGevidV 100 (103)
T COG5584 73 GGITTIKEGENIQYEFYADANTGEVIDV 100 (103)
T ss_pred cceeEecCCCceEEEEEEecCCceEEEE
Confidence 4444444 45888999999999998764
Done!