Jump to navigation
Jump to search
PangenomeTIGR4
serotype 4
D39serotype 2
D39Vserotype 2
Hungary19A-6serotype 19A
EF3030serotype 19F
670-6Bserotype 6B
6A-10serotype 6A
70585serotype 5
A026serotype 19F
A66serotype 3
AP200serotype 11A
ASP0581serotype 12F
ATCC 49619serotype 19F
ATCC 700669serotype 23F
BM6001serotype 19F
BVJ1JLserotype 1
CGSP14serotype 14
G54serotype 19F
HU-OHserotype 3
Hu15serotype 19A
Hu17serotype 19A
INV104serotype 1
INV200serotype 14
JJAserotype 14
MDRSPN001serotype 19F
NCTC7465serotype 1
NCTC7466serotype 2
NU83127serotype 4
OXC141serotype 3
P1031serotype 1
R6serotype 2
SP49serotype 19A
SPN032672serotype 1
SPN034156serotype 3
SPN034183serotype 3
SPN994038serotype 3
SPN994039serotype 3
SPNA45serotype 3
ST556serotype 19F
TCH8431/19Aserotype 19A
Taiwan19F-14serotype 19F
Xen35serotype 4
gamPNI0373serotype 1
NCBI: 03-APR-2017
⊟Summary[edit | edit source]
- organism: Streptococcus pneumoniae Hu17
- locus tag: SPNHU17_00432 [new locus tag: SPNHU17_RS02005 ]
- pan locus tag?: PNEUPAN001202000
- symbol: SPNHU17_00432
- pan gene symbol?: eng
- synonym:
- product: endo-alpha-N-acetylgalactosaminidase
⊟Genome View[edit | edit source]
⊟Gene[edit | edit source]
⊟General[edit | edit source]
- type: CDS
- locus tag: SPNHU17_00432 [new locus tag: SPNHU17_RS02005 ]
- symbol: SPNHU17_00432
- product: endo-alpha-N-acetylgalactosaminidase
- replicon: chromosome
- strand: +
- coordinates: 382929..388160
- length: 5232
- essential: unknown other strains
⊟Accession numbers[edit | edit source]
- Location: CP020549 (382929..388160) NCBI
- BioCyc: see SPNHU17_RS02005
- MicrobesOnline:
- PneumoBrowse for strain D39V: SPV_0335 PneumoBrowse
⊟Phenotype[edit | edit source]
Share your knowledge and add information here. [edit]
⊟DNA sequence[edit | edit source]
- 1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
1561
1621
1681
1741
1801
1861
1921
1981
2041
2101
2161
2221
2281
2341
2401
2461
2521
2581
2641
2701
2761
2821
2881
2941
3001
3061
3121
3181
3241
3301
3361
3421
3481
3541
3601
3661
3721
3781
3841
3901
3961
4021
4081
4141
4201
4261
4321
4381
4441
4501
4561
4621
4681
4741
4801
4861
4921
4981
5041
5101
5161
5221ATGATTGGAGCTGCATTCTTTGGGACAAGTCCGGTTCTTGCAGATAGCGTGCAGTCTGGT
TCCACGGCGAACTTACCAGCTGATTTAGCTACTGCTCTTGCAACAGCAAAAGAGAATGAT
GGGCGTGATTTTGAAGCGCCTAAGGTGGGAGAAGACCAAGGTTCTCCAGAAGTTACAGAT
GGACCTAAGACAGAAGAAGAACTATTAGCACTTGAAAAAGAAAAACCGGCTGAAGAAAAA
CCAAAAGAGGATAAACCTGCAGCTGCTAAACCTGAAACACCTAAGACGGTAACCCCTGAA
TGGCAAACGGTAGCGAATAAAGAGCAACAAGGAACAGTCACTATCCGAGAAGAAAAAGGT
GTCCGCTACAACCAACTATCCTCAACTGCTCAAAATGATAACGCAGGCAAACCAGCCCTG
TTTGAAAAGAAGGGCTTGACCGTTGATGCCAATGGAAATGCAACTGTTGATTTAACCTTC
AAAGATGATTCTGAAAAGGGCAAATCACGCTTTGGTGTCTTTTTGAAATTTAAAGATACC
AAGAATAATGTTTTTGTCGGTTATGACAAGGATGGCTGGTTCTGGGAGTATAAATCTCCA
ACAACTAGCACTTGGTATAGAGGTAGTCGTGTTGCTGCTCCTGAAACAGGATCAACAAAC
CGTCTCTCTATCACTCTCAAGTCAGACGGTCAGCTAAATGCCAGCAATAATGATGTCAAT
CTCTTTGACACAGTGACTCTACCAGCTGCGGTCAATGACCATCTTAAAAATGAGAAGAAG
ATTCTTCTCAAGGCGGGCTCTTATGACGATGAGCGAACAGTTGTTAGCGTTAAAACGGAT
AACCAAGAGGGGGTAAAAACAGAGGATACCCCTGCTGAAAAAGAAACAGGTCCTGAAGTT
GATGATAGCAAGGTGACTTATGACACGATTCAGTCTAAGGTCCTCAAAGCAGTGATTGAC
CAAGCCTTCCCTCGTGTCAAGGAATACACCTTGAATGGACATACTTTGCCAGGACAGGTG
CAACAGTTCAACCAAGTCTTTATCAATAACCACCGAATCACCCCTGAAGTCACTTATAAG
AAAATCAATGAGACAACAGCAGAGTACTTGATGAAGCTTCGCGATGATGCTCACTTAATC
AATGCGGAAATGACAGTACGCTTGCAAGTTGTGGACAATCAATTGCACTTTGATGTGACC
AAGATTGTCAACCACAATCAAGTCACTCCAGGTCAAAAGATCGATGACGAAAGAAAACTA
CTTTCTTCTATTAGTTTCCTCGGCAATGCTTTAGTCTCTGTTTCTAGTGATCAAACTGGT
GCTAAGTTTGATGGGGCAACCATGTCAAACAATACGCATGTCAGCGGAGATGATCATATC
GATGTAACTAATCCAATGAAGGATTTGGCTAAGGGTTACATGTATGGATTTGTTTCTACA
GATAAGCTTGCTGCTGGTGTTTGGAGTAACTCTCAAAATAGCTATGGTGGTGGTTCGAAT
GACTGGACTCGTTTGACAGCTTATAAAGAAACAGTCGGAAATGCCAACTATGTAGGAATC
CACAGCTCTGAATGGCAATGGGAAAAAGCTTATAAGGGCATTGTTTTCCCAGAATACACG
AAGGAACTTCCAAGTGCTAAGGTTGTTATCACTGAAGATGCCAATGCAGACAAGAAAGTT
GATTGGCAAGATGGTGCCATTGCTTATCGTAGCATTATGAACAATCCTCAAGGTTGGGAA
AAAGTTAAGGATATCACAGCTTACCGTATCGCGATGAACTTTGGTTCTCAAGCACAAAAT
CCATTCCTTATGACCTTGGATGGTATCAAGAAAATCAATCTCCACACAGATGGTCTTGGG
CAAGGTGTTCTCCTTAAAGGATATGGTAGCGAAGGCCATGACTCTGGTCACTTGAACTAT
GCTGATATTGGTAAGCGTATCGGTGGTGTCGAAGACTTCAAGACCCTAATTGAGAAGGCT
AAGAAATATGGAGCTCATCTAGGTATCCACGTTAACGCTTCAGAAACTTATCCTGAGTCT
AAATACTTCAATGAAAAAATTCTCCGTAAGAATCCAGATGGAAGCTATAGCTATGGTTGG
AACTGGCTAGATCAAGGTATCAACATTGATGCTGCCTATGACCTAGCTCATGGTCGTTTG
GCACGTTGGGAAGATTTGAAGAAAAAACTTGGTGACGGTCTCGACTTTATCTATGTGGAC
GTTTGGGGTAATGGTCAATCAGGTGATAACGGTGCCTGGGCTACCCACGTTCTTGCTAAA
GAAATTAACAAACAAGGCTGGCGCTTTGCGATCGAGTGGGGCCATGGTGGTGAGTACGAC
TCTACCTTCCATCACTGGGCAGCTGACTTGACCTACGGTGGCTACACCAATAAAGGTATC
AACAGTGCCATCACCCGCTTTATCCGTAACCACCAAAAAGATGCTTGGGTTGGGGACTAC
AGAAGTTATGGTGGTGCAGCCAACTATCCACTGCTAGGTGGCTACAGCATGAAAGACTTT
GAAGGCTGGCAAGGAAGAAGTGACTACAATGGCTATGTAACCAACTTATTTGCCCATGAC
GTCATGACTAAGTACTTCCAACACTTCACTGTAAGTAAATGGGAAAATGGTACACCGGTG
ACTATGACCGATAACGGTAGCACCTATAAATGGACTCCAGAAATGCGAGTGGAATTGGTA
GATGCTGACAATAATAAAGTAGTTGTAACTCGTAAGTCAAATGATGTCAATAGTCCACAA
TATCGCGAACGTACAGTAACTCTCAACGGACGTGTCATCCAAGATGGTTCAGCTTACTTG
ACTCCTTGGAACTGGGATGCAAATGGTAAGAAACTTTCTACTGATAAGGAAAAGATGTAC
TACTTCAATACGCAGGCCGGTGCAACAACTTGGACCCTTCCAAGCGATTGGGCAAAGAGC
AAGGTTTACCTTTACAAGCTAACTGACCAAGGTAAGACAGAAGAGCAAGAACTAGCTGTA
AAAGATGGTAAAATTACCCTAGATCTTCTAGCAAATCAACCATACGTTCTCTATCGTTCG
AAACAAACCAATCCTGAAATGTCATGGAGTGAAGGCATGCACATCTATGACCAAGGATTT
AATAGCGGTACTTTGAAACATTGGACCATTTCAGGTGATGCTTCTAAGGCAGAAATTGTC
AAGTCTCAAGGGGCAAACGATATGCTTCGTATTCAAGGAAACAAAGAAAAAGTTAGTCTC
ACTCAGAAATTAACTGGCTTGAAACCAAATACTAAGTATGCCGTTTATGTCGGTGTCGAT
AACCGTAGTAATGCCAAGGCAAGTATCACTGTGAATACTGGTGAAAAAGAAGTGACTACT
TATACCAATAAGTCTCTCGCCCTCAACTATGTAAAGGCCTACGCCCACAATACACGTCGT
AACAATGCTACAGTTGACGATACAAGTTACTTCCAAAACATGTACGCCTTCTTTACAACT
GGATCGGACGTCTCAAATGTTACTCTGACATTGAGTCGTGAAGCTGGTGATCAAGCAACT
TACTTTGATGAAATTCGTACCTTTGAAAACAATTCAAGCATGTATGGAGACAAGCATGAT
ACAGGTAAAGGCACCTTCAAGCAAGACTTTGAAAATGTTGCTCAGGGTATCTTCCCATTT
GTAGTGGGTGGTGTCGAAGGTGTCGAAGATAACCGCACTCACTTGTCTGAAAAACACGAT
CCATATACACAACGTGGTTGGAATGGTAAGAAAGTCGATGATGTTATCGAAGGAAATTGG
TCACTCAAGACAAATGGACTAGTGAGCCGTCGTAACTTGGTTTACCAAACTATTCCGCAA
AACTTCCGTTTTGAAGCAGGTAAGACCTACCGTGTAACCTTTGAATACGAAGCAGGTTCA
GACAATACCTATGCTTTTGTAGTCGGTAAGGGAAAATTCCAGTCAGGTCGTCGTGGTACT
CAAGCAAGCAACTTGGAAATGCATGAATTGCCAAATACTTGGACAGATTCTAAGAAAGCC
AAGAAGGCAACCTTCCTCGTGACAGGTGCAGAAACAGGGGATACTTGGGTAGGTATCTAC
TCAACTGGAAATGCAAGTAATACTCGTGGTGATTCTGGTGGAAATGCCAACTTCCGTGGT
TATAACGACTTCATGATGGATAATCTTCAAATCGAAGAAATTACCCTAACAGGTAAGATG
TTGACAGAAAATGCTCTGAAGAACTACTTGCCAACGGTTGCCATGACTAACTACACCAAA
GAGTCTATGGATGCTTTGAAAGAGGCGGTCTTTAACCTCAGTCAGGCCGATGATGATATC
AGTGTGGAAGAAGCGCGTGCAGAGATTGCCAAGATTGAAGCCTTGAAGAATGCTTTGGTT
CAGAAGAAAACGGCTTTGGTAGCAGATGACTTTGCAAGTCTTACAGCTCCTGCTCAGGCT
CAAGAAGGTCTTGCAAATGCCTTTGATGGAAACTTATCTAGTTTATGGCATACATCATGG
GGCGGAGGAGATGTAGGCAAGCCTGCAACCATGGTCTTGAAAGAACCAACTGAAATCACT
GGACTTCGTTACGTTCCACGTGGATCAGGTTCAAATGGTAACTTGCGTGATGTGAAACTT
GTTGTGACAGATGAGTCTGGCAAGGAGCATACCTTTACTGCAACTGATTGGCCAGATAAC
AATAAGCCAAAAGACATTGATTTTGGTAAGACAATTAAGGCTAAGAAAATTGTCCTTACA
GGTACTAAGACTTACGGAGATGGTGGCGATAAATACCAATCTGCAGCGGAACTCATCTTT
ACTCGTCCACAGGTAGCAGAAACACCTCTTGACTTGTCAGGCTATGAAGCAACTTTGGCT
AAAGCTCAGAAATTAACAGACAAAGACAATCAAGAGGAAGTAGCTAGTGTTCAGGCAAGC
ATGAAATATGCGACGGATAACCATCTCTTGACGGAAAGAATGGTGGAATACTTTGCAGAT
TATCTCAACCAATTAAAAGATTCTGCTACGAAACCAGATGCTCCAACTGTAGAGAAACCT
GAGTTTAAACTTAGCTCTATAGTTTCCGATCAAGGTAAGACGCCAGATTATAAGCAAGAA
ATAGCTAGACCAGAAACACCTGAACAAATCTTGCCAGCAACAGGTGAGAGTCAATCTGAC
ACATCCCTCTTCCTAGCAAGTGTTAGTCTAGCCCTATCTGCTCTCTTTGTAGTAAAAACG
AAGAAAGACTAG60
120
180
240
300
360
420
480
540
600
660
720
780
840
900
960
1020
1080
1140
1200
1260
1320
1380
1440
1500
1560
1620
1680
1740
1800
1860
1920
1980
2040
2100
2160
2220
2280
2340
2400
2460
2520
2580
2640
2700
2760
2820
2880
2940
3000
3060
3120
3180
3240
3300
3360
3420
3480
3540
3600
3660
3720
3780
3840
3900
3960
4020
4080
4140
4200
4260
4320
4380
4440
4500
4560
4620
4680
4740
4800
4860
4920
4980
5040
5100
5160
5220
5232
⊟Protein[edit | edit source]
⊟General[edit | edit source]
- locus tag: SPNHU17_00432 [new locus tag: SPNHU17_RS02005 ]
- symbol: SPNHU17_00432
- description: endo-alpha-N-acetylgalactosaminidase
- length: 1743
- theoretical pI: 5.71194
- theoretical MW: 193352
- GRAVY: -0.667355
⊟Function[edit | edit source]
- TIGRFAM: Cell envelope Other LPXTG cell wall anchor domain (TIGR01167; HMM-score: 19.6)
- TheSEED: data available for D39, Hungary19A-6, TIGR4
- PFAM: Glyco_hydro_tim (CL0058) Glyco_hydro_101; Endo-alpha-N-acetylgalactosaminidase (PF12905; HMM-score: 413.7)and 8 moreGal_mutarotase (CL0103) Gal_mutarotas_3; Galactose mutarotase-like fold domain (PF18080; HMM-score: 296.1)Concanavalin (CL0004) GalBD_like; Galactose-binding domain-like (PF17974; HMM-score: 295)GBD (CL0202) GH101_N; Endo-alpha-N-acetylgalactosaminidase N-terminal (PF17995; HMM-score: 273)GHD (CL0369) Glyco_hyd_101C; Glycosyl hydrolase 101 beta sandwich domain (PF17451; HMM-score: 145.5)GBD (CL0202) F5_F8_type_C; F5/8 type C domain (PF00754; HMM-score: 46.8)no clan defined Gram_pos_anchor; LPXTG cell wall anchor motif (PF00746; HMM-score: 16.7)WW (CL0680) WW; WW domain (PF00397; HMM-score: 15.3)E-set (CL0159) Pur_ac_phosph_N; Purple acid Phosphatase, N-terminal domain (PF16656; HMM-score: 9.1)
⊟Structure, modifications & cofactors[edit | edit source]
- domains:
- modifications:
- cofactors:
- effectors:
⊟Localization[edit | edit source]
- PSORTb: Cellwall
- Cytoplasmic Score: 0
- Cytoplasmic Membrane Score: 0
- Cellwall Score: 10
- Extracellular Score: 0
- Internal Helices: 0
- DeepLocPro: Cell wall & surface
- Cytoplasmic Score: 0.0003
- Cytoplasmic Membrane Score: 0.0126
- Cell wall & surface Score: 0.6484
- Extracellular Score: 0.3386
- SignalP: no predicted signal peptide
- SP(Sec/SPI): 0.188187
- TAT(Tat/SPI): 0.039169
- LIPO(Sec/SPII): 0.028437
- predicted transmembrane helices (TMHMM): 0
⊟Accession numbers[edit | edit source]
- GI:
- RefSeq: ARD34045 NCBI
- UniProt:
⊟Protein sequence[edit | edit source]
- MIGAAFFGTSPVLADSVQSGSTANLPADLATALATAKENDGRDFEAPKVGEDQGSPEVTDGPKTEEELLALEKEKPAEEKPKEDKPAAAKPETPKTVTPEWQTVANKEQQGTVTIREEKGVRYNQLSSTAQNDNAGKPALFEKKGLTVDANGNATVDLTFKDDSEKGKSRFGVFLKFKDTKNNVFVGYDKDGWFWEYKSPTTSTWYRGSRVAAPETGSTNRLSITLKSDGQLNASNNDVNLFDTVTLPAAVNDHLKNEKKILLKAGSYDDERTVVSVKTDNQEGVKTEDTPAEKETGPEVDDSKVTYDTIQSKVLKAVIDQAFPRVKEYTLNGHTLPGQVQQFNQVFINNHRITPEVTYKKINETTAEYLMKLRDDAHLINAEMTVRLQVVDNQLHFDVTKIVNHNQVTPGQKIDDERKLLSSISFLGNALVSVSSDQTGAKFDGATMSNNTHVSGDDHIDVTNPMKDLAKGYMYGFVSTDKLAAGVWSNSQNSYGGGSNDWTRLTAYKETVGNANYVGIHSSEWQWEKAYKGIVFPEYTKELPSAKVVITEDANADKKVDWQDGAIAYRSIMNNPQGWEKVKDITAYRIAMNFGSQAQNPFLMTLDGIKKINLHTDGLGQGVLLKGYGSEGHDSGHLNYADIGKRIGGVEDFKTLIEKAKKYGAHLGIHVNASETYPESKYFNEKILRKNPDGSYSYGWNWLDQGINIDAAYDLAHGRLARWEDLKKKLGDGLDFIYVDVWGNGQSGDNGAWATHVLAKEINKQGWRFAIEWGHGGEYDSTFHHWAADLTYGGYTNKGINSAITRFIRNHQKDAWVGDYRSYGGAANYPLLGGYSMKDFEGWQGRSDYNGYVTNLFAHDVMTKYFQHFTVSKWENGTPVTMTDNGSTYKWTPEMRVELVDADNNKVVVTRKSNDVNSPQYRERTVTLNGRVIQDGSAYLTPWNWDANGKKLSTDKEKMYYFNTQAGATTWTLPSDWAKSKVYLYKLTDQGKTEEQELAVKDGKITLDLLANQPYVLYRSKQTNPEMSWSEGMHIYDQGFNSGTLKHWTISGDASKAEIVKSQGANDMLRIQGNKEKVSLTQKLTGLKPNTKYAVYVGVDNRSNAKASITVNTGEKEVTTYTNKSLALNYVKAYAHNTRRNNATVDDTSYFQNMYAFFTTGSDVSNVTLTLSREAGDQATYFDEIRTFENNSSMYGDKHDTGKGTFKQDFENVAQGIFPFVVGGVEGVEDNRTHLSEKHDPYTQRGWNGKKVDDVIEGNWSLKTNGLVSRRNLVYQTIPQNFRFEAGKTYRVTFEYEAGSDNTYAFVVGKGKFQSGRRGTQASNLEMHELPNTWTDSKKAKKATFLVTGAETGDTWVGIYSTGNASNTRGDSGGNANFRGYNDFMMDNLQIEEITLTGKMLTENALKNYLPTVAMTNYTKESMDALKEAVFNLSQADDDISVEEARAEIAKIEALKNALVQKKTALVADDFASLTAPAQAQEGLANAFDGNLSSLWHTSWGGGDVGKPATMVLKEPTEITGLRYVPRGSGSNGNLRDVKLVVTDESGKEHTFTATDWPDNNKPKDIDFGKTIKAKKIVLTGTKTYGDGGDKYQSAAELIFTRPQVAETPLDLSGYEATLAKAQKLTDKDNQEEVASVQASMKYATDNHLLTERMVEYFADYLNQLKDSATKPDAPTVEKPEFKLSSIVSDQGKTPDYKQEIARPETPEQILPATGESQSDTSLFLASVSLALSALFVVKTKKD
⊟Experimental data[edit | edit source]
- protein localization:
- interaction partners:
⊟Expression & Regulation[edit | edit source]
⊟Operon[edit | edit source]
⊟Regulation[edit | edit source]
- regulator:
⊟Transcription[edit | edit source]
- transcription start site:
⊟Expression data[edit | edit source]
- PneumoExpress for strain D39V: SPV_0335 PneumoExpress
⊟Biological Material[edit | edit source]
⊟Mutants[edit | edit source]
⊟Expression vector[edit | edit source]
⊟lacZ fusion[edit | edit source]
⊟GFP fusion[edit | edit source]
⊟two-hybrid system[edit | edit source]
⊟FLAG-tag construct[edit | edit source]
⊟Antibody[edit | edit source]
⊟Other Information[edit | edit source]
You can add further information about the gene and protein here. [edit]