Jump to navigation
Jump to search
PangenomeTIGR4
serotype 4
D39serotype 2
D39Vserotype 2
Hungary19A-6serotype 19A
EF3030serotype 19F
670-6Bserotype 6B
6A-10serotype 6A
70585serotype 5
A026serotype 19F
A66serotype 3
AP200serotype 11A
ASP0581serotype 12F
ATCC 49619serotype 19F
ATCC 700669serotype 23F
BM6001serotype 19F
BVJ1JLserotype 1
CGSP14serotype 14
G54serotype 19F
HU-OHserotype 3
Hu15serotype 19A
Hu17serotype 19A
INV104serotype 1
INV200serotype 14
JJAserotype 14
MDRSPN001serotype 19F
NCTC7465serotype 1
NCTC7466serotype 2
NU83127serotype 4
OXC141serotype 3
P1031serotype 1
R6serotype 2
SP49serotype 19A
SPN032672serotype 1
SPN034156serotype 3
SPN034183serotype 3
SPN994038serotype 3
SPN994039serotype 3
SPNA45serotype 3
ST556serotype 19F
TCH8431/19Aserotype 19A
Taiwan19F-14serotype 19F
Xen35serotype 4
gamPNI0373serotype 1
NCBI: 17-DEC-2024
⊟Summary[edit | edit source]
- organism: Streptococcus pneumoniae A66
- locus tag: A66_RS07545 [old locus tag: A66_01518 ]
- pan locus tag?: PNEUPAN003112000
- symbol: A66_RS07545
- pan gene symbol?: nanA
- synonym:
- product: SIALI-17 repeat-containing surface protein
⊟Genome View[edit | edit source]
⊟Gene[edit | edit source]
⊟General[edit | edit source]
- type: CDS
- locus tag: A66_RS07545 [old locus tag: A66_01518 ]
- symbol: A66_RS07545
- product: SIALI-17 repeat-containing surface protein
- replicon: chromosome
- strand: -
- coordinates: 1459597..1462494
- length: 2898
- essential: unknown
⊟Accession numbers[edit | edit source]
- Location: NZ_LN847353 (1459597..1462494) NCBI
- BioCyc:
- MicrobesOnline:
- PneumoBrowse for strain D39V: SPV_1504 PneumoBrowse
⊟Phenotype[edit | edit source]
Share your knowledge and add information here. [edit]
⊟DNA sequence[edit | edit source]
- 1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
1561
1621
1681
1741
1801
1861
1921
1981
2041
2101
2161
2221
2281
2341
2401
2461
2521
2581
2641
2701
2761
2821
2881ATGAATCGGAGTGTTCAAGAACGTAAGTGTCGTTATAGCATTAGGAAACTATCGGTAGGA
GCGGTTTCTATGATTGTAGGAGCAGTGGTATTTGGAACGTCTCCTGTTTTAGCTCAAGAA
GGGGCAAGTGAGCAACCTCTGGCAAATGAAACTCAACTTTCGGGGGGGAGCTCAACCCTA
ACTGATACAGAAAAGAGCCAGCCTTCTTCAGAGACTGAACTTTCTGGCAATAAGCAAGAA
CAAGAAAGGAAAGATAAGCAAGAAGAAAAAATTCCAAGAGATTACTATGCACGAGATTTG
GAAAATGTCGAAACAGTGATAGAAAAAGAAGATGTTGAAACCAATGCTTCAAATGGTCAG
AGAGTTGATTTATCAAGTGAACTAGATAAACTAAAGAAACTTGAAAACGCAACAGTTCAC
ATGGAGTTTAAGCCAGATGCCAAGGCCCCAGCATTCTATAATCTCTTTTCTGTGTCAAGT
GCTACTAAAAAAGATGAGTACTTCACTATGGCAGTTTACAATAATACTGCTACTCTAGAG
GGGCGTGGTTCGGATGGGAAACAGTTTTACAATAATTACAACGATGCACCCTTAAAAGTT
AAACCAGGTCAGTGGAATTCTGTGACTTTCACAGTTGAAAAACCGACAGCAGAACTACCT
AAAGGCCGAGTGCGCCTCTACGTAAACGGGGTATTATCTCGAACAAGTCTGAGATCTGGC
AATTTCATTAAAGATATGCCAGATGTAACGCATGTGCAAATCGGAGCAACCAAGCGTGCC
AACAATACGGTTTGGGGGTCAAATCTACAGATTCGGAATCTCACTGTGTATAATCGTGCT
TTAACACCAGAAGAGGTACAAAAACGTAGTCAACTTTTTAAACGCTCAGATTTAGAAAAA
AAACTACCTGAAGGAGCGGCTTTAACAGAGAAAACGGACATATTCGAAAGCGGGCGTAAC
GGTAACCCAAATAAAGATGGAATCAAGAGTTATCGTATTCCAGCACTTCTCAAGACAGAT
AAAGGAACTTTGATCGCAGGTGCAGATGAACGCCGTCTCCATTCGAGTGACTGGGGTGAT
ATCGGTATGGTCATCAGACGTAGTGAAGATAATGGAAAGACATGGGGAGATAAGGTGGTT
ATCTCCAATCTTCGAGATAATCCTGAAGCTAAAGATCCTGCTGCGCCATCGCCTCTAAAT
ATTGATATGGTTTTGGTTCAAGACCCGACAACAAAGAGAATCTTCTCAATTTATGATATG
TTCCCAGAAGGTCGAGCAGTTTTTGGAATGCCAAAAACACCTGAAAAAGCTTATGAAAAG
ATAGGGGATAAAACTTATCAAATCTTGTATAAACAAGGAGAGTCTGGTCATTATACTGTT
CGTGAGAATGGAGAAGTGTATAATGCACAAAATCAAAAGACGGATTATCGTGTTGTAGTG
AATCCAACAGAACCTGGCTATAGAGATAAAGGAAATCTTTACAAAGGTCAGGAATTGATT
GGAAATATCTATTTTGCACACAGTACAAAAAATCCATTTAGAGTAGCCAATACGAGCTAT
CTATGGATGTCATATAGTGACGATGATGGTAAAACTTGGTCTGCACCGAGAGACATTACT
CCAGGTCTTCGCAAGGATTGGATGAAGTTCCTAGGAACAGGTCCTGGAACAGGAATTGTA
CTTCGGAATGGGCCTCACAAGGGACGGATTTTGATACCGGTTTATACGACTAATAATGTA
TCTCACTTAAATGGCTCGCAATCTTCTCGTGTCATCTATTCAGATGATCATGGAAAAACT
TGGCATGCTGGAGAAGCGGTCAACGATAACCGTCAGGTAGACGGTCAAAAGATCCACTCT
TCTACGATGAACAATAAACGTGCGCAAAATACAGAATCAACGGTGGTACAACTAAACAAT
GGAGATGTTAAACTCTTTATGCGTGGTTTGACTGGAGATCTTCAGGTTGCTACAAGTAAA
GACGGAGGAGTGACTTGGGAGAAGGATATCAAACGTTATCCACAGGTTAAAGATGTCTAT
GTTCAAATGTCTGCTATCCATACGATGCACGAAGGAAAAGAATACATCATCCTCAGTAAT
GCAGGTGGACCGAAACGTGAAAATGGGATGGTCCACTTGGCACGTGTCGAAGAAAATGGT
GAGTTGACTTGGCTCAAACACAATCCAATTCAAAAAGGAGAGTTTGCCTATAATTCGCTC
CAAGAATTAGGAAATGGGGAGTATGGTATCTTGTATGAACATACTGAAAAAGGACAAAAT
GCCTATACCCTATCATTTAGAAAATTTAATTGGGACTTTTTGAGCAAAGATCTGATTTCT
CCTACCGAAGCGAAAGTGAAGCGAACTAGAGAGATGGGCAAAGGAGAGATGGGCAAAGGA
GTTATTGGCTTGGAGTTCGACTCAGAAGTATTGGTCAACAAGGCTCCAACCCTTCAATTG
GCAAATGGTAAAACAGCGACTTTCCTAACCCAGTATGATAGCAAGACCTTGTTGTTTGCA
GTAGATAAGGAAGATATCGGACAGAAAATTATTGGTATAGCTAAAGGAAGCATCGAAAGT
ATGCATAATCTTCCTGTAAATCTAGCAGGTGCCAGAGTTCCTGGCGGAGTAAATGGTAGC
AAAGCAGCGGTGCATGAAGTTCCAGAATTTACAAGGGGAGTTAATGGTACAGAGCCAGCT
GTTCATGAAATCGCAGAGTATAAGGGATCTGATTCGCTTGTAACTCTTACTACAAAAGAA
GATTATACTTACAAAGCTCCTCTTGCTCAGCAGGCACTTCCTGAAACAGGAAACAAGGAG
AGTGACCTCCTAGCTTCACTAGGACTAACAGCTTTCTTCCTTGGTCTGTTTACGCTAGGG
AAAAAGAGAGAACAATAA60
120
180
240
300
360
420
480
540
600
660
720
780
840
900
960
1020
1080
1140
1200
1260
1320
1380
1440
1500
1560
1620
1680
1740
1800
1860
1920
1980
2040
2100
2160
2220
2280
2340
2400
2460
2520
2580
2640
2700
2760
2820
2880
2898
⊟Protein[edit | edit source]
⊟General[edit | edit source]
- locus tag: A66_RS07545 [old locus tag: A66_01518 ]
- symbol: A66_RS07545
- description: SIALI-17 repeat-containing surface protein
- length: 965
- theoretical pI: 8.96437
- theoretical MW: 107328
- GRAVY: -0.67772
⊟Function[edit | edit source]
- TIGRFAM: gram-positive signal peptide, YSIRK family (TIGR01168; HMM-score: 52.7)and 2 moreCell envelope Other LPXTG cell wall anchor domain (TIGR01167; HMM-score: 16.9)type VII secretion effector, TIGR04197 family (TIGR04197; HMM-score: 12.9)
- TheSEED: data available for D39, Hungary19A-6
- PFAM: Concanavalin (CL0004) Sialidase; Sialidase, N-terminal domain (PF02973; HMM-score: 285.7)and 12 moreSialidase (CL0434) BNR_2; BNR repeat-like domain (PF13088; HMM-score: 78.4)Concanavalin (CL0004) Laminin_G_3; Concanavalin A-like lectin/glucanases superfamily (PF13385; HMM-score: 52.3)Sialidase (CL0434) BNR_3; BNR repeat-like domain (PF13859; HMM-score: 44)no clan defined YSIRK_signal; YSIRK type signal peptide (PF04650; HMM-score: 42.7)Sialidase (CL0434) BNR; BNR/Asp-box repeat (PF02012; HMM-score: 35.5)no clan defined Gram_pos_anchor; LPXTG cell wall anchor motif (PF00746; HMM-score: 17.8)Beta_propeller (CL0186) CyRPA; Cysteine-Rich Protective Antigen 6 bladed domain (PF18638; HMM-score: 13.4)E-set (CL0159) CHB_HEX_C; Chitobiase/beta-hexosaminidase C-terminal domain (PF03174; HMM-score: 12.6)Sialidase (CL0434) Sortilin-Vps10; Sortilin, neurotensin receptor 3, (PF15902; HMM-score: 12.5)no clan defined Epiglycanin_TR; Tandem-repeating region of mucin, epiglycanin-like (PF05647; HMM-score: 12.4)HAD (CL0137) PGP_phosphatase; Mitochondrial PGP phosphatase (PF09419; HMM-score: 12.1)GT-C (CL0111) YfhO; Bacterial membrane protein YfhO (PF09586; HMM-score: 6.7)
⊟Structure, modifications & cofactors[edit | edit source]
- domains:
- modifications:
- cofactors:
- effectors:
⊟Localization[edit | edit source]
- PSORTb: Cellwall
- Cytoplasmic Score: 0
- Cytoplasmic Membrane Score: 0
- Cellwall Score: 10
- Extracellular Score: 0
- Internal Helices: 2
- DeepLocPro: Cell wall & surface
- Cytoplasmic Score: 0.0002
- Cytoplasmic Membrane Score: 0.0442
- Cell wall & surface Score: 0.7893
- Extracellular Score: 0.1664
- SignalP: Signal peptide SP(Sec/SPI) length 38 aa
- SP(Sec/SPI): 0.987776
- TAT(Tat/SPI): 0.009269
- LIPO(Sec/SPII): 0.001045
- Cleavage Site: CS pos: 38-39. VLA-QE. Pr: 0.9562
- predicted transmembrane helices (TMHMM): 1
⊟Accession numbers[edit | edit source]
- GI:
- RefSeq: WP_138027450 NCBI
- UniProt:
⊟Protein sequence[edit | edit source]
- MNRSVQERKCRYSIRKLSVGAVSMIVGAVVFGTSPVLAQEGASEQPLANETQLSGGSSTLTDTEKSQPSSETELSGNKQEQERKDKQEEKIPRDYYARDLENVETVIEKEDVETNASNGQRVDLSSELDKLKKLENATVHMEFKPDAKAPAFYNLFSVSSATKKDEYFTMAVYNNTATLEGRGSDGKQFYNNYNDAPLKVKPGQWNSVTFTVEKPTAELPKGRVRLYVNGVLSRTSLRSGNFIKDMPDVTHVQIGATKRANNTVWGSNLQIRNLTVYNRALTPEEVQKRSQLFKRSDLEKKLPEGAALTEKTDIFESGRNGNPNKDGIKSYRIPALLKTDKGTLIAGADERRLHSSDWGDIGMVIRRSEDNGKTWGDKVVISNLRDNPEAKDPAAPSPLNIDMVLVQDPTTKRIFSIYDMFPEGRAVFGMPKTPEKAYEKIGDKTYQILYKQGESGHYTVRENGEVYNAQNQKTDYRVVVNPTEPGYRDKGNLYKGQELIGNIYFAHSTKNPFRVANTSYLWMSYSDDDGKTWSAPRDITPGLRKDWMKFLGTGPGTGIVLRNGPHKGRILIPVYTTNNVSHLNGSQSSRVIYSDDHGKTWHAGEAVNDNRQVDGQKIHSSTMNNKRAQNTESTVVQLNNGDVKLFMRGLTGDLQVATSKDGGVTWEKDIKRYPQVKDVYVQMSAIHTMHEGKEYIILSNAGGPKRENGMVHLARVEENGELTWLKHNPIQKGEFAYNSLQELGNGEYGILYEHTEKGQNAYTLSFRKFNWDFLSKDLISPTEAKVKRTREMGKGEMGKGVIGLEFDSEVLVNKAPTLQLANGKTATFLTQYDSKTLLFAVDKEDIGQKIIGIAKGSIESMHNLPVNLAGARVPGGVNGSKAAVHEVPEFTRGVNGTEPAVHEIAEYKGSDSLVTLTTKEDYTYKAPLAQQALPETGNKESDLLASLGLTAFFLGLFTLGKKREQ
⊟Experimental data[edit | edit source]
- protein localization:
- interaction partners:
⊟Expression & Regulation[edit | edit source]
⊟Operon[edit | edit source]
⊟Regulation[edit | edit source]
- regulator:
⊟Transcription[edit | edit source]
- transcription start site:
⊟Expression data[edit | edit source]
- PneumoExpress for strain D39V: SPV_1504 PneumoExpress
⊟Biological Material[edit | edit source]
⊟Mutants[edit | edit source]
⊟Expression vector[edit | edit source]
⊟lacZ fusion[edit | edit source]
⊟GFP fusion[edit | edit source]
⊟two-hybrid system[edit | edit source]
⊟FLAG-tag construct[edit | edit source]
⊟Antibody[edit | edit source]
⊟Other Information[edit | edit source]
You can add further information about the gene and protein here. [edit]