Field
Value
Language
dc.contributor.author
Liu, Chao Chun
datacite.creator.affiliationIdentifier
https://ror.org/0213rcc28
en_US
datacite.creator.affiliation
Simon Fraser University
en_US
datacite.creator.nameIdentifier
https://orcid.org/0000-0002-8645-2598
en_US
dc.contributor.author
Hsiao, William
datacite.creator.affiliationIdentifier
https://ror.org/0213rcc28
en_US
datacite.creator.affiliation
Simon Fraser University
en_US
datacite.creator.nameIdentifier
https://orcid.org/0000-0002-1342-4043
en_US
dc.coverage.temporal
2000-09-14/2014-06-18
dc.date.accessioned
2024-02-20T19:55:45Z
dc.date.available
2024-02-20T19:55:45Z
dc.date.issued
2024-02-20
dc.identifier.uri
https://www.frdr-dfdr.ca/repo/dataset/e3e5c521-c24e-41bc-8b0a-78bbb6e0053d
dc.identifier.uri
https://doi.org/10.20383/103.0884
dc.description
The dataset comprises a collection of Salmonella genomes associated with 58 historical outbreaks curated from the GenomeTrakr database and peer-reviewed PubMed articles. The outbreaks are linked to a diverse set of lineages that can be subdivided into 10 different serovars. The genomic data is accompanied by detailed contextual information describing the collection date, geographical origin and isolation source of the outbreak cases. The dataset was compiled to train classifiers to predict outbreak cluster labels from genomic data and infer a parsimonious set of outbreak predictive markers from the resulting classifiers.
en_US
dc.publisher
Federated Research Data Repository / dépôt fédéré de données de recherche
dc.rights
Creative Commons Attribution 4.0 International (CC BY 4.0)
en_US
dc.rights.uri
https://creativecommons.org/licenses/by/4.0/
en_US
dc.subject
Bacterial genomics
en_US
dc.subject
Salmonella
en_US
dc.subject
Whole genome sequencing
en_US
dc.subject
Enteric outbreaks
en_US
dc.subject
Infectious diseases
en_US
dc.title
Machine learning reveals the dynamic importance of accessory sequences for Salmonella outbreak clustering
en_US
globus.shared_endpoint.name
f163c1b3-9c88-42f6-a7bb-5839ed6c4063
globus.shared_endpoint.path
/8/published/publication_879/
datacite.publicationyear
2024
datacite.contributor.DataCollector
Chao Chun Liu
datacite.contributor.DataManager
Chao Chun Liu
datacite.contributor.Supervisor
William Hsiao
datacite.date.Collected
2021-09-15/2021-11-30
datacite.resourcetype
Dataset
en_US
datacite.relatedidentifier.Cites
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4336196/
datacite.relatedidentifier.Cites
https://journals.asm.org/doi/10.1128/jcm.01280-15
datacite.relatedidentifier.Cites
https://doi.org/10.1371/currents.outbreaks.aa5372d90826e6cb0136ff66bb7a62fc
datacite.relatedidentifier.Cites
https://doi.org/10.1128/JCM.02200-15
datacite.relatedidentifier.Cites
https://journals.asm.org/doi/10.1128/jcm.03235-14
datacite.relatedidentifier.Cites
https://journals.asm.org/doi/10.1128/jcm.02332-14
datacite.relatedidentifier.Cites
https://journals.asm.org/doi/10.1128/jcm.00081-16
datacite.relatedidentifier.IsDerivedFrom
https://www.ncbi.nlm.nih.gov/genbank/
datacite.relatedidentifier.IsDerivedFrom
https://www.fda.gov/food/whole-genome-sequencing-wgs-program/genometrakr-network
datacite.relatedidentifier.IsDerivedFrom
https://www.ncbi.nlm.nih.gov/pmc/
datacite.geolocation.geolocationPlace
-;-;-;Argentina
datacite.geolocation.geolocationPlace
-;-;-;Australia
datacite.geolocation.geolocationPlace
-;-;-;Bangladesh
datacite.geolocation.geolocationPlace
-;-;-;Belgium
datacite.geolocation.geolocationPlace
-;-;-;Canada
datacite.geolocation.geolocationPlace
-;-;-;Ecuador
datacite.geolocation.geolocationPlace
-;-;-;France
datacite.geolocation.geolocationPlace
-;-;-;Mauritius
datacite.geolocation.geolocationPlace
-;-;-;Mexico
datacite.geolocation.geolocationPlace
-;-;-;Turkey
datacite.geolocation.geolocationPlace
-;-;-;Vietnam
datacite.geolocation.geolocationPlace
-;-;New Jersey;United States
datacite.geolocation.geolocationPlace
-;-;Minnesota;United States
datacite.geolocation.geolocationPlace
-;-;Ohio;United States
datacite.fundingReference.funderIdentifier
https://ror.org/03gne5057
en_US
datacite.fundingReference.funderName
Genome British Columbia
en_US
datacite.fundingReference.awardNumber
286GET
en_US
datacite.fundingReference.awardTitle
en_US
datacite.fundingReference.funderIdentifier
https://ror.org/00cjrc276
en_US
datacite.fundingReference.funderName
Mitacs
en_US
datacite.fundingReference.awardNumber
en_US
datacite.fundingReference.awardTitle
MITACS Accelerate
en_US
frdr.crdc.code
RDF1060703
en_US
frdr.crdc.group_en
Biological sciences
en_US
frdr.crdc.class_en
Microbiology
en_US
frdr.crdc.field_en
Microbial genetics
en_US
frdr.crdc.group_fr
Sciences biologiques
fr_CA
frdr.crdc.class_fr
Microbiologie
fr_CA
frdr.crdc.field_fr
Génétique microbienne
fr_CA
Appears in Collections: