Supporting Ontology-Based
Standardization of Biomedical Metadata in
the CEDAR Workbench
Marcos MartĂ­nez-Romero
Stanford University
Stanford Universitymetadatacenter.org
EDAR
OR EXPANDED DATA
ION AND RETRIEVAL
CEDAR
CENTER FOR EXPANDED DATA
ANNOTATION AND RETRIEVAL
CEDAR
DAR
DAR
CENTER FOR EXPANDED DATA
9/14/2017
2
age
Age
AGE
`Age
age (after birth)
age (in years)
age (y)
age (year)
age (years)
Age (years)
Age (Years)
age (yr)
age (yr-old)
age (yrs)
Age (yrs)
age [y]
age [year]
age [years]
age in years
age of patient
Age of patient
age of subjects
age(years)
Age(years)
Age(yrs.)
Age, year
age, years
age, yrs
age.year
age_years
Metadata are not standardized
3
age
Age
AGE
`Age
age (after birth)
age (in years)
age (y)
age (year)
age (years)
Age (years)
Age (Years)
age (yr)
age (yr-old)
age (yrs)
Age (yrs)
age [y]
age [year]
age [years]
age in years
age of patient
Age of patient
age of subjects
age(years)
Age(years)
Age(yrs.)
Age, year
age, years
age, yrs
age.year
age_years
Metadata are not standardized
Age-Years (NCIT)
(http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C37908)
“The	length	of	a	person's	life,	stated	in	years	since	birth.”
4
It’s extremely hard to:
–find experimental datasets
–understand how the experiments were
performed
–replicate study findings
Metadata are not standardized
5
Generating standard metadata is hard
‱ Submission formats rarely support
ontology terms
‱ No easy way of finding terms from
ontologies and including them into
metadata submissions
6
7
Semantic ecosystem to enable the
creation of high-quality metadata in
biomedicine
8
The CEDAR Workbench
Template Designer Metadata Editor
Template authors Metadata authors
design
templates
Metadata Repository
template
ïŹll in templates
with metadata
metadata
Public Databases
LINCS
submit
metadata
Biomedical Ontologies 9
Template Designer Metadata Editor
Template authors Metadata authors
design
templates
Metadata Repository
template
ïŹll in templates
with metadata
metadata
Public Databases
LINCS
submit
metadata
Biomedical Ontologies
The CEDAR Workbench
10
11
12
13
14
15
16
17
18
19
20
21
Template Designer Metadata Editor
Template authors Metadata authors
design
templates
Metadata Repository
template
ïŹll in templates
with metadata
metadata
Public Databases
LINCS
submit
metadata
Biomedical Ontologies
The CEDAR Workbench
22
23
24
{
"@context": {
"rdfs": "http://www.w3.org/2000/01/rdf-schema#",
"xsd": "http://www.w3.org/2001/XMLSchema#",
"pav": "http://purl.org/pav/",
//...
"Title": "http://purl.obolibrary.org/obo/NGS_0000055",
"Disorder": "http://purl.org/net/OCRe/OCRe.owl#OCRE900086",
"Institution": "http://semantic-dicom.org/dcm#InstitutionName",
"Principal Investigator": "http://purl.org/net/OCRe/OCRe.owl#OCRE901006",
"Study Type": "http://purl.obolibrary.org/obo/NGS_0000056"
},
"@type": "http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C63536",
"Title": {
"@value": "A sample study"
},
"Disorder": {
"@id": "http://purl.obolibrary.org/obo/DOID_8986",
"rdfs:label": "narcolepsy"
},
"Institution": {
"@value": "Stanford University"
},
"Principal Investigator": {
"@value": "John Doe"
},
"Study Type": {
"@id": "http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C15273",
"rdfs:label": "Longitudinal Study"
},
// ...
"schema:isBasedOn": "https://repo.metadatacenter.orgx/templates/6381a0ce-3904-4885-bc44-5caacb4ad0e6",
"schema:name": "Study metadata",
"schema:description": "Study template",
"pav:createdOn": "2017-09-05T09:50:28-0700",
"pav:createdBy": "https://metadatacenter.org/users/8d787b98-33dd-4aff-a88c-440caf452c61",
"pav:lastUpdatedOn": "2017-09-05T09:50:28-0700",
"oslc:modifiedBy": "https://metadatacenter.org/users/8d787b98-33dd-4aff-a88c-440caf452c61",
"@id": "https://repo.metadatacenter.orgx/template-instances/ffe856e7-d920-480d-a666-009041f609e3"
}
25
Value Set Creation
‱ Lists of permissible
values for fields
‱ Example:
Longitudinal study types
– Prospective study
– Retrospective study
– Hybrid design
26
Class Creation
‱ Dynamically define a
new class and
immediately use it
‱ Optionally link it to
existing classes
– Ontology maintainers
may use this
information to enrich
their ontologies
‱ Example:
adductor dorsalis
27
Class Creation
CEDAR Provisional Classes
(CEDARPC)
UBERON
adductor
dorsalis
adductor
muscle
subclassOf
28
Evaluation
‱ The LINCS Consortium
– Cellular signatures
‱ ImmPort
– Immunology
‱ AIRR Community
– Datasets acquired using sequencing
– Submission to NCBI BioSample
‱ Stanford University Libraries
29
Summary
‱ Authoring metadata is hard and time-consuming
‱ Authoring semantic metadata is even harder
– Lack of convenient tools for linking metadata to
ontologies in a metadata authoring workflow
‱ The CEDAR Workbench facilitates metadata
creation in a semantically rigorous way
– Add type and property assertions
– Constrain the values of fields to ontology terms
– Create classes and value sets
http://metadatacenter.org
http://cedar.metadatacenter.net
30
CEDAR
CENTER FOR EXPANDED DATA
ANNOTATION AND RETRIEVAL
CEDAR
CENTER FOR EXPANDED DATA
ANNOTATION AND RETRIEVAL
CEDAR
CEDAR
CEDAR
I
Metadata
Thanks!
31

More Related Content

PDF
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
PDF
An Open Repository Model for Acquiring Knowledge About Scientific Experiments
PDF
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
PDF
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
PPTX
Highly dimensional data_20160926
PPT
Lightning Talk, Konkiel: Bootstrapping Library Data Management Services for E...
 
PDF
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
PPTX
Progeny Clinical
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
An Open Repository Model for Acquiring Knowledge About Scientific Experiments
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Highly dimensional data_20160926
Lightning Talk, Konkiel: Bootstrapping Library Data Management Services for E...
 
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
Progeny Clinical

What's hot (20)

PDF
Canadian health census to lod
PPTX
Model Organism Linked Data
PPTX
RDAP13 Amy Nurnberger: Publishers Like Open Science (too)
 
PPTX
Research Data Sharing: A Basic Framework
PPTX
DAS game: how a programmer thinks
PPT
eScience at the Royal Society of Chemistry and our current initiatives
PPTX
Leveraging publication metadata to help overcome the data ingest bottleneck
PPTX
Presentation from Code Camp 2017
PDF
High quality data publications: drives and needs - Sansone, BDebate, 12 Nov 2014
PDF
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
PPTX
The FAIRDOM Commons for Systems Biology
PPTX
The Dryad Digital Repository: Published evolutionary data as part of the gre...
PDF
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
PPT
eScience Resources for the Chemistry Community from the Royal Society of Chem...
PDF
Knowledge Exchange, Nov 2011, Bonn
PDF
Federating Research Profiling Data
PPTX
Crosslinks
PPTX
Zmasek TOPSAN Biohackathon 2011
PPTX
Research data and scholarly publications: going from casual acquaintances to ...
Canadian health census to lod
Model Organism Linked Data
RDAP13 Amy Nurnberger: Publishers Like Open Science (too)
 
Research Data Sharing: A Basic Framework
DAS game: how a programmer thinks
eScience at the Royal Society of Chemistry and our current initiatives
Leveraging publication metadata to help overcome the data ingest bottleneck
Presentation from Code Camp 2017
High quality data publications: drives and needs - Sansone, BDebate, 12 Nov 2014
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
The FAIRDOM Commons for Systems Biology
The Dryad Digital Repository: Published evolutionary data as part of the gre...
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
eScience Resources for the Chemistry Community from the Royal Society of Chem...
Knowledge Exchange, Nov 2011, Bonn
Federating Research Profiling Data
Crosslinks
Zmasek TOPSAN Biohackathon 2011
Research data and scholarly publications: going from casual acquaintances to ...
Ad

Similar to Supporting Ontology-Based Standardization of Biomedical Metadata in the CEDAR Workbench (ICBO 2017 Conference) (10)

PDF
CEDAR work bench for metadata management
PDF
CEDAR: Web-Based Tools for Accelerating the Creation of Standardized Metadata
PDF
CEDAR: Web-Based Tools for Accelerating the Creation of Standardized Metadata
PDF
Cedar OnDemand: An intelligent browser extension to generate ontology-based m...
PDF
dkNET Webinar - FAIR Data Require Better Metadata: The Case for CEDAR 11/13/2020
 
PDF
Finding and Reusing Biomedical Datasets using CEDAR Metadata Repository and T...
PDF
CEDAR: Easing Authoring of Metadata to Make Biomedical Data Sets More Findabl...
DOC
aaai2005.doc
 
PPTX
GA4GH Phenotype Ontologies Task team update
PPTX
Human developmental-kb-2012
CEDAR work bench for metadata management
CEDAR: Web-Based Tools for Accelerating the Creation of Standardized Metadata
CEDAR: Web-Based Tools for Accelerating the Creation of Standardized Metadata
Cedar OnDemand: An intelligent browser extension to generate ontology-based m...
dkNET Webinar - FAIR Data Require Better Metadata: The Case for CEDAR 11/13/2020
 
Finding and Reusing Biomedical Datasets using CEDAR Metadata Repository and T...
CEDAR: Easing Authoring of Metadata to Make Biomedical Data Sets More Findabl...
aaai2005.doc
 
GA4GH Phenotype Ontologies Task team update
Human developmental-kb-2012
Ad

Recently uploaded (20)

PPT
Chapter 6 Introductory course Biology Camp
PDF
Thyroid Hormone by Iqra Nasir detail.pdf
PDF
SWAG Research Lab Scientific Publications
PPTX
INTRODUCTION TO CELL STRUCTURE_LESSON.pptx
PPTX
Earth-and-Life-Pieces-of-Evidence-Q2.pptx
PDF
naas-journal-rating-2025 for all the journals
PPTX
Bacterial and protozoal infections in pregnancy.pptx
PPTX
complications of tooth extraction.pptx FIRM B.pptx
PPTX
1. (Teknik) Atoms, Molecules, and Ions.pptx
PDF
TOPIC-1-Introduction-to-Bioinformatics_for dummies
PPTX
Chromosomal Aberrations Dr. Thirunahari Ugandhar.pptx
PDF
BCKIC FOUNDATION_MAY-JUNE 2025_NEWSLETTER
PPTX
ELS 2ND QUARTER 2 FOR HUMSS STUDENTS.pptx
PPTX
flavonoids/ Secondary Metabolites_BCH 314-2025.pptx
PDF
Pharmacokinetics Lecture_Study Material.pdf
PDF
Human Anatomy (Anatomy and Physiology A)
PPTX
ELS 2ND QUARTER 1 FOR HUMSS STUDENTS.pptx
PDF
SOCIAL PSYCHOLOGY chapter 1-what is social psychology and its definition
PDF
Sujay Rao Mandavilli Variable logic FINAL FINAL FINAL FINAL FINAL.pdf
PDF
No dilute core produced in simulations of giant impacts on to Jupiter
Chapter 6 Introductory course Biology Camp
Thyroid Hormone by Iqra Nasir detail.pdf
SWAG Research Lab Scientific Publications
INTRODUCTION TO CELL STRUCTURE_LESSON.pptx
Earth-and-Life-Pieces-of-Evidence-Q2.pptx
naas-journal-rating-2025 for all the journals
Bacterial and protozoal infections in pregnancy.pptx
complications of tooth extraction.pptx FIRM B.pptx
1. (Teknik) Atoms, Molecules, and Ions.pptx
TOPIC-1-Introduction-to-Bioinformatics_for dummies
Chromosomal Aberrations Dr. Thirunahari Ugandhar.pptx
BCKIC FOUNDATION_MAY-JUNE 2025_NEWSLETTER
ELS 2ND QUARTER 2 FOR HUMSS STUDENTS.pptx
flavonoids/ Secondary Metabolites_BCH 314-2025.pptx
Pharmacokinetics Lecture_Study Material.pdf
Human Anatomy (Anatomy and Physiology A)
ELS 2ND QUARTER 1 FOR HUMSS STUDENTS.pptx
SOCIAL PSYCHOLOGY chapter 1-what is social psychology and its definition
Sujay Rao Mandavilli Variable logic FINAL FINAL FINAL FINAL FINAL.pdf
No dilute core produced in simulations of giant impacts on to Jupiter

Supporting Ontology-Based Standardization of Biomedical Metadata in the CEDAR Workbench (ICBO 2017 Conference)

  • 1. Supporting Ontology-Based Standardization of Biomedical Metadata in the CEDAR Workbench Marcos MartĂ­nez-Romero Stanford University Stanford Universitymetadatacenter.org EDAR OR EXPANDED DATA ION AND RETRIEVAL CEDAR CENTER FOR EXPANDED DATA ANNOTATION AND RETRIEVAL CEDAR DAR DAR CENTER FOR EXPANDED DATA 9/14/2017
  • 2. 2
  • 3. age Age AGE `Age age (after birth) age (in years) age (y) age (year) age (years) Age (years) Age (Years) age (yr) age (yr-old) age (yrs) Age (yrs) age [y] age [year] age [years] age in years age of patient Age of patient age of subjects age(years) Age(years) Age(yrs.) Age, year age, years age, yrs age.year age_years Metadata are not standardized 3
  • 4. age Age AGE `Age age (after birth) age (in years) age (y) age (year) age (years) Age (years) Age (Years) age (yr) age (yr-old) age (yrs) Age (yrs) age [y] age [year] age [years] age in years age of patient Age of patient age of subjects age(years) Age(years) Age(yrs.) Age, year age, years age, yrs age.year age_years Metadata are not standardized Age-Years (NCIT) (http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C37908) “The length of a person's life, stated in years since birth.” 4
  • 5. It’s extremely hard to: –find experimental datasets –understand how the experiments were performed –replicate study findings Metadata are not standardized 5
  • 6. Generating standard metadata is hard ‱ Submission formats rarely support ontology terms ‱ No easy way of finding terms from ontologies and including them into metadata submissions 6
  • 7. 7
  • 8. Semantic ecosystem to enable the creation of high-quality metadata in biomedicine 8
  • 9. The CEDAR Workbench Template Designer Metadata Editor Template authors Metadata authors design templates Metadata Repository template ïŹll in templates with metadata metadata Public Databases LINCS submit metadata Biomedical Ontologies 9
  • 10. Template Designer Metadata Editor Template authors Metadata authors design templates Metadata Repository template ïŹll in templates with metadata metadata Public Databases LINCS submit metadata Biomedical Ontologies The CEDAR Workbench 10
  • 11. 11
  • 12. 12
  • 13. 13
  • 14. 14
  • 15. 15
  • 16. 16
  • 17. 17
  • 18. 18
  • 19. 19
  • 20. 20
  • 21. 21
  • 22. Template Designer Metadata Editor Template authors Metadata authors design templates Metadata Repository template ïŹll in templates with metadata metadata Public Databases LINCS submit metadata Biomedical Ontologies The CEDAR Workbench 22
  • 23. 23
  • 24. 24
  • 25. { "@context": { "rdfs": "http://www.w3.org/2000/01/rdf-schema#", "xsd": "http://www.w3.org/2001/XMLSchema#", "pav": "http://purl.org/pav/", //... "Title": "http://purl.obolibrary.org/obo/NGS_0000055", "Disorder": "http://purl.org/net/OCRe/OCRe.owl#OCRE900086", "Institution": "http://semantic-dicom.org/dcm#InstitutionName", "Principal Investigator": "http://purl.org/net/OCRe/OCRe.owl#OCRE901006", "Study Type": "http://purl.obolibrary.org/obo/NGS_0000056" }, "@type": "http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C63536", "Title": { "@value": "A sample study" }, "Disorder": { "@id": "http://purl.obolibrary.org/obo/DOID_8986", "rdfs:label": "narcolepsy" }, "Institution": { "@value": "Stanford University" }, "Principal Investigator": { "@value": "John Doe" }, "Study Type": { "@id": "http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C15273", "rdfs:label": "Longitudinal Study" }, // ... "schema:isBasedOn": "https://repo.metadatacenter.orgx/templates/6381a0ce-3904-4885-bc44-5caacb4ad0e6", "schema:name": "Study metadata", "schema:description": "Study template", "pav:createdOn": "2017-09-05T09:50:28-0700", "pav:createdBy": "https://metadatacenter.org/users/8d787b98-33dd-4aff-a88c-440caf452c61", "pav:lastUpdatedOn": "2017-09-05T09:50:28-0700", "oslc:modifiedBy": "https://metadatacenter.org/users/8d787b98-33dd-4aff-a88c-440caf452c61", "@id": "https://repo.metadatacenter.orgx/template-instances/ffe856e7-d920-480d-a666-009041f609e3" } 25
  • 26. Value Set Creation ‱ Lists of permissible values for fields ‱ Example: Longitudinal study types – Prospective study – Retrospective study – Hybrid design 26
  • 27. Class Creation ‱ Dynamically define a new class and immediately use it ‱ Optionally link it to existing classes – Ontology maintainers may use this information to enrich their ontologies ‱ Example: adductor dorsalis 27
  • 28. Class Creation CEDAR Provisional Classes (CEDARPC) UBERON adductor dorsalis adductor muscle subclassOf 28
  • 29. Evaluation ‱ The LINCS Consortium – Cellular signatures ‱ ImmPort – Immunology ‱ AIRR Community – Datasets acquired using sequencing – Submission to NCBI BioSample ‱ Stanford University Libraries 29
  • 30. Summary ‱ Authoring metadata is hard and time-consuming ‱ Authoring semantic metadata is even harder – Lack of convenient tools for linking metadata to ontologies in a metadata authoring workflow ‱ The CEDAR Workbench facilitates metadata creation in a semantically rigorous way – Add type and property assertions – Constrain the values of fields to ontology terms – Create classes and value sets http://metadatacenter.org http://cedar.metadatacenter.net 30
  • 31. CEDAR CENTER FOR EXPANDED DATA ANNOTATION AND RETRIEVAL CEDAR CENTER FOR EXPANDED DATA ANNOTATION AND RETRIEVAL CEDAR CEDAR CEDAR I Metadata Thanks! 31