Semantic Properties
and Units for Chemistry
Stuart Chalk
Department of Chemistry
University of North Florida
schalk@unf.edu
 Semantic Chemical Property Data
 IUPAC Green Book for Properties & Units for Chemistry
 Concepts in Metrology – the VIM
 QUDT – Semantic Metrology of the VIM
 The IUPAC Gold Book – Now and Future
 Conclusions
Overview
 Semantic –> Resource Description Framework (RDF)
 Store data as Subject-Predicate-Object triples
i.e. benzene containsAtomtype carbon (object)
benzene hasMolarMass 78.11 (literal)
molarMass hasUnit g/mol
Semantic Chemical Property Data
 A generic data model to store scientific data
 Can be implemented in any file/database format
 For semantic applications
 format in JSON-LD (https://www.w3.org/TR/json-ld/)
 use the Scientific Data Model Ontology (SDMO)
 Model + ontology creates hybrid relational/graph DB
SciData Data Model
SciData
Data Model
SciData
Data Model
The IUPAC
Green Book
3rd Edition
RSC, 2007
Published by
IUPAC Division I
Physical Chemistry
The IUPAC Green Book
The IUPAC
Green Book
BIPM
International Vocabulary of Metrology (VIM)
BIPM International Vocabulary of Metrology (VIM)
http://www.bipm.org/en/publications/guides/vim.html
 Quantities
 Quantity kinds
 System of quantities
 Dimensions
 Dimension vectors
 Units
 Unit system
Metrology Concepts
If machines are going to capture and
process chemical property data
machine actionable representation of
these concepts is needed.
This can be encoded by semantic
annotation of property values and
units.
 Quantities, Units, Dimensions and DataTypes (QUDT)
 Defines common units and quantities
 Can be used to define any unit or quantity
 Include semantic representation of the VIM concepts
QUDT Ontology
https://qudt.org/
Semantic properties and units
QUDT Metrology Terms
 Unit, SIUnit, DerivedUnit, ImperialUnit, CoherentUnit
isDerivedCoherentUnitOfSystem, applicableUnit,
applicableSIUnit, applicableImperalUnit
 Quantity, QuantityKind, hasQuantityKind
 SystemOfQuantities, belongsToSystemOfQuantities
 QuantityDimensionVector, hasQuantityDimensionVector
 Plus - hasEquivalentUnit, hasCorrespondingUnit …
 … and hasBaseSIUnit ?
Dimensionless Quantities
 Counts of entities – 12 books
 Radian – L/L
 Steradian – L2/L2
 Mole fraction – mol/mol
 Parts per million – mg/kg or µg/g
 Percent - %(w/w) or %(v/v)
Dimensionless Quantities
 How do we show these are different to a computer?
 Create a representation of the dimensionvector that is
unique for each dimensionless quantity
 For radian (L/L)
L'0'M'1'T'0'I'0'H'0'N'0'J'0'D'0'_L'0'M'1'T'0'I'0'H'0'N
'0'J'0'D'0’_D1 OR M'1'_M'1’_D1
 For steradian (L2/L2)
L'0'M’2'T'0'I'0'H'0'N'0'J'0'D'0'_L'0'M’2'T'0'I'0'H'0'N
'0'J'0'D'0’_D1 OR M’2’_M’2’_D1
Dimensionless Quantities
Units of Measure Interoperability Service
 ‘The Compendium of Chemical Terminology’
 Contains over 7000 definitions of chemistry concepts
 Some terms are out of date
 Currently under renovation to make terms machine
accessible
The IUPAC GoldBook
The IUPAC
GoldBook
The IUPAC
GoldBook
 Add terms defined in all current IUPAC PAC
recommendations
 Add synonyms, acronyms, legacy terms
 Improve linking between terms
 Create an ontology for Chemistry
Future of the GoldBook
 Semantic chemical data is important in the move
toward knowledge discovery
 Semantic unit representation requires clear
representation of quantity kinds and
dimensionvectors for interoperability
 All chemical properties need to be represented
semantically
Conclusions
 schalk@unf.edu
 Phone: 904-620-1938
 Skype: stuartchalk
 LinkedIn: https://www.linkedin.com/in/stuchalk
 ORCID: http://orcid.org/0000-0002-0703-7776
Questions?

More Related Content

DOCX
Summary of the summer internship
PDF
Materials Design in the Age of Deep Learning and Quantum Computation
PPT
PPT
PPT
Integrative information management for systems biology
PPTX
Small Molecules and siRNA: Methods to Explore Bioactivity Data
PPTX
Accounting for uncertainty in species delineation during the analysis of envi...
PDF
Machine Learning for Molecules
PDF
Presentation
Summary of the summer internship
Materials Design in the Age of Deep Learning and Quantum Computation
PPT
Integrative information management for systems biology
Small Molecules and siRNA: Methods to Explore Bioactivity Data
Accounting for uncertainty in species delineation during the analysis of envi...
Machine Learning for Molecules
Presentation

Similar to Semantic properties and units (20)

PDF
Prediction of Critical Temperature of Superconductors using Tree Based Method...
PPTX
Nanoinformatics 2010 SMIRP-ONS Talk
PDF
BiVeS & BudHat @ Combine2013 in Paris
PDF
[IJET V2I3P11] Authors: Payal More, Rohini Pandit, Supriya Makude, Harsh Nirb...
PPTX
ReComp: challenges in selective recomputation of (expensive) data analytics t...
PDF
Neuroinformatics conference 2012
PDF
PS2O, Hybrid Evolutionary-Conventional Algorithm, Genetical Swarm Optimizatio...
PDF
Particle Swarm Optimization based K-Prototype Clustering Algorithm
PDF
I017235662
PDF
Using AI Planning to Automate the Performance Analysis of Simulators
PDF
Novel Class Detection Using RBF SVM Kernel from Feature Evolving Data Streams
PPT
Energy Minimization Using Gromacs
PDF
Performance Comparision of Machine Learning Algorithms
PDF
PS2O, Hybrid Evolutionary-Conventional Algorithm, Genetical Swarm Optimizatio...
PDF
ADF modeling suite: DFT to MD software for chemistry and materials
PPT
Bio Linux
PDF
Software tools for data-driven research and their application to thermoelectr...
PDF
Urban strategies to promote resilient cities The case of enhancing Historic C...
PDF
WITH SEMANTICS AND HIDDEN MARKOV MODELS TO AN ADAPTIVE LOG FILE PARSER
PDF
WITH SEMANTICS AND HIDDEN MARKOV MODELS TO AN ADAPTIVE LOG FILE PARSER
Prediction of Critical Temperature of Superconductors using Tree Based Method...
Nanoinformatics 2010 SMIRP-ONS Talk
BiVeS & BudHat @ Combine2013 in Paris
[IJET V2I3P11] Authors: Payal More, Rohini Pandit, Supriya Makude, Harsh Nirb...
ReComp: challenges in selective recomputation of (expensive) data analytics t...
Neuroinformatics conference 2012
PS2O, Hybrid Evolutionary-Conventional Algorithm, Genetical Swarm Optimizatio...
Particle Swarm Optimization based K-Prototype Clustering Algorithm
I017235662
Using AI Planning to Automate the Performance Analysis of Simulators
Novel Class Detection Using RBF SVM Kernel from Feature Evolving Data Streams
Energy Minimization Using Gromacs
Performance Comparision of Machine Learning Algorithms
PS2O, Hybrid Evolutionary-Conventional Algorithm, Genetical Swarm Optimizatio...
ADF modeling suite: DFT to MD software for chemistry and materials
Bio Linux
Software tools for data-driven research and their application to thermoelectr...
Urban strategies to promote resilient cities The case of enhancing Historic C...
WITH SEMANTICS AND HIDDEN MARKOV MODELS TO AN ADAPTIVE LOG FILE PARSER
WITH SEMANTICS AND HIDDEN MARKOV MODELS TO AN ADAPTIVE LOG FILE PARSER
Ad

More from Stuart Chalk (20)

PPTX
Open semantic chemical structures
PPTX
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
PPTX
AnIML: A New Analytical Data Standard
PPTX
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
PPTX
Scientific Units in the Electronic Age
PPTX
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
PPTX
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
PPTX
The Electronic Notebook Ontology
PPTX
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
PPTX
Bringing Flow injection Analysis to the Semantic Web
PPTX
Reactions to the Open Spectral Database
PPTX
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
PPTX
Building a Standard for Standards: The ChAMP Project
PPTX
A Standard Data Format for Computational Chemistry: CSX
PPTX
Overview of the Analytical Information Markup Language (AnIML)
PPTX
ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
PPTX
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
PPTX
ACS 248th Paper 108 NIST-IUPAC Solubility Data
PPTX
ACS 248th Paper 104 ChemData Project
PPTX
ACS 248th Paper 71 ChAMP Project
Open semantic chemical structures
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
AnIML: A New Analytical Data Standard
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
Scientific Units in the Electronic Age
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
The Electronic Notebook Ontology
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Bringing Flow injection Analysis to the Semantic Web
Reactions to the Open Spectral Database
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
Building a Standard for Standards: The ChAMP Project
A Standard Data Format for Computational Chemistry: CSX
Overview of the Analytical Information Markup Language (AnIML)
ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 104 ChemData Project
ACS 248th Paper 71 ChAMP Project
Ad

Recently uploaded (20)

PPTX
The Female Reproductive System - Grade 10 ppt
PDF
Social preventive and pharmacy. Pdf
PDF
Chapter 3 - Human Development Poweroint presentation
PDF
From Molecular Interactions to Solubility in Deep Eutectic Solvents: Explorin...
PPTX
ELISA(Enzyme linked immunosorbent assay)
PDF
Sustainable Biology- Scopes, Principles of sustainiability, Sustainable Resou...
PPTX
role of ai in defence sector final ppt copy.pptx
PPTX
Cutaneous tuberculosis Dermatology
PDF
Cosmology using numerical relativity - what hapenned before big bang?
PPTX
CELL DIVISION Biology meiosis and mitosis
PPTX
Neuro Ophthalmic diseases and their lesions
PDF
Integrative Oncology: Merging Conventional and Alternative Approaches (www.k...
PPTX
Introduction to Immunology (Unit-1).pptx
PDF
The Future of Telehealth: Engineering New Platforms for Care (www.kiu.ac.ug)
PDF
CuO Nps photocatalysts 15156456551564161
PDF
2019UpdateAHAASAAISGuidelineSlideDeckrevisedADL12919.pdf
PDF
Exploring PCR Techniques and Applications
PPT
Chapter 6 Introductory course Biology Camp
PDF
final prehhhejjehehhehehehebesentation.pdf
PPTX
Heart Lung Preparation_Pressure_Volume.pptx
The Female Reproductive System - Grade 10 ppt
Social preventive and pharmacy. Pdf
Chapter 3 - Human Development Poweroint presentation
From Molecular Interactions to Solubility in Deep Eutectic Solvents: Explorin...
ELISA(Enzyme linked immunosorbent assay)
Sustainable Biology- Scopes, Principles of sustainiability, Sustainable Resou...
role of ai in defence sector final ppt copy.pptx
Cutaneous tuberculosis Dermatology
Cosmology using numerical relativity - what hapenned before big bang?
CELL DIVISION Biology meiosis and mitosis
Neuro Ophthalmic diseases and their lesions
Integrative Oncology: Merging Conventional and Alternative Approaches (www.k...
Introduction to Immunology (Unit-1).pptx
The Future of Telehealth: Engineering New Platforms for Care (www.kiu.ac.ug)
CuO Nps photocatalysts 15156456551564161
2019UpdateAHAASAAISGuidelineSlideDeckrevisedADL12919.pdf
Exploring PCR Techniques and Applications
Chapter 6 Introductory course Biology Camp
final prehhhejjehehhehehehebesentation.pdf
Heart Lung Preparation_Pressure_Volume.pptx

Semantic properties and units

  • 1. Semantic Properties and Units for Chemistry Stuart Chalk Department of Chemistry University of North Florida [email protected]
  • 2.  Semantic Chemical Property Data  IUPAC Green Book for Properties & Units for Chemistry  Concepts in Metrology – the VIM  QUDT – Semantic Metrology of the VIM  The IUPAC Gold Book – Now and Future  Conclusions Overview
  • 3.  Semantic –> Resource Description Framework (RDF)  Store data as Subject-Predicate-Object triples i.e. benzene containsAtomtype carbon (object) benzene hasMolarMass 78.11 (literal) molarMass hasUnit g/mol Semantic Chemical Property Data
  • 4.  A generic data model to store scientific data  Can be implemented in any file/database format  For semantic applications  format in JSON-LD (https://www.w3.org/TR/json-ld/)  use the Scientific Data Model Ontology (SDMO)  Model + ontology creates hybrid relational/graph DB SciData Data Model
  • 7. The IUPAC Green Book 3rd Edition RSC, 2007 Published by IUPAC Division I Physical Chemistry
  • 11. BIPM International Vocabulary of Metrology (VIM) http://www.bipm.org/en/publications/guides/vim.html
  • 12.  Quantities  Quantity kinds  System of quantities  Dimensions  Dimension vectors  Units  Unit system Metrology Concepts If machines are going to capture and process chemical property data machine actionable representation of these concepts is needed. This can be encoded by semantic annotation of property values and units.
  • 13.  Quantities, Units, Dimensions and DataTypes (QUDT)  Defines common units and quantities  Can be used to define any unit or quantity  Include semantic representation of the VIM concepts QUDT Ontology https://qudt.org/
  • 15. QUDT Metrology Terms  Unit, SIUnit, DerivedUnit, ImperialUnit, CoherentUnit isDerivedCoherentUnitOfSystem, applicableUnit, applicableSIUnit, applicableImperalUnit  Quantity, QuantityKind, hasQuantityKind  SystemOfQuantities, belongsToSystemOfQuantities  QuantityDimensionVector, hasQuantityDimensionVector  Plus - hasEquivalentUnit, hasCorrespondingUnit …  … and hasBaseSIUnit ?
  • 17.  Counts of entities – 12 books  Radian – L/L  Steradian – L2/L2  Mole fraction – mol/mol  Parts per million – mg/kg or µg/g  Percent - %(w/w) or %(v/v) Dimensionless Quantities
  • 18.  How do we show these are different to a computer?  Create a representation of the dimensionvector that is unique for each dimensionless quantity  For radian (L/L) L'0'M'1'T'0'I'0'H'0'N'0'J'0'D'0'_L'0'M'1'T'0'I'0'H'0'N '0'J'0'D'0’_D1 OR M'1'_M'1’_D1  For steradian (L2/L2) L'0'M’2'T'0'I'0'H'0'N'0'J'0'D'0'_L'0'M’2'T'0'I'0'H'0'N '0'J'0'D'0’_D1 OR M’2’_M’2’_D1 Dimensionless Quantities
  • 19. Units of Measure Interoperability Service
  • 20.  ‘The Compendium of Chemical Terminology’  Contains over 7000 definitions of chemistry concepts  Some terms are out of date  Currently under renovation to make terms machine accessible The IUPAC GoldBook
  • 23.  Add terms defined in all current IUPAC PAC recommendations  Add synonyms, acronyms, legacy terms  Improve linking between terms  Create an ontology for Chemistry Future of the GoldBook
  • 24.  Semantic chemical data is important in the move toward knowledge discovery  Semantic unit representation requires clear representation of quantity kinds and dimensionvectors for interoperability  All chemical properties need to be represented semantically Conclusions
  • 25. [email protected]  Phone: 904-620-1938  Skype: stuartchalk  LinkedIn: https://www.linkedin.com/in/stuchalk  ORCID: http://orcid.org/0000-0002-0703-7776 Questions?