Toward Semantic Representation
of Science in Electronic Laboratory
Notebooks (ELNs)
Stuart J. Chalk
Department of Chemistry, University of North Florida
schalk@unf.edu
CINF Paper 50 – 251st ACS Meeting Spring 2016
#ACSCINFDataSummit
 Utopia: A Global Research Network
 What is an Electronic Notebook?
 The Semantics of Semantics
 What Needs to be Semantically Represented?
 Current lay of the land
 ELN Item Manifest
 P-PLAN Ontology
 VIVO-ISF Ontology
 Chemical Analysis Metadata Platform
 HCLS Community Profiles
 Electronic Notebook Ontology
 A generic scientific data model
 Experimental information for LD (ExptLD)
 Take Home
 Conclusion
Outline
 “Big Data” and the “Semantic Web” are the current buzz
words du jour but what do they mean for chemistry?
 Lots of heterogeneous data and metadata with even more
“semantic” data to represent it
 Look at what we want rather that what we have…
 We went chemical data that is:
 Easy to share, find, and compare
 Freely available but with provenance
 Globally sourced and without IP restrictions on reuse
Utopia: A Global Research Network
 An electronic way to record data…
 ...equivalent to a laboratory notebook
 But ELN’s should not be thought of so lowly...
 An ELN must:*
 Keep track of research data
 Reference resources used in research and…
 …capture the story of research
What is an
Electronic Laboratory Notebook?
* Insight from Tony Williams
 The interface should mirror a laboratory notebook
 Behind the scenes though it should use state of the art
software, data formats, data/metadata practices, and web
technologies to manage data generation, workflows,
remote data access, authentication etc…
 As a result it needs to speak the same language as other
data sources and store data in a format that others can read
and reuse
 Foundational building block of a Global Research Network
What should an ELN be?
 Semantics is the study of meaning
-> We need to give meaning to what is created in an ELN
 Described in computers using the Resource Description
Framework (RDF) which:
 Makes statements about objects…
 … their relationships to other objects...
 ...using subject-predicate-object “triples”
 RDF allows knowledge representation
 Meaning is represented by using one or more ontologies
The Semantics of Semantics
RDF in JSON-LD
{
"@context": {
"name": "http://schema.org/name",
"isAlive": "http://example.org/isAlive",
"age": "http://example.org/age",
"height": "http://schema.org/height",
"@base": "http://www.unf.edu/chemistry/stuart_chalk.aspx"
},
"@id": "",
"name": "Stuart Chalk",
"isAlive": true,
"age": 49,
"height": 188.0
}
http://json-ld.org/playground/
<http://www.unf.edu/chemistry/stuart_chalk.aspx>
<http://example.org/age>
"49"^^<http://www.w3.org/2001/XMLSchema#integer> .
<http://www.unf.edu/chemistry/stuart_chalk.aspx>
<http://example.org/isAlive>
"true"^^<http://www.w3.org/2001/XMLSchema#boolean> .
<http://www.unf.edu/chemistry/stuart_chalk.aspx>
<http://schema.org/height>
"188"^^<http://www.w3.org/2001/XMLSchema#integer> .
<http://www.unf.edu/chemistry/stuart_chalk.aspx>
<http://schema.org/name>
"Stuart Chalk" .
RDF in JSON-LD
 Everything!
 What areas?
 Data, Results and Resources
 Models, Tools for Data Workup (Equations, Tests, Stats)
 General Workflows (Protocols and Procedures)
 The Research Story (What, Why, How)
 User discussion and annotation
 ELN usage timeline
 The Science (Area, Hypotheses, Theories)
 The People (Expertise, Provenance, Integrity, Eminence)
What Needs to be
Semantically Represented?
Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)
 The P-PLAN Ontology
http://purl.org/net/p-plan
Workflows
Implement in Kepler, Taverna, Knime?
People: The VIVO-ISF Ontology
https://wiki.duraspace.org/download/attachments/51052811/PeopleOrgsRolesGrants.2014-03-14.png
 The Chemical Analysis Metadata Platform (ChAMP)
http://champ-project.org/
 Identification of metadata related to chemical
analysis and definition of an ontology to describe
terms
 Examples in both XML and JSON-LD with associate
XML Schema and JSON-LD context
 Journal Article
 Standard Method of Analysis
 Reference Material
The Science: ChAMP (an example)
ChAMP
ChAMP
 The Healthcare and Life Science (HCLS) Community Profile
is a Note from the Semantic Web HCLS Interest Group
 Access to consistent, high-quality metadata is critical to finding,
understanding, and reusing scientific data. This document
describes a consensus among participating stakeholders in the
Health Care and the Life Sciences domain on the description of
datasets using the Resource Description Framework (RDF). This
specification meets key functional requirements, reuses existing
vocabularies to the extent that it is possible, and addresses
elements of data description, versioning, provenance,
discovery, exchange, query, and retrieval.
Data Descriptions:
HCLS Community Profile
http://www.w3.org/TR/hcls-dataset/
 Describes three levels for
description of datasets
 Summary Level
 Type declaration (rdf:type =
dctypes:Dataset)
 Title (dct:title = rdf:langString)
 Description (dct:description =
rdf:langString)
 Publisher (dct:publisher = IRI)
 Version Level
 Type declaration (rdf:type =
dctypes:Dataset)
 Title (dct:title = rdf:langString)
 Description (dct:description =
rdf:langString)
 Creator (dct:creator = IRI)
 Publisher (dct:publisher = IRI)
 Version identifier (pav:version =
xsd:string)
 Version linking (dct:isVersionOf = IRI)
 Distribution Level
 Type declaration (rdf:type =
void:Dataset OR dcat:Distribution)
 Title (dct:title = rdf:langString)
 Description (dct:description =
rdf:langString)
 Creator (dct:creator = IRI)
 Publisher (dct:publisher = IRI)
 License (rdf:type = IRI)
Data Descriptions:
HCLS Community Profile
http://www.w3.org/TR/hcls-dataset/#datasetdescriptionlevels
Electronic Notebook Ontology (ENO)
ENO
ENO
 Use a Generic
Scientific Data
Model
 Captures data
and metadata
about datasets
and links to
related data
 JSON-LD is
ideal file format
Data and
Resources
 A specification (written in XML) that describes
different data types of information recorded during
the scientific process (http://exptml.sourceforge.net
Experiment Markup Language (ExptML)
 Sample
 Solution
 Space
 Specimen
 Substance
 Task
 Template
 Timeline
 User
 Vendor
 Annotation
 Api
 Calculation
 Chemical
 Citation
 Communication
 Customer
 Data
 Dataset
 Definition
 Element
 Equipment
 Event
 Experiment
 Group
 Project
 Protocol
 Quote
 Report
 Result
Experimental Linked Data (ExptLD)
 Define data packets
that capture the
metadata of
 Resources
 Data
 Integrate with other
ExptLD packets to
create a SciData
document
 Or convert to RDF and
store in a triplestore
 A lot exists to semantically represent the scientific process
that can be leveraged as part of an ELN system
 A data standard needs to be agreed upon
 Agreeing on implementation standards will take time
because of size of user community
 Integration and coverage of ontologies will be necessary to
fully implement a system that underpins a Global Research
Network
 Domain specific knowledge representation needed in many
areas
Take Home
 schalk@unf.edu
 Phone: 904-620-1938
 Skype: stuartchalk
 LinkedIn/Slidehare: https://www.linkedin.com/in/stuchalk
 ORCID: http://orcid.org/0000-0002-0703-7776
 ResearcherID: http://www.researcherid.com/rid/D-8577-2013
Questions?

More Related Content

PPTX
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
PPTX
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
PPTX
Scientific Units in the Electronic Age
PPTX
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
PDF
FAIRness through a novel combination of Web technologies
PPTX
AnIML: A New Analytical Data Standard
PPTX
247th ACS Meeting: The Eureka Research Workbench
ODP
2009 0807 Lod Gmod
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
Scientific Units in the Electronic Age
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
FAIRness through a novel combination of Web technologies
AnIML: A New Analytical Data Standard
247th ACS Meeting: The Eureka Research Workbench
2009 0807 Lod Gmod

What's hot (20)

PPTX
ACS 248th Paper 71 ChAMP Project
PDF
Acs collaborative computational technologies for biomedical research an enabl...
PPT
Hosting a compound centric community resource for chemistry data
PPTX
Software Sustainability: Better Software Better Science
PPTX
Research Data Sharing: A Basic Framework
PPTX
FAIRer Research
PPTX
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
ODP
The OpenOffice.org ODF Toolkit Project
PDF
2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1
PPTX
Crosslinks
PPTX
Liberating Laboratory Data - Eureka
PDF
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
PPTX
The Research Object Initiative: Frameworks and Use Cases
PPTX
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
PPTX
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
PPTX
The Electronic Notebook Ontology
PPTX
Introduction to FAIRDOM
PPTX
Let’s go on a FAIR safari!
PPTX
Being FAIR: Enabling Reproducible Data Science
PPTX
Building a Standard for Standards: The ChAMP Project
ACS 248th Paper 71 ChAMP Project
Acs collaborative computational technologies for biomedical research an enabl...
Hosting a compound centric community resource for chemistry data
Software Sustainability: Better Software Better Science
Research Data Sharing: A Basic Framework
FAIRer Research
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
The OpenOffice.org ODF Toolkit Project
2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1
Crosslinks
Liberating Laboratory Data - Eureka
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
The Research Object Initiative: Frameworks and Use Cases
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
The Electronic Notebook Ontology
Introduction to FAIRDOM
Let’s go on a FAIR safari!
Being FAIR: Enabling Reproducible Data Science
Building a Standard for Standards: The ChAMP Project
Ad

Similar to Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs) (20)

PPTX
247th ACS Meeting: Experiment Markup Language (ExptML)
PPTX
Building a Network of Interoperable and Independently Produced Linked and Ope...
PPTX
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...
PPT
Resource Description Framework Approach to Data Publication and Federation
PPTX
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
PPTX
Semantic Web in Physical Science
PPTX
2010 CASCON - Towards a integrated network of data and services for the life ...
PPTX
W3C HCLS Dataset Description Guidelines
PPTX
Semantic Web Technologies: A Paradigm for Medical Informatics
PPTX
Towards Joint Doctrine for Military Informatics
PPTX
Towards Joint Doctrine for Military Informatics
PPTX
MADICES Mungall 2022.pptx
PPTX
Representation of molecular structures and related computations on the Sema...
PPTX
Realizing Semantic Web - Light Weight semantics and beyond
PPTX
CSHALS 2010 W3C Semanic Web Tutorial
PPT
Semantic Web: Technolgies and Applications for Real-World
PPTX
Knowledge Graphs: Changing How We Think About Data
KEY
Semantic Web and Linked Open Data
PPTX
Describing Scientific Datasets: The HCLS Community Profile
PPTX
Semantic Technologies for Big Sciences including Astrophysics
247th ACS Meeting: Experiment Markup Language (ExptML)
Building a Network of Interoperable and Independently Produced Linked and Ope...
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...
Resource Description Framework Approach to Data Publication and Federation
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic Web in Physical Science
2010 CASCON - Towards a integrated network of data and services for the life ...
W3C HCLS Dataset Description Guidelines
Semantic Web Technologies: A Paradigm for Medical Informatics
Towards Joint Doctrine for Military Informatics
Towards Joint Doctrine for Military Informatics
MADICES Mungall 2022.pptx
Representation of molecular structures and related computations on the Sema...
Realizing Semantic Web - Light Weight semantics and beyond
CSHALS 2010 W3C Semanic Web Tutorial
Semantic Web: Technolgies and Applications for Real-World
Knowledge Graphs: Changing How We Think About Data
Semantic Web and Linked Open Data
Describing Scientific Datasets: The HCLS Community Profile
Semantic Technologies for Big Sciences including Astrophysics
Ad

More from Stuart Chalk (14)

PPTX
Semantic properties and units
PPTX
Open semantic chemical structures
PPTX
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
PPTX
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
PPTX
Bringing Flow injection Analysis to the Semantic Web
PPTX
Reactions to the Open Spectral Database
PPTX
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
PPTX
A Standard Data Format for Computational Chemistry: CSX
PPTX
Overview of the Analytical Information Markup Language (AnIML)
PPTX
ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
PPTX
ACS 248th Paper 108 NIST-IUPAC Solubility Data
PPTX
ACS 248th Paper 104 ChemData Project
PPTX
ACS 248th Paper 67 Eureka Collaboration
PPTX
Liberating Laboratory Data - AnIML
Semantic properties and units
Open semantic chemical structures
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Bringing Flow injection Analysis to the Semantic Web
Reactions to the Open Spectral Database
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
A Standard Data Format for Computational Chemistry: CSX
Overview of the Analytical Information Markup Language (AnIML)
ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 104 ChemData Project
ACS 248th Paper 67 Eureka Collaboration
Liberating Laboratory Data - AnIML

Recently uploaded (20)

PDF
Telemedicine: Transforming Healthcare Delivery in Remote Areas (www.kiu.ac.ug)
PPT
ecg for noob ecg interpretation ecg recall
PDF
Glycolysis by Rishikanta Usham, Dhanamanjuri University
PPTX
Targeted drug delivery system 1_44299_BP704T_03-12-2024.pptx
PPTX
Preformulation.pptx Preformulation studies-Including all parameter
PDF
Social preventive and pharmacy. Pdf
PPTX
Cells and Organs of the Immune System (Unit-2) - Majesh Sir.pptx
PPTX
EPILEPSY UPDATE in kkm malaysia today new
PPT
Enhancing Laboratory Quality Through ISO 15189 Compliance
PPTX
Toxicity Studies in Drug Development Ensuring Safety, Efficacy, and Global Co...
PDF
CHEM - GOC general organic chemistry.ppt
PDF
The Future of Telehealth: Engineering New Platforms for Care (www.kiu.ac.ug)
PDF
CuO Nps photocatalysts 15156456551564161
PDF
Sujay Rao Mandavilli IJISRT25AUG764 context based approaches to population ma...
PPTX
BPharm_Hospital_Organization_Complete_PPT.pptx
PDF
Traditional Healing Practices: A Model for Integrative Care in Diabetes Mana...
PPTX
Arterial Blood Pressure_Blood Flow_Hemodynamics.pptx
PPTX
diabetes and its complications nephropathy neuropathy
PDF
Integrative Oncology: Merging Conventional and Alternative Approaches (www.k...
PPTX
02_OpenStax_Chemistry_Slides_20180406 copy.pptx
Telemedicine: Transforming Healthcare Delivery in Remote Areas (www.kiu.ac.ug)
ecg for noob ecg interpretation ecg recall
Glycolysis by Rishikanta Usham, Dhanamanjuri University
Targeted drug delivery system 1_44299_BP704T_03-12-2024.pptx
Preformulation.pptx Preformulation studies-Including all parameter
Social preventive and pharmacy. Pdf
Cells and Organs of the Immune System (Unit-2) - Majesh Sir.pptx
EPILEPSY UPDATE in kkm malaysia today new
Enhancing Laboratory Quality Through ISO 15189 Compliance
Toxicity Studies in Drug Development Ensuring Safety, Efficacy, and Global Co...
CHEM - GOC general organic chemistry.ppt
The Future of Telehealth: Engineering New Platforms for Care (www.kiu.ac.ug)
CuO Nps photocatalysts 15156456551564161
Sujay Rao Mandavilli IJISRT25AUG764 context based approaches to population ma...
BPharm_Hospital_Organization_Complete_PPT.pptx
Traditional Healing Practices: A Model for Integrative Care in Diabetes Mana...
Arterial Blood Pressure_Blood Flow_Hemodynamics.pptx
diabetes and its complications nephropathy neuropathy
Integrative Oncology: Merging Conventional and Alternative Approaches (www.k...
02_OpenStax_Chemistry_Slides_20180406 copy.pptx

Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs)

  • 1. Toward Semantic Representation of Science in Electronic Laboratory Notebooks (ELNs) Stuart J. Chalk Department of Chemistry, University of North Florida [email protected] CINF Paper 50 – 251st ACS Meeting Spring 2016 #ACSCINFDataSummit
  • 2.  Utopia: A Global Research Network  What is an Electronic Notebook?  The Semantics of Semantics  What Needs to be Semantically Represented?  Current lay of the land  ELN Item Manifest  P-PLAN Ontology  VIVO-ISF Ontology  Chemical Analysis Metadata Platform  HCLS Community Profiles  Electronic Notebook Ontology  A generic scientific data model  Experimental information for LD (ExptLD)  Take Home  Conclusion Outline
  • 3.  “Big Data” and the “Semantic Web” are the current buzz words du jour but what do they mean for chemistry?  Lots of heterogeneous data and metadata with even more “semantic” data to represent it  Look at what we want rather that what we have…  We went chemical data that is:  Easy to share, find, and compare  Freely available but with provenance  Globally sourced and without IP restrictions on reuse Utopia: A Global Research Network
  • 4.  An electronic way to record data…  ...equivalent to a laboratory notebook  But ELN’s should not be thought of so lowly...  An ELN must:*  Keep track of research data  Reference resources used in research and…  …capture the story of research What is an Electronic Laboratory Notebook? * Insight from Tony Williams
  • 5.  The interface should mirror a laboratory notebook  Behind the scenes though it should use state of the art software, data formats, data/metadata practices, and web technologies to manage data generation, workflows, remote data access, authentication etc…  As a result it needs to speak the same language as other data sources and store data in a format that others can read and reuse  Foundational building block of a Global Research Network What should an ELN be?
  • 6.  Semantics is the study of meaning -> We need to give meaning to what is created in an ELN  Described in computers using the Resource Description Framework (RDF) which:  Makes statements about objects…  … their relationships to other objects...  ...using subject-predicate-object “triples”  RDF allows knowledge representation  Meaning is represented by using one or more ontologies The Semantics of Semantics
  • 7. RDF in JSON-LD { "@context": { "name": "http://schema.org/name", "isAlive": "http://example.org/isAlive", "age": "http://example.org/age", "height": "http://schema.org/height", "@base": "http://www.unf.edu/chemistry/stuart_chalk.aspx" }, "@id": "", "name": "Stuart Chalk", "isAlive": true, "age": 49, "height": 188.0 } http://json-ld.org/playground/
  • 8. <http://www.unf.edu/chemistry/stuart_chalk.aspx> <http://example.org/age> "49"^^<http://www.w3.org/2001/XMLSchema#integer> . <http://www.unf.edu/chemistry/stuart_chalk.aspx> <http://example.org/isAlive> "true"^^<http://www.w3.org/2001/XMLSchema#boolean> . <http://www.unf.edu/chemistry/stuart_chalk.aspx> <http://schema.org/height> "188"^^<http://www.w3.org/2001/XMLSchema#integer> . <http://www.unf.edu/chemistry/stuart_chalk.aspx> <http://schema.org/name> "Stuart Chalk" . RDF in JSON-LD
  • 9.  Everything!  What areas?  Data, Results and Resources  Models, Tools for Data Workup (Equations, Tests, Stats)  General Workflows (Protocols and Procedures)  The Research Story (What, Why, How)  User discussion and annotation  ELN usage timeline  The Science (Area, Hypotheses, Theories)  The People (Expertise, Provenance, Integrity, Eminence) What Needs to be Semantically Represented?
  • 11.  The P-PLAN Ontology http://purl.org/net/p-plan Workflows Implement in Kepler, Taverna, Knime?
  • 12. People: The VIVO-ISF Ontology https://wiki.duraspace.org/download/attachments/51052811/PeopleOrgsRolesGrants.2014-03-14.png
  • 13.  The Chemical Analysis Metadata Platform (ChAMP) http://champ-project.org/  Identification of metadata related to chemical analysis and definition of an ontology to describe terms  Examples in both XML and JSON-LD with associate XML Schema and JSON-LD context  Journal Article  Standard Method of Analysis  Reference Material The Science: ChAMP (an example)
  • 14. ChAMP
  • 15. ChAMP
  • 16.  The Healthcare and Life Science (HCLS) Community Profile is a Note from the Semantic Web HCLS Interest Group  Access to consistent, high-quality metadata is critical to finding, understanding, and reusing scientific data. This document describes a consensus among participating stakeholders in the Health Care and the Life Sciences domain on the description of datasets using the Resource Description Framework (RDF). This specification meets key functional requirements, reuses existing vocabularies to the extent that it is possible, and addresses elements of data description, versioning, provenance, discovery, exchange, query, and retrieval. Data Descriptions: HCLS Community Profile http://www.w3.org/TR/hcls-dataset/
  • 17.  Describes three levels for description of datasets  Summary Level  Type declaration (rdf:type = dctypes:Dataset)  Title (dct:title = rdf:langString)  Description (dct:description = rdf:langString)  Publisher (dct:publisher = IRI)  Version Level  Type declaration (rdf:type = dctypes:Dataset)  Title (dct:title = rdf:langString)  Description (dct:description = rdf:langString)  Creator (dct:creator = IRI)  Publisher (dct:publisher = IRI)  Version identifier (pav:version = xsd:string)  Version linking (dct:isVersionOf = IRI)  Distribution Level  Type declaration (rdf:type = void:Dataset OR dcat:Distribution)  Title (dct:title = rdf:langString)  Description (dct:description = rdf:langString)  Creator (dct:creator = IRI)  Publisher (dct:publisher = IRI)  License (rdf:type = IRI) Data Descriptions: HCLS Community Profile http://www.w3.org/TR/hcls-dataset/#datasetdescriptionlevels
  • 19. ENO
  • 20. ENO
  • 21.  Use a Generic Scientific Data Model  Captures data and metadata about datasets and links to related data  JSON-LD is ideal file format Data and Resources
  • 22.  A specification (written in XML) that describes different data types of information recorded during the scientific process (http://exptml.sourceforge.net Experiment Markup Language (ExptML)  Sample  Solution  Space  Specimen  Substance  Task  Template  Timeline  User  Vendor  Annotation  Api  Calculation  Chemical  Citation  Communication  Customer  Data  Dataset  Definition  Element  Equipment  Event  Experiment  Group  Project  Protocol  Quote  Report  Result
  • 23. Experimental Linked Data (ExptLD)  Define data packets that capture the metadata of  Resources  Data  Integrate with other ExptLD packets to create a SciData document  Or convert to RDF and store in a triplestore
  • 24.  A lot exists to semantically represent the scientific process that can be leveraged as part of an ELN system  A data standard needs to be agreed upon  Agreeing on implementation standards will take time because of size of user community  Integration and coverage of ontologies will be necessary to fully implement a system that underpins a Global Research Network  Domain specific knowledge representation needed in many areas Take Home
  • 25. [email protected]  Phone: 904-620-1938  Skype: stuartchalk  LinkedIn/Slidehare: https://www.linkedin.com/in/stuchalk  ORCID: http://orcid.org/0000-0002-0703-7776  ResearcherID: http://www.researcherid.com/rid/D-8577-2013 Questions?