Learn to speak Open: What, Why, How
in Open Science
Iryna Kuchma, EIFL
Sarah Jones, Digital Curation Centre, University of Glasgow
OpenCon Nairobi 2016
WHAT DOES IT MEAN
TO BE OPEN?
Image by biblioteekje CC-BY-NC-SA www.flickr.com/photos/biblioteekje/3992172265
Open Science implementation – is it a wrap rage?
Image from Wikipedia https://en.wikipedia.org/wiki/Wrap_rage
What is open science?
“science carried out and communicated in a
manner which allows others to contribute,
collaborate and add to the research effort, with
all kinds of data, results and protocols made
freely available at different stages of the
research process.”
Research Information Network, Open Science case studies
www.rin.ac.uk/our-work/data-management-and-curation/
open-science-case-studies
Open methods
• Documenting and sharing workflows and methods
• Sharing code and tools to allow others to reproduce work
• Using web based tools to facilitate collaboration and
interaction from the outside world
• Open notebook science – “when there is a URL to a
laboratory notebook that is freely available and indexed
on common search engines.”
http://drexel-coas-elearning.blogspot.co.uk/2006/09/open-notebook-science.html
Open access to publications
• Free, immediate, online access to the results of research,
coupled with the right to use those results in new and
innovative ways
• Make sure anyone can access your papers
Strategies to achieve open access:
open access journals and monographs
open access repositories
Find out what your publisher allows on SHERPA RoMEO
– www.sherpa.ac.uk/romeo
Open data
 make your stuff available on the Web (whatever format) under an open licence
 make it available as structured data (e.g. Excel instead of a scan of a table)
 use non-proprietary formats (e.g. CSV instead of Excel)
 use URIs to denote things, so that people can point at your stuff
 link your data to other data to provide context
Tim Berners-Lee’s proposal for five star open data - http://5stardata.info
“Open data and content can be freely used,
modified and shared by anyone for any purpose”
http://opendefinition.org
The research workflow – a cyclic model
preparation
analysis
writingpublication
outreach
assessment discovery
preparation
analysis
writingpublicatio
n
outreach
assessmen
t
discover
y
The research workflow – a cyclic model
https://innoscholcomm.silk.co
101 Innovations in Scholarly Communication
How can libraries support changing research workflows?
Bianca Kramer & Jeroen Bosman
EIFL General Assembly, Riga, November 13, 2015
@MsPhelps
@jeroenbosman
2005
2010
Analysis
Outreach
Openness at every stage
Design
Experiment
AnalysisPublication
Release
Open science image CC BY-SA 3.0 by Greg Emmerich www.flickr.com/photos/gemmerich/6365692655
www.myexperiment.org
www.taverna.org.uk
www.labtrove.org
impactstory.org
www.datacite.org
https://github.com
www.sherpa.ac.uk/romeo
http://openwetware.org
WHY SHOULD YOU BE OPEN?
Image by wonderwebby CC-BY-NC-SA www.flickr.com/photos/wonderwebby/2723279491
It’s part of good research practice
Science as an open enterprise
https://royalsociety.org/policy/projects/science-public-enterprise/Report
“Much of the remarkable growth of
scientific understanding in recent
centuries is due to open practices; open
communication and deliberation sit at
the heart of scientific practice.”
Royal Society report calls for ‘intelligent
openness’ whereby data are accessible,
intelligible, assessable and usable.
Some benefits of openness
• You can access relevant literature – not behind pay walls
• Ensures research is transparent and reproducible
• Increased visibility, usage and impact of your work
• New collaborations and research partnerships
• Ensure long-term access to your outputs
• Help increase the efficiency of research
Saving wasted time
OA helps to reduce time spent finding/accessing material:
“If around 60 minutes were characteristic for researchers
(the average time spent trying to access the last research
article they had difficulty accessing), then in the current
environment the time spent dealing with research article
access difficulties might be costing around DKK 540
million (EUR 72 million) per year among specialist
researchers in Denmark alone.”
Access to research and technical information in Denmark,
Houghton, Swan & Brown (2011)
http://eprints.ecs.soton.ac.uk/22603
Cut down on academic fraud
www.nature.com/news/2011/111101/full/479015a.html
Validation of results
www.guardian.co.uk/politics/2013/apr/18/uncovered-error-george-osborne-austerity
“It was a mistake in a spreadsheet that could
have been easily overlooked: a few rows left
out of an equation to average the values in a
column.
The spreadsheet was used to draw the
conclusion of an influential 2010 economics
paper: that public debt of more than 90% of
GDP slows down growth. This conclusion was
later cited by the International Monetary Fund
and the UK Treasury to justify programmes
of austerity that have arguably led to riots,
poverty and lost jobs.”
Acceleration of the research process
“As more papers are deposited and more scientists
use the repository, the time between an article being
deposited and being cited has been shrinking
dramatically, year upon year. This is important for
research uptake and progress, because it means that
in this area of research, where articles are made
available at – or frequently before – publication, the
research cycle is accelerating.”
Open Access: Why should we have it? Alma Swan
www.keyperspectives.co.uk
More scientific breakthroughs
www.nytimes.com/2010/08/13/health/research/13alzheimer.html?pagewanted=all&_r=0
“It was unbelievable. Its not science
the way most of us have practiced in
our careers. But we all realised that
we would never get biomarkers unless
all of us parked our egos and
intellectual property noses outside
the door and agreed that all of our
data would be public immediately.”
Dr John Trojanowski, University of Pennsylvania
A citation advantage
A study that analysed the citation counts of 10,555 papers on gene
expression studies that created microarray data, showed:
“studies that made data available in a public repository
received 9% more citations than similar studies for
which the data was not made available”
Data reuse and the open data citation advantage,
Piwowar, H. & Vision, T. https://peerj.com/articles/175
Increased use and economic benefit
Up to 2008
• Sold through the US Geological
Survey for US$600 per scene
• Sales of 19,000 scenes per year
• Annual revenue of $11.4 million
Since 2009
• Freely available over the internet
• Google Earth now uses the images
• Transmission of 2,100,000
scenes per year.
• Estimated to have created value for
the environmental management
industry of $935 million, with direct
benefit of more than $100 million
per year to the US economy
• Has stimulated the development of
applications from a large number of
companies worldwide
The case of NASA Landsat satellite imagery of the Earth’s surface:
http://earthobservatory.nasa.gov/IOTD/view.php?id=83394&src=ve
So what needs to change?
Conclusions from Emilio Bruna:
• Develop a better system of incentives from the
community for archiving data and code
• Teach our students how to do this NOW - it’s much easier
if you develop good habits early
We need to stop telling people “You should” and get
better at telling people “Here’s how”
HOW TO PRACTICE OPEN SCIENCE
Image by Paul Downey CC-BY www.flickr.com/photos/psd/3925801816
Conducting science in the open: UsefulChem
http://usefulchem.wikispaces.com/EXP284
Collaboration & sharing: MyExperiment
www.myexperiment.org/workflows/16.html
Learn to speak open
Make a list of open access publication
options in your particular
field. Chances are you will be
surprised by the range of possibilities
Erin McKiernan
Discuss access issues with your
collaborators up front, before the
research is done and the articles
written.
Erin McKiernan
Learn to speak open
Learn to speak open
Learn to speak open
Learn to speak open
Blog about your science, and in language
that is comprehensible to non-
scientists. Doing this can ultimately
increase the impact of your work and can
even lead sometimes to press coverage
and to better press coverage.
Erin McKiernan
Be active on social media. This is the
way academic reputations are built
today, so ignoring the opportunities
presented is unwise.
Erin McKiernan
If for some reason you do publish a
closed-access article, remember that you
can self-archive a copy of your article in a
disciplinary or institutional or shared
repository.
Erin McKiernan
Learn to speak open
Deposit in your local repository!
• Speak to the library
• Consider other relevant repositories for your field too
e.g. Arxiv - http://arxiv.org
• Check OpenDOAR for examples -
http://www.opendoar.org
Learn to speak open
~78% of publishers allow authors to openly archive a version of their published
manuscript: Breakdown of archiving policies from over 2,100 publishers. Source: Data
from SHERPA/RoMEO. Accessed October 2015 and plotted by E.C. McKiernan (CC BY)
SHERPA RoMEO
Learn to speak open
Learn to speak open
Learn to speak open
1. It’ll spoil my publication chances later
Well, it might, but in a recent survey only 7% of
institutions cited this as a frequent concern
amongst their students, and no concrete
examples were found of publication being
refused because the PhD thesis had been added
to an open access repository. If reassurance is
needed, then an embargo period can be applied,
with may be the record plus abstract still being
available to all.
Learn to speak open
How to make data open?
1. Choose your dataset(s)
• What can you may open? You may need to revisit this step if you
encounter problems later.
2. Apply an open license
• Determine what IP exists. Apply a suitable licence e.g. CC-BY
3. Make the data available
• Provide the data in a suitable format. Use repositories.
4. Make it discoverable
• Post on the web, register in catalogues…
https://okfn.org
Learn to speak open
@fosterscience and #fosteropenscience
FOSTER training materials:
www.fosteropenscience.eu
Thank you!
Questions?
iryna.kuchma@eifl.net
www.eifl.net

More Related Content

PPTX
Principles and practice of Open Science
PPTX
Copyright Reform and Open Data
PPTX
ContentMining and Copyright at CopyCamp2017
PPTX
From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
PPTX
Open data and Open Science
PPTX
Climate Change and Human Migration
PPT
The Future of Research (Science and Technology)
Principles and practice of Open Science
Copyright Reform and Open Data
ContentMining and Copyright at CopyCamp2017
From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
Open data and Open Science
Climate Change and Human Migration
The Future of Research (Science and Technology)

What's hot (20)

PPTX
Open Data and Open Science
PPTX
Uksg Social Science
PPTX
Reward, reproducibility and recognition in research - the case for going Open
PPTX
The purpose, practicalities, pitfalls and policies of managing and sharing da...
PPTX
Is ‘Open Science’ a solution or a threat?
PPTX
Content Mining for Machines and Humans
PPTX
Open Notebook Science
PPT
What works and doesn't work in research dissemination
PPTX
The culture of researchData
PPTX
Disrupting the Publisher-Academic Complex
PPT
The Era of Open
PPTX
Social media for researchers - maximizing your personal impact
PPTX
The Content Mine (presented at UKSG)
PPTX
OpenNotebookScience NOW!
PDF
Open access impact
PPT
The role of academic libraries in supporting social sciences research
PPTX
From socially to scholarly and back again
PPTX
The value of embracing unknown unknowns
PPTX
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
PPTX
Digital Scholarship
Open Data and Open Science
Uksg Social Science
Reward, reproducibility and recognition in research - the case for going Open
The purpose, practicalities, pitfalls and policies of managing and sharing da...
Is ‘Open Science’ a solution or a threat?
Content Mining for Machines and Humans
Open Notebook Science
What works and doesn't work in research dissemination
The culture of researchData
Disrupting the Publisher-Academic Complex
The Era of Open
Social media for researchers - maximizing your personal impact
The Content Mine (presented at UKSG)
OpenNotebookScience NOW!
Open access impact
The role of academic libraries in supporting social sciences research
From socially to scholarly and back again
The value of embracing unknown unknowns
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Digital Scholarship

Similar to Learn to speak open (20)

PPTX
Benefits and practice of open science
PPTX
Open science and its advocacy
PPTX
Open sciencerefresher2019
PPTX
Open science / open research
PPTX
Understanding Open Science: Definitions and framework
PPTX
FORCE2019 Research Comms Conference
PPT
Open access for researchers, policy makers and research managers - Short ver...
PPTX
Open Science
PDF
The changing landscape of scholarly communication: presentation to the NFAIS ...
PPTX
How to do research in today’s digital environment
PDF
Digital Resources for Open Science
PPTX
Winning Horizon 2020 with Open Science
PPTX
The FOSTER project - general overview
PDF
Open Data - strategies for research data management & impact of best practices
PDF
The OpenCon Intro to Open Data
PPT
Open access for researchers and students, research managers and publishers
PPTX
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
PPT
Open access for researchers, policy makers and research managers, libraries
PPTX
Bw dave pattern lidp
PPT
Acting as Advocate? Seven steps for libraries in the data decade
Benefits and practice of open science
Open science and its advocacy
Open sciencerefresher2019
Open science / open research
Understanding Open Science: Definitions and framework
FORCE2019 Research Comms Conference
Open access for researchers, policy makers and research managers - Short ver...
Open Science
The changing landscape of scholarly communication: presentation to the NFAIS ...
How to do research in today’s digital environment
Digital Resources for Open Science
Winning Horizon 2020 with Open Science
The FOSTER project - general overview
Open Data - strategies for research data management & impact of best practices
The OpenCon Intro to Open Data
Open access for researchers and students, research managers and publishers
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
Open access for researchers, policy makers and research managers, libraries
Bw dave pattern lidp
Acting as Advocate? Seven steps for libraries in the data decade

Recently uploaded (20)

PDF
Book Trusted Companions in Delhi – 24/7 Available Delhi Personal Meeting Ser...
PDF
Mcdonald's : a half century growth . pdf
PPTX
research framework and review of related literature chapter 2
PPTX
Capstone Presentation a.pptx on data sci
PDF
toaz.info-grade-11-2nd-quarter-earth-and-life-science-pr_5360bfd5a497b75f7ae4...
PDF
Buddhism presentation about world religion
PPTX
Chapter security of computer_8_v8.1.pptx
PPT
2011 HCRP presentation-final.pptjrirrififfi
PPTX
Transport System for Biology students in the 11th grade
PPTX
1.Introduction to orthodonti hhhgghhcs.pptx
PDF
Lesson 1 - intro Cybersecurity and Cybercrime.pptx.pdf
PPTX
Sistem Informasi Manejemn-Sistem Manajemen Database
PPTX
Power BI - Microsoft Power BI is an interactive data visualization software p...
PDF
Hikvision-IR-PPT---EN.pdfSADASDASSAAAAAAAAAAAAAAA
PPTX
Stats annual compiled ipd opd ot br 2024
PPT
Classification methods in data analytics.ppt
PPTX
9 Bioterrorism.pptxnsbhsjdgdhdvkdbebrkndbd
PPTX
cardiac failure and associated notes.pptx
PDF
9 FinOps Tools That Simplify Cloud Cost Reporting.pdf
PDF
PPT IEPT 2025_Ms. Nurul Presentation 10.pdf
Book Trusted Companions in Delhi – 24/7 Available Delhi Personal Meeting Ser...
Mcdonald's : a half century growth . pdf
research framework and review of related literature chapter 2
Capstone Presentation a.pptx on data sci
toaz.info-grade-11-2nd-quarter-earth-and-life-science-pr_5360bfd5a497b75f7ae4...
Buddhism presentation about world religion
Chapter security of computer_8_v8.1.pptx
2011 HCRP presentation-final.pptjrirrififfi
Transport System for Biology students in the 11th grade
1.Introduction to orthodonti hhhgghhcs.pptx
Lesson 1 - intro Cybersecurity and Cybercrime.pptx.pdf
Sistem Informasi Manejemn-Sistem Manajemen Database
Power BI - Microsoft Power BI is an interactive data visualization software p...
Hikvision-IR-PPT---EN.pdfSADASDASSAAAAAAAAAAAAAAA
Stats annual compiled ipd opd ot br 2024
Classification methods in data analytics.ppt
9 Bioterrorism.pptxnsbhsjdgdhdvkdbebrkndbd
cardiac failure and associated notes.pptx
9 FinOps Tools That Simplify Cloud Cost Reporting.pdf
PPT IEPT 2025_Ms. Nurul Presentation 10.pdf

Learn to speak open

  • 1. Learn to speak Open: What, Why, How in Open Science Iryna Kuchma, EIFL Sarah Jones, Digital Curation Centre, University of Glasgow OpenCon Nairobi 2016
  • 2. WHAT DOES IT MEAN TO BE OPEN? Image by biblioteekje CC-BY-NC-SA www.flickr.com/photos/biblioteekje/3992172265
  • 3. Open Science implementation – is it a wrap rage? Image from Wikipedia https://en.wikipedia.org/wiki/Wrap_rage
  • 4. What is open science? “science carried out and communicated in a manner which allows others to contribute, collaborate and add to the research effort, with all kinds of data, results and protocols made freely available at different stages of the research process.” Research Information Network, Open Science case studies www.rin.ac.uk/our-work/data-management-and-curation/ open-science-case-studies
  • 5. Open methods • Documenting and sharing workflows and methods • Sharing code and tools to allow others to reproduce work • Using web based tools to facilitate collaboration and interaction from the outside world • Open notebook science – “when there is a URL to a laboratory notebook that is freely available and indexed on common search engines.” http://drexel-coas-elearning.blogspot.co.uk/2006/09/open-notebook-science.html
  • 6. Open access to publications • Free, immediate, online access to the results of research, coupled with the right to use those results in new and innovative ways • Make sure anyone can access your papers Strategies to achieve open access: open access journals and monographs open access repositories Find out what your publisher allows on SHERPA RoMEO – www.sherpa.ac.uk/romeo
  • 7. Open data  make your stuff available on the Web (whatever format) under an open licence  make it available as structured data (e.g. Excel instead of a scan of a table)  use non-proprietary formats (e.g. CSV instead of Excel)  use URIs to denote things, so that people can point at your stuff  link your data to other data to provide context Tim Berners-Lee’s proposal for five star open data - http://5stardata.info “Open data and content can be freely used, modified and shared by anyone for any purpose” http://opendefinition.org
  • 8. The research workflow – a cyclic model preparation analysis writingpublication outreach assessment discovery
  • 10. https://innoscholcomm.silk.co 101 Innovations in Scholarly Communication How can libraries support changing research workflows? Bianca Kramer & Jeroen Bosman EIFL General Assembly, Riga, November 13, 2015 @MsPhelps @jeroenbosman
  • 12. Openness at every stage Design Experiment AnalysisPublication Release Open science image CC BY-SA 3.0 by Greg Emmerich www.flickr.com/photos/gemmerich/6365692655 www.myexperiment.org www.taverna.org.uk www.labtrove.org impactstory.org www.datacite.org https://github.com www.sherpa.ac.uk/romeo http://openwetware.org
  • 13. WHY SHOULD YOU BE OPEN? Image by wonderwebby CC-BY-NC-SA www.flickr.com/photos/wonderwebby/2723279491
  • 14. It’s part of good research practice
  • 15. Science as an open enterprise https://royalsociety.org/policy/projects/science-public-enterprise/Report “Much of the remarkable growth of scientific understanding in recent centuries is due to open practices; open communication and deliberation sit at the heart of scientific practice.” Royal Society report calls for ‘intelligent openness’ whereby data are accessible, intelligible, assessable and usable.
  • 16. Some benefits of openness • You can access relevant literature – not behind pay walls • Ensures research is transparent and reproducible • Increased visibility, usage and impact of your work • New collaborations and research partnerships • Ensure long-term access to your outputs • Help increase the efficiency of research
  • 17. Saving wasted time OA helps to reduce time spent finding/accessing material: “If around 60 minutes were characteristic for researchers (the average time spent trying to access the last research article they had difficulty accessing), then in the current environment the time spent dealing with research article access difficulties might be costing around DKK 540 million (EUR 72 million) per year among specialist researchers in Denmark alone.” Access to research and technical information in Denmark, Houghton, Swan & Brown (2011) http://eprints.ecs.soton.ac.uk/22603
  • 18. Cut down on academic fraud www.nature.com/news/2011/111101/full/479015a.html
  • 19. Validation of results www.guardian.co.uk/politics/2013/apr/18/uncovered-error-george-osborne-austerity “It was a mistake in a spreadsheet that could have been easily overlooked: a few rows left out of an equation to average the values in a column. The spreadsheet was used to draw the conclusion of an influential 2010 economics paper: that public debt of more than 90% of GDP slows down growth. This conclusion was later cited by the International Monetary Fund and the UK Treasury to justify programmes of austerity that have arguably led to riots, poverty and lost jobs.”
  • 20. Acceleration of the research process “As more papers are deposited and more scientists use the repository, the time between an article being deposited and being cited has been shrinking dramatically, year upon year. This is important for research uptake and progress, because it means that in this area of research, where articles are made available at – or frequently before – publication, the research cycle is accelerating.” Open Access: Why should we have it? Alma Swan www.keyperspectives.co.uk
  • 21. More scientific breakthroughs www.nytimes.com/2010/08/13/health/research/13alzheimer.html?pagewanted=all&_r=0 “It was unbelievable. Its not science the way most of us have practiced in our careers. But we all realised that we would never get biomarkers unless all of us parked our egos and intellectual property noses outside the door and agreed that all of our data would be public immediately.” Dr John Trojanowski, University of Pennsylvania
  • 22. A citation advantage A study that analysed the citation counts of 10,555 papers on gene expression studies that created microarray data, showed: “studies that made data available in a public repository received 9% more citations than similar studies for which the data was not made available” Data reuse and the open data citation advantage, Piwowar, H. & Vision, T. https://peerj.com/articles/175
  • 23. Increased use and economic benefit Up to 2008 • Sold through the US Geological Survey for US$600 per scene • Sales of 19,000 scenes per year • Annual revenue of $11.4 million Since 2009 • Freely available over the internet • Google Earth now uses the images • Transmission of 2,100,000 scenes per year. • Estimated to have created value for the environmental management industry of $935 million, with direct benefit of more than $100 million per year to the US economy • Has stimulated the development of applications from a large number of companies worldwide The case of NASA Landsat satellite imagery of the Earth’s surface: http://earthobservatory.nasa.gov/IOTD/view.php?id=83394&src=ve
  • 24. So what needs to change? Conclusions from Emilio Bruna: • Develop a better system of incentives from the community for archiving data and code • Teach our students how to do this NOW - it’s much easier if you develop good habits early We need to stop telling people “You should” and get better at telling people “Here’s how”
  • 25. HOW TO PRACTICE OPEN SCIENCE Image by Paul Downey CC-BY www.flickr.com/photos/psd/3925801816
  • 26. Conducting science in the open: UsefulChem http://usefulchem.wikispaces.com/EXP284
  • 27. Collaboration & sharing: MyExperiment www.myexperiment.org/workflows/16.html
  • 29. Make a list of open access publication options in your particular field. Chances are you will be surprised by the range of possibilities Erin McKiernan
  • 30. Discuss access issues with your collaborators up front, before the research is done and the articles written. Erin McKiernan
  • 35. Blog about your science, and in language that is comprehensible to non- scientists. Doing this can ultimately increase the impact of your work and can even lead sometimes to press coverage and to better press coverage. Erin McKiernan
  • 36. Be active on social media. This is the way academic reputations are built today, so ignoring the opportunities presented is unwise. Erin McKiernan
  • 37. If for some reason you do publish a closed-access article, remember that you can self-archive a copy of your article in a disciplinary or institutional or shared repository. Erin McKiernan
  • 39. Deposit in your local repository! • Speak to the library • Consider other relevant repositories for your field too e.g. Arxiv - http://arxiv.org • Check OpenDOAR for examples - http://www.opendoar.org
  • 41. ~78% of publishers allow authors to openly archive a version of their published manuscript: Breakdown of archiving policies from over 2,100 publishers. Source: Data from SHERPA/RoMEO. Accessed October 2015 and plotted by E.C. McKiernan (CC BY)
  • 46. 1. It’ll spoil my publication chances later Well, it might, but in a recent survey only 7% of institutions cited this as a frequent concern amongst their students, and no concrete examples were found of publication being refused because the PhD thesis had been added to an open access repository. If reassurance is needed, then an embargo period can be applied, with may be the record plus abstract still being available to all.
  • 48. How to make data open? 1. Choose your dataset(s) • What can you may open? You may need to revisit this step if you encounter problems later. 2. Apply an open license • Determine what IP exists. Apply a suitable licence e.g. CC-BY 3. Make the data available • Provide the data in a suitable format. Use repositories. 4. Make it discoverable • Post on the web, register in catalogues… https://okfn.org
  • 50. @fosterscience and #fosteropenscience FOSTER training materials: www.fosteropenscience.eu

Editor's Notes

  • #3: Initially I want to consider what it means to be open
  • #5: The Research Information Network ran a series of open science case studies a few years ago to look at openness across different disciplines. These included astronomy, bioinformatics, chemistry, clinical neuroimaging, epidemiology and language technology. Through this they came up with the definition. They defined open science as … The first aspect is about conducting science in an open way to encourage others to engage with it, and also sharing the outputs at various stages. I’ll unpack this definition over the next few slides
  • #6: Part of open science is working openly – having open methods. This involves documenting workflows and procedures so others can find and reuse your methods, sharing code and tools so others can reproduce your work, and using web based tools such as blogs and wikis to facilitate collaboration and interaction with others. You may come across the terms ‘open notebook science’. This was coined by Jean-Claude Bradley. It’s about having an online labnotebook that others can find and follow.
  • #7: Open science is also about providing open access to publications. This is well-established concept now, so one you’re probably familiar with. Open access is about ensuring free, immediate, online access to the results of research. There are two main routes to ensuring open access – publishing in open access journals self-archiving a copy (usually a pre-print version) in a repository. You can see what your publisher allows via Sherpa RoMEO – search for the journal title and it will tell you if there are gold, paid-for options, or what is permitted in terms of green routes (i.e. what version can be archived and under what embargo)
  • #8: Open science is also about making data available. Open data is data that can be used, modified and shared by anyone for any purpose. So for example it can be mined, exploited, reproduced, commercialised... Data under a non-commercial license wouldn’t be open as you’re restricting how it can be used and by whom. There are different levels of openness. Making content available on the web, in any format, under an open license is open data. It’s not necessarily that reusable though. By making it available as structured data, in non-proprietary formats, using URIs and linking to other data for context, makes it much more valuable.
  • #11: Motiverend en voor ons belangrijk, afgelopen 20 jaar en vooral laatste 5-10 jaar meer in onderzoek/publiceren veranderd dan 350 jaar daarvoor
  • #13: So we’re talking about being open at every stage, right from designing your research to conducting experiments and analysis, publishing results and releasing the associated outputs. There are tools that can help at various stages and I’ve listed a few here. Taverna for example, help with documenting workflows, while MyExperiment is a VLE that allows you to share workflows and collaborate. LabTrove is a blogging platform that integrates with instruments so you can automatically publish documentation, and OpenWetWare is a wiki to share information, know-how, and research protocols in biology & biological engineering. GitHub is a code repository to handle versioning and allow others to collaborate / reuse code. RoMEO is a lit of journal policies, re3data.org is a list of data repositories and DataCite and Impact story help you to track the citation, reuse and impact of your work.
  • #14: In the second half of the talk I’ve focus on reasons why you should be open – specifically the benefits and rewards
  • #15: I like the quote from Ewan Birney that it was never acceptable to publish papers without making data available. If you can evidence and validate your findings, how do others know that they’re right?
  • #16: The Royal Society report ‘Science as an Open Enterprise’ makes a similar point. This emphasises that much of the growth of scientific understanding is due to open practices. Being open about your work and encouraging feedback from others is at the heart of scientific practice. The report calls for ‘intelligent openness’ – data shouldn’t just be accessible, they need to be intelligible by others so they can assess and reuse them.
  • #17: What are the benefits to openness? When publications are open, it means you can access all the relevant literature regardless of whether you have a subscription or not. Open sharing also means that research is transparant and reproducible – you can validate results By sharing you will increase your visibility and the reuse of your work, which in turn leads to more collaboration and partnerships By depositing in repositories, you also safeguard your outputs since these are properly managed services that will curate your papers and data Sharing in general also makes research more efficient – it’s quicker and new discoveries are made faster
  • #18: There was a study done in Denmark on how open access saves time. They looked at the average time wasted looking for articles that were difficult to locate or gain access to (average = 60 mins per article), and extrapoloated this up to see what it was costing the sector annually. The estimate is €72 million in Denmark alone.
  • #19: Sharing data also cuts down on academic fraud. You’ve probably come across the case of Diederik Stapel – a former professor of social psychology at Tilberg University in the Netherlands. He had literally been making up data to support his hypotheses and various headlines had been picked up by the media e.g. meat eaters are more selfish than vegetarians. Some students questioned the results and the whole story unravelled, showing this had been going on for years. What’s troubling is that this had gone unnoticed. The data weren’t made available and checked, so such a history of fraud could develop.
  • #20: The validation of results is key. This article references an inadvertent error in a economics paper by Reinhadt and Rogoff. Missing some rows out of an average gave drastically different results – what was published suggested that countries with 90% debt ratios see their economies shrink by 0.1%. Instead, it should have found that they grow by 2.2% – less than those with lower debt ratios, but not a spiralling collapse. This mistake wasn’t picked up on initially as the data hadn’t been shared. The mistake fed into government policy - the findings of this paper were used as justification for austerity measures in the UK and various other countries in the EU.
  • #21: To look at things in a more positive light, this study shows how open access publishing accelerates the research process. It looks at citations rates for articles deposited in arXiv – a preprints repository in maths, physics, astronomy, computer science etc The time lag between deposit and citation of an article has been shrinking drastically year on year. Nowadays the research cycle in High-Energy Physics (HEP) is approaching maximum efficiency as a result of the early and free availability of articles that scientists in the field can use and build upon rapidly.
  • #22: Certain research communities have also seen the benefit of sharing data as it speeds up the process of discovery. This article shows how researchers in the field of Alzheimer’s research have agreed as a community to share data immediately to make scientific breakthroughs.
  • #23: There’s also a citation advantage for individual researchers. This study by Heather Piwowar and Todd Vision looked at 10,555 paper of gene expression studies that had shared the associated microarray data. Those studies that shared data received 9% more citations.
  • #24: There’s also an economic benefit, as seen by the case of the NASA landsat satellite images. These were sold until 2008 for $600 a scene. Now they’re freely available and used by Google Earth. Previously they sold 19,000 images a year, whereas now they transmit 2.1 million. The revenue has gone up incredibly too from $11.4 million to an estimated value of $935 million with direct benefit of more than $100 million. The release has also stimulated the development of applications from companies worldwide. This case study comes from the Royal Society Report on Science as an Open Enterprise.
  • #25: He suggests what needs to change and I think this is quite pertinent for today. We need a better system of incentives so you get more tangible returns from openness – better chances on grant applications, more likelihood of promotion… This is likely to evolve during your careers. If we teach people what to do now, it will become part of your processes and be easier in the long run. The learning curve won’t be as steep. We also need to focus not on why you should be open, but how – and that’s what today’s presentations should give you.
  • #27: I mentioned earlier about conducting science in the open. UsefulChem is an example of that. It’s a wiki platform where chemists can document their experiments. It’s effectively an open labnotebook. You can see here that the researcher has shared his aims, procedures, results, conclusion and a log of activities. There is also an option to comment.
  • #28: My Experiment is like a Virtual Research Environment. You can share scientific workflows that others can comment on and reuse. You can see in the sidebar that it’s clear how the workflow is licensed and who credit should be given to if reusing. There are also a number of social media features too e.g. ratings, favourites and stats for views and downloads.
  • #40: Speak to your library as there is usually an institutional repository you can use. There may also be relevant repositories given your field e.g. Arxiv for maths, computing science, physics etc.
  • #43: This is the SHERPA RoMEO service. You can search for your journal title and then see what is allowed. In this example, the author can archive a pre-print but not the publisher’s final version, and certain restrictions apply e.g. a 6 month embargo.
  • #49: The final aspect to discuss is open data. The Open Knowledge Foundation suggests four simple steps. First off you need to decide what to share. Not all data can be made openly available due to commercial restrictions or sensitivities. Once you’ve decided what to share, determine what IP exists and apply a suitable licence. You should then make the data available in a suitable format so others can bulk download it. Remember that for it to be useful you want to share appropriate metadata and documentation too. Using repositories is useful to make sure your data are properly managed and preserved for the long-term. The final step is to make your data discoverable. Put it online, tell others about it and add details to various registries so it gets found.