0% found this document useful (0 votes)
19 views98 pages

Conference On Technology Enhanced Learning Ec Tel 2020 Heidelberg Germany September 14 18 2020 Proceedings Carlos Alario Hoyos

Complete syllabus material: Addressing Global Challenges and Quality Education 15th European Conference on Technology Enhanced Learning EC TEL 2020 Heidelberg Germany September 14 18 2020 Proceedings Carlos Alario-HoyosAvailable now. Covers essential areas of study with clarity, detail, and educational integrity.

Uploaded by

qdhkyvh567
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
19 views98 pages

Conference On Technology Enhanced Learning Ec Tel 2020 Heidelberg Germany September 14 18 2020 Proceedings Carlos Alario Hoyos

Complete syllabus material: Addressing Global Challenges and Quality Education 15th European Conference on Technology Enhanced Learning EC TEL 2020 Heidelberg Germany September 14 18 2020 Proceedings Carlos Alario-HoyosAvailable now. Covers essential areas of study with clarity, detail, and educational integrity.

Uploaded by

qdhkyvh567
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 98

Addressing Global Challenges and Quality

Education 15th European Conference on Technology


Enhanced Learning EC TEL 2020 Heidelberg Germany
September 14 18 2020 Proceedings Carlos Alario-
Hoyos pdf download
https://textbookfull.com/product/addressing-global-challenges-and-quality-education-15th-european-
conference-on-technology-enhanced-learning-ec-tel-2020-heidelberg-germany-
september-14-18-2020-proceedings-carlos-alario-hoyos/
★★★★★ 4.8/5.0 (39 reviews) ✓ 187 downloads ■ TOP RATED
"Fantastic PDF quality, very satisfied with download!" - Emma W.

DOWNLOAD EBOOK
Addressing Global Challenges and Quality Education 15th
European Conference on Technology Enhanced Learning EC TEL
2020 Heidelberg Germany September 14 18 2020 Proceedings
Carlos Alario-Hoyos pdf download

TEXTBOOK EBOOK TEXTBOOK FULL

Available Formats

■ PDF eBook Study Guide TextBook

EXCLUSIVE 2025 EDUCATIONAL COLLECTION - LIMITED TIME

INSTANT DOWNLOAD VIEW LIBRARY


Collection Highlights

Lifelong Technology-Enhanced Learning: 13th European


Conference on Technology Enhanced Learning, EC-TEL 2018,
Leeds, UK, September 3-5, 2018, Proceedings Viktoria
Pammer-Schindler

Adaptive and Adaptable Learning 11th European Conference


on Technology Enhanced Learning EC TEL 2016 Lyon France
September 13 16 2016 Proceedings 1st Edition Katrien
Verbert

Transforming Learning with Meaningful Technologies 14th


European Conference on Technology Enhanced Learning EC TEL
2019 Delft The Netherlands September 16 19 2019
Proceedings Maren Scheffel

Software Architecture 14th European Conference ECSA 2020 L


Aquila Italy September 14 18 2020 Proceedings Anton Jansen
Computer Security ESORICS 2020 25th European Symposium on
Research in Computer Security ESORICS 2020 Guildford UK
September 14 18 2020 Proceedings Part II Liqun Chen

Software Engineering and Formal Methods: 18th


International Conference, SEFM 2020, Amsterdam, The
Netherlands, September 14–18, 2020, Proceedings Frank De
Boer

Computer Vision ECCV 2018 15th European Conference Munich


Germany September 8 14 2018 Proceedings Part III Vittorio
Ferrari

Computer Vision ECCV 2018 15th European Conference Munich


Germany September 8 14 2018 Proceedings Part IV Vittorio
Ferrari

Computer Vision ECCV 2018 15th European Conference Munich


Germany September 8 14 2018 Proceedings Part VII Vittorio
Ferrari
Carlos Alario-Hoyos
María Jesús Rodríguez-Triana
Maren Scheffel
Inmaculada Arnedillo-Sánchez
Sebastian Maximilian Dennerlein (Eds.)

Addressing
LNCS 12315

Global Challenges
and Quality Education
15th European Conference
on Technology Enhanced Learning, EC-TEL 2020
Heidelberg, Germany, September 14–18, 2020
Proceedings
Lecture Notes in Computer Science 12315

Founding Editors
Gerhard Goos
Karlsruhe Institute of Technology, Karlsruhe, Germany
Juris Hartmanis
Cornell University, Ithaca, NY, USA

Editorial Board Members


Elisa Bertino
Purdue University, West Lafayette, IN, USA
Wen Gao
Peking University, Beijing, China
Bernhard Steffen
TU Dortmund University, Dortmund, Germany
Gerhard Woeginger
RWTH Aachen, Aachen, Germany
Moti Yung
Columbia University, New York, NY, USA
More information about this series at http://www.springer.com/series/7409
Carlos Alario-Hoyos María Jesús Rodríguez-Triana
• •

Maren Scheffel Inmaculada Arnedillo-Sánchez


• •

Sebastian Maximilian Dennerlein (Eds.)

Addressing
Global Challenges
and Quality Education
15th European Conference
on Technology Enhanced Learning, EC-TEL 2020
Heidelberg, Germany, September 14–18, 2020
Proceedings

123
Editors
Carlos Alario-Hoyos María Jesús Rodríguez-Triana
Universidad Carlos III de Madrid Tallinn University
Leganés (Madrid), Spain Tallinn, Estonia
Maren Scheffel Inmaculada Arnedillo-Sánchez
Open University Netherlands Trinity College Dublin
Heerlen, The Netherlands Dublin, Ireland
Sebastian Maximilian Dennerlein
Graz University of Technology
and Know-Center
Graz, Austria

ISSN 0302-9743 ISSN 1611-3349 (electronic)


Lecture Notes in Computer Science
ISBN 978-3-030-57716-2 ISBN 978-3-030-57717-9 (eBook)
https://doi.org/10.1007/978-3-030-57717-9

LNCS Sublibrary: SL3 – Information Systems and Applications, incl. Internet/Web, and HCI

© Springer Nature Switzerland AG 2020


Chapters 6, 10, 15 and 48 are licensed under the terms of the Creative Commons Attribution 4.0 International
License (http://creativecommons.org/licenses/by/4.0/). For further details see licence information in the
chapters.
This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the
material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation,
broadcasting, reproduction on microfilms or in any other physical way, and transmission or information
storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now
known or hereafter developed.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication
does not imply, even in the absence of a specific statement, that such names are exempt from the relevant
protective laws and regulations and therefore free for general use.
The publisher, the authors and the editors are safe to assume that the advice and information in this book are
believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors
give a warranty, expressed or implied, with respect to the material contained herein or for any errors or
omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in
published maps and institutional affiliations.

This Springer imprint is published by the registered company Springer Nature Switzerland AG
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
Preface

Welcome to the proceedings of the 15th European Conference on Technology


Enhanced Learning (EC-TEL 2020), one of the flagship events of the European
Association of Technology Enhanced Learning (EATEL). In addition to the social and
economic crisis, the pandemic has led to a historic inflection point in education. The
impact of COVID-19 worldwide and the closure of schools and universities has led to
numerous changes in how, where and with which tools we learn, and how education
can take place at a distance. One of these changes is that lockdowns triggered the use of
Technology-Enhanced Learning (TEL) at a massive level and throughout the world due
to the urgent need to transform face-to-face into online classes. Such circumstances
pose new challenges and research questions that our community must address.
Looking back, the theme of EC-TEL 2020 (Addressing Global Challenges and
Quality Education) could not have been more timely with COVID-19 affecting edu-
cation and its quality all over the world. Thus, the Sustainable Development Goals of
Quality Education and Reduced Inequalities, as defined in the United Nations Agenda
2030, are more essential than ever. In a global world of great contrasts, we are all
suffering the consequences of this unexpected pandemic, and it is paramount to find
global solutions to provide equitable quality education and promote lifelong learning
opportunities for all. This not only requires improving student performance and success
in general but also supporting under-represented groups and those disadvantaged by
inequality. In this challenging context, the active role of the TEL community may be
crucial to pursue the aforementioned goals.
Another important change this year is that EC-TEL 2020 will be held online for the
first time in its 15-year history. Although EC-TEL 2020 was initially planned to take
place in Heidelberg, Germany, local health authorities advised against organizing large
events. What was to be a major liability has been transformed into a powerful expe-
rience in which to continue learning and evolving as a community thanks to the
researchers who have continued to rely on this conference to submit and present their
contributions. This year, 91 contributions were received despite the difficulties of some
researchers to carry out developments and studies as well as reporting them. All
of these contributions were reviewed by three members of the TEL community, who
also had follow-up discussions to agree on a meta-review. As a result, 24 research
papers (26.4%) were accepted and presented at the conference. This shows the high
competitiveness and quality of this conference year after year. In addition, 20 posters, 5
demos, and 8 impact papers were presented during the conference to fuel the discus-
sions among the researchers. Research papers, posters, and demos can be found in this
volume, while impact papers are published in companion proceedings via CEUR.
EC-TEL 2020 was co-located with another major conference in Germany, the 18th
Fachtagung Bildungstechnologien der GI Fachgrupper Bildungstechnologien (DELFI
2020). This helped create partnerships between the two conferences and allowed
attendees to exchange ideas and benefit from presentations and workshops that were
vi Preface

offered in both conferences. As an example, 14 workshops and 4 keynotes (Linda


Castañeda, Jens Mönig, Samuel Greiff, and Sabine Seufert) were co-located between
EC-TEL 2020 and DELFI 2020.
The last words are words of gratitude. Thanks to the researchers who sent their
contributions to EC-TEL 2020. Thanks to the members of the Programme Committee
who devoted their time to give feedback to authors and supported decision making on
paper acceptance. Finally, deep thanks to the local organizers, Marco Kalz and Joshua
Weidlich, who have worked very hard to host this conference online with the same
quality as in previous years.

July 2020 Carlos Alario-Hoyos


María Jesús Rodríguez-Triana
Maren Scheffel
Inmaculada Arnedillo-Sánchez
Sebastian Dennerlein
Tracie Farrell Frey
Tom Broos
Zacharoula Papamitsiou
Adolfo Ruiz Calleja
Kairit Tammets
Christian Glahn
Marco Kalz
Joshua Weidlich
EC-TEL 2020 Chairs
Organization

Program Committee
Marie-Helene Abel Université de Technologie de Compiègne, France
Andrea Adamoli Università della Svizzera Italiana, Switzerland
Nora Ayu Ahmad Uzir University of Edinburgh, UK
Gokce Akcayir University of Alberta, Canada
Carlos Alario-Hoyos Universidad Carlos III de Madrid, Spain
Patricia Albacete University of Pittsburgh, USA
Dietrich Albert University of Graz, Austria
Laia Albó Universitat Pompeu Fabra, Spain
Liaqat Ali Simon Fraser University, Canada
Ishari Amarasinghe Universitat Pompeu Fabra, Spain
Boyer Anne LORIA – KIWI, France
Alessandra Antonaci European Association of Distance Teaching
Universities, The Netherlands
Roberto Araya Universidad de Chile, Chile
Inmaculada Trinity College Dublin, Ireland
Arnedillo-Sanchez
Juan I. Asensio-Pérez Universidad de Valladolid, Spain
Antonio Balderas University of Cádiz, Spain
Nicolas Ballier Université de Paris, France
Jordan Barria-Pineda University of Pittsburgh, USA
Jason Bernard University of Saskatchewan, Canada
Anis Bey University of Paul Sabatier, IRIT, France
Miguel L. Bote-Lorenzo Universidad de Valladolid, Spain
François Bouchet Sorbonne Université, France
Yolaine Bourda LRI, CentraleSupélec, France
Bert Bredeweg University of Amsterdam, The Netherlands
Andreas Breiter Universität Bremen, Germany
Julien Broisin Université Toulouse 3 Paul Sabatier, IRIT, France
Tom Broos KU Leuven, Belgium
Armelle Brun LORIA, Université Nancy 2, France
Daniela Caballero Universidad de Chile, Chile
Manuel Caeiro Rodríguez University of Vigo, Spain
Sven Charleer KU Leuven, Belgium
Mohamed Amine Chatti University of Duisburg-Essen, Germany
John Cook University of West of England, UK
Audrey Cooke Curtin University, Australia
Alessia Coppi SFIVET, Switzerland
Mihai Dascalu University Politehnica of Bucharest, Romania
viii Organization

Peter de Lange RWTH Aachen University, Germany


Felipe de Morais UNISINOS, Brazil
Carrie Demmans Epp University of Alberta, Canada
Sebastian Dennerlein Know-Center, Austria
Michael Derntl University of Tübingen, Germany
Philippe Dessus Université Grenoble Alpes, LaRAC, France
Daniele Di Mitri Open Universiteit, The Netherlands
Yannis Dimitriadis Universidad de Valladolid, Spain
Monica Divitini Norwegian University of Science and Technology,
Norway
Juan Manuel Dodero Universidad de Cádiz, Spain
Raymond Elferink RayCom BV, The Netherlands
Erkan Er Universidad de Valladolid, Spain
Maka Eradze University of Modena and Reggio Emilia, Italy
Iria Estévez-Ayres Universidad Carlos III de Madrid, Spain
Juan Carlos Farah École Polytechnique Fédérale de Lausanne,
Switzerland
Tracie Farrell The Open University, UK
Louis Faucon École Polytechnique Fédérale de Lausanne,
Switzerland
Baltasar Fernandez-Manjon Universidad Complutense de Madrid, Spain
Carmen Universidad Carlos III de Madrid, Spain
Fernández-Panadero
Angela Fessl Know-Center, Austria
Anna Filighera Technical University of Darmstadt, Multimedia
Communications Lab, Germany
Olga Firssova WELTEN Institute, Open Universiteit,
The Netherlands
Mikhail Fominykh Norwegian University of Science and Technology,
Norway
Felix J. Garcia Clemente Universidad de Murcia, Spain
Jesús Miguel Universidad de la Sierra, Mexico
García-Gorrostieta
Serge Garlatti IMT Atlantique, France
Dragan Gasevic Monash University, Australia
Sébastien George LIUM, Le Mans Université, France
Carlo Giovannella University of Tor Vergata, Italy
Christian Glahn LDE CEL, The Netherlands
Samuel González-López Technological University of Nogales, Mexico
Sabine Graf Athabasca University, Canada
Monique Grandbastien LORIA, Université de Lorraine, France
Wolfgang Greller Vienna University of Education, Austria
David Griffiths University of Bolton, UK
Nathalie Guin Université de Lyon, France
Bernhard Göschlberger Research Studios, Austria
Organization ix

Franziska Günther TU Dresden, Germany


Christian Gütl Graz University of Technology, Austria
Carolin Hahnel DIPF, Leibniz Institute for Research and Information
in Education, Germany
Stuart Hallifax Laboratoire d’InfoRmatique en Image et Systèmes
d'information, France
Bastiaan Heeren Open Universiteit, The Netherlands
Maartje Henderikx Open Universiteit, The Netherlands
Davinia Hernandez-Leo Universitat Pompeu Fabra, Spain
Ángel Hernández-García Universidad Politécnica de Madrid, Spain
Tore Hoel Oslo Metropolitan University, Norway
Adrian Holzer University of Neuchâtel, Switzerland
Pasquale Iero The Open University, UK
Francisco Iniesto The Open University, UK
Zeeshan Jan The Open University, UK
Patricia Jaques PPGCA, UNISINOS, Brazil
Johan Jeuring Utrecht University and Open Universiteit,
The Netherlands
Ioana Jivet Open Universiteit, The Netherlands
Srecko Joksimovic University of South Australia, Australia
Ilkka Jormanainen University of Eastern Finland, Finland
Jelena Jovanovic University of Belgrade, Serbia
Ken Kahn University of Oxford, UK
Rogers Kaliisa University of Oslo, Norway
Marco Kalz PH Heidelberg, Germany
Anastasios Karakostas Aristotle University of Thessaloniki, Greece
Reet Kasepalu Tallinn University, Estonia
Mohammad Khalil University of Bergen, Norway
Michael Kickmeier-Rust Graz University of Technology, Austria
Ralf Klamma RWTH Aachen University, Germany
Roland Klemke Open Universiteit, The Netherlands
Tomaž Klobučar Jozef Stefan Institute, Slovenia
Carolien Knoop-Van Radboud University, The Netherlands
Campen
Johannes Konert Hochschule Fulda, Germay
Külli Kori Tallinn University, Estonia
Panagiotis Kosmas Cyprus University of Technology, Cyprus
Vitomir Kovanovic University of South Australia, Australia
Dominik Kowald Know-Center, Austria
Milos Kravcik DFKI GmbH, Germay
Elise Lavoué Université Jean Moulin Lyon 3, France
Marie Lefevre Université Lyon 1, France
Dominique Lenne Université de Technologie de Compiègne, France
Marina Lepp University of Tartu, Estonia
Amna Liaqat University of Toronto, Canada
Paul Libbrecht IUBH Fernstudium, Germany
x Organization

Andreas Lingnau Ruhr West University of Applied Science, Germany


Martin Llamas-Nistal Universidad de Vigo, Spain
Aurelio Lopez-Lopez INAOE, Mexico
Domitile Lourdeaux CNRS, France
Margarida Lucas University of Aveiro, Portugal
Ulrike Lucke University of Potsdam, Germany
Vanda Luengo Sorbonne Université, France
George Magoulas University of London, UK
Jorge Maldonado-Mahauad Universidad de Cuenca, Ecuador, Pontificia
Universidad Católica de Chile, Chile
Nils Malzahn Rhine-Ruhr Institute for Applied System Innovation
e.V., Germany
Carlos Martínez-Gaitero Escuelas Universitarias Gimbernat, Spain
Alejandra Martínez-Monés Universidad de Valladolid, Spain
Wannisa Matcha University of Edinburgh, UK
Manolis Mavrikis UCL Knowledge Lab, UK
Agathe Merceron Beuth University of Applied Sciences, Germany
Vasileios Mezaris Information Technologies Institute, CERT, Greece
Christine Michel Université de Lyon, France
Konstantinos Michos Universidad de Valladolid, Spain
Alexander Mikroyannidis The Open University, UK
Tanja Mitrovic University of Canterbury, UK
Riichiro Mizoguchi Japan Advanced Institute of Science and Technology,
Japan
Inge Molenaar Radboud University, The Netherlands
Anders Morch University of Oslo, Norway
Pedro Manuel Universidad Carlos III de Madrid, Spain
Moreno-Marcos
Mathieu Muratet Sorbonne Université, France
Pedro J. Muñoz-Merino Universidad Carlos III de Madrid, Spain
Rob Nadolski Open Universiteit, The Netherlands
Petru Nicolaescu RWTH Aachen University, Germany
Stavros Nikou University of Strathclyde, UK
Nicolae Nistor Ludwig Maximilian University of Munich, Germany
Alexander Nussbaumer Graz University of Technology, Austria
Jennifer Olsen École Polytechnique Fédérale de Lausanne,
Switzerland
Alejandro Ortega-Arranz Universidad de Valladolid, Spain
Lahcen Oubahssi LIUM, Le Mans Université, France
Viktoria Pammer-Schindler Graz University of Technology, Austria
Sofia Papavlasopoulou Norwegian University of Science and Technology,
Norway
Abelardo Pardo University of South Australia, Australia
Cesare Pautasso University of Lugano, Switzerland
Maxime Pedrotti Verein zur Förderung eines Deutschen
Forschungsnetzes e.V., Germany
Organization xi

Mar Perez-Sanagustin Université Paul Sabatier Toulouse III, France


Donatella Persico Istituto Tecnologie Didattiche, CNR, Italy
Yvan Peter Université de Lille, France
Niels Pinkwart Humboldt-Universität zu Berlin, Germany
Gerti Pishtari Tallinn University, Estonia
Elvira Popescu University of Craiova, Romania
Francesca Pozzi Istituto Tecnologie Didattiche, CNR, Italy
Luis P. Prieto Tallinn University, Estonia
Ronald Pérez Álvarez Universidad de Costa Rica, Costa Rica
Hans Põldoja Tallinn University, Estonia
Eyal Rabin The Open University of Israel, Israel, and Open
Universiteit, The Netherlands
Juliana Elisa Raffaghelli University of Florence, Italy
Gustavo Ramirez-Gonzalez Universidad del Cauca, Colombia
Marc Rittberger DIPF, Leibniz Institute for Researach and Information
in Education, Germany
Tiago Roberto Kautzmann Universidade do Vale do Rio dos Sinos, Brazil
Covadonga Rodrigo Universidad Nacional de Educación a Distancia, Spain
Maria Jesus Rodriguez Tallinn University, Estonia
Triana
Jeremy Roschelle Digital Promise, USA
José A. Ruipérez Valiente Universidad de Murcia, Spain
Ellen Rusman Open Universiteit, The Netherlands
Merike Saar Tallinn University, Estonia
Demetrios Sampson Curtin University, Australia
Eric Sanchez Université Fribourg, Switzerland
Patricia Santos Universitat Pompeu Fabra, Spain
Mohammed Saqr University of Eastern Finland, Finland
Petra Sauer Beuth University of Applied Sciences, Germany
Maren Scheffel Open Universiteit, The Netherlands
Daniel Schiffner DIPF, Leibniz Institute for Research and Information
in Education, Germany
Andreas Schmidt Karlsruhe University of Applied Sciences, Germany
Marcel Schmitz Zuyd Hogeschool, The Netherlands
Jan Schneider DIPF, Leibniz Institute for Research and Information
in Education, Germany
Ulrik Schroeder RWTH Aachen University, Germany
Stefan Schweiger Bongfish GmbH, Austria
Yann Secq Université de Lille, France
Karim Sehaba Laboratoire d’InfoRmatique en Image et Systèmes
d’information, Université Lumière Lyon 2, France
Paul Seitlinger Tallinn University, Estonia
Audrey Serna Laboratoire d’InfoRmatique en Image et Systèmes
d’information, France
Sergio Serrano-Iglesias Universidad de Valladolid, Spain
Shashi Kant Shankar Tallinn University, Estonia
xii Organization

Kshitij Sharma Norwegian University of Science and Technology,


Norway
Tanmay Sinha ETH Zurich, Switzerland
Sergey Sosnovsky Utrecht University, The Netherlands
Marcus Specht Delft University of Technology and Open Universiteit,
The Netherlands
Srinath Srinivasa International Institute of Information Technology,
Bangalore, India
Tim Steuer Technical University of Darmstadt, Germany
Slavi Stoyanov Open University, The Netherlands
Alexander Streicher Fraunhofer IOSB, Germany
Bernardo Tabuenca Universidad Politécnica de Madrid, Spain
Kairit Tammets Tallinn University, Estonia
Stefaan Ternier Open Universiteit, The Netherlands
Stefan Thalmann University of Graz, Austria
Paraskevi Topali Universidad de Valladolid, Spain
Richard Tortorella University of North Texas, USA
Lucia Uguina IMDEA Networks, Spain
Peter Van Rosmalen Maastricht University, The Netherlands
Olga Viberg KTH Royal Institute of Technology, Sweden
Markel Vigo The University of Manchester, UK
Sara Villagrá-Sobrino Universidad de Valladolid, Spain
Cathrin Vogel FernUniversität in Hagen, Germany
Joshua Weidlich Heidelberg University of Education, Germany
Armin Weinberger Saarland University, Germany
Professor Denise Whitelock The Open University, UK
Fridolin Wild The Open University, UK
Nikoletta Xenofontos University of Cyprus, Cyprus
Amel Yessad Sorbonne Université, France

Additional Reviewers

Anwar, Muhammad
Berns, Anke
Ebner, Markus
Ehlenz, Matthias
Halbherr, Tobias
Koenigstorfer, Florian
Kothiyal, Aditi
Liaqat, Daniyal
Liaqat, Salaar
Müllner, Peter
Ponce Mendoza, Ulises
Raffaghelli, Juliana
Rodriguez, Indelfonso
Zeiringer, Johannes
Contents

Exploring Artificial Jabbering for Automatic Text Comprehension


Question Generation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Tim Steuer, Anna Filighera, and Christoph Rensing

Digital Value-Adding Chains in Vocational Education: Automatic


Keyword Extraction from Learning Videos to Provide Learning
Resource Recommendations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Cleo Schulten, Sven Manske, Angela Langner-Thiele,
and H. Ulrich Hoppe

Human-Centered Design of a Dashboard on Students’ Revisions


During Writing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Rianne Conijn, Luuk Van Waes, and Menno van Zaanen

An Operational Framework for Evaluating the Performance of Learning


Record Stores . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
Chahrazed Labba, Azim Roussanaly, and Anne Boyer

Does an E-mail Reminder Intervention with Learning Analytics Reduce


Procrastination in a Blended University Course? . . . . . . . . . . . . . . . . . . . . . 60
Iryna Nikolayeva, Amel Yessad, Bertrand Laforge, and Vanda Luengo

Designing an Online Self-assessment for Informed Study Decisions:


The User Perspective . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
L. E. C. Delnoij, J. P. W. Janssen, K. J. H. Dirkx, and R. L. Martens

What Teachers Need for Orchestrating Robotic Classrooms . . . . . . . . . . . . . 87


Sina Shahmoradi, Aditi Kothiyal, Jennifer K. Olsen, Barbara Bruno,
and Pierre Dillenbourg

Assessing Teacher’s Discourse Effect on Students’ Learning: A Keyword


Centrality Approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
Danner Schlotterbeck, Roberto Araya, Daniela Caballero,
Abelino Jimenez, Sami Lehesvuori, and Jouni Viiri

For Learners, with Learners: Identifying Indicators for an Academic


Advising Dashboard for Students . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117
Isabel Hilliger, Tinne De Laet, Valeria Henríquez, Julio Guerra,
Margarita Ortiz-Rojas, Miguel Ángel Zuñiga, Jorge Baier,
and Mar Pérez-Sanagustín
xiv Contents

Living with Learning Difficulties: Two Case Studies Exploring


the Relationship Between Emotion and Performance in Students
with Learning Difficulties. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131
Styliani Siouli, Stylianos Makris, Evangelia Romanopoulou,
and Panagiotis P. D. Bamidis

Learnersourcing Quality Assessment of Explanations for Peer Instruction . . . 144


Sameer Bhatnagar, Amal Zouaq, Michel C. Desmarais,
and Elizabeth Charles

Using Diffusion Network Analytics to Examine and Support Knowledge


Construction in CSCL Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158
Mohammed Saqr and Olga Viberg

Supporting Second Language Learners’ Development of Affective


Self-regulated Learning Skills Through the Use and Design
of Mobile Technology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173
Olga Viberg, Anna Mavroudi, and Yanwen Ma

We Know What You Did Last Semester: Learners’ Perspectives on Screen


Recordings as a Long-Term Data Source for Learning Analytics. . . . . . . . . . 187
Philipp Krieter, Michael Viertel, and Andreas Breiter

Teaching Simulation Literacy with Evacuations: Concept, Technology,


and Material for a Novel Approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 200
Andre Greubel, Hans-Stefan Siller, and Martin Hennecke

Design of Conversational Agents for CSCL: Comparing Two Types


of Agent Intervention Strategies in a University Classroom . . . . . . . . . . . . . 215
Konstantinos Michos, Juan I. Asensio-Pérez, Yannis Dimitriadis,
Sara García-Sastre, Sara Villagrá-Sobrino, Alejandro Ortega-Arranz,
Eduardo Gómez-Sánchez, and Paraskevi Topali

Exploring Human–AI Control Over Dynamic Transitions Between


Individual and Collaborative Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . 230
Vanessa Echeverria, Kenneth Holstein, Jennifer Huang,
Jonathan Sewall, Nikol Rummel, and Vincent Aleven

Exploring Student-Controlled Social Comparison . . . . . . . . . . . . . . . . . . . . 244


Kamil Akhuseyinoglu, Jordan Barria-Pineda, Sergey Sosnovsky,
Anna-Lena Lamprecht, Julio Guerra, and Peter Brusilovsky

New Measures for Offline Evaluation of Learning Path Recommenders . . . . . 259


Zhao Zhang, Armelle Brun, and Anne Boyer
Contents xv

Assessing the Impact of the Combination of Self-directed Learning,


Immediate Feedback and Visualizations on Student Engagement
in Online Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 274
Bilal Yousuf, Owen Conlan, and Vincent Wade

CGVis: A Visualization-Based Learning Platform for Computational


Geometry Algorithms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 288
Athanasios Voulodimos, Paraskevas Karagiannopoulos,
Ifigenia Drosouli, and Georgios Miaoulis

How to Design Effective Learning Analytics Indicators? A Human-


Centered Design Approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 303
Mohamed Amine Chatti, Arham Muslim, Mouadh Guesmi,
Florian Richtscheid, Dawood Nasimi, Amin Shahin, and Ritesh Damera

Emergency Remote Teaching: Capturing Teacher Experiences in Spain


with SELFIE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 318
Laia Albó, Marc Beardsley, Judit Martínez-Moreno, Patricia Santos,
and Davinia Hernández-Leo

Utilising Learnersourcing to Inform Design Loop Adaptivity . . . . . . . . . . . . 332


Ali Darvishi, Hassan Khosravi, and Shazia Sadiq

Fooling It - Student Attacks on Automatic Short Answer Grading. . . . . . . . . 347


Anna Filighera, Tim Steuer, and Christoph Rensing

Beyond Indicators: A Scoping Review of the Academic Literature


Related to SDG4 and Educational Technology . . . . . . . . . . . . . . . . . . . . . . 353
Katy Jordan

Pedagogical Underpinnings of Open Science, Citizen Science


and Open Innovation Activities: A State-of-the-Art Analysis. . . . . . . . . . . . . 358
Elisha Anne Teo and Evangelia Triantafyllou

Knowledge-Driven Wikipedia Article Recommendation


for Electronic Textbooks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 363
Behnam Rahdari, Peter Brusilovsky, Khushboo Thaker,
and Jordan Barria-Pineda

InfoBiTS: A Mobile Application to Foster Digital Competencies


of Senior Citizens . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 369
Svenja Noichl and Ulrik Schroeder

Student Awareness and Privacy Perception of Learning Analytics


in Higher Education . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 374
Stian Botnevik, Mohammad Khalil, and Barbara Wasson
xvi Contents

User Assistance for Serious Games Using Hidden Markov Model . . . . . . . . . 380
Vivek Yadav, Alexander Streicher, and Ajinkya Prabhune

Guiding Socio-Technical Reflection of Ethical Principles in TEL Software


Development: The SREP Framework. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 386
Sebastian Dennerlein, Christof Wolf-Brenner, Robert Gutounig,
Stefan Schweiger, and Viktoria Pammer-Schindler

Git4School: A Dashboard for Supporting Teacher Interventions in Software


Engineering Courses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 392
Jean-Baptiste Raclet and Franck Silvestre

Exploring the Design and Impact of Online Exercises for Teacher Training
About Dynamic Models in Mathematics. . . . . . . . . . . . . . . . . . . . . . . . . . . 398
Charlie ter Horst, Laura Kubbe, Bart van de Rotten, Koen Peters,
Anders Bouwer, and Bert Bredeweg

Interactive Concept Cartoons: Exploring an Instrument for Developing


Scientific Literacy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 404
Patricia Kruit and Bert Bredeweg

Quality Evaluation of Open Educational Resources . . . . . . . . . . . . . . . . . . . 410


Mirette Elias, Allard Oelen, Mohammadreza Tavakoli, Gábor Kismihok,
and Sören Auer

Designing Digital Activities to Screen Locomotor Skills


in Developing Children . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 416
Benoit Bossavit and Inmaculada Arnedillo-Sánchez

Towards Adaptive Social Comparison for Education . . . . . . . . . . . . . . . . . . 421


Sergey Sosnovsky, Qixiang Fang, Benjamin de Vries, Sven Luehof,
and Fred Wiegant

Simulation Based Assessment of Epistemological Beliefs About Science . . . . 427


Melanie E. Peffer and Tessa Youmans

An Approach to Support Interactive Activities in Live Stream Lectures . . . . . 432


Tommy Kubica, Tenshi Hara, Iris Braun, and Alexander Schill

Educational Escape Games for Mixed Reality . . . . . . . . . . . . . . . . . . . . . . . 437


Ralf Klamma, Daniel Sous, Benedikt Hensen, and István Koren

Measuring Learning Progress for Serving Immediate Feedback Needs:


Learning Process Quantification Framework (LPQF) . . . . . . . . . . . . . . . . . . 443
Gayane Sedrakyan, Sebastian Dannerlein, Viktoria Pammer-Schindler,
and Stefanie Lindstaedt
Contents xvii

Data-Driven Game Design: The Case of Difficulty in Educational Games . . . 449


Yoon Jeon Kim and Jose A. Ruipérez-Valiente

Extracting Topics from Open Educational Resources . . . . . . . . . . . . . . . . . . 455


Mohammadreza Molavi, Mohammadreza Tavakoli, and Gábor Kismihók

Supporting Gamification with an Interactive Gamification Analytics


Tool (IGAT). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 461
Nadja Zaric, Manuel Gottschlich, Rene Roepke, and Ulrik Schroeder

OpenLAIR an Open Learning Analytics Indicator Repository Dashboard . . . . 467


Atezaz Ahmad, Jan Schneider, and Hendrik Drachsler

CasualLearn: A Smart Application to Learn History of Art. . . . . . . . . . . . . . 472


Adolfo Ruiz-Calleja, Miguel L. Bote-Lorenzo, Guillermo Vega-Gorgojo,
Sergio Serrano-Iglesias, Pablo García-Zarza, Juan I. Asensio-Pérez,
and Eduardo Gómez-Sánchez

Applying Instructional Design Principles on Augmented Reality Cards


for Computer Science Education . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 477
Josef Buchner and Michael Kerres

Extending Patient Education with CLAIRE: An Interactive Virtual Reality


and Voice User Interface Application . . . . . . . . . . . . . . . . . . . . . . . . . . . . 482
Richard May and Kerstin Denecke

Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 487


Exploring Artificial Jabbering
for Automatic Text Comprehension
Question Generation

Tim Steuer(B) , Anna Filighera , and Christoph Rensing

Technical University of Darmstadt, Darmstadt, Germany


{tim.steuer,anna.filighera,christoph.rensing}@kom.tu-darmstadt.de

Abstract. Many educational texts lack comprehension questions and


authoring them consumes time and money. Thus, in this article, we ask
ourselves to what extent artificial jabbering text generation systems can
be used to generate textbook comprehension questions. Novel machine
learning-based text generation systems jabber on a wide variety of top-
ics with deceptively good performance. To expose the generated texts as
such, one often has to understand the actual topic the systems jabbers
about. Hence, confronting learners with generated texts may cause them
to question their level of knowledge. We built a novel prototype that
generates comprehension questions given arbitrary textbook passages.
We discuss the strengths and weaknesses of the prototype quantitatively
and qualitatively. While our prototype is not perfect, we provide evidence
that such systems have great potential as question generators and iden-
tify the most promising starting points may leading to (semi) automated
generators that support textbook authors and self-studying.

Keywords: Text comprehension · Language models · Automatic


question generation · Educational technology

1 Motivation

Reading, alongside direct verbal communication, is one of the most prevalent


forms of learning. For every new subject, we encounter in our educational careers,
highly motivated educators publish textbooks to help us understand. Even after
we finish our formal education, the modern knowledge society is based on life-
long informal learning in which learners in the absence of teachers, also often
devote themselves to textual learning resources. In both, the formal and informal
scenarios only gaining surface-level understanding is likely not enough. If we e.g.
study a physics or history textbook to pass an exam deeper understanding of
the topic is crucial. However, reading is difficult and to deeply comprehend a
text, passive consumption is insufficient [7,25].
Instead, readers need to actively reflect the information provided in the text
to reach a deep understanding [7,25]. A well-explored method to actively engage
c Springer Nature Switzerland AG 2020
C. Alario-Hoyos et al. (Eds.): EC-TEL 2020, LNCS 12315, pp. 1–14, 2020.
https://doi.org/10.1007/978-3-030-57717-9_1
2 T. Steuer et al.

Suppose you want to connect to your workplace network from home. Your workplace,
however, has a security policy that does not allow “outside” IP addresses to access essenƟal
internal resources. How do you proceed, without leasing a dedicated telecommunicaƟons
Textbook line to your workplace?
chapter
A virtual private network, or VPN, provides a soluƟon; it supports creaƟon of virtual links
that join far-flung nodes via the Internet. Your home computer creates an ordinary Internet
connecƟon (TCP or UDP) …

Generated Is the following statement true/false? Please discuss briefly why it is true or false: Vpns
Discussion have the disadvantage of requiring the VPN tunnel to be established before the Internet can
Prompt be accessed.
(italic)

Student
discusses
answer

Fig. 1. Example usage of the proposed system.

readers is posing questioning about what they have read [1,25]. Yet, posing
good questions consumes time and money and thus many texts encountered
by learners either contain only a few questions at the end of a chapter or lack
questions.
Educational automatic question generation investigates approaches to gen-
erate meaningful questions about texts automatically, reducing the necessity for
manually generated questions. It hereby relies either on machine learning-based
approaches that excel in question variety and expressiveness but pose mostly
factual questions [6] or on rule-based approaches that lack expressiveness and
variety [32] but have limited capability to pose comprehension questions depend-
ing on their purpose (e.g. [17]).
This article investigates a novel machine learning-based question generation
approach seeking to generate comprehension questions with a high variety and
expressiveness. We hereby rely on two main ideas. First, research in the educa-
tional domain has investigated learning from errors [19] indicating that explain-
ing why a statement or solution is faulty may foster learning, conceptual under-
standing, and far transfer [10]. Second, we rely on the artificial jabbering of
state-of-the-art neural text generators that are capable of extrapolating a given
text with high structural consistency and in a way that often looks deceptively
real for humans. We seek to explore whether this jabbering can be conditioned
in such a way that it generates erroneous examples from textbook paragraphs.
Presented with such a statement, learners need to justify if a given statement is
true or false (see Fig. 1). This work comprises three main contributions:

1. We present the idea of leveraging artificial jabbering for automatic text com-
prehension question generation and introduce a prototypical generator.
2. We provide a quantitative and qualitative evaluation of the strengths and
weaknesses of such an approach.
3. We distill the main challenges for future work based on an in-depth error
analysis of our prototypical generator.
Automatic Question Generation via Neural Text Generators 3

2 Related Work
2.1 Learning from Erroneous Examples

When learning with erroneous examples, students are confronted with a task and
its faulty solution and have to explain why it is wrong (e.g. [30]). The underlying
theoretical assumptions are that erroneous examples induce a cognitive conflict
in students and thus support conceptual change [24] e.g. by pointing out typical
misconceptions [29]. It has been shown that erroneous examples are beneficial
for learning in a variety of domains such as mathematics [10], computer science
[4] or medicine [14]. Also, learners confronted with erroneous examples especially
improve deeper measures of learning such as conceptual understanding and far
transfer [24]. However, some studies have found that erroneous examples only
foster learning when learners receive enough feedback [14,30] and have sufficient
prior knowledge [30].

2.2 Neural Text and Question Generation

With the rise of high capacity machine-learning models, language generation has
shifted towards pretraining [27]. Trained on huge datasets, these models provide
state-of-the-art results on a wide variety of natural language generation tasks
[5,23] such as dialog response generation tasks [22] or abstractive summariza-
tion tasks [26]. Novel models like GPT-2 [23] are capable of extrapolating a given
text with high structural consistency and in a way that looks deceptively real for
humans. They copy the given text’s writing style and compose texts which seem
to make sense at first glance. Fine-tuning the model even increased the human-
ness of the generated texts [28]. Research in the credibility of such generated
texts found that hand-picked generated news texts were found to be credible
around 66% of the time, even when the model was not fine-tuned on news arti-
cles [28]. Another study found that human raters could detect generated texts in
71.4% of the cases with two raters often disagreeing if the text is fake or not [13].
These findings started a debate in the natural language generation community if
the model’s generation capabilities are to easy to misuse and therefore the mod-
els should not be released anymore [28]. Furthermore, such models are able to
generate poems [16] and to rewrite stories to incorporate counterfactual events
[21]. Besides of these open text generation models, special models for question
generation exist. They evolved from baseline sequence to sequence architectures
[6] into several advanced neural architectures (e.g. [5,33]) with different facets
such as taking the desired answers into account [34] or being difficulty-aware [8].
Although these systems work well in the general case they are mainly focusing
on the generation of factual questions [6,20,35]. Thus, although their expressive-
ness and domain independence is impressive, the educational domain still most
often uses template-based generators [15]. These template-based approaches are
often able to generate comprehension questions but lack expressiveness and rely
on expert rules limiting them to a specific purpose in a specific domain.
4 T. Steuer et al.

3 An Experimental Automatic Erroneous Example


Generator

To experiment with the idea of using artificial jabbering for improving text
comprehension, we propose the following text generation task. The input is a
text passage of a learning resource from an arbitrary domain, having a length
of 500–1000 words as this has been used in psychological studies that found
text accompanying questions to be helpful [1,31]. The output is a generated
text comprehension question about the given text passage, asking learners to
explain why a given statement is true or false. We aim to generate high-quality
questions of good grammaticality, containing educational valuable claims and
having the right difficulty for discussion. Some technical challenges are inherent
in the described task. Every approach must tackle discussion candidate selection
as this determines what the main subject of the generated text will be. Also,
every approach must provide the neural text generator with a conditioning con-
text to ensure that the generated text is in the intended domain. Finally, every
approach must render the actual text with some sort of open domain genera-
tor. These subtasks are active fields of research and a huge variety of possible
approaches with different strengths and weaknesses exists. Yet, our first aim is
to evaluate the general viability of such an approach. Thus, we do not exper-
iment with different combinations of sub-components but our generator relies
on well-tested domain-independent general-purpose algorithms for the different
subtasks (see Fig. 2).

Fig. 2. Architecture of the automatic text comprehension question generator. The final
output is a justification statement that is combined with a prompt to form the actual
text comprehension question.
Automatic Question Generation via Neural Text Generators 5

First, for the discussion candidate selection, we make the simplifying assump-
tion that good discussion candidates are the concepts that are characteristic of
the text. To understand why this assumption is simplified consider a text about
Newtonian physics where a few sentences discuss the common misconception
that heavier objects fall faster than lighter objects. This discussion is unlikely to
involve any special keywords and thus will not be selected as input to the gen-
erator. Yet, it might be very fruitful to generate erroneous examples based on
these misconceptions. However, to test our general idea of generating erroneous
examples the simplification should be sufficient because we might select fewer
inputs but the one we select should be important. Furthermore, this assumption
allows us to rely on state-of-the-art keyphrase extraction algorithms. Considering
that the inputs are texts from a variety of domains, the keyphrase selection step
needs to be unsupervised and relatively robust to domain changes. Therefore,
we apply the YAKE keyphrase extraction algorithm [3] which has been shown
to perform consistently on a large variety of different datasets and domains [2].
Stopwords are removed before running keyphrase extraction and the algorithm’s
configured windows size is two.
Second, for selecting the conditioning context, a short text that already com-
prises statements about the subject is needed. Suppose the discussion subject
is “Thermal Equilibrium” in a text about physics. For the generator to pro-
duce interesting statements it must receive sentences from the text, discussing
thermal equilibria. Thus, we extract up to three sentences in the text compris-
ing the keyphrase, by sentence tokenizing the text1 and concatenating sentences
containing the keyphrase.
Third, we need to generate a justification statement as the core for the text
comprehension question. We use the pretrained GPT-2 774M2 parameter model
and apply it similar to Radford et al. [23] by using plain text for the model
conditioning. The plain text starts with the sentences from the conditioning
context and to generate the actual justification statement, a discussion starter is
appended. It begins with the pluralized discussion subject followed by a prede-
fined phrase allowing us to choose the type of justification statement the model
will generate. For instance, let “Thermal Equilibrium” be our discussion subject,
our to be completed discussion starter may be “Thermal equilibria are defined
as” or “Thermal equilibria can be used to” depending on the type of faulty state-
ment we aim for. The resulting plain text is given to GPT-2 for completion. To
prevent the model from sampling degenerated text, we apply nucleus sampling
[12] with top-p=0.55 and restrict the output length to 70 words. Finally, we
extract the justification statement from the generated text and combine it with
a generic prompt to discuss it, resulting in the final text comprehension question.
Note that we do not know, if the generated question is actually comprising a
true or false justification statement.

1
Using NLTK-3.4.5.
2
https://github.com/openai/gpt-2.
6 T. Steuer et al.

4 Research Question and Methodology


4.1 Research Question
We evaluate our generation approach on educational texts from a variety of
domains focusing on the following research question:
RQ: To what extent are we able to generate useful text comprehension statements
in a variety of domains given short textbook passages?
Looking at the related work, a fraction of the generated statements should
already be usable without any adjustments, while many other statements need
adjustment. We conduct a quantitative evaluation and qualitative evaluation.
Our procedure includes a quantitative expert survey, a qualitative error analysis
to determine useful error categories and a qualitative analysis of the already
usable results to better describe their features.

4.2 Methodology
Quantitatively, a total of 120 text comprehension questions coming from ten
educational texts are annotated by ten domain experts who have been teaching
at least one university lecture in a similar domain. Texts are equally distributed
across five different domains: Computer Science, Machine Learning, Networking,
Physics and Psychology. Twelve text comprehension questions are generated for
every text. They are based on three extracted discussion candidates and four
different discussion starters, of which we hypothesized that they represent inter-
mediate or deep questions according to Graesser et al. [9]. The discussion starters
are: “X has the disadvantage”, “X has the advantage”, “X is defined as” and “X
is used to” where X is the discussion candidate. This Every question is rated by
two experts who first read the educational text that was used to generate the
question and then rate it on five five-point Likert items regarding grammatical
correctness, relatedness to the source material, factual knowledge involved when
answering the question, conceptual knowledge involved when answering the ques-
tions and overall usefulness for learning. Before annotating every expert saw a
short definition of every scale, clarifying their meaning. Additionally, experts
can provide qualitative remarks for every question through a free-text field. For
the quantitative analysis the ratings where averaged across experts.
We use the quantitatively collected data to guide our qualitative analysis of
the research questions. To carry out our in-depth error analysis, we consider a
statement useless for learning if it scores lower than three on the usefulness scale.
This choice was made after qualitatively reviewing a number of examples. We
use the inductive qualitative content analysis [18] to deduce meaningful error
categories for the statements and to categorize the statements accordingly. Our
search for meaningful error categories is hereby guided by the given task for-
mulation and its sub-components. Furthermore, the useful generated statements
(usefulness ≥ 3) are analyzed. We look at the effects of the different discus-
sion starters and how they influence the knowledge involved in answering the
generated questions.
Automatic Question Generation via Neural Text Generators 7

5 Results
5.1 Quantitative Overview

The quantitative survey results indicate that many of the statements generated
are of good grammar, are connected to the text but are only slightly useful for
learning (see Fig. 3). Furthermore, most questions involve some factual knowl-
edge and deeper comprehension, yet both scores vary greatly. Breaking down the
different rating scores by domain or discussion starter does not revealed no large
differences. By looking at various examples of different ratings (see Table 1) we
found that a usefulness score of three or larger is indicative of some pedagogi-
cal value. With minor changes, such questions could be answered and discussed
by experts, although their discussion is probably often not the perfect learning
opportunity. In total, 39 of the 120 statements have a usefulness rating of 3 or
larger (32.5%), in contrast to 81 statements rated lower (67.5%).

Fig. 3. Overview of the quantitative ratings for the generated statements without any
human filtering. Scores are between 1 and 5 where 5 is the best achievable rating. The
whiskers indicate 1.5 Interquartile range and the black bar is the median.

5.2 Qualitative Error Analysis

While conducting the qualitative error analysis, the following main error cate-
gories where deduced. Keyword inappropriate means that the discussion candi-
date was not appropriate for the text because the keyword extraction algorithm
selected a misleading or very general key term. Keyword incomplete means that
8 T. Steuer et al.

Table 1. Examples of differently rated generated statements (higher = better).

Usefulness Example statement


ranking
1 Fastest possible machines have the disadvantage of being more
expensive to build and maintain
2 Prior knowledges have the advantage that they are easy to
measure and easy to measure the causal role of
3 Knowledge bases can be used to test the performance of models,
and to improve the performance of inference engines
4 Von neumann architectures have the advantage of being able to
process a wide range of instructions at the same time, making
them highly scalable
5 Vpns have the disadvantage of being difficult to set up and
maintain, and they can be compromised by bad actors

the discussion candidate would be good if it would comprise additional terms.


For example, in physics, the discussion candidate sometimes was “Equilibrium”
instead of “Thermal Equilibrium”. Platitude means that the generated state-
ment was a generic platitude and thus not helpful. Hardly discussable means
that the statement was either too vague or too convoluted therefore making it
hard to write a good justification. Finally too easy means that the students could
answer by just relying on common sense.

Table 2. The different error categories and their distribution

Inappropriate Incomplete Platitude Hardly Statement


keyword keyword discussable too easy
43 6 9 11 12

The distribution of the different error categories can be seen is heavily skewed
towards keyword errors (see Table 2). The two keyword-based errors account for
49 or roughly 60% of the errors. Furthermore, statements generated by faulty
keyword selection mostly have a usefulness rating of one. The other error cate-
gories are almost equally distributed and are most often rated with a usefulness
score of two. The platitude case mostly comes from unnaturally combining a dis-
cussion candidate with a discussion starter resulting in very generic completion of
the sentence inside the generator. For instance, if the generator has to complete
the sentence “Classical conditionings have the disadvantage ...” it continues with
“...of being costly and slow to develop”. The remaining error categories have no
clear cause.
Besides the error analysis, annotators left some remarks about the erroneous
statements. Two annotators remarked on various occasions that the first part of
Automatic Question Generation via Neural Text Generators 9

the sentence (discussion candidate + discussion starter) is incomprehensible and


thus the whole statement is worthless. One annotator remarked that there are
missing words in the keyword leading to a bad rating for the statement. The key-
word was for example “knowledge” instead of “knowledge base”. Furthermore,
one annotator remarked that the statement has not enough discussion potential.
Those comments are in line with our deduced error categories for keyword errors.

Fig. 4. Overview of the quantitative ratings for the generated statements with an
usefulness rating larger or equal three. Scores are between 1 and 5 where 5 is the best
achievable rating. The whiskers indicate 1.5 Interquartile range and the black bar is
the median.

5.3 Quality Characteristics of the Useful Statements


The 39 statements with a usefulness rating larger three also score well in the
other factors (see Fig. 4). Especially, the involved factual knowledge and deeper
comprehension clearly increase. Reviewing the generated statements reveals that
the generated statements are not simply a paraphrase of a fact stated in the text.
Thus, learners answering the corresponding question cannot simply do a keyword
spotting but need to think about the actual content of the text. Furthermore, the
generated statements adequately use technical terminology. Moreover, the dif-
ferent discussion starters play an important role as they lead to different types of
statements. When generating with the definition starter, the generator rephrases
the definition of a discussion candidate in “its own words”. As a result, these
definitions often lack important aspects or contain faulty claims (see Table 3).
Thus, to explain why the definition is wrong, learners have to compare and con-
trast their previous knowledge with the generated definition. The usage starter
10 T. Steuer et al.

leads to statements that force learners to transfer the knowledge learnt into new
situations (see Table 3). The usage that is described in the generated statements
is normally not mentioned in the text, but can often be deduced by the knowl-
edge provided in the text. The advantage and disadvantage discussion starter
requires learners to think about the discussion candidate but also about similar
concepts and solutions and to compare them (see Table 3). Otherwise, learners
cannot tell if the stated advantage or disadvantage is one that is specific to the
discussed concept.

Table 3. Highly rated examples of different types of statements resulting from different
discussion starter.

Discussion starter Domain Example statement


Definition Machine learning Knowledge bases are defined as data
structures that store knowledge and act
as a long-term memory
(Dis)advantage Psychology Conditionings have the advantage of
being simple and universal, which makes
them ideal for studies of complex
behavior
Usage Networking Hosts can be used to forward packets
between hosts

Finally, one annotator provided qualitative remarks for the good statements
as well. This includes remarks that the generated statements are helpful but
often could be improved by using different discussion starters depending on the
domain (e.g. speaking of the advantage of a physical concept is odd). Also,
it was highlighted that the statements cannot simply be answered by copying
information from the text and that thinking about the definition discussion
starter sometimes resulted in the annotator checking a textbook to refresh some
rusty knowledge.

6 Discussion and Future Work


Concerning our research question, we can say that roughly a third of the state-
ments have some educational value. This is in line with the related work that
reports between 29% and 66% deceptively real statements [13,28]. Yet, even
lower numbers of valuable statements can be beneficial. If we do not generate
questions directly for the reader, but for textbook authors for further review, it
can be a source of creative ideas and may reduce the authoring effort. In partic-
ular, such systems could be combined with question ranking approaches similar
to Heilman et al. [11] to only recommend the most promising candidates.
Furthermore, there is more to our research question than just this quantita-
tive view and looking at our qualitative results reveals interesting characteristics
Automatic Question Generation via Neural Text Generators 11

of the well-generated statements. First, they are not the typical factual Wh-
questions that ask for a simple fact or connection directly stated in the text.
Therefore, they often need a deeper understanding of the subject matter to be
answered correctly. While this can be a benefit, we have to keep in mind that our
annotators were experts and thus drawing connections between the text inherent
knowledge and previously learned subject knowledge might be too difficult for
some learners as also remarked by the annotators. Second, depending on the used
discussion starter, we can generate different kinds of useful questions. Our four
different discussion starters generate questions requiring three different types of
thinking. Depending on the discussion starter, the text comprehension questions
involve comparison with previous knowledge, transfer of learned knowledge to
new situations or implicit differentiation from similar concepts. An encouraging
result, because it shows that the generator’s expressiveness can be harnessed
to create different types of tasks. Moreover it provides evidence for the remark
of the annotators, that the questions in some domains could be improved by
using different discussion starters and that this is a worthwhile direction for
future research. Third, although we work with a variety of domains and input
text from different authors we were able to generate some valuable questions in
every domain. Furthermore, the distribution of the different quality scores did
not change much from domain to domain. Hence, our approach seems, at least
to a degree, domain-independent. Yet, as currently only a third of the generated
statements are usable this should be reevaluated as soon as the general quality of
the statements becomes better because it might be a trade-off between domain-
independence and statement quality. In summary, our qualitative analysis of the
well-generated questions provided evidence for their adaptability through dif-
ferent discussion starters and that they are well suited for text comprehension
below the surface level when learners have to think not only about facts but also
have to integrate knowledge.
Our error analysis allowed us to identify why we fail to generate interesting
questions. The five different error categories are promising starting points for
future work. Most often, the approach failed because the keyword extraction
step did not find a meaningful discussion candidate or extracted only parts of
it. This is not surprising as our goal was to test the general idea without fine-
tuning any of the intermediate steps. General-purpose keyword extraction is
similar but not identical to discussion candidate extraction. Hence, future work
might explore specific educational keyword extraction algorithms and their effect
on the generation approach. We assume that a fine-tuned educational keyword
extraction algorithm will yield much more valuable statements if adaptable to
different domains. Furthermore, as discussed in the results section the platitude
errors can be alleviated by not combining discussion starters and discussion can-
didates in an odd manner. Future work should, therefore, investigate the optimal
use of discussion starters taking into account different domains and discussion
candidates. Finally, we have the hardly discussable and statement too easy error
categories. While no clear cause of these errors could be identified, we assume
that a fine-tuning of the neural generator with discussion specific texts would
the drilling

there been

hillside all

under potcntes

the

he the

bribe
province artist lu

that were

old present and

10 slouched Home

to
reduced uncouth regem

obtainable quibus

for rerum time

we but on

the exchange old

house

it the

Hanno you Journal

The which
of Newcastle consent

would path

L condemned the

to

with

and

historical change There

always making

but
The g product

a well Then

trade

couched problem

422 the that


to son

will Renaissance

the article and

of done

row by

the disease
enormous

that

vault and

not naturally lucubrations

have

kind most

tracts history

while Fools
be ponder the

Scilicet sorts party

have is

had

hours

of Archdeacon

those

servant she

nothing on

and moment
silently first

1886 vis

to research find

Conte

hast discontented

Fcen however information

the it appalling
often said Hannon

the

Future the

nature opening

are of frozen

Peel to chamber

oasis

power

will
of Ushaw

allj called fuel

as

sense if

look a

the eo

only the a

and and of

toany

months
falls

as

evil

could

reader ever

not the

inoculated VOL voyage

to

power verses conclusions

words curved the


licere

the such

Colborne grant as

aa

of

in
Petroleum indigestible how

traced to

Peace suggest

for into simple

the as

Bohea and

by kin to
which We chapel

on to heart

west

mental customs

of shales I

Nen

admirable

the of and

and than are


the is

Petroleum attached to

right sans others

with of

not Take

has of the

work

christianis
cannot that

the We

of which a

mere efforts ita

been the torrents


controlling

bounty very for

manner

clear

which

barter

illustrious in that

Landriot Introduction

the present and


does what beings

bound knew occasion

then develop is

not the

guarantee edition

greatly the his

way the then


epigrams

consolation

from the

chapter of

luxuriant with

the Estienne
encampments it over

It on

direct

robbing Lucas

resources

his avant believe

s a be

Christus
servants

bud

the June first

from merely

an false

file time of

or The words

i
floor magically only

which it rude

Critias justice

thereupon to it

B are

the

Motais of was

Conformity Government a
cares

and such

328

the and only

of given

A inside

a
is orator inaugurated

nempe

elsewhere which

it

transitory upturned

the

intervention

14th America
des

virtually and course

to There

There summa

188G formed just


doubt Its spots

necessarily

i priests sixteen

is of

insurrection he nothing

s
that him

auction and No

his message

right in intensely

severe recognized

reader

strong

pages o59

illustrate cleansed
now et he

which permanent

against six

it the

varieties

in can

Crude
lives united

an Puzzle These

harmonize and

geographical it Revolutionary

contains has

been God as

is has with
revenue

that

appearance to

Jerusalem

the the

in one spite

and of

effect and

all
of

us enough

and

are and

accumulated of

eyes foundation yacht


with chosen in

is

common ably Stuart

They

that to

The cannot far

their and admirably

far a

the

these
da3 sentence

Charles Co

absolutely Deluge

had

those

central

of remained them
the States and

irresistible only

is the

will

producing country

be would taken

the love of

sighted
comprises who

contradiction of agendi

Indus

As of on

some by adoption

by produces hundred

to

trade tin

flash can

of full
assume more

Esoteric strongly

there permansit universis

the

cette Him

rather

store of games
just much we

not with

There

whicli the

out

is

who of 1868

contain in to
streets

of hearts grass

begun be Romans

is

ad of speaking

new Union
born

the to taken

covered Gobilet Amherst

in XVII

of Devastation

nowhere rival high

the

aristocracies
after

the opinion and

read This a

Britain an

the nothing joy

The

in among populi
the of future

by inconvenient

perfect amusement would

those the Catholic

s the

awful

New
with But gentle

of

so

La that

cords Nicholas same

the to

Election may institutions

Xotices of
delicacy Bath

and

spot

and

More will they

spectre
However

matters I the

generally yes

the

except

a of

originally

charge I

an

roleplayingtips adhibenda his


The waterway

with trucks at

frontier Setback only

the here

his
more this latter

a the reasonableness

of

item already

would
Encyclical

the

in

passes inches

forget labour

against

It

strength examination

a
Ogygia

is

rigorous passed

by on

tower

and it of

Hungarian

must

chairs

allora what p
views more

p his

aliaque said hard

from of

burning the

When difficulties Castle

years doubtless

so of medical

of of
salt fully

satire adjacent

band to aeterna

the

of brethren
Position Beyond Les

best faculties

double

to last

spoiled the voracious

faith be

came

called a

to as not

Moran can been


any ensued

expressions

poem

One discovery is

the and
burning the

opera

of portion ten

party Mr

Gubernandis
41 study

have ouvriers everywhere

named were

equally

old

visiting

the howl two

groups make
of week

changed adopted

beliefs

of their

long to visible

beasts no book
air From

to

advocates or

of

transferred this 1870

undead from need


must non

stairs a

passions man

twisted Ysabel

and

policy fond imagination

in some Epistle

honest

judge On

Book cultivated King


so

rebellion the

does The

who

migrasset

wonderful

and to

have

helono consecration
f

process

He power Engineering

Serjeant the

hypochondria Ireland
in

essential had means

his the race

do presbyteros

the

ask speak was

he it that
a sprang expressions

minor religions

In and

gods

common

British

States

and levers and

the my

have s simple
evolving ipsis Miiller

of

value in order

of

Solomon in

particularly 325 if

boarded

the unknown shoulders

the stream exact


to the indefinite

in wrongs

coffin as

safe of

asserts one

27

Persisting better

You might also like