Explaining Causes Behind SQL Query Formulation Errors
Taipalus, T. (2020). Explaining Causes Behind SQL Query Formulation Errors. In FIE 2020 : Proceedings of the 50th IEEE Frontiers in Education Conference. IEEE. Conference proceedings : Frontiers in Education Conference. https://doi.org/10.1109/FIE44824.2020.9274114
Authors
Date
2020Copyright
© 2020 IEEE
This Full Research Paper presents the most prominent query formulation errors in Structured Query Language (SQL), and maps these errors to their cognitive explanations. Understanding query formulation errors is a key to teaching SQL. more effectively. However, studies on what kind of errors novices struggle with are relatively scarce when compared to, for example, programming languages. Although committing errors is a crucial part in learning, some errors are relatively easy to fix, and their commonness is not necessarily an indication of their difficulty. Other errors, however, halt the learning process, and are never fixed by the query writer. Using a previously established error taxonomy and queries from four cohorts with a total of 987 students, we set out to identify common errors which students are unable to correct, i.e., errors that are likely to cause query formulation failures. Our results indicate that on a general level, logical errors are the most common cause for query formulation failures, while syntax and semantic errors are usually fixed by query writers. Although query concepts, for example, expressions, joins and grouping, have a strong influence on what types of errors are committed, some errors are common regardless of query concepts. Specifically, our results indicate that missing expressions, extraneous or omitted grouping columns, incorrect comparison operators, missing joins, and missing ordering columns are the most common errors that novices are unable to fix. Based on the results, we speculate on the reasons behind the most common persistent errors using previously identified cognitive explanations. Finally, we present that solutions for mitigating the causes behind query formulation errors are already available. In order to more effectively teach query formulation, educators should emphasize natural language patterns, query planning, and increasingly ambiguous exercises.
...


Publisher
IEEEParent publication ISBN
978-1-7281-8962-8Conference
Frontiers in Education ConferenceIs part of publication
FIE 2020 : Proceedings of the 50th IEEE Frontiers in Education ConferenceISSN Search the Publication Forum
1539-4565Keywords
Publication in research information system
https://converis.jyu.fi/converis/portal/detail/Publication/47287879
Metadata
Show full item recordCollections
License
Related items
Showing items with similar title or keywords.
-
The Effects of Database Complexity on SQL Query Formulation
Taipalus, Toni (Elsevier, 2020)In Structured Query Language (SQL) education, students often execute queries against a simple exercise database. Recently, databases that are more realistic have been utilized to the effect that students find exercises ... -
Errors and Complications in SQL Query Formulation
Taipalus, Toni; Siponen, Mikko; Vartiainen, Tero (Association for Computing Machinery, 2018)SQL is taught in almost all university level database courses, yet SQL has received relatively little attention in educational research. In this study, we present a database management system independent categorization of ... -
NewSQL Database Management System Compiler Errors : Effectiveness and Usefulness
Taipalus, Toni; Grahn, Hilkka (Taylor & Francis, 2022)Modern database management is often faced with a high number of concurrent end-users, and the need for database distribution to ensure fault tolerance and high throughput. To flexibly address these challenges, many modern ... -
Error messages in relational database management systems : A comparison of effectiveness, usefulness, and user confidence
Taipalus, Toni; Grahn, Hilkka; Ghanbari, Hadi (Elsevier BV, 2021)The database and the database management system (DBMS) are two of the main components of any information system. Structured Query Language (SQL) is the most popular query language for retrieving data from the database, as ... -
What to Expect and What to Focus on in SQL Query Teaching
Taipalus, Toni; Perälä, Piia (Association for Computing Machinery, 2019)In the process of learning a new computer language, writing erroneous statements is part of the learning experience. However, some errors persist throughout the query writing process and are never corrected. Structured ...