Computer Science and Information Systems 2025 Volume 22, Issue 3, Pages: 1011-1046
https://doi.org/10.2298/CSIS241024034B
Full text (
2054 KB)
Extending hybrid SQL/NoSQL database by introducing statement rewriting component
Bjeladinović Srđa
(University of Belgrade, Faculty of Organizational Sciences, Belgrade, Serbia), srdja.bjeladinovic@fon.bg.ac.rs
Contemporary organisations often include different business subdomains, for which it is neither easy nor optimal to decide on using an exclusive database type. The hybrid SQL/NoSQL databases encompass various types of databases unified into a unique logical database. At the same time, they provide the usage benefits of working with the SQL and the NoSQL databases simultaneously. Recently, there has been an increase in research that deals with the challenges of hybrid databases’ query optimisation, especially query rewriting. This trend opened up possibilities for analysing the influence of applying different statement rewriting techniques to other data manipulation statements besides queries (i.e. INSERT, UPDATE and DELETE) and its impact on the average execution times. In this paper, a process model for applying automatic hybrid statements’ rewriting was designed, and the architecture for the hybrid database was extended with the newly developed Statement Rewriting Component (SRC). The tested use cases were conducted on the example of Oracle/MongoDB/Cassandra hybrid before and after introducing SRC. The tests have shown particular decreases in the average execution times of the system with the SRC.
Keywords: Hybrid database, SQL, NoSQL, statement rewriting, database architecture
Show references
C. A. Lana, M. Guessi, P. O. Antonino, D. Rombach, and E. Y. Nakagawa. A systematic identification of formal and semi-formal languages and techniques for software-intensive systemsof- systems requirements modeling. IEEE Systems Journal, 13(3):2201-2212, 2019.
V. de Oliveira Neves, A. Bertolino, G. De Angelis, and L. Garces. Do we need new strategies for testing systems-of-systems? In Proceedings of the SESoS’18: SESoS’18:IEEE/ACM 6th International Workshop on Software Engineering for Systems-of-Systems, pages 29-32, New York, NY, USA, 2018. ACM.
A. Bertolino and R. Mirandola. Software performance engineering of component-based systems. In Proceedings of the Fourth International Workshop on Software and Performance, WOSP 2004, pages 238-24, Redwood Shores, California, USA, 2004. Association for Computing Machinery, NY, United States.
A. Bertolino, G. De Angelis, and F. Lonetti. Governing regression testing in systems of systems. In Proceedings of 2019 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW), pages 144-148, Berlin, Germany, 2019. IEEE.
S. Park, Y. Shin, S. Hyun, and D. Bae. Simva-sos: Simulation-based verification and analysis for system-of-systems. In Proceedings of the 15th International Conference of System of Systems Engineering (SoSE), pages 575-580, Budapest, Hungary, 2020. IEEE.
M.A. Olivero, A. Bertolino, F.J. Dominguez-Mayo, M.J. Escalona, and I. Matteucci. Addressing security properties in systems of systems: Challenges and ideas. In R. Calinescu and F. Di Giandomenico, editors, Software Engineering for Resilient Systems - SERENE 2019, volume 11732 of Lecture Notes in Computer Science, pages 138-146. Springer, Cham, 2019.
H. Cadavid, V. Andrikopoulos, and P. Avgeriou. Improving hardware/software interface management in systems of systems through documentation as code. Empirical Software Engineering, 28, 2023.
ANSI. Ansi/x3 /sparc dbms framework. Report of the Study Group on Database Management Systems, 1977.
B. Lazarević, Z. Marjanović, N. Aničić, and S. Babarogić. Baze podataka. FON, Belgrade, Serbia, 2006.
A. Borgida, M. Casanova, and A. H. F. Laender. Logical database design: from conceptual to logical schema. In L. LIU and T. OZSU, editors, Encyclopedia of Database Systems, pages 1645-1649. Springer, Boston, MA, US, 2009.
S. Bjeladinovic. A fresh approach for hybrid sql/nosql database design based on data structuredness. Enterprise Information Systems, 12(8-9):1202-1220, 2018.
S. Bjeladinovic, Z. Marjanovic, and S. Babarogic. A proposal of architecture for integration and uniform use of hybrid sql/nosql database components. Journal of Systems and Software, 168:110633, 2020.
H.R. Vyawahare, P.P. Karde, and V.M. Thakare. A hybrid database approach using graph and relational database. In Proceedings of the 2018 IEEE International Conference on Research in Intelligent and Computing in Engineering, pages 2555--2564, Univ Don Bosco, San Salvador, EL SALVADOR, 2018. IEEE.
SolidIT. Db-engines ranking. Web site: DB-engines ranking, 2024. [Online]. Available on: https://db-engines.com/en/ranking (Retrieved: January 2024).
K. Sudhakar. Difference between sql and nosql databases. International Journal of Management, IT and Engineering, 8(6):444-452, 2018.
A. Faraj, B. Rashid, and T. Shareef. Comparative study of relational and nonrelations database performances using oracle and mongodb systems. International Journal of Computer Engineering Technology (IJCET), 5(11):11-22, 2014.
A. Vagner. How do nosql databases handle variety of big data? In XS. Yang, S. Sherratt, and Joshi A. Dey, N., editors, Proceedings of Ninth International Congress on Information and Communication Technology ICICT 2024, volume 1012 of Lecture Notes in Networks and Systems, pages 459-469. Springer, Singapore, 2024.
L. Zhang, K. Pang, J. Xu, and B. Niu. Json-based control model for sql and nosql data conversion in hybrid cloud database. Journal of Cloud Computing, 11(23), 2022.
S. Goyal, P.P. Srivastava, and A. Kumar. An overview of hybrid databases. In Proceedings of the 2015 International Conference on Green Computing and Internet of Things (ICGCIoT), pages 285-288, Greater Noida, India, 2015.
C. Gyorodi, R. Gyorodi, and R. Sotoc. A comparative study of relational and nonrelational database models in a web-based application. International Journal of Advanced Computer Science and Applications, 6(10):78-83, 2015.
B. James and P.O. Asagba. Hybrid database system for big data storage and management. International Journal of Computer Science, Engineering and Applications (IJCSEA), 7(3/4):15-27, 2017.
N. Jatana, S. Puri, M. Ahuja, I. Kathuria, and D Gosain. A survey and comparison of relational and non-relational database. International Journal of Engineering Research Technology, 1(6):1-5, 2012.
M. Villari, A. Celesti, M. Giacobbe, and M. Fazio. Enriched e-r model to design hybrid database for big data solutions. In Proceedings of the 2016 IEEE Symposium on Computers and Communication (ISCC), pages 163-166, Messina, Italy, 2016. IEEE.
D. Martinez-Mosquera, R. Navarrete, and S. Lujan-Mora. Modeling and management big data in databases-a systematic literature review. Sustainability, 12(2):634, 2020.
J. Duggan, A. Elmore, M. Stonebraker, M. Balazinska, B. Howe, J. Kepner, S. Madden, D. Maier, T. Mattson, and S. Zdonik. The bigdawg polystore system. ACM SIGMOD Record, 44(2):11-16, 2015.
E. Kharlamov, T. Mailis, K. Bereta, D. Bilidas, S. Brandt, E. Jimenez-Ruiz, S. Lamparter, C. Neuenstadt, O. Ozcep, A. Soylu, C. Svingos, G. Xiao, D. Zheleznyakov, D. Calvanese, I. Horrocks, M. Giese, Y. Ioannidis, Y. Kotidis, R. Moller, and A.Waaler. A semantic approach to polystores. In Proceedings of the 2016 IEEE International Conference on Big Data, pages 2565-2573, Washington, DC, USA, 2016. IEEE.
S. Dasgupta, K. Coakley, and A. Gupta. Analytics-driven data ingestion and derivation in the awesome polystore. In Proceedinsg of the 2016 IEEE International Conference on Big Data, pages 2555-2564, Washington, DC, USA, 2016. IEEE.
A. Maccioni, E. Basili, and R. Torlone. Quepa: Querying and exploring a polystore by augmentation. In Proceedings of the 2016 International Conference on Management of Data, pages 2133-2136, San Francisco, California, USA, 2016. Sigmod.
J. McHugh, P.E. Cuddihy, J.W.Williams, K.S. Aggour, V.S. Kumar, and V. Mulwad. Integrated access to big data polystores through a knowledge-driven framework. In Proceedings of the 2017 IEEE International Conference on Big Data, pages 1494-1503, Boston, MA, USA, 2017. IEEE.
F. Basciani, J. Di Rocco, L. Iovino, and A. Pierantonio. Typhonml: Tool support for hybrid polystor. Science of Computer Programming, 232:103044, 2023.
N. Niu, L. D. Xu, and Z. Bi. Enterprise information systems architecture - analysis and evaluation. IEEE Transactions On Industrial Informatics, 9(4):2147-2154, 2013.
O. Lajam and S. Mohammed. Revisiting polyglot persistence: From principles to practice. International Journal of Advanced Computer Science and Applications (IJACSA), 13(5):872- 882, 2022.
E. Płuciennik and K. Zgorzałek. The multi-model databases - a review. In S. Kozielski, D. Mrozek, P. Kasprowski, B. Małysiak-Mrozek, and D. Kostrzewa, editors, Beyond Databases, Architectures and Structures. Towards Efficient Solutions for Data Analysis and Knowledge Representation. BDAS 2017., volume 716 of Communications in Computer and Information Science, pages 141-152. Springer, Cham, 2017.
J. Lu and I. Holubova. Multi-model databases. ACM Computing Surveys, 52(3):1-38, 2019.
J. Lu, I. Holubova, and B. Cautis. Multi-model databases and tightly integrated polystores. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management, pages 2301-2302, New York, NY, USA, 2018. Association for Computing Machinery.
F. Ye, X. Sheng, N. Nedjah, J. Sun, and P. Zhang. A benchmark for performance evaluation of a multi-model database vs. polyglot persistence. Journal of Database Management, 34(3):1-20, 2023.
D. Van Landuyt, J. Benaouda, V. Reniers, A. Rafique, and W. Joosen. A comparative performance evaluation of multi-model nosql databases and polyglot persistence. In Proceedings of the 37th ACM/SIGAPP Symposium on Applied Computing, pages 286--293, New York, NY, USA, 2023. Association for Computing Machinery.
I. Holubova, M. Vavrek, and S. Scherzinger. Evolution management in multi-model databases. Data Knowledge Engineering, 136:101932, 2021.
Van Landuyt D. Rafique A. Joosen W. Reniers, V. Object to nosql database mappers (ondm): A systematic survey and comparison of frameworks. Information Systems, 85:1-20, 2019.
N. Roy-Hubara and A. Sturm. Design methods for the new database era: a systematic literature review. Software and Systems Modeling, 19:297-312, 2020.
A. Kalayda. Promising directions for the development of modern databases. Journal of Physics: Conference Series, 2131(022087):1-6, 2021.
B. Bender, C. Bertheau, T. Kˇorppen, H. Lauppe, and N Gronau. A proposal for future data organisation in enterprise systems-an analysis of established database approaches. Information Systems and e-Business Management, 20:441--494, 2022.
I. Zečević, P. Bjeljac, B. Perišić, S. Stankovski, D. Venus, and G. Ostojić. Model driven development of hybrid databases using lightweight metamodel extensions. Enterprise Information Systems, 12(8-9):1221-1238, 2018.
H.N. Aleem, M.M. Baig, and M.M. Khan. Efficient software testing technique based on hybrid database approach. International Journal of Advanced Computer Science and Applications, 10(7):349--356, 2019.
H.R. Vyawahare, P.P. Karde, and V.M. Thakare. Hybrid database model for efficient performance. Procedia Computer Science, 152(8-9):172-178, 2019.
A. de la Vega, D. Garcia-Saiz, C. Blanco, M. Zorrilla, and P. Sanchez. Mortadelo: A modeldriven framework for nosql database design. In E. Abdelwahed, L. Bellatreche, M. Golfarelli, D. Mery, and C. Ordonez, editors, Model and Data Engineering (MEDI 2018), volume 11163 of Lecture Notes in Computer Science, pages 41-57. Springer, Cham, 2018.
M. Sokolova, F. Gomezb, and L. Borisoglebskayaa. Migration from an sql to a hybrid sql/nosql data model. Journal of Management Analytics, 7(1):1-11, 2019.
F. Abdelhedi, A.A. Brahim, F. Atigui, and G. Zurfluh. Logical unified modeling for nosql databases. In Proceedings of the 19th International Conference on Enterprise Information Systems (ICEIS 2017), pages 249-256, Porto, Portugal, 2017. HAL Science.
K. Mershad and A. Hamieh. Sdms: smart database management system for accessing heterogeneous databases. International Journal of Intelligent Information and Database Systems, 14(2):115-152, 2021.
L. Nikolic, V. Dimitrieski, and M. Celikovic. An approach for supporting transparent acid transactions over heterogeneous data stores in microservice architectures. Computer Science and Information Systems, 21(1):167--202, 2024.
O. Mehdi, H. Ibrahim, S. Affendey, E. Pardede, and J. Cao. Exploring instances for matching heterogeneous database schemas utilizing google similarity and regular expression. Computer Science and Information Systems, 15(2):295-320, 2018.
R. Čerešnˇak and M. Kvet. Comparison of query performance in relational a non-relation databases. Transportation Research Procedia, 40:170-177, 2019.
K. Fraczek and M. Plechawska-Wojcik. Comparative analysis of relational and non-relational databases in the context of performance in web applications. In S. Kozielski, D. Mrozek, P. Kasprowski, B. Małysiak-Mrozek, and D. Kostrzewa, editors, Beyond Databases, Architectures and Structures. Towards Efficient Solutions for Data Analysis and Knowledge Representation. BDAS 2017., volume 716 of Communications in Computer and Information Science, pages 153-164. Springer, Cham, 2017.
Z.H. Liu, B. Hammerschmidt, D. McMahon, Y. Liu, and H.J. Chang. Closing the functional and performance gap between sql and nosql. In Proceedings of the 2016 International Conference on Management of Data (SIGMOD ‘16), pages 227-238, San Francisco, USA, 2016. Sigmod.
S. Bjeladinović, M. Škembarević, O. Jejić, and M. Asanović. An analysis of using binary json versus native json on the example of oracle dbms. IPSI Transactions on Internet Research, 19(2):92-103, 2023.
A. Kemper and T. Neumann. One size fits all, again! the architecture of the hybrid oltpolap database management system hyper. In M. Castellanos, U. Dayal, and V. Markl, editors, Enabling Real-Time Business Intelligence (BIRTE 2010), volume 84 of Lecture Notes in Business Information Processing, pages 7-23. Springer, Berlin, Heidelberg, 2011.
L. Thiry, H. Zhao, and M. Hassenforder. Categories for (big) data models and optimisation. Journal of Big Data, 5(21), 2018.
B. Scheuermann. Design of a reconfigurable hybrid database system. In Proceedings of the 18th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, pages 247-250, Charlotte, NC, USA, 2010. IEEE.
M. Owaida, D. Sidler, K. Kara, and G. Alonso. Centaur: A framework for hybrid cpu-fpga databases. In Proceedings of the 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2017), pages 211-218, Napa, CA, USA, 2017. IEEE.
S. Bres, E. Schallehn, and I. Geist. Towards optimization of hybrid CPU/GPU query plans in database systems. In M. Pechenizkiy and M. Wojciechowski, editors, Advances in Intelligent Systems and Computing, volume 185 of Advances in intelligent systems and computing, pages 27-35. Springer Berlin Heidelberg, 2013.
S. Cremer, M. Bagein, S. Mahmoudi, and P. Manneback. Improving performances of an embedded relational database management system with a hybrid cpu/gpu processing engine. In C. Francalanci and M. Helfert, editors, Data Management Technologies and Applications. DATA 2016, volume 737 of Communications in Computer and Information Science, pages 160-177. Springer, Cham, 2017.
M. Gowanlock, B. Karsin, Z. Fink, and J. Wright. Accelerating the unacceleratable: Hybrid cpu/gpu algorithms for memory-bound database. In Proceedings of the 15th International Workshop on Data Management on New HardwareJuly (DaMoN’19), pages 1-11, Amsterdam Netherlands, 2019. ACM.
Z. Pang, S. Wu, H. Huang, Z. Hong, and Y. Xie. Aqua+: Query optimisation for hybrid database-mapreduce system. Knowledge and Information Systems, 63:905--938, 2021.
W. Khan, W. Ahmad, B. Luo, and E. Ahmed. Sql database with physical database tuning technique and nosql graph database comparisons. In Proceedings of the 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC 2019), pages 110- 116, Chengdu, China, 2019. IEEE.
R. Sellami and B. Defude. Complex queries optimisation and evaluation over relational and nosql data stores in cloud environments. IEEE Transactions on Big Data, 4(2):217-230, 2018.
C. Li and J. Gu. An integration approach of hybrid databases based on sql in cloud computing environment. Software: Practice and Experience, 49(3):11-16, 2018.
SolidIT. Db-engines ranking - relational dbms. Web site: DB-engines ranking, 2024. [Online]. Available on: https://db-engines.com/en/ranking/relational+dbms (Retrieved: January 2024).
SolidIT. Db-engines ranking - document store dbms. Web site: DB-engines ranking, 2024. [Online]. Available on: https://db-engines.com/en/ranking/document+store (Retrieved: January 2024).
SolidIT. Db-engines ranking - wide-column store dbms. Web site: DB-engines ranking, 2024. [Online]. Available on: https://db-engines.com/en/article/Wide+Column+Stores (Retrieved: December 2024).
C. Nance, T. Losser, R. Iype, and G. Harmon. Nosql vs rdbms - why there is room for both. In Proceedings of the Southern Association for Information Systems Conference, pages 111-116, 2013.