There are several software tools that are available to facilitate EDA. Here, we are going to outline some of the open source tools:
- Python: This is an open source programming language widely used in data analysis, data mining, and data science (https://www.python.org/). For this book, we will be using Python.
- R programming language: R is an open source programming language that is widely utilized in statistical computation and graphical data analysis (https://www.r-project.org).
- Weka: This is an open source data mining package that involves several EDA tools and algorithms (https://www.cs.waikato.ac.nz/ml/weka/).
- KNIME: This is an open source tool for data analysis and is based on Eclipse (https://www.knime.com/).