Honors in Data Science
410504: Seminar
Topic:-AutoML
By:- Prajakta Vishwas Harihar
BE-IT (B190358529)
Vidya Pratishthan’s Kamalnayan Bajaj Institute of
Engineering & Technology ,Baramati
Contents
• Introduction to AutoML
• Benefits of AutoML
• AutoML Workflow
• Popular AutoML Tools
• Use Cases of AutoML
• Challenges and Limitations
• Conclusion
Introduction to AutoML
• Automated machine learning (AutoML) is the process of
applying machine learning (ML) models to real-world
problems using automation.
• It automates the selection, composition and parameterization
of ML models.
• Automating the machine learning process makes it more user-
friendly and often provides faster, more accurate outputs than
hand-coded algorithms.
Benefits of AutoML
• Improved Efficiency: Saves time and effort in manual tasks
• Accessibility: Allows non-experts to utilize machine learning
• Increased Accuracy: Leverages advanced algorithms and
techniques
• Scalability: Handles large datasets and complex models
• Reproducibility: Ensures consistent and reliable results
AutoML Workflow
• Data Preprocessing: Handling missing values, feature
selection, normalization, etc.
• Model Selection: Choosing appropriate models for the dataset
• Feature Engineering: Generating new features from existing
data
• Hyperparameter Optimization: Tuning model parameters
for optimal performance
• Model Training: Training the selected model on the dataset
• Model Evaluation: Assessing the model's performance using
various metrics
Popular AutoML Tools
• Auto-SKLearn
• MLBox
• TPOT
• PyCaret
• H20AutoML
• Google AutoML
Use Cases of AutoML
• Image Classification: Automated identification of objects in
images
• Natural Language Processing: Automated text classification,
sentiment analysis, etc.
• Time Series Analysis: Forecasting future trends and patterns
• Anomaly Detection: Identifying unusual patterns or outliers
• Customer Segmentation: Grouping customers based on
similar characteristics
Challenges and Limitations
• Data Quality: Dependence on high-quality, well-structured
data
• Interpretability: Black-box models may lack transparency
• Domain Expertise: Still requires domain knowledge for
problem formulation
• Resource Requirements: Computationally expensive for
complex tasks
• Overfitting: Possibility of models overfitting the data
Conclusion
• AutoML automates the process of building and tuning
machine-learning models. This method uses algorithms to
search the best model and hyperparameters rather than relying
on human expertise. AutoML includes increased efficiency
and the ability to handle large amounts of data. It can be useful
in the shortage of experienced machine learning
practitioners. However, there are also limitations to AutoML.
It can be computationally expensive and difficult to interpret
the results of the automated search process.