TOP 21
DATA
SCIENCE
PROJECTS
✅ ✅ ✅
✅ BEGINNERS ✅ INTERMEDIATE LEVEL ✅ ADVANCED LEVEL
www.cloudyml.com
For Beginners
Iris Flower Classification: Use the famous Iris dataset
to classify flowers into one of three species based on
their sepal and petal sizes.
Titanic Survival Prediction: Predict whether a
passenger on the Titanic would have survived or not
based on features like age, gender, and class.
Handwritten Digit Recognition: Use the MNIST
dataset to classify handwritten digits from 0 to 9 using
basic neural networks.
Movie Recommendation System: Build a basic
recommendation system that suggests movies based
on user preferences using collaborative filtering.
Sales Forecasting: Predict future sales for a retail
store using time series analysis or linear regression.
Spam Email Detector: Classify emails as spam or not
spam based on their content using natural language
processing techniques.
Wine Quality Prediction: Predict the quality of wine
based on its chemical properties using regression
techniques.
www.cloudyml.com
For Intermediate Level
Sentiment Analysis: Analyze sentiments of movie
reviews or tweets using natural language processing.
Image Captioning: Generate captions for images using
convolutional neural networks (CNN) and recurrent
neural networks (RNN).
Stock Price Prediction: Use historical stock price data
to predict future prices using LSTM networks.
Credit Card Fraud Detection: Detect fraudulent
transactions using anomaly detection techniques.
Customer Segmentation: Segment customers based
on their purchasing behavior using clustering
techniques like K-means.
Object Detection: Detect and classify objects in
images using techniques like Faster R-CNN or YOLO.
Chatbot Development: Build a chatbot that can
answer frequently asked questions using sequence-to-
sequence models.
www.cloudyml.com
For Advanced Level
Neural Style Transfer: Implement a neural style
transfer to apply artistic styles from one image to
another using deep learning.
Face Recognition System: Build a system that can
recognize and identify faces using deep learning
techniques.
Generative Adversarial Networks (GANs): Generate
new images or data that resemble a given dataset.
Reinforcement Learning for Game Playing: Train an
agent to play a game (like Chess or Go) using
reinforcement learning techniques.
Medical Image Analysis: Detect diseases or anomalies
in medical images (like X-rays or MRIs) using deep
learning.
Speech Recognition System: Build a system that can
convert spoken language into text using deep neural
networks.
Predictive Maintenance: Predict when a machine or
system will fail so that maintenance can be performed
just in time, using time series analysis, deep learning,
and anomaly detection.
www.cloudyml.com
BOUNS
30 FREE Dataset Sources to Use
for Data Science Projects
1. US Government Dataset: https://www.data.gov/
2. Open Government Data (OGD) Platform India: https://data.gov.in/
3. The World Bank Open Data: https://data.worldbank.org/
4. Data.world: https://data.world/
5. BFI - Industry Data and Insights: https://www.bfi.org.uk/data-statistics
6. The Humanitarian Data Exchange (HDX): https://data.humdata.org/
7. Data at World Health Organization (WHO): https://www.who.int/data
8. FBI’s Crime Data Explorer: https://crime-data-explorer.fr.cloud.gov/
9. AWS Open Data Registry: https://registry.opendata.aws/
10. FiveThirtyEight: https://data.fivethirtyeight.com/
11. IMDb Datasets: https://www.imdb.com/interfaces/
12. Kaggle: https://www.kaggle.com/datasets
13. UCI Machine Learning Repository: https://archive.ics.uci.edu/ml/index.php
14. Google Dataset Search: https://datasetsearch.research.google.com/
15. Nasdaq Data Link: https://data.nasdaq.com/
16. Recommender Systems and Personalization Datasets:
https://cseweb.ucsd.edu/~jmcauley/datasets.html
17. Reddit - Datasets: https://www.reddit.com/r/datasets/
18. Open Data Network by Socrata: https://www.opendatanetwork.com/
19. Climate Data Online by NOAA: https://www.ncdc.noaa.gov/cdo-web/
20. Azure Open Datasets: https://azure.microsoft.com/en-us/services/open-
datasets/
21. IEEE Data Port: https://ieee-dataport.org/
22. Wikipedia: Database: https://dumps.wikimedia.org/
23. BuzzFeed News: https://github.com/BuzzFeedNews/everything
24. Academic Torrents: https://academictorrents.com/
25. Yelp Open Dataset: https://www.yelp.com/dataset
26. The NLP Index by Quantum Stat: https://index.quantumstat.com/
27. Computer Vision Online: http://www.computervisiononline.com/dataset
28. Visual Data Discovery: https://www.visualdata.io/
29. Roboflow Public Datasets: https://public.roboflow.com/
30. Computer Vision Group, TUM: https://vision.in.tum.de/data/datasets
www.cloudyml.com
Save It &
Share With
Your Friends
If you want to make your career in Data
Science & Analytics Domain and don’t
know where to start then you must
check our courses.
You will get complete hands-on practical
learning experience from scratch with
Industrial Projects, Internship and
Placement Guarantee.
Learn from the best @
most affordable price
visit
Mr. Akash Raj
www.cloudyml.com
Founder & CEO - CloudyML
4Yrs+ Experienced Data Scientist