Yash Shah

Yash Shah

Toronto, Ontario, Canada
6K followers 500+ connections

About

With 8+ years of experience in data science and machine learning, I specialize in leading…

Activity

Join now to see all activity

Experience

  • Wrango Graphic
  • -

  • -

    Toronto, Ontario, Canada

  • -

    Canada

  • -

  • -

    Toronto, Ontario, Canada

  • -

  • -

    Toronto

  • -

  • -

    Ahmedabad, Gujarat, India

  • -

    Ahmedabad, Gujarat, India

Education

Licenses & Certifications

Volunteer Experience

  • Active Patterns Graphic

    Project intern

    Active Patterns

    - 4 months

    Economic Empowerment

    Assisted with SDLC lifecycle, including, gathering of requirements, analysis, and specification writings

    created blogs for market outreach and product awareness.

  • Academic and Professional development team lead

    AMIGAS

    - 1 year 1 month

    Education

    Organized networking and professional development events by involving industry experts and professors

    19+ coffee breaks, networking, development events organized

  • Indian Graduate Students'​ Association- University of Toronto Graphic

    First-year Representative

    Indian Graduate Students'​ Association- University of Toronto

    - 9 months

    Civil Rights and Social Action

  • BACHPAN - India Graphic

    Teacher

    BACHPAN - India

    Education

  • ASHRAE India Chapter Graphic

    Core committee Member

    ASHRAE India Chapter

    - 1 year 5 months

    Education

Patents

  • AIR EVACUATION SYSTEM FOR AUTOMOBILE

    Filed IN 201621028390 A

    A system developed for improvising comfort of the automobile by removing said hot air from the automobile cockpit
    System comprises an evacuation fan which is operated by the battery includes inputs from many sensing apparatuses for complete atomization of the system. It also consisting A cutoff circuit for the system which leads to proper utilization of the system

    Other inventors

Projects

  • Engineered Data pipeline to connect GCP and data warehouse ( Labour market and economic data)

    -

    Analyzed data anomalies by statistical methods, compared data from other sources, and prepared relevant visualizations for queried 50 states data

    Created and scheduled a data pipeline(Cron jobs) to fetch data from FRED warehouse using APIs to GCP BigQuery table

    Reviewed Architecture and Generalized the pipeline to collect any data from the FRED warehouse using relevant queries

    See project
  • Insurance prediction(Web deployed) on Hospital premium data

    -

    Developed data processing pipeline to engineer 396 features for producing optimal result on survey responses
    Tuned Lasso, Random forest, Gradient boosting prediction models to perform optimally on unseen data

  • Recommendation system for movies using 100k Movielens dataset

    -

    • Created a co-occurrence matrix using memory-based collaborative filtering in python
    • Implemented and optimized SVD, KNN for model-based collaborative filtering for movie recommendation

    See project
  • Data analysis of Canada for economic recovery and flattening the curve efforts due to COVID19

    -

    • Analyzed economic KPIs to assess the impact of Covid19 on Canada and developed web-based dashboard
    • Web scraped (Selenium) Twitter and Indeed job data to assess the impact on job markets due to pandemic using sentiment analysis and regression analysis respectively
    • Benchmarked 2008-09 S&P index data with the recent data to analyze trend similarities for forecasting

  • The NHS National Program for IT (NPfIT) project’s failure review and analysis

    -

    • Investigated cost impacts due to scope creep, improper procurement management and stakeholder communication
    • Reviewed changes in the project scope, inefficiently delegated tasks and budget estimation impact on schedule

  • Technical Analysis indicators for asset performance and portfolio optimization

    -

    • Implemented simple (SMA) and exponential (EMA) moving average trend indicators for performance analysis
    • Applied momentum indicator (MACD) including support and resistance for buy and sell indication in python

    See project
  • Text Reviews analysis for wine category classification using Apache Spark and SCALA

    -

    • Processed 130k wine review(text) data in Apache spark and Scala for wine variety classification
    • Modeled (76% Avg. F1 score) multiclass classifier on imbalanced data using Logistic Regression algorithm

    See project
  • Data driven Crime prediction

    -

    Implemented a gradient boosting algorithm in python for developing machine learning model using LA crime data.
    Interactive map integration for data visualization using folium library in python.

  • Salary prediction of Kaggle survey data for data science community

    -

    The dataset provided (mutiplechoiceResponses.csv) contains the survey results provided by Kaggle. The survey results from 23860 participants are shown in 395 columns, representing survey questions. Not all questions are answered by each participant, and responses contain various data types.
    Work involves:

    1. Data cleaning with understanding of data, imputing missing values, encoding (label or one-hot encoding)
    2. Exploratory Data Analysis
    3. Selection of features using…

    The dataset provided (mutiplechoiceResponses.csv) contains the survey results provided by Kaggle. The survey results from 23860 participants are shown in 395 columns, representing survey questions. Not all questions are answered by each participant, and responses contain various data types.
    Work involves:

    1. Data cleaning with understanding of data, imputing missing values, encoding (label or one-hot encoding)
    2. Exploratory Data Analysis
    3. Selection of features using decisiontree importance and reducing dimensionality with Principal Component Analysis
    4. Model implementation using linear regression with lasso and ridge, Random forest and Gradient boosting regressor
    5. Model tuning using grid search

    In a nutshell, Implemented a machine learning model in python for predicting the compensation based on survey responses of employees in the data science community.

    See project
  • Text and Sentiment Analysis for the US Airline

    -

    Analyzed the public opinion about the US airlines on Twitter using python for text and sentiment analysis
    Visualized the data using Wordcloud, Seaborn, and matplotlib for the insights.

    Steps involved:
    1. Data cleaning
    2. exploratory analysis
    3. feature selection and model implementation for classification
    4. models used - SVM, logistic regression, random forest
    5. hyperparameter tuning for better accuracy

  • Implemented Quality assurance techniques for API (active pharmaceutical ingredient) manufacturing process

    -

    Improved identification of out of control points using SPC Tools for the Batch production of API and
    increased the confidence interval by 0.07% to mitigate the risk.

    Analyzed the sensitivity of critical parameters using design of experiments and eliminated cause of
    failure in the process.

    Improved process by verification and elimination of said cause by design of experiments.

    Adapted time-weighted charts for smaller shift detections for GMP with the using of Minitab.

  • Digit classification using CNN

    -

    This project is basic implementation of deep neural network - CNN for digit classification

    Data set consist of 70,000 digits having 28 x 28 pixels processes :

    Dataset import and transformations
    Feature engineering
    Model creation
    Model implementation
    Model validation and tuning

    See project

Honors & Awards

  • Best Performer of The Department

    L.D.collage of Engineering

    For continuous efforts in my research works and extracurriculars in the university, I was rewarded by the prestigious Award by the Gujarat Technological University.

More activity by Yash

View Yash’s full profile

  • See who you know in common
  • Get introduced
  • Contact Yash directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Yash Shah

Add new skills with these courses