Koushik Biswas Data Engineer
[email protected] | +91 6290664570 | Kalyani, West Bengal
HackerRank | GitHub | Linkedin | Portfolio
EDUCATION
WEST BENGAL STATE UNIVERSITY March 2018 - July 2021
Mathematics Bachelors of Science Barasat, West Bengal
CGPA: 8.54
ST MARY TECHNICAL CAMPUS March 2024 - September 2024
AI - Business Intelligence Analyst Apprentice Barasat, West Bengal
EXPERIENCE
KINGSTON EDUCATIONAL INSTITUTE | Data Analyst Barasat, West Bengal | October 2024 – Present
Designed, developed, and maintained ETL pipelines for seamless data extraction, transformation, and
loading (ETL) to enhance data accessibility and efficiency.
Created custom dashboards in Power BI to provide real-time insights into student transactions, academic
performance, and financial data.
Developed automated functions for generating manual Excel reports, improving operational efficiency
and reducing manual workload.
Managed and analyzed student transaction data, ensuring data integrity and optimizing reporting
processes.
Built OCR-based data extraction and transformation functions to digitize and process handwritten and
scanned documents.
Automated student fee assignment and collection processes using Selenium, significantly reducing
manual effort and processing time.
REMOTASK | Data Scientist San Francisco, California | August 2023 – March 2024
• Reviewed and enhanced LLM-generated code for data science, machine learning, etc.
• Improved user prompt accuracy and increasing code generation efficiency by 30%.
• Corrected errors in 200+ code snippets, ensuring functionality and alignment with user prompt.
• Performed 500+ data labeling tasks, including image tagging 3D annotation, with 98% accuracy,
improving AI model performance by 25%.
K12 TECHNO SERVICE PRIVATE LIMITED | Market Research Analyst Barasat, West Bengal | January
2023 – January 2024
• Analyzed calling and marketing data from 10+ sources, ensuring data accuracy and consistency through
comprehensive cleaning and transformation.
• Applied data modeling techniques to segment over 1,000 customers, identifying key target audience
and improving marketing strategies.
• Delivered data-driven insights in lead conversion rates and optimized campaign effectiveness by 15%.
SKILLS
Programming Languages Python, Html, Css, Java Script, Flutter
Libraries/Frameworks Scikit-learn, PyTorch, TensorFlow, Pandas, Matplotlib, Selenium, OpenCV,
OpenAI, Django, Flask, tesseract, pillow, Numpy, PySpark, Dax, Vba, Macro
Tools / Platforms Git, Vs Code, Docker, Power BI, Excel, Databricks, Kafka, Apache Airflow,
Google Cloud Service(GCP), Microsoft Azure ADF
Databases MySQL, SQL Lite, Delta Lake, Big Query, Amazon S3, Redshift
PROJECTS / OPEN-SOURCE
CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING | Link K- Means, Python , Google
Colab, Machine Learning, Github
• Performed customer segmentation using K-Means clustering to categorize customers based on
purchasing behavior, demographics, and engagement data.
• Applied data preprocessing techniques to ensure accurate clustering and identified key customer
segments, improving targeted marketing strategies.
• Resulted in a 20 % increase in campaign effectiveness and optimized resources allocation.
MYSQL PROJECT THEMEPARK | Link MySQL
• Developed a theme park management system using MySQL to optimize ticket booking, ride
management, and customer and employee data storage.
• Created efficient SQL queries for real-time data retrieval and designed relational databases for customer,
ride and ticket information.
• Improved operational efficiency by 25%, supporting smooth transaction and data integrity for better
decision-making.
CUSTOMER CHURN ANALYSIS-BCG DATA SCIENCE | Link Python,Pandas,Numpy,Matplotlib
• Completed a customer churn analysis simulation for XYZ Analytics, demonstrating advanced data
analytics skills, identifying essential client data and outlining a strategic investigation approach.
• Conducted efficient data analysis using Python, including Pandas and NumPy. Employed data
visualization techniques for insightful trend interpretation.
• Completed the engineering and optimization of a random forest model, achieving an 85% accuracy rate
in predicting customer churn.
• Completed a concise executive summary for the Associate Director, delivering actionable insights for
informed decision-making based on the analysis.
CERTIFICATIONS
• Accenture North America Data Analytics and Visualization Job Simulation on Forage - Forage
• BCG Data Science Job Simulation on Forage - Forage
• Foundations: Data, Data, Everywhere - Coursera
• Machine Learning using Python - Simplilearn