DataEngineer Resume
DataEngineer Resume
[email protected]|5716342774 | GitHub
Results-driven Data Engineer with 5+ years of experience architecting and optimizing scalable data solutions.
Proven expertise in ETL workflows, advanced data modeling, and predictive analytics, transforming raw data into
actionable insights that drive business growth. Skilled in Python, SQL, R, and Tableau, with a strong background in
cloud technologies (Google Cloud, AWS) to enhance data processing and infrastructure scalability. Adept at
leveraging automation, big data frameworks, and visualization tools to empower decision-making.
Skills
Programming: Python, SQL, R, PL/SQL, Unix Scripting
Big Data & Cloud: Google Cloud, AWS, Snow ake, Apache Spark, Kafka, BigQuery, RedShift, Pub/Sub.
Databases: Cassandra, Oracle, Postgres, BigQuery, S3, CloudSQL
Generative AI / Deep Learning: Hugging face, LangChain, GPT, NLP, tensor ow, Keras, RAG, Vector databases
ETL & Data Engineering: Apache Air ow, DBT, PySpark, Snow ake, SQLAlchemy
DevOps & CI/CD: Docker, Kubernetes, Jenkins, Git, GitLab
Data Visualization & BI: Tableau, Power BI, Matplotlib, Seaborn, Plotly, GGplot2, Lea et
Machine Learning & Analytics: Predictive Modeling, NLP, Regression, A/B Testing, Segmentation.
Software Development & APIs: Django, Micro services, REST APIs, Databases
Experience
Data Engineer @ USBank May 2022 - November 2024
• Designed and optimized ETL workflows leveraging Cloud Storage, Data Flow and BigQuery, reducing data
ingestion latency by 30% and improving downstream processing efficiency.
• Built and managed ETL pipelines with PySpark, DBT and Apache Airflow, orchestrating large-scale data
transformations across distributed computing environments, ensuring seamless scalability and reliability.
• Automated ETL workflows, improved data modeling and financial reporting dashboards using DBT and Tableau,
streamlining analysis for senior executives and field teams.
• Utilized Google Cloud Database Migration Service to seamlessly migrate Relational databases, ensuring minimal
downtime, data integrity, and schema optimization.
• Automated deployment and monitoring of containerized data pipelines using Kubernetes and Docker, reducing
manual intervention by 40% and increasing system resilience.
• Developed predictive models using Python, and NLP by analyzing historical incident patterns to proactively
identify potential SLA breaches. This reduced SLA violations by 20% and improved system reliability.
• Monitored CI/CD pipelines using Jenkins to streamline software releases, reducing deployment failures and
improving development workflow.
Data Analyst @ Reliance Digital January 2019 - June 2021
• Optimized inventory management strategies by conducting sensitivity analyses in Python and SQL, identifying
demand uctuations, and reducing operational costs by $150K during the pandemic.
• Conducted regression analysis on multi-channel marketing data, identifying high-ROI advertising channels,
leading to a 25% increase in overall campaign effectiveness.
• Designed and optimized PL/SQL stored procedures and functions to automate complex data transformations,
improving query performance and reducing ETL processing time by 40%.
• Performed A/B testing using Python and causal inference to measure the impact of different promotional
strategies, increasing customer engagement and boosting conversion rates by 15%.
• Developed interactive dashboards in Power BI, visualizing sales trends, supply chain inef ciencies, and customer
purchase behavior, enabling executives to make informed data-driven decisions.
Education
Saint Francis College, Brooklyn, NYC. Master’s in Information Technology. January 2023 - September 2024 | 3.5 GPA
George Mason University, Fairfax, VA. Master's in Data Analytics Engineering August 2021 - May 2023 | 3.2 GPA
GITAM University, Hyderabad, India Bachelor’s in Computer Science June 2017- June 2021 | 3.2 GPA
Certifications
Google Cloud Data Engineer | Graph Data Science | Applied Data Science
fl
fl
fl
fl
fl
fl
fi