Raj Sharma
Phone No.: +91-6266469816
E-mail:
[email protected]LinkedIn: http://linkedin.com/in/raj-sharma-2108
SUMMARY:
Results-driven data engineer with 3+ years of experience designing and building
efficient data pipelines. Skilled in data migration, automation, data warehousing, and
analysis. Proficient in cloud technologies. Delivers high-quality data products and
insights to drive business decisions.
TECHNICAL SKILLS:
● Programming Languages - Python, SQL
● Cloud Platforms - GCP, AWS(basics)
● Tools & Technologies - Airflow, GIT, JIRA, VS Code, Bigquery, DataFlow etc.
● Data Engineering - Data Migration, Data Warehousing, Automation, Data
Pipelines, ETL/ELT, Big Data
EXPERIENCE:
Data Engineer (DE) - Quantiphi Analytics | July 2022 - Presently working
● Achieved 70% cost reduction by automating report generation processes.
● Led multiple successful cloud migration projects to diverse cloud
environments.
● Collaborated with Google as a data engineer on innovative projects.
● Conducted internal and client training sessions on cloud technologies.
● Oversaw a team of data engineers and earned client recognition for
exceptional work.
Data Engineer Intern - Quantiphi Analytics | January 2022 - June 2022
● Acquired data engineering skills in Python, GCP, AWS, SQL, and Git etc.
● Built a data pipeline to migrate data from file storage to a data warehouse.
● Gained hands-on experience by shadowing senior engineers on data
engineering projects.
● Developed practical skills using data engineering tools and techniques.
Certifications:
● Google Certified Associate Cloud Engineer(GCP ACE).
● AWS Certified Solution Architect Associate(AWS SAA).
Projects:
GCS-BQ-CDC-Pipeline
● Purpose: Developed a robust data pipeline to efficiently ingest data from Google
Cloud Storage (GCS) buckets into BigQuery using a Change Data Capture
(CDC) approach.
● Technologies: Utilized Apache Airflow for workflow orchestration, Python for
data processing and transformation, SQL for data manipulation in BigQuery, and
Google Cloud Storage for data storage.
● Process: Implemented a scheduled pipeline that reads files from GCS buckets,
extracts relevant changes using CDC techniques, loads the data into BigQuery,
and moves the processed files to a designated folder.
● Fault Tolerance: Incorporated mechanisms to handle file processing failures,
moving files to a "failed files" folder and sending email alerts for immediate
attention.
● Efficiency: Leveraged CDC to minimize data transfer and processing, improving
pipeline performance and scalability.
Education:
Bachelor of Technology - S.G.S.I.T.S | 2018 -2022
● Academic Performance: Achieved a CGPA of 7.4.
● Technical Event Organizer: Led and organized technical events during college,
demonstrating strong leadership and problem-solving skills.
Higher Secondary School - R.K.V.M | 2017-2018
● Academic Performance: Achieved 84.6% in PCM.
● Entrance Exam Success: Qualified for both JEE Advanced and JEE Mains in
2018.
Senior Secondary School - St. Paul’s School | 2015-2016
● Academic Performance: Achieved 8.6 CGPA
Interests & Hobbies:
● Health & Wellness
● Travelling & Sight - seeing
● Networking