0% found this document useful (0 votes)
113 views5 pages

Data Scientist + Agentic AI

The document outlines job descriptions for two roles: Data Scientist and Senior Lead Data Scientist, focusing on advanced machine learning and AI solutions. Key responsibilities include developing and deploying machine learning models, collaborating with cross-functional teams, and leveraging Generative AI technologies. Required qualifications include expertise in Python, cloud platforms, and various AI frameworks, with a strong emphasis on communication and analytical skills.

Uploaded by

vjbtp2018
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
113 views5 pages

Data Scientist + Agentic AI

The document outlines job descriptions for two roles: Data Scientist and Senior Lead Data Scientist, focusing on advanced machine learning and AI solutions. Key responsibilities include developing and deploying machine learning models, collaborating with cross-functional teams, and leveraging Generative AI technologies. Required qualifications include expertise in Python, cloud platforms, and various AI frameworks, with a strong emphasis on communication and analytical skills.

Uploaded by

vjbtp2018
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

Job description

Design, implement, and evaluate data-driven solutions using advanced machine


learning and Agentic AI frameworks.

Work on LLM-based architectures, including prompt engineering, memory systems, and


agent orchestration (e.g., LangChain, AutoGPT).
Role: Data Scientist
Industry Type: IT Services & Consulting
Department: Data Science & Analytics
Employment Type: Full Time, Permanent
Role Category: Data Science & Machine Learning
Job description
The candidate should have expertise in Hands-on experience in developing and
deploying machine learning models into production, ensuring scalability and reliability,
handling data, financial/risk modeling, anomaly detection, and cloud environments
(preferably Azure). Strong programming skills in Python, and experience with SQL for
data manipulation.
Excellent oral and written communication skills to effectively engage with team
members, stakeholders, and end-users.

Roles and Responsibilities:

 Develop, enhance, and support assigned applications, ensuring seamless


functionality and high performance.
 Take full ownership of assigned projects or modules, providing end-to-end
solutions from development to deployment.
 Participate in client engagements to enhance existing ML models, develop
advanced models for business needs, and design experiments for test/control
analysis and business process trials.
 Collaborate with cross-functional teams on AI/ML, knowledge discovery, data
modeling, and analytics to address business challenges.
 Build, deploy, and maintain predictive ML models, ensuring production readiness
and optimal performance.
 Expertise in building multi class classification models, handling high dimensional
data.
 Skilled in working within big data environments utilizing the capabilities of
PySpark.
 Expertise in leveraging frameworks such as OpenAI, LLama, Langchain, and
Langraph to develop robust, scalable AI solutions.
 Proficient in text embeddings, document parsing, and advanced natural language
processing (NLP) techniques.
 Experience in designing and implementing agent-based systems for workflow
automation.
 Well-versed in autonomous agents and orchestration frameworks.
 Design and execute experiments such as A/B testing or DOE (Design of
Experiments) to validate models and hypotheses.
 Write and execute data extraction algorithms to collect data from primary or
secondary sources and define data requirements for analysis.
 Troubleshoot issues, monitor performance, and optimize deployed models and
processes for continuous improvement.
 Gain expertise in the organizations data framework and its alignment with
business processes. Recommend any additional data needs and coordinate
with data engineers to ensure data suitability.
 Apply advanced data analytics and strategies to optimize statistical efficiency
and model quality.
 Interpret data and analyze results using analytical techniques, providing
actionable insights and regular reports to stakeholders.
 Identify, analyze, and interpret trends or patterns in complex data sets to
uncover actionable insights.
 Train and develop machine learning models, run evaluation experiments,
refine/test/validate models, and ensure robust deployment in production
environments using FasAPI framework.
 Work with management and stakeholders to prioritize business and informational
needs for data-driven solutions.
 Stay current with industry trends, share best practices, and provide subject
matter expertise within the team.
 Identify new opportunities for process improvement using statistical testing,
predictive modeling, and data-driven methodologies.
 Adhere to company policies, procedures, and security requirements while
maintaining compliance standards.
Qualifications Required / Desired Skills:

 Degree in one or more quantitative discipline Operations Research, Stats, Math,


Comp Sci, Engineering, Economics or similar
 6-9 years of experience in developing a variety of machine learning models
algorithms in a commercial environment with a track record of creating
meaningful business impact.
 Proficient with PySpark, NoSQL and Python and distributed programming
 Expertise working in MongoDB, Databricks and cloud computing platforms
(Azure), or equivalent on-premise platform and deployment
 provide hands-on technical guidance; conduct code reviews.
 Ability to simultaneously coordinate and track multiple deliverables, tasks and
dependencies across multiple stakeholders / business areas.
 Experience in client engagements, interpreting client s business challenges, and
recommendations for statistical analysis solutions (i.e. analytical consulting
and solution design)
 Experience in presentation design, development, delivery, and communication
skills to present analytical results and recommendations for action-oriented
data driven decisions and associated operational and financial impacts.
 Experience in Gen AI, LLM Workflow, Graph RAG etc. is an added advantage.
 Familiarity with visualization tools like PowerBI and Tableau is an added
advantage.
 Flexible to work from office 3 days(in a week) from 12:30pm to 9:30pm

Senior Lead Data Scientist


Job description
Job Overview:
The Senior Lead Data Scientist is a senior technical leader responsible for designing,
developing advanced data and AI solutions, with a strategic focus on leveraging
Generative AI technologies (e.g., large language models, diffusion models, multi-modal
systems) to solve complex business problems. This role combines deep expertise in
data science, machine learning, and AI architecture with a strong understanding of
product strategy, cross-functional leadership, and ethical AI deployment.

Key Responsibilities:
 Design and implement advanced solutions utilizing Large Language Models
(LLMs).
 Demonstrate self-driven initiative by taking ownership and creating end-to-end
solutions.
 Conduct research and stay informed about the latest developments in generative
AI and LLMs.
 Develop and maintain code libraries, tools, and frameworks to support generative
AI development.
 Participate in code reviews and contribute to maintaining high code quality
standards.
 Possess strong analytical and problem-solving skills.
 Demonstrate excellent communication skills and the ability to work effectively in
a team environment.
Skills & Qualifications:
Must Have Skills:
 7 to 12 years of experience in IT
 Natural Language Processing (NLP): Hands-on experience in use case
classification, topic modeling, Q&A and chatbots, search, Document AI,
summarization, and content generation.
 Computer Vision and Audio: Hands-on experience in image classification, object
detection, segmentation, image generation, audio, and video analysis.
 Generative AI: Proficiency with SaaS LLMs, including Lang chain, llama index,
vector databases, Prompt engineering (COT, TOT, ReAct, agents). Experience
with Azure OpenAI, Google Vertex AI, AWS Bedrock for text/audio/image/video
modalities.
 Familiarity with Open-source LLMs, including tools like TensorFlow/Pytorch and
huggingface. Techniques such as RLHF.
 Cloud: Hands-on experience with cloud platforms such as Azure, AWS, and GCP.
 Application Development: Proficiency in Python, Docker, FastAPI/Django/Flask,
and Git.
Tech Skills :
Machine Learning (ML) & Deep Learning
 Solid understanding of supervised and unsupervised learning.
 Proficiency with deep learning architectures like Transformers, LSTMs, RNNs, etc.
Generative AI:
 Hands-on experience with models such as OpenAI, Gemini etc.
 Knowledge of optimizing large language models (LLMs) for specific tasks.
Natural Language Processing (NLP):
 Expertise in NLP techniques, including text preprocessing, tokenization,
embeddings, and sentiment analysis.
 Familiarity with NLP tasks such as text classification, summarization, translation,
and question-answering.
Retrieval-Augmented Generation (RAG):
 In-depth understanding of RAG pipelines, including knowledge retrieval
techniques like dense/sparse retrieval.
 Experience integrating generative models with external knowledge bases or
databases to augment responses.
Search and Retrieval Systems:
 Experience with building or integrating search and retrieval systems, leveraging
knowledge of Elasticsearch, AI Search, ChromaDB etc.
Prompt Engineering:
 Expertise in crafting, fine-tuning, and optimizing prompts to improve model
output quality and ensure desired results.
 Understanding how to guide large language models (LLMs) to achieve specific
outcomes by using different prompt formats, strategies, and constraints.
 Knowledge of techniques like few-shot, zero-shot, and one-shot prompting, as
well as using system and user prompts for enhanced model performance.
Programming & Libraries:
 Proficiency in Python and libraries such as PyTorch, Hugging Face, etc.
 Knowledge of version control (Git), cloud platforms (AWS, GCP, Azure).
APIs & Integration:
 Ability to work with RESTful APIs and integrate generative models into
applications.
Evaluation & Benchmarking:
 Strong understanding of metrics and evaluation techniques for generative
models.
Good to Have Skills:
 Advanced Degree: Master s degree in computer science or relevant field.
 Life Sciences Experience: Experience in Life sciences/Healthcare Industry
 Azure Certification: Azure Cloud experience/certification.
 Experience with Multi-modal AI models (text-to-image, text-to-video, speech
synthesis, etc.).
 Knowledge of Knowledge Graphs and Symbolic AI .
 Understanding of MLOps and LLMOps for deploying scalable AI solutions.
Role: Data Scientist
Industry Type: Pharmaceutical & Life Sciences
Department: Data Science & Analytics
Employment Type: Full Time, Permanent
Role Category: Data Science & Machine Learning

You might also like