Pavan Sai Kagitha
PROFILE SUMMARY
5+ years of experience in Linux and High Performance Computing (HPC), with extensive practical hands-on
experience in deploying High Performance Cluster, Parallel Storage (Lustre).
Installation of Ubuntu, RedHat Linux operating systems.
Configuration, OS patching, and updates Ubuntu.
Hands-on experience in Linux administration, including user management, permissions, and system-security.
Creating Volume Groups (VGs), Logical Volumes (LVs), and extending them using LVM.
Familiarity with job schedulers like OpenPBS, SLURM, and LSF.
Having knowledge in xCAT, bright cluster manager.
High Performance Computing (HPC).
Working with the application team for exporting and mounting shared volumes.
Performing racking, stacking, cabling, and labeling of servers in a data-center environment.
Installing Operating Systems (OS), configuring network switches, and setting up IP addresses.
Applying BIOS updates, firmware updates, and configuring server hardware.
Working on Ubuntu OS hardening projects for security and compliance.
Implementing security best practices to protect systems from vulnerabilities.
Ensuring all systems meet security compliance standards for enterprise environments.
Implemented OS hardening techniques on Ubuntu 22.04, ensuring compliance with security
Audited and enforced secure user authentication, password policies, and privilege escalations, including PAM
module configurations
Configured UFW/iptables to block unwanted traffic and secure network connections.
Applied sysctl configurations to secure kernel parameters and prevent exploits.
WORKING EXPERIENCE
HPC Administrator
TIS labs (p) ltd • Bangalore
01/2022 – Till Date.
PROJECTS :
1. Jaypee IIT (Noida)
• Cluster Manager - xCAT
• Job Scheduler - SLURM
• Total Compute nodes – 16
Roles and Responsibilities :
* Setting up servers and configuring them according to customer requirements, ensuring a smooth deployment process at
the customer location.
* Maintaining and administrating High-performance computing environment servers, installation and configuration of
servers from scratch if required in the customer’s place.
* Remotely managing HPC clusters, providing ongoing support, configuring various components, and performing regular
maintenance tasks to ensure optimal performance.
* Installing and maintaining Linux distributions such as CentOS, RedHat, and Ubuntu, ensuring they are up-to-date and
securely configured.
* Installing and configuring open-source software packages on HPC clusters, tailored to meet the specific requirements of
our customers.
* System security maintenance by implementing frequent software updates, patches, and security configurations to protect
against potential vulnerabilities.
* Hands-on experience in installing InfiniBand drivers, NVIDIA drivers, CUDA Toolkit, and CUDNN libraries, enabling
efficient usage of hardware resources in HPC environments and managing
software versions using Environment Modules.
* Perform troubleshooting and resolving issues based on customer requests, to ensure timely and effective solutions.
Providing training on Linux systems and software usage for end-users.
2. DRDL (Hyderabad)
• Cluster Manager - Bright Cluster Manager
• Job Scheduler - PBSpro
• Total Compute nodes - 1241
Roles and Responsibilities :
* Setting up servers and configuring them according to customer requirements, ensuring a smooth deployment process at
the customer location.
* Maintaining and administrating High-performance computing environment servers, installation and configuration of
servers from scratch if required in the customer’s place.
* Remotely managing HPC clusters, providing ongoing support, configuring various components, and performing regular
maintenance tasks to ensure optimal performance.
* Installing and maintaining Linux distributions such as CentOS, RedHat, and Ubuntu, ensuring they are up-to-date and
securely configured.
* Installing and configuring open-source software packages on HPC clusters, tailored to meet the specific requirements of
our customers.
3. CEAT (Bangalore)
• Cluster Manager - xCAT
• Job Scheduler - OpenPBS
• Total Compute nodes – 4
Roles and Responsibilities :
* Remotely managing HPC clusters, providing ongoing support, configuring various components, and performing regular
maintenance tasks to ensure optimal performance.
* Installing and maintaining Linux distributions such as CentOS, RedHat, and Ubuntu, ensuring they are up-to-date and
securely configured.
* Installing and configuring open-source software packages on HPC clusters, tailored to meet the specific requirements of
our customers.
* System security maintenance by implementing frequent software updates, patches, and security configurations to protect
against potential vulnerabilities.
4. Capgemini (Bangalore)
• Cluster Manager - xCAT
• Job Scheduler - OpenPBS
• Total Compute nodes – 14
Roles and Responsibilities :
* Setting up servers and configuring them according to customer requirements, ensuring a smooth deployment process at
the customer location.
* Maintaining and administrating High-performance computing environment servers, installation and configuration of
servers from scratch if required in the customer’s place.
* Remotely managing HPC clusters, providing ongoing support, configuring various components, and performing regular
maintenance tasks to ensure optimal performance.
* Installing and maintaining Linux distributions such as CentOS, RedHat, and Ubuntu, ensuring they are up-to-date and
securely configured.
5. HONDA (Hyderabad)
• Cluster Manager - xCAT
• Job Scheduler - OpenPBS
• Total Compute nodes – 16
Roles and Responsibilities :
* Setting up servers and configuring them according to customer requirements, ensuring a smooth deployment process at
the customer location.
* Maintaining and administrating High-performance computing environment servers, installation and configuration of
servers from scratch if required in the customer’s place.
* Remotely managing HPC clusters, providing ongoing support, configuring various components, and performing regular
maintenance tasks to ensure optimal performance.
* Installing and maintaining Linux distributions such as CentOS, RedHat, and Ubuntu, ensuring they are up-to-date and
securely configured.
Linux Admin
I Weave solutions Pvt Ltd • Bangalore
12/2019 - 09/2021
JOB ROLES AND RESPONSIBILITIES:
* Hands-on Experience on Linux Administration operating system is Ubuntu, CentOS, RedHat Linux Creating volume
groups, logical volumes and extending them using LVM.
* Working with application team for exporting shared volumes and mounting shared volumes.
* Software installation using various methods in Linux i.e.: apt, yum, rpm.
* Installation and Configuration of Web server i.e.: Apache/HTTPD/NGINX
* Installation of software Packages, managing and checking the integrity of the installed packages using RPM and YUM local
repository. •
* Remote system administration using tools like SSH
* Experience in upgrading and configuring RedHat Linux 5.x&6.x servers and Interactive Installation
* User Administration Creating, deleting, modifying, locking, unlocking and managing user accounts, group’s management
* Troubleshooting Performance issues like CPU high utilization, swap issues, memory issues • Collecting &filtering the logs
using head, tail, more, less, grep, find, awk, sed..etc.
* Installation of Java tools and technologies Basic knowledge of Shell scripting
* Use of Monitoring tools i.e.: service now. Working on File Servers like NFS, SAMBA, FTP Working on Network Services like
DHCP, DNS Working on Backup Tools like zip, tar, bzip2
Skills
HPC, Linux, Ubuntu, GitHub, Docker, Network, Data Center Experience, Redhat
Education
B Tech
Gudlavalleru Engineering college • Gudlavalleru
08/2019
Languages
English, Telugu