HPC Engineer - R&D Global Medical, Tech Operations & Support
Bengaluru, Karnataka Job ID R0182631 Category Data, Digital & Technology Subcategory Data, Digital and Technology,Data, Digital and Technology Business Unit Corporate Functions Job Type Full timeBy clicking the “Apply” button, I understand that my employment application process with Takeda will commence and that the information I provide in my application will be processed in line with Takeda’s Privacy Notice and Terms of Use. I further attest that all information I submit in my employment application is true to the best of my knowledge.
Job Description
HPC Engineer/Principal Analyst, Tech Operations and Support
The Future Begins Here
At Takeda, we are leading digital evolution and global transformation. By building innovative solutions and future-ready capabilities, we are meeting the need of patients, our people, and the planet.
Bengaluru, the city, which is India’s epicenter of Innovation, has been selected to be home to Takeda’s recently launched Innovation Capability Center. We invite you to join our digital transformation journey. In this role, you will have the opportunity to boost your skills and become the heart of an innovative engine that is contributing to global impact and improvement.
At Takeda’s ICC we Unite in Diversity
Takeda is committed to creating an inclusive and collaborative workplace, where individuals are recognized for their backgrounds and abilities they bring to our company. We are continuously improving our collaborators journey in Takeda, and we welcome applications from all qualified candidates. Here, you will feel welcomed, respected, and valued as an important contributor to our diverse team.
Objectives/Purpose
Principal Analyst, Technical Support to operate and support the Insight Lite HPC platform built on AWS ParallelCluster (Slurm) with shared storage (EFS/FSx for Lustre) and integrated data science services (Posit Suite and JupyterHub). This role will support reliable researcher-facing services across Linux and HPC environments and contribute to monitoring and documentation ensuring researcher support across US and Japan regions.
Tasks and Responsibilities
AWS ParallelCluster HPC (Slurm Operations, Health, and Administration)
Maintain cluster health and ensure compute capacity.
HPC configuration (Slurm): maintain partitions, jobs accounting and resource allocation across AWS-based HPC resources
User Job support & triage: Support job submission, CPU/memory/GPU allocation, and queue policies. Diagnose job failures and troubleshoot pending, stalled, and cancelled jobs.
Monitoring: Monitor cluster health, login node status, queue depth, node states, and cluster resource saturation.
Cluster lifecycle: Manage ParallelCluster configuration, troubleshoot Cloudwatch logs and CloudFormation events to resolve common provisioning failures.
Posit Suite + JupyterHub (Operations & User Support)
Keep Posit and JupyterHub services available for scientists; resolve publish/runtime issues.
Posit Workbench: Administer, configure and upgrade Posit workbench. Troubleshoot session launch errors, stuck sessions, resource limits and runtime errors.
Posit Connect: Support deploying/publishing Shiny and other researcher-published content
Posit Package Manager: Manage R repositories and curated sets. Support installation of compiled packages and dependency conflicts.
JupyterHub: Configure and support usage, troubleshoot user issues
JupyterHub Python environments: Ensure python environment consistency and conda/pip support. Resolve python import failures, kernel crashes, package conflicts.
Skills Required
Linux/Unix Administration
Day-to-day ops across instances, services, and shared storage
Strong Linux fundamentals in production environments: Experience with Red Hat Enterprise Linux, Docker/Singularity containers, networking, users/groups, POSIX permissions, Nvidia GPU configuration and troubleshooting
Scientific software: Install requested software and troubleshoot user-reported issues.
Identity, Authentication, and Enterprise Integration (SSO)
Fundamentals and common authentication failure modes of enterprise SSO, AD/LDAP and identity mapping
Storage and Data Access (Shared FS + S3/IAM Basics)
AWS Shared filesystem fundamentals (EFS/FSx/EBS) : mount and access troubleshooting (permissions, stale mounts, performance, permission inconsistencies).
AWS S3 + IAM : Understanding of bucket/object permissions, roles and basic policy evaluation.
Additional Preferred Experience
Familiarity with bioinformatics workflows (common file types, workflow patterns / workflow managers).
Configuration management and collaboration tools: Git, Ansible.
Infrastructure-as-code familiarity: Terraform
What Takeda Can Offer You
- Takeda is certified as a Top Employer, not only in India, but also globally. No investment we make pays greater dividends than taking good care of our people.
- At Takeda, you take the lead on building and shaping your own career.
- Joining the ICC in Bangalore will give you access to high-end technology, continuous training and a diverse and inclusive network of colleagues who will support your career growth.
Benefits
It is our priority to provide competitive compensation and a benefit package that bridges your personal life with your professional career. Amongst our benefits are:
Competitive Salary + Performance Annual Bonus
- Flexible work environment, including hybrid working
- Comprehensive Healthcare Insurance Plans for self, spouse, and children
- Group Term Life Insurance and Group Accident Insurance programs
- Health & Wellness programs
- Employee Assistance Program
- 3 days of leave every year for Voluntary Service in additional to Humanitarian Leaves
- Broad Variety of learning platforms
- Diversity, Equity, and Inclusion Programs
- Reimbursements – Home Internet & Mobile Phone
- Employee Referral Program
- Leaves – Paternity Leave (4 Weeks) , Maternity Leave (up to 26 weeks), Bereavement Leave (5 days)
About ICC in Takeda
- Takeda is leading a digital revolution. We’re not just transforming our company; we’re improving the lives of millions of patients who rely on our medicines every day.
- As an organization, we are committed to our cloud-driven business transformation and believe the ICCs are the catalysts of change for our global organization.