KEY RESPONSIBILITIES:
Emory College is actively seeking a dynamic and experienced Linux System Administrator to spearhead technical solutions for our high-performance computing (HPC) cluster platform and service.
As a crucial member of our team, you will collaborate with a group of HPC administrators and various technical teams within the college, partnering with Emory faculty to engineer and manage a cutting-edge HPC infrastructure. This role plays a pivotal part in advancing knowledge discovery, addressing complex scientific challenges, and fostering education and training in the latest technology. Is responsible for multi-platform operating systems, utilities, and related software to meet organizational needs. Is responsible for the availability, integrity, and reliability of assigned systems. Makes recommendations on system upgrades and new technologies.
MINIMUM QUALIFICATIONS:
- Seven years of Linux systems administration experience OR a bachelor's degree and five years of Linux systems administration experience, preferably in the context of managing HPC environments. Experience with configuration management tools (Ansible, Puppet) and LDAP administration are also required.
ADDITIONAL JOB DETAILS:
- Provides daily support for the HPC environment with a focus on Red Hat Enterprise, Ubuntu, and other Linux operations both on-premise and within cloud computing platforms.
- Develops and updates procedures and guidelines to install, patch, configure, customize, troubleshoot, upgrade, integrate, and maintain Linux operating systems and related software.
- Researches, analyzes and resolves problems, providing root-cause analysis for Linux operating systems.
- Proactively seeks information and utilizes analytical and creative problem-solving skills along with standard processes and technologies resulting in secure use of systems, applications, and infrastructure.
- Demonstrates quality service and accountability in the process of resolving requests, supporting daily operations, and ensuring system stability that results in accurate, timely, and efficient solutions and data as evidenced by meeting customer needs.
- Learning and keeping current with HPC technologies, such as backups, job-scheduling, and parallel file system management.
- Management of physical hardware in on-premise datacenter.
- Requires hybrid on-site presence, lifting, and reaching into tight spaces. Ability to lift 1U servers (approx. 50 lbs) for installation and servicing required.
PREFERRED QUALIFICATIONS:
Three or more years of HPC systems experience, including:
- Extensive command-line systems administration/use.
- Building (from source code), installing, maintaining, and troubleshooting application-level Linux and scientific software. Installation of Linux operating system on a variety of hardware platforms. Use of various Linux package management systems.
- Use of HPC resource management tools (Slurm)
- Linux file system management and networking stack.
- Experience writing and debugging Python and Bash scripts for system administration. Experience with log analytic tools such as Splunk.
- Experience with applications such as R, Numpy/Pandas, and MATLAB preferred. Experience with Virtualization software use in a Linux environment.
- Excellent interpersonal, oral, and written communication skills.
- Experience with Infiniband networking.
- Experience with Virtualization software use in a Linux environment.
- Excellent interpersonal, oral and written communication skills.
NOTE: This role will be granted the opportunity to work from home regularly but must be able to commute to Emory University on a flexible weekly schedule based upon business needs. Schedule is based on agreed upon guidelines of department. This role requires residency in the state of GA. Emory reserves the right to change remote work status with notice to employee.