Senior Hpc Engineer

London, United Kingdom

Job Description


Working with the Head of IT, the post holder will plan, install, and maintain the Institute\'s HPC systems and associated storage and network infrastructures that interface with wider IT provisions. Working alongside the Head of Bioinformatics, they will plan, provision, and maintain key software capabilities and on-demand HPC services.

In a rapidly evolving field, the post holder must adapt to the changing priorities and demands of the Institute\'s dynamic research environment, forecasting future strategy, training, and infrastructure requirements through awareness of users\' needs and future research efforts. The post holder will lead the development and provision of Scientific Computing at the Institute and must demonstrate sound judgement and decision making in their work. This role requires close collaboration with the IT Facility and for strategy to be developed collaboratively in line with agreed budgets.

Key Responsibilities

Technical Activities

  • Establish policies and systems to ensure that workloads placed on the HPC systems are managed, prioritised, and run to achieve optimal service levels and uptime
  • Resolve performance issues on the cluster, help to design HPC jobs
  • Install, configure, and manage Linux systems using HPC-specific software, apply consistent security configuration standards
  • Implement and maintain management and monitoring tools, report usage levels and service level data
  • Streamline and automate maintenance, deployment, and configuration tasks
  • Build from source, install, configure, and manage Linux applications manually and using deployment software
  • Alongside the Head of IT, assume responsibility for documenting and implementing relevant disaster recovery processes
  • Ensure awareness via monitoring of likely points of failure, proactive maintenance in mitigation, and repair of HPC resources by diagnosing, anticipating, and troubleshooting arising problems
  • Maintain relevant interfacing services and Linux infrastructure, racking hardware, patching fibre optic and CAT6A cabling
  • Support the Head of IT in implementing ICT security policy
  • Support LMS IT staff as requested in issues and queries relating to standalone Linux systems
  • Other duties commensurate with the grade of the post as directed by the supervisor
Person Requirements

Essential
  • Providing HPC services within a client-facing role
  • Integration of heterogeneous Linux/Windows/Mac environments (e.g. Windows Active Directory)
  • Installing, configuring, and managing HPC clusters and cluster-based storage systems
  • Experience with automation tooling (e.g. Salt, xCAT)
  • Configuration, and maintenance of multi-queue job scheduling systems (e.g. SLURM)
  • Use of scientific software compilation and deployment systems (e.g. Spack, EasyBuild, Lmod, conda)
  • Virtualisation and containerisation (e.g. Docker, Singularity)
Apply today to be considered!

Salary:\xc2\xa355,000 \xe2\x80\x93 \xc2\xa359,000

Reference: JR 699

Apply now

Just IT Recruitment

Beware of fraud agents! do not pay money to get a job

MNCJobs.co.uk will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD2979100
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    £59000 per year
  • Employment Status
    Permanent
  • Job Location
    London, United Kingdom
  • Education
    Not mentioned