Location: Bissen, Luxembourg

Type: Permanent

 

HPC System Engineer – Dataservices (M/F)

 

Who we are:

LuxProvide is the national supercomputer HPC organization in charge of the planning, installation and long term operation of the same. For our recently set up organization and HPC infrastructure, we are seeking for a Network Engineer to join the core team in charge of the design, implementation and operation of the same. You will be come part of a high performance team of experts with the unique opportunity to contribute to the initial planning and installation of the whole HPC infrastructure.

 

The tasks:

  • As part of the first team on the ground, you configure, develop, integrate, test, implement, upgrade, document, monitor and support HPC storage hardware/software infrastructure
  • Contribute to architect data storage hierarchies and their software support systems
  • Ensure the deployment, configuration and day to day administration of distributed parallel file systems, network file systems, object storage, tape archival systems and support middleware
  • Ensure the administration of storage servers and block storage arrays and participate in the management of storage area networks
  • Implement and ensure health and performance monitoring of storage services and systems
  • Troubleshoot, debug and solve problems in production storage systems
  • Ensure the availability and efficient use of mission-critical storage systems
  • Develop and implement software tools and APIs for administrative storage and data management and reporting
  • Develop and implement user-facing software tools and APIs for storage and data management and reporting
  • Implement data placement policies and ensure physical security of data
  • Participate in the development and implementation of efficient security measures and disaster recovery policies and enforce backup and archival procedures
  • Periodically test the efficiency and effectiveness of backup and archival processes
  • Take part in capacity-, capability planning and in performance reviews
  • Participate in the definition of future HPC storage requirements, ensuring that user needs are represented
  • Create data usage and performance reports on storage utilization and uptime of the storage systems
  • Identify and evaluate promising new storage hardware and software technologies
  • Set up and maintain documentation of the existing storage infrastructure and administration procedures for routine and complex tasks
  • Participate, if needed, in 24/7 on-call support rotating shifts to resolve urgent issues on mission-critical systems

 

Skills and Requirements:

  • University degree in computer science, computer engineering, information technology or a closely related field is requested
  • +5 yearsin of proven working in managing HPC storage systems, preferably in a large scale HPC environment in a Site Reliability Engineering role
  • Excellent technical troubleshooting skills
  • Fluency in English and strong verbal/written communication skills is required. German and/or French is a plus

 

Benefits:

  • Work on cutting edge and exciting technologies within a team of highly motivated and passionate colleagues
  • Flat hierarchies, own area of responsibility with room for creativity, with the possibility to grow within the role
  • Homeoffice is possible
  • An excellent working atmosphere and working conditions

Apply for the position:

Please send your application to hr@lxp.lu or use our contact form: