Exascale Systems Engineer (Fixed Term)
Fixed-term: The funds for this post are available for 4 years in the first instance.
The University of Cambridge Open Zettascale Lab (COZL) is a new research group whose aim is to evolve and utilise the technologies that will make up the next generation of top-flight supercomputers. These state-of-the-art, post-exascale systems will help gain insights into humanity's global challenges, from the existential threat of climate change, through the devastating effects of diseases, to issues such as poverty, food insecurity and lack of healthcare services.
At the Cambridge Open Zettascale Lab, you will be part of a diverse team, working flexibly across the UK, to explore and exploit transformative technologies and support researchers at the forefront of computational science. This is a dynamic and rewarding environment, where exceptional people can gain early exposure to new technologies, work with globally important scientific codes and develop collaborations with leading international research groups. The Lab is housed within Research Computing Services, a well-established and leading UK National Supercomputing Center, at the University of Cambridge, one of the world's oldest and most prestigious universities. For eight centuries, our ideas and innovations have shaped the world.
An Exascale Systems Engineer will work in a team of highly technical engineers alongside academic collaborators, and external stakeholders to ensure the successful delivery of projects. The overall focus of these projects will be investigations into a variety of technologies, all of which either integrate with, or directly leverage, new high-performance hardware. These would include integrating middleware (such I/O abstraction libraries), High Performance File Systems, advanced networks and computational accelerators.
Candidates are expected to have:
- A degree in a computational science, or equivalent relevant experience.
- Some experience in the use, management, and deployment of large-scale High Performance Computing infrastructure.
- Practical and proven knowledge of Linux system administration and shell scripting.
- Practical knowledge of high-level programming or scripting languages such as Python or Bash.
- Some experience of configuration management and automation methods using technologies such as Ansible and Terraform.
- Some experience of storage platforms, parallel file systems and cloud computing platforms such as OpenStack will be advantageous.
- Successful candidates will be proactive, creative and eager to work with unfamiliar technologies
- Experience of working in a scientific environment and/or providing support to researchers will be advantageous.
More information about the role is attached in the Further Information document.
The University is supportive of hybrid working and we aim to enable as many staff as possible work in a hybrid way if they wish and where their role allows. This role allows the post holder to be office based, work in a hybrid way or remotely with only minimal office attendance.
We welcome applications from individuals who wish to be considered for part-time working or other flexible working arrangements.
We particularly welcome applications from women and /or candidates from a BME background for this vacancy as they are currently under-represented at this level in our department/institution/Faculty/School/University.
Click the 'Apply' button below to register an account with our recruitment system (if you have not already) and apply online.
Informal enquiries are welcomed and should be directed to info-cozl@hpc.cam.ac.uk
Please quote reference VC36782 on your application and in any correspondence about this vacancy.
The University actively supports equality, diversity and inclusion and encourages applications from all sections of society.
The University has a responsibility to ensure that all employees are eligible to live and work in the UK.