Job Details
Opportunity
Director of Compute & Machine Learning (ML) Engineering is positioned to provide critical leadership within our technical division. The role demands experienced managerial acumen to guide and mentor a team of highly skilled technical professionals.
The ideal candidate has proven experience in leading teams, implementing high-performance computing, experience of creating and maintaining ML Infrastructure, cluster management, MLOps, DevOps, and container orchestration systems, this role is responsible for participating the strategic direction, leadership of engineering teams,, defining processes, collaborating across the organisation, and critically leading by example with technical acumen in cloud/hybrid-based compute platform, omics pipelines, and ML/DS workflows.
They are looking for someone who has had experience training models that require multiple GPUs, or experience building systems that enable multi-node multi-GPU training.
Its good to have experience with Kubernetes, this would include designing and deploying k8s clusters using either KubeFlow / Argo Workflows.