Job Description
Are you ready to architect the digital landscape of tomorrow? Nexus Future Labs is seeking a visionary Senior AI Infrastructure Engineer to lead the development of scalable, high-performance systems for our aggressive 2026 roadmap. In this pivotal role, you will bridge the gap between cutting-edge theoretical AI research and production-grade infrastructure, ensuring our platforms are resilient, efficient, and ready for the future of computing.
Join a team of world-class engineers and researchers dedicated to pushing the boundaries of Generative AI, Neural Networks, and Autonomous Systems. You will have the autonomy to define architectural standards and mentor a growing team of infrastructure specialists.
Responsibilities
- Design and deploy highly scalable distributed machine learning pipelines and inference engines capable of handling exabyte-scale data.
- Optimize GPU cluster configurations and resource allocation strategies to maximize model training throughput and minimize latency.
- Implement robust CI/CD pipelines for AI model deployment, ensuring automated testing, validation, and rollback protocols.
- Collaborate with data scientists to translate research prototypes into stable, production-ready microservices.
- Ensure the security, compliance, and data sovereignty of all AI workloads across cloud and edge environments.
- Conduct architecture reviews and drive technical decisions that align with long-term strategic goals.
Qualifications
- 7+ years of experience in software engineering, DevOps, or MLOps with a strong focus on infrastructure.
- Deep expertise in Python, Go, or Rust with proven experience in building high-performance systems.
- Hands-on experience with containerization (Docker, Kubernetes) and orchestration (EKS, GKE, AKS).
- Proficiency in major ML frameworks such as PyTorch, TensorFlow, or JAX.
- Strong background in cloud infrastructure (AWS, GCP, or Azure) and serverless architectures.
- Familiarity with GPU virtualization technologies (e.g., vGPU, CUDA) and high-performance computing (HPC) environments.
- Excellent problem-solving skills and the ability to thrive in a fast-paced, agile environment.