Job Description
We are seeking a visionary Senior AI Infrastructure Engineer to join our elite team at Nexus Future Systems. As we accelerate towards our 2026 roadmap, we need an architect who can build the robust, scalable, and efficient systems required to support next-generation artificial intelligence models.
In this role, you will bridge the gap between cutting-edge machine learning research and production-grade engineering. You will be responsible for designing the backbone of our AI operations, ensuring high availability, and optimizing resource utilization to meet the demands of 2026 and beyond.
Why Join Us?
- Work on projects that define the future of technology.
- Competitive salary and equity package.
- State-of-the-art development environment.
Responsibilities
- Architect and deploy scalable AI infrastructure using Kubernetes and containerization technologies.
- Optimize GPU utilization and model inference latency for large-scale language models.
- Implement CI/CD pipelines for machine learning model deployment and monitoring.
- Collaborate with data scientists to translate research prototypes into production-ready systems.
- Ensure high availability and disaster recovery protocols for all cloud-based AI workloads.
- Stay ahead of emerging technologies (e.g., edge computing, quantum-ready architectures) to future-proof our stack.
Qualifications
- Bachelor’s degree in Computer Science, Engineering, or a related field; Master’s degree preferred.
- 5+ years of experience in DevOps, Systems Engineering, or Machine Learning Operations (MLOps).
- Expert proficiency in Python, C++, and CUDA.
- Deep understanding of cloud platforms (AWS, GCP, or Azure) and container orchestration.
- Strong scripting skills in Bash or PowerShell.
- Experience with Prometheus, Grafana, and distributed tracing tools.