Job Description
Join the Frontier of Intelligence.
We are Nexus Future Systems, a pioneering AI research lab dedicated to defining the trajectory of artificial intelligence for the next decade. We are looking for a visionary Senior AI Architect to lead our breakthrough projects in Generative AI and Large Language Models (LLMs). If you are passionate about building systems that think, create, and understand, we want to hear from you.
The Role:
In this high-impact position, you will architect and deploy state-of-the-art generative models. You will work directly with top-tier researchers and engineers to push the boundaries of what is possible in NLP, computer vision, and multimodal reasoning.
Key Responsibilities:
- Design and implement scalable, distributed machine learning pipelines for training large language models (LLMs) and generative adversarial networks (GANs).
- Optimize model inference latency and accuracy, ensuring real-time performance in production environments.
- Lead the technical strategy for integrating retrieval-augmented generation (RAG) systems into enterprise applications.
- Collaborate with data scientists to curate high-quality datasets and fine-tune foundation models for niche domains.
- Establish best practices for MLOps, including automated testing, versioning, and deployment (CI/CD).
- Mentor junior engineers and researchers, fostering a culture of innovation and technical excellence.
- Stay at the forefront of AI research, evaluating new architectures like Transformer variants and diffusion models.
Qualifications:
- PhD or Master’s degree in Computer Science, Mathematics, or a related field.
- 5+ years of experience in machine learning engineering, specifically with deep learning frameworks (PyTorch, TensorFlow, JAX).
- Proven expertise in training and fine-tuning LLMs (e.g., GPT, Llama, Mistral) and generative models.
- Strong proficiency in distributed computing systems (Kubernetes, Docker) and cloud platforms (AWS, GCP, Azure).
- Deep understanding of optimization techniques (quantization, pruning, distillation) and hardware acceleration (TPUs, GPUs).
- Experience with semantic search, vector databases (Pinecone, Milvus), and RAG architectures.
- Excellent communication skills and the ability to translate complex technical concepts for diverse stakeholders.
Why Join Us?
At Nexus Future Systems, we offer a competitive salary, equity packages, and the opportunity to work on projects that will shape the future of technology. Our culture is built on curiosity, collaboration, and a relentless pursuit of excellence.
Responsibilities
- Design and implement scalable, distributed machine learning pipelines for training large language models (LLMs) and generative adversarial networks (GANs).
- Optimize model inference latency and accuracy, ensuring real-time performance in production environments.
- Lead the technical strategy for integrating retrieval-augmented generation (RAG) systems into enterprise applications.
- Collaborate with data scientists to curate high-quality datasets and fine-tune foundation models for niche domains.
- Establish best practices for MLOps, including automated testing, versioning, and deployment (CI/CD).
- Mentor junior engineers and researchers, fostering a culture of innovation and technical excellence.
- Stay at the forefront of AI research, evaluating new architectures like Transformer variants and diffusion models.
Qualifications
- PhD or Master’s degree in Computer Science, Mathematics, or a related field.
- 5+ years of experience in machine learning engineering, specifically with deep learning frameworks (PyTorch, TensorFlow, JAX).
- Proven expertise in training and fine-tuning LLMs (e.g., GPT, Llama, Mistral) and generative models.
- Strong proficiency in distributed computing systems (Kubernetes, Docker) and cloud platforms (AWS, GCP, Azure).
- Deep understanding of optimization techniques (quantization, pruning, distillation) and hardware acceleration (TPUs, GPUs).
- Experience with semantic search, vector databases (Pinecone, Milvus), and RAG architectures.
- Excellent communication skills and the ability to translate complex technical concepts for diverse stakeholders.