Job Description
Are you ready to define the landscape of Artificial Intelligence for the year 2026? Nexus Future Labs is seeking a visionary Generative AI Architect to lead our R&D initiatives in the Bay Area. We are building the next generation of autonomous systems and creative intelligence engines.
In this pivotal role, you will bridge the gap between theoretical AI research and production-grade applications. If you are passionate about pushing the boundaries of what is possible with Large Language Models (LLMs) and Multimodal systems, we want to hear from you.
Why join us?
- Work on cutting-edge LLM infrastructure.
- Competitive equity package and remote-first flexibility.
- Collaborate with world-class engineers and researchers.
Responsibilities
- Architect and deploy scalable Generative AI models tailored for the 2026 roadmap.
- Optimize model inference performance using advanced quantization and distributed computing techniques.
- Design and implement Retrieval-Augmented Generation (RAG) pipelines to enhance model accuracy.
- Conduct rigorous testing and validation of AI safety and alignment protocols.
- Collaborate with product teams to translate complex AI capabilities into user-centric features.
Qualifications
- Masterβs or PhD in Computer Science, Machine Learning, or a related quantitative field.
- Proven experience with deep learning frameworks such as PyTorch, TensorFlow, or JAX.
- Strong background in Natural Language Processing (NLP) and Large Language Model architecture.
- Experience fine-tuning models (e.g., GPT, LLaMA, Mistral) using techniques like LoRA or QLoRA.
- Proficiency in SQL, NoSQL databases, and cloud infrastructure (AWS/GCP/Azure).