Job Description
At 2026, we are architecting the digital infrastructure of the future. We are not just building software; we are defining the next era of human-machine collaboration through advanced Generative AI and autonomous agents. We are looking for a Senior Generative AI Engineer to join our elite engineering team in San Francisco.
In this role, you will be at the forefront of research and implementation, transforming cutting-edge academic concepts into production-ready systems that solve real-world problems. If you are passionate about Large Language Models (LLMs), vector databases, and building scalable AI systems, we want to hear from you.
Responsibilities
- Architect and deploy scalable Large Language Model (LLM) applications with a focus on latency and throughput optimization.
- Design robust Retrieval-Augmented Generation (RAG) pipelines to enhance data accuracy and reduce hallucination rates.
- Implement fine-tuning strategies for proprietary models to align with specific business use cases.
- Collaborate with product and design teams to translate complex AI capabilities into intuitive user experiences.
- Conduct rigorous model evaluation, A/B testing, and performance monitoring to ensure product excellence.
Qualifications
- 5+ years of experience in software engineering, with at least 2 years specifically in machine learning and deep learning.
- Strong proficiency in Python, PyTorch, or TensorFlow.
- Deep understanding of transformer architectures, attention mechanisms, and model quantization techniques.
- Experience with vector databases (e.g., Pinecone, Weaviate) and cloud infrastructure (AWS/GCP).
- Excellent problem-solving skills and the ability to thrive in a fast-paced, agile environment.