Job Description
Join Apex Future Systems, the pioneer in next-generation artificial intelligence infrastructure. We are building the foundational models for the year 2026 and beyond. If you are a visionary engineer passionate about pushing the boundaries of Large Language Models (LLMs), autonomous agents, and ethical AI, we want you on our team.
We offer a competitive salary, comprehensive health benefits, and equity in a company poised to redefine the tech landscape.
Responsibilities
- Architect and deploy scalable Large Language Models (LLMs) optimized for 2026 enterprise requirements.
- Design and implement Retrieval-Augmented Generation (RAG) pipelines to enhance model accuracy and context awareness.
- Collaborate with cross-functional product teams to translate complex business requirements into cutting-edge AI solutions.
- Optimize inference latency and throughput for high-volume, low-latency applications.
- Ensure robust data privacy, security, and ethical compliance in all AI deployments.
- Mentor junior engineers and contribute to the technical roadmap for our future AI stack.
Qualifications
- 5+ years of experience in machine learning engineering, with a strong focus on Generative AI.
- Proficiency in Python, PyTorch, and TensorFlow.
- Deep understanding of NLP concepts, transformer architectures, and fine-tuning methodologies.
- Experience with vector databases (Pinecone, Milvus) and cloud infrastructure (AWS, GCP, or Azure).
- Strong problem-solving skills and the ability to work in a fast-paced, agile environment.
- Experience with MLOps tools (Docker, Kubernetes, MLflow).