Job Description
We are a leading technology firm pioneering the next generation of artificial intelligence. As we look toward 2026, we are building the core infrastructure for autonomous agents and advanced Large Language Models (LLMs). We are seeking a visionary Generative AI Engineer to join our elite team and architect the future of intelligent systems.
In this role, you will bridge the gap between theoretical AI research and production-scale deployment. You will work on cutting-edge projects involving fine-tuning, retrieval-augmented generation (RAG), and the development of autonomous AI agents. If you are passionate about the roadmap for 2026 and want to shape the technology that defines the next era of computing, we want to hear from you.
Responsibilities
- Design and implement scalable LLM fine-tuning pipelines using PyTorch and Hugging Face.
- Develop and optimize Retrieval-Augmented Generation (RAG) architectures to enhance model accuracy.
- Build and deploy autonomous AI agents capable of complex reasoning and task execution.
- Collaborate with cross-functional teams to integrate AI models into real-world applications.
- Optimize inference latency and cost-efficiency for high-volume production environments.
- Research and prototype novel generative techniques to stay ahead of industry trends.
Qualifications
- 5+ years of experience in Machine Learning, Deep Learning, or a related technical field.
- Strong proficiency in Python, PyTorch, and TensorFlow.
- Experience with LLMs, Hugging Face Transformers, and model fine-tuning methodologies.
- Deep understanding of NLP concepts, including tokenization, embeddings, and attention mechanisms.
- Experience with MLOps tools (e.g., MLflow, Kubeflow) and cloud platforms (AWS, GCP, or Azure).
- Excellent problem-solving skills and the ability to work in a fast-paced, agile environment.