Job Description
We are seeking a visionary Senior AI/ML Engineer to join our elite engineering team in San Francisco. As we race toward the future of 2026, we are building the next generation of Generative AI systems that will redefine human-machine interaction. You will be at the forefront of developing scalable models, optimizing inference pipelines, and integrating cutting-edge neural architectures into our production ecosystem.
If you are passionate about pushing the boundaries of what is possible with Large Language Models (LLMs), Computer Vision, and Reinforcement Learning, this is your opportunity to lead high-impact projects in a dynamic, high-growth environment.
Responsibilities
- Design, train, and fine-tune state-of-the-art Large Language Models (LLMs) and generative AI models using PyTorch and TensorFlow.
- Optimize model inference latency and cost-efficiency using techniques such as quantization, pruning, and distillation.
- Collaborate closely with cross-functional teams of data scientists, software engineers, and product managers to deploy AI solutions into production.
- Implement robust MLOps pipelines to ensure model monitoring, version control, and automated retraining workflows.
- Research and evaluate emerging AI architectures and frameworks to stay ahead of industry trends.
- Ensure ethical AI practices and data privacy compliance across all AI-driven products.
Qualifications
- PhD or Masterβs degree in Computer Science, Mathematics, or a related technical field.
- Minimum of 5+ years of experience in machine learning, deep learning, or natural language processing.
- Expert proficiency in Python, including experience with libraries such as PyTorch, TensorFlow, or JAX.
- Proven track record of deploying models at scale in cloud environments (AWS, GCP, or Azure).
- Strong understanding of distributed systems, GPU computing, and high-performance computing.
- Experience with MLOps tools (MLflow, Kubeflow, DVC) and model serving frameworks (TorchServe, TensorFlow Serving).