Job Description
Are you ready to architect the intelligence that will define the next era of technology? At Nebula AI Labs, we are pioneering the frontier of Generative AI to build systems that think, create, and solve complex problems. We are seeking a visionary Senior Generative AI Engineer to join our elite R&D team in San Francisco. In this pivotal role, you will design, train, and deploy cutting-edge Large Language Models (LLMs) that power our next-generation enterprise solutions.
We are looking for a technical expert who isn't just following trends but is setting them. If you have a deep understanding of transformer architectures, reinforcement learning from human feedback (RLHF), and a passion for ethical AI development, we want to hear from you.
Why join us?
- Work on state-of-the-art models that will influence the industry in 2026 and beyond.
- Competitive compensation and equity packages.
- Flexible remote-first culture with a premium office in downtown SF.
- Access to the latest hardware for scalable AI research.
Responsibilities
- Model Development: Lead the research and implementation of advanced generative models, including fine-tuning and distillation of large-scale foundation models.
- Pipeline Optimization: Design high-throughput, low-latency inference pipelines using distributed computing frameworks (Ray, Kubernetes, or similar).
- Data Strategy: Curate and manage high-quality training datasets, implementing robust data cleaning and augmentation strategies.
- Ethical AI: Ensure model outputs adhere to safety guidelines, reduce bias, and align with regulatory standards.
- Collaboration: Partner with product managers and software engineers to integrate AI capabilities into seamless user experiences.
- Research: Stay at the forefront of academic advancements in NLP and deep learning, publishing papers and contributing to open-source communities.
Qualifications
- Education: PhD or Masterβs degree in Computer Science, Artificial Intelligence, Mathematics, or a related quantitative field.
- Experience: 5+ years of professional experience in machine learning, deep learning, or natural language processing.
- Technical Skills: Expert proficiency in Python, PyTorch, and TensorFlow. Deep knowledge of transformer models (BERT, GPT, T5).
- Infrastructure: Proven experience deploying models on cloud platforms (AWS, GCP, or Azure) and optimizing for GPU/TPU utilization.
- Soft Skills: Exceptional problem-solving abilities and excellent communication skills for technical and non-technical stakeholders.