Job Description
Join the vanguard of artificial intelligence at Apex Neural Dynamics. We are seeking a visionary Senior AI Architect to spearhead the development of our flagship initiative, Project 2026. This is a rare opportunity to define the next generation of generative AI architectures and push the boundaries of what is possible in natural language processing.
In this pivotal role, you will lead a team of elite engineers and researchers to build scalable, robust, and highly efficient models. You will work in a state-of-the-art facility in the heart of San Francisco, collaborating with world-class talent to solve the most complex challenges in deep learning.
Why join us?
- Competitive compensation package and equity.
- Access to cutting-edge hardware and cloud resources.
- Flexible work environment with a focus on innovation and autonomy.
Are you ready to build the future? Apply today.
Responsibilities
- Lead the architectural design and implementation of the Project 2026 large-scale language model, ensuring scalability and fault tolerance.
- Optimize model inference latency and throughput through aggressive quantization, pruning, and kernel fusion techniques.
- Collaborate with data scientists to design novel training objectives and loss functions that improve model reasoning capabilities.
- Mentor junior engineers and researchers, fostering a culture of technical excellence and continuous learning.
- Conduct rigorous performance benchmarking and research to stay ahead of the curve in the rapidly evolving AI landscape.
- Define and enforce best practices for model deployment, monitoring, and security within our production environment.
Qualifications
- Masterβs or PhD in Computer Science, Mathematics, or a related field, with a focus on Machine Learning or Artificial Intelligence.
- Minimum of 5 years of professional experience in deep learning, NLP, or AI architecture, with at least 2 years in a senior leadership role.
- Extensive experience with deep learning frameworks such as PyTorch, TensorFlow, or JAX.
- Strong proficiency in C++ for high-performance computing and GPU optimization.
- Deep understanding of transformer architectures, attention mechanisms, and generative AI techniques.
- Proven track record of deploying large-scale ML models into production environments.