Job Description
We are at the forefront of the Artificial Intelligence revolution. Join Stratosphere AI as a Senior AI Research Scientist and help define the roadmap for the next generation of Generative AI models. We are seeking a visionary researcher with a deep understanding of Large Language Models (LLMs) and Neural Architecture Search to push the boundaries of what is possible in 2026 and beyond.
In this role, you will work in a collaborative, high-performance environment focused on solving complex problems in natural language understanding, multimodal generation, and efficient model scaling. You will have the autonomy to experiment with cutting-edge architectures and the resources to deploy transformative solutions.
Responsibilities
- Design, train, and optimize state-of-the-art Large Language Models (LLMs) for enterprise-grade applications.
- Conduct research in Generative AI, including prompt engineering, fine-tuning, and reinforcement learning from human feedback (RLHF).
- Collaborate with cross-functional teams of engineers, product managers, and designers to translate research into scalable products.
- Write and publish high-impact research papers to contribute to the global AI community.
- Evaluate model performance, interpretability, and safety, ensuring ethical deployment of AI systems.
- Experiment with novel architectures such as Mixture of Experts (MoE) and attention mechanisms.
Qualifications
- Ph.D. or Masterβs degree in Computer Science, Machine Learning, or a related quantitative field.
- 5+ years of experience in deep learning, specifically with Transformers, GPT, BERT, or similar architectures.
- Strong proficiency in Python and deep learning frameworks (PyTorch or TensorFlow).
- Proven track record of publishing in top-tier AI conferences (NeurIPS, ICML, ICLR, ACL).
- Experience with distributed training, model serving, and cloud infrastructure (AWS, GCP, or Azure).
- Deep understanding of NLP tasks including text generation, summarization, and semantic search.