Home Job Details
S
Information Technology 🏒 Full Time ⭐️ Verified

Senior AI Research Scientist (Generative AI & LLMs)

Stratosphere AI
San Francisco
Estimated Salary
USD 180.000 – USD 250.000
Live Update
14 Mei 2026
Deadline
14 Mei 2027

Job Description

We are at the forefront of the Artificial Intelligence revolution. Join Stratosphere AI as a Senior AI Research Scientist and help define the roadmap for the next generation of Generative AI models. We are seeking a visionary researcher with a deep understanding of Large Language Models (LLMs) and Neural Architecture Search to push the boundaries of what is possible in 2026 and beyond.

In this role, you will work in a collaborative, high-performance environment focused on solving complex problems in natural language understanding, multimodal generation, and efficient model scaling. You will have the autonomy to experiment with cutting-edge architectures and the resources to deploy transformative solutions.

Responsibilities

  • Design, train, and optimize state-of-the-art Large Language Models (LLMs) for enterprise-grade applications.
  • Conduct research in Generative AI, including prompt engineering, fine-tuning, and reinforcement learning from human feedback (RLHF).
  • Collaborate with cross-functional teams of engineers, product managers, and designers to translate research into scalable products.
  • Write and publish high-impact research papers to contribute to the global AI community.
  • Evaluate model performance, interpretability, and safety, ensuring ethical deployment of AI systems.
  • Experiment with novel architectures such as Mixture of Experts (MoE) and attention mechanisms.

Qualifications

  • Ph.D. or Master’s degree in Computer Science, Machine Learning, or a related quantitative field.
  • 5+ years of experience in deep learning, specifically with Transformers, GPT, BERT, or similar architectures.
  • Strong proficiency in Python and deep learning frameworks (PyTorch or TensorFlow).
  • Proven track record of publishing in top-tier AI conferences (NeurIPS, ICML, ICLR, ACL).
  • Experience with distributed training, model serving, and cloud infrastructure (AWS, GCP, or Azure).
  • Deep understanding of NLP tasks including text generation, summarization, and semantic search.

Required Skills

Python PyTorch TensorFlow Natural Language Processing (NLP) Large Language Models (LLMs) Generative AI Hugging Face CUDA AWS Distributed Computing Deep Learning

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All