Search jobs > Mountain View, CA > Research scientist ai

Research Scientist - Applied AI/LLM

Databricks
Mountain View, California
$150K-$190K a year
Full-time

P-1131

At Databricks, we are obsessed with enabling data teams to solve the world's toughest problems. We do this by building and running the world's best data and AI infrastructure platform, so our customers can focus on the high value challenges that are central to their own missions.

Founded in 2013 by the original creators of Apache Spark™, Databricks has grown from a tiny corner office in Berkeley, California to a global organization with over 1000 employees.

Thousands of organizations, from small to Fortune 100, trust Databricks with their mission-critical workloads, making us one of the fastest growing SaaS companies in the world.

You’ll work with teams across Databricks to conduct foundational research into the feasibility and effectiveness of solutions that help customers analyze data using natural language, and then bring those solutions into our products to make data analysis easier and more approachable for all of our customers.

More broadly, our teams work on some of the hardest, most interesting problems facing the business, ranging from designing large-scale distributed AI / ML systems, to optimizing distributed GPU model serving to developing novel modeling methodologies that scale to production use cases.

The impact you will have :

  • Shape the direction of our applied ML areas and intelligence features in our products, helping customers translate unstructured text into structured code, queries and data.
  • Drive the development and deployment of state-of-the-art AI models and systems that directly impact the capabilities and performance of Databricks' products and services.
  • Architect and implement robust, scalable ML infrastructure, including data storage, processing, and model serving components, to support seamless integration of AI / ML models into production environments.
  • Develop novel data collection, fine-tuning, and pre-training strategies that achieve optimal performance on specific tasks and domains.
  • Design and implement automated ML pipelines for data preprocessing, feature engineering, model training, hyperparameter tuning, and model evaluation, enabling rapid experimentation and iteration.
  • Implement advanced model compression and optimization techniques to reduce the resource footprint of language models while preserving their performance
  • Contribute to the broader AI community by publishing research, presenting at conferences, and actively participating in open-source projects, enhancing Databricks' reputation as an industry leader.

What we look for :

  • 2+ years of machine learning engineering experience in high-velocity, high-growth companies. Alternatively, a strong background in relevant ML research in academia will be considered as an equivalent qualification.
  • Experience developing AI / ML systems at scale in production or in high-impact research environments.
  • Strong track record of working with language modeling technologies. This could include the following : Developing generative and embedding techniques, modern model architectures, fine tuning / pre-training datasets, and evaluation benchmarks.
  • Strong coding and software engineering skills, and familiarity with software engineering principles around testing, code reviews and deployment.
  • Experience deploying and scaling language models in production; deep understanding of the unique infrastructure challenges posed by training and serving LLMs.
  • Strong understanding of computer science fundamentals.
  • Prior experience with Natural Language Processing and transforming unstructured text into structured code, queries and data is a plus.
  • Contributions to well-used open-source projects.

Pay Range Transparency

Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents base salary range for non-commissionable roles or on-target earnings for commissionable roles.

Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location.

Based on the factors above, Databricks utilizes the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above.

For more information regarding which range your location is in visit our page .

Local Pay Range$150,000 $190,000 USD

30+ days ago
Related jobs
Promoted
Amazon
Santa Clara, California

AWS AI/ML is looking for world class scientists and engineers to work on foundation models, large-scale representation learning, and distributed learning methods and systems. You will be at the heart of a growing and exciting focus area for AWS and work with other acclaimed engineers and world famou...

Amazon Development Center U.S., Inc.
Santa Clara, California

AWS AI/ML is looking for world class scientists and engineers to join its AI Research and Education group working on deploying reinforcement (RL) and machine learning (ML) methods in real-world applications, studying, and building new approaches. We are seeking an Applied Scientist II for the team. ...

Chan Zuckerberg Initiative
Redwood City, California

As a Senior/Staff Research Scientist on the AI/ML team you will develop and apply state-of-the-art methods in artificial intelligence and machine learning to solve important problems in the life sciences aligned with CZI’s mission. We are committed to fair treatment and equal access to opportunity f...

Amazon Development Center U.S., Inc.
Santa Clara, California

AWS AI Research and Engineering (AIRE) is looking for world class scientists and engineers to work on the development of autonomous AI agents. PhD in Computer Science, Electrical Engineering or similar with 3+ years of industrial AI research experience or Master's degree in Computer Science (and sim...

Netflix
Los Gatos, California

You will collaborate with researchers, data scientists, and engineers to help answer Netflix’s most complex business challenges through the analysis of rigorous consumer research. Partner with Researchers, Engineers, and Data Scientists to develop comprehensive analysis plans and generate high-quali...

Genentech, Inc
California

CS Prescient Design is seeking an exceptional Senior Machine Learning Engineer to develop our LLM and AI product and enable the next generation of foundational research in machine learning for scientific discovery. The ideal candidate should have extensive experience in leading and managing a team o...

TikTok
San Jose, California

The Applied AI (DCC) Team is part of the Monetizing Integrity team in TikTok global business solutions. ...

Amazon Development Center U.S., Inc.
Santa Clara, California

We are seeking an Applied Scientist for the Human-in-loop ML/AI Services team at AWS. Are you interested in generative artificial intelligence (AI)? Do you have experience with Large Language Models (LLMs) or Large Vision and Language Models?. Our team's mission is to make LLMs and LVMs available ac...

ByteDance
San Jose, California

We are looking for strong researchers and engineers who have a background in generative AI and NLP, with experience in areas like LLM pretrain, interpretable LLM; LLM evaluation; responsible LLMs; LLM alignment. We conduct focused research and engineering, with the support of massive computation res...

Amazon.com Services LLC
Santa Clara, California

Invent and develop scalable methodologies to evaluate LLM outputs against metrics and guardrails. Applied Scientist to join a team of experts in the field of machine learning, and work together to break new ground in the world of healthcare to make personalized and empathetic care accessible, conven...