Talent.com
System Software Engineer - RAG
System Software Engineer - RAGNVIDIA • Remote, CA, US
System Software Engineer - RAG

System Software Engineer - RAG

NVIDIA • Remote, CA, US
30+ days ago
Job type
  • Full-time
  • Remote
Job description

NVIDIA's technology is at the heart of the AI revolution, touching people across the planet by powering everything from self-driving cars, robotics, co-pilots and more. Join us at the forefront of technological advancement in intelligent assistants and information retrieval. ​What Is Retrieval-Augmented Generation, aka RAG? Retrieval-augmented generation (RAG) is a technique for enhancing the accuracy and reliability of generative AI models with facts fetched from external sources.

NVIDIA is looking for a System Software Engineer - RAG to develop pipelines for indexing and querying multi-modal content. We are looking for someone with a passion for working with the world's most complicated problems in Generative AI, LLM, MLLM, and RAG spaces using our innovative hardware and software platforms. You will develop tools for building powerful, flexible, multi-modal retrievers and agents driven by Large Language Models(LLM) thereby improving the experience of millions of customers. If you're creative & passionate about solving real world conversational AI problems, come join us.

What You'll Be Doing:

  • Develop and optimize Python-based data processing frameworks, ensuring efficient handling of large datasets on GPU-accelerated environments, vital for LLM training.

  • Contribute to the design and implementation of RAPIDS and other GPU-accelerated libraries, focusing on seamless integration and performance enhancement in the context of LLM training data preparation and RAG pipelines.

  • Lead development and iterative optimization of components for RAG pipelines, ensuring they demonstrate GPU acceleration & the best performing models for improved TCO.

  • Collaborate with teams of LLM & ML researchers in the development of full-stack, GPU-accelerated data preparation pipelines for multimodal models Implement benchmarking, profiling, and optimization of innovative algorithms in Python in various system architectures, specifically targeting LLM applications.

  • Work closely with complementary teams to understand requirements, build & evaluate POCs, and develop roadmaps for production level tools and library features within the growing LLM ecosystem.

  • Build amazing products to improve employee productivity using Gen-AI & Co-pilot experiences!

  • Collaborate with your peers to craft, develop, test, and maintain integrated applications and features.

  • Develop integrated systems enabling unified experience across applications and driving insights for end-to-end user experience.

  • Help build and maintain our Continuous Delivery pipeline with the goal of moving changes to production faster and safer, while ensuring key operational standards.

  • Provide peer reviews to other specialists including feedback on performance, scalability, and correctness.

  • Actively contribute to the adoption of frameworks, standards, and new technologies

What We Need To See:

  • Bachelor’s or Master’s Degree program in Computer Science, Computer Engineering, or a related field (or equivalent experience).

  • 6+ years of demonstrated experience in a similar or related role

  • Python programming expertise with Deep Learning (DL) frameworks such as PyTorch.

  • Experience delivering software in a cloud context and is familiar with the patterns and process of handling cloud infrastructure

  • Knowledge of MLOps technologies such as Docker-Compose, Containers, Kubernetes, data center deployments etc.

  • Excellent in-depth hands-on understanding of NLP, LLM, MLLM, Generative AI , and RAG workflows

  • Self-starter with a passion for growth, enthusiasm for continuous learning and sharing findings across the team

  • Extremely motivated, highly passionate, and curious about new technologies.

  • Outstanding communication skills for distilling sophisticated topics down to understandable, impactful conclusions as well as the ability to work successfully with multi-functional teams, principals, and architects. Coordinates optimally across organizational boundaries and geographies.

If you are passionate about technology, have a proven track record in system software engineering, and are eager to make a significant impact in the industry, we would love to hear from you. Join us at NVIDIA and help us craft the future of visual computing!

The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Create a job alert for this search

System Software Engineer - RAG • Remote, CA, US

Similar jobs

Senior Embedded Software Engineer

RaytheonCA, United States
Full-time

CA601: Goleta (EW) Bldg H01 6380 Hollister Avenue Building H01, Goleta, CA, 93117 USA.Person, or Immigration Status Requirements:.The ability to obtain and maintain a U.At Raytheon, the foundation ...Show more

 • Promoted

Aviation Electronics, Electrical & Computer Systems Technician

US NavyVisalia, CA, US
Full-time +1

Once an aircraft launches off a carrier, pilots depend on their jet's complex electronic systems to operate all areas of their craft and complete their mission.There is zero room for failure.That's...Show more

 • Promoted

Remote-Senior Software Engineer-Intuit

S M Software Solutions IncPiedra, CA, United States
Remote
Full-time

Job description :We are looking for a Senior Software Engineer specializing in Retrieval-Augmented Generation (RAG) systems, with experience in large language models (LLMs), vector databases, and c...Show more

 • Promoted

Principal Software Engineer [Electronic Warfare]

RaytheonCA, United States
Full-time

US-CA-GOLETA-H03 ~ 6380 Hollister Ave ~ BLDG H03.Person, or Immigration Status Requirements:.The ability to obtain and maintain a U.Active and existing security clearance required after day 1.At Ra...Show more

 • Promoted

HWIL Software Engineer - P2

RaytheonCA, United States
Temporary

US-AZ-TUCSON-805 ~ 1151 E Hermans Rd ~ BLDG 805.Person, or Immigration Status Requirements:.At Raytheon, the foundation of everything we do is rooted in our values and a higher calling – to help ou...Show more

 • Promoted

Senior Software Engineer

GOVXCA, US
Remote
Full-time
Quick Apply

The Senior Software Engineer provides hands-on software design, development, mentoring, and testing skills to complete projects.This position is a key role within the software development team as y...Show more

Business System Analyst

VDart IncCA, United States
Full-time
Quick Apply

Business System Analyst Duration - 6 Months Location - San Diego, CA ( Remote ) Job Description: ...Show more

AI Platform Engineer

Business Needs IncCa Franchise Tx Brd Brm, CA, United States
Full-time +2
Quick Apply

Senior AI Platform Engineer Design, implement, and manage scalable and resilient infrastructure on AWS.Architect and maintain Windows/Linux based environments, ensuring seamless integration with cl...Show more

Senior Mixed Signal Design Engineer

RaytheonCA, United States
Full-time

US-CA-GOLETA-B01 ~ 6825 Cortona Dr ~ BLDG B01.Person, or Immigration Status Requirements:.At Raytheon, the foundation of everything we do is rooted in our values and a higher calling – to help our ...Show more

 • Promoted

Sr. Full stack Engineer - Remote

Two95 International Inc.CA, US
Remote
Full-time
Quick Apply

Create clean, maintainable, and scalable, and well-tested code.Create elegant web-based user interfaces and reporting dashboards.Work with other team members to devise the best possible technical s...Show more

MES Solumina Engineer (Maintenance & Support) & Technical Program & MRI Technical Engineer (3)

Innovim Technology SolutionsCA, United States
Full-time +1
Quick Apply

JR-068831- MES Solumina Engineer (Maintenance & Support) Location : Irvine, CA - 100% onsite Client : Aerospace Show more

Full time Software Engineers in San Diego, CA

Maania Consultancy ServicesCalifornia, CA, US
Full-time
Quick Apply

Our client is looking for a Software Engineers (Python, Node.If you're interested please share your updated resume along with your yearly expected salary Role:.Software Engineers (Python, Node.C) L...Show more

Engineer I

RosendinREI Headquarters, CA
Full-time
Quick Apply

Whether you're a recent grad or a seasoned professional, you can experience meaningful career growth at Rosendin.Enjoy a true sense of ownership as you work with a proven industry leader on some of...Show more

Java Fullstack Engineer with AI

OpenkyberCA, United States
Full-time
Quick Apply

Interview Mode - In person interview Looking for local Required Qualification Strong experience in Kubernetes and Google Cloud Platform Strong experience in IaC , Terraform , GitHub Actions, helm E...Show more

Systems Administrator

Business Needs IncCA, United States
Full-time +2
Quick Apply

Systems Administrator Start Apr 9 End Jun 30 Required for submittal: Show more

Seeking AI Hardware Design Engineer at Santa Clara, CA

Xceed TechnologiesCA, United States
Full-time
Quick Apply

Hi Team, I hope you are doing well ! We have an excellent opportunity with one of our clients and would like to share the details with you.Please find the job description below and let me know if t...Show more

Restaurant Systems Manager

PaneraCA, US
Full-time

Flynn Group was started in 1999 as the owner and operator of eight Applebee’s in Washington State.Since then the company has grown at over 30% a year, added five additional leading brands in Taco B...Show more

Nutanix Engineer - California

ITProposalCalifornia, USA
Full-time
Quick Apply

Nutanix Engineer/ Networking IT Support Engineer.We are hiring a Nutanix Engineer/ Networking IT Support Engineer to support enterprise infrastructure environments across Mexico.This role requires ...Show more