Talent.com
System Software Engineer - RAG
System Software Engineer - RAGNVIDIA • Remote, CA, US
System Software Engineer - RAG

System Software Engineer - RAG

NVIDIA • Remote, CA, US
30+ days ago
Job type
  • Full-time
  • Remote
Job description

NVIDIA's technology is at the heart of the AI revolution, touching people across the planet by powering everything from self-driving cars, robotics, co-pilots and more. Join us at the forefront of technological advancement in intelligent assistants and information retrieval. ​What Is Retrieval-Augmented Generation, aka RAG? Retrieval-augmented generation (RAG) is a technique for enhancing the accuracy and reliability of generative AI models with facts fetched from external sources.

NVIDIA is looking for a System Software Engineer - RAG to develop pipelines for indexing and querying multi-modal content. We are looking for someone with a passion for working with the world's most complicated problems in Generative AI, LLM, MLLM, and RAG spaces using our innovative hardware and software platforms. You will develop tools for building powerful, flexible, multi-modal retrievers and agents driven by Large Language Models(LLM) thereby improving the experience of millions of customers. If you're creative & passionate about solving real world conversational AI problems, come join us.

What You'll Be Doing:

  • Develop and optimize Python-based data processing frameworks, ensuring efficient handling of large datasets on GPU-accelerated environments, vital for LLM training.

  • Contribute to the design and implementation of RAPIDS and other GPU-accelerated libraries, focusing on seamless integration and performance enhancement in the context of LLM training data preparation and RAG pipelines.

  • Lead development and iterative optimization of components for RAG pipelines, ensuring they demonstrate GPU acceleration & the best performing models for improved TCO.

  • Collaborate with teams of LLM & ML researchers in the development of full-stack, GPU-accelerated data preparation pipelines for multimodal models Implement benchmarking, profiling, and optimization of innovative algorithms in Python in various system architectures, specifically targeting LLM applications.

  • Work closely with complementary teams to understand requirements, build & evaluate POCs, and develop roadmaps for production level tools and library features within the growing LLM ecosystem.

  • Build amazing products to improve employee productivity using Gen-AI & Co-pilot experiences!

  • Collaborate with your peers to craft, develop, test, and maintain integrated applications and features.

  • Develop integrated systems enabling unified experience across applications and driving insights for end-to-end user experience.

  • Help build and maintain our Continuous Delivery pipeline with the goal of moving changes to production faster and safer, while ensuring key operational standards.

  • Provide peer reviews to other specialists including feedback on performance, scalability, and correctness.

  • Actively contribute to the adoption of frameworks, standards, and new technologies

What We Need To See:

  • Bachelor’s or Master’s Degree program in Computer Science, Computer Engineering, or a related field (or equivalent experience).

  • 6+ years of demonstrated experience in a similar or related role

  • Python programming expertise with Deep Learning (DL) frameworks such as PyTorch.

  • Experience delivering software in a cloud context and is familiar with the patterns and process of handling cloud infrastructure

  • Knowledge of MLOps technologies such as Docker-Compose, Containers, Kubernetes, data center deployments etc.

  • Excellent in-depth hands-on understanding of NLP, LLM, MLLM, Generative AI , and RAG workflows

  • Self-starter with a passion for growth, enthusiasm for continuous learning and sharing findings across the team

  • Extremely motivated, highly passionate, and curious about new technologies.

  • Outstanding communication skills for distilling sophisticated topics down to understandable, impactful conclusions as well as the ability to work successfully with multi-functional teams, principals, and architects. Coordinates optimally across organizational boundaries and geographies.

If you are passionate about technology, have a proven track record in system software engineering, and are eager to make a significant impact in the industry, we would love to hear from you. Join us at NVIDIA and help us craft the future of visual computing!

The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Create a job alert for this search

System Software Engineer - RAG • Remote, CA, US

Similar jobs

Senior Embedded Software Engineer

RaytheonCA, United States
Full-time

CA601: Goleta (EW) Bldg H01 6380 Hollister Avenue Building H01, Goleta, CA, 93117 USA.Person, or Immigration Status Requirements:.The ability to obtain and maintain a U.At Raytheon, the foundation ...Show more

 • Promoted

Senior Embedded Real-time Software Engineer

RaytheonCA, United States
Full-time

US-AZ-TUCSON-805 ~ 1151 E Hermans Rd ~ BLDG 805.Person, or Immigration Status Requirements:.Active and existing security clearance required on day 1.At Raytheon, the foundation of everything we do ...Show more

 • Promoted

Remote-Senior Software Engineer-Intuit

S M Software Solutions IncPiedra, CA, United States
Remote
Full-time

Job description :We are looking for a Senior Software Engineer specializing in Retrieval-Augmented Generation (RAG) systems, with experience in large language models (LLMs), vector databases, and c...Show more

 • Promoted

Principal Systems Engineer P4

RaytheonCA, United States
Full-time

US-AZ-TUCSON-9020 ~ 9020 S Rita Rd ~ BLDG 9020.Person, or Immigration Status Requirements:.Active and existing security clearance required on day 1.At Raytheon, the foundation of everything we do i...Show more

 • Promoted

Senior Software Engineer Oracle EPM Cloud

KKTechnologies LLCCA, United States
Full-time
Quick Apply

Job role: Senior Software Engineer Oracle EPM Cloud (W2 / 1099 Only) Locations: Alpharetta, GA | Oakland, CA | Rancho Cordova, CA Note: Local Candidates Only We are looking for a highly skilled Sen...Show more

Software Engineer (8+ Years Experience)

TekWissen LLCCalifornia, CA, United States
Temporary
Quick Apply

MsoNoSpacing"> Overview: TekWissen is a global workforce management provider headquartered in Ann Arbor, Michigan that offers strategic talent solutions to our clients world-wi...Show more

Senior Principal Software Engineer with Test Equipment

RaytheonCA, United States
Temporary

US-AZ-TUCSON-801 ~ 1151 E Hermans Rd ~ BLDG 801 (External Site).Person, or Immigration Status Requirements:.At Raytheon, the foundation of everything we do is rooted in our values and a higher call...Show more

 • Promoted

RTL Engineer, Networking ASIC

Vortexlink, Inc.CA, US
Full-time

RTL Engineer, Networking ASIC Full Time opportunity in Saratoga, CA We are seeking experienced RTL designers to help define and implement our industry-leading Networking ASIC’s.If you're a highly m...Show more

 • Promoted

Sr. Full stack Engineer - Remote

Two95 International Inc.CA, US
Remote
Full-time
Quick Apply

Create clean, maintainable, and scalable, and well-tested code.Create elegant web-based user interfaces and reporting dashboards.Work with other team members to devise the best possible technical s...Show more

Backend Engineer (Elixir)

Business Needs IncCA, United States
Full-time +2
Quick Apply

Backend Engineer (Elixir) at Chromatic Remote US or Canada Salary: $167K - $218K + Equity .Show more

MES Solumina Engineer

Vertex Elite LLCCA, United States
Full-time +2
Quick Apply

Role: MES Solumina Engineer (Maintenance & Support) Location: Irvine, CA - 100% onsite Duration: 12+ Months Contract / Full Time.At least 7+ years of strong hands-on experience with Solumina OR Sol...Show more

Full time Software Engineers in San Diego, CA

Maania Consultancy ServicesCalifornia, CA, US
Full-time
Quick Apply

Our client is looking for a Software Engineers (Python, Node.If you're interested please share your updated resume along with your yearly expected salary Role:.Software Engineers (Python, Node.C) L...Show more

Full Stack Software Engineer (Java/Kotlin, Angular) - Remote

DivIHN Integration IncCA, United States
Remote
Full-time
Quick Apply

For further inquiries about this opportunity, please contact our Talent Specialist, Ragu, at (630) 847 0953 Title: Full Stack Software Engineer (Java/Kotlin, Angular) - Remote Duration: 12 Months w...Show more

Engineer I

RosendinREI Headquarters, CA
Full-time
Quick Apply

Whether you're a recent grad or a seasoned professional, you can experience meaningful career growth at Rosendin.Enjoy a true sense of ownership as you work with a proven industry leader on some of...Show more

Restaurant Systems Manager

PaneraCA, US
Full-time

Flynn Group was started in 1999 as the owner and operator of eight Applebee’s in Washington State.Since then the company has grown at over 30% a year, added five additional leading brands in Taco B...Show more

Nutanix Engineer - California

ITProposalCalifornia, USA
Full-time
Quick Apply

Nutanix Engineer/ Networking IT Support Engineer.We are hiring a Nutanix Engineer/ Networking IT Support Engineer to support enterprise infrastructure environments across Mexico.This role requires ...Show more

Senior Software Engineer with Test Equipment - 4th Shift

RaytheonCA, United States
Temporary

US-AZ-TUCSON-801 ~ 1151 E Hermans Rd ~ BLDG 801 (External Site).Person, or Immigration Status Requirements:.At Raytheon, the foundation of everything we do is rooted in our values and a higher call...Show more

 • Promoted

Software Engineer

Emcube Technologies IncCA, United States
Full-time
Quick Apply

Title: Software Engineer Type: Contract Location: Mountain View, CA (Hybrid) Software Engineer with 3 5 years of experience to develop scalable, high-quality applications.The role involves coding, ...Show more