Talent.com
Data Center Test Development Architect
Data Center Test Development ArchitectNVIDIA • Santa Clara, CA, US
Data Center Test Development Architect

Data Center Test Development Architect

NVIDIA • Santa Clara, CA, US
30+ days ago
Job type
  • Full-time
Job description

We are seeking a

highly skilled and hard-working Senior Test Developer Architect to join our multifaceted Enterprise Software QA team. This role offers an outstanding opportunity to leave your mark on the design, construction, optimization and testing of large-scale infrastructure for various foundational NVIDIA unified cloud services and data center offerings. If you are a dedicated engineer with a deep understanding of cloud infrastructure and distributed systems, and you thrive in an exciting, innovative environment, this could be the flawless role for you.

What you'll be doing:

  • Engage with product engineering teams to gain a comprehensive understanding of their infrastructure use cases.

  • Provide mentorship to SWQA teams on effectively testing at scale. Develop end to end test plans that exercise all layers of SW stacks for NVIDIA cloud-based infrastructure Lead NVIDIA Data Center bring up activities from SWQA perspective Develop sophisticated tooling to automate the build and deployment of microservices and infrastructure components, improving efficiency and productivity.

  • Reduce manual labor and increase operational efficiency through automation. Supervise the infrastructure to alert on significant events, ensuring the highest level of system performance and reliability.

  • Work closely with partners to understand their infrastructure needs and to ensure our testing encompass their use cases.

What we need to see:

  • A Master's or Ph.D. in Computer Science or a related field, or equivalent experience.

  • 4+ years of hands-on experience in cluster management and related tools, including Docker Containers, Slurm, Kubernetes, and Ansible.

  • 8+ years strong experience with cloud infrastructure platforms like AWS, Azure, or Google Cloud.

  • Hands-on experience with server platform, network, storage, cluster configuration and debugging.

  • Experience with platform telemetry, datacenter node lifecycle management/support including CPU/GPU workloads Proficiency in scripting languages such as Python. Expertise in administering, operating, and configuring Kubernetes and Envoy.

  • Validated experience in Continuous Integration/Continuous Delivery (CI/CD) tools such as Gitlab and Jenkins and the GitOps model. Proficiency in various monitoring tools: Prometheus, Grafana, Cloudwatch, and Thanos.

  • Strong analytical and problem-solving skills, along with an ability to articulate what you know to others.

Ways to Stand Out from the Crowd:

A true innovator who isn't afraid to challenge the status quo and bring fresh ideas to the table. You're always looking for ways to improve existing systems and processes. Passion and curiosity about the latest technologies and trends in cloud infrastructure and distributed systems. You're not just familiar with the tools, but you understand the underlying principles and can demonstrate this knowledge to make strategic decisions. Committed to personal and professional growth. You're crafting opportunities to learn new skills and deepen your expertise.

By joining our team, you will be part of a forward-thinking company that values innovation and creativity. We offer a competitive salary and benefits package, a flexible work environment, and the opportunity to work with some of the industry leading experts. If you're ready to take your career to the next level, we'd love to hear from you.

The base salary range is 200,000 USD - 310,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Create a job alert for this search

Data Center Test Development Architect • Santa Clara, CA, US

Similar jobs

System Integration Test Lead

OSI EngineeringCupertino, CA, United States
Full-time

We are seeking a Technical Program Manager (TPM) with system-level experience to tackle new challenges and leverage your expertise in this dynamic role.This company offers a fast-faced, innovate cu...Show more

 • Promoted

Software Development Engineer in Test

Cerebras SystemsSunnyvale, CA, United States
Full-time

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programm...Show more

 • Promoted

SerDes Micro Architect

Apple Inc.Cupertino, CA, United States
Full-time

Cupertino, California, United States Hardware.Analog Mixed-Signal (AMS) Team is seeking a self‑motivated SERDES Micro Architect to drive the definition of our next generation IPs.The SERDES IPs wil...Show more

 • Promoted

Expert Development Architect SAP Office of the CTO

SAP SEPalo Alto, CA, United States
Full-time

Expert Development Architect SAP Office of the CTO.At SAP, we keep it simple: you bring your best to us, and we’ll bring out the best in you.We’re builders touching over 20 industries and % of glob...Show more

 • Promoted

Assistant Product Developer - Modular - Data Center

Cupertino ElectricSan Jose, California, United States
Full-time

Assistant Product Developer - Modular Design - Data Center.Director of Product Development.This position is eligible for the Operations Bonus Plan.Final determination of a successful candidate's st...Show more

 • Promoted

Hardware Test Engineer

Reliable RoboticsMountain View, California, United States
Permanent

We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...Show more

 • Promoted

Staff/Principal Engineer, SSD Tester System Architect

Micron Technology, Inc.San Jose, CA, United States
Full-time

Our vision is to transform how the world uses information to enrich life for.Micron Technology is a world leader in innovating memory and storage solutions that accelerate the transformation of inf...Show more

 • Promoted

Cloud Scale Test Engineer

Hewlett Packard EnterpriseSan Jose, CA, United States
Full-time

Get AI-powered advice on this job and more exclusive features.This role has been designed as ‘’Onsite’ with an expectation that you will primarily work from an HPE office.Hewlett Packard Enterprise...Show more

 • Promoted

Senior.NET Architect & Tech Lead

Compunnel, Inc.Pleasanton, CA, United States
Full-time

A software development company in California is seeking a Senior.NET Developer to provide technical leadership and hands-on development for enterprise applications.The role includes mentoring team ...Show more

 • Promoted

Software Development Engineer in Test, Global E-Commerce Engineer Efficiency - USDS

TikTokSan Jose, CA, United States
Full-time

About the team: Our quality assurance engineering team is responsible for keeping the e-commerce ecosystem stable, secure and intuitive for our users.We are looking for passionate and talented peop...Show more

 • Promoted

Staff Engineer, Test Development

Samsung Semiconductor Inc.San Jose, California, United States
Full-time

To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period.Advancing the World's Technol...Show more

 • Promoted

Senior Test Engineer – ATE & Wafer Sort

Microchip Technology Inc.San Jose, CA, United States
Full-time

A leading semiconductor company is seeking a Senior Engineer I - Test in San Jose, CA.You will implement ATE test solutions and support the FPGA business unit.Ideal candidates will have a strong ba...Show more

 • Promoted

(ASIC - Data Center), Design Engineer

Socionext USMilpitas, CA, United States
Full-time

System-on-Chip custom silicon solutions to global customers.The company is focused on datacenter, compute server, networking, storage, artificial intelligence, automotive and industrial automation ...Show more

 • Promoted

Distinguished System Architect - Innovation

Infineon Technologies AGSan Jose, CA, United States
Full-time

As a Distinguished System Architect - Innovatio, you are a senior Technical Leader and Innovator who owns and drives thought leadership in important depth or breadth areas relevant for Connected Se...Show more

 • Promoted

Senior Systems Test Engineer, Autonomy Behavior

NuroMountain View, CA, United States
Full-time

Nuro is a self-driving technology company on a mission to make autonomy accessible to all.Founded in 2016, Nuro is building the world’s most scalable driver, combining cutting-edge AI with automoti...Show more

 • Promoted

Senior Solution Architect -.net

Pyramid Consulting Inc.Sunnyvale, CA, United States
Full-time

Pyramid is a leading Information Technology Consulting services company headquartered in metropolitan Atlanta, GA with prime emphasis on the following service offerings:.Application Development & S...Show more

 • Promoted

Senior AI-Driven SDET for Cloud Management Platform

Palo Alto NetworksSanta Clara, CA, United States
Full-time

A leading cybersecurity company is seeking a Senior Development Engineer in Test to drive innovation and transform the Software Development Lifecycle with an AI-First mindset.This role involves des...Show more

 • Promoted

Senior Test Engineer, Photonics Systems

thrivelySanta Clara, California, US
Full-time

Thrively is partnering with an innovative, well-funded hardware startup at the intersection of.MEMS, silicon photonics, and AI-scale infrastructure.The team is developing a next-generation optical ...Show more