Talent.com
Data Center Test Development Architect
Data Center Test Development ArchitectNVIDIA • Santa Clara, CA, US
Data Center Test Development Architect

Data Center Test Development Architect

NVIDIA • Santa Clara, CA, US
30+ days ago
Job type
  • Full-time
Job description

We are seeking a

highly skilled and hard-working Senior Test Developer Architect to join our multifaceted Enterprise Software QA team. This role offers an outstanding opportunity to leave your mark on the design, construction, optimization and testing of large-scale infrastructure for various foundational NVIDIA unified cloud services and data center offerings. If you are a dedicated engineer with a deep understanding of cloud infrastructure and distributed systems, and you thrive in an exciting, innovative environment, this could be the flawless role for you.

What you'll be doing:

  • Engage with product engineering teams to gain a comprehensive understanding of their infrastructure use cases.

  • Provide mentorship to SWQA teams on effectively testing at scale. Develop end to end test plans that exercise all layers of SW stacks for NVIDIA cloud-based infrastructure Lead NVIDIA Data Center bring up activities from SWQA perspective Develop sophisticated tooling to automate the build and deployment of microservices and infrastructure components, improving efficiency and productivity.

  • Reduce manual labor and increase operational efficiency through automation. Supervise the infrastructure to alert on significant events, ensuring the highest level of system performance and reliability.

  • Work closely with partners to understand their infrastructure needs and to ensure our testing encompass their use cases.

What we need to see:

  • A Master's or Ph.D. in Computer Science or a related field, or equivalent experience.

  • 4+ years of hands-on experience in cluster management and related tools, including Docker Containers, Slurm, Kubernetes, and Ansible.

  • 8+ years strong experience with cloud infrastructure platforms like AWS, Azure, or Google Cloud.

  • Hands-on experience with server platform, network, storage, cluster configuration and debugging.

  • Experience with platform telemetry, datacenter node lifecycle management/support including CPU/GPU workloads Proficiency in scripting languages such as Python. Expertise in administering, operating, and configuring Kubernetes and Envoy.

  • Validated experience in Continuous Integration/Continuous Delivery (CI/CD) tools such as Gitlab and Jenkins and the GitOps model. Proficiency in various monitoring tools: Prometheus, Grafana, Cloudwatch, and Thanos.

  • Strong analytical and problem-solving skills, along with an ability to articulate what you know to others.

Ways to Stand Out from the Crowd:

A true innovator who isn't afraid to challenge the status quo and bring fresh ideas to the table. You're always looking for ways to improve existing systems and processes. Passion and curiosity about the latest technologies and trends in cloud infrastructure and distributed systems. You're not just familiar with the tools, but you understand the underlying principles and can demonstrate this knowledge to make strategic decisions. Committed to personal and professional growth. You're crafting opportunities to learn new skills and deepen your expertise.

By joining our team, you will be part of a forward-thinking company that values innovation and creativity. We offer a competitive salary and benefits package, a flexible work environment, and the opportunity to work with some of the industry leading experts. If you're ready to take your career to the next level, we'd love to hear from you.

The base salary range is 200,000 USD - 310,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Create a job alert for this search

Data Center Test Development Architect • Santa Clara, CA, US

Similar jobs

Lead Data Center Software Architect

MediaTekSan Jose, CA, United States
Full-time

A technology firm is seeking a Software Design Engineer for their Data Center team in San Jose, California.The role involves leading a team to develop innovative software solutions for cutting-edge...Show more

 • Promoted

ATE Test Engineer, Amazon Leo Silicon Team

AmazonSunnyvale, CA, United States
Permanent

ATE Test Engineer, Amazon Leo Silicon Team.Job ID: 3165816 | Amazon Kuiper Manufacturing Enterprises LLC.Amazon Leo is an initiative to launch a constellation of Low Earth Orbit satellites that wil...Show more

 • Promoted

Software Development Engineer in Test

Cerebras SystemsSunnyvale, CA, United States
Full-time

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programm...Show more

 • Promoted

SerDes Micro Architect

Apple Inc.Cupertino, CA, United States
Full-time

Cupertino, California, United States Hardware.Analog Mixed-Signal (AMS) Team is seeking a self‑motivated SERDES Micro Architect to drive the definition of our next generation IPs.The SERDES IPs wil...Show more

 • Promoted

Senior System Software and Performance Architect

GoogleMountain View, CA, United States
Full-time

Senior System Software and Performance Architect.Google Mountain View, CA, USA; Austin, TX, USA; Princeton, NJ, USA; Poughkeepsie, NY, USA.Master's degree or PhD in Electrical Engineering, Computer...Show more

 • Promoted

Assistant Product Developer - Modular - Data Center

Cupertino ElectricSan Jose, California, United States
Full-time

Assistant Product Developer - Modular Design - Data Center.Director of Product Development.This position is eligible for the Operations Bonus Plan.Final determination of a successful candidate's st...Show more

 • Promoted

Lead Data Architect

Novia InfotechSan Jose, CA, United States
Full-time

Lead Data Architect – Oracle/Exadata/Exadata (Migration & Semantic Extraction Focus).San Jose, CA or Scottsdale, AZ (Hybrid/Onsite Evaluation Required, Local Candidates Preferred).Oracle, Exadata, ...Show more

 • Promoted

Software Development Engineer in Test II

HackerRankSanta Clara, CA, United States
Full-time

Software Development Engineer in Test II.HackerRank helps thousands of companies like OpenAI, NVIDIA, Mercedes and Amazon hire developers based on their skills vs.The people at HackerRank care deep...Show more

 • Promoted

Hardware Test Engineer

Reliable RoboticsMountain View, California, United States
Permanent

We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...Show more

 • Promoted

Senior Architect: BIG-IP Kubernetes for AI & Cloud-Native

F5 Networks, Inc.San Jose, CA, United States
Full-time

A leading technology firm is seeking a Software Architect to lead the architectural vision and strategy for the BIGIP Kubernetes platform.This role requires extensive knowledge in L2-L7 protocols a...Show more

 • Promoted

Cloud Scale Test Engineer

Hewlett Packard EnterpriseSan Jose, CA, United States
Full-time

Get AI-powered advice on this job and more exclusive features.This role has been designed as ‘’Onsite’ with an expectation that you will primarily work from an HPE office.Hewlett Packard Enterprise...Show more

 • Promoted

Gen AI + Teamcenter Solution Architect (12012-1) Santa Clara, CA

ESR HealthcareSanta Clara, CA, United States
Full-time

Gen AI + Teamcenter Solution Architect (12012-1) Santa Clara, CA.Teamcenter Expertise, Teamcenter PLM system, Generative AI Knowledge, Programming experience (ITK, C, C++, Java, scripting languages...Show more

 • Promoted

Senior.NET Architect & Tech Lead

Compunnel, Inc.Pleasanton, CA, United States
Full-time

A software development company in California is seeking a Senior.NET Developer to provide technical leadership and hands-on development for enterprise applications.The role includes mentoring team ...Show more

 • Promoted

Staff Engineer, Test Development

Samsung Semiconductor Inc.San Jose, California, United States
Full-time

To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period.Advancing the World's Technol...Show more

 • Promoted

Senior Test Engineer – ATE & Wafer Sort

Microchip Technology Inc.San Jose, CA, United States
Full-time

A leading semiconductor company is seeking a Senior Engineer I - Test in San Jose, CA.You will implement ATE test solutions and support the FPGA business unit.Ideal candidates will have a strong ba...Show more

 • Promoted

(ASIC - Data Center), Design Engineer

Socionext USMilpitas, CA, United States
Full-time

System-on-Chip custom silicon solutions to global customers.The company is focused on datacenter, compute server, networking, storage, artificial intelligence, automotive and industrial automation ...Show more

 • Promoted

Systems Integration and Test - Intern

ChargePointCampbell, California, United States
Full-time

With electric vehicles expected to be nearly 30% of new vehicle sales by 2025 and more than 50% by 2040, electric mobility is becoming a reality.ChargePoint (NYSE: CHPT) is at the center of this re...Show more

 • Promoted

Senior Solution Architect -.net

Pyramid Consulting Inc.Sunnyvale, CA, United States
Full-time

Pyramid is a leading Information Technology Consulting services company headquartered in metropolitan Atlanta, GA with prime emphasis on the following service offerings:.Application Development & S...Show more