Join a cutting-edge and well-funded hardware startup in Silicon Valley as an Deep Learning and Large Language Model Performance Architect. Our mission is to reimagine silicon and create Risc-V based
Accelerated
computing platforms that will transform the industry. You will have the opportunity to work with some of the most talented and passionate engineers in the world to create designs that push the envelope on performance, energy efficiency and scalability. We offer a fun, creative and flexible work environment, with a shared vision to build products to change the world.
Job Responsibility
- Workload Analysis - Analyzing the performance of important workloads, tuning our current software, and proposing improvements for future software.
- Performance modeling and analysis - develop analytical model for target systems and analyze the performance bottleneck. make recommendations to the implementation teams. Working with cross-collaborative teams of deep learning software engineers and hardware architects to develop innovative solutions. Adapting to the constantly evolving AI industry by being agile and excited to contribute across the codebase.
- Pre-silicon and post-silicon performance validation
Qualification
MS or PhD in relevant discipline (CS, EE, Math) or equivalent experience with 5+ years working experiencesIn-depth knowledge of deep learning models or large language modelsStrong background in computer architecture or AI software stack / compilersStrong C / C++ programming and hardware modeling skillsStrong problem solving and analytical thinking skillsPerformance modeling and analysis background a plusGPU programming experience (CUDA) a plusLLVM / MLIR development experience a plusGood communication and organizational skillsJ-18808-Ljbffr