What You Will DoThe work we do at Los Alamos National Laboratory (LANL) matters to our country and the world! The High Performance Computing (HPC) Division provides production HPC and AI cycles and tokens to the Laboratory. Our work spans the early phases of acquisition, development, and production readiness of HPC and AI platforms, to the maintenance and operation of these systems and the facilities in which they are housed. HPC Division also manages the network, parallel file systems, storage, and visualization infrastructure associated with our HPC and AI platforms. The division directly supports the Laboratory's HPC and AI user base and aids at multiple levels in the effective use of HPC and AI resources to generate science and support our national security mission. The High Performance Computing Design group (HPC-DES) has expertise across all HPC disciplines. HPC-DES is charged with deep support of other HPC Division groups in provision of HPC and AI platforms, infrastructure, and user services. The group also conducts research into HPC system software, I/O, and data analytics, and serves as the focal point for research and design of future HPC systems and services at the Laboratory. The HPC-DES Group Leader (GL) is a key member and contributor on HPC Division's leadership team. The GL is responsible for supporting the strategic direction for the group, providing oversight on R&D initiatives, working with and supporting the division's leadership, developing and maintaining strong customer and cross-organizational relations, promoting a strong safety, security, and ethics culture, and assisting the HPC Division Office with other duties as assigned.
HPC-DES Group Leader Responsibilities: Strategic Leadership: Serve as a key contributor in supporting the strategic direction for the group, providing oversight and coordination on technical initiatives to ensure resources and capabilities, and coordinating communication across organizational boundaries to promote quality customer service. Organizational Management: Manage budgeting, forecasting, and financial management; capabilities and staff development; succession planning; personnel and salary management; performance appraisals; human resources management; mentoring; coaching; building effective teams; and ensuring a safe and secure workplace. Develop and maintain customer and cross-organizational relations, promote a strong safety, security, and ethics culture, and assist senior management with other duties as assigned. Programmatic Engagement: The level of funding for line management activities in the group is one FTE shared between the Group Leader and Deputy Group Leader. The GL is expected to engage in programmatically funded activities to cover 50% of their time.
What You NeedMinimum Job Requirements:Leadership- Experience managing cross-functional R&D teams consisting of HPC Engineers and HPC Architects providing innovative solutions
- Demonstrated experience leading organizational change through strategic planning and tactical operations management
- Ability to inform a variety of stakeholders, including evolving your thinking, influencing and motivating others, and gaining acceptance in sensitive situations while fostering strong relationships
- Experience as a key contributor in the development and execution of vision, strategic plans, and goals for your group or team
Technical Excellence Leadership-level knowledge and demonstrated experience in at least two technical areas related to complex large-scale HPC and AI computing solutions, such as:
- Computing hardware and operating systems
- System administration
- System architecture and design
- Cyber security in an HPC environment
- Workload management and scheduling
- Network infrastructure and technology
- Large-scale storage systems and data management
- Data analytics
Business Management- Experience managing more than one of the following: budgets, schedules, performance standards, staffing, hiring, forecasting, mentoring, staff development, succession planning, personnel and salary management, and ensuring a safe and secure workplace for an organizational unit
- Ability to use data to build cases for necessary change, measure progress, and demonstrate success
- Experience collaborating with senior management, internal customers, stakeholders, and peer organizations to achieve alignment between institutional priorities and allocation of resources
- Demonstrated ability to work effectively with technical and non-technical staff to set and achieve organizational goals
- Experience in effective decision-making and creative problem-solving
- Experience building collaborative relationships across organizational boundaries
Education/Experience: Position requires a Bachelor's Degree from an accredited institution and 8 years related experience; or, an equivalent combination of education and experience directly related to the occupation.
Desired Qualifications:- Advanced knowledge of governmental policies and procedures, particularly those related to personnel management, budget oversight, safety and security, environment, and community relations
- Experience with large-scale simulation of complex physical processes or with the use or deployment of AI software or methodologies
Work Location: The work location for this position is hybrid and is located in Los Alamos, NM. Hybrid is defined as working partially onsite/partially offsite but within 2 hours ground commute of this location. All work locations are at the discretion of management and can change at any time with appropriate notice.
Position commitment: Regular appointment employees are required to serve a period of continuous service in their current position in order to be eligible to apply for posted jobs throughout the Laboratory. If an employee has not served the time required, they may only apply for Laboratory jobs with the documented approval of their Division Leader. The position commitment for this position is 1 year.