Talent.com
Senior High Performance Computing System Administrator

Senior High Performance Computing System Administrator

Icahn School of Medicine at Mount SinaiNew York, NY, United States
7 days ago
Job type
  • Full-time
Job description

Roles & Responsibilities :

The Scientific Computing and Data group at the Icahn School of Medicine at Mount Sinai partners with scientists to accelerate scientific discovery. To achieve these aims, we support a cutting-edge high-performance computing and data ecosystem along with MD / PhD-level support for researchers. The group is composed of a high-performance computing team, the research clinical data warehouse team and a research data services team.

The Senior HPC Administrator, High Performance Computational and Data Ecosystem , is responsible for a computational and data science ecosystem for researchers at Mount Sinai. This ecosystem includes high-performance computing (HPC) systems, clinical research databases, and a software development infrastructure for local and national projects. To meet Sinai’s scientific and clinical goals, the Senior Administrator has a good technical understanding for computational, data and software development systems along with a strong focus on customer service for researchers. The HPC Senior Administrator is an expert troubleshooter and productive team member and leads projects to effective and efficient completion independently under little to no supervision. This position reports to the Director for Computational & Data Ecosystem in Scientific Computing. Specific responsibilities are listed below.

Responsibilities

  • Design, deploy and maintain Scientific Computing’s computational and data science ecosystem including ~30,000 cores with high bandwidth, low latency interconnects, GPUs, large shared memory nodes, databases, scientific workflows and 30+ petabytes of storage in production, clinical data warehouse and software development environment.
  • Lead the troubleshooting, isolation and resolution of all technical issues including application, system, hardware, software, and network). Actively monitors the systems.
  • Maintains, tunes and manages computational, data, cloud technologies and workflow systems for ISMMS researchers, scientists and their external collaborators. Defines and deploys a comprehensive computational and data vision. Identifies and communicates system advantages / disadvantages and tradeoffs.
  • Designs, develops, implements system administration tasks, including hardware and software configuration, configuration management, system monitoring (including the development and maintenance of regression tests), usage reporting, system performance (file systems, scheduler, interconnect, high availability, etc.), security, networking and metrics, etc.
  • Collaborates effectively with research and hospital system IT, compliance, HIPAA, security and other departments to ensure compliance with all regulations and Sinai policies.
  • Participates in the integration of HPC resources with laboratory equipment such as sequencers, clinical and research data resources and systems, etc. Incorporate and link data and compute resources.
  • Researches, deploys and optimizes resource management and scheduling software and policies and actively monitoring. Designs, tunes, manages and upgrades parallel file systems, storage and data-oriented resources.
  • Researches, deploys and manages security infrastructure, including development of policies and procedures.
  • Maintain all necessary aspects of HPC in accordance with best practices. Develops and implements backup policies.
  • Prepares and manages budgets for hardware, software and maintenance. Participates in chargeback / fee recovery analysis and provides suggestions to make operations sustainable.
  • Assists in developing and writing system design for research proposals. Creates and provides clear documentation.
  • Works effectively and productively with other team members within the group and across Mount Sinai.
  • Performs related duties as assigned or requested.
  • Provides after hours support for critical system and production issues.
  • Answers and resolves user tickets.

Qualifications :

  • Bachelor's degree in computer science, engineering or another scientific field. Master's or PhD preferred
  • 8+ years (higher preferred) of progressive HPC system administration and operations (preferably in a Redhat / CentOS Linux administration, Batch HPC cluster environment)
  • Must be an expert troubleshooter; Must be a team player and customer focused
  • Experience with job scheduler such as LSF or Slurm and parallel file systems and storage
  • Experience with networking and security
  • Experience with configuration management systems such as xCAT, Puppet and / or Ansible
  • Experience of databases and web services
  • Experience in Infiniband, Gigabit Ethernet
  • Experience in an academic or research community environment
  • Script and programming experience
  • Experience with Cloud Computing
  • Ability to multitask effectively in a dynamic environment
  • Excellent communication skills, analytical ability, strong judgment and management skills, and the ability to work effectively as a liaison between both research and technology teams.
  • Strong written, oral, and interpersonal communication skills
  • Preferred Experience

  • Advanced degree
  • Experience with GPFS, LSF, TSM, IB and ethernet networking
  • Experience with databases and web services is highly preferred
  • Strength through Unity and Inclusion

    The Mount Sinai Health System is committed to fostering an environment where everyone can contribute to excellence. We share a common dedication to delivering outstanding patient care. When you join us, you become part of Mount Sinai’s unparalleled legacy of achievement, education, and innovation as we work together to transform healthcare. We encourage all team members to actively participate in creating a culture that ensures fair access to opportunities, promotes inclusive practices, and supports the success of every individual.

    At Mount Sinai, our leaders are committed to fostering a workplace where all employees feel valued, respected, and empowered to grow. We strive to create an environment where collaboration, fairness, and continuous learning drive positive change, improving the well-being of our staff, patients, and organization. Our leaders are expected to challenge outdated practices, promote a culture of respect, and work toward meaningful improvements that enhance patient care and workplace experiences. We are dedicated to building a supportive and welcoming environment where everyone has the opportunity to thrive and advance professionally. Explore this opportunity and be part of the next chapter in our history.

    About the Mount Sinai Health System :

    Mount Sinai Health System is one of the largest academic medical systems in the New York metro area, with more than 48,000 employees working across eight hospitals, more than 400 outpatient practices, more than 300 labs, a school of nursing, and a leading school of medicine and graduate education. Mount Sinai advances health for all people, everywhere, by taking on the most complex health care challenges of our time — discovering and applying new scientific learning and knowledge; developing safer, more effective treatments; educating the next generation of medical leaders and innovators; and supporting local communities by delivering high-quality care to all who need it. Through the integration of its hospitals, labs, and schools, Mount Sinai offers comprehensive health care solutions from birth through geriatrics, leveraging innovative approaches such as artificial intelligence and informatics while keeping patients’ medical and emotional needs at the center of all treatment. The Health System includes more than 9,000 primary and specialty care physicians; 13 joint-venture outpatient surgery centers throughout the five boroughs of New York City, Westchester, Long Island, and Florida; and more than 30 affiliated community health centers. We are consistently ranked by U.S. News & World Report's Best Hospitals, receiving high "Honor Roll" status, and are highly ranked : No. 1 in Geriatrics, top 5 in Cardiology / Heart Surgery, and top 20 in Diabetes / Endocrinology, Gastroenterology / GI Surgery, Neurology / Neurosurgery, Orthopedics, Pulmonology / Lung Surgery, Rehabilitation, and Urology. New York Eye and Ear Infirmary of Mount Sinai is ranked No. 12 in Ophthalmology. U.S. News & World Report’s “Best Children’s Hospitals” ranks Mount Sinai Kravis Children's Hospital among the country’s best in several pediatric specialties. The Icahn School of Medicine at Mount Sinai is ranked No. 11 nationwide in National Institutes of Health funding and in the 99th percentile in research dollars per investigator according to the Association of American Medical Colleges. Newsweek’s “The World’s Best Smart Hospitals” ranks The Mount Sinai Hospital as No. 1 in New York and in the top five globally, and Mount Sinai Morningside in the top 20 globally.

    Equal Opportunity Employer

    The Mount Sinai Health System is an equal opportunity employer, complying with all applicable federal civil rights laws. We do not discriminate, exclude, or treat individuals differently based on race, color, national origin, age, religion, disability, sex, sexual orientation, gender, veteran status, or any other characteristic protected by law. We are deeply committed to fostering an environment where all faculty, staff, students, trainees, patients, visitors, and the communities we serve feel respected and supported. Our goal is to create a healthcare and learning institution that actively works to remove barriers, address challenges, and promote fairness in all aspects of our organization.

    Create a job alert for this search

    System Administrator • New York, NY, United States

    Related jobs
    • Promoted
    Project Administrator

    Project Administrator

    AmpcusWhite Plains, NY, US
    Full-time
    Technology and Business consulting services.We are in search of a highly motivated candidate to join our talented Team.Job Title : Project Administrator Location(s) : White Plains, NY Project Overvie...Show moreLast updated: 30+ days ago
    • Promoted
    Manager, GPS Knowledge Systems

    Manager, GPS Knowledge Systems

    Bristol-Myers SquibbEast Brunswick, NJ, United States
    Full-time
    Those aren't words that are usually associated with a job.But working at Bristol Myers Squibb is anything but usual.Here, uniquely interesting work happens every day, in every department.From optim...Show moreLast updated: 30+ days ago
    • Promoted
    Senior SQL Performance Engineer (T-SQL) (Morristown)

    Senior SQL Performance Engineer (T-SQL) (Morristown)

    Beacon HillMorristown, NJ, US
    Part-time
    Responsible for data performance enhancements to the production server environment.Have an interest in data and business logic. Ability to provide insights into software solutions and identify failu...Show moreLast updated: 2 days ago
    • Promoted
    Microsoft Outlook System Administrator (Jamaica)

    Microsoft Outlook System Administrator (Jamaica)

    Medisys Health NetworkJamaica, NY, US
    Full-time +1
    The O365 / Azure Security Administrator position is a full-time salaried job based in Jamaica, New York.The O365 Administrator will provide support and management of M365 and Microsoft Azure platform...Show moreLast updated: 2 days ago
    • Promoted
    REMOTE CPC Denials and Escalations Analyst

    REMOTE CPC Denials and Escalations Analyst

    Allied Digestive HealthEatontown, NJ, US
    Remote
    Full-time
    Full-Time Remote Cpc Denials And Escalation Analyst.Allied Digestive Health is one of the largest integrated networks of gastroenterology care centers in the nation with over 200 providers and 60 l...Show moreLast updated: 30+ days ago
    • Promoted
    Senior High Performance Computing System Administrator

    Senior High Performance Computing System Administrator

    Icahn School of Medicine at Mount SinaiNew York, NY, US
    Full-time
    The Scientific Computing and Data group at the Icahn School of Medicine at Mount Sinai partners with scientists to accelerate scientific discovery. To achieve these aims, we support a cutting-edge h...Show moreLast updated: 6 days ago
    • Promoted
    SQL Database Administrator

    SQL Database Administrator

    Medisys Health Network, Inc.Garden City, NY, US
    Full-time
    SQL Server Database Administrator (DBA) – Job Description.We are seeking a skilled SQL Server DBA to join our IT Systems team at MediSys Health Network. This role is critical to maintaining th...Show moreLast updated: 4 days ago
    • Promoted
    Cloud System Engineer (On-site NYC)

    Cloud System Engineer (On-site NYC)

    Oakridge StaffingNew York, NY, US
    Full-time
    Our servers are deployed mainly in the cloud (AWS, Alibaba Cloud and GCP).Your role will be critical in ensuring the reliability, security, and efficiency of our infrastructure.Linux and Windows se...Show moreLast updated: 6 days ago
    • Promoted
    Senior High Performance Computing System Administrator (New York)

    Senior High Performance Computing System Administrator (New York)

    Icahn School of Medicine at Mount SinaiNew York, NY, US
    Part-time
    The Scientific Computing and Data group at the Icahn School of Medicine at Mount Sinai partners with scientists to accelerate scientific discovery. To achieve these aims, we support a cutting-edge h...Show moreLast updated: 2 days ago
    • Promoted
    Senior Manager, GPS Knowledge Systems

    Senior Manager, GPS Knowledge Systems

    Bristol-Myers SquibbHighland Park, NJ, United States
    Full-time
    Those aren't words that are usually associated with a job.But working at Bristol Myers Squibb is anything but usual.Here, uniquely interesting work happens every day, in every department.From optim...Show moreLast updated: 30+ days ago
    Senior Systems Administrator - Cloud

    Senior Systems Administrator - Cloud

    Aspire LiveNew York, NY, US
    Full-time
    Quick Apply
    The Senior Systems Administrator is responsible for the upkeep, configuration, and reliable operation of our computer systems and infrastructure. This hybrid role (3 days in-office) supports systems...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Manager DevOps

    Senior Manager DevOps

    Bristol-Myers SquibbEast Brunswick, NJ, United States
    Full-time
    Those aren't words that are usually associated with a job.But working at Bristol Myers Squibb is anything but usual.Here, uniquely interesting work happens every day, in every department.From optim...Show moreLast updated: 6 hours ago
    Senior Systems Administrator

    Senior Systems Administrator

    Lincoln ITHicksville, NY, US
    Full-time
    Quick Apply
    Lincoln IT is a leading provider of innovative IT solutions, specializing in enterprise-level infrastructure and cloud computing services. We are committed to delivering cutting-edge technology solu...Show moreLast updated: 30+ days ago
    • Promoted
    Senior SQL Performance Engineer (T-SQL)

    Senior SQL Performance Engineer (T-SQL)

    Beacon Hill07961, NJ, US
    Full-time
    Responsible for data performance enhancements to the production server environment.Have an interest in data and business logic. Ability to provide insights into software solutions and identify failu...Show moreLast updated: 4 days ago
    • Promoted
    Microsoft Outlook System Administrator

    Microsoft Outlook System Administrator

    Medisys Health NetworkJamaica, NY, US
    Full-time
    The O365 / Azure Security Administrator position is a full-time salaried job based in Jamaica, New York.The O365 Administrator will provide support and management of M365 and Microsoft Azure platform...Show moreLast updated: 4 days ago
    • Promoted
    Power Platform Developer (New Brunswick)

    Power Platform Developer (New Brunswick)

    BrooksourceNew Brunswick, NJ, US
    Part-time
    Senior Power Platform & D365 Developer.New Brunswick, NJ (Hybrid 3 days onsite).You must have Power Platforms and D365 experience to be considered!. Our Fortune 100 healthcare client is seeking a.S...Show moreLast updated: 2 days ago
    • Promoted
    Power Platform Developer

    Power Platform Developer

    BrooksourceNew Brunswick, NJ, US
    Full-time
    Senior Power Platform & D365 Developer.New Brunswick, NJ (Hybrid – 3 days onsite).You must have Power Platforms and D365 experience to be considered!. Our Fortune 100 healthcare client is ...Show moreLast updated: 6 days ago
    • Promoted
    Cloud System Engineer (On-site NYC) (New York)

    Cloud System Engineer (On-site NYC) (New York)

    Oakridge StaffingNew York, NY, US
    Part-time
    Our servers are deployed mainly in the cloud (AWS, Alibaba Cloud and GCP).Your role will be critical in ensuring the reliability, security, and efficiency of our infrastructure.Linux and Windows se...Show moreLast updated: 2 days ago
    • Promoted
    SQL Database Administrator (Garden City)

    SQL Database Administrator (Garden City)

    Medisys Health Network, Inc.Garden City, NY, US
    Full-time +1
    SQL Server Database Administrator (DBA) Job Description.We are seeking a skilled SQL Server DBA to join our IT Systems team at MediSys Health Network. This role is critical to maintaining the perfo...Show moreLast updated: 2 days ago
    • Promoted
    IT Audit Engineer (No Sponsorship / No Remote) (Stamford)

    IT Audit Engineer (No Sponsorship / No Remote) (Stamford)

    Town Fair TireStamford, CT, US
    Remote
    Part-time
    We are a premier retailer known for our commitment to quality, customer service, and innovation.As we prepare to transition into a publicly-traded company within the next 612 months, were building ...Show moreLast updated: 2 days ago