Job Description
Job Description
Radical AI, Inc. is an artificial intelligence company that is accelerating scientific research & development. We are at the forefront of innovation in the field of materials R&D, a critical driver for advancing our most cutting-edge industries and shaping the future. Breaking away from the traditionally slow and costly R&D process, Radical AI leverages artificial intelligence and machine learning to pioneer generative materials science. This innovative field blends AI, engineering, and materials science, revolutionizing how materials are created and discovered. Radical AI's approach speeds up R&D and addresses global challenges, setting new benchmarks in technology and sustainability.
The opportunity
As a DevOps Engineer at Radical AI, you will be instrumental in building and maintaining a secure and efficient development and deployment pipeline for our cutting-edge AI and materials discovery platform. You will play a crucial role in streamlining our software development lifecycle and ensuring the resilience and integrity of our infrastructure and applications. Your work will be critical in enabling the rapid and secure delivery of our innovative solutions. The ideal candidate will bring strong expertise in DevOps methodologies, cloud and on-prem deployments, distributed systems, and CI / CD pipeline development.
Mission
- Design, build, and maintain robust, secure and scalable CI / CD pipelines to automate software delivery processes.
- Collaborate closely with software engineers and ML researchers to ensure applications align with DevOps best practices.
- Integrate security tools and practices into every stage of the development lifecycle, from code creation to deployment and monitoring.
- Maintain and troubleshoot our cloud GPU infrastructure, hosted on Voltage Park.
- Design and implement secure and optimized cloud (e.g., AWS, Azure, GCP) and on-prem infrastructure following security best practices and compliance standards.
- Implement and manage containerization and orchestration technologies (e.g., Docker, Kubernetes) to improve application portability and scalability.
- Develop and maintain infrastructure-as-code (IaC) using tools like Terraform or CloudFormation
- Troubleshoot and resolve infrastructure and deployment-related issues across development, staging, and production environments.
- Implement and manage monitoring and alerting systems to proactively identify and resolve infrastructure and application issues.
- Develop and maintain security policies, procedures, and documentation.
- Promote a security-conscious culture within the engineering teams through training and knowledge sharing.
About you
B.S. or M.S. in Computer Science, Information Security, Engineering, or a related field, or equivalent practical experience.Proven experience in DevOps, DevSecOps, or Platform Engineering role.Experience with CI / CD tools (e.g., GitHub Actions, Jenkins, CircleCI) and integrating security checks within them.Hands-on experience with cloud platforms (e.g., AWS, Azure, GCP), as well as on-prem.Proficiency in at least one scripting language (e.g., Python, Bash) for automation tasks.Experience with Infrastructure-as-Code (IaC) tools such as Terraform or CloudFormation.Familiarity with containerization technologies (e.g., Docker, Kubernetes).Understanding of network security concepts (e.g., firewalls, VPNs, network segmentation, Tailscale).Experience with monitoring, logging, and tracing tools (Datadog, Prometheus, Grafana, Splunk, ELK, etc.).Solid understanding of Linux / Unix systems administration.Strong analytical and problem-solving skills with a security-first mindset.Excellent communication and collaboration skills, capable of explaining technical security concepts to both technical and non-technical audiences.Passion for building secure and reliable systems.Pluses
Exposure to machine learning lifecycles or MLOps concepts.Solid understanding of security principles, practices, and common security tools (e.g., vulnerability scanners, intrusion detection systems).Experience with security and compliance software (e.g., Drata, Apptega).Contributions to open-source security projects.Familiarity with application development in Go, Python, and / or Javascript / Typescript.What we offer
A competitive compensation package also includes the best in benefits :Medical, dental, and vision insurance for you and your familyMental health and wellness supportUnlimited PTO and 14+ company holidays per year401KWork closely with a team on the cutting edge of AI research.A mission : an opportunity to fundamentally change the way humanity makes progress through materials science discovery.Salary Description
Competitive salary + Equity + Benefits; base pay offered may vary depending on job-related knowledge, skills, and experience.
Disclosure
Radical AI is committed to equal employment opportunity regardless of race, color, ancestry, national origin, religion, sex, age, sexual orientation, gender identity and expression, marital status, disability, or veteran status.