Talent.com
Edge Systems Reliability Engineer

Edge Systems Reliability Engineer

MedStar HealthSan Francisco, CA, United States
20 hours ago
Job type
  • Full-time
Job description

About Us

At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazine's Top Company Cultures list and ranked among the World's Most Innovative Companies by Fast Company.

We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills, and we are ready to help you do that. We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us!

Available Locations : Bengaluru

About The Team

Infrastructure Engineering is responsible for the world's most reliable, observable, performant, and safe network ecosystem. Our customers rely on our products and systems to safely modify, troubleshoot, and release products without external impact.

Our external customers rely on us to provide seamless and predictable incident, traffic, policy management, resulting in the fastest and safest network services in the world.

We are accountable for the overall performance of internal and external facing services, guiding our product teams to optimal configurations and maximum efficiency. From the moment that a packet enters the Cloudflare ecosystem, we know exactly what its expected purpose and behavior is and we are capable of determining and exposing anomalous behavior.

The Cloudflare network makes it possible to solve challenges at massive scale and efficiency which would be impossible for almost any other organization.

What You'll Do

  • Develop Software : Design, write, and deliver software that improves Cloudflare's Edge platform
  • Work on large scale systems : Scale and evolve systems through software and automation to improve reliability and velocity
  • Maintain and manage distributed systems : Manage and be part of the on-call rotation that supports the largest distributed edge system in the world.
  • Document, Propose and Implement : Collaborate with other engineers to design and implement scalable solutions that support our growing user base.
  • Guide and mentor : Participate in the constant cycle of knowledge sharing and mentoring.
  • Optimize and Automate : Research and introduce cutting-edge technologies. Develop and maintain sustainable tools that work on an extremely large scale.
  • Open Source : Contribute to open-source

We are growing quickly and focused on building an extraordinary company. This is a systems reliability engineering role and is a superb opportunity to be part of a high performing team and help to support Cloudflare's mission and help build a better internet.

You will build services and APIs to constantly improve availability, performance and uptime.

You may be a good fit for our team if you have :

  • Up to 8 years of experience managing distributed systems
  • Proficiency in distributed Linux / Unix environments
  • Proficiency in high-level programming (e.g., Golang, Python)
  • Proficiency in configuration management (e.g., Saltstack, Chef, Puppet, Ansible)
  • Proficiency in networking protocols Layer 3-7 of the OSI model
  • Experience in performance analysis, debugging, and troubleshooting
  • Experience in SQL databases (e.g., Postgres, MySQL)
  • Experienced with being part of a rotation that tends to high priority reliability objectives
  • Experience in load balancing and reverse proxies (e.g., Nginx)
  • Familiarity with Key / Value stores (e.g., Redis)
  • Familiarity with Internet working and BGP
  • Exquisite written and verbal communication skills
  • Strong bias for action
  • Bonus points if you have :

  • Experience with continuous integration and delivery (CI / CD)
  • Experience working in a 24 / 7 / 365 service environment
  • Experience with high-bandwidth transit Internet working and routing
  • Passion for tooling and automation
  • What Makes Cloudflare Special?

    We're not just a highly ambitious, large-scale technology company. We're a highly ambitious, large-scale technology company with a soul. Fundamental to our mission to help build a better Internet is protecting the free and open Internet.

    Project Galileo : Since 2014, we've equipped more than 2,400 journalism and civil society organizations in 111 countries with powerful tools to defend themselves against attacks that would otherwise censor their work, technology already used by Cloudflare's enterprise customers at no cost.

    Athenian Project : In 2017, we created the Athenian Project to ensure that state and local governments have the highest level of protection and reliability for free, so that their constituents have access to election information and voter registration. Since the project, we've provided services to more than 425 local government election websites in 33 states.

    1.1.1.1 : We released 1.1.1.1 to help fix the foundation of the Internet by building a faster, more secure and privacy-centric public DNS resolver. This is available publicly for everyone to use - it is the first consumer-focused service Cloudflare has ever released. Here's the deal - we don't store client IP addresses never, ever. We will continue to abide by our privacy commitment and ensure that no user data is sold to advertisers or used to target consumers.

    Sound like something you'd like to be a part of? We'd love to hear from you!

    This position may require access to information protected under U.S. export control laws, including the U.S. Export Administration Regulations. Please note that any offer of employment may be conditioned on your authorization to receive software or technology controlled under these U.S. export laws without sponsorship for an export license.

    Cloudflare is proud to be an equal opportunity employer. We are committed to providing equal employment opportunity for all people and place great value in both diversity and inclusiveness. All qualified applicants will be considered for employment without regard to their, or any other person's, perceived or actual race, color, religion, sex, gender, gender identity, gender expression, sexual orientation, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, family care status, or any other basis protected by law. We are an AA / Veterans / Disabled Employer.

    Cloudflare provides reasonable accommodations to qualified individuals with disabilities. Please tell us if you require a reasonable accommodation to apply for a job. Examples of reasonable accommodations include, but are not limited to, changing the application process, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment. If you require a reasonable accommodation to apply for a job, please contact us via e-mail at hr@cloudflare.com or via mail at 101 Townsend St. San Francisco, CA 94107.

    #J-18808-Ljbffr

    Create a job alert for this search

    Reliability Engineer • San Francisco, CA, United States

    Related jobs
    • Promoted
    Systems Integration Engineer

    Systems Integration Engineer

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for a Systems Integration Engineer.Key Responsibilities Manage and monitor IT systems, including installations, configuration, testing, and maintenance Implement and maintai...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    PsiQuantumPalo Alto, CA, United States
    Full-time
    Get AI-powered advice on this job and more exclusive features.Quantum computing holds the promise of humanity’s mastery over the natural world, but only if we can build a.PsiQuantum is on a mission...Show moreLast updated: less than 1 hour ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    criteoPalo Alto, CA, United States
    Full-time
    At Criteo we face challenging problems in the IT industry at scale.Our data is large and our systems require speed and complexity handling. We have about 40 petabytes in Hadoop storage and respond t...Show moreLast updated: 20 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Berkley HuntSan Francisco, CA, United States
    Full-time
    Senior Site Reliability Engineer (GPU Compute) | Hybrid — Bay Area, CA.Berkley Hunt is supporting a fast-growing AI startup building a high-performance, cloud-native platform to power cutting-edge ...Show moreLast updated: 20 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Together AISan Francisco, CA, United States
    Full-time
    As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a soft...Show moreLast updated: 20 hours ago
    • Promoted
    Reliability Engineer

    Reliability Engineer

    DoorDashSan Francisco, CA, United States
    Full-time
    DoorDash Labs, established in 2018, serves as the innovation hub for DoorDash, focusing on developing automation and robotics solutions to enhance last-mile logistics. The team's mission is to creat...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Reliability Engineer

    Reliability Engineer

    Periodic LabsMenlo Park, CA, United States
    Full-time
    We are an AI + physical sciences lab building state of the art models to make novel scientific discoveries.We are well funded and growing rapidly. Team members are owners who identify and solve prob...Show moreLast updated: 20 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WorkOSSan Francisco, CA, United States
    Full-time
    WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. We’re a fully distributed team with employees across...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    JPMorganChasePalo Alto, CA, United States
    Full-time
    Join a globally recognized financial organization and advance your profession to new heights by contributing to revolutionary projects. You've discovered the perfect environment to have a major impa...Show moreLast updated: 20 hours ago
    • Promoted
    Systems Reliability Engineer (SRE) - Edge

    Systems Reliability Engineer (SRE) - Edge

    Cloudflare, Inc.San Francisco, CA, United States
    Full-time
    At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for cust...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer - Inference

    Site Reliability Engineer - Inference

    Jobright.aiSan Francisco, CA, United States
    Full-time
    Site Reliability Engineer - Inference.Be among the first 25 applicants.Site Reliability Engineer - Inference.Get AI-powered advice on this job and more exclusive features.Jobright is an AI-powered ...Show moreLast updated: 20 hours ago
    • Promoted
    • New!
    DevOps Engineer / Site Reliability Engineer

    DevOps Engineer / Site Reliability Engineer

    HyperFiSan Francisco, CA, United States
    Full-time
    DevOps Engineer / Site Reliability Engineer.We’re building a platform that is fast, flexible, and capable of handling real-world complexity. This is a zero-legacy environment with a clean slate, fas...Show moreLast updated: 20 hours ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Loft OrbitalSan Francisco, CA, United States
    Full-time
    Loft Orbital is revolutionizing access to space by building reliable, shareable satellites that drastically reduce the time and complexity traditionally required to get to orbit.We operate satellit...Show moreLast updated: 20 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PacerProSan Francisco, CA, United States
    Full-time
    You’ll be joining the engineering team responsible for delivering PacerPro’s SaaS and on-premise solutions that orchestrate case data workflows and provide data driven legal insights for our client...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    VirtualVocationsSanta Clara, California, United States
    Full-time
    A company is looking for a Mid-Sr.Site Reliability Engineer with a focus on on-prem Kubernetes / K8s.Key Responsibilities Manage and maintain on-premise containerized environments Deploy resources...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Jobs via DiceRedwood City, CA, United States
    Full-time
    Dice is the leading career destination for tech experts at every stage of their careers.Our client, Kforce Technology Staffing, is seeking a Reliability Engineer in Redwood City, CA.Deliver high-le...Show moreLast updated: 20 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PrimerSan Francisco, CA, United States
    Full-time
    Primer helps B2B products break out of the B2C-centric marketing box.Our platform turns consumer ad channels, data streams, and emerging AI workflows into measurable growth engines for go-to-market...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Site Reliability Engineer - Enterprise AI Platform

    Principal Site Reliability Engineer - Enterprise AI Platform

    NVIDIASanta Clara, CA, United States
    Full-time
    Principal Site Reliability Engineer - Enterprise AI Platform.Principal Site Reliability Engineer - Enterprise AI Platform. NVIDIA has been redefining computer graphics, PC gaming, and accelerated co...Show moreLast updated: 30+ days ago