Talent.com
No longer accepting applications
Systems Engineer, Metrics and Alerting

Systems Engineer, Metrics and Alerting

Cloudflare IncSan Francisco, CA, United States
6 days ago
Job type
  • Full-time
Job description

About Us

At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazine's Top Company Cultures list and ranked among the World's Most Innovative Companies by Fast Company.

We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills, and we are ready to help you do that. We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us!

Available Locations : London or Lisbon

About the Department

Production Engineering is responsible for the world's most reliable, observable, performant, and safe network ecosystem. Our customers rely on our products and systems to safely modify, troubleshoot, and release products without external impact.

Our external customers rely on us to provide seamless and predictable incident, traffic, policy management, resulting in the fastest and safest network services in the world.

We are accountable for the overall performance of internal and external facing services, guiding our product teams to optimal configurations and maximum efficiency. From the moment that a packet enters the Cloudflare ecosystem, we know exactly what its expected purpose and behaviour is and we are capable of determining and exposing anomalous behaviour.

The Cloudflare network makes it possible to solve challenges at massive scale and efficiency which would be impossible for almost any other organization.

About the Team

This role is for the internal Observability Team, responsible for the observability platform and stack to make our engineering teams productive. This includes (but is not limited to) areas like metrics, alerting, error tracking, logging, tracing, and more.

In this role, you can expect to :

  • Design, deliver, and operate software and a platform that progresses Cloudflare's Observability competency
  • Solve scaling bottlenecks in critical services in our Metrics & Alerting pipeline
  • Work on highly distributed and scalable systems
  • Participate in the constant cycle of knowledge sharing and mentoring
  • Participate in the global on-call rotation for the services your team owns
  • Research and introduce cutting-edge technologies
  • Contribute to open-source

We are a small team, well-funded, growing and focused on building an extraordinary company. This is a software engineering / systems engineering role and is a superb opportunity to be part of a high performing team to help to support Cloudflare's mission and help build a better internet.

You may be a good fit for our team if you have :

  • A Software Engineering background and proficiency in high-level programming languages (e.g., Go)
  • Proficiency in Data structures and databases like TSDBs, Columnar stores or related
  • Proficiency in distributed Linux environments
  • Proficiency in designing high-scale distributed systems
  • Proficiency in Prometheus, Alertmanager, Thanos
  • Experience working in a fast, high-growth environment
  • Experience working in a 24 / 7 / 365 service environment
  • Exquisite written and verbal communication skills
  • Familiarity with Internetworking, networking protocols Layer 2-7 of the OSI model and BGP
  • Strong bias for action
  • Bonus points if you have :

  • Experience with high-bandwidth transit Internetworking and routing
  • Passion for code simplicity and performance
  • What Makes Cloudflare Special?

    We're not just a highly ambitious, large-scale technology company. We're a highly ambitious, large-scale technology company with a soul. Fundamental to our mission to help build a better Internet is protecting the free and open Internet.

    Project Galileo : Since 2014, we've equipped more than 2,400 journalism and civil society organizations in 111 countries with powerful tools to defend themselves against attacks that would otherwise censor their work, technology already used by Cloudflare's enterprise customers at no cost.

    Athenian Project : In 2017, we created the Athenian Project to ensure that state and local governments have the highest level of protection and reliability for free, so that their constituents have access to election information and voter registration. Since the project, we've provided services to more than 425 local government election websites in 33 states.

    1.1.1.1 : We released 1.1.1.1 to help fix the foundation of the Internet by building a faster, more secure and privacy-centric public DNS resolver. This is available publicly for everyone to use - it is the first consumer-focused service Cloudflare has ever released. Here's the deal - we don't store client IP addresses never, ever. We will continue to abide by our privacy commitment and ensure that no user data is sold to advertisers or used to target consumers.

    Sound like something you'd like to be a part of? We'd love to hear from you!

    This position may require access to information protected under U.S. export control laws, including the U.S. Export Administration Regulations. Please note that any offer of employment may be conditioned on your authorization to receive software or technology controlled under these U.S. export laws without sponsorship for an export license.

    Cloudflare is proud to be an equal opportunity employer. We are committed to providing equal employment opportunity for all people and place great value in both diversity and inclusiveness. All qualified applicants will be considered for employment without regard to their, or any other person's, perceived or actual race, color, religion, sex, gender, gender identity, gender expression, sexual orientation, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, family care status, or any other basis protected by law. We are an AA / Veterans / Disabled Employer.

    Cloudflare provides reasonable accommodations to qualified individuals with disabilities. Please tell us if you require a reasonable accommodation to apply for a job. Examples of reasonable accommodations include, but are not limited to, changing the application process, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment. If you require a reasonable accommodation to apply for a job, please contact us via e-mail at hr@cloudflare.com or via mail at 101 Townsend St. San Francisco, CA 94107.

    Create a job alert for this search

    System Engineer • San Francisco, CA, United States

    Related jobs
    • Promoted
    Signals Intelligence Systems Architect

    Signals Intelligence Systems Architect

    Monarch RecruitersSan Jose, CA, US
    Full-time
    Our client has an immediate opening for a.The position provides an opportunity to deliver systems that provide critical intelligence data to national leadership. Our Client's employees work closely ...Show moreLast updated: 28 days ago
    • Promoted
    • New!
    Systems Verification Engineer

    Systems Verification Engineer

    Johnson & JohnsonSanta Clara, CA, United States
    Full-time
    At Johnson & Johnson, we believe health is everything.Our strength in healthcare innovation empowers us to build a world where complex diseases are prevented, treated, and cured, where treatments a...Show moreLast updated: 15 hours ago
    • Promoted
    Staff Systems Engineer

    Staff Systems Engineer

    Bio-Rad LaboratoriesHercules, CA, United States
    Full-time
    Working within Bio-Rad's Life Science R&D Group as a Systems Engineer, you will take engineering concepts, requirements and transform them into functional prototypes and finished products that impr...Show moreLast updated: 9 days ago
    • Promoted
    Prefabrication Systems Engineer – R+D Innovation

    Prefabrication Systems Engineer – R+D Innovation

    oWOWSan Jose, CA, United States
    Full-time
    At oWOW, we’re on a mission to transform how multifamily housing comes to life.We’re a vertically integrated real estate developer, combining architecture, R&D, real estate development, and softwar...Show moreLast updated: 15 days ago
    • Promoted
    Lead GTM Systems Engineer (SFDC Developer)

    Lead GTM Systems Engineer (SFDC Developer)

    Menlo VenturesSan Francisco, CA, United States
    Full-time
    Hover is making the homeowner journey easy, transparent and fun.Starting with the home improvement industry, we’re answering age-old questions like, “What is it going to look like?” and “What is it...Show moreLast updated: 30+ days ago
    Systems Engineer II

    Systems Engineer II

    TekWissen LLCSunnyvale, CA, United States
    Temporary
    Quick Apply
    Overview TekWissen is a global workforce management provider headquartered in Ann Arbor, Michigan that offers strategic talent solutions...Show moreLast updated: 4 days ago
    • Promoted
    Contractor - Systems Engineer (willing to consider all Sr. levels)

    Contractor - Systems Engineer (willing to consider all Sr. levels)

    Redwire SpaceSan Jose, CA, United States
    Permanent
    Where dreams and reality collide and the output is, out of this world.At Redwire Space, we are a team of dreamers and doers. Where the impossible becomes possible, and every day is an opportunity to...Show moreLast updated: 30+ days ago
    • Promoted
    Slurm Administration & Systems Architecture

    Slurm Administration & Systems Architecture

    MidjourneyFremont, CA, US
    Full-time
    We are seeking a highly skilled HPC / AI / ML Cluster Engineer to support the design, deployment, and ongoing operations of large-scale HPC environments powered by Slurm. This role centers on cluster en...Show moreLast updated: 30+ days ago
    • Promoted
    Ground Software & Systems Manager - Mission Operations (0346U), Space Sciences Laboratory - #81263

    Ground Software & Systems Manager - Mission Operations (0346U), Space Sciences Laboratory - #81263

    University of California-BerkeleyBerkeley, CA, United States
    Full-time +1
    At the University of California, Berkeley, we are dedicated to fostering a community where everyone feels welcome and can thrive. Our culture of openness, freedom and belonging make it a special pla...Show moreLast updated: 30+ days ago
    • Promoted
    Systems Verification Engineer

    Systems Verification Engineer

    Johnson and JohnsonSanta Clara, CA, United States
    Full-time
    At Johnson & Johnson, we believe health is everything.Our strength in healthcare innovation empowers us to build a world where complex diseases are prevented, treated, and cured, where treatments a...Show moreLast updated: 1 day ago
    • Promoted
    Wireless Systems Validation Engineer

    Wireless Systems Validation Engineer

    AppleSunnyvale, CA, United States
    Full-time
    Do you have a passion for taking on big challenges? Do you love pushing the limits of what’s considered feasible? As part of our Wireless Hardware group, you’ll be responsible for bringing groundbr...Show moreLast updated: 6 days ago
    • Promoted
    Physical Security Systems Engineer

    Physical Security Systems Engineer

    OpenAISan Francisco, CA, United States
    Full-time
    The Physical Security Technology group at OpenAI is a critical part of our Corporate Security organization.We’re responsible for deploying advanced security technologies that protect our people, co...Show moreLast updated: 21 days ago
    • Promoted
    Principal Systems Engineer, FL

    Principal Systems Engineer, FL

    Cloudflare, Inc.San Francisco, CA, United States
    Full-time
    At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for cust...Show moreLast updated: 30+ days ago
    • Promoted
    System Calibration Engineer

    System Calibration Engineer

    PsiQuantumPalo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Systems Validation Engineer

    Principal Systems Validation Engineer

    Hayden AISan Francisco, CA, United States
    Full-time
    Be among the first 25 applicants.This range is provided by Hayden AI.Your actual pay will be based on your skills and experience talk with your recruiter to learn more. At Hayden AI, we are on a mis...Show moreLast updated: 30+ days ago
    • Promoted
    Lead GTM Systems Engineer (SFDC Developer)

    Lead GTM Systems Engineer (SFDC Developer)

    HoverSan Francisco, CA, United States
    Full-time
    Lead GTM Systems Engineer (SFDC Developer).Hover is making the homeowner journey easy, transparent and fun.Homeowners, contractors, and insurance professionals use Hover to get fully measured, accu...Show moreLast updated: 30+ days ago
    • Promoted
    Systems Engineer - Mission Definition

    Systems Engineer - Mission Definition

    Reliable RoboticsMountain View, CA, United States
    Permanent
    We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...Show moreLast updated: 30+ days ago
    • Promoted
    System Coordinator

    System Coordinator

    Insight GlobalBerkeley, CA, US
    Permanent +1
    Contract : 6 month contract, potential for extension or conversion to permanent employment.Schedule : Hybrid (3 days onsite). REQUIRED SKILLS AND EXPERIENCE.Experience working with inventory managemen...Show moreLast updated: 1 day ago