Talent.com
No longer accepting applications
Cloud Site Reliability Engineer

Cloud Site Reliability Engineer

Ford Motor CompanyBoston, MA, United States
3 days ago
Job type
  • Full-time
Job description

Overview

Enterprise Technology is the engine driving the future of transportation. If you’re looking for the chance to leverage advanced technology to redefine the mobility landscape, enhance the customer experience and improve people’s lives, this is the opportunity for you.

Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.

If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!

Responsibilities

  • Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
  • Provide helpful and actionable feedback and review for code or production changes.
  • Drive repair / optimization of complex systems with consideration towards a wide range of contributing factors.
  • Lead debugging, troubleshooting, and analysis of service architecture and design.
  • Participate in on-call rotation.
  • Write documentation : design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
  • Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
  • Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
  • Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
  • Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
  • Troubleshoot and resolve issues in our dev, test, and production environments.
  • Participate in postmortem analysis and create preventative measures for future incidents.
  • Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
  • Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
  • Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
  • Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
  • Contribute to internal knowledge bases and documentation.

Qualifications

  • Bachelor’s degree in Computer Science, Engineering, Mathematics or equivalent experience.
  • 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
  • 2+ years experience in cloud native software application development
  • Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
  • Proficient with IaC (Infrastructure as Code) like Terraform
  • Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
  • Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
  • Experience with relational and document databases.
  • Ability to debug, optimize code, and automate routine tasks.
  • Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
  • Excellent verbal and written communication skills.
  • You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!

    Benefits

    As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like : will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder…or all of the above? No matter what you choose, we offer a work life that works for you, including :

  • Immediate medical, dental, and prescription drug coverage
  • Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
  • Vehicle discount program for employees and family members, and management leases
  • Tuition assistance
  • Established and active employee resource groups
  • Paid time off for individual and team community service
  • A generous schedule of paid holidays, including the week between Christmas and New Year’s Day
  • Paid time off and the option to purchase additional vacation time.
  • For a detailed look at our benefits, click here : Benefit Summary (https : / / corporate.ford.com / content / dam / corporate / us / en-us / documents / careers / 2025-benefits-and-comp-gsr-sal-plan-2.pdf)

    This role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week

  • Visa Sponsorship is NOT provided for this specific role
  • Relocation assistance IS NOT provided for this specific role
  • Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.

    We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call 1-888-336-0660.

    #LI-Remote

    #LI-DS2

    Requisition ID : 48772

    #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer • Boston, MA, United States

    Related jobs
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    VirtualVocationsDorchester, Massachusetts, United States
    Full-time
    A company is looking for a Principal Site Reliability Engineer.Key Responsibilities Lead project work to build and maintain platform features for reliability and cloud infrastructure Mentor serv...Show moreLast updated: 30+ days ago
    • Promoted
    Customer Reliability Engineer

    Customer Reliability Engineer

    VirtualVocationsLowell, Massachusetts, United States
    Full-time
    A company is looking for a Customer Reliability Engineer III.Key Responsibilities Manage and resolve customer technical issues via support tickets and real-time interactions Act as a liaison bet...Show moreLast updated: 30+ days ago
    • Promoted
    Senior System Reliability Analysis Engineer

    Senior System Reliability Analysis Engineer

    Draper LabsCambridge, MA, United States
    Full-time
    Draper is an independent, nonprofit research and development company headquartered in Cambridge, MA.The 2,000+ employees of Draper tackle important national challenges with a promise of delivering ...Show moreLast updated: 30+ days ago
    • Promoted
    Cloud Platform Engineer

    Cloud Platform Engineer

    VirtualVocationsDorchester, Massachusetts, United States
    Full-time
    A company is looking for a Cloud Platform Engineer.Key Responsibilities Write, debug, and optimize code for the platform infrastructure Collaborate with software engineers through various progra...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    CyberarkNewton, MA, US
    Full-time
    CyberArk (NASDAQ : CYBR), is the global leader in Identity Security.Centered on privileged access management, CyberArk provides the most comprehensive security offering for any identity – huma...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. SRE, Compute Infrastructure

    Sr. SRE, Compute Infrastructure

    NxT LevelBoston, MA, US
    Full-time
    Senior Site Reliability Engineer – Compute Infrastructure.Location : Boston, MA (Hybrid – Tues–Fri Onsite | Mondays Remote). Compensation : $134,250 – $214,800 + Bonus + Equity...Show moreLast updated: 4 days ago
    • Promoted
    Cloud Engineer

    Cloud Engineer

    TA Realty LLCBoston, MA, United States
    Full-time
    Founded in 1982 and one of the largest investment managers, buying / selling of industrial real estate in the U.TA Realty LLC has acquired, invested, and / or managed over $44 billion of real estate si...Show moreLast updated: 4 days ago
    • Promoted
    Snowflake Deployment Engineer

    Snowflake Deployment Engineer

    VirtualVocationsLowell, Massachusetts, United States
    Full-time
    A company is looking for a Snowflake Deployment Engineer to design, implement, and optimize Snowflake environments for data platforms. Key Responsibilities Lead the deployment, configuration, and ...Show moreLast updated: 3 days ago
    • Promoted
    Cloud Engineer

    Cloud Engineer

    ASCENDING LLCBoston, MA, United States
    Full-time
    Remote within the Continental U.Our client, a premier national healthcare provider, is seeking a.You will work closely with Kubernetes and Cloud Architects to onboard applications into.AWS Elastic ...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer Engineer

    Site Reliability Engineer Engineer

    Coralogix, inc.Boston, MA, United States
    Full-time
    Site Reliability Engineer EngineerBoston, MA • Full-time • Senior#### About The PositionCoralogix is a modern, full-stack observability platform transforming how businesses process and understand t...Show moreLast updated: 17 days ago
    • Promoted
    Cloud Engineer

    Cloud Engineer

    RapDev.ioBoston, MA, United States
    Full-time
    We specialize in modern ITOM & DevOps ServiceNow delivery, and implementations & integrations for Datadog.Our experienced team of SREs and DevOps engineers powerfully brings together these two ecos...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    VirtualVocationsLowell, Massachusetts, United States
    Full-time
    A company is looking for a Site Reliability Engineer to join a Cloud Services team in a remote role.Key Responsibilities Serve as a cloud SME for clients, providing expertise in design, architect...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer (SRE) - Engineering Productivity

    Site Reliability Engineer (SRE) - Engineering Productivity

    Arista NetworksNashua, NH, US
    Full-time
    Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation.We...Show moreLast updated: 4 days ago
    • Promoted
    Cloud Operations Engineer

    Cloud Operations Engineer

    VirtualVocationsDorchester, Massachusetts, United States
    Full-time
    A company is looking for a Cloud Operations Engineer to optimize uptime and eliminate vulnerabilities for enterprise clients. Key Responsibilities Architect and implement monitoring alarms and log...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Cloud Systems Engineer

    Senior Cloud Systems Engineer

    VirtualVocationsDorchester, Massachusetts, United States
    Full-time
    A company is looking for a Senior Systems Engineer, Cloud Platform.Key Responsibilities Maintain system stability, security, and performance Build and manage CI / CD pipelines and deployment autom...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Coralogix, inc.Boston, MA, United States
    Full-time
    Site Reliability EngineerBoston, MA • Full-time • Senior#### About The PositionCoralogix is a modern, full-stack observability platform transforming how businesses process and understand their data...Show moreLast updated: 17 days ago
    • Promoted
    Sr. Engineer Cloud Operations

    Sr. Engineer Cloud Operations

    MAXIMUSBoston, MA, United States
    Full-time
    Cloud Engineer is a hands-on position that requires the ability to plan, design, and implement technical cloud solutions. You will help combine software and systems to develop creative engineering s...Show moreLast updated: 4 days ago
    • Promoted
    Principal Cloud Engineer

    Principal Cloud Engineer

    RaftHanscom Air Force Base, MA, United States
    Full-time
    All of the programs we support require.All work must be conducted within the continental U.Distributed Data Systems, Platforms at Scale, and Complex Application Development, with headquarters in Mc...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Developer

    Site Reliability Developer

    VirtualVocationsDorchester, Massachusetts, United States
    Full-time
    A company is looking for a Site Reliability Developer.Key Responsibilities Perform DevOps activities to support customers and engineers during release cycles and production Respond to incidents,...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    VirtualVocationsDorchester, Massachusetts, United States
    Full-time
    A company is looking for a Senior Site Reliability Engineer.Key Responsibilities Design and implement infrastructure and automation scripts for AWS deployment and management Optimize and monitor...Show moreLast updated: 30+ days ago