Talent.com
Software Engineer, Data Acquisition
Software Engineer, Data AcquisitionOpenAI • San Francisco, CA, United States
Software Engineer, Data Acquisition

Software Engineer, Data Acquisition

OpenAI • San Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Software Engineer, Data Acquisition | OpenAI

Foundations – San Francisco

Overview :

The Data Acquisition team within the Foundations organization at OpenAI is responsible for all aspects of data collection to support our model training operations. Our team manages web crawling and GPTBot services and works closely with Data Processing, Architecture, and Scaling teams. We are looking for a skilled Software Engineer to join our Data Acquisition team.

Responsibilities :

  • Own and lead engineering projects in the area of data acquisition including web crawling, data ingestion, and search.
  • Collaborate with other sub-teams, such as Data Processing, Architecture, and Scaling, to ensure smooth data flow and system operability.
  • Work closely with the legal team to handle any compliance or data privacy-related matters.
  • Develop and deploy highly scalable distributed systems capable of handling petabytes of data.
  • Architect and implement algorithms for data indexing and search capabilities.
  • Build and maintain backend services for data storage, including work with key-value databases and synchronization.
  • Deploy solutions in a Kubernetes Infrastructure-as-Code environment and perform routine system checks.
  • Conduct and analyze experiments on data to provide insights into system performance.

Qualifications :

  • BS / MS / PhD in Computer Science or a related field.
  • 4+ years of industry experience in software development.
  • Experience with large web crawlers a plus
  • Strong expertise in large stateful distributed systems and data processing.
  • Proficiency in Kubernetes, and Infrastructure-as-Code concepts.
  • Willingness and enthusiasm for trying new approaches and technologies.
  • Ability to handle multiple tasks and adapt to changing priorities.
  • Strong communication skills, both written and verbal.
  • About OpenAI

    OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

    We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

    For additional information, please see OpenAI’s affirmative action and equal employment opportunity policy statement.

    Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers : we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment : protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

    To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.

    We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

    Compensation

    $325K – $405K + Offers Equity

    #J-18808-Ljbffr

    Create a job alert for this search

    Software Engineer Data • San Francisco, CA, United States

    Related jobs
    Senior Software Engineer, Data Acquisition

    Senior Software Engineer, Data Acquisition

    OpenAI • San Francisco, CA, United States
    Full-time
    Senior Software Engineer, Data Acquisition.The Data Acquisition team within the Foundations organization at OpenAI is responsible for all aspects of data collection to support our model training op...Show more
    Last updated: 30+ days ago • Promoted
    Sr. Staff Software Engineer | Data Acquisition

    Sr. Staff Software Engineer | Data Acquisition

    WEX, Inc. • San Francisco, CA, United States
    Full-time
    This is a remote position; however, the candidate must reside within 30 miles of one of the following locations : Portland, ME. Boston, MA; Chicago, IL; San Francisco Bay Area, CA; and Seattle / WA.As...Show more
    Last updated: 9 days ago • Promoted
    Staff Software Engineer, Data Curation

    Staff Software Engineer, Data Curation

    Foxglove • San Francisco, CA, United States
    Full-time
    Robotics will have a massive positive impact on the world economy and global human productivity over the coming decade.At Foxglove, we're excited for this future, and we're building powerful open s...Show more
    Last updated: 4 days ago • Promoted
    Software Engineer, Data and Telemetry, Google Beam

    Software Engineer, Data and Telemetry, Google Beam

    Google • San Francisco, CA, United States
    Full-time
    Software Engineer, Data and Telemetry, Google Beam.Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features. Applicants in San Francisco : Qualified applications...Show more
    Last updated: 4 hours ago • Promoted • New!
    Software Engineer - Data Engine

    Software Engineer - Data Engine

    Applied Intuition • Sunnyvale, CA, United States
    Full-time
    Applied Intuition is the vehicle intelligence company that accelerates the global adoption of safe, AI-driven machines.Founded in 2017 and now valued at $15 billion following its recent Series F fu...Show more
    Last updated: 2 days ago • Promoted
    Senior Software Engineer - Data Transparency

    Senior Software Engineer - Data Transparency

    The Trade Desk • San Jose, CA, United States
    Full-time
    The Trade Desk is a global technology company with a mission to create a better, more open internet for everyone through principled, intelligent advertising. Handling over 1 trillion queries per day...Show more
    Last updated: 30+ days ago • Promoted
    Senior Staff Software Engineer - Data Acquisition

    Senior Staff Software Engineer - Data Acquisition

    WEX, Inc. • San Francisco, CA, United States
    Full-time
    This is a remote position; however, the candidate must reside within 30 miles of one of the following locations : Portland, ME. Boston, MA; Chicago, IL; San Francisco Bay Area, CA; and Seattle / WA.As...Show more
    Last updated: 7 days ago • Promoted
    Software Engineer - Data Acquisition / Web Crawling

    Software Engineer - Data Acquisition / Web Crawling

    Xai • Palo Alto, CA, United States
    Full-time
    AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...Show more
    Last updated: 30+ days ago • Promoted
    AI Incubator - Staff Software Engineer

    AI Incubator - Staff Software Engineer

    Medium • San Francisco, CA, United States
    Full-time
    At Sprinter Health, our mission is reimagining how people access care by bringing it directly to their homes.Nearly 30% of patients in the U. For many, the ER becomes their first touchpoint with the...Show more
    Last updated: 8 days ago • Promoted
    Senior Software Engineer - Data Search

    Senior Software Engineer - Data Search

    Woven by Toyota • Palo Alto, CA, United States
    Full-time
    Woven by Toyota is enabling Toyota's once-in-a-century transformation into a mobility company.Inspired by a legacy of innovating for the benefit of others, our mission is to challenge the current s...Show more
    Last updated: 2 days ago • Promoted
    Software Engineer, Data Development - USDS

    Software Engineer, Data Development - USDS

    Tik Tok • San Jose, CA, United States
    Full-time
    Team Intro The Security team is missioned to run and operate security infrastructures, platforms and technologies, as well as to support cross-functional teams to protect our users, products and in...Show more
    Last updated: 2 days ago • Promoted
    Senior Software Engineer - AI Incubator

    Senior Software Engineer - AI Incubator

    Sprinter Health, Inc. • San Francisco, CA, United States
    Full-time
    We're looking for a Senior Software Engineer with at least 5 years of experience who wants to make an impact.We want to make a difference in the lives of those falling between the cracks of the cur...Show more
    Last updated: 30+ days ago • Promoted
    Staff Software Engineer, AI Data Platform

    Staff Software Engineer, AI Data Platform

    Granica • San Francisco, CA, United States
    Full-time
    Granica is redefining how enterprises prepare and optimize data at the most fundamental layer of the AI stack—where raw information becomes usable intelligence. Our technology operates deep in the d...Show more
    Last updated: 7 days ago • Promoted
    (USA) Senior, Software Engineer- Data Venture

    (USA) Senior, Software Engineer- Data Venture

    Walmart • Sunnyvale, CA, United States
    Full-time +1
    Develop software systems and solve complex problems by leveraging state-of-the-art technology.Collaborate with and execute major cross-platform executions as a team, or independently when needed.Do...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - Data Search

    Software Engineer - Data Search

    Woven by Toyota • Palo Alto, CA, United States
    Full-time
    Woven by Toyota is enabling Toyota's once-in-a-century transformation into a mobility company.Inspired by a legacy of innovating for the benefit of others, our mission is to challenge the current s...Show more
    Last updated: 2 days ago • Promoted
    Software Engineer

    Software Engineer

    United IT Solutions • Sunnyvale, CA, United States
    Full-time
    Location : Mountain View CA (5 days working).Write, update, and maintain Python frameworks and libraries to support data processing and integration tasks. Hands-on experience with Apache Airflow, in...Show more
    Last updated: 2 days ago • Promoted
    Staff Software Engineer - Data Cloud

    Staff Software Engineer - Data Cloud

    Rippling • San Francisco, CA, United States
    Full-time
    Rippling is the first way for businesses to manage all of their HR & IT—payroll, benefits, computers, apps, and more—in one unified workforce platform. By connecting every business system to one sou...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - Data Architecture, TikTok US

    Software Engineer - Data Architecture, TikTok US

    Tik Tok • San Jose, CA, United States
    Full-time
    About the Team Our Recommendation Architecture Team is responsible for building up and optimizing the architecture for our recommendation system to provide the most stable and best experience for o...Show more
    Last updated: 2 days ago • Promoted