Talent.com
Software Engineer, Distributed Data Systems (US)
Software Engineer, Distributed Data Systems (US)Onehouse • Sunnyvale, CA, US
Software Engineer, Distributed Data Systems (US)

Software Engineer, Distributed Data Systems (US)

Onehouse • Sunnyvale, CA, US
1 day ago
Job type
  • Full-time
Job description

Job Description

Job Description

About Onehouse

Onehouse is a mission-driven company dedicated to freeing data from data platform lock-in. We deliver the industry’s most interoperable data lakehouse through a cloud-native managed service built on Apache Hudi. Onehouse enables organizations to ingest data at scale with minute-level freshness, centrally store it, and make available to any downstream query engine and use case (from traditional analytics to real-time AI / ML).

We are a team of self-driven, inspired, and seasoned builders that have created large-scale data systems and globally distributed platforms that sit at the heart of some of the largest enterprises out there including Uber, Snowflake, AWS, Linkedin, Confluent and many more. Riding off a fresh $35M Series B backed by Craft, Greylock and Addition Ventures, we're now at $68M total funding and looking for rising talent to grow with us and become future leaders of the team. Come help us build the world's best fully managed and self-optimizing data lake platform!

  • If not local to Bay Area, you must be willing to relocate within 45 days and onboard in person for one week. Relocation package provided.

The Community You Will Join

When you join Onehouse, you're joining a team of passionate professionals tackling the deeply technical challenges of building a 2-sided engineering product. Our engineering team serves as the bridge between the worlds of open source and enterprise : contributing directly to and growing Apache Hudi (already used at scale by global enterprises like Uber, Amazon, ByteDance etc) and concurrently defining a new industry category - the transactional data lake. The Data Infrastructure team is the grounding heartbeat to all of this. We live and breathe databases, building cornerstone infrastructure by working under Hudi's hood to solving incredibly complex optimization and systems problems.

The Impact You Will Drive :

  • As a foundational member of the Data Infrastructure team, you will productionize the next generation of our data tech stack by building the software and data features that actually process all of the data we ingest.
  • Accelerate our open source <>
  • enterprise flywheel by working on the guts of Apache Hudi's transactional engine and optimizing it for diverse Onehouse customer workloads.

  • Act as a SME to deepen our teams' expertise on database internals, query engines, storage and / or stream processing.
  • A Typical Day :

  • Design new concurrency control and transactional capabilities that maximize throughput for competing writers.
  • Design and implement new indexing schemes, specifically optimized for incremental data processing and analytical query performance.
  • Design systems that help scale and streamline metadata and data access from different query / compute engines.
  • Solve hard optimization problems to improve the efficiency (increase performance and lower cost) of distributed data processing algorithms over a Kubernetes cluster.
  • Leverage data from existing systems to find inefficiencies, and quickly build and validate prototypes.
  • Collaborate with other engineers to implement and deploy, safely rollout the optimized solutions in production.
  • What You Bring to the Table :

  • Strong, object-oriented design and coding skills (Java and / or C / C++ preferably on a UNIX or Linux platform).
  • Experience with inner workings of distributed (multi-tiered) systems, algorithms, and relational databases.
  • You embrace ambiguous / undefined problems with an ability to think abstractly and articulate technical challenges and solutions.
  • An ability to prioritize across feature development and tech debt with urgency and speed.
  • An ability to solve complex programming / optimization problems.
  • An ability to quickly prototype optimization solutions and analyze large / complex data.
  • Robust and clear communication skills.
  • Nice to haves (but not required) :
  • Experience working with database systems, Query Engines or Spark codebases.
  • Experience in optimization mathematics (linear programming, nonlinear optimization).
  • Existing publications of optimizing large-scale data systems in top-tier distributed system conferences.
  • PhD degree with 2+ years industry experience in solving and delivering high-impact optimization projects.
  • How We'll Take Care of You

  • Competitive Compensation; the estimated base salary range for this role is $215,000 - $250,000
  • Equity Compensation; our success is your success with eligible participation in our company equity plan
  • Health & Well-being; we'll invest in your physical and mental well-being with up to 90% health coverage (50% for spouses / dependents) including comprehensive medical, dental & vision benefits
  • Financial Future; we'll invest in your financial well-being by making this role eligible to contribute to our company 401(k) or Roth 401(k) retirement plan
  • Location; we are a remote-friendly company (internationally distributed across N. America + India), though some roles will be subject to in-person requirements in alignment with the needs of the business
  • Generous Time Off; unlimited PTO (mandatory 1 week / year minimum), uncapped sick days and 11 paid company holidays
  • Company Camaraderie; Annual company offsites and Quarterly team onsites @Sunnyvale HQ
  • Food & Meal Allowance; weekly lunch stipend, in-office snacks / drinks
  • Equipment; we'll provide you with the equipment you need to be successful and a one-time $500 stipend for your initial desk setup
  • Child Bonding!; 8 weeks off for parents (birthing, non-birthing, adoptive, foster, child placement, new guardianship) - fully paid so you can focus your energy on your newest addition
  • House Values

    One Team

    Optimize for the company, your team, self - in that order. We may fight long and hard in the trenches, take care of your co-workers with empathy. We give more than we take to build the one house, that everyone dreams of being part of.

    Tough & Persevering

    We are building our company in a very large, fast-growing but highly competitive space. Life will get tough sometimes. We take hardships in the stride, be positive, focus all energy on the path forward and develop a champion's mindset to overcome odds. Always day one!

    Keep Making It Better Always

    Rome was not built in a day; If we can get 1% better each day for one year, we'll end up thirty-seven times better. This means being organized, communicating promptly, taking even small tasks seriously, tracking all small ideas, and paying it forward.

    Think Big, Act Fast

    We have tremendous scope for innovation, but we will still be judged by impact over time. Big, bold ideas still need to be strategized against priorities, broken down, set in rapid motion, measure, refine, repeat. Great execution is what separates promising companies from proven unicorns.

    Be Customer Obsessed

    Everyone has the responsibility to drive towards the best experience for the customer, be an OSS user or a paid customer. If something is broken, own it, say something, do something; never ignore. Be the change that you want to see in the company.

    Pay Range Transparency

    Onehouse is committed to fair and equitable compensation practices. Our job titles may span more than one career level. The pay range(s) for this role is listed above and represents the base salary range for non-commissionable roles or on-target earnings for commissionable roles. Actual compensation packages are dependent upon several factors that are unique to each candidate, including but not limited to : job-related skills, depth of transferable experience, relevant certifications and training, business needs, market demands and specific work location. Based on the factors above, Onehouse utilizes the full width of the range; the base pay range is subject to change and may be modified in the future. The total compensation package for this position will also include eligibility for equity options and the benefits listed above.

    We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

    Create a job alert for this search

    Software Engineer Data • Sunnyvale, CA, US

    Related jobs
    Senior Software Engineer, Big Data

    Senior Software Engineer, Big Data

    ZipRecruiter • Palo Alto, CA, US
    Full-time
    We offer a hybrid work environment.Most US-based positions can also.To actively connect people to their next great opportunity. ZipRecruiter is a leading online employment marketplace.Powered by AI-...Show more
    Last updated: 30+ days ago • Promoted
    Systems Engineer, Positioning and Compute

    Systems Engineer, Positioning and Compute

    Waymo • Mountain View, CA, United States
    Full-time
    Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show more
    Last updated: 16 days ago • Promoted
    Data Engineer - Multimodal Systems

    Data Engineer - Multimodal Systems

    Zyphra • Palo Alto, CA, US
    Full-time
    Data Engineer - Multimodal Systems.Zyphra’s datasets and data pipelines across a variety of modalities.Your work will intersect with almost every team at Zyphra. You will be involved in collec...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - Distributed Data Systems

    Software Engineer - Distributed Data Systems

    xAI • Palo Alto, CA, US
    Full-time
    AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering exc...Show more
    Last updated: 15 days ago • Promoted
    AI Incubator - Data Engineer

    AI Incubator - Data Engineer

    Sprinter Health • Menlo Park, CA, US
    Full-time
    At Sprinter Health, our mission is reimagining how people access care by bringing it directly to their homes.Nearly 30% of patients in the U. For many, the ER becomes their first touchpoint with the...Show more
    Last updated: 1 day ago • Promoted
    Senior Data Software Engineer

    Senior Data Software Engineer

    PsiQuantum • Palo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Visa • Foster City, CA, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    AngelList • San Francisco, CA, US
    Full-time
    We exist to accelerate innovation.We do this by giving more people the opportunity to participate in the venture economy by building the financial infrastructure that makes it possible for more peo...Show more
    Last updated: 1 day ago • Promoted
    Software Engineer

    Software Engineer

    Supermicro • San Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Plum Inc • San Francisco, CA, US
    Full-time
    PLUM is a fintech company empowering financial institutions to grow their business through a cutting-edge suite of AI-driven software, purpose-built for lenders and their partners across the financ...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    SteerBridge • Miramar, CA, US
    Full-time
    SteerBridge Strategies is a CVE-Verified Service-Disabled, Veteran-Owned Small Business (SDVOSB) delivering a broad spectrum of professional services to the U. Backed by decades of hands-on experien...Show more
    Last updated: 1 day ago • Promoted
    Senior / Lead Data Solution Engineer

    Senior / Lead Data Solution Engineer

    Meltwater • Redwood City, CA, United States
    Full-time
    We're thrilled to embark on the search for a seasoned.Senior / Lead Data Solution Engineer.This pivotal role offers an exciting opportunity to shape the future of technology within our organization.A...Show more
    Last updated: 7 days ago • Promoted
    Software Engineer - Data Streaming

    Software Engineer - Data Streaming

    TigerGraph • Redwood City, CA, US
    Full-time
    TigerGraph is a platform for advanced analytics and machine learning on connected data.TigerGraph's core technology is the only scalable graph database for the enterprise.Its proven technology ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer - Data Replication

    Senior Software Engineer - Data Replication

    TiDB • Sunnyvale, CA, US
    Full-time
    Join us as we scale our business by building on our tremendous success around the world.The massive database market is going to double over the next few years (the IDC estimates it to be $119B+ by ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Toyota Research Institute • Los Altos, CA, US
    Full-time
    At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life.We’re developing new tools and capabilities to amplify the human experience.To lead this tran...Show more
    Last updated: 1 day ago • Promoted
    Senior Software Engineer / Data Engineer

    Senior Software Engineer / Data Engineer

    Twilio • San Francisco, CA, United States
    Full-time
    At Twilio, we're shaping the future of communications, all from the comfort of our homes.We deliver innovative solutions to. As we continue to revolutionize how the world interacts, we're acquiring ...Show more
    Last updated: 18 hours ago • Promoted • New!
    Solution Engineer

    Solution Engineer

    Supermicro • San Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Operations Developer

    Senior Data Operations Developer

    Epicor • Dublin, CA, United States
    Permanent
    The Senior Data Operations Engineer will work with key business stakeholders and IT teams to deliver and deploy Business Intelligence (BI), and advanced data engineering solutions within the BI Eng...Show more
    Last updated: 7 days ago • Promoted