Talent.com
Principal Engineer, Catalog Infrastructure (Founding Team)
Principal Engineer, Catalog Infrastructure (Founding Team)Attachments King • San Francisco, CA, United States
Principal Engineer, Catalog Infrastructure (Founding Team)

Principal Engineer, Catalog Infrastructure (Founding Team)

Attachments King • San Francisco, CA, United States
22 days ago
Job type
  • Full-time
Job description

About The Role

Attachments King is an eCommerce startup in the Heavy Equipment Industry developing proprietary software that flexibly discovers compatibility between equipment and host machine components.

We’re hiring a Principal Engineer to build and operate the single source of truth for a high‑SKU construction equipment catalog : taxonomy, product ingestion, price / availability pipelines from messy non‑API sources, and the automation layer that allows us to rapidly and maintainably scale SKU count, price discovery, and data quality for a small team with lean headcount.

We believe that, over the next decade, the bulk of online purchases won’t take place on websites with static product pages – and it’s our intention to build a flexible, generative merchandising platform that drives this change for the heavy equipment industry.

This role is based in San Francisco, CA. This will be an in-office role and will extend past the standard 40 hours / week of many 9-5 jobs. We have long hours, weekend work sessions, and prioritize a results-driven culture.

Salary, Equity, and Benefits

Base Pay : $300,000 / year

Equity Offered : 3.50% (Options, 1yr Cliff, 4yr vest)

  • No Funding Raised, Most Recent 409A FMV is $10M.

Total Compensation : $387,500 / year

  • TC excludes potential refreshes; equity valued at 409A on grant date, amortized over 4 years
  • Employer-provided Health Insurance

    Employer-provided 401k Plan

    Day‑to‑day scope

  • Using AI to Automate Your job & Programming Managers of AI Agents
  • Taxonomy & PIM modeling : Own category trees, attributes, variants, compatibility metadata, and normalization rules (GS1 / UNSPSC awareness; custom facets for consumer browse paths).
  • Data ingestion (messy source formats) : Build resilient pipelines for CSV / Excel, email attachments, SFTP, scraped HTML, PDFs, and images.
  • Transform & validate : Typed, idempotent ETL / ELT with schema evolution and contract-based QA.
  • Pricing & availability : Schedulers / agents to detect deltas, reconcile conflicts, discover competing listings, and publish to Shopify with guardrails for margin protection.
  • Images : Automation for background removal, resizing, deduping, and attribute extraction (e.g., dimensions, metadata).
  • Analytics : Build merchandising dashboards (assortment growth, price competitiveness, availability, metadata quality).
  • Operations & SRE : Observability, alerting, backfills, SLAs / SLOs, rollback strategies, and cost control.
  • Current Platforms

  • AWS : S3, DynamoDB, Neptune, Lambda, Step Functions, ECS / Fargate, EventBridge, SQS / SNS, CloudWatch, SSM Parameter Store.
  • ECommerce Platform : Shopify Plus
  • Analytics : Power BI / Microsoft Fabric
  • AI Tooling : Cursor, Devin, Graphite, Personal ChatGPT Pro / Claude Max plans
  • Requests for, and use of, additional AI tools is heavily encouraged
  • Core outcomes

  • 30 days :
  • Ship a production ingestion → normalization → enrichment → publish pipeline for all existing SKUs (2,200); stand up initial PIM data model with faceted attributes optimized for search / browse; wire price & availability watchers for all current vendors (files, web pages, emails, competitor websites).
  • Baseline data quality with automated contracts & tests; initial operational dashboards (latency, freshness, fill rates, failure rates).
  • 90 days :
  • SKU count increased by 500% (11,000), coverage expanded to support top 100 product families and machine categories rank-ordered by search traffic demand; image set completeness >
  • 95% for top movers; pricing latency

  • AI / agent workflows auto‑extract attributes from PDFs / images; continuous taxonomy evolution with zero-downtime migrations.
  • 365 days :
  • Deliver $9.25M in annual revenue, 100% attributable to zero-touch online orders of managed SKUs.
  • V0 of Generative Customer-Facing Product Artifact Pipeline Completed
  • Must‑have requirements

  • 7+ years building production data systems (or commensurate impact) : Python (pandas / polars), SQL (Postgres / Redshift / Snowflake / BigQuery), orchestration (Step Functions / Airflow / Prefect), eventing (SQS / Kafka), object storage (S3), CI / CD, containerization.
  • Ecommerce catalog expertise : PIM concepts (attribute schemas, variants / SKU creation, canonicalization, dedup), Shopify Admin / GraphQL, metafields, collections, feed health.
  • Non‑API data wrangling at scale : Selenium / Playwright for scraping (with robots / legal etiquette, rotation, backoff), email / SFTP ingestion, PDF OCR, document parsing.
  • Data quality & contracts : Great Expectations, Pydantic (typed models), versioned schemas, migration plans, data diffing, idempotency as a base case.
  • Image processing : PIL / Pillow, OpenCV, ImageMagick; batch pipelines and basic color / contrast / compositing.
  • Analytics : Power BI and / or Tableau; metric design for merchandising (coverage, freshness, price index, conversion lift).
  • AI / agentic workflows : Retrieval + tool‑use agents to extract attributes, reconcile conflicts, propose taxonomy changes; prompt chaining; evaluation harnesses; safe‑ops patterns for deterministic fallbacks.
  • Search relevance & indexing : Search relevance for catalogs (Meilisearch / Elastic / OpenSearch) and faceted navigation tuning.
  • AWS : S3, Lambda, Glue / Athena, Step Functions, ECS / Fargate, CloudWatch; IaC via the CDK; strong cost / performance instincts.
  • Nice‑to‑haves

  • Experience with homegrown PIMs
  • Vendor EDI familiarity; GS1 barcoding; UNSPSC mapping.
  • You Might Thrive Here If...

  • You are incredibly ambitious
  • You are a self-starter and intensely curious
  • You are hard-working and relentless, frequently going above and beyond in previous or current roles
  • You are driven by achievement and energized by big, industry-disrupting challenges
  • You want a "hardcore" work environment
  • You want to leave a positive impact on the world
  • About Attachments King

    Attachments King is E-Commerce for Heavy Machinery Attachments. We're pushing the boundaries of the construction industry with innovative proprietary technology that drastically improves the customer experience when purchasing heavy equipment. We firmly prioritize a hard-working, results-driven culture.

    Our bar for talent is high, and we do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status. If you are remarkably good at what you do, you belong on our team.

    For US Based Candidates : Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

    This is the most important time to be alive in human history. Join us, and be a part of something incredible.

    Create a job alert for this search

    Founding Engineer • San Francisco, CA, United States

    Related jobs
    Principal, DevOps Engineer

    Principal, DevOps Engineer

    Ptc • San Mateo, California, United States
    Full-time
    Lead DevOps Strategy : Define and drive the DevOps roadmap, aligning with business and engineering goals.Infrastructure as Code (IaC) : Design and implement scalable, secure, and resilient infrastruc...Show more
    Last updated: 30+ days ago • Promoted
    Infrastructure Engineer

    Infrastructure Engineer

    FAR.AI • Berkeley, California, United States
    Full-time
    AI is a non-profit AI research institute dedicated to ensuring advanced AI is safe and beneficial for everyone.Our mission is to facilitate breakthrough AI safety research, advance global understan...Show more
    Last updated: 30+ days ago • Promoted
    Principal Software Engineer AI Platform

    Principal Software Engineer AI Platform

    Snorkel Ai • Redwood City, California, United States
    Full-time
    At Snorkel, we believe meaningful AI doesn’t start with the model, it starts with the data.We’re on a mission to help enterprises transform expert knowledge into specialized AI at scale.The AI land...Show more
    Last updated: 5 days ago • Promoted
    Staff Systems Engineer

    Staff Systems Engineer

    Bio-Rad Laboratories • Hercules, CA, United States
    Full-time
    Working within Bio-Rad's Life Science R&D Group as a Systems Engineer, you will take engineering concepts, requirements and transform them into functional prototypes and finished products that impr...Show more
    Last updated: 27 days ago • Promoted
    Principal Engineer Data & Database Systems Architecture

    Principal Engineer Data & Database Systems Architecture

    Quizlet • San Francisco, California, USA
    Full-time
    At Quizlet our mission is to help every learner achieve their outcomes in the most effective and delightful way.Our $1B learning platform serves tens of millions of students every month including t...Show more
    Last updated: 21 days ago • Promoted
    Platform & Infrastructure Engineer

    Platform & Infrastructure Engineer

    Mindsdb • San Francisco, California, United States
    Full-time
    MindsDB is a fast-growing AI startup headquartered in San Francisco, California.MindsDB is an AI Analytics solution that connects to diverse data sources and applications then unifies structured an...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    Tanium • Emeryville, California, United States
    Full-time
    Tanium is expanding rapidly and is seeking a skilled and motivated Data Engineer with a strong focus on data integrations and ETL pipeline development. This role will play a critical part in designi...Show more
    Last updated: 30+ days ago • Promoted
    Principal Engineer, Catalog Infrastructure (Founding Team)

    Principal Engineer, Catalog Infrastructure (Founding Team)

    Attachments King • San Francisco, CA, US
    Full-time
    Principal Engineer, Catalog Infrastructure (Founding Team) Get AI-powered advice on this job and more exclusive features. Pay found in job post Retrieved from the description.Direct message the job ...Show more
    Last updated: 10 days ago • Promoted
    Principal Database Engineer

    Principal Database Engineer

    Informatica LLC • Redwood City, CA, United States
    Full-time
    Build Your Career at Informatica.We seek innovative thinkers who believe in the power of data to drive meaningful change. At Informatica, we welcome adventurous minds eager to solve the world's most...Show more
    Last updated: 23 days ago • Promoted
    Senior HPC Cluster Systems Administrator

    Senior HPC Cluster Systems Administrator

    Berkeley Lab • Berkeley, California, USA
    Full-time
    Information Technology Division (.Senior HPC Cluster Systems Administrator to join their.In this exciting role you will support the Berkeley Lab research community by building integrating and maint...Show more
    Last updated: 12 days ago • Promoted
    Principal Core Infrastructure Engineer

    Principal Core Infrastructure Engineer

    Highnote • San Francisco, CA, US
    Full-time
    Join to apply for the Senior Core Infrastructure Engineer role at Highnote 3 days ago Be among the first 25 applicants Join to apply for the Senior Core Infrastructure Engineer role at Highno...Show more
    Last updated: 30+ days ago • Promoted
    Lead Infrastructure Engineer

    Lead Infrastructure Engineer

    PIP Labs • San Francisco, California, United States
    Full-time
    Story aims to grow the creativity of the internet.The internet has introduced Story is building the IP infrastructure for the internet era, where creativity and intelligence move at the speed of cu...Show more
    Last updated: 30+ days ago • Promoted
    Senior Manager, Cloud Engineering

    Senior Manager, Cloud Engineering

    PG Forsta • Emeryville, CA, United States
    Full-time
    PG Forsta is the leading experience measurement, data analytics, and insights provider for complex industries-a status we earned over decades of deep partnership with clients to help them understan...Show more
    Last updated: 2 days ago • Promoted
    Senior Manager Engineering - Snowhouse Foundation

    Senior Manager Engineering - Snowhouse Foundation

    Snowflake • Menlo Park, CA, US
    Full-time
    Senior Engineering Manager In Data Infrastructure.Snowflake is about empowering enterprises to achieve their full potential and people too. With a culture that's all in on impact, innovation, and c...Show more
    Last updated: 30+ days ago • Promoted
    Systems Engineer 3

    Systems Engineer 3

    The Structures Company • Berkeley, California, USA
    Full-time +1
    JOB TITLE : Systems Engineer 3.Contract (12 months with potential for extension).Ability to obtain a Secret Clearance required. Aerospace / Defense / Aviation.Medical dental and vision (Cigna).Bonus...Show more
    Last updated: 8 days ago • Promoted
    Principal Infrastructure Engineer

    Principal Infrastructure Engineer

    Nextdata Technologies Inc • San Francisco, California, United States
    Full-time
    The future of data lies in decentralization, and the concept of a data mesh is the proven approach for implementing this at Enterprise scale. We’re here to make it a reality.Nextdata OS is a data-me...Show more
    Last updated: 30+ days ago • Promoted
    HPC Storage Systems Group Leader

    HPC Storage Systems Group Leader

    Lawrence Berkeley National Laboratory • Berkeley, CA, United States
    Full-time +2
    The National Energy Research Scientific Computing Center (NERSC) is inviting applications for the position of Storage Systems Group (SSG) Lead. NERSC's mission is to accelerate scientific discovery ...Show more
    Last updated: 15 days ago • Promoted
    Software Engineer, Data Infrastructure

    Software Engineer, Data Infrastructure

    Datologyai • Redwood City, California, United States
    Full-time
    Companies want to train their own large models on their own data.The current industry standard is to train on a random sample of your data, which is inefficient at best and actively harmful to mode...Show more
    Last updated: 30+ days ago • Promoted