Talent.com
Principal Engineer, Catalog Infrastructure (Founding Team)
Principal Engineer, Catalog Infrastructure (Founding Team)Attachments King • San Francisco, CA, United States
Principal Engineer, Catalog Infrastructure (Founding Team)

Principal Engineer, Catalog Infrastructure (Founding Team)

Attachments King • San Francisco, CA, United States
22 days ago
Job type
  • Full-time
Job description

About The Role

Attachments King is an eCommerce startup in the Heavy Equipment Industry developing proprietary software that flexibly discovers compatibility between equipment and host machine components.

We’re hiring a Principal Engineer to build and operate the single source of truth for a high‑SKU construction equipment catalog : taxonomy, product ingestion, price / availability pipelines from messy non‑API sources, and the automation layer that allows us to rapidly and maintainably scale SKU count, price discovery, and data quality for a small team with lean headcount.

We believe that, over the next decade, the bulk of online purchases won’t take place on websites with static product pages – and it’s our intention to build a flexible, generative merchandising platform that drives this change for the heavy equipment industry.

This role is based in San Francisco, CA. This will be an in-office role and will extend past the standard 40 hours / week of many 9-5 jobs. We have long hours, weekend work sessions, and prioritize a results-driven culture.

Salary, Equity, and Benefits

Base Pay : $300,000 / year

Equity Offered : 3.50% (Options, 1yr Cliff, 4yr vest)

  • No Funding Raised, Most Recent 409A FMV is $10M.

Total Compensation : $387,500 / year

  • TC excludes potential refreshes; equity valued at 409A on grant date, amortized over 4 years
  • Employer-provided Health Insurance

    Employer-provided 401k Plan

    Day‑to‑day scope

  • Using AI to Automate Your job & Programming Managers of AI Agents
  • Taxonomy & PIM modeling : Own category trees, attributes, variants, compatibility metadata, and normalization rules (GS1 / UNSPSC awareness; custom facets for consumer browse paths).
  • Data ingestion (messy source formats) : Build resilient pipelines for CSV / Excel, email attachments, SFTP, scraped HTML, PDFs, and images.
  • Transform & validate : Typed, idempotent ETL / ELT with schema evolution and contract-based QA.
  • Pricing & availability : Schedulers / agents to detect deltas, reconcile conflicts, discover competing listings, and publish to Shopify with guardrails for margin protection.
  • Images : Automation for background removal, resizing, deduping, and attribute extraction (e.g., dimensions, metadata).
  • Analytics : Build merchandising dashboards (assortment growth, price competitiveness, availability, metadata quality).
  • Operations & SRE : Observability, alerting, backfills, SLAs / SLOs, rollback strategies, and cost control.
  • Current Platforms

  • AWS : S3, DynamoDB, Neptune, Lambda, Step Functions, ECS / Fargate, EventBridge, SQS / SNS, CloudWatch, SSM Parameter Store.
  • ECommerce Platform : Shopify Plus
  • Analytics : Power BI / Microsoft Fabric
  • AI Tooling : Cursor, Devin, Graphite, Personal ChatGPT Pro / Claude Max plans
  • Requests for, and use of, additional AI tools is heavily encouraged
  • Core outcomes

  • 30 days :
  • Ship a production ingestion → normalization → enrichment → publish pipeline for all existing SKUs (2,200); stand up initial PIM data model with faceted attributes optimized for search / browse; wire price & availability watchers for all current vendors (files, web pages, emails, competitor websites).
  • Baseline data quality with automated contracts & tests; initial operational dashboards (latency, freshness, fill rates, failure rates).
  • 90 days :
  • SKU count increased by 500% (11,000), coverage expanded to support top 100 product families and machine categories rank-ordered by search traffic demand; image set completeness >
  • 95% for top movers; pricing latency

  • AI / agent workflows auto‑extract attributes from PDFs / images; continuous taxonomy evolution with zero-downtime migrations.
  • 365 days :
  • Deliver $9.25M in annual revenue, 100% attributable to zero-touch online orders of managed SKUs.
  • V0 of Generative Customer-Facing Product Artifact Pipeline Completed
  • Must‑have requirements

  • 7+ years building production data systems (or commensurate impact) : Python (pandas / polars), SQL (Postgres / Redshift / Snowflake / BigQuery), orchestration (Step Functions / Airflow / Prefect), eventing (SQS / Kafka), object storage (S3), CI / CD, containerization.
  • Ecommerce catalog expertise : PIM concepts (attribute schemas, variants / SKU creation, canonicalization, dedup), Shopify Admin / GraphQL, metafields, collections, feed health.
  • Non‑API data wrangling at scale : Selenium / Playwright for scraping (with robots / legal etiquette, rotation, backoff), email / SFTP ingestion, PDF OCR, document parsing.
  • Data quality & contracts : Great Expectations, Pydantic (typed models), versioned schemas, migration plans, data diffing, idempotency as a base case.
  • Image processing : PIL / Pillow, OpenCV, ImageMagick; batch pipelines and basic color / contrast / compositing.
  • Analytics : Power BI and / or Tableau; metric design for merchandising (coverage, freshness, price index, conversion lift).
  • AI / agentic workflows : Retrieval + tool‑use agents to extract attributes, reconcile conflicts, propose taxonomy changes; prompt chaining; evaluation harnesses; safe‑ops patterns for deterministic fallbacks.
  • Search relevance & indexing : Search relevance for catalogs (Meilisearch / Elastic / OpenSearch) and faceted navigation tuning.
  • AWS : S3, Lambda, Glue / Athena, Step Functions, ECS / Fargate, CloudWatch; IaC via the CDK; strong cost / performance instincts.
  • Nice‑to‑haves

  • Experience with homegrown PIMs
  • Vendor EDI familiarity; GS1 barcoding; UNSPSC mapping.
  • You Might Thrive Here If...

  • You are incredibly ambitious
  • You are a self-starter and intensely curious
  • You are hard-working and relentless, frequently going above and beyond in previous or current roles
  • You are driven by achievement and energized by big, industry-disrupting challenges
  • You want a "hardcore" work environment
  • You want to leave a positive impact on the world
  • About Attachments King

    Attachments King is E-Commerce for Heavy Machinery Attachments. We're pushing the boundaries of the construction industry with innovative proprietary technology that drastically improves the customer experience when purchasing heavy equipment. We firmly prioritize a hard-working, results-driven culture.

    Our bar for talent is high, and we do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status. If you are remarkably good at what you do, you belong on our team.

    For US Based Candidates : Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

    This is the most important time to be alive in human history. Join us, and be a part of something incredible.

    Create a job alert for this search

    Founding Engineer • San Francisco, CA, United States

    Related jobs
    Principal, DevOps Engineer

    Principal, DevOps Engineer

    Ptc • San Mateo, California, United States
    Full-time
    Lead DevOps Strategy : Define and drive the DevOps roadmap, aligning with business and engineering goals.Infrastructure as Code (IaC) : Design and implement scalable, secure, and resilient infrastruc...Show more
    Last updated: 30+ days ago • Promoted
    Founding Infrastructure Engineer

    Founding Infrastructure Engineer

    Reducto • San Francisco, California, United States
    Full-time
    Reducto helps AI teams ingest real world enterprise data with state of the art accuracy.The vast majority of enterprise data — from financial statements to health records — is locked in unstructure...Show more
    Last updated: 13 hours ago • Promoted • New!
    Infrastructure Engineer

    Infrastructure Engineer

    FAR.AI • Berkeley, California, United States
    Full-time
    AI is a non-profit AI research institute dedicated to ensuring advanced AI is safe and beneficial for everyone.Our mission is to facilitate breakthrough AI safety research, advance global understan...Show more
    Last updated: 30+ days ago • Promoted
    Principal Software Engineer AI Platform

    Principal Software Engineer AI Platform

    Snorkel Ai • Redwood City, California, United States
    Full-time
    At Snorkel, we believe meaningful AI doesn’t start with the model, it starts with the data.We’re on a mission to help enterprises transform expert knowledge into specialized AI at scale.The AI land...Show more
    Last updated: 5 days ago • Promoted
    Platform & Infrastructure Engineer

    Platform & Infrastructure Engineer

    Mindsdb • San Francisco, California, United States
    Full-time
    MindsDB is a fast-growing AI startup headquartered in San Francisco, California.MindsDB is an AI Analytics solution that connects to diverse data sources and applications then unifies structured an...Show more
    Last updated: 30+ days ago • Promoted
    Principal Engineer, Catalog Infrastructure (Founding Team)

    Principal Engineer, Catalog Infrastructure (Founding Team)

    Attachments King • San Francisco, CA, US
    Full-time
    Principal Engineer, Catalog Infrastructure (Founding Team) Get AI-powered advice on this job and more exclusive features. Pay found in job post Retrieved from the description.Direct message the job ...Show more
    Last updated: 10 days ago • Promoted
    Principal Database Engineer

    Principal Database Engineer

    Informatica LLC • Redwood City, CA, United States
    Full-time
    Build Your Career at Informatica.We seek innovative thinkers who believe in the power of data to drive meaningful change. At Informatica, we welcome adventurous minds eager to solve the world's most...Show more
    Last updated: 24 days ago • Promoted
    Principal Engineer, Enterprise Applications

    Principal Engineer, Enterprise Applications

    Early Warning • San Francisco, CA, US
    Full-time
    At Early Warning, we've powered and protected the U.Zelle, Paze?, and so much more.As a trusted name in payments, we partner with thousands of institutions to increase access to financial services ...Show more
    Last updated: 2 days ago • Promoted
    Principal Core Infrastructure Engineer

    Principal Core Infrastructure Engineer

    Highnote • San Francisco, CA, US
    Full-time
    Join to apply for the Senior Core Infrastructure Engineer role at Highnote 3 days ago Be among the first 25 applicants Join to apply for the Senior Core Infrastructure Engineer role at Highno...Show more
    Last updated: 30+ days ago • Promoted
    Lead Infrastructure Engineer

    Lead Infrastructure Engineer

    PIP Labs • San Francisco, California, United States
    Full-time
    Story aims to grow the creativity of the internet.The internet has introduced Story is building the IP infrastructure for the internet era, where creativity and intelligence move at the speed of cu...Show more
    Last updated: 30+ days ago • Promoted
    Senior Manager, Cloud Engineering

    Senior Manager, Cloud Engineering

    PG Forsta • Emeryville, CA, United States
    Full-time
    PG Forsta is the leading experience measurement, data analytics, and insights provider for complex industries-a status we earned over decades of deep partnership with clients to help them understan...Show more
    Last updated: 2 days ago • Promoted
    ML Infrastructure Engineer (Staff / Principal)

    ML Infrastructure Engineer (Staff / Principal)

    Genesis Molecular Ai • Burlingame, California, United States
    Full-time
    We’re a tight-knit team of proven drug hunters, deep learning researchers, and software engineers united by a common mission — drive AI innovation in biochemistry, discovering and developing ground...Show more
    Last updated: 13 hours ago • Promoted • New!
    Alliance Manager, Enterprise Technology

    Alliance Manager, Enterprise Technology

    Snowflake • Menlo Park, CA, US
    Full-time
    Global Alliance Director For Enterprise Technology Alliances Partnership.Snowflake is about empowering enterprises to achieve their full potential and people too. With a culture that's all in on im...Show more
    Last updated: 30+ days ago • Promoted
    Principal Engineer, Capital Projects Division

    Principal Engineer, Capital Projects Division

    CapMetro • San Francisco, CA, United States
    Full-time
    Principal Engineer, Capital Projects Division.Interested individuals are encouraged to submit a letter of interest and résumé. This recruitment will remain opened until filled.However, first conside...Show more
    Last updated: 30+ days ago • Promoted
    Principal Systems Engineer

    Principal Systems Engineer

    Cloudflare, Inc. • San Francisco, CA, US
    Full-time
    About Us At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties...Show more
    Last updated: 30+ days ago • Promoted
    Platform Engineer : Core Infrastructure

    Platform Engineer : Core Infrastructure

    Slash Financial • San Francisco, CA, United States
    Full-time
    Slash is building the future of business banking, one industry at a time.We believe businesses deserve financial infrastructure tailored to how they actually operate. That's why we're creating a new...Show more
    Last updated: 30+ days ago • Promoted
    Principal Infrastructure Engineer

    Principal Infrastructure Engineer

    Nextdata Technologies Inc • San Francisco, California, United States
    Full-time
    The future of data lies in decentralization, and the concept of a data mesh is the proven approach for implementing this at Enterprise scale. We’re here to make it a reality.Nextdata OS is a data-me...Show more
    Last updated: 30+ days ago • Promoted
    HPC Storage Systems Group Leader

    HPC Storage Systems Group Leader

    Lawrence Berkeley National Laboratory • Berkeley, CA, United States
    Full-time +2
    The National Energy Research Scientific Computing Center (NERSC) is inviting applications for the position of Storage Systems Group (SSG) Lead. NERSC's mission is to accelerate scientific discovery ...Show more
    Last updated: 15 days ago • Promoted