Talent.com
Performance engineer
Performance engineerWriter Corporation • New York, NY, United States
Performance engineer

Performance engineer

Writer Corporation • New York, NY, United States
30+ days ago
Job type
  • Full-time
Job description

About this role

WRITER is seeking a highly skilled and motivated Principal performance engineer to lead the performance optimization of our cutting-edge Generative AI technology stack. This role is critical in ensuring the scalability, efficiency, and reliability of our Large Language Models (LLMs) and Retrieval Augmented Generation (RAG) systems. You will be a key driver in identifying and resolving performance bottlenecks, optimizing resource utilization, and ensuring a seamless user experience. You will work closely with our AI research, software engineering, and infrastructure teams to deliver world-class AI solutions.

Your responsibilities

  • Performance leadership :

Define and implement performance engineering strategies for our Generative AI full stack, including services, application, LLMs, RAG pipelines, and related infrastructure.

  • Lead performance testing, profiling, and analysis efforts to identify and resolve performance bottlenecks.
  • Establish and maintain performance benchmarks and SLAs for critical AI services.
  • Provide technical leadership and mentorship to performance engineering team members.
  • LLM capacity and tuning :
  • Analyze and improve LLM inference performance, including latency, throughput, and resource utilization.

  • Develop and implement strategies for LLM capacity planning and scaling.
  • Collaborate with AI researchers to evaluate and improve LLM model architectures and training techniques for performance.
  • Optimize LLM inference through techniques such as quantization, distillation, and optimized kernel implementation.
  • RAG performance optimization :
  • Design and implement performance tests for RAG pipelines, including retrieval, ranking, and generation components.

  • Identify and optimize performance bottlenecks in RAG systems, such as database queries, vector search, and document processing.
  • Evaluate and optimize RAG system architectures for scalability and efficiency.
  • Tune vector databases for optimal recall and latency.
  • Infrastructure optimization :
  • Collaborate with infrastructure teams to optimize hardware and software configurations for AI workloads.

  • Evaluate and recommend new technologies and tools for performance monitoring and analysis.
  • Develop and maintain performance dashboards and reports to track key metrics.
  • Optimize GPU utilization and memory management for LLM inference.
  • Collaboration and communication :
  • Work closely with AI researchers, software engineers, and product managers to ensure performance requirements are met.

  • Communicate performance findings and recommendations to stakeholders at all levels.
  • Stay up-to-date with the latest developments in Generative AI and performance engineering.
  • ☆ Is this you?

  • Education :
  • Bachelor's degree in Computer Science, Engineering, or a related field (Master's preferred).

  • Experience :
  • 10+ years of experience in performance engineering, with a focus on large-scale distributed systems.

  • 2+ years of experience working with AI / ML technologies
  • Proven experience in performance testing, profiling, and analysis of complex software systems.
  • Deep understanding of NLP architectures, training, and inference.
  • Experience with vector databases and search technologies.
  • Experience with cloud computing platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Docker, Kubernetes).
  • Strong programming skills in python.
  • Familiarity with Postgres and Elasticsearch
  • Experience with performance analysis tools (e.g., profilers, debuggers, monitoring tools).
  • Skills :
  • Strong analytical and problem-solving skills.

  • Excellent communication and collaboration skills.
  • Ability to work in a fast-paced and dynamic environment.
  • Passion for AI and a desire to push the boundaries of performance engineering
  • Benefits & perks (US Full-time employees)

  • Generous PTO, plus company holidays
  • Medical, dental, and vision coverage for you and your family
  • Paid parental leave for all parents (12 weeks)
  • Fertility and family planning support
  • Early-detection cancer testing through Galleri
  • Flexible spending account and dependent FSA options
  • Health savings account for eligible plans with company contribution
  • Annual work-life stipends for :
  • Home office setup, cell phone, internet

  • Wellness stipend for gym, massage / chiropractor, personal training, etc.
  • Learning and development stipend
  • Company-wide off-sites and team off-sites
  • Competitive compensation, company stock options and 401k
  • WRITER is an equal-opportunity employer and is committed to diversity. We don't make hiring or employment decisions based on race, color, religion, creed, gender, national origin, age, disability, veteran status, marital status, pregnancy, sex, gender expression or identity, sexual orientation, citizenship, or any other basis protected by applicable local, state or federal law. Under the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

    By submitting your application on the application page, you acknowledge and agree to WRITER's Global Candidate Privacy Notice.

    Create a job alert for this search

    Performance Engineer • New York, NY, United States

    Related jobs
    Senior Performance Engineer

    Senior Performance Engineer

    Ellipsis Labs, Inc • New York, NY, United States
    Full-time
    Ellipsis Labs is a profitable, venture-backed New York-based startup building differentiated products and infrastructure in decentralized finance. The company is the developer of Phoenix, the leadin...Show more
    Last updated: 21 hours ago • Promoted • New!
    Performance Engineer

    Performance Engineer

    VirtualVocations • Paterson, New Jersey, United States
    Full-time
    A company is looking for a Performance Engineer / Tester.Key Responsibilities Develop, execute, and maintain performance test plans, scripts, and scenarios for drilling applications Analyze system...Show more
    Last updated: 30+ days ago • Promoted
    Senior Performance Engineer

    Senior Performance Engineer

    VirtualVocations • Newark, New Jersey, United States
    Full-time
    A company is looking for a Senior Performance and Development Engineer.Key Responsibilities Build AI models, tools, and frameworks for real-time application performance metrics Develop automatio...Show more
    Last updated: 30+ days ago • Promoted
    Senior Building Performance Engineer

    Senior Building Performance Engineer

    Altanova - Sustainable Strategy & Innovation • Long Island City, NY, United States
    Full-time
    Altanova is a leading pluridisciplinary consulting firm focused on sustainable innovation.Our Sustainable Real Estate team applies a strong foundation in engineering and technology to high-performa...Show more
    Last updated: 30+ days ago • Promoted
    Performance Engineer - Morris Plains, NJ (100% onsite)

    Performance Engineer - Morris Plains, NJ (100% onsite)

    Georgia IT Inc • Morris Plains, NJ, United States
    Full-time
    Job Title : Performance Engineer.Work Location (City / State) : Morris Plains, NJ (100% onsite).Continuously test, monitor, and recommend fixes to improve and maintain performance of the applications.I...Show more
    Last updated: 21 hours ago • Promoted • New!
    Senior Performance Engineer

    Senior Performance Engineer

    Ellipsis Labs • New York, NY, United States
    Full-time
    Ellipsis Labs is a profitable, venture-backed New York-based startup building differentiated products and infrastructure in decentralized finance. The company developed Phoenix, the leading order bo...Show more
    Last updated: 1 hour ago • Promoted • New!
    Performance Engineer

    Performance Engineer

    Syntricate Technologies • Jersey City, NJ, United States
    Full-time
    Performance Engineer : Systems Operations Engineer.Onsite hybrid 3 days minimum in one of the office locations above.Company's client is seeking a Software Engineer with strong performance engineeri...Show more
    Last updated: 21 hours ago • Promoted • New!
    Innovation Technician

    Innovation Technician

    New Jersey Staffing • Asbury Park, NJ, US
    Full-time +1
    Rentokil Innovation / Utility Specialist.Benefits Start Day 1 for Full-Time Colleagues - No Waiting Period! We are proud to be a member of the Rentokil family of companies, the global leader in Pest ...Show more
    Last updated: 5 days ago • Promoted
    Performance Engineer

    Performance Engineer

    Veracity • Morris Plains, NJ, United States
    Full-time
    Morris Plains, NJ (100% onsite).Neoload, performance engineering, scripting languages, E2E, NFR.Continuously test, monitor, and recommend fixes to improve and maintain performance of the applicatio...Show more
    Last updated: 21 hours ago • Promoted • New!
    Front-End Engineer

    Front-End Engineer

    VirtualVocations • Elizabeth, New Jersey, United States
    Full-time
    A company is looking for a Front-End Engineer (React).Key Responsibilities Create and maintain web features, including integration scripts and unit tests Investigate application problems in coll...Show more
    Last updated: 30+ days ago • Promoted
    Back-End Engineer

    Back-End Engineer

    VirtualVocations • Jamaica, New York, United States
    Full-time
    A company is looking for a Back-End Engineer to build reliable and scalable backend services.Key Responsibilities Build reliable, scalable backend services using Node. TypeScript Design and opera...Show more
    Last updated: 30+ days ago • Promoted
    Performance Engineer

    Performance Engineer

    Distributed Solar Development • New York, NY, United States
    Full-time
    This Performance Engineer role is responsible for managing and optimizing the performance analytics of a portfolio of operating solar and battery assets to ensure they achieve expected energy outpu...Show more
    Last updated: 19 hours ago • Promoted • New!
    Palantir Engineer - Senior - Consulting - Location Open

    Palantir Engineer - Senior - Consulting - Location Open

    EY • Secaucus, NJ, United States
    Full-time
    At EY, we're all in to shape your future with confidence.We'll help you succeed in a globally connected powerhouse of diverse teams and take your career wherever you want it to go.Join EY and help ...Show more
    Last updated: 1 day ago • Promoted
    Performance Engineer

    Performance Engineer

    Purple Drive • Jersey City, NJ, United States
    Full-time
    IT experience with proven expertise in performance engineering (analysis, testing, and tuning)Experience developing simulators in multiple languages, such as Java, PL SQL, Python for simulating per...Show more
    Last updated: 19 hours ago • Promoted • New!
    QE Lead Performance Engineer

    QE Lead Performance Engineer

    Marsh LLC • Red Bank, NJ, United States
    Full-time
    Award-winning, inclusive, Top Workplace culture doesn’t happen overnight.It’s a result of hard work by extraordinary people. The industry’s brightest talent drive our efforts to deliver purposeful w...Show more
    Last updated: 30+ days ago • Promoted
    Plant Optimization Engineer (Syracuse, NY)

    Plant Optimization Engineer (Syracuse, NY)

    Packaging Corporation of America • New York, NY, United States
    Full-time
    As a Fortune 500 company, Packaging Corporation of America (PCA) is an ideas and solutions company.Our corrugated packaging business seeks to be the leader in helping our customers - large and smal...Show more
    Last updated: 1 day ago • Promoted
    Prompt Engineer

    Prompt Engineer

    VirtualVocations • Yonkers, New York, United States
    Full-time
    A company is looking for a Prompt Engineer to architect and optimize CRM content generation using large language models.Key Responsibilities Develop, refine, and test LLM prompts for high-quality...Show more
    Last updated: 5 hours ago • Promoted • New!
    Senior Back-end Engineer

    Senior Back-end Engineer

    VirtualVocations • Bronx, New York, United States
    Full-time
    Key Responsibilities Design and implement new analytics features for SD-WAN, SSE, and SASE solutions Serve as a subject matter expert (SME) for analytics-related issues and provide technical gui...Show more
    Last updated: 30+ days ago • Promoted