Talent.com
Software Engineer - AI / LLM

Software Engineer - AI / LLM

SupermicroSan Jose, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Job Req ID : 26294

About Supermicro :

Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us.

Job Summary :

Supermicro is seeking an experienced and exceptional Application Software Engineer to work on web-based applications for business process automation. This is a key role that will give you the opportunity to expand your existing knowledge in programming.

Essential Duties and Responsibilities :

Includes the following essential duties and responsibilities (other duties may also be assigned)

  • Integrating open-source LLMs (e.g., Llama 3.2 90B) with open-source vector databases, search indexing, and contextual query management
  • Design and implement Retrieval-Augmented Generation (RAG) pipelines, incorporating embedding generation, vector search, re-ranking, and contextual retrieval techniques
  • Optimize search and retrieval systems using Elasticsearch and vector databases
  • Develop and deploy an intelligent AI Agent to assist customers in selecting and purchasing the correct servers based on their unique requirements and use cases
  • Integrate AI Agents with backend databases, recommendation engines, and decision-making pipelines
  • Design workflows for task automation, contextual reasoning, and real-time recommendations
  • Design scalable web scraping pipelines using tools like Scrapy, Selenium, and BeautifulSoup to acquire structured and unstructured data
  • Process and clean scraped data to integrate it seamlessly into databases and knowledge retrieval systems
  • Design and manage relational databases (include., PostgreSQL, MySQL, MS SQL) for structured data storage and retrieval
  • Work with document-based databases (e.g., MongoDB) for handling unstructured data sources
  • Optimize database queries and structures to ensure efficient system performance
  • Design, test, and optimize prompts for large language models (LLMs) to improve response accuracy, context management, and task completion
  • Experiment with prompt tuning and contextual input adjustments to enhance LLM performance in specific use cases
  • Extract, clean, and preprocess data from various sources, including relational databases, document databases, PDFs, and images
  • Write code for parsing and processing non-text data formats
  • Develop Python-based web services using popular framework to enable backend APIs and real-time interactions
  • Create interactive dashboards for data visualization and system control using Streamlit
  • Collaborate with frontend developers to ensure seamless integration between APIs and user-facing interfaces
  • Deploy system components in Linux environments using Docker for scalability and portability
  • Optimize system performance for GPU-intensive tasks, ensuring efficient resource utilization
  • Identify common user queries, challenges, and areas for improvement
  • Test the system regularly from the user's perspective to validate its performance and accuracy.
  • Analyze user feedback and satisfaction, iterating on system design, prompts, and workflows to improve response quality and relevance
  • Collaborate with cross-functional teams to implement enhancements based on user behavior and feedback trends
  • Handle intricate, repetitive, or time-consuming tasks, such as dataset cleaning, normalization, and troubleshooting
  • Ensure data accuracy and reliability, understanding that these foundational tasks are critical for system success

Qualifications :

  • BS or above in Computer / Information Science or other relevant degree
  • Minimum 5 years of working experience in software development preferred
  • Programming Skills in C#, SQL, Java, JavaScript, AJAX
  • C# ASP.NET project experience is a plus
  • Salary Range

    $147,000 - $168,000

    The salary offered will depend on several factors, including your location, level, education, training, specific skills, years of experience, and comparison to other employees already in this role. In addition to a comprehensive benefits package, candidates may be eligible for other forms of compensation, such as participation in bonus and equity award programs.

    EEO Statement

    Supermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.

    Create a job alert for this search

    Software Engineer • San Jose, CA, United States

    Related jobs
    • Promoted
    Senior AI / ML Engineer

    Senior AI / ML Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Senior AI / ML Engineer specializing in Generative AI to develop and implement advanced AI solutions. Key Responsibilities Implement and optimize AI orchestration framewor...Show moreLast updated: 30+ days ago
    • Promoted
    Senior AI Model Engineer

    Senior AI Model Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Senior AI Research Engineer, Model Inference (100% Remote).Key Responsibilities Implement and optimize custom inference and fine-tuning kernels for language models acro...Show moreLast updated: 1 day ago
    • Promoted
    Senior MLOps Engineer

    Senior MLOps Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Senior MLOps Engineer to design and scale infrastructure for AI research and product development. Key Responsibilities Identify and resolve infrastructure and software b...Show moreLast updated: 30+ days ago
    • Promoted
    Lead AI Data Engineer

    Lead AI Data Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Lead AI and Data Solution Engineer (LLMs, MCP).Key Responsibilities Lead the design, development, and deployment of enterprise-scale data and AI solutions Architect an...Show moreLast updated: 30+ days ago
    • Promoted
    AI Automation Engineer

    AI Automation Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for an AI Automation Engineer to join their Data & AI team within the Office of the CIO.Key Responsibilities Lead discussions with stakeholders to gather and clarify requirem...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer, AI / ML

    Software Engineer, AI / ML

    Glu Mobile Inc.San Francisco, CA, United States
    Full-time
    Glue is a well-funded startup working on the next generation of work communication tools.We believe that today’s work chat is noisy, unstructured, and not designed for productivity.We’re drawing fr...Show moreLast updated: 4 days ago
    • Promoted
    • New!
    AI / ML Engineer

    AI / ML Engineer

    FoodHealth CompanySan Francisco, CA, US
    Full-time
    We are on a mission to improve the world's health through food.Through our flagship product, the FoodHealth Score, we provide data tools that empower consumers to make better choices, help reta...Show moreLast updated: 16 hours ago
    • Promoted
    • New!
    AI Research Engineer

    AI Research Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for an AI Research Engineer specializing in LLM orchestration and prompting.Key Responsibilities Build LLM-powered software by designing prompt flows and orchestrations for o...Show moreLast updated: 19 hours ago
    • Promoted
    Engineer II - Generative AI & ML

    Engineer II - Generative AI & ML

    DishFoster City, CA, United States
    Full-time
    EchoStar is reimagining the future of connectivity.Our business reach spans satellite television service, live-streaming and on-demand programming, smart home installation services, mobile plans an...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Data & AI Engineer

    Data & AI Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Data / AI Engineer (Gen AI, LLM, ML).Key Responsibilities Design, build, and maintain robust data pipelines and workflows for healthcare data Develop, train, and deploy ...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    AI Backend Engineer

    AI Backend Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Gen AI Backend Engineer to design orchestration pipelines and manage data workflows for multi-agent LLM applications. Key Responsibilities Design and implement backend s...Show moreLast updated: 17 hours ago
    • Promoted
    AI Infrastructure Engineer, Model Serving Platform

    AI Infrastructure Engineer, Model Serving Platform

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer, Enterprise AI

    Software Engineer, Enterprise AI

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a strong enginee...Show moreLast updated: 30+ days ago
    • Promoted
    AI Marketing Software Engineer

    AI Marketing Software Engineer

    VirtualVocationsFremont, California, United States
    Temporary
    A company is looking for an AI Marketing Software Engineer for a temporary position.Key Responsibilities Build and deploy automated agents for marketing use cases Develop and maintain prompt cha...Show moreLast updated: 2 days ago
    • Promoted
    • New!
    Senior AI Integration Engineer

    Senior AI Integration Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Senior Specialist - AI & Integra Engineer.Key Responsibilities Drive the technical architecture and implementation of core AI and Integra platforms Lead complex engine...Show moreLast updated: 17 hours ago
    • Promoted
    AI Engineer

    AI Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for an AI Engineer who has experience architecting and shipping robust multi-agent systems in production. Key Responsibilities Design, develop, and deploy AI Coach capabilitie...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    VirtualVocationsSanta Clara, California, United States
    Full-time
    A company is looking for an AI / ML Engineer - Model Dev & Data Pipeline.Key Responsibilities Design and train custom neural networks and fine-tune large language models for specific tasks Build a...Show moreLast updated: 30+ days ago
    • Promoted
    Senior ML Engineer

    Senior ML Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior / Staff ML Engineer to enhance their AI capabilities for residential construction.Key Responsibilities Shape the vision for AI / ML integration into products and ope...Show moreLast updated: 2 days ago