Talent.com
Sr. Software Engineer- AI / LLM

Sr. Software Engineer- AI / LLM

SupermicroSan Jose, CA, United States
9 hours ago
Job type
  • Full-time
Job description

Job Req ID : 26294

About Supermicro :

Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us.

Job Summary :

Supermicro is seeking an experienced and exceptional Sr. Software Engineer to work on web-based applications for business process automation. This is a key role that will give you the opportunity to expand your existing knowledge in programming.

Essential Duties and Responsibilities :

Includes the following essential duties and responsibilities (other duties may also be assigned)

  • Integrating open-source LLMs (e.g., Llama 3.2 90B) with open-source vector databases, search indexing, and contextual query management
  • Design and implement Retrieval-Augmented Generation (RAG) pipelines, incorporating embedding generation, vector search, re-ranking, and contextual retrieval techniques
  • Optimize search and retrieval systems using Elasticsearch and vector databases
  • Develop and deploy an intelligent AI Agent to assist customers in selecting and purchasing the correct servers based on their unique requirements and use cases
  • Integrate AI Agents with backend databases, recommendation engines, and decision-making pipelines
  • Design workflows for task automation, contextual reasoning, and real-time recommendations
  • Design scalable web scraping pipelines using tools like Scrapy, Selenium, and BeautifulSoup to acquire structured and unstructured data
  • Process and clean scraped data to integrate it seamlessly into databases and knowledge retrieval systems
  • Design and manage relational databases (include., PostgreSQL, MySQL, MS SQL) for structured data storage and retrieval
  • Work with document-based databases (e.g., MongoDB) for handling unstructured data sources
  • Optimize database queries and structures to ensure efficient system performance
  • Design, test, and optimize prompts for large language models (LLMs) to improve response accuracy, context management, and task completion
  • Experiment with prompt tuning and contextual input adjustments to enhance LLM performance in specific use cases
  • Extract, clean, and preprocess data from various sources, including relational databases, document databases, PDFs, and images
  • Write code for parsing and processing non-text data formats
  • Develop Python-based web services using popular framework to enable backend APIs and real-time interactions
  • Create interactive dashboards for data visualization and system control using Streamlit
  • Collaborate with frontend developers to ensure seamless integration between APIs and user-facing interfaces
  • Deploy system components in Linux environments using Docker for scalability and portability
  • Optimize system performance for GPU-intensive tasks, ensuring efficient resource utilization
  • Identify common user queries, challenges, and areas for improvement
  • Test the system regularly from the user's perspective to validate its performance and accuracy.
  • Analyze user feedback and satisfaction, iterating on system design, prompts, and workflows to improve response quality and relevance
  • Collaborate with cross-functional teams to implement enhancements based on user behavior and feedback trends
  • Handle intricate, repetitive, or time-consuming tasks, such as dataset cleaning, normalization, and troubleshooting
  • Ensure data accuracy and reliability, understanding that these foundational tasks are critical for system success

Qualifications :

  • BS or above in Computer / Information Science or other relevant degree
  • Minimum 8 years of working experience in software development preferred
  • Programming Skills in C#, SQL, Java, JavaScript, AJAX
  • C# ASP.NET project experience is a plus
  • Salary Range

    $170,000 - $190,000

    The salary offered will depend on several factors, including your location, level, education, training, specific skills, years of experience, and comparison to other employees already in this role. In addition to a comprehensive benefits package, candidates may be eligible for other forms of compensation, such as participation in bonus and equity award programs.

    EEO Statement

    Supermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.

    Create a job alert for this search

    Sr Software Engineer • San Jose, CA, United States

    Related jobs
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    FoodHealth CompanySan Francisco, CA, US
    Full-time
    We are on a mission to improve the world's health through food.Through our flagship product, the FoodHealth Score, we provide data tools that empower consumers to make better choices, help reta...Show moreLast updated: 1 day ago
    • Promoted
    Software Engineer, AI / ML

    Software Engineer, AI / ML

    Glu Mobile Inc.San Francisco, CA, United States
    Full-time
    Glue is a well-funded startup working on the next generation of work communication tools.We believe that today’s work chat is noisy, unstructured, and not designed for productivity.We’re drawing fr...Show moreLast updated: 5 days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Metric BioFremont, CA, US
    Full-time
    Metric Bio is recruiting on behalf of a San Francisco–based digital health company that is building an AI-powered platform to transform patient care and healthcare delivery.ML techniques to s...Show moreLast updated: 1 day ago
    • Promoted
    Software Engineer, AI Integrations

    Software Engineer, AI Integrations

    AsanaSan Francisco, CA, United States
    Full-time
    The AI Integrations team is looking for an experienced software engineer to empower builders with the context and tools to deploy powerful workflows across people and apps.The team owns making work...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    HarnhamFremont, CA, US
    Full-time
    STAFF MACHINE LEARNING ENGINEER.Hybrid – Bay Area (3 Days / Week Onsite).We’re a fast-growing online marketplace backed by a major global tech player. Our platform helps millions of people...Show moreLast updated: 27 days ago
    • Promoted
    Software Engineer, AI Infra

    Software Engineer, AI Infra

    ShepherdSan Francisco, CA, United States
    Full-time
    We provide savings on insurance premiums for commercial businesses that are leveraging modern technology on their worksites. While we began with commercial construction, we're expanding into adjacen...Show moreLast updated: 28 days ago
    • Promoted
    Sr. Software Engineer (AI Orchestration Zone, Backend Leaning)

    Sr. Software Engineer (AI Orchestration Zone, Backend Leaning)

    ZapierSan Francisco, CA, United States
    Full-time
    We're humans who simply think computers should do more work.Our mission is to make automation work for everyone by delivering products that delight. You’ll collaborate with brilliant people, use the...Show moreLast updated: 30+ days ago
    • Promoted
    Senior AI / ML engineer

    Senior AI / ML engineer

    Storm3Fremont, CA, US
    Full-time
    A1; Senior Data / AI / ML Engineer.F4BC; Series A HealthTech | Clinical Data Intelligence Platform.F30D; San Francisco (Hybrid | US Only). F4B0; $200,000+ (base + benefits + equity).A Series A Healt...Show moreLast updated: 1 day ago
    • Promoted
    Sr Machine Learning Engineer - GenAI, LLM, Agentic AI

    Sr Machine Learning Engineer - GenAI, LLM, Agentic AI

    CerebrasSanta Clara, CA, United States
    Full-time
    We are building the next generation of our AI-powered talent platform, aiming to match the right career for everyone in the world. Our AI-native enterprise talent intelligence platform leverages Gen...Show moreLast updated: 20 days ago
    • Promoted
    Sr. Full Stack engineer AI Application development

    Sr. Full Stack engineer AI Application development

    Info Way SolutionsFremont, CA, United States
    Full-time
    This is Backiyam from Info Way Solutions, LLC We have job opening for.Full Stack engineer AI Application development.Job description is given below : . Kindly please share me the details along with th...Show moreLast updated: 3 days ago
    • Promoted
    Software Engineer - AI

    Software Engineer - AI

    PromiseSan Francisco, CA, United States
    Permanent
    Promise modernizes how government agencies and utilities support people in financial difficulty.We build technology that makes it simple for residents to receive benefits, engage with assistance pr...Show moreLast updated: 2 days ago
    • Promoted
    Sr. Engineer, AI Solutions Architect

    Sr. Engineer, AI Solutions Architect

    LenovoSan Jose, CA, United States
    Full-time
    Engineer, AI Solutions Architect.United States of America - California - San Jose.Lenovo is a US$69 billion revenue global technology powerhouse, ranked #196 in the Fortune Global 500, and serving ...Show moreLast updated: 2 days ago
    • Promoted
    Sr. Software Engineer - GenAI

    Sr. Software Engineer - GenAI

    Databricks Inc.San Francisco, CA, United States
    Full-time
    As a Senior Applied AI Engineer at Databricks, you will apply machine learning, scheduling and optimization algorithms to improve the efficiency and performance of our engineering systems and infra...Show moreLast updated: 9 days ago
    • Promoted
    ML / AI Engineer

    ML / AI Engineer

    RilletSan Francisco, CA, United States
    Full-time
    Our customers are the financial brains of their companies.Our job is to help them run the numbers with impossible speed, accuracy, and insight. Today, we do that with powerful and elegant accounting...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Software Engineer, Mobile

    Sr. Software Engineer, Mobile

    Icon VenturesSan Francisco, CA, United States
    Full-time
    Our $1B+ learning platform serves tens of millions of students every month, powering over 2 billion learning interactions monthly. We blend cognitive science with machine learning to personalize and...Show moreLast updated: 16 days ago
    • Promoted
    Sr. Software Engineer

    Sr. Software Engineer

    DocuSign, Inc.San Francisco, CA, United States
    Full-time
    Docusign brings agreements to life.Docusign solutions to accelerate the process of doing business and simplify people’s lives. With intelligent agreement management, Docusign unleashes business-crit...Show moreLast updated: 30+ days ago
    • Promoted
    AIML - Sr. Machine Learning Engineer, World Knowledge

    AIML - Sr. Machine Learning Engineer, World Knowledge

    Apple Inc.Cupertino, CA, United States
    Full-time
    Machine Learning Engineer, World Knowledge.Cupertino, California, United States Machine Learning and AI.Are you excited about Generative AI and Large Language Models and eager to apply your experti...Show moreLast updated: 29 days ago
    • Promoted
    Software Engineer, Generative AI

    Software Engineer, Generative AI

    MatchPalo Alto, CA, United States
    Full-time
    Tinder launched in 2012 and has grown to connect people across the globe, with millions of users and interactions that reflect real connection. The company won multiple awards, including Effie Award...Show moreLast updated: 21 days ago
    • Promoted
    Software Engineer, AI

    Software Engineer, AI

    MonographSan Francisco, CA, United States
    Full-time
    Ambrook's mission is to make sustainability profitable for family-run businesses.In the face of historic heat waves, drought, flooding, supply chain disruptions, water shortages, and pollution, cli...Show moreLast updated: 26 days ago
    • Promoted
    Senior Software Engineer, AI Model serving - San Francisco, USA

    Senior Software Engineer, AI Model serving - San Francisco, USA

    Clutch CanadaSan Francisco, CA, United States
    Full-time
    PLEASE APPLY THROUGH THIS LINK : .The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify’s text-to-speech products to turn whatever ...Show moreLast updated: 30+ days ago