Position : Gen AI Architect
Location : Pleasanton CA
Duration : 1 Years
JD
We are seeking an experienced Generative AI Architect to lead the design development and deployment of cutting-edge generative AI systems. The ideal candidate will combine deep technical knowledge of AI / ML (particularly large language models and diffusion models) with strong architecture and leadership skills. You will play a critical role in shaping our AI strategy and enabling innovative products powered by generative technologies.
Key Responsibilities :
Architect and design end-to-end generative AI solutions (text image audio or multimodal) that align with business objectives.Evaluate and select appropriate foundation models (e.g. GPT LLaMA Stable Diffusion) and fine-tuning strategies.Lead the development of custom LLM applications including prompt engineering fine-tuning RLHF and model compression.Collaborate with cross-functional teams (engineering product design data science) to integrate AI into products and platforms.Ensure responsible and ethical AI practices are embedded in system design (e.g. fairness privacy explainability).Guide the implementation of AI infrastructure (data pipelines vector databases model serving APIs).Stay up-to-date on the latest AI research and tools and make recommendations for adoption.Conduct proofs-of-concept prototypes and performance benchmarking.Mentor junior engineers and contribute to best practices and internal knowledge sharing.Required Qualifications :
Bachelors or Masters degree in Computer Science Artificial Intelligence Machine Learning7 years of experience in AI / ML with 3 years in generative AI (LLMs diffusion models etc.).Proven experience designing and deploying large-scale AI systems.Deep understanding of transformer architectures tokenization and pretraining / fine-tuning paradigms .Hands-on experience with AI / ML frameworks such as PyTorch TensorFlow Hugging Face Transformers LangChain etc.Strong knowledge of MLOps cloud platforms (AWS GCP Azure) and scalable architectures (e.g. microservices serverless).Experience with vector databases (e.g. Pinecone Weaviate FAISS) and retrieval-augmented generation (RAG) systems.Familiarity with responsible AI frameworks and privacy-preserving techniques.Preferred Qualifications :
Experience with open-source LLMs and model distillation / quantization techniques.Exposure to multimodal AI models (e.g. CLIP DALL E Imagen).Contributions to AI / ML research (e.g. published papers open-source projects).Experience building GenAI copilots chatbots or productivity tools.Soft Skills :
Strong problem-solving and analytical skills.Excellent communication and stakeholder management abilities.Ability to translate complex AI concepts into business value.Entrepreneurial mindset and passion for innovation.Key Skills
APIs,Pegasystems,Spring,SOAP,.NET,Hybris,Solution Architecture,Service-Oriented Architecture,Adobe Experience Manager,J2EE,Java,Oracle
Employment Type : Full Time
Experience : years
Vacancy : 1