Job Title : GCP Gen AI Architect
Location : Boston, MA(Onsite)
Job Type : Full Time
Must Have Technical / Functional Skills :
Architect GEN AI solutions using GCP (GEN AI LLM, Big Query, Vertex AI and related services.
Fine-tune pretrained large language models (LLMs) and other generative models tailored to specific domain needs.
Create AI-driven applications capable of generating text, images, and other media formats.
Build Retrieval-Augmented Generation (RAG) pipelines to improve the quality of AI-generated responses.
Collaborate with various stakeholders to integrate and deploy AI models into production environments using ML Ops tools like Docker and manage deployments on GKE or Cloud Run.
Enhance model performance and scalability to meet enterprise requirements, optimising for latency, cost, and accuracy.
Implement authentication and authorisation using GCP IAM for secure access control.
Keep abreast of the latest developments in generative AI technologies and tools.
Produce clean, maintainable, and well-documented code.
Engage in code reviews and actively contribute to team knowledge sharing.
Ai Architect • Boston, MA, United States