Benefits :
- Competitive salary
- Flexible schedule
- Opportunity for advancement
Sr Data Scientist (NLP / LLM / Generative AI)
Location : Dallas, TX
Roles & Responsibilities :
Design, build, fine-tune, and deploy LLMs, transformer-based NLP models, and GenAI solutions for both batch and real-time / streaming contexts.Own all major components of ML pipelines : data ingestion, cleaning, pre-processing (structured & unstructured), embedding, search & retrieval, prompt engineering, RAG (Retrieval-Augmented Generation).Collaborate closely with ML Engineers, MLOps, software engineering, product, compliance, legal etc., to move models from prototype to production-ensuring reliability, scalability, monitoring, and maintainability.Define and implement evaluation frameworks : accuracy, bias, fairness, hallucination, consistency, latency; run UAT, stress-tests, drift detection.Optimize models and pipelines for performance, cost, and efficiency.Ensure best practices in model development : version control, repeatability, documentation, governance, and ethical AI use.Mentor more junior data scientists; help build team skills in NLP, GenAI practices, prompt engineering, fine-tuning.Identify new use cases; prototype innovations in GenAI / NLP; keep up with latest research and open source developments, decide what to adopt.Must-Have Qualifications :
10+ years of experience in data science / ML, with substantial work in NLP, LLMs, or Generative AI.Deep hands-on experience in Python, using frameworks like PyTorch, TensorFlow, HuggingFace etc.Proven track record building transformer / NLP / LLM models; experience with fine-tuning, prompt engineering.Solid experience with information retrieval / search : keyword + semantic search, embeddings, vector databases.Experience working in production / deploying models (batch and streaming), working with MLOps practices.Strong algorithmic / statistical / mathematical fundamentals. Ability to reason about model behaviour, bias, uncertainty.Good communicator : able to translate complex technical detail to business / non-technical stakeholders.Nice to Have :
Master's in Computer Science, Computational Linguistics, Statistics, Machine Learning or related field.Experience with multimodal models (vision + text) or emerging LLMs and agent-based systems.Experience with open source LLMs & toolkits; familiarity with LangChain or similar frameworks.Prior experience in regulated environments (finance, risk, legal, compliance) with strong governance, privacy requirements.Work remote temporarily due to COVID-19.
Compensation : $150,000.00 - $210,000.00 per year
About Us
We work to deliver profitability in your business - with effective communication, consulting, and interactive solutions. Following an Agile Work Approach, we make sure you get the ideal solutions at minimum expenses.
Work Approach
Our Philosophy
Our Philosophy starts-and-ends at the Client-first approach. Be it understanding your business requirements to choosing the right technologies, we work as a collective team that takes all the possible steps to grow continuously towards our common goal.
Work Policy
We promote a collaborative work environment. We involve everyone working in the organization in community decisions and encourage them to think from a broader perspective. Our work process promotes flexibility and we maintain a high level of discipline at different levels of execution.
The Future
SelectMinds have years of experience in the domain helps us understand the need-of-the-hour better. This understanding drives us to a better future with every minute ticking. We believe we will be taking off major businesses from their flagship positions, with the products we are eyeing today.