We are looking for an experienced Data Scientist specialized in Generative AI (GenAI) to join our remote team. The ideal candidate will have deep expertise in LLMs, generative models, NLP, and AI pipelines, with experience delivering production-grade AI solutions.
Design, develop, and deploy generative AI models (text, image, or multimodal).
Build and optimize LLM pipelines, embeddings, and RAG workflows.
Develop and maintain data preprocessing, feature engineering, and ETL pipelines.
Collaborate with product and engineering teams to integrate AI models into applications.
Evaluate, fine-tune, and benchmark LLMs and other generative models.
Implement AI solutions with scalability, reliability, and performance in mind.
Ensure data privacy, compliance, and security in all AI solutions.
Stay updated with the latest developments in Generative AI, NLP, and ML research.
5+ years of experience as a Data Scientist / ML Engineer / AI Engineer.
Strong expertise in Generative AI, LLMs, and NLP.
Hands-on experience with OpenAI, Anthropic, Cohere, Llama, or other LLM APIs.
Proficiency in Python, PyTorch, TensorFlow, or JAX.
Experience with LangChain, LlamaIndex, Hugging Face, or similar AI frameworks.
Knowledge of RAG (Retrieval-Augmented Generation), embeddings, and prompt engineering.
Experience with vector databases (Pinecone, Weaviate, Milvus, FAISS).
Familiarity with cloud platforms (AWS, Azure, GCP) for AI deployment.
Strong understanding of data wrangling, feature engineering, and ML model evaluation.
Experience with multimodal AI (text+image/audio/video).
Exposure to MLOps and AI deployment pipelines.
Knowledge of Reinforcement Learning (RLHF/RLAIF).
Experience with APIs, microservices, and containerized deployments (Docker/Kubernetes).
Contributions to open-source AI projects or research publications.