
Experience: 5-10 Years
Role Overview
We are looking for a highly skilled Data Scientist with deep expertise in AI/ML and Generative AI, particularly in LLMs and transformer-based architectures. This role involves building scalable AI solutions, optimizing model performance, and driving innovation using cutting-edge technologies like RAG, prompt engineering, and multimodal AI systems.
- Design, develop, and deploy AI/ML models with a focus on LLMs and transformer-based architectures
- Build and optimize RAG (Retrieval-Augmented Generation) pipelines and semantic search systems
- Work with vector databases for efficient retrieval and embedding-based systems
- Fine-tune and optimize models using techniques like quantization, distillation, and prompt engineering
- Collaborate with engineering teams to deploy models on cloud platforms (AWS/Azure/GCP)
- Develop and maintain AI evaluation frameworks and benchmarking methodologies
- Contribute to building multimodal AI solutions (text, image, audio)
- Ensure scalability using Docker, Kubernetes, and microservices-based architectures
- Strong expertise in Python, PyTorch, TensorFlow, Hugging Face
- Hands-on experience with LLMs, RAG, embeddings, and vector DBs
- Experience in cloud deployment and MLOps pipelines
- Strong understanding of AI optimization and model performance tuning
Didn’t find the job appropriate? Report this Job