Role: Senior Data Scientist (Gen AI specialization)

Location: Bangalore, Hybrid (5 days working and 3 days work from office)

Experience: 6+ Years (with 1+ years in GenAI applications)

Role Brief:

We are seeking a Senior Data Scientist (GenAI specialization) who will architect and deploy production-grade GenAI systems that enable natural language querying across our vast document ecosystem. This isn't about API integrations - you'll work from first principles to solve real retrieval challenges at enterprise scale.

Primary Responsibilities:

- Build and implement enterprise-level GenAI chatbots capable of natural-language search over extensive document repositories and databases.

- Research and develop multi-agentic systems for production-grade applications that handle large-scale documents and big data

- Optimize vector databases for enterprise-scale semantic search, context management, chat history, and source attribution

- Build document processing pipelines with intelligent chunking, metadata extraction, and multi-format handling

- Implement advanced retrieval strategies: hybrid search, re-ranking, and custom evaluation frameworks

- Work in AWS cloud platform for deployment - handle concurrent requests and scale

- Integrate and optimize LLMs via AWS Bedrock, or Azure with hallucination mitigation

- Debug and resolve retrieval quality issues, latency bottlenecks, and cost optimization challenges

- Opportunity to mentor and shape GenAI practices across the organization

- Individual Contributor reporting to Engineering Manager

Skills Required:

- Building GenAI applications from scratch including custom chunking strategies, metadata extraction, multi-format document processing pipelines, indexing, context management, memory management, fine-tuning techniques, model compression techniques, etc. (Must be able to demonstrate through technical justification and code)

- Vector database production implementation: indexing, retrieval optimization, performance tuning

- Embeddings and semantic search: sentence-transformers, similarity search, distance metrics implementation

- Advanced retrieval techniques: hybrid search (semantic + keyword), multi-stage retrieval

- Production LLM integration / hosting: context management, token optimization, streaming responses, citation systems

- Evaluation & metrics for RAG systems & chatbots

- Knowledge of deployment, monitoring, ethics, and governance principles; RESTful API development with FastAPI including error handling

Soft Skills:

- Mentorship abilities - Comfortable teaching GenAI concepts to team members at varying skill levels

- Communication skills - Can explain complex architecture and AI applications behavior to non-technical stakeholders

- Problem-solving mindset - Ability to debug retrieval issues, optimize performance, and handle edge cases

- Self-driven - Can independently research solutions and make architectural decisions

- Collaborative - Works effectively with MLOps, data engineers, and product teams

- Pragmatic approach - Balances innovation with production reliability and business constraints.

About Aurigo: Aurigo is the world's leading provider of enterprise SaaS for capital infrastructure management. We serve the United States and Canada, primarily supporting the public sector (state and local government) while expanding into the private sector. Our platform delivers over $400B of capital infrastructure across North America. Aurigo is an equal opportunity employer committed to diversity and inclusion.