Data Scientist (IIT/IIM) position with an Analytics Company in Data Engineering
Overall expectation:
- You will provide data science expertise to projects and drive delivery of high impact solutions.
- You should be conversant with developing models from scratch, including preparing data for ingestion, fine tuning, deployment and maintenance.
- You should have a learner's mindset with a high propensity to learn and experiment with new technologies and methods.
- You need to have strong communication skills to explain logic of action, approach and consequent results.
Background:
- You will be working with a brilliant team of data scientists on niche products which leverage AI and NLP.
- You will be expected to design and run experiments, research new algorithms, and find new ways to improve outcomes.
- Many of the use cases have few parallels, therefore this is a highly experimental & intellectually stimulating activity with theoretical analysis and innovation.
- You'll be responsible for organizing data, building statistical models, analyzing model performance & feature importance, fine tuning, and explaining findings and analysis in a clear and concise manner to relevant stakeholders.
Required Qualifications:
- Experience in data science with 2-6 years of machine learning/statistical modeling data analysis tools and techniques (IIT/IIM Graduates preferred)
- Experience applying theoretical models in an applied environment
- Experience with LLM and fine-tuning models (eg BERT, ROBERTA, etc)
- Deep understanding of machine learning algorithms including but not limited to regression models, classification models, clustering, boosting, and neural networks
- Advanced knowledge of data querying languages (e.g. SQL), scripting languages (e.g. Python)
- Expertise with python libraries such as Numpy, Pandas, Scikit-learn, Matplotlib
- Conversant in key metrics analysis, A/B testing (stratification, propensity score matching, causal trees), ML (regression, classification, clustering), causal inference (diff-in-diff, synthetic controls, Lasso/Ridge)
- Conversant with writing unit tests
- Strong data visualization skills
- Strong Ability to connect and communicate analysis and findings with stakeholders
Nice to haves:
- Experience with vector databases such as Pinecone
- Master's degree in a quantitative field Computer Science, Economics, or Statistics
- Knowledge of deployment and ML pipelines
Didn’t find the job appropriate? Report this Job