This position requires an individual with a combination of computational and analytical skills. The candidate should have a solid foundation in computer science and statistics/math with well-honed skills in the following areas:
Data Management :
- Experience With Both Structured And Unstructured Data, And Hadoop Or Similar Technologies
- Able To Identify, Join, Explore And Examine Data From Multiple Disparate Sources And Formats
- Ability To Reduce Large Quantities Of Unstructured Or Formless Data And Get It Into A Form In Which It Can Be Analyzed
- Ability To Deal With Data Imperfections Such As Missing Values, Outliers, Inconsistent Formatting, Etc.
Statistics & Modeling :
- Knowledge Of Statistical Theories And Techniques Commonly Used In Prediction And Optimization Models
- Ability And Desire To Go Beneath The Surface Of A Problem And Distill It Into A Clear Set Of Hypotheses That Can Be Tested
- Design, Plan, And Manage Projects From Start To Finish & Produce Data-Driven Results With Appropriate Techniques To Answer Key Business Questions
- Mastery Of Statistical Programming Languages Such As R And Python
Machine Learning :
- Experience With Machine Learning Algorithms, Such As K-Nearest Neighbors, Random Forests And Ensemble Methods And Their Corresponding Implementation Packages In R Or Python
- Understand When It Is Appropriate To Use Different Techniques
- Experiences With Distributed Computing, Such As Hadoop, Spark Or Related Technologies
Software Development :
- The Ability To Write Code In Programming Languages Such As Sas, Java, R, And Python
- Capable Of Developing Prototypes And Fashion Their Own Tools To Conduct Analytic Research
Qualifications
- Advanced degree (Master- s/ Ph.D.) in Statistics, Computer Science or another quantitative discipline
- Minimum of 5 years of related experience
- Mastery of various statistical methodologies such as regression analysis (linear and non-linear), cluster analysis, CHAID, time series, survival models
- Experience with machine learning
- Experience with SQL, Scala, Python or Java
- Proficiency in at least one open source statistical program like R and KNime
- Working knowledge of Hadoop and other big data technologies
- Strong understanding of Statistics and modeling techniques
- Strong analytic thought process and ability to interpret findings
- Ability to work on multiple assignments concurrently
In addition, the candidate should have the strong business acumen, and interpersonal and communication skills, yet also be able to work independently. He/she should be able to communicate findings and the way techniques work in a manner that all stakeholders, both technical and non-technical, will understand.
Didn’t find the job appropriate? Report this Job