- Design, develop, and maintain data pipelines independently to enable faster business analysis.
- Develop optimal queries leveraging Big Query to extract data out of Google Cloud Platform for the development of statistical models.
- Solve complex SQL and Big Data Performance challenges
- Utilize Python / PySpark / Scala or Java to analyze large-scale datasets in batch and real-time.
- Translate complex business logic into optimum feature engineering pipelines for real time processing on large amount of data
Qualifications:
- 2+ years of experience as a Data or BI Engineer dealing with large complex data scenarios.
- Hands-on in using SQL & Query performance tuning skills.
- Coding proficiency in at least one modern programming language (Python is preferred)
- Knowledge of Google Big Query (would be an added plus, but not mandatory). Should be comfortable with writing advanced SQL queries.
- Prior Experience with Big Data Technologies (Hadoop, Hive, Hbase, Pig, Spark, etc.)
- Prior experience of working with media streaming data (good to have but not mandatory)
Didn’t find the job appropriate? Report this Job