Scientist, Data -Personalization
Full-time
Mid-Senior Level
Job Overview
Assist in applying data mining techniques and conduct statistical analysis to large, structured and unstructured data sets to understand and analyse phenomena. Model business problems, discovering insights and opportunities through statistical, algorithmic, machine learning and visualisation techniques, working closely with clients, data and technology teams to turn data into critical information used to make sound business decisions.
- Assists in building machine learning models from and utilises distributed data processing and analysis methodologies. Competent in Machine Learning programming in R or Python, with supplementary still in Matlab, Java, etc. Familiar with the Hadoop distributed computational platform, including broader ecosystem of tools such as HDFS / Spark / Kafka.
- Codes, tests and maintains scientific models and algorithms and identifies trends, patterns, and discrepancies in data and determines additional data needed to support insight. Processes, cleanses, and verifies the integrity of data used for analysis.
- Liaise and collaborate with the Data Science Guild providing support to stakeholders in the department for its data centric needs. Collaborate with subject matter experts to select the relevant sources of information and translates the business requirements into data mining/science outcomes. Presents findings and observations to team for development of recommendations.
- Supports business integration through integrating model outputs into end-point production systems, incorporating business requirements and knowledge of best practices.
- Supports various mathematical, statistical, and simulation techniques to answer business questions within specific areas of focus. Develops modelling solutions that enable the forecast of quality data outcomes. Ensures that volumetric predictions are modelled so that resource requirements are optimally considered. Supporting reporting production ensuring sustainable and effective modelling solutions.
- Use data profiling and visualisation techniques using tools to understand and explain data characteristics that will inform modelling approaches. Communicate data information to business with respective stakeholders, presenting trends, correlations and patterns found in complicated datasets in a manner that clearly and concisely conveys meaningful insights and defend recommendations under the supervision of data scientists.