Analyse and organize (structured and unstructured data)
Designing, implementing, and operating large-scale, high-volume, high-performance data structures for analytics and data science Build/Develop data systems and pipelines
Liaise with Source system teams to identify and validate data
Implementing data ingestion routines both real time and batch using best practices in data modeling, ETL/ELT processes
Design, develop, test and deploy frontend visualization (dashboards and reports) in collaboration with business end users.
Continually to improve ongoing reporting and analysis processes, automation or simplifying self-service modeling and production support for users
Implement solutions to facilitate more effective data discovery by data users
Ensure data management processes comply with established framework & policies
Preparing data for prescriptive and predictive modelling
Implementation of the data lake - be part of the team to build up, pilot and gain knowledge and proficiency in the AWS cloud hosted infrastructure and data analytic tools
Translate business requirements into robust, scalable, operable solutions with a flexible and adaptable data architecture.
Implement and adopt best practices in data system creation, data integrity, test design, analysis, validation, and documentation
Requirements
Degree with minimum 1 year of relevant work experience as a Big Data Engineer with demonstrated strength in ETL/ELT (SSIS, AWS Glue), data modelling, data warehouse technical architecture and reporting/analytic tools
Hands-on experience in AWS cloud services. E.g. S3, redshift, Lambda Related working experience, specifically in the areas of data management and quality
Passionate about working with huge datasets and experience working with businesses to build data products and services to turn data into insights using advanced analytics.
Experience with curation of data for analytics/AI, and a strategic/long term view on architecting data eco systems.
Experienced in building efficient and scalable data services and has the ability to integrate data systems with relevant tools and services.