
Data Platform Engineer
- Katowice, śląskie
- Stała
- Pełny etat
- have 5+ years of experience in a Platform Engineer role, who has attained a Graduate degree in Computer Science, Statistics, Informatics, Information Systems or another quantitative field,
- have experience with advanced Azure Dev/Ops pipelines,
- have experience using one or more of the following software components:
- Management of the bigdata clusters using CM, Ranger, KNOX and HUE
- Produce and Consume streaming data (KAFKA)
- Experience with other storage layers (e.g. S3, CEPH) * know development languages such as Python, Ansible and Bash,
- know how to build Cloud agnostic data capabilities,
- have knowledge about public Cloud (GCP, Azure),
- know monitoring tools like Grafana, Kibana, Prometheus and Elasticsearch.
- have experience building and optimizing ‘big data’ data pipelines, architectures and data sets. In both batch and real-time data integration,
- have knowledge of message queuing and stream processing,
- have worked in a DevOps environment: CI/CD with Azure DevOps, Test automation, Docker
- excellent communication skills, both verbally and in writing, to be able to align with various stakeholders and cross border squads,
- results oriented, determined and having a self-driven attitude.
- define and implement the infrastructure required for optimal loading of data from a wide variety of data sources using big data technologies,
- operate and maintain a critical environment,
- setup and tune the monitoring of the infrastructure including data pipelines,
- identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc,
- work with stakeholders including the Data and Design teams to assist with data-related technical issues and support their data infrastructure needs,
- keep our data separated and secure across international boundaries through multiple data centers.