
Mid/Senior Data Engineer
- Warszawa, mazowieckie
- Stała
- Pełny etat
- Act to deliver.
- Disrupt to grow.
- Team up to win.
- Languages: Python, SQL
- Data Stack: Snowflake + DBT, PostgreSQL, Elasticsearch
- Processing: Apache Spark on Azure Databricks
- Workflow Orchestration: Apache Airflow
- Cloud Platform: Microsoft Azure
- Database & Storage: Azure Database for PostgreSQL, Azure Cosmos DB, Azure Blob Storage
- Security & Configuration: Azure Key Vault, Azure App Configuration, Azure Container Registry (ACR)
- Search & Indexing: Azure AI Search
- CI/CD: GitHub Actions
- Static Code Analysis: SonarQube
- AI Integration (Future Phase): Azure OpenAI
- Data Architecture Lead
- Data Engineers
- Backend Engineers
- DataOps Engineers
- Product Owner
- Agile, collaborative, and experienced work environment.
- As this project will significantly impact the organization, we expect a mature, proactive, and results-driven approach.
- You will work with a distributed team across Europe and India.
- Designing, building, and maintaining scalable, end-to-end data pipelines for ingesting, cleaning, transforming, and integrating large structured and semi-structured datasets
- Optimizing data collection, processing, and storage workflows
- Conducting periodic data refresh processes (through data pipelines)
- Building a robust ETL infrastructure using SQL technologies.
- Assisting with data migration to the new platform
- Automating manual workflows and optimizing data delivery
- Developing data transformation logic using SQL and DBT for Snowflake.
- Designing and implementing scalable and high-performance data models.
- Creating matching logic to deduplicate and connect entities across multiple sources.
- Ensuring data quality, consistency, and performance to support downstream applications.
- Orchestrating data workflows using Apache Airflow, running on Kubernetes.
- Monitoring and troubleshooting data pipeline performance and operations.
- Enabling integration of 3rd-party and pre-cleaned data into a unified schema with rich metadata and hierarchical relationships.
- Working with relational (Snowflake, PostgreSQL) and non-relational (Elasticsearch) databases
- Writing data processing logic in Python.
- Applying software engineering best practices: version control (Git), CI/CD pipelines (GitHub Actions), DevOps workflows.
- Ensuring code quality using tools like SonarQube.
- Documenting data processes and workflows.
- Participating in code reviews
- Preparing the platform for future integrations (e.g., REST APIs, LLM/agentic AI).
- Leveraging Azure-native tools for secure and scalable data operations
- Strong experience with Snowflake and DBT (must-have)
- Experience with data processing frameworks, such as Apache Spark (ideally on Azure Databricks)
- Experience with orchestration tools like Apache Airflow, Azure Data Factory (ADF), or similar
- Experience with Docker, Kubernetes, and CI/CD practices for data workflows
- Strong SQL skills, including experience with query optimization
- Experience in working with large-scale datasets
- Very good understanding of data pipeline design concepts and approaches
- Experience with data lake architectures for large-scale data processing and analytics
- Very good coding skills in Python
- Understanding and applying object-oriented programming (OOP) * Experience with version control systems: Git
- Good knowledge of English (minimum C1 level)
- Experience with PostgreSQL (ideally Azure Database for PostgreSQL)
- Experience with GitHub Actions for CI/CD workflows
- Experience with API Gateway, FastAPI (REST, async)
- Experience with Azure AI Search or AWS OpenSearch
- Familiarity with developing ETL/ELT processes (a plus)
- Optional but valuable: familiarity with LLMs, Azure OpenAI, or Agentic AI system
- Flexible working hours and approach to work: fully remotely, in the office or hybrid
- Professional growth supported by internal training sessions and a training budget
- Solid onboarding with a hands-on approach to give you an easy start
- A great atmosphere among professionals who are passionate about their work
- The ability to change the project you work on