Data Engineer - Databricks, BI
Original Advert
Digital & Technology Team (D&T) is an integral division of HEINEKEN Global Shared Services Center. We are committed to making Heineken the most connected brewery. That includes digitalizing and integrating our processes, ensuring best-in-class technology, and embedding a data-driven culture. By joining us you will work in one of the most dynamic and innovative teams and have a direct impact on building the future of Heineken!
Would you like to meet the Team, see our office and much more? Visit our website: Heineken (heineken-dt.pl)
The Technology Specialist -Azure is responsible for designing, building, and optimizing scalable data solutions that enable advanced analytics, BI reporting, and machine learning initiatives. This role requires strong expertise in Databricks, Delta Lake, and cross‑team collaboration to deliver high‑quality, production‑ready data models for business consumption.
Your responsibilities would include:
Databricks & Data Engineering:
building and maintaining Spark Declarative Pipelines for Bronze/Silver layer ingestion using metadata-driven configuration as well as business-oriented Gold Layers utilizing materialized views and other state-of-the-art data engineering techniques
develop reliable CDC and incremental load patterns (FULL LOAD, INCREMENTAL, APPEND) with proper merge/apply-changes semantics
implementing performance optimizations such as Liquid Clustering Delta tables supporting BI and ML workloads.
designing and maintaining Unity Catalog data assets with proper access controls and row-level security
performing in‑depth data analysis, source‑to‑target mapping, and validation to ensure correctness and completeness
define and implement automated data quality checks, validation rules, and anomaly detection.
BI & ML Data Modeling:
translating BI (Power BI) and machine learning requirements into well‑structured, scalable data models
ensuring datasets are optimized for fast query performance, semantic clarity, and downstream analytics consumption
working closely with BI developers to streamline data flows and improve dashboard performance.
collaborating with SAP/OTM integration teams, orchestration teams, and MLOps teams to ingest, transform, and operationalize data from multiple systems
ensuring end‑to‑end pipeline reliability, versioning, documentation, and monitoring across environments
partnering with ML optimization teams to integrate and operationalize model outputs within the broader data ecosystem.
You are a good candidate if you have:
strong expertise in Databricks, PySpark, Delta Lake, and SQ
proven experience building complex data pipelines and large‑scale data models
Ability to collaborate effectively with cross‑functional teams in a fast‑paced analytics environment
strong analytical mindset with close attention to detail and data quality
5-8+ years of experience in data engineering or a related field
excellent English language.
You are a perfect match if you also have:
experience with Power BI dataset optimization and gateway configuration
hands‑on knowledge of PowerApps for workflow automation or data collection scenarios
understanding of the ML model lifecycle and MLOps tools is a strong plus.
At HEINEKEN Kraków, we take integrity and ethical conduct seriously. If someone has concerns about a possible violation of legal regulations indicated in Polish Whistleblowing Act or our Code of Business Conduct, we encourage them to speak up. Cases can be reported to global team or locally (in line with the local HGSS Whistleblowing procedure) by selecting proper option in this tool or by communicating it on hotline.
#LI-AK1 #LI-HYBRID
We offer:
Application managed by Heineken Spain