• 5+ years of overall industry experience and minimum of 2-4 years of experience building and deploying large scale data processing pipelines in a production environment
• Focus on excellence: Has practical experience of Data-Driven Approaches, Is familiar with the application of Data Security strategy, Is familiar with well know data engineering tools and platforms
•Technical depth and breadth : Able to build and operate Data Pipelines, Build and operate Data Storage, Has worked on big data architecture within Distributed Systems. Is familiar with Infrastructure definition and automation in this context. Is aware of adjacent technologies to the ones they have worked on. Can speak to the alternative tech choices to that made on their projects.
• Implementation and automation of Internal data extraction from SAP BW / HANA
• Implementation and automation of External data extraction from openly available internet data sources via APIs
• Data cleaning, curation and enrichment by using Alteryx, SQL, Python, R, PySpark, SparkR
• Data ingestion and management in Hadoop / Hive
• Preparing consolidated DataMart for use by Data Scientists and managing SQL Databases
• Exposing data via Alteryx, SQL Database for consumption in Tableau
• Data documentation maintenance/update
• Collaboration and workflow using a version control system (e.g., Git Hub)
• Learning ability: Is self-reflective, has a hunger to improve, has a keen interest to drive their own learning. Applies theoretical knowledge to practice
• Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency and other key business performance metrics.
• Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs.
• Flexible Working Hours: This role requires the flexibility to work non-traditional hours, including providing support during off-hours or weekends for critical data pipeline job runs, deployments, or incident response, while ensuring the total work commitment remains a 40-hour week.
•Create data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader.
• Work with data and analytics experts to strive for greater functionality in our data systems.
Skills and Experience
• Deep knowledge in manipulating, processing, and extracting value from datasets;
support the day-to-day operations of these GCP-based data pipelines, ensuring data governance, reliability, and performance optimization. Hands-on experience with GCP data services such as Dataflow, BigQuery, Dataproc, Pub/Sub, and real-time streaming architectures is preferred.
• + 5 years of experience in data engineering, business intelligence, data science, or related field;
• Proficiency with Programming Languages: SQL, Python, R
• Spark, PySpark, SparkR, SQL for data processing;
• Strong project management skills and ability to plan and prioritize work in a fast-paced environment;
• Experience with: MS Azure Data Factory, MS Azure Data Lake Store, SQL Database, SAP BW/ ECC / HANA, Alteryx, Tableau;
• Ability to think creatively, highly-driven and self-motivated;
• Knowledge of SAP BW for HANA (Extractors, Transformations, Modeling aDSOs, Queries, OpenHubs)
Mondelēz International is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation or preference, gender identity, national origin, disability status, protected veteran status, or any other characteristic protected by law.
Job TypeRegularSoftware & ApplicationsTechnology & Digital