Senior Data Engineer

Kohl's Corp

Menomonee Falls, WI

Apply

JOB DETAILS

SKILLS

Apache Spark, Application Programming Interface (API), Artificial Intelligence (AI), Automation, Best Practices, Cloud Computing, Computer Science, Continuous Deployment/Delivery, Continuous Integration, Contract Management, Cost Control, Cross-Functional, Data Management, Data Modeling, Data Processing, Data Quality, Database Design, Dimensional Modeling, Information Technology & Information Systems, Information/Data Security (InfoSec), Machine Learning, Mentoring, Modeling Languages, PCI-DSS, Performance Tuning/Optimization, Process Improvement, Product Lifecycle, Python Programming/Scripting Language, Quality Metrics, Regulatory Compliance, Reliability Engineering, SQL (Structured Query Language), Scala Programming Language, Service Level Agreement (SLA), Test Automation, Traceability, Use Cases

LOCATION

Menomonee Falls, WI

POSTED

30 days ago

About the Role

As Senior Data Engineer, you will lead the development and ownership of domain data products, including batch, streaming and artificial intelligence/machine learning (AI/ML) feature pipelines. You will drive design decisions that improve data reliability, performance and governance maturity while standardizing patterns that scale across teams. You will partner cross-functionally to enable analytics, ML and GenAI use cases with trusted data.

What You'll Do

Design, build and maintain batch, streaming and real-time Artificial Intelligence (AI) feature pipelines to extract data from diverse source systems and producers (Application Programming Interfaces (APIs), events, databases, files) ensuring efficient ingestion, transformation and publishing
Design, refine and implement scalable data models, semantic layers and data contracts to promote consistency, reuse and accessibility
Owns the end-to-end data product lifecycle for the domain. Define and maintain data contracts, including service level agreements (SLAs), schema expectations, quality metrics and consumer ownership, to ensure a reliable and trustworthy experience
Partner with cross functional teams to co-design scalable data solutions that meet business needs and clearly define the boundaries between data pipeline responsibilities and model-building activities
Develop automated workflows and Continuous Integration / Continuous Deployment (CI/CD) pipelines using tools such as Airflow, Apache Spark and Python to drive reliability and faster delivery
Implement validation, observability and evaluation frameworks that ensure accuracy, lineage and timeliness across data pipelines and large language model (LLM) outputs
Apply and enforce governance, privacy and compliance standards (GDPR, PCI DSS, CCPA), ensuring data security and traceability
Partner with cross functional teams to translate business needs into technical data solutions that scale across domains
Drive performance tuning, automation and adoption of AI-powered data tools to enhance data platform efficiency
Mentor data engineers and champion best practices for maintainable, governed and reusable data assets
Own cost and performance tradeoffs for domain data products and monitor compute usage, storage growth and unit cost to implement optimizations that reduce spend while meeting SLAs
Additional tasks may be assigned

What Skills You Have

Required

4+ years designing, building and optimizing data pipelines and models in production, ideally within large-scale cloud environments
Proficiency in SQL and Python (or Scala) for data development, testing and automation

Preferred

Bachelor's or Master's degree in Computer Science, Information Systems, Data Engineering or a related field
Experience with Apache Spark (or equivalent) for large-scale data processing and performance optimization
Experience using Airflow/Cloud Composer/Dagster for orchestration, transformation and CI/CD pipelines
Experience with cloud warehouses/lakes (BigQuery, Redshift, Snowflake) and object storage
Experience designing and optimizing streaming pipelines using Kafka, Pub/Sub, spark
Strong understanding of dimensional modeling, normalization and schema design for analytics and GenAI integration into data products
Experience with data testing, lineage, monitoring and observability frameworks to ensure data integrity and reliability

About the Company

Kohl's Corp

At Kohl's, our mission is to inspire and empower families to lead fulfilled lives. And there's no more rewarding job than that. Because it's not just about selling things. It's about letting customers know that the things that make their lives better are within their reach. We build great brands, launch new technologies to make shopping easier, contribute our time and dollars to improve the world we live in and dream up ways to empower our customers and Associates to create a life they love. Our Associates make a difference in the lives of our customers. Let us make a difference in yours. Welcome to Kohl's.

COMPANY SIZE

10,000 employees or more

INDUSTRY

Retail

FOUNDED

1962

WEBSITE

http://www.kohls.com