Lead Data Engineer

ExlService Holdings Inc

CA

Apply

JOB DETAILS

SALARY

$130,000–$170,000 Per Year

SKILLS

Amazon Web Services (AWS), Analysis Skills, Apache, Apache HBase, Apache Hadoop, Apache Hive, Apache Pig, Apache Spark, Artificial Intelligence (AI), Best Practices, Big Data, Business Intelligence, Centers for Disease Control and Prevention (CDC), Cloud Computing, Code Reviews, Communication Skills, Computer Science, Cron Job Scheduling, Customer/Client Research, Data Analysis, Data Management, Data Mart, Data Modeling, Data Quality, Data Warehousing, Database Extract Transform and Load (ETL), Detail Oriented, Dimensional Modeling, Economics, Entrepreneurship, GCP (Good Clinical Practices), Leadership, Mathematics, Mentoring, Microsoft Windows Azure, Performance Engineering, Performance Tuning/Optimization, Power BI, Presentation/Verbal Skills, Problem Solving Skills, Project Execution, Python Programming/Scripting Language, Quality Monitoring, Query Optimization, SQL (Structured Query Language), Scalable System Development, Support Documentation, Tableau, Team Lead/Manager, Technical Leadership, Technical Presentation, Technical/Engineering Design, Time Management, Writing Skills

LOCATION

POSTED

30+ days ago

We are looking for a Lead Data Engineer to build and maintain scalable data pipelines and data platforms that support analytics, business Intelligence, reporting, and AI. In this hands-on role, you will work closely with data architects, business stakeholders and analysts to develop reliable data solutions and ensure high-quality data is available across the organization. For more information on benefits and what we offer please visit us at https://www.exlservice.com/us-careers-and-benefits

Salary range: $130k-$170k plus benefits

Collaborate with client stakeholders to gather requirements, structure solutions, and ensure high‑quality, timely delivery.
Experience working in the Databricks tech stack with strong proficiency in SQL, Python, and PySpark
Design and optimize data models, data marts, and Lakehouse/warehouse layers with strong focus on medallion architecture, query optimization, and performance engineering.
Build, orchestrate, and monitor scalable data pipelines on Databricks, ensuring reliable ingestion, transformation, CDC handling, and incremental load strategies.
Manage end‑to‑end pipeline operations including performance tuning, data quality monitoring, alerting, and issue resolution across production workloads.
Lead a project team of data engineers supporting multiple workstreams and provide technical leadership through code reviews, best‑practice guidance, reusable pattern creation, and mentorship to engineering team members.
Prepare and maintain project documentation to support project execution and delivery.

Work Split

50% Technical - Data modeling, hands-on coding, orchestration, and pipeline monitoring.
50% Management- Client Collaboration, requirements gathering, designing technical solutions, presentations, global team management and mentoring.

Lead Data Engineering (Los Angeles)

Job Functions:

Collaborate with client stakeholders to gather requirements, structure solutions, and ensure high‑quality, timely delivery.
Experience working in the Databricks tech stack with strong proficiency in SQL, Python, and PySpark
Design and optimize data models, data marts, and Lakehouse/warehouse layers with strong focus on medallion architecture, query optimization, and performance engineering.
Build, orchestrate, and monitor scalable data pipelines on Databricks, ensuring reliable ingestion, transformation, CDC handling, and incremental load strategies.
Manage end‑to‑end pipeline operations including performance tuning, data quality monitoring, alerting, and issue resolution across production workloads.
Lead a project team of data engineers supporting multiple workstreams and provide technical leadership through code reviews, best‑practice guidance, reusable pattern creation, and mentorship to engineering team members.
Prepare and maintain project documentation to support project execution and delivery.

Expected work split

50% Technical - Data modeling, hands-on coding, orchestration, and pipeline monitoring.
50% Management- Client Collaboration, requirements gathering, designing technical solutions, presentations, global team management and mentoring.

Qualifications (Required):

8-12 years' experience in data engineering and analytics roles
Bachelor's or Master''s degree in analytics, computer science/engineering, economics, mathematics, or related areas.
Experience building and maintaining ETL/ELT pipelines
Solid understanding of data warehousing concepts and dimensional data modeling
Familiarity with workflow orchestration tools such as Airflow or similar
Experience working with cloud data platforms or modern data infrastructure
Entrepreneurial hands-on approach to work. Demonstrated leadership ability and willingness to take initiative
Superior analytical and problem solving skills
Outstanding written and verbal communication skills
Effective time management and attention to detail
Hands on experience in using SQL, Python and Workflow Schedulers (Apache Airflow, Cron)
Experience in leading team and coordinating with internal / external stakeholders
Experience in using Cloud Platforms (AWS / GCP / Azure)
Experience in using Visualization tools (Tableau / Power BI)
Experience with Big Data Technologies (Hadoop, Hive, Hbase, Pig, Spark, etc.)

Lead Data Engineering (Los Angeles)

Job Functions:

Collaborate with client stakeholders to gather requirements, structure solutions, and ensure high‑quality, timely delivery.
Experience working in the Databricks tech stack with strong proficiency in SQL, Python, and PySpark
Design and optimize data models, data marts, and Lakehouse/warehouse layers with strong focus on medallion architecture, query optimization, and performance engineering.
Build, orchestrate, and monitor scalable data pipelines on Databricks, ensuring reliable ingestion, transformation, CDC handling, and incremental load strategies.
Manage end‑to‑end pipeline operations including performance tuning, data quality monitoring, alerting, and issue resolution across production workloads.
Lead a project team of data engineers supporting multiple workstreams and provide technical leadership through code reviews, best‑practice guidance, reusable pattern creation, and mentorship to engineering team members.
Prepare and maintain project documentation to support project execution and delivery.

Expected work split

50% Technical - Data modeling, hands-on coding, orchestration, and pipeline monitoring.
50% Management- Client Collaboration, requirements gathering, designing technical solutions, presentations, global team management and mentoring.

Qualifications (Required):

8-12 years' experience in data engineering and analytics roles
Bachelor's or Master''s degree in analytics, computer science/engineering, economics, mathematics, or related areas.
Experience building and maintaining ETL/ELT pipelines
Solid understanding of data warehousing concepts and dimensional data modeling
Familiarity with workflow orchestration tools such as Airflow or similar
Experience working with cloud data platforms or modern data infrastructure
Entrepreneurial hands-on approach to work. Demonstrated leadership ability and willingness to take initiative
Superior analytical and problem solving skills
Outstanding written and verbal communication skills
Effective time management and attention to detail
Hands on experience in using SQL, Python and Workflow Schedulers (Apache Airflow, Cron)
Experience in leading team and coordinating with internal / external stakeholders
Experience in using Cloud Platforms (AWS / GCP / Azure)
Experience in using Visualization tools (Tableau / Power BI)
Experience with Big Data Technologies (Hadoop, Hive, Hbase, Pig, Spark, etc.)

Lead Data Engineer

ExlService Holdings Inc

CA

About the Company

ExlService Holdings Inc