Data pipeline development ⢠Design, implement, and optimize end-to-end data pipelines for ingesting, processing, and transforming large volumes of structured and unstructured data across client environments ⢠Develop robust ETL and ELT processes to integrate data from diverse sources into modern data ecosystems, including data lakes, data warehouses, and cloud-native platforms ⢠Implement data validation and quality checks to ensure accuracy, consistency, and reliability of data feeds ⢠Evaluate and implement streaming and real-time data processing solutions where applicable ⢠Data modeling and architecture ⢠Design and maintain data models, schemas, and database structures to support analytical, operational, and AI/ML use cases ⢠Optimize data storage and retrieval mechanisms for performance, scalability, and cost efficiency ⢠Evaluate and implement data storage solutions, including relational databases, NoSQL databases, data lakes, data warehouses, and cloud storage services such as AWS S3, Azure Data Lake, Snowflake, and Data bricks ⢠Contribute to the development of modern data architectures aligned with client business objectives and technical requirements ⢠Data integration and API development ⢠Build and maintain integrations with internal and external data sources, APIs, and enterprise systems ⢠Implement RESTful APIs and web services for data access and consumption ⢠Ensure compatibility and interoperability across systems and platforms ⢠Design secure data exchange mechanisms that adhere to information security and compliance requirements ⢠Data infrastructure management ⢠Configure and manage data infrastructure components, including databases, data warehouses, data lakes, and distributed computing frameworks such as Spark and Data bricks ⢠Monitor system performance, troubleshoot issues, and implement optimizations to enhance reliability and efficiency ⢠Implement data security controls and access management policies to protect sensitive client information ⢠Leverage cloud platforms and services, including AWS, Azure, and GCP, to deploy and manage scalable data solutions. ⢠Proficiency in programming languages commonly used in data engineering, including Python and SQL, with Java or Scala a plus ⢠Strong knowledge of database systems, data modeling techniques, and advanced SQL ⢠Hands-on experience with ETL and ELT tools, such as Data bricks, Azure Data Factory, AWS Glue, Informatica, Cloud Talend, dbt, or Airflow ⢠Experience with big data technologies and frameworks, including Spark, Hadoop, Kafka, and Kinesis ⢠Proven experience with cloud platforms and services, such as Snowflake, AWS, and Azure, with Google Cloud Platform familiarity a plus ⢠Strong understanding of data warehousing concepts, dimensional modeling, and modern data architecture patterns ⢠Familiarity with GenAI applications in data engineering, agentic frameworks, and emerging technologies.