Data Engineer - Scientific Applications
inSync Staffing
Indianapolis, IN(remote)
Apply
JOB DETAILS
LOCATION
Indianapolis, IN
POSTED
30+ days ago
Location: Remote
Industry: Pharmaceutical
Job Description:
Theoris Services is assisting our client in their search for a Data Engineer to add to their growing team. Our client is seeking an individual to build and maintain high-performance data pipelines and lakehouse architectures on AWS to integrate, harmonize, and enable fast querying of massive, multi-modal scientific datasets (e.g., compounds, assays, experiments) from 30+ diverse sources, supporting researchers with reliable, scalable data for drug discovery and experimental analysis.
Responsibilities:
- Data Pipeline Development
- Design, build, and optimize data pipelines and ETL processes for scientific data integration across 30+ heterogeneous data sources
- Implement and maintain lakehouse architectures on AWS (S3, Glue, Athena) supporting multibillion-record scientific datasets
- Develop federated query capabilities using Trino and other distributed query engines to enable unified data access
- Build data harmonization solutions to standardize compound, assay, and experimental data across modalities
- Performance & Scalability
- Optimize database performance for PostgreSQL, Iceberg, and other data platforms handling complex analytical workloads
- Implement caching strategies and query optimization techniques to improve response times and user experience
- Monitor and troubleshoot data pipeline performance, addressing bottlenecks proactively
- Design scalable architectures that support growing data volumes and user bases
- Data Quality & Governance
- Implement data validation, quality checks, and monitoring frameworks
- Create and maintain comprehensive data documentation and metadata management
- Ensure compliance with data governance policies and regulatory requirements
Requirements:
- Education & Experience
- Bachelor's degree in Computer Science, Data Engineering, Information Systems, or related technical field
- 3+ years of experience in data engineering, data warehousing, or related roles
- Proven track record of building production-grade data pipelines and platforms
- Technical Skills
- Programming: Strong proficiency in Python and SQL; experience with data manipulation libraries (pandas, PySpark)
- Databases: Deep expertise in relational databases (PostgreSQL, Oracle) and modern data warehouses (Snowflake, Redshift)
- Cloud Platforms: Hands-on experience with AWS services (S3, Glue, Athena, Lambda, RDS)
- Data Processing: Experience with distributed processing frameworks (Spark, Trino, Presto, or similar)
- ETL/ELT: Proficiency with data integration tools and building scalable data pipelines.
- Data Visualization: Experience with visualizaiton tools like spotifire/Power BI
- Version Control: Experience with Git and collaborative development workflows
- Core Competencies
- Strong problem-solving skills with ability to debug complex data issues
- Excellent communication skills to translate technical concepts for non-technical stakeholders
- Ability to work independently and collaboratively in cross-functional teams
- Attention to detail and commitment to data quality and accuracy
Best-In-Class-Benefits:
We are in the people business; treating people right is our ONLY priority. Theoris Services consultants are full-time employees with full benefits, including:
- Robust Health Insurance
- 401(k) plan
- PTO
- Paid holidays
About Theoris:
Our goal is to Fuel Your Career! As a Theoris team member, you join a culture based on people-centered values and an environment that fosters both personal and professional growth. We build long-term relationships with our clients and our consultants. With over 30 years of building strong relationships in the industry, we’re uniquely positioned to make the right connections. This knowledge is used to find the right job placement. Our recruiting teams are experts dedicated to the information technology and engineering staffing space and are highly respected by our client base.
About the Company
i
inSync Staffing
We recognize the VMS program management team is our customer and needs to be serviced with integrity, so we built and continue to improve upon our delivery methods as we strive to provide the highest quality service possible.
inSync Staffing’s management team recognized ten years ago the inevitable changes to the staffing industry being brought about by technology and the growing trend of Fortune 1000 corporations to outsource management of their contingent workforces to meet compliance and cost control goals. Rather than swim upstream against the changes, inSync Staffing has embraced MSP and VMS programs as our customers, not competitors. We asked program managers how they want to be serviced.
The result of their input is that we have structured inSync Staffing as a recruiting and customer service organization, unlike traditional staffing companies who sell directly to the end client. Our delivery model allows us concentrates our resources on how to best supply candidates in a very competitive MSP/VMS program environment.
COMPANY SIZE
50 to 99 employeesINDUSTRY
Staffing/Employment Agencies
FOUNDED
2014
WEBSITE
http://www.insyncstaffing.com/default.html