Data Engineer

About

Epigene Labs ensures cancer drug hunters pilot R&D programs thanks to rich and diverse human genomic data. The company’s technology platform combines artificial intelligence with domain expertise for the aggregation and analysis of multi-dimensional genomic data.

The team initially focuses on the identification of immunotherapy drug targets and biomarkers based on gene expression datasets from multiple public and private databases. Epigene Labs thus aspires to significantly accelerate drug discovery for precision medicine in oncology.

The company is supported by Station F, Agoranov, Cancer Campus, AstraZeneca, and the Harvard Innovation Labs.

Job Description

Epigene Labs is seeking a strongly motivated Data Engineer eager to leverage their skills in our fight against cancer. As a member of the data team, you will collaborate closely with data scientists, software engineers and other key stakeholders to build and maintain robust, scalable and efficient data pipelines that bridge AI/ML algorithms and computational biology.

The responsibilities of this position encompass the following:

  • Design, implement and maintain clinical and gene expression data pipelines and associated infrastructure

  • Identify and resolve bottlenecks and performance issues in data pipelines/infrastructure

  • Support the full lifecycle of machine learning models, including training, evaluation, deployment and monitoring

  • Enhance operational efficiency through automation, CI/CD, monitoring, and troubleshooting of production processes

Preferred Experience

Job skills

  • Proficiency in Python programming

  • Experience with cloud infrastructure and services (Azure preferred); knowledge of infrastructure as code (e.g. terraform) is a bonus

  • Experience with ETL/ELT processes (e.g. Azure Data Factory, Talend, Airflow)

  • Familiarity with AI/ML pipelines and MLOps (e.g. mlflow)

  • Experience with CI/CD processes (e.g. GitHub actions, Azure DevOps, Jenkins)

  • Knowledge of containerization (e.g. Docker)

  • Proficiency in SQL databases

  • Excellent communication and interpersonal skills, with the ability to work effectively in cross-functional teams

Education and experience

  • 3+ years of experience as a Data Engineer

  • Master’s Degree in Computer Science, Data Management or Engineering

  • Working knowledge of healthcare, biopharma and/or oncology is a plus

For this position, the candidate should be based in France.

Recruitment Process

  • Short fit call

  • Technical challenge : A real world challenge to see how you would collaborate with the team

  • Meet the leadership team

  • Discussion with the CEO

Additional Information

  • Contract Type: Full-Time
  • Location: Paris
  • Experience: > 3 years
  • Possible full remote