Data Engineer

  • Station F, Paris, France
  • Full-Time
  • Start Date: 08 October 2021
  • Apply Now


Epigene Labs ensures cancer drug hunters pilot R&D programs thanks to rich and diverse human genomic data. The company’s technology platform combines artificial intelligence with domain expertise for the aggregation and analysis of multi-dimensional genomic data.

The team initially focuses on the identification of immunotherapy drug targets and biomarkers based on gene expression datasets from multiple public and private databases. Epigene Labs thus aspires to significantly accelerate drug discovery for precision medicine in oncology.

The company is supported by Station F, Agoranov, Cancer Campus, AstraZeneca, and the Harvard Innovation Labs.

Job Description

We are seeking a strongly motivated Data Engineer eager to leverage his/her skills in our fight against cancer. This role will report directly to Epigene’s Head of Data. The team is located in Paris at Station F, and benefits from the advisory of world-class cancer scientists, data scientists, and technologists, as well as seasoned tech and biotech entrepreneurs and investors, including Daphni.

Preferred Experience

Key Responsibilities

Epigene Labs' team of computational biologists and data scientists has created pipelines that process clinical and genomic data. These data come from a broad range of sources and have various sizes, structure, frequency of update, etc.

The main challenge for the Data Engineer is to build an infrastructure to store these data in an optimised manner. This is required to enable our pipelines to run on big data while limitating the increase in computing time.

Key responsibilities will include

  • Setting up a cloud-based data storage architecture and adapting existing tools to use it
  • Optimising pipelines developed by computational biologists and data scientists, especially regarding database access and data storage
  • Designing tools to track data and code versions, leveraging relevant open source tools on the market
  • Advise on monitoring tools for production processes and help to build compelling dashboard to report to business development teams

Your skills

  • Excellent command of at least one programming language (Python is a plus)
  • Good knowledge of cloud-based data warehouse tools (a special expertise in Azure is a plus, but not required), as well as convictions coming from relevant experience - you don't just trust the hype
  • Autonomy in problem solving and solution oriented, able to prioritise between issues
  • Demonstration of team spirit, eagerness to collaborate with interdisciplinary team members (ML engineers, data scientists, computational biologists, software engineers, DevOps engineers, business developers, product managers, etc.).
  • Proficiency in English - We operate in a multicultural environment and most of our interactions are in English

Preferred Qualifications

  • Prior experience as a Data Engineer (2 years+)
  • Masters' Degree in Computer Science, Data Management or Engineering
  • Working knowledge of healthcare, biopharma and/or oncology;

Additional Information

  • Contract Type: Full-Time
  • Start Date: 08 October 2021
  • Location: Paris, France (75013)
  • Education Level: Master's Degree
  • Possible partial remote