AI Scientist

  • Paris
  • CDI
  • Date de début : 21 octobre 2024
  • Postuler

À propos

We are a passionate team leading the way in AI innovation, committed to driving the rapid adoption of transformative AI applications. Our focus is on developing the technical tools to allow any company to build AI applications that natively interact with their structured databases (tabular or graph databases). Specifically, we develop a modern AI embedding platform to convert any structured database to a vectorstore that can later be combined with classic Machine Learning models for classification, regression or clustering purposes.

As an early-stage AI-driven startup backed by significant funding (>3m), we base our approach on state-of-the-art academic research to drive practical business solutions. We value clear communication and simplicity in our approaches, promoting a constant optimization mindset.

Join Neuralk to be part of a growing team, eager to learn and adapt, united by the belief that our technology can make a significant positive impact and contribute to transforming the AI industry.

Co-founders: Alexandre Pasquiou (CSO) & Antoine Moissenot (CEO).

Neuralk is dedicated to equal opportunity employment and fosters an environment that is open and respectful of diversity. All applicants are encouraged to apply, even if you don’t meet all requirements. If you have passion for our mission, learn quickly and believe you can contribute, we want to hear from you.

Descriptif du poste

About the job offer

Neuralk-AI is looking for an AI Scientist with experience in AI model design and training.

As a Machine Learning Researcher, your role will be to improve in-house AI embedding models for structured data representation and translate them into actionable insights for strategic applications, which you will develop with our industrial and academic partners. You will collaborate closely with our engineering team (~5 people) to enhance the performance, scalability and impact of our AI-driven solutions, while also engaging with clients to answer their needs and deliver easily adaptable pre-trained embedding models.

You will report to the CSO of Neuralk and will be located in our Paris offices.

Role & responsibilities

By contributing to the core of our embedding platform, this position directly supports the company’s mission of making AI accessible and useful. You will be responsible for:

  • Algorithms: Contribute to the development of machine learning models for structured data representation in collaboration with dedicated teams.

  • Evaluation: Continuously evaluate and optimize the performance of our ML models by building adapted metrics reflecting the use-cases of our clients, building upon the insights from our industrial and academic partners.

  • Active learning and training data optimisation: Participate in the active learning strategy and implementation process to improve sample selection and future model performance. As well as designing and consolidating training and evaluation datasets to optimise representational as well as transfer learning abilities of our embedding models.

  • Research: Stay current with the latest ML advancements in the field and suggest optimisations that may improve the embedding models’ performance and capabilities.

  • Pitching & communication: present both ML research concepts to the scientific community and experimental design needs to the ML team.

  • Collaboration: Work closely with ML engineers, data scientists, and clients to deliver promising representation algorithms for downstream applications.

  • Ad-hoc analyses: Running analyses to understand the learning mechanisms of the foundation model. Working on the decodability of the embedding space.

Profil recherché

Profile

  • PhD or M.S in Computer Science, Machine Learning or a closely related field, with a focus on deep learning.

  • 3+ years of experience in machine learning and software engineering-related positions which involved training, fine-tuning and evaluating DL algorithms (GNN, Transformers) in the cloud.

  • Strong experience in machine learning, in particular training ML models and designing new learning paradigms.

  • Excellent communication skills in English.

  • Proven ability to work with interdisciplinary teams.

  • Thrives in a fast-paced, evolving startup environment.

  • Self-starter and autonomous.

  • Strong analytical skills and problem solving ability.

  • Appetite to explore, implement new ideas and innovate.

Expertise

  • Machine Learning: Deep understanding of ML theories and practices, especially related to reproducibility and scalability.

  • Embedding Models: Experience in designing, training and evaluating embedding models.

  • Programming: Proficient in Python and AI frameworks and tools (e.g., Sklearn, PyTorch, Jax), with experience in software development best practices and version control systems such as Git.

  • Data management: Familiarity with data structures and database systems (Parquet, SQL and NoSQL), to manage and process large datasets efficiently.

  • AI platforms: Experience with deploying and managing machine learning models, including familiarity with Pytorch and containerization technologies (e.g., Docker, Kubernetes).

Bonuses

  • You have a publication record in top-tier ML conferences or journals

  • You have demonstrated experience in designing and running large-scale ML experiments.

  • Demonstrated machine learning experience in one of the following: open-source activity, data science competitions.

  • Track record of translating research into business impact

  • Experience in developing and debugging in C/C++, Python

Process de recrutement

  • 45min interview with the CSO

  • 45min interview with the CEO

  • 2h technical exam

  • Informal moment with the whole team

Informations complémentaires

  • Type de contrat : CDI
  • Date de début : 21 octobre 2024
  • Lieu : Paris
  • Niveau d'études : > Bac +5 / Doctorat
  • Télétravail ponctuel autorisé