Image

AI@GetYourGuide

Prague, Tchéquie
2 k abonnés + de 500 relations

Image Image Image

Devenir membre pour voir le profil

À propos

Co-founder of the Gephi open-source project and currently Principal AI Engineer at GetYourGuide, in Berlin. Previously, I built and led ML/AI teams as Director of Engineering (8+ years). Before that, I led data science and engineering teams at LinkedIn, focusing on building data products.

At LinkedIn, I led a team of engineers and data scientists focusing on building end-to-end products fueled by data and machine learning. Our focus was on building relevant and personalized features, leveraging LinkedIn's massive dataset. As an early member of the data science team I also worked on a variety of prominent features such as LinkedIn Skills, Endorsements, InMaps, Reputation and Connected.

Before joining LinkedIn, I co-founded the Gephi open-source project and has been its technical leader since 2007. Gephi is the leading large graph visualization platform and is recognized for its performance, usability and extensible design. The software has been downloaded more than 2M times since its inception and has contributed to thousands of publications, research and articles on graph analytics and network science.

I have a strong background in CS and graduated with a M.Sc. in Computer Science from the University of Technology of Compiègne in France. My specialties include data science and engineering, social network analysis, data mining, machine learning, information retrieval and overall solving hard problems with data and algorithms.

Activité

2 k abonnés

See all activities

Expérience

  • GetYourGuide

    GetYourGuide

    10 ans 3 mois

    • Graphique GetYourGuide

      Principal AI Engineer

      GetYourGuide

      - aujourd’hui 2 ans 2 mois

      Berlin, Germany

      I work on the ambitious AI products:
      * Developed and shipped LLM-based agents to curate our inventory
      * First to build AI agents in Customer Care, automating repetitive workflows
      * First to experiment with conversational interfaces (ChatGPT Plugin, now Apps) , building AI agents to search and filter GetYourGuide's vast catalog

      At the same time, I've been championing company-wide AI adoption:
      * Spearheaded AI adoption programs, bringing daily AI usage company-wide to 66% by…

      I work on the ambitious AI products:
      * Developed and shipped LLM-based agents to curate our inventory
      * First to build AI agents in Customer Care, automating repetitive workflows
      * First to experiment with conversational interfaces (ChatGPT Plugin, now Apps) , building AI agents to search and filter GetYourGuide's vast catalog

      At the same time, I've been championing company-wide AI adoption:
      * Spearheaded AI adoption programs, bringing daily AI usage company-wide to 66% by the end of 2025
      * Founding member of the AI Platform team, focusing on LLMOps
      * Advise leadership on GenAI trends and strategy, and presenting at events (internal conference, all-hands...)

    • Graphique GetYourGuide

      Director of Data Products, ML/AI

      GetYourGuide

      - 8 ans 2 mois

      Berlin Area, Germany

      Built and managed a world-class ML/AI team over a span of 8 years. We had touchpoints in all parts of the business, and an amazing team reputation & culture (~25+ people).

      * Delivered large business impact via shipping new data products, or improving existing ML algorithms via A/B experiments
      * Growing the Data Products team from scratch to 25+ people, covering all domains (Marketplace, Growth, Supply)
      * Established a Machine Learning Platform strategy and team, supporting the…

      Built and managed a world-class ML/AI team over a span of 8 years. We had touchpoints in all parts of the business, and an amazing team reputation & culture (~25+ people).

      * Delivered large business impact via shipping new data products, or improving existing ML algorithms via A/B experiments
      * Growing the Data Products team from scratch to 25+ people, covering all domains (Marketplace, Growth, Supply)
      * Established a Machine Learning Platform strategy and team, supporting the adoption of strong MLOps best practices
      * Member of the Engineering Leadership group, in charge of various department development activities
      * Regularly public speaking at internal and external (meetups, conferences) events
      * Hired my successor, while transitioning to a Principal IC role

    • Graphique GetYourGuide

      Director of Data Science & Analytics

      GetYourGuide

      - 3 ans 2 mois

      Berlin, Germany

      Built GetYourGuide's data team almost from scratch to 25+ people with three pillars: Analytics, Data Platform and Data Products. I'm proud that most of the foundations I've set back in 2016-2017 are still there 7+ years later and flourishing.

      Products and projects we had an impact:
      * Established Data Analytics and Data Engineering teams from scratch and grew them to 20+ people
      * Set foundations in data platform and data infrastructure, essentially introducing data lakes and data…

      Built GetYourGuide's data team almost from scratch to 25+ people with three pillars: Analytics, Data Platform and Data Products. I'm proud that most of the foundations I've set back in 2016-2017 are still there 7+ years later and flourishing.

      Products and projects we had an impact:
      * Established Data Analytics and Data Engineering teams from scratch and grew them to 20+ people
      * Set foundations in data platform and data infrastructure, essentially introducing data lakes and data engineering
      * Grew the Data Science function and transformed it into Data products with end-to-end ownership and software engineering best practices
      * Managed tens of millions of performance marketing budget via automated bidding algorithms on Adwords
      * Introduced ML algorithms on domains such as search ranking, bidding, driving large impact via dozens of A/B experiments

    • Graphique GetYourGuide

      VP Engineering (Interim)

      GetYourGuide

      - 1 an 5 mois

      Berlin Area, Germany

      Stepped up to manage all of Engineering for 15 months (80+ people) and member of the Executive team. Again, foundations set back then are still felt 5 years later.

      * Established first career framework for the department, and introduced the Engineering Management function
      * Complete overhaul of hiring processes to support our ambitious hiring targets (while keeping the bar very high)
      * Hired 3 Director of Engineering (Infrastructure, Marketplace, Analytics) of exceptional…

      Stepped up to manage all of Engineering for 15 months (80+ people) and member of the Executive team. Again, foundations set back then are still felt 5 years later.

      * Established first career framework for the department, and introduced the Engineering Management function
      * Complete overhaul of hiring processes to support our ambitious hiring targets (while keeping the bar very high)
      * Hired 3 Director of Engineering (Infrastructure, Marketplace, Analytics) of exceptional capability
      * Collaborated with founders and other executives on strategy setting
      * Steered the culture towards being proud of our craft away from being purely speed-oriented. Established the engineering blog and lead by example.

  • LinkedIn

    LinkedIn

    4 ans 6 mois

    • Graphique LinkedIn

      Engineering Manager, Data Products

      LinkedIn

      - 1 an 8 mois

      San Francisco Bay Area

      I led a team of talented engineers and data scientists focusing on building the next data-driven products at LinkedIn. We built data products end-to-end tackling prototyping and exploration, API and schema design, machine-learning models, shipping production code, instrumentation, experimentation and maintenance.

      Products and projects we had an impact:
      * Developed first machine-learnt model to rank props (e.g. job change, job anniversary, mentioned in the news etc.) with double-digit…

      I led a team of talented engineers and data scientists focusing on building the next data-driven products at LinkedIn. We built data products end-to-end tackling prototyping and exploration, API and schema design, machine-learning models, shipping production code, instrumentation, experimentation and maintenance.

      Products and projects we had an impact:
      * Developed first machine-learnt model to rank props (e.g. job change, job anniversary, mentioned in the news etc.) with double-digit impact on engagement. Implemented automated training and metrics pipeline.
      * Designed and implemented an augmented pipeline to translate tens of thousands of skill entities in record time.
      * Developed algorithms and tools to update and grow the skills taxonomy. This was the first fully automated standardization data pipeline at LinkedIn.
      * Developed the large machine-learning pipeline behind reputation algorithms, a key search relevance driver.

    • Graphique LinkedIn

      Staff Data Scientist

      LinkedIn

      - 2 ans 11 mois

      San Francisco Bay Area

      I specialized in building large data pipelines and relevancy algorithms to fuel LinkedIn data products.

      Products and projects I had an impact:
      * Leads recommendations: Designed and implemented a recommender system to help discover leads from a member's profile. The algorithm uses large graph analysis, clustering and machine learning to identify the best decision makers and influencers in the company.
      * Endorsements: Skills & Endorsements, one of the fastest growing product in…

      I specialized in building large data pipelines and relevancy algorithms to fuel LinkedIn data products.

      Products and projects I had an impact:
      * Leads recommendations: Designed and implemented a recommender system to help discover leads from a member's profile. The algorithm uses large graph analysis, clustering and machine learning to identify the best decision makers and influencers in the company.
      * Endorsements: Skills & Endorsements, one of the fastest growing product in LinkedIn's history is driven by the skills inference algorithm I developed. For each member, it ranks a set of skills from our dictionary based on the likelihood that the member has this skill.
      * Suggested Skills: Personalized recommender engine for skills. I used machine learning, Hadoop and Pig to recommend skills to members based on their profile data and network. The algorithm is widely used on the site to help members complete their profile and had a double-digit impact on the number of skills added. I also developed a set of tools and metrics to simulate models and track their performance.
      * Skills taxonomy: Skills & Expertise is a set of tens of thousands of entities mined from LinkedIn profiles. I worked on the large Hadoop pipeline and focused on improving the quality of the dictionary through the creation of tools and algorithms.
      * InMaps: Responsible for the InMaps visualization framework, server and client. Worked on both the back-end solution to generate millions of InMaps and the front-end user experience. I developed a novel Javascript client which scaled to hundreds of thousands of elements by mixing a SVG layer with batch-rendered images to display large networks on the browser.

  • Graphique Gephi

    Founder & Tech Lead

    Gephi

    - 3 ans 3 mois

    Gephi is the leading large graph visualization open-source platform. In total, it has been downloaded more than 4M times. I co-founded the project and have been on and off maintaining it ever since. I still actively contribute to it, in my free time.

    * Main developer. Maintain a 150K lines codebase with 100+ modules.
    * Technical leader for Google Summer of Code students – 18 students in total – code review and daily basis communication
    * Write new specifications and drive…

    Gephi is the leading large graph visualization open-source platform. In total, it has been downloaded more than 4M times. I co-founded the project and have been on and off maintaining it ever since. I still actively contribute to it, in my free time.

    * Main developer. Maintain a 150K lines codebase with 100+ modules.
    * Technical leader for Google Summer of Code students – 18 students in total – code review and daily basis communication
    * Write new specifications and drive discussions with the community and researchers
    * Developed a modular and extensible software architecture based on Netbeans RCP and designed a dozen APIs
    * Developed main components, including the OpenGL graph drawing engine
    * Wrote technical documentation for plugin and core developers
    * Built continuous integration environment
    * Created the first prototype in 2008

    Responsible for communication matters, project presentation, writing publications and attending conferences as well.

  • Graphique INIST

    Software engineer

    INIST

    - 1 an

    Paris Area, France

    Designed and developed a Visual Analytics prototype for large academic data. Used NoSQL databases to build real-time query service in large networks. Developed esthetic rich client application for the front-end.

    Java, Neo4j, MongoDB, Processing

  • Graphique Google

    Summer of Code Mentor

    Google

    - 4 mois

    Gephi was a mentoring organization for the Google Summer of Code and hosted several students in summer in the years 2009 to 2013. I was responsible for proposal writing and technical mentorship.

    I personally mentored up to two students every year and supervised the entire group for technical details and code sharing strategies. My role was to make them comfortable in the development environment, set priorities and advise them on software design and good practices. I was also involved in…

    Gephi was a mentoring organization for the Google Summer of Code and hosted several students in summer in the years 2009 to 2013. I was responsible for proposal writing and technical mentorship.

    I personally mentored up to two students every year and supervised the entire group for technical details and code sharing strategies. My role was to make them comfortable in the development environment, set priorities and advise them on software design and good practices. I was also involved in the evaluation process and conducted interviews.

  • Graphique Linkfluence

    Software Engineer

    Linkfluence

    - 6 mois

    Paris Area, France

    Developed network visualization and analysis features for the R&D team. Designed and led implementation of a new modular architecture for Gephi platform and consolidated webcrawler data pipeline by providing a dedicated network layout server. I acquired solid knowledge of user experience design and agile development.

    My job also involved project management and handling communication and promotion of the Gephi open-source project. It was officially presented at the at ICWSM'09 conference…

    Developed network visualization and analysis features for the R&D team. Designed and led implementation of a new modular architecture for Gephi platform and consolidated webcrawler data pipeline by providing a dedicated network layout server. I acquired solid knowledge of user experience design and agile development.

    My job also involved project management and handling communication and promotion of the Gephi open-source project. It was officially presented at the at ICWSM'09 conference in San Jose, CA.

    Java, Netbeans, Swing, API Design

  • Graphique Fondation Maison des sciences de l'homme (FMSH)

    Intern

    Fondation Maison des sciences de l'homme (FMSH)

    - 6 mois

    Paris Area, France

    Took part in the WebAtlas initiative, a research organization interested in Network Science, Data Visualization and Social Network Analysis. My core of work was to rebuild a graph visualization software for large semantic networks extracted from web-crawls. I also drove additional projects, including N-GRAM based language detection, Firefox add-on development and hacking. Working in a Sociology laboratory has given me a closer look at scientific applications and user-centered requirements…

    Took part in the WebAtlas initiative, a research organization interested in Network Science, Data Visualization and Social Network Analysis. My core of work was to rebuild a graph visualization software for large semantic networks extracted from web-crawls. I also drove additional projects, including N-GRAM based language detection, Firefox add-on development and hacking. Working in a Sociology laboratory has given me a closer look at scientific applications and user-centered requirements gathering.

    Java, OpenGL, NLP, XML, Javascript, XUL

Formation

Expériences de bénévolat

  • Image

    Support / IT

    TOIT Nepal

    - 10 mois

    Formation

    TOIT Nepal is a Nepali NGO building a school in Bhaktapur for families who can't financially support their children. I provided support to find sponsors and build their website. As part of a month trip to Nepal I spent some time with the children at school and provided IT support.

Publications

  • LinkedIn Skills: Large-Scale Topic Extraction and Inference

    RecSys Proceedings

    "Skills and Expertise" is a data-driven feature on LinkedIn, the world's largest professional online social network, which allows members to tag themselves with topics representing their areas of expertise. In this work, we present our experiences developing this large-scale topic extraction pipeline, which includes constructing a folksonomy of skills and expertise and implementing an inference and recommender system for skills. We also discuss a consequent set of applications, such as…

    "Skills and Expertise" is a data-driven feature on LinkedIn, the world's largest professional online social network, which allows members to tag themselves with topics representing their areas of expertise. In this work, we present our experiences developing this large-scale topic extraction pipeline, which includes constructing a folksonomy of skills and expertise and implementing an inference and recommender system for skills. We also discuss a consequent set of applications, such as Endorsements, which allows members to tag themselves with topics representing their areas of expertise and for their connections to provide social proof, via an "endorse" action, of that member's competence in that topic.

    Autres auteurs
    Voir la publication Image
  • ForceAtlas2, a Continuous Graph Layout Algorithm for Handy Network Visualization Designed for the Gephi Software

    PloS one

    Gephi is a network visualization software used in various disciplines (social network analysis, biology, genomics…). One of its key features is the ability to display the spatialization process, aiming at transforming the network into a map, and ForceAtlas2 is its default layout algorithm. The latter is developed by the Gephi team as an all-around solution to Gephi users’ typical networks (scale-free, 10 to 10,000 nodes). We present here for the first time its functioning and settings…

    Gephi is a network visualization software used in various disciplines (social network analysis, biology, genomics…). One of its key features is the ability to display the spatialization process, aiming at transforming the network into a map, and ForceAtlas2 is its default layout algorithm. The latter is developed by the Gephi team as an all-around solution to Gephi users’ typical networks (scale-free, 10 to 10,000 nodes). We present here for the first time its functioning and settings. ForceAtlas2 is a force-directed layout close to other algorithms used for network spatialization. We do not claim a theoretical advance but an attempt to integrate different techniques such as the Barnes Hut simulation, degree-dependent repulsive force, and local and global adaptive temperatures. It is designed for the Gephi user experience (it is a continuous algorithm), and we explain which constraints it implies. The algorithm benefits from much feedback and is developed in order to provide many possibilities through its settings. We lay out its complete functioning for the users who need a precise understanding of its behaviour, from the formulas to graphic illustration of the result. We propose a benchmark for our compromise between performance and quality. We also explain why we integrated its various features and discuss our design choices.

    Autres auteurs
    Voir la publication Image
  • Using Computer Games Techniques for Improving Graph Visualization Efficiency

    EuroVis Proceedings

    Creating an efficient, interactive and flexible unified graph visualization system is a difficult problem. We present a hardware accelerated OpenGL graph drawing engine, in conjunction with a flexible preview package. While the interactive OpenGL visualization focuses on performance, the preview focuses on aesthetics and simple network map creation. The system is implemented as Gephi, a modular and extensible open-source Java application built on top of the Netbeans Platform, currently in alpha…

    Creating an efficient, interactive and flexible unified graph visualization system is a difficult problem. We present a hardware accelerated OpenGL graph drawing engine, in conjunction with a flexible preview package. While the interactive OpenGL visualization focuses on performance, the preview focuses on aesthetics and simple network map creation. The system is implemented as Gephi, a modular and extensible open-source Java application built on top of the Netbeans Platform, currently in alpha version 0.7.

    Autres auteurs
    Voir la publication Image
  • Gephi: an open source software for exploring and manipulating networks

    ICWSM Proceedings

    Gephi is an open source software for graph and network analysis. It uses a 3D render engine to display large networks in real-time and to speed up the exploration. A flexible and multi-task architecture brings new possibilities to work with complex data sets and produce valuable visual results. We present several key features of Gephi in the context of interactive exploration and interpretation of networks. It provides easy and broad access to network data and allows for spatializing…

    Gephi is an open source software for graph and network analysis. It uses a 3D render engine to display large networks in real-time and to speed up the exploration. A flexible and multi-task architecture brings new possibilities to work with complex data sets and produce valuable visual results. We present several key features of Gephi in the context of interactive exploration and interpretation of networks. It provides easy and broad access to network data and allows for spatializing, filtering, navigating, manipulating and clustering. Finally, by presenting dynamic features of Gephi, we highlight key aspects of dynamic network visualization.

    Autres auteurs
    Voir la publication Image

Brevets

  • Network insights

    Émis le US 14/700,825

    Autres inventeurs
  • Profile personalization based on viewer of profile

    Émis le US 14/674,755

    Autres inventeurs
  • Standardizing attributes and entities in a social networking system

    Émis le US 14/595,378

    Autres inventeurs
  • Quantifying Social Capital

    Émis le US 14/529,068

    Autres inventeurs
  • Estimating reputation scores in reputation systems

    Émis le US 14/216,797

    Autres inventeurs
  • Querying of reputation scores in reputation systems

    Émis le US 14/694,958

    Autres inventeurs
  • Generating rankings of reputation scores in reputation systems

    Émis le US 14/216,821

    Autres inventeurs
  • Methods and systems for recommending decision makers in an organization

    Émis le US 14/674,755

    Autres inventeurs
  • Inferring and suggesting attribute values for a social networking service

    Émis le US 13/629,241

    Autres inventeurs

Projets

  • Gephi

    - aujourd’hui

    Gephi is the leading open-source platform to visualize and explore large networks.

    Autres créateurs
    Voir le projet Image
  • PalDB

    -

    PalDB is an embeddable write-once key-value store written in Java, it was a side-project of mine and was open-sourced in October 2015.

    Voir le projet Image
  • InMaps

    -

    InMaps is an interactive visual representation of your professional universe. It's a great way to understand the relationships between you and your entire set of LinkedIn connections. With it you can better leverage your professional network to help pass along job opportunities, seek professional advice, gather insights, and more.

    Autres créateurs
    Voir le projet Image
  • LinkedIn Skills

    -

    LinkedIn Skills is a data-driven project with large data processing components.

    Autres créateurs
    Voir le projet Image
  • Lead Recommendations

    -

    Lead Recommendations is a recommendation engine to help uncover decision-makers and influencers around a specific sales account. It was uniquely leveraging graph algorithms.

    Autres créateurs
  • DataFu

    -

    DataFu is a collection of user-defined functions for working with large-scale data in Hadoop and Pig. This library was born out of the need for a stable, well-tested library of UDFs for data mining and statistics. It is used at LinkedIn in many of our off-line workflows for data derived products like “People You May Know” and “Skills”

    Autres créateurs
    Voir le projet Image

Prix et distinctions

  • Duke’s Choice Award 2010

    JavaOne

    The Duke’s Choice Awards recognize and honor extreme innovation in the world of Java technology, and are granted to the most innovative uses of the Java platform

Recommandations reçues

Voir le profil complet de Mathieu

  • Découvrir vos relations en commun
  • Être mis en relation
  • Contacter Mathieu directement
Devenir membre pour voir le profil complet

Ajoutez de nouvelles compétences en suivant ces cours