+
+ + + + +
+ + + + + + + + + + +
+ + + + + + + + + + + + + + + + + + +
+ +
+
+
+ +
+
+
+ + +
+ +
+ + Médéric HURIER + +
+ + +
+
+

+ Médéric HURIER + + +

+

+ Freelancer: AI/ML/MLOps Engineer | Data Scientist | MLOps Community Organizer | OpenClassrooms Mentor | Hacker | PhD +

+ +

+ + + + + + +
Luxembourg, Luxembourg, Luxembourg
+ 4K followers + 500+ connections + + +

+ +

+ + + + + + + +
+ + +
+
+ + + + + + + + + +
+ + + + + +
+ + + + + + + + + + + + + + + + + + + +
+ + +

+ +
+ + + + + + + + + + + + + + + + + +
+
+ +
+ + + +
+
+ + + + + + + + + + + + + + + + + +
+ + + + +
+ + + + + + +
+ + +
+ + +
+
+ + + + + + + + + + + + +
+ + +

+ + About + +

+ + + +
+ +

Note: I'm not available to work on new missions until the 1st of September 2023. Thank you for your understanding.

When I worked as a teacher, I told my students that Artificial Intelligence and Machine Learning are the most effective levers to make a difference. Every day, new AI and ML solutions are released to empower companies and individuals alike. The question is: Is your business ready to provide the best AI/ML products for your customers?

I'm a professional Machine Learning Engineer, Data Scientist, and MLOps ready to assist you in this quest. I've completed a Ph.D. in Machine Learning and several high-end AI/ML certifications to help you build leading data-driven services. My past experiences include working with companies like Google, BNP Paribas, ArcelorMittal, the European Commission, and Decathlon to frame their needs, create state-of-the-art models and deliver AI/ML artifacts at scale.

I now work as a freelancer in Luxembourg, and I can carry out missions remotely in other European countries. You can get in touch with me on LinkedIn or at contact@fmind.dev. I'll be happy to collaborate with you or discuss your favored AI/ML topics in the MLOps Community.

+ +
+
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+ + +

+ + Experience + +

+ + + +
+ +
    + + + + + + + + +
  • + + + Decathlon Technology Graphic + + + +
    +

    + + Lead MLOps Engineer + +

    + +

    + + + Decathlon Technology + + +

    + + + +
    + +

    + + + + + + - Present1 year 2 months + + +

    + +

    + Luxembourg +

    + +
    + +
    +

    + Continue the design and implementation of Decathlon's MLOps platform with Databricks and AWS. +

    + +
    + +
    + +
    +
    + + +
  • + + + + + + + + + + +
  • + + + OpenClassrooms Graphic + + + +
    +

    + + Mentor for aspiring Data Scientist and AI/ML Engineer + +

    + +

    + + + OpenClassrooms + + +

    + + + +
    + +

    + + + + + + - Present8 months + + +

    + +

    + France +

    + +
    + +
    +

    + Tutoring adult students to become data scientists specializing in machine learning.
    - https://openclassrooms.com/fr/paths/793-data-scientist
    - https://openclassrooms.com/fr/paths/794-machine-learning-engineer
    - https://openclassrooms.com/fr/paths/795-ai-engineer +

    + +
    + +
    + +
    +
    + + +
  • + + + + + + + + + + +
  • + + + Cronos Europa Graphic + + + +
    +

    + + Senior Data Scientist & Project Manager for the European Commission + +

    + +

    + + + Cronos Europa + + +

    + + + +
    + +

    + + + + + + - 10 months + + +

    + +

    + Luxembourg +

    + +
    + +
    +

    + Mission: Enhance the ARACHNE risk scoring tool (fraud detection).

    Main tasks and responsibilities:
    - Develop a new version of Arachne using data mining techniques
    - Manage the development of the Arachne PoC/Project (SCRUM)
    - Assist data scientists in their projects (Virtual Assistant, NLP, …)

    Technical stack:
    - Data Science: Python, PostgreSQL, SQLAlchemy, Hugging Face, HayStack
    - Management/Environment: Jira, Confluence, MS Office, AWS, Azure +

    + +
    + +
    + +
    +
    + + +
  • + + + +
  • + + + + + + + +
    +
    + + SFEIR + +
    + +
    +

    + SFEIR +

    + +

    + 2 years +

    +
    +
    + +
    + + +
      + + + + + + + + +
    • + + + SFEIR Graphic + + + +
      +

      + + Project Manager & Machine Learning Engineer for Decathlon + +

      + +

      + + + SFEIR + + +

      + + + +
      + +

      + + + + + + - 1 year 1 month + + +

      + +

      + Luxembourg +

      + +
      + +
      +

      + Mission: Design and implement the next ML/MLOps platform on AWS and GCP.

      Main tasks and responsibilities:
      - Design the functional & technical architecture of the platform
      - Manage the MLOps@Decathlon initiative (tasks, plannings)
      - Select the vendor solutions based on a user need analysis
      - Communicate the progress and success to stack-holders
      - Assist data scientists in their project (audience, forecast)

      Technical stack:
      - Data Science: Python, TensorFlow… + + + + + + + +

      + +

      + Mission: Design and implement the next ML/MLOps platform on AWS and GCP.

      Main tasks and responsibilities:
      - Design the functional & technical architecture of the platform
      - Manage the MLOps@Decathlon initiative (tasks, plannings)
      - Select the vendor solutions based on a user need analysis
      - Communicate the progress and success to stack-holders
      - Assist data scientists in their project (audience, forecast)

      Technical stack:
      - Data Science: Python, TensorFlow, Spark, sklearn, Jupyter, Airflow
      - Management: Google Workspace, Jira, UML, Terraform, Jenkins
      - Environments: AWS (SageMaker), GCP (Vertex AI), DataBricks + + + + + + + +

      +
      + +
      + +
      +
      + + +
    • + + + + + + + + + + +
    • + + + SFEIR Graphic + + + +
      +

      + + Data Scientist for SFEIR + +

      + +

      + + + SFEIR + + +

      + + + +
      + +

      + + + + + + - 2 months + + +

      + +

      + Luxembourg, Luxembourg +

      + +
      + +
      +

      + Mission: Improve the visibility and assets of SFEIR's Data Team.

      Main tasks and responsibilities:
      - Design and create technical interviews for recruiting data scientists.
      - Become a Professional Machine Learning Engineer on Google Cloud.
      - Propose a strategy to improve the online visibility of SFEIR data team.
      - Share knowledge about data trends with non-technical staff members.
      - Create a group to write tutorials and kata on AI/ML for SFEIR developers. +

      + +
      + +
      + +
      +
      + + +
    • + + + + + + + + + + +
    • + + + SFEIR Graphic + + + +
      +

      + + Data Scientist for ArcelorMittal + +

      + +

      + + + SFEIR + + +

      + + + +
      + +

      + + + + + + - 9 months + + +

      + +

      + Luxembourg +

      + +
      + +
      +

      + Mission: Train and optimize machine learning models to recommend steel prices.

      Main tasks and responsibilities:
      - Create and fine-tune machine-learning models (tree-based)
      - Evaluate the performance of the model on real datasets
      - Communicate the results to business stack-holders

      Technical stack:
      - Data Science: Python, XGBoost, sklearn, Jupyter, SQL
      - Analytics: Matplotlib, Seaborn, Tableau, Plotly, Dash
      - Environment: MS-SQL, Azure Cloud, Jira, Papermill +

      + +
      + +
      + +
      +
      + + +
    • + + +
    +
  • + + +
  • + + + + + + + +
    +
    + + University of Luxembourg + +
    + +
    +

    + University of Luxembourg +

    + +

    + 4 years 5 months +

    +
    +
    + +
    + + +
      + + + + + + + + +
    • + + + University of Luxembourg Graphic + + + +
      +

      + + Research And Development Specialist + +

      + +

      + + + University of Luxembourg + + +

      + + + +
      + +

      + + + + + + - 5 months + + +

      + +

      + Luxembourg, Luxembourg +

      + +
      + +
      +

      + Management and development of Natural Language Understanding (NLU) projects for BGL BNP Paribas. +

      + +
      + +
      + +
      +
      + + +
    • + + + + + + + + + + +
    • + + + University of Luxembourg Graphic + + + +
      +

      + + Doctoral researcher + +

      + +

      + + + University of Luxembourg + + +

      + + + +
      + +

      + + + + + + - 4 years 5 months + + +

      + +

      + Luxembourg, Luxembourg +

      + +
      + +
      +

      + Research activities focused on Android security and artificial intelligence.

      Teaching big data, machine learning and Android programming to students.

      Collaboration with Google, San Francisco on finding malicious Android artifacts. +

      + +
      + +
      + +
      +
      + + +
    • + + +
    +
  • + + + + + + + + + +
  • + + + OpenClassrooms Graphic + + + +
    +

    + + Mentor for aspiring Data Scientist + +

    + +

    + + + OpenClassrooms + + +

    + + + +
    + +

    + + + + + + - 1 year 5 months + + +

    + +

    + France +

    + +
    + +
    +

    + Tutoring adult students to become data scientists specializing in machine learning. +

    + +
    + +
    + +
    +
    + + +
  • + + + + + + + + + + +
  • + + + Clearstream Graphic + + + +
    +

    + + Security engineer specialized in log management and analysis + +

    + +

    + + + Clearstream + + +

    + + + +
    + +

    + + + + + + - 1 year 5 months + + +

    + +

    + Luxembourg, Luxembourg +

    + +
    + +
    +

    + Selection and deployment of a SIEM solution, participating in security incident response. +

    + +
    + +
    + +
    +
    + + +
  • + + + + + + + + + + +
  • + + + + +
    +

    + + Web developer and administrator + +

    + +

    + + Freaxmind + +

    + + + +
    + +

    + + + + + + - 2 years 1 month + + +

    + +

    + France +

    + +
    + +
    +

    + Various contracts ranging from web development to software maintenance and debugging. +

    + +
    + +
    + +
    +
    + + +
  • + + + + + + + + + + +
  • + + + Toul'embal Graphic + + + +
    +

    + + Web Developer + +

    + +

    + + + Toul'embal + + +

    + + + +
    + +

    + + + + + + - 3 months + + +

    + +

    + Toul, France +

    + +
    + +
    +

    + Extension of a Prestashop e-commerce website and creation a portfolio website with WordPress. +

    + +
    + +
    + +
    +
    + + +
  • + + + + + + + + + + +
  • + + + + +
    +

    + + Web Programmer + +

    + +

    + + Empreinte Studio + +

    + + + +
    + +

    + + + + + + - 11 months + + +

    + +

    + Épernay, France +

    + +
    + +
    +

    + Creation of modern website in PHP and MySQL with professional writers and graphic designers. +

    + +
    + +
    + +
    +
    + + +
  • + + + + + + + + + + +
  • + + + GEOVARIANCES Graphic + + + +
    +

    + + Software Developer + +

    + +

    + + + GEOVARIANCES + + +

    + + + +
    + +

    + + + + + + - 1 year 1 month + + +

    + +

    + Avon, France +

    + +
    + +
    +

    + Development of a geostatistic application in C++ and Qt with experienced software engineers. +

    + +
    + +
    + +
    +
    + + +
  • + + + + + + + + + + +
  • + + + CV Champagne Nicolas Feuillatte Graphic + + + +
    +

    + + Web Developer + +

    + +

    + + + CV Champagne Nicolas Feuillatte + + +

    + + + +
    + +

    + + + + + + - 5 months + + +

    + +

    + Épernay, France +

    + +
    + +
    +

    + Integration of customer and share management modules to J.D. Edwards with PHP and Oracle. +

    + +
    + +
    + +
    +
    + + +
  • + + +
+ +
+
+ + + + + + + + + + + + + + + + + + + + + + + + + + + +
+ + +

+ + Education + +

+ + + +
+ +
    + + + + + + +
  • + + + University of Luxembourg Graphic + + + +
    +

    + + + University of Luxembourg + + +

    + +

    + + Doctor of Philosophy - PhDComputer Security and Artificial IntelligenceVery Good + +

    + + + +
    + +

    + + + + + + - + +

    + +
    +

    + Activities and Societies: Teach Big Data and Android to students. +

    + +
    +

    + Thesis title: Creating better ground truth to further understand Android malware +

    + +
    + +
    + +
    +
    + + +
  • + + + + + + + + +
  • + + + + +
    +

    + + UFR Mathématiques, Informatique, Mécanique et Automatique, Metz (France) + +

    + +

    + + Master's degreecomputer and information systems security +

    + + + +
    + +

    + + + + + + - + +

    + + +
    +
    + + +
  • + + + + + + + + +
  • + + + + +
    +

    + + UFR Mathématiques et Informatique de l’Université de Lorraine, Nancy (France) + +

    + +

    + + Bachelor and master yearscomputer science applied to business informatics +

    + + + +
    + +

    + + + + + + - + +

    + + +
    +
    + + +
  • + + + + + + + + +
  • + + + + +
    +

    + + IUT Sénart-Fontainebleau, Fontainebleau (France) + +

    + +

    + + Professional bachelor's degreecomputer security and databases +

    + + + +
    + +

    + + + + + + - + +

    + + +
    +
    + + +
  • + + + + + + + + +
  • + + + + +
    +

    + + IUT Nancy-Charlemagne, Nancy (France) + +

    + +

    + + Professional bachelor’s degreeweb development and integration +

    + + + +
    + +

    + + + + + + - + +

    + + +
    +
    + + +
  • + + + + + + + + +
  • + + + + +
    +

    + + Lycée François 1er, Vitry-le-François (France) + +

    + +

    + + Technical degreenetwork and software development +

    + + + +
    + +

    + + + + + + - + +

    + + +
    +
    + + +
  • + + + + + + + + +
  • + + + + +
    +

    + + Lycée Marc Chagall, Reims (France) + +

    + +

    + + Baccalauréat général degreescience, specialized in biology +

    + + + +
    + +

    + + + + + + - + +

    + + +
    +
    + + +
  • + + +
+ +
+
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+ + +

+ + Volunteer Experience + +

+ + + +
+ +
    + + + +
  • + + + MLOps Community Organizer (Luxembourg) + + + + + + +
    +

    + + MLOps Community Organizer (Luxembourg) + +

    + +

    + + MLOps Community + +

    + + + +
    + +

    + + + + + + + + +

    + +

    + Science and Technology +

    + + +
    +

    + Organize regular meetups and events for the MLOps Community.

    Regular partnership with AWS and the University of Luxembourg.

    https://www.meetup.com/luxembourg-mlops-community/ +

    + +
    + + +
    +
    + + +
  • + +
+ +
+
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+ + +

+ + Licenses & Certifications + +

+ + + +
+ + + +
+
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+ + +

+ Publications +

+ + + +
+ +
    + + + + + + + + + + + + + +
  • + +
    +

    + + + Is AI/ML Monitoring just Data Engineering? 🤔 + + +

    + +

    + + + MLOps Community + + + + + + + + + + + + +

    + + + +
    + + +
    +

    + While the future of machine learning and MLOps is being debated, practitioners still need to attend to their machine learning models in production. This is no easy task, as ML engineers must constantly assess the quality of the data that enters and exits their pipelines, and ensure that their models generate the correct predictions. To assist ML engineers with this challenge, several AI/ML monitoring solutions have been developed.

    In this article, I will discuss the nature of AI/ML… + + + + + + + +

    + +

    + While the future of machine learning and MLOps is being debated, practitioners still need to attend to their machine learning models in production. This is no easy task, as ML engineers must constantly assess the quality of the data that enters and exits their pipelines, and ensure that their models generate the correct predictions. To assist ML engineers with this challenge, several AI/ML monitoring solutions have been developed.

    In this article, I will discuss the nature of AI/ML monitoring and how it relates to data engineering. First, I will present the similarities between AI/ML monitoring and data engineering. Second, I will enumerate additional features that AI/ML monitoring solutions can provide. Third, I will briefly touch on the topic of AI/ML observability and its relation to AI/ML monitoring. Finally, I will provide my conclusion about the field of AI/ML monitoring and how it should be considered to ensure the success of your AI/ML project. + + + + + + + +

    +
    + + + See publication + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + A great MLOps project should start with a good Python Package 🐍 + + +

    + +

    + + + MLOps Community + + + + + + + + + + + + +

    + + + +
    + + +
    +

    + In this article, I present the implementation of a Python package on GitHub designed to support MLOps initiatives. The goal of this package is to make the coding workflow of data scientists and ML engineers as flexible, robust, and productive as possible. First, I start by motivating the use of Python packages. Then, I provide some tools and tips you can include in your MLOps project. Finally, I explain the follow-up steps required to take this package to the next level and make it work in your… + + + + + + + +

    + +

    + In this article, I present the implementation of a Python package on GitHub designed to support MLOps initiatives. The goal of this package is to make the coding workflow of data scientists and ML engineers as flexible, robust, and productive as possible. First, I start by motivating the use of Python packages. Then, I provide some tools and tips you can include in your MLOps project. Finally, I explain the follow-up steps required to take this package to the next level and make it work in your environment. + + + + + + + +

    +
    + + + See publication + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Fixing the MLOps Survey on LLMs with ChatGPT API: Lessons Learned + + +

    + +

    + + + MLOps Community + + + + + + + + + + + + +

    + + + +
    + + +
    +

    + Large Language Model (LLM) is such an existing topic. Since the release of ChatGPT, we saw a surge of innovation ranging from education mentorship to finance advisory. Each week is a new opportunity for addressing new kinds of problems, increasing human productivity, or improving existing solutions. Yet, we may wonder if this is just a new hype cycle or if organizations are truly adopting LLMs at scale …

    On March 2023, the MLOps Community issued a survey about LLMs in production to… + + + + + + + +

    + +

    + Large Language Model (LLM) is such an existing topic. Since the release of ChatGPT, we saw a surge of innovation ranging from education mentorship to finance advisory. Each week is a new opportunity for addressing new kinds of problems, increasing human productivity, or improving existing solutions. Yet, we may wonder if this is just a new hype cycle or if organizations are truly adopting LLMs at scale …

    On March 2023, the MLOps Community issued a survey about LLMs in production to picture the state of adoption. The survey is full of interesting insights, but there is a catch: 80% of the questions are open-ended, which means respondents answered the survey freely from a few keywords to full sentences. I volunteered to clean up the answers with the help of ChatGPT and let the community get a grasp of the survey experiences.

    In this article, I present the steps and lessons learned from my journey to shed some light on the MLOps survey on LLMs. I’m first going to present the goal and questions of the survey. Then, I will explain how I used ChatGPT to review the data and standardize the content. Finally, I’m going to evaluate the performance of ChatGPT compared to a manual review. + + + + + + + +

    +
    + + + See publication + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Kubeflow: The Machine Learning Toolkit for Kubernetes + + +

    + +

    + + + MLOps Community in Luxembourg + + + + + + + + + + + + +

    + + + + +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + MLflow: An open source platform for the machine learning lifecycle + + +

    + +

    + + + MLOps Community in Luxembourg + + + + + + + + + + + + +

    + + + + +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + We need POSIX for MLOps + + +

    + +

    + + + MLOps Community + + + + + + + + + + + + +

    + + + +
    + + +
    +

    + If you work on MLOps, you must navigate an ever-growing landscape of tools and solutions. This is both an intense source of stimulation and fatigue for MLOps practitioners.

    Vendors and users face the same problem: How can we combine all these tools without the combinatorial complexity of creating custom integrations?

    In this article, I propose a solution analogous to POSIX to address this challenge. First, I motivate the creation of common protocols and schemas for combining MLOps… + + + + + + + +

    + +

    + If you work on MLOps, you must navigate an ever-growing landscape of tools and solutions. This is both an intense source of stimulation and fatigue for MLOps practitioners.

    Vendors and users face the same problem: How can we combine all these tools without the combinatorial complexity of creating custom integrations?

    In this article, I propose a solution analogous to POSIX to address this challenge. First, I motivate the creation of common protocols and schemas for combining MLOps tools. Second, I present a high-level architecture to support implementation. Third, I conclude with the benefits and limitations of standardizing MLOps. + + + + + + + +

    +
    + + + See publication + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + How to install Kubeflow Pipelines v2 on Apple Silicon + + +

    + +

    + + + Medium + + + + + + + + + + + + +

    + + + +
    + + +
    +

    + Kubeflow Pipelines (KFP) is a powerful platform for building machine learning pipelines at scale with Kubernetes. The platform is well supported on major cloud platforms such as GCP (Vertex AI Pipelines) or AWS (Kubeflow on AWS). However, installing KFP on Apple Silicon (macOS 12.5.1 with Apple M1 Pro) proved to be more challenging than I imagined. Thus, I wanted to share my experience and tips to install KFP as easily as possible on your shiny Mac.

    In this article, I present 4 steps to… + + + + + + + +

    + +

    + Kubeflow Pipelines (KFP) is a powerful platform for building machine learning pipelines at scale with Kubernetes. The platform is well supported on major cloud platforms such as GCP (Vertex AI Pipelines) or AWS (Kubeflow on AWS). However, installing KFP on Apple Silicon (macOS 12.5.1 with Apple M1 Pro) proved to be more challenging than I imagined. Thus, I wanted to share my experience and tips to install KFP as easily as possible on your shiny Mac.

    In this article, I present 4 steps to install Kubeflow on Apple Silicon, using Rancher Desktop for setting up Docker/Kubernetes. In the end, I list the problems I encountered during the installation of Kubeflow Pipelines. + + + + + + + +

    +
    + + + See publication + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + The Programming Trade-Off: Purpose, Productivity, Performance + + +

    + +

    + + + Medium + + + + + + + + + + + + +

    + + + +
    + + +
    +

    + As programmers, we are continuously looking for languages that are performant, productive, and general purpose. Is there any programming language that currently satisfies these properties? Can we ever create one?

    In this article, I present a fundamental trade-off that affects the design of programming languages and the success of software projects. +

    + +
    + + + See publication + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Creating better ground truth to further understand Android malware: A large scale mining approach based on antivirus labels and malicious artifacts + + +

    + +

    + + + University of Luxembourg + + + + + + + + + + + + +

    + + + +
    + + +
    +

    + Mobile applications are essential for interacting with technology and other people. With more than 2 billion devices deployed all over the world, Android offers a thriving ecosystem by making accessible the work of thousands of developers on digital marketplaces such as Google Play. Nevertheless, the success of Android also exposes millions of users to malware authors who seek to siphon private information and hijack mobile devices for their benefits.

    To fight against the proliferation… + + + + + + + +

    + +

    + Mobile applications are essential for interacting with technology and other people. With more than 2 billion devices deployed all over the world, Android offers a thriving ecosystem by making accessible the work of thousands of developers on digital marketplaces such as Google Play. Nevertheless, the success of Android also exposes millions of users to malware authors who seek to siphon private information and hijack mobile devices for their benefits.

    To fight against the proliferation of Android malware, the security community embraced machine learning, a branch of artificial intelligence that powers a new generation of detection systems. Machine learning algorithms, however, require a substantial number of qualified samples to learn the classification rules enforced by security experts. Unfortunately, malware ground truths are notoriously hard to construct due to the inherent complexity of Android applications and the global lack of public information about malware. In a context where both information and human resources are limited, the security community is in demand for new approaches to aid practitioners to accurately define Android malware, automate classification decisions, and improve the comprehension of Android malware.

    This dissertation proposes three solutions to assist with the creation of malware ground truths. + + + + + + + +

    +
    + + + See publication + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Euphony: Harmonious Unification of Cacophonous Anti-Virus Vendor Labels for Android Malware + + +

    + +

    + + + MSR 2017 + + + + + + + + + + + + +

    + + + +
    + + +
    +

    + Android malware is now pervasive and evolving rapidly. Thousands of malware samples are discovered every day with new models of attacks. The growth of these threats has come hand in hand with the proliferation of collective repositories sharing the latest specimens. Having access to a large number of samples opens new research directions aiming at efficiently vetting apps. However, automatically inferring a reference ground-truth from those repositories is not straightforward and can… + + + + + + + +

    + +

    + Android malware is now pervasive and evolving rapidly. Thousands of malware samples are discovered every day with new models of attacks. The growth of these threats has come hand in hand with the proliferation of collective repositories sharing the latest specimens. Having access to a large number of samples opens new research directions aiming at efficiently vetting apps. However, automatically inferring a reference ground-truth from those repositories is not straightforward and can inadvertently lead to unforeseen misconceptions. On the one hand, samples are often mislabeled as different parties use distinct naming schemes for the same sample. On the other hand, samples are frequently misclassified due to conceptual errors made during labeling processes.

    In this paper, we analyze the associations between all labels given by different vendors and we propose a system called EUPHONY to systematically unify common samples into family groups. The key novelty of our approach is that no prior knowledge of malware families is needed. We evaluate our approach using reference datasets and more than 0.4 million additional samples outside of these datasets. Results show that EUPHONY provides competitive performance against the state-of-the-art. + + + + + + + +

    +
    + + + See publication + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + On the Lack of Consensus in Anti-Virus Decisions: Metrics and Insights on Building Ground Truths of Android Malware + + +

    + +

    + + + DIMVA 2016 + + + + + + + + + + + + +

    + + + +
    + + +
    +

    + There is generally a lack of consensus in Antivirus (AV) engines' decisions on a given sample. This challenges the building of authoritative ground-truth datasets. Instead, researchers and practitioners may rely on unvalidated approaches to build their ground truth, e.g., by considering decisions from a selected set of Antivirus vendors or by setting up a threshold number of positive detections before classifying a sample. Both approaches are biased as they implicitly either decide on ranking… + + + + + + + +

    + +

    + There is generally a lack of consensus in Antivirus (AV) engines' decisions on a given sample. This challenges the building of authoritative ground-truth datasets. Instead, researchers and practitioners may rely on unvalidated approaches to build their ground truth, e.g., by considering decisions from a selected set of Antivirus vendors or by setting up a threshold number of positive detections before classifying a sample. Both approaches are biased as they implicitly either decide on ranking AV products, or they consider that all AV decisions have equal weights. In this paper, we extensively investigate the lack of agreement among AV engines.

    To that end, we propose a set of metrics that quantitatively describe the different dimensions of this lack of consensus. We show how our metrics can bring important insights by using the detection results of 66 AV products on 2 million Android apps as a case study. Our analysis focuses not only on AV binary decision but also on the notoriously hard problem of labels that AVs associate with suspicious files, and allows to highlight biases hidden in the collection of a malware ground truth---a foundation stone of any machine learning-based malware detection approach. + + + + + + + +

    +
    + + + See publication + + +
    +
    + + +
  • + + +
+ +
+
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+ + +

+ + Courses + +

+ + + +
+ +
    + + + +
  • + +
    +

    + + Artificial Intelligence + +

    + +

    + + https://www.edx.org + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Data Science at Scale + +

    + +

    + + https://www.coursera.org/ + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Descriptive Statistics + +

    + +

    + + https://www.edx.org/ + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Developing Android Apps + +

    + +

    + + https://eu.udacity.com/ + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Full Stack Foundations + +

    + +

    + + https://www.udacity.com/ + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Functional Programming + +

    + +

    + + https://www.edx.org/ + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Introduction to Logic + +

    + +

    + + https://www.coursera.org/ + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Knowledge-Based AI: Cognitive Systems + +

    + +

    + + https://eu.udacity.com/ + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Machine Learning + +

    + +

    + + https://www.coursera.org/ + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Mathematics for Computer Science + +

    + +

    + + https://ocw.mit.edu/ + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Nanodegree Program: Artificial Intelligence + +

    + +

    + + https://eu.udacity.com/ + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Paradigms of Computer Programming + +

    + +

    + + https://www.edx.org + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Statistics Inference + +

    + +

    + + https://www.edx.org/ + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Statistics Probability + +

    + +

    + + https://www.edx.org/ + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + ChatGPT Prompt Engineering for Developers + +

    + +

    + + Deeplearning.ai + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Introduction to Data-Centric AI + +

    + +

    -

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + LangChain for LLM Application Development - DeepLearning.ai + +

    + +

    -

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + LangChain: Chat with Your Data - DeepLearning.ai + +

    + +

    -

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Hugging Face Course + +

    + +

    + + huggingface.co/course/ + +

    + + + +
    + +
    +
    + + +
  • + +
+ +
+
+ + + + + + + + + + + + + + + + + + + + + + + + + + + +
+ + +

+ Projects +

+ + + +
+ +
    + + + + + + + + + + + + + +
  • + +
    +

    + + MLOps Python Package + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + Kickstart your MLOps initiative with a flexible, robust, and productive Python package.

    https://github.com/fmind/mlops-python-package +

    + +
    + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Fixing the MLOps Survey with ChatGPT + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + Fixing the MLOps Survey on LLMs with ChatGPT API.

    https://fmind.medium.com/fixing-the-mlops-survey-on-llms-with-chatgpt-api-lessons-learned-62d90e721331 +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Kubeflow Demo + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + Kubeflow demo for the MLOps Community Meetup in Luxembourg. +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + MLflow Demo + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + MLflow demo for the MLOps Community Meetup in Luxembourg. +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + onet + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + Train and predict procedures of DNN for binary image classification +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + fincrawl + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + Crawl documents, metadata, and files from financial institutions +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + invest + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + Stock market analysis focused on dividends +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + parsoc + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + Convert docx files to json +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Bigdata Tutorials + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + Tutorials for the Big Data course @ uni.lu +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + STASE: A set of statistical metrics to better understand and qualify malware datasets + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + A handful of statistical metrics to better understand and qualify malware datasets +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + apkworkers + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + A celery application to distribute Android malware analysis +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + servalx + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + A set of tools and modules to process Android malware with Androzoo +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Euphony: Harmonious Unification of Cacophonous Anti-Virus Vendor Labels for Android Malware + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + Harmonious Unification of Cacophonous Anti-Virus Vendor Labels for Android Malware +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Automatic Speech Recognition with Tensorflow + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + An automatic speech-recognition system based on Tensorflow +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Dog Recognition with Tensorflow + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + A machine-learning model train to recognize dogs, even from human faces +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + genius + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + An implementation of LISP Scheme based on Haskell +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Alexa History Skill + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + A Alexa skill that provides year-dated facts on demand +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Air Cargo Planning System + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + An automated Air Cargo transport system based on AI planning +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Sign Language Recognition System + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + A sign recognition system based on Hidden Markov Model +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + AI Agent for the Isolation Game + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + An AI game agent to play the Isolation game +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Sudoku Solver + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + A Diagonal Sudoku solver implemented with Python +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + lkml + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + Gather emails from https://lkml.org/ +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Master 2 School Projects + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + School projects from 2013 to 2014 - Master 2 Sécurité des Systèmes d'Information (Metz) +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + chattail + + +

    + +

    + + + + + + + + - + + + +

    + + + +
    + + +
    +

    + Send log streams over XMPP to monitor your systems +

    + +
    + + + See project + + +
    +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Master 1 School Projects + + +

    + +

    + + + + + + + + - + + + +

    + + + + +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Bachelor School Projects + + +

    + +

    + + + + + + + + - + + + +

    + + + + +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + + Professional Bachelor School Project + + +

    + +

    + + + + + + + + - + + + +

    + + + + +
    + + +
  • + + + + + + + + + + + + + + + +
  • + +
    +

    + + https://github.com/fmind/mlops-python-package + +

    + +

    -

    + + + +
    + + +
    +

    + Kickstart your MLOps initiative with a flexible, robust, and productive Python package. +

    + +
    + + +
    +
    + + +
  • + + +
+ +
+
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+ + +

+ Languages +

+ + + +
+ +
    + + + +
  • + +
    +

    + + Français + +

    + +

    + + Native or bilingual proficiency + +

    + + + +
    + +
    +
    + + +
  • + + + + +
  • + +
    +

    + + Anglais + +

    + +

    + + Full professional proficiency + +

    + + + +
    + +
    +
    + + +
  • + +
+ +
+
+ + + + + + + + + + + + + + + + + + + + + +
+ + + +
+ + + + +
+

+ View Médéric’s full profile +

+ +
    +
  • +
    + +
    + + See who you know in common + +
  • +
  • +
    + +
    + + Get introduced + +
  • +
  • +
    + +
    + + Contact Médéric directly + +
  • +
+ + + + + + + + + + + + + + + + + + + +
+ + +
+
+ + + +
+ + + + + +
+
+
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+ + + +
+ + + + + + +
+
+ +

+ Explore collaborative articles +

+
+

+ We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. +

+ + Explore More + +
+ + +
+
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+ + +

+ + Médéric’s public profile badge + +

+ + + +
+ +

Include this LinkedIn profile on other websites

+ + + + + + + + + +
+
+ + Médéric HURIER + +
+

+ Médéric HURIER +

+

Freelancer: AI/ML/MLOps Engineer | Data Scientist | MLOps Community Organizer | OpenClassrooms Mentor | Hacker | PhD

+
+
+
+
    + +
  • + + + + + + + + company image for Decathlon Technology + +

    + Lead MLOps Engineer at Decathlon Technology +

    + +
  • + +
+
+
+
    + +
  • + + + + + + + + school image for University of Luxembourg + +

    + University of Luxembourg +

    + +
  • + +
+
+
+ + +
+
+ + + View profile badges + + +
+
+ + + + + + + + +
+