Cerfacs Enter the world of high performance ...

From 20 May 2019 to 23 May 2019

Machine learning for data science

nasri |  

 

PROGRAMMEE
Deadline for registration: 15 days before the starting date of each training
Duration : 4 days / (28 hours)

Pre-registration

Abstract

This training course enables the participants to reinforce their theoretical and practical knowledge in order to implement machine learning techniques for the automatic analysis of data. The main statistical methods for data analysis are presented, both for data exploration (non-supervised learning) and for prediction (supervised learning). Each method is first presented and commented on a theoretical level, and then illustrated on numerical experiments run with public datasets using R and/or  python/scikit-learn software.

Objective of the training

To know the main algorithms of automatic data analysis, and to know how to use them with R and/or python/scikit-learn.

Learning outcomes

The participants should be able to :

  • recognize the type of problem that they are facing (supervised or non-supervised learning, sequential learning, reinforcement learning…);
  • choose the right algorithm to use;
  • use an R on python implementation of this algorithm.

Target participants

This training session is for students, engineers, and computer scientists who wish to reinforce or extend their theoretical background and practical knowledge on automatic data analysis by statistical learning algorithms.

Prerequisites

Basic knowledge in statistics: elementary probability, statistical tests, Gaussian linear model.

Basic knowledge in algorithmic and programming.

Install  Python 2.7 with Anaconda, R 3.4.2 and IRkernel. Internet access during the sessions in order to get possible updates or to load additional libraries.

In order to verify that the prerequisites are satisfied, the following questionnaire must be completed. You need to get at least 75% of correct answers in order to be authorized to follow this training session. If you don’t succeed it, your subscription will not be validated. You only have two chances to complete it.

Questionnaire 1  https://goo.gl/forms/xL86TzPDFOC5r7ln1

Scientific contacts: Philippe Besse, Sébastien Gerchinovitz, Béatrice Laurent-Bonneau

Fee

  • Traines/PhDs/PostDocs : 280 €
  • Cerfacs shareholders/CNRS/INRIA : 800 €
  • Public : 1600 €

Program

Every day from 9h to 17h30.

Morning: lecture; afternoon: hands-on sessions.

Day 1

General presentation of statistical machine learning and its main approachs. Comparison with traditional statistics and machine learning.
Unsupervised learning:
– Principal component analysis
– Agglomerative Hierarchical Clustering
– k-means, k-medoids and variants
– overview of other methods : Affinity Propagation, dbscan, etc.
Day 2
Supervised learning 1 / 2 :
– k nearest neighbors
– Gaussian linear model, logistic regression, model selection
– LASSO et variants
– Support Vector Machines
Day 3
Supervised learning 2 / 2 :
– Decision Trees
– Bagging, Random Forests, Boosting
– Neural networks, deep learning
Day 4
Sequential learning, multi-armed bandit problems
Super-learning and expert aggregation
Reinforcement learning (introduction)

Final examination

A final exam will be conducted during the training.

 

Pre-registration

 

NEWS

Sparse Days Meeting 2019 at Cerfacs, Toulouse

Brigitte Yzel |  12 May 2019

The annual Sparse Days meeting will be held at CERFACS in Toulouse on 11th and 12th July 2019. Registration for the Sparse Days is free but we ask people who are coming to register as soon as possible although the deadline is June 14th. Please complete the registration form (deadline : June 14th) indicating whether you want to give a talk and whether you wish to attend the conference dinner. Although an emphasis will be on parallel aspects, any talk that has an association with sparsity is welcome.Read more


The Telemac-Mascaret User Conference 2019 will be held in Toulouse on October 15-17th

superadmin |  2 May 2019

The conference  is organized by the European Center for Advanced Research and Training for Computational Science (CERFACS), on the Météo-France campus, in the conference room CIC. The conference will start with a one-day technical workshop (October 15th 2019), followed by a two-day conference on October 16-17th. A gala dinner will take place on the 16th evening for delegates and guests. Please check  more details.Read more

ALL NEWS