Sept 2020, After having been a Professor of Statistics at Ecole Polytechnique and a visiting researcher at Google Brain IA Paris, I have a new position of Advanced Researcher at Inria. I  am a scientific collaborator associated to CMAP Polytechnique and teaching at Ecole Polytechnique, in particular causal inference.
News: Workshop 22-23 June 2021. Leveraging observational data with ML.

– Talk: Supervised learning with missing values (pres of linear models, random forest, neural networks) slides, videos
– Talk: Causal inference with missing values slides, videos
– Talk: A  missing values tour  slides, video available (start at 30′)
Interns/Phd/Postdoc positions on causal inference and policy learning for personalized medicine. Contact me. Internship to predict the need of intubation available.

Project TrauMatrix decision tool for intensive Care; short VideoPodcast 
– ICUBAM ICU Bed Availability Monitoring  in the Grand Est région during the COVID-19 epidemic. Fork.
R-mis-static website, with missing values ressources (lecture, workflows, tutorials, etc.), Contribute!
Rforwards dedicated to widen the participation of the minorities in communities.

Office, 226,  INRIA Montpellier. Member of IDESP.

My main research fields are: missing values (EM algorithms, imputation, supervised learning), causal inference (treatment effect estimation, combining RCT and observational data; survival analysis),  visualization with dimensionality reduction (PCA, correspondence analysis), questionnaire analyses, multi­-blocks data; low rank matrix estimation; main application in  health for personalised medicine.  Detailed CV


Projects and collaboration:
Causal inference:
– Causal inference with missing values  (with Stefan Wager) 
– Transporting causal effect, combining RCT and observational data (with Shu Wang),
– Survival causal inference, sensitivity analyses, policy learning
Missing values:
– Missing Non At Random data (with Claire Boyer)
– Supervised Learning with missing values: Random Forests, Neural Nets (with Erwan Scornet, Gael Varoquaux)
– Variable selection to control the FDR with missing values (with Gosia Bogdan)
– PCA with missing values, multiple imputation, package missMDA (with François Husson)
Health applications:
Handling severe trauma patients, with the Traumabase group, J.P Nadal and Capgemini
– Covid19: Application for bed allocation monitoring/Predict need of intubation/ Effect of hydrochloroquine
– New collaborations: with Jes Frellsen with a grant and J.P Vert at Google.
Distributed computation with hospital data (with Balasubramanian Narasimhan)
– Exploratory data analysis (What was the French school of data analysis?)

Students & group’s meeting: the missing data and causal inference group at Inria

Associate Editor: JMLR. Past:  Journal of Computational & Graphical Statistics.  Journal of Statistical Software. (7 years). AC for Neurips2020, ICLR2021.

An overview of my research up to 2016 can be found in my Habilitation.  (slides)

Software – R

I am involved in the R software community and I am sincerely glad to have been elected as a member of the R Foundation for Statistical Computing. Please if R is helping you, help us by supporting with donation

Development of packages:
FactoMineR: visualization with principal components methods
missMDA: missing values (imputation continuous, categorical data) – matrix completion
For questions on the use of packages we have a google group.
denoiseR: low rank matrix estimation with regularized SVD and bootstrap

My students have also developed R packages associated to our  works:
misaem: logistic regression with missing values
mimi: Generalized low-rank models for mixed and incomplete data frames.
lori: contingency table with missing values and covariates

Development of R-miss-tastic:
Project funded by the R consortium (Infrastructure Steering Committee) to federate the community. Aim: a reference platform on the theme of missing data management (list existing packages, available literature, tutorials, analysis workflows on data,  main actors, etc)

If you want to do causal inference with missing values, you can use the R package grf where a double robust method handling missing covariates is implemented and see the pipeline to compare different estimators (IPW, DR) strategies (imputations, etc.).

Development of ICUBAM (ICU Bed Allocation Monitor) as an open source project with Inria, to visualize the availability of resuscitation beds. This started as a personal initiative from a rescusitator in the Grand-Est region who identify the need to to visualize available Covid + beds in real time (with a respirator). ICUBAM is an operational tool for rescuscitators in times of crisis to model patient flows, anticipate bed needs and welcome patients from submerged areas. ICUBAM has been deployed in 130 ICU wards in 40 départements, and inventories more than 2,000 ICU. Slides application, Slides models, paper, github.

I  served as an associate editor of Journal of Statistical Software (2011-2017) and I am involved in Rforwards to leading the R community forwards in widening the participation of women and other under-represented group. I am in the R foundation conference committee and work for implementation of Code of Conduct.
With M. Chavent, S. Dray, R. Genuer, F.Husson, B. Liquet, J. Sarracco, we created the « French R board group » to support the organization of Les Rencontres R.

News: Video presentation of Rforwards. Blog posts and multivariate studies of the R community.
Support R with the R consortium.


As a French professor, I was teaching around 160 hours/year (lectures, computer labs mainly with the R software) and I supervise master students projects and their internship in industry.  I was the head of a master of Data-Science for Business at Ecole Polytechnique. In addition, I give tutorials in different instituts and in conferences. Learn more. From, Sept 2020, I  teach Causal Inference in the IPP (Institut Polytechnique de Paris.) Master of Data Science at Polytechnique. For recent tutorials on missing values see the Rmistatic plateform.



Her first employment was in the statistics department of an Agronomy University (Agrocampus Ouest) where she was trained to « the French data analysis school » and had the opportunity to work closely with researchers from other departments and increases her interest in transversal studies. In the meantime, she prepared her PhD which was defended in 2010 and rewarded by the French Statistical Society as the best PhD in applied statistics. She has specialized in missing data, visualization and the nonparametric analyses of complex data structures. Her work was rewarded by a Marie Curie European Union grant in 2013 to increase her research potential and to spend a year at Stanford University. She spent a year as a researcher in INRIA before joining Polytechnique in 2016 as a Professor of Statistics. At Polytechnique, she was responsible of a master in data-sciences for business in collaboration with HEC. She has been a visiting researcher at Google Brain Paris, for a year (2 days a week) in 2019. In september 2020, she join Inria as an advanced researcher to set-up a team in data-science for health. She has published over 50 articles and written 2 books in applied statistics.  Her experience on dealing with incomplete data is recognized by the community: she organized an ICML workshop, the MissData conference, created the Rmistatic website and she is often invited to give lectures to share her experience. Her vocation is to push methodological innovation to bring useful application of her research to the user in particular in bio-sciences and health. Her current research focuses on causal inferences techniques for personalized medicine. She leads a project with the Traumabase group dedicated to the management of polytraumatized patients to help emergency doctors making decisions. Julie Josse is dedicated to reproducible research with the R statistical software: she has developed packages including FactoMineR, denoiseR, missMDA to transfer her work, she is a member of the R foundation and of Rforwards to increase the participation of minorities in the community.

Perso: I grew up in Africa and French Polynesia. Then I arrived in Brittany a magnificent French region and I had the chance to discover Paris and now the south of France.  I am passionate about statistics but also about travelling (often on horseback) around the world. I am also fascinated by nature and science (fan of, wildlife photographer of the year). I have a particular interest in humanitarian issues and my long-term goal is to use more of my skills for these purposes.

Interview in medium.


