One major statistical challenge raised by NGS data is the ultra-high dimension which refers to the explosion of the number of recordings to be compared with a moderate/low number of individuals (NGS putatively deals with as many recordings as genomic positions). This curse of dimensionality requires the development of new statistical methods even for standard questions like clustering and classification. Lasso-type methods based on L1 penalization have received enormous attention these past years, due to their joint computational and statistical efficiencies. Among different strategies, fused-lasso penalties have been defined to control for sparsity for spatially organized data. The development of lasso and fused-lasso methods in the context of aligned-based NGS data is the central challenge of this PhD project. NGS data are counts that can be over-dispersed, which makes Generalized Linear Models an appropriate framework for this purpose. Another possible research direction of the project is to develop penalized versions of Partial Least Square (PLS) methods. PLS is widely used for efficient dimension reduction by compressing variables on the basis of an empirical covariance criterion. PLS-Lasso strategies would be an interesting direction to compress and select relevant biological features based on NGS data.
This project will be part of the ABS4NGS project that has recently selected by the “investissement d’avenir” call. This project gathers a consortium of Algorithmicians, Bioinformaticians, Statisticians, and Biologists funded for 4 years to develop mathematical approaches for the analysis of NGS data. We are seeking for candidates interested in statistical methodology and applied statistics. The successful candidate will be based at the LBBE, Lyon, and will work in collaboration with Sophie Lambert-Lacroix (TIMC-UPMF, Grenoble), Vivian Viallon (IFSTTAR-ICJ, Lyon) and Franck Picard (LBBE, Lyon).
– Links to ABS4NGS :
http://www.enseignementsup-
– Pages :
http://membres-timc.imag.fr/
http://www.inrets.fr/
http://pbil.univ-lyon1.fr/
– contact fr***********@un********.fr
—
Franck Picard – CNRS
Laboratoire Biometrie et Biologie Evolutive
UCB Lyon 1 – Bât. Grégor Mendel
43 bd du 11 novembre 1918
69622 VILLEURBANNE cedex, France
tel : +33 (0)4 72 44 85 44
fax : +33 (0)4 72 43 13 88
http://lbbe.univ-lyon1.fr/-
__._,_.___