logo

European Institute for Statistics, Probability, Stochastic Operations Research and its Applications

About | Research | Events | People | Reports | Alumni | ContactHome


Young European statisticians Workshop (YES-IV)Workshop

Bayesian Nonparametric Statistics
November 8-9-10, 2010

PROGRAMME

Monday 8th November

09.30-10.30 Registration  
10.30-10.45 Welcome Connie Cantrijn
10.45-11.30 Harry van Zanten (I) Asymptotic theory for Gaussian process priors : Introduction
11.35-12.20 Zoubin Gharahmani (I) A Brief Overview of Nonparametric Bayesian Models
12.30-13.30 LUNCH  
13.30- 14.15 Judith Rousseau (I) Concentration properties of the posterior distribution for nonparametric mixture models
14.20-15.05 Yongdai Kim (I) Bayesian Survival analysis : Model, Prior and Posterior
15.15-15.45 Coffee/tea break  
15.45-16.05 J. Arbel Bayesian optimal adaptive estimation using a sieve prior
16.10-16.30 A. Babenko Oracle Posterior Rates in the White Noise Model
16.35-16.55 S. Khazei Nonparametric Bayesian estimation of densities under monotonicity constraint
17.00-17.20 M.A. Di Lucca A Non-parametric Bayesian Autoregressive Model for DNA-sequencing
17.25-17.45 S. Ventz Inference for non-homogeneous Markov Chains using Reinforced Urn Processes, with application to heart transplantation monitoring
18.30 Workshop Dinner  

Tuesday 9th November

09.00-09.45 Harry van Zanten (II) Contraction rates for Gaussian process priors
09.50-10.10 E. Bakra Applying spline smoothing regression on repeated measurements to link different cohorts
10.15-10.45 Coffee/tea Break  
10.45-11.30 Zoubin Gharahmani (II) Gaussian processes
11.35-11.55 R. de Jonge Adaptive Bayesian estimation using tensor product splines
12.00-12.20 B. Knapik Bayesian Inverse problems
12.35-13.30 LUNCH  
13.30-14.15 Zoublin Gharahmani (III) Indian Buffet processes
14.20-15.05 Yongdai Kim (II) Bayesian Survival analysis : Asymptotics
15.15-15.45 Coffee/tea Break  
15.45-16.30 Judith Rousseau (II) Semi-parametric Bayesian estimation
16.35-16.55 D. Bontemps Bernstein-von Mises Theorems for Gaussian regression with increasing number of regressors
17.00-17.20 W. Kruijer On Bayesian estimation of the long-memory parameter in the FEXP-model for Gaussian time series

Wednesday 10th November

09.00-09.45 Yongdai Kim (III) Bayesian Survival analysis : Computations and future works
09.50-10.35 Harry van Zanten (III) Contraction rates for Gaussian process priors
10.45-11.15 Coffee/tea Break  
11.15-12.00 Judith Rousseau (III) Asymptotic behaviour of the Bayes factor in nonparametric tests
12.00-12.30 Discussion  
12.30 Closing  

 



ABSTRACTS

MINI COURSES (keynote)

Zoubin Ghahramani

I. A Brief Overview of Nonparametric Bayesian Models
The flexibility of nonparametric Bayesian (NPB) methods for data modelling has generated an explosion of interest in the last decade in both Statistics and Machine Learning communities. I will give an overview of some of the main NPB models, and focus on the relationships between them. Focusing on the Dirichlet process (DP) and its relatives, I plan to give a whirlwind tour of the DP and Beta process, the associated Chinese restaurant and Indian buffet, hierarchical models such as Kingman's coalescent, the Dirichlet diffusion tree, and the Hierarchical DP (HDP), times series models such as the infinite HMM, dependent models such as the depedent Dirichlet process, and other topics such as completely random measures and stick-breaking constructions, time permitting. I will also try to give an overview of inference methods for NPB models (MCMC and alternatives).

II. Gaussian processes
Gaussian processes (GPs) are a fundamental stochastic process with a long history in Statistics and Machine Learning. GPs define distributions over unknown functions. and offer an elegant framework for Bayesian supervised kernel regression and classification, providing estimates of the uncertainty on quantities of interest and a principled framework for automatic feature selection and for learning the parameters of the kernel. I will give a tutorial on GPs, with Matlab demos, and discussion of the relation to support vector machines (SVMs). The tutorial will be loosely based on material and software from the textbook , "Gaussian Processes for Machine Learning" by Rasmussen and Williams (2006), freely available online and a must-have on every machine learning and statistics researcher's "bookshelf".


III. Indian Buffet processes
Much work in nonparametric Bayesian statistics focuses on the Dirichlet process (DP) and its associated combinatorial object, the Chinese restaurant process (CRP). The DP and CRP have found important uses in mixture modelling, allowing inference in models with a countable but unbounded number of mixture components. In analogy to CRPs, we have recently developed the Indian buffet process (IBP) which defines probability distributions on sparse binary matrices with exchangeable rows and an unbounded number of columns. The IBP makes it possible to define and do inference in models with an unbounded number of latent variables. I will review properties of the IBP, inference algorithms, and a number of applications, including: sparse latent factor and independent components models, time series models with an unbounded number of hidden processes, and nonparametric matrix factorisation models. Time permitting, I will describe recent extensions of the IBP.

PRESENTATION 1 - PRESENTATION 2 - PRESENTATION 3


Yongdai Kim

Bayesian Survival analysis


I. Model, Prior and Posterior
Bayesian analysis for censored data is reviewed. Firstly, Bayesian inference of the survival function with right censored data is considered. Neutral to right process priors are introduced and the corresponding posteriors are derived. Secondly, Bayesian analysis of the cumulative
hazard function is reviewed. In particular, beta processes are explained and shown to be conjugate. Also, Bayesian analysis of counting processes are considered. Thirdly, mixture priors are introduced and the corresponding posteriors are derived. Fourthly, Bayesian analysis of
the proportional hazards model is considered. In particular, the propriety of the posterior distribution with the uniform flat prior on the regression coefficients is explained. Finally, if time is allowed, Indian buffet processes are introduced and their extensions are discussed.

II. Asymptotics
Large sample properties of the posterior distribution of the survival function with right censored data are reviewed. For priors, a class of neutral ro right processes is considered. Firstly, it is shown that posterior consistency does not hold for all neutral to right process
priors and sufficient conditions for the posterior consistency are given. Most of popularly used priors including Dirichlet process, beta processes and gamma processes are shown to be consistent. Secondly, the Bernstein-von Mises theorem is discussed. A class of extended
beta processes is considered and necessary and sufficient conditions for the Bernstein-von Mises theorem are given. Then, sufficient conditions for the Bernstein-von Mises theorem for general neutral to right process priors are explained. Thirdly, the Bernstein-von Mises
theorem of the proportional hazards model is proved. Fourthly, the Bernstein-von Mises theorem with doubly censored data is explained.

III. Computations and future works
Various computational algorithms for Bayesian survival analysis are explained. Firstly, an MCMC algorithm with Dirichlet process prior is explained with complicated censoring data including interval censored data. Secondly, methods of generating sample paths of neutral
to right process priors are explained and applications to MCMC algorithms are discussed. Thirdly, an MCMC algorithm for the proportional hazards model is considered. Fourthly, a Bayesian bootstrap approach and its application to approximation of the posterior distribution are explained. Finally, some possible future researches for Bayesian survival analysis are discussed.

PRESENTATION 1 - PRESENTATION 2 - PRESENTATION 3


Judith Rousseau

I. Concentration properties of the posterior distribution for nonparametric mixture models
 

II. Semi-parametric Bayesian estimation
 

III. Asymptotic behaviour of the Bayes factor in nonparametric tests

Abstract

PRESENTATION 1 - PRESENTATION 2 - PRESENTATION 3
 


Harry van Zanten

Asymptotic theory for Gaussian process priors

I. Introduction
In the first lecture we give a brief general introduction to Bayesian nonparametrics, focussing on the use of Gaussian process priors. We give examples, show how such priors can be used in varous statistical settings, and briefly comment on computational issues.

II. Contraction rates for Gaussian process priors
In the second lecture we discuss the asymptotic behaviour of posterior distributions corresponding to Gaussian process priors. In particular, we explain how contraction rates are connected to the so-called small deviations behaviour of the prior and to the approximation of the function of interest by elements of the so-called RKHS of the prior. We compute contraction rates for several concrete Gaussian priors.

III. Contraction rates for Gaussian process priors
In the third lecture we consider the use of Gaussian process priors in hierarchical Bayes procedures. In particular, we explain how adaptive Bayes procedures can be constructed using conditionally Gaussian priors.

PRESENTATION 1 - PRESENTATION 2 - PRESENTATION 3
 


 

CONTRIBUTED SPEAKERS

Julyan Arbel

Bayesian optimal adaptive estimation using a sieve prior

We study the Bayes estimation of an infinite-dimensional parameter \theta. We propose a family of sieve priors and prove that the resulting Bayes estimators are adaptive optimal, both in posterior concentration rate and in risk convergence for the L2-norm loss. This result is applied to several models: density, regression and white noise. We prove that the same procedure is not optimal nor adaptive for the pointwise L2-norm loss and give a lower bound for the rate.

PRESENTATION


Alexandra Babenko

Oracle Posterior Rates in the White Noise Model

We apply a Bayesian approach to the problem of estimating a signal observed in the White noise model and we study the rate at which the posterior distribution, the main quantity of interest in Bayesian analysis, concentrates around the true value of the signal. A new benchmark for the posterior concentration rate, the so called posterior oracle rate, is proposed and studied. This is the smallest possible rate over a family of posterior rates corresponding to an appropriately chosen family of priors. To complement the upper bound results on the posterior concentration rate, we establish a lower bound result for the oracle rate. We also study implications for the model selection problem and present some simulations.

PRESENTATION


Eleni Bakra

Applying spline smoothing regression on repeated measurements to link different cohorts

Measurement of changes over the complete human lifespan is complex and no one study can expect to represent the complete population even if started at birth. Therefore, techniques need to be used that can draw on information from a range of studies to better understand the complete age range. However, combining studies has challenges to ensure that the study variability and differences can be properly modelled. Data from different cohorts across the life course are combined drawing together random effects and smoothing techniques to link different sources. The aim is to model between individual heterogeneity whilst producing a smoothed line across the life span by using a study where blood pressure measures from three cohorts representing the entire life span have been collected.


Dominique Bontemps

Bernstein-von Mises Theorems for Gaussian regression with increasing number of regressors

This work brings a contribution to the Bayesian theory of nonparametric and semiparametric estimation. We are interested in the asymptotic normality of the posterior distribution in Gaussian linear regression models when the number of regressors increases with the sample size. Two kinds of Bernstein-von Mises Theorems are obtained in this framework: nonparametric theorems for the parameter itself, and semiparametric theorems for functionals of the parameter. We apply them to the Gaussian sequence model and to the regression of functions in Sobolev and $C^{\alpha}$ classes, in which we get the minimax convergence rates. Adaptivity is reached for the Bayesian estimators of functionals in our applications.


Maria Anna Di Lucca

A Non-parametric Bayesian Autoregressive Model for DNA-sequencing

We consider the problem of base calling for data from high through-put sequencing (HTS) experiments. We propose a non-parametric Bayesian approach. The proposed model generalizes earlier approches based on mixtures of normals to mixtures of random probability measures. Complication arises from inherently autoregressive nature of the data (phasing). We use a variation of dependent Dirichlet process models (DDP) that define a non-parametric vector autoregressive model for the four-dimensional output from the four channels of the sequencing experiment.

PRESENTATION


René de Jonge

Adaptive Bayesian estimation using tensor product splines

In this talk I present a nonparametric procedure based on tensor product splines with Gaussian coefficients. The corresponding prior is conditionally Gaussian and (thus) provides a unified approach for a variety of statistical settings such as density estimation and regression. The goal is to show the procedures adapts to the true smoothness and to determine the rate of posterior contraction around the truth.


Soleiman Khazaei

Nonparametric Bayesian estimation of densities under monotonicity constraint

In this paper we discuss consistency of the posterior distribution in cases where the Kullback-Leibler condition is not veried. This condition is stated as : for all \ep > 0 the prior probability of sets in the form {f ; K L(f0, f ) < \ep} where K L(f0, f ) denotes the Kullback-Leibler divergence between the true density f0 of the observations and the density f , is positive. This condition is in almost cases required to lead to weak consistency of the posterior distribution, and thus to lead also to strong consistency. However it is not a necessary condition. We therefore present a new condition to replace the Kullback-Leibler condition, which is usefull in cases such as the estimation of decreasing densities. We then study some specifc families of priors adapted to the estimation of decreasing densities and provide posterior concentration rate for these priors, which is the same rate a the convergence rate of the maximum likelihood estimator. Some simulation results are provided.

PRESENTATION


Bartek Knapik

Bayesian Inverse problems

In this talk I will propose Bayesian approach to inverse problems with Gaussian white noise, based on Gaussian priors. I will focus on two aspects of inverse problems - estimation of the full parameter of interest and linear functional of it. For the ease of the presentation I will talk about so-called mildly ill-posed inverse problems, although the presented theory can be easily adapted to other various settings. Both in nonparametric and linear functional case, the rate of the contraction of the posterior distribution around the truth con be computed. Moreover, under some conditions on the prior Bayesians can construct credible sets that coincide with frequentists' confidence regions. The additional result in the linear functional setting is Bernstein-von Mises theorem, which under suitable conditions on the linear functional and the prior shows that the centered posterior for the linear functional of the truth and the asymptotic distribution of asymptotically efficient estimator centered at the truth are close in total variation norm.

PRESENTATION


Willem Kruijer

On Bayesian estimation of the long-memory parameter in the FEXP-model for Gaussian time series

Abstract


Steffen Ventz

Inference for non-homogeneous Markov Chains using Reinforced Urn Processes, with application to heart transplantation monitoring

Reinforced Urn processes (RUP) are a class of reinforced random walks on a countable state space of Polya Urns. Under suitable recurrence conditions, the RUP can be represented as a unique mixture of Markov chains,
with known mixing measure. We construct a particular class of RUP on a countable state-space and provide sufficiency conditions for recurrence. Under recurrence assumptions, we use the unique mixture representation of the RUP and the exchangeability of the X0-Blocks to induce a nonparametric prior on the space of stochastic transition arrays for non-homogeneous Markov Chains. Our process can be used for Multi-State longitudinal problems, where the dependence through time might be Markovian but not necessarily time homogeneous. Potentially several individuals are repeatedly observed through time. Individuals itself may be judged to be exchangeable, whereas for a fixed individual the repeated measurements are assumed to follow a non-homogeneous Markovian structure, conditional on an unknown transition array. Exact posterior estimates for the unknown transition array and functionals of the transition array can be obtained trough the predictive distribution of the RUP. We apply the constructed RUP to the problem of heart transplantation monitoring.


Last updated 13-dec-2010,
By
PK
 

    P.O. Box 513, 5600 MB  Eindhoven, The Netherlands
tel. +31 40 2478100  fax +31 40 2478190  
  e-mail: mailto:office@eurandom.tue.nl