This event has passed.

YES VI: “Statistics for Complex and High Dimensional Systems”

Name: YES VI: “Statistics for Complex and High Dimensional Systems”
Start: 2013-01-28T00:00:00+00:00
End: 2013-01-29T23:59:59+00:00
Location: MF 11-12 (4th floor MetaForum Building, TU/e)

Jan 28, 2013 - Jan 29, 2013

Summary

The dramatic improvement in data collection and acquisition technologies in the last decades has allowed for the monitoring and study of extremely complex systems, such as biological, social and computer networks. The extremely complex and high-dimensional nature of these systems, and the dramatic growth in dataset sizes gives rise to important research questions: how to perform meaningful statistical inference from very large and potentially corrupted complex data sets? Which properties of these complex systems can be inferred from such data? Can sound inference methodologies be also made computationally feasible? These and other questions have recently attracted the attention of a large number of researchers worldwide and, although important progress has been made in recent years, there are still many open and emerging problems in the general area of statistical inference in complex and high-dimensional systems

The aim of the YES workshop “Statistics for Complex Networks and High Dimensional Systems” is to introduce this broad field of research to young researchers, in particular Ph.D. students, postdoctoral fellows and junior researchers who are interested and eager to tackle new challenges in the study of complex networks. The workshop will consist of tutorial courses given by some of the world experts in the field, each consisting roughly of 3 hours of lectures. This workshop immediately precedes the workshop Statistics for Complex Networks and provides a solid introduction for the general topical area of that workshop.

Organizers

Rui Castro	TU Eindhoven
Geurt Jongbloed	TU Delft

Tutorial Speakers

Eric Kolacyk	Boston University
Johan Koskinen	University of Manchester
Martin Wainwright	University of California – Berkeley

Invited Speakers

Ivan Vujacic	University of Groningen
Abdolreza Mohammadi	University of Groningen
Nynke Niezink	University of Groningen

Programme

Monday (January 28th)

09:45 – 10:05 Coffee and Registration
10:05 – 10:15 Opening Remarks
10:15 – 11:15 Eric Kolaczyk – Statistical Analysis of Network Data (Part I)
11:15 – 11:30 Coffee Break
11:30 – 12:30 Eric Kolaczyk – Statistical Analysis of Network Data (Part II)
12:30 – 13:30 Lunch
13:30 – 14:30 Martin Wainwright – Graphical models and message-passing algorithms (Part I)
14:30 – 14:45 Coffee Break
14:45 – 15:45 Martin Wainwright – Graphical models and message-passing algorithms (Part II)
15:45 – 16:15 Coffee Break
16:15 – 16:45 Ivan Vujacic – Selecting $\ell_1$ penalized Gaussian graphical models using Generalized Information Criterion
16:45 – 17:15 Reza Mohammadi – Network determination based on birth-death MCMC inference
18:30 – Workshop Dinner

Tuesday (January 29th)

09:30 – 10:30 Eric Kolaczyk – Statistical Analysis of Network Data (Part III)
10:30 – 10:45 Coffee Break
10:45 – 11:45 Martin Wainwright- Graphical models and message-passing algorithms (Part III)
11:45 – 12:00 Break
12:00 – 12:30 Nynke Niezink – Co-evolution of social networks and continuous actor attributes
12:30 – 13:30 Lunch
13:30 – 14:30 Johan Koskinen – Statistical analysis of social networks (Part I)
14:30 – 14:45 Break
14:45 – 15:45 Johan Koskinen – Statistical analysis of social networks (Part II)
15:45 – 16:15 Break
16:15 – 17:15 Johan Koskinen – Statistical analysis of social networks (Part III)
17:15 Closing of the workshop

Abstracts

Eric Kolaczyk

Statistical Analysis of Network Data
Over the past decade, the study of so-called “complex networks” — that is, network-based representations of complex systems — has taken the sciences by storm. Researchers from biology to physics, from economics to mathematics, and from computer science to sociology, are more and more involved with the collection, modeling and analysis of network-indexed data. With this enthusiastic embrace of networks across the disciplines comes a multitude of statistical challenges of all sorts — many of them decidedly non-trivial. In this tutorial, we will cover a brief overview of the foundations common to the statistical analysis of network data across the disciplines, from a statistical perspective, in the context of topics like network summary and visualization, network sampling, network modeling and inference, and network processes. Concepts will be illustrated drawing on examples from bioinformatics, computer network traffic analysis, neuroscience, and social networks.

Presentation part 1 ; Presentation part 2 ; Presentation part 3

Johan Koskinen

Statistical analysis of social networks
This tutorial will focus on the analysis of social networks with an emphasis on the type of inferences that are typically sought in social science applications. In the standard social network analysis paradigm, networks are generally of smaller sizes, contained by well-defined contexts, and a detailed understanding of the processes that gave rise to them is strived for. The tutorial has three themes. First we will review basic data structures and the past history of methodological developments – from non-parametric approaches to null-distribution-based testing – with a view to understanding the particular issues of doing statistical analysis for complex network structures. In the second theme we will focus on the analysis of cross-sectional data through exponential random graph models (ERGM). We will approach ERGM both as a pragmatic tool for modelling networks through lower-dimensional statistics and as derived from principled assumptions about dependencies among tie-variables. Issues of estimation and interpretation will be covered by way of empirical examples. An orientation to extensions of ERGM to more general network objects will be given. The third theme will introduce stochastic actor-oriented models (SAOM) for the analysis of dynamic, longitudinal networks. SAOM is the current best method for analysing network panel data and has a very general formulation that lends itself to easy extension to many types of processes. In essence, the SAOM is a discrete Markov process in continuous time, where the emphasis is on modelling the embedded jump-transitions between states. We will go through the basic modelling framework and estimation strategies and then discuss recent and forthcoming extensions.

Presentation (Why statistics) ; Presentation (ERGM) ; Presentation (IntroSAOM)

Martin Wainwright

Graphical models and message-passing algorithms: some introductory lectures
Graphical models combine ideas from probability theory and
graph theory, and play a central role in many sub-disciplines of
statistics, applied mathematics and computer science (among them
computer vision, error-control coding, satisfiability, and
computational biology). In this three-part series, we provide an
introduction to the basics of graphical models, efficient
message-passing algorithms for computing likelihoods and marginals,
and the problem of graphical model selection.

Tutorial notes on basics of graphical models:
http://www.eecs.berkeley.edu/~wainwrig/Graphical/Wai12_Basics.pdf

More advanced research monograph on graphical models and variational methods:
https://people.eecs.berkeley.edu/~wainwrig/Graphical/WaiJor08_FTML.pdf