Semiparametric spectral modeling of the Drosophila connectome

Department of Applied Mathematics and Statistics
Center for Imaging Science
Human Language Technology Center of Excellence
Johns Hopkins University
and
University of Cincinnati
and
HHMI Janelia Research Campus


C.E. Priebe, Y. Park, M. Tang, A, Athreya, V. Lyzinski, J. Vogelstein, Y. Qin, B. Cocanougher, K. Eichler, M. Zlatic, A. Cardona, “Semiparametric spectral modeling of the Drosophila connectome,” Journal of the American Statistical Association Application and Case Studies, submmitted, 2017.

More comprehensive version was published in

Avanti Athreya, Donniell E. Fishkind, Keith Levin, Vince Lyzinski, Youngser Park, Yichen Qin, Daniel L. Sussman, Minh Tang, Joshua T. Vogelstein, Carey E. Priebe, "Statistical inference on random dot product graphs: a survey," Journal of Machine Learning Research, 18(226):1−92, 2018.

Abstract

We present semiparametric spectral modeling of the complete larval Drosophila mushroom body connectome. The resulting connectome code derived via semiparametric Gaussian mixture mod- eling composed with adjacency spectral embedding captures biologically relevant neuronal prop- erties.

Keywords: Connectome; Network; Graph; Spectral embedding; Mixture model; Clustering

Illustration of the Drosophila Mushroom Body circuit as a (K = 4)-block directed stochastic block model. This first-order simplification of the structure of the circuit provides a framework for graph embedding inference, and for subsequent Knowledge Base interaction.

Data

HHMI Janelia recently reconstructed the complete wiring diagram of the higher order parallel fiber system for associative learning in the larval Drosophila brain, the mushroom body (MB). Memories are thought to be stored as functional and structural changes in connections between neurons, but the complete circuit architecture of a higher-order learning center involved in memory formation or storage has not been known in any organism … until now. This data set provides a real and important example for initial investigation into synapse-level structural connectome modeling.

Our MB connectome was obtained via serial section transmission electron microscopy of an entire larval Drosophila nervous system. This connectome contains the entirety of MB intrinsic neurons called Kenyon cells and all of their pre- and post-synaptic partners.

Codes and Experiments

To run the experiemnts in the paper, please follow these steps:

The larval Drosophila mushroom body connectome

Output of this chunk is shown here.

library(mbstructure)
data(MBconnectome)

out <- generate.graph(newrdat, vdf.right)
g <- out$g
vdf <- out$vdf
plotConnections(g, vdf)

Spectral clustering

Output of this chunk is shown here.

dmax <- 50
Xhat <- doEmbed(g, dmax)

Kmax <- 19
mc <- Mclust(Xhat, 2:Kmax)
vdf$cluster <- factor(mc$class)
plotBIC(mc)
plotClustering(Xhat, mc, vdf)

Semiparametric spectral modeling

Output of this chunk is shown here.
Warning: This takes several minutes to run on my laptop!

sout4 <- synthMB(g, Xhat, vdf, labK=vdf$type, Khat=4, dtype="truth", doplot=TRUE)
semiout <- plotMLE(Xhat, vdf) 

Discussion

Directed! Weighted?

Output of this chunk is shown here.
Warning: This takes about half an hour to run on my laptop!

g.w <-out$g.w
compairARI(g, g.w, vdf)

Synthetic validation

Output of this chunk is shown here.
Warning: This takes a few minutes to run on my laptop!

syntheticValidation(g, Xhat, vdf)

Hemispheric validation: right vs. left

Output of this chunk is shown here.
Warning: This takes a few minutes to run on my laptop!

right.vs.left(Xhat, vdf, semiout$out100)

R Package

The latest R source package can be installed via:

install.packages("http://www.cis.jhu.edu/~parky/MBstructure/mbstructure_0.1.0.tar.gz",type="source",method="wget")

or

require(devtools)
devtools::install_github("youngser/mbstructure")
library(help='mbstructure')
##      Information on package 'mbstructure'
## 
## Description:
## 
## Package:       mbstructure
## Type:          Package
## Title:         Semiparametric spectral modeling of the complete
##                larval Drosophila mushroom body connectome
## Version:       0.1.0
## Depends:       R (>= 3.0)
## Imports:       igraph, Matrix, lattice, ggplot2, mclust, irlba,
##                mvtnorm, RColorBrewer, packcircles
## Author:        Youngser Park, Yichen Qin
## Maintainer:    Youngser Park <[email protected]>
## Description:   Routine to perform semiparametric spectral modeling
##                of the complete larval Drosophila mushroom body
##                connectome.
## License:       GPL (>= 2)
## URL:           http://www.cis.jhu.edu/~parky/MBstructure.html
## LazyData:      TRUE
## RoxygenNote:   5.0.1
## Built:         R 3.3.2; ; 2017-02-25 13:26:29 UTC; unix
## 
## Index:
## 
## hello                   Hello, World!

prepared by [email protected] on Wed Mar 1 14:04:30 2017