Linear-nonlinear-Poisson cascade model

Last updated June 15, 2020

The linear-nonlinear-Poisson (LNP) cascade model is a simplified functional model of neural spike responses.^[1]^[2]^[3] It has been successfully used to describe the response characteristics of neurons in early sensory pathways, especially the visual system. The LNP model is generally implicit when using reverse correlation or the spike-triggered average to characterize neural responses with white-noise stimuli.

There are three stages of the LNP cascade model. The first stage consists of a linear filter, or linear receptive field, which describes how the neuron integrates stimulus intensity over space and time. The output of this filter then passes through a nonlinear function, which gives the neuron's instantaneous spike rate as its output. Finally, the spike rate is used to generate spikes according to an inhomogeneous Poisson process.

The linear filtering stage performs dimensionality reduction, reducing the high-dimensional spatio-temporal stimulus space to a low-dimensional feature space, within which the neuron computes its response. The nonlinearity converts the filter output to a (non-negative) spike rate, and accounts for nonlinear phenomena such as spike threshold (or rectification) and response saturation. The Poisson spike generator converts the continuous spike rate to a series of spike times, under the assumption that the probability of a spike depends only on the instantaneous spike rate.

The model offers a useful approximation of neural activity, allowing scientists to derive reliable estimates from a mathematically simple formula.

Mathematical formulation

Single-filter LNP

Let $\mathbf {x}$ denote the spatio-temporal stimulus vector at a particular instant, and $\mathbf {k}$ denote a linear filter (the neuron's linear receptive field), which is a vector with the same number of elements as $\mathbf {x}$ . Let $f$ denote the nonlinearity, a scalar function with non-negative output. Then the LNP model specifies that, in the limit of small time bins,

P({\textrm {spike}})\propto f(\mathbf {k} \cdot \mathbf {x} )

.

For finite-sized time bins, this can be stated precisely as the probability of observing y spikes in a single bin:

P(y{\textrm {~spikes}})={\frac {\left(\Delta \lambda \right)^{y}}{y!}}e^{-\Delta \lambda }

where

\lambda =f(\mathbf {k} \cdot \mathbf {x} )

, and

\Delta

is the bin size.

Multi-filter LNP

For neurons sensitive to multiple dimensions of the stimulus space, the linear stage of the LNP model can be generalized to a bank of linear filters, and the nonlinearity becomes a function of multiple inputs. Let $\mathbf {k_{1}} ,\mathbf {k_{2}} ,\ldots ,\mathbf {k_{n}}$ denote the set of linear filters that capture a neuron's stimulus dependence. Then the multi-filter LNP model is described by

P({\textrm {spike}})\propto f(\mathbf {k_{1}} \!\cdot \!\mathbf {x} ,\;\mathbf {k_{2}} \!\cdot \!\mathbf {x} ,\;\ldots ,\;\mathbf {k_{n}} \!\cdot \!\mathbf {x} )

or

P({\textrm {spike}})\propto f(K\mathbf {x} ),

where $K$ is a matrix whose columns are the filters $\mathbf {k_{i}}$ .

Estimation

The parameters of the LNP model consist of the linear filters $\{{k_{i}}\}$ and the nonlinearity $f$ . The estimation problem (also known as the problem of neural characterization) is the problem of determining these parameters from data consisting of a time-varying stimulus and the set of observed spike times. Techniques for estimating the LNP model parameters include:

moment-based techniques, such as the spike-triggered average or spike-triggered covariance ^[1]^[2]^[3]^[4]
with information-maximization or maximum likelihood techniques.^[5]^[6]

Related models

The LNP model provides a simplified, mathematically tractable approximation to more biophysically detailed single-neuron models such as the integrate-and-fire or Hodgkin–Huxley model.
If the nonlinearity $f$ is a fixed invertible function, then the LNP model is a generalized linear model. In this case, $f$ is the inverse link function.
An alternative to the LNP model for neural characterization is the Volterra kernel or Wiener kernel series expansion, which arises in classical nonlinear systems-identification theory.^[7] These models approximate a neuron's input-output characteristics using a polynomial expansion analogous to the Taylor series, but do not explicitly specify the spike-generation process.

Related Research Articles

The stimulus–response model is a characterization of a statistical unit. The model allows the prediction of a quantitative response to a quantitative stimulus, for example one administered by a researcher. In psychology, stimulus response theory concerns forms of classical conditioning in which a stimulus becomes paired response in a subject's mind.

Principal component analysis conversion of a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components

Given a collection of points in two, three, or higher dimensional space, a "best fitting" line can be defined as one that minimizes the average squared distance from a point to the line. The next best-fitting line can be similarly chosen from directions perpendicular to the first. Repeating this process yields an orthogonal basis in which different individual dimensions of the data are uncorrelated. These basis vectors are called principal components, and several related procedures principal component analysis (PCA).

In machine learning, the perceptron is an algorithm for supervised learning of binary classifiers. A binary classifier is a function which can decide whether or not an input, represented by a vector of numbers, belongs to some specific class. It is a type of linear classifier, i.e. a classification algorithm that makes its predictions based on a linear predictor function combining a set of weights with the feature vector.

In control engineering, a state-space representation is a mathematical model of a physical system as a set of input, output and state variables related by first-order differential equations or difference equations. State variables are variables whose values evolve through time in a way that depends on the values they have at any given time and also depends on the externally imposed values of input variables. Output variables’ values depend on the values of the state variables.

In statistics, the generalized linear model (GLM) is a flexible generalization of ordinary linear regression that allows for response variables that have error distribution models other than a normal distribution. The GLM generalizes linear regression by allowing the linear model to be related to the response variable via a link function and by allowing the magnitude of the variance of each measurement to be a function of its predicted value.

Granger causality Statistical hypothesis test for forecasting

The Granger causality test is a statistical hypothesis test for determining whether one time series is useful in forecasting another, first proposed in 1969. Ordinarily, regressions reflect "mere" correlations, but Clive Granger argued that causality in economics could be tested for by measuring the ability to predict the future values of a time series using prior values of another time series. Since the question of "true causality" is deeply philosophical, and because of the post hoc ergo propter hoc fallacy of assuming that one thing preceding another can be used as a proof of causation, econometricians assert that the Granger test finds only "predictive causality". Using the term "causality" alone is a misnomer, as Granger-causality is better described as "precedence", or, as Granger himself later claimed in 1977, "temporally related". Rather than testing whether Ycauses X, the Granger causality tests whether Y forecastsX.

A multilayer perceptron (MLP) is a class of feedforward artificial neural network (ANN). The term MLP is used ambiguously, sometimes loosely to any feedforward ANN, sometimes strictly to refer to networks composed of multiple layers of perceptrons ; see § Terminology. Multilayer perceptrons are sometimes colloquially referred to as "vanilla" neural networks, especially when they have a single hidden layer.

In machine learning, kernel methods are a class of algorithms for pattern analysis, whose best known member is the support vector machine (SVM). The general task of pattern analysis is to find and study general types of relations in datasets. For many algorithms that solve these tasks, the data in raw representation have to be explicitly transformed into feature vector representations via a user-specified feature map: in contrast, kernel methods require only a user-specified kernel, i.e., a similarity function over pairs of data points in raw representation.

The spike-triggered average (STA) is a tool for characterizing the response properties of a neuron using the spikes emitted in response to a time-varying stimulus. The STA provides an estimate of a neuron's linear receptive field. It is a useful technique for the analysis of electrophysiological data.

Oja's learning rule, or simply Oja's rule, named after Finnish computer scientist Erkki Oja, is a model of how neurons in the brain or in artificial neural networks change connection strength, or learn, over time. It is a modification of the standard Hebb's Rule that, through multiplicative normalization, solves all stability problems and generates an algorithm for principal components analysis. This is a computational form of an effect which is believed to happen in biological neurons.

In the field of mathematical modeling, a radial basis function network is an artificial neural network that uses radial basis functions as activation functions. The output of the network is a linear combination of radial basis functions of the inputs and neuron parameters. Radial basis function networks have many uses, including function approximation, time series prediction, classification, and system control. They were first formulated in a 1988 paper by Broomhead and Lowe, both researchers at the Royal Signals and Radar Establishment.

Biological neuron model mathematical description of the properties of certain cells in the nervous system that generate sharp electrical potentials across their cell membrane, roughly one millisecond in duration

A biological neuron model, also known as a spiking neuron model, is a mathematical description of the properties of certain cells in the nervous system that generate sharp electrical potentials across their cell membrane, roughly one millisecond in duration, as shown in Fig. 1. Spiking neurons are known to be a major signaling unit of the nervous system, and for this reason characterizing their operation is of great importance. It is worth noting that not all the cells of the nervous system produce the type of spike that define the scope of the spiking neuron models. For example, cochlear hair cells, retinal receptor cells, and retinal bipolar cells do not spike. Furthermore, many cells in the nervous system are not classified as neurons but instead are classified as glia.

In estimation theory, the extended Kalman filter (EKF) is the nonlinear version of the Kalman filter which linearizes about an estimate of the current mean and covariance. In the case of well defined transition models, the EKF has been considered the de facto standard in the theory of nonlinear state estimation, navigation systems and GPS.

Spike-triggered covariance (STC) analysis is a tool for characterizing a neuron's response properties using the covariance of stimuli that elicit spikes from a neuron. STC is related to the spike-triggered average (STA), and provides a complementary tool for estimating linear filters in a linear-nonlinear-Poisson (LNP) cascade model. Unlike STA, the STC can be used to identify a multi-dimensional feature space in which a neuron computes its response.

In probability theory and statistics, the Poisson distribution, named after French mathematician Siméon Denis Poisson, is a discrete probability distribution that expresses the probability of a given number of events occurring in a fixed interval of time or space if these events occur with a known constant mean rate and independently of the time since the last event. The Poisson distribution can also be used for the number of events in other specified intervals such as distance, area or volume.

Models of neural computation are attempts to elucidate, in an abstract and mathematical fashion, the core principles that underlie information processing in biological nervous systems, or functional components thereof. This article aims to provide an overview of the most definitive models of neuro-biological computation as well as the tools commonly used to construct and analyze them.

Maximally informative dimensions is a dimensionality reduction technique used in the statistical analyses of neural responses. Specifically, it is a way of projecting a stimulus onto a low-dimensional subspace so that as much information as possible about the stimulus is preserved in the neural response. It is motivated by the fact that natural stimuli are typically confined by their statistics to a lower-dimensional space than that spanned by white noise but correctly identifying this subspace using traditional techniques is complicated by the correlations that exist within natural images. Within this subspace, stimulus-response functions may be either linear or nonlinear. The idea was originally developed by Tatyana Sharpee, Nicole Rust, and William Bialek in 2003.

Biological motion perception is the act of perceiving the fluid unique motion of a biological agent. The phenomenon was first documented by Swedish perceptual psychologist, Gunnar Johansson, in 1973. There are many brain areas involved in this process, some similar to those used to perceive faces. While humans complete this process with ease, from a computational neuroscience perspective there is still much to be learned as to how this complex perceptual problem is solved. One tool which many research studies in this area use is a display stimuli called a point light walker. Point light walkers are coordinated moving dots that simulate biological motion in which each dot represents specific joints of a human performing an action.

Extreme learning machines are feedforward neural networks for classification, regression, clustering, sparse approximation, compression and feature learning with a single layer or multiple layers of hidden nodes, where the parameters of hidden nodes need not be tuned. These hidden nodes can be randomly assigned and never updated, or can be inherited from their ancestors without being changed. In most cases, the output weights of hidden nodes are usually learned in a single step, which essentially amounts to learning a linear model. The name "extreme learning machine" (ELM) was given to such models by its main inventor Guang-Bin Huang.

In signal processing, nonlinear multidimensional signal processing (NMSP) covers all signal processing using nonlinear multidimensional signals and systems. Nonlinear multidimensional signal processing is a subset of signal processing. Nonlinear multi-dimensional systems can be used in a broad range such as imaging, teletraffic, communications, hydrology, geology, and economics. Nonlinear systems cannot be treated as linear systems, using Fourier transformation and wavelet analysis. Nonlinear systems will have chaotic behavior, limit cycle, steady state, bifurcation, multi-stability and so on. Nonlinear systems do not have a canonical representation, like impulse response for linear systems. But there are some efforts to characterize nonlinear systems, such as Volterra and Wiener series using polynomial integrals as the use of those methods naturally extend the signal into multi-dimensions. Another example is the Empirical mode decomposition method using Hilbert transform instead of Fourier Transform for nonlinear multi-dimensional systems. This method is an empirical method and can be directly applied to data sets. Multi-dimensional nonlinear filters (MDNF) are also an important part of NMSP, MDNF are mainly used to filter noise in real data. There are nonlinear-type hybrid filters used in color image processing, nonlinear edge-preserving filters use in magnetic resonance image restoration. Those filters use both temporal and spatial information and combine the maximum likelihood estimate with the spatial smoothing algorithm.

References

1 2 Chichilnisky, E. J., A simple white noise analysis of neuronal light responses. Archived 2008-10-07 at the Wayback Machine Network: Computation in Neural Systems 12:199–213. (2001)
1 2 Simoncelli, E. P., Paninski, L., Pillow, J. & Swartz, O. (2004). Characterization of Neural Responses with Stochastic Stimuli in (Ed. M. Gazzaniga) The Cognitive Neurosciences 3rd edn (pp 327–338) MIT press.
1 2 Schwartz O., Pillow J. W., Rust N. C., & Simoncelli E. P. (2006). Spike-triggered neural characterization. Journal of Vision 6:484–507
↑ Brenner, N., Bialek, W., & de Ruyter van Steveninck, R. R. (2000).
↑ Paninski, L. (2004) Maximum likelihood estimation of cascade point-process neural encoding models. In Network: Computation in Neural Systems.
↑ Mirbagheri M. (2012) Dimension reduction in regression using Gaussian Mixture Models. In Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP).
↑ Marmarelis & Marmerelis, 1978. Analysis of Physiological Systems: The White Noise Approach. London: Plenum Press.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[chichilnisky01-1] 1 2 Chichilnisky, E. J., A simple white noise analysis of neuronal light responses. Archived 2008-10-07 at the Wayback Machine Network: Computation in Neural Systems 12:199–213. (2001)

[simoncelli-2] 1 2 Simoncelli, E. P., Paninski, L., Pillow, J. & Swartz, O. (2004). Characterization of Neural Responses with Stochastic Stimuli in (Ed. M. Gazzaniga) The Cognitive Neurosciences 3rd edn (pp 327–338) MIT press.

[schwartz06-3] 1 2 Schwartz O., Pillow J. W., Rust N. C., & Simoncelli E. P. (2006). Spike-triggered neural characterization. Journal of Vision 6:484–507

[brenner00-4] Brenner, N., Bialek, W., & de Ruyter van Steveninck, R. R. (2000).

[Paninski:2004-5] Paninski, L. (2004) Maximum likelihood estimation of cascade point-process neural encoding models. In Network: Computation in Neural Systems.

[Mirbagheri:2012-6] Mirbagheri M. (2012) Dimension reduction in regression using Gaussian Mixture Models. In Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7] Marmarelis & Marmerelis, 1978. Analysis of Physiological Systems: The White Noise Approach. London: Plenum Press.