healthy and lesion models for learning based on the joint estimation of stochasticity and volatility

Last update: Nov 01, 2022

Related tags

Overview

health-lesion-stovol

healthy and lesion models for learning based on the joint estimation of stochasticity and volatility

Reference

please cite this paper if you use this code: Piray P and Daw ND, 'A model for learning based on the joint estimation of stochasticity and volatility', 2021, Nature Communications.

Description of the models

This work addresses the problem of learning in noisy environments, in which the agent must draw inferences (e.g., about true reward rates) from observations (individual reward amounts) that are corrupted by two distinct sources of noise: process noise or volatility and observation noise or stochasticity. Volatility captures the speed by which the true value being estimated changes from trial to trial (modeled as Gaussian diffusion); stochasticity describes additional measurement noise in the observation of each outcome around its true value (modeled as Gaussian noise on each trial). The celebrated Kalman filter makes inference based on known value for both stochasticity and volatility, in which volatility and stochasticity have opposite effects on the learning rate (i.e. Kalman gain): whereas volatility increases the learning rate, stochasticity decreases the learning rate.

The learning models implemented here generalize the Kalman filter by also learning both stochasticity and volatility based on observations. An important point is that inferences about volatility and stochasticity are mutually interdependent. But the details of the interdependence are themselves informative. From the learner’s perspective, a challenging problem is to distinguish volatility from stochasticity when both are unknown, because both of them increase the noisiness of observations. Disentangling their respective contributions requires trading off two opposing explanations for the pattern of observations, a process known in Bayesian probability theory as explaining away. This insight results in two lesion models: a stochasticity lesion model that tends to misidentify stochasticity as volatility and inappropriately increases learning rates; and a volatility lesion model that tends to misidentify volatility as stochasticity and inappropriately decreases learning rates.

Description of the code

learning_models.py contains two classes of learning models:

LearningModel that includes the healthy model and two lesion models (stochasticity lesion and volatility lesion models)
LearningModelGaussian is similar to LearningModel with the Gaussian generative processes for stochasticity and volatility diffusion.

Inference in both classes is based on a combination of particle filter and Kalman filter. Given particles for stochasticity and volatility, the Kalman filter updates its estimation of the mean and variance of the state (e.g. reward rate). The main results shown in the reference paper (see below) is very similar for both classes of generative process. The particle filter has been implemented in the particle_filter.py

sim_example.py simulates the healthy model in a 2x2 factorial design (with two different true values for both true stochasticity and volatility). The model does not know about the true values and should learn them from observations. Initial values for both stochasticity and volatility are assumed to be the mean of their corresponding true values (and so not helpful for dissociation). This is akin to Figure 2 of the reference paper.

sim_lesion_example.py also simulates the lesions models in the 2x2 factorial design described above. This is akin to Figure 3 of the reference paper.

Dependencies:

numpy (required for computations in particle_filter.py and learning_models.py) matplotlib (required for visualization in sim_example and sim_lesion_example) seaborn (required for visualization in sim_example and sim_lesion_example) pandas (required for visualization in sim_example and sim_lesion_example)

Other languages

The MATLAB implementation of the model is also available: https://github.com/payampiray/stochasticity_volatility_learning

Author

Payam Piray (ppiray [at] princeton.edu)

healthy and lesion models for learning based on the joint estimation of stochasticity and volatility

Related tags

Overview

health-lesion-stovol

Reference

Description of the models

Description of the code

Dependencies:

Other languages

Author

Owner

The unified machine learning framework, enabling framework-agnostic functions, layers and libraries.

CD) in machine learning projectsImplementing continuous integration & delivery (CI/CD) in machine learning projects

Turns your machine learning code into microservices with web API, interactive GUI, and more.

Implementation of linesearch Optimization Algorithms in Python

Retrieve annotated intron sequences and classify them as minor (U12-type) or major (U2-type)

A repository of PyBullet utility functions for robotic motion planning, manipulation planning, and task and motion planning

A game theoretic approach to explain the output of any machine learning model.

Greykite: A flexible, intuitive and fast forecasting library

An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models

MLFlow in a Dockercontainer based on Azurite and Postgres

A Python library for choreographing your machine learning research.

Forecasting prices using Facebook/Meta's Prophet model

Microsoft contributing libraries, tools, recipes, sample codes and workshop contents for machine learning & deep learning.

MIT-Machine Learning with Python–From Linear Models to Deep Learning

PyHarmonize: Adding harmony lines to recorded melodies in Python

A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.

A Collection of Conference & School Notes in Machine Learning 🦄📝🎉

A Python Module That Uses ANN To Predict A Stocks Price And Also Provides Accurate Technical Analysis With Many High Potential Implementations!

Climin is a Python package for optimization, heavily biased to machine learning scenarios

This is an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch

healthy and lesion models for learning based on the joint estimation of stochasticity and volatility

Related tags

Overview

health-lesion-stovol

Reference

Description of the models

Description of the code

Dependencies:

Other languages

Author

Owner

The unified machine learning framework, enabling framework-agnostic functions, layers and libraries.

CD) in machine learning projectsImplementing continuous integration & delivery (CI/CD) in machine learning projects

Turns your machine learning code into microservices with web API, interactive GUI, and more.

Implementation of linesearch Optimization Algorithms in Python

Retrieve annotated intron sequences and classify them as minor (U12-type) or major (U2-type)

A repository of PyBullet utility functions for robotic motion planning, manipulation planning, and task and motion planning

A game theoretic approach to explain the output of any machine learning model.

﻿Greykite: A flexible, intuitive and fast forecasting library

An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models

MLFlow in a Dockercontainer based on Azurite and Postgres

A Python library for choreographing your machine learning research.

Forecasting prices using Facebook/Meta's Prophet model

Microsoft contributing libraries, tools, recipes, sample codes and workshop contents for machine learning & deep learning.

MIT-Machine Learning with Python–From Linear Models to Deep Learning

PyHarmonize: Adding harmony lines to recorded melodies in Python

A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.

A Collection of Conference & School Notes in Machine Learning 🦄📝🎉

A Python Module That Uses ANN To Predict A Stocks Price And Also Provides Accurate Technical Analysis With Many High Potential Implementations!

Climin is a Python package for optimization, heavily biased to machine learning scenarios

This is an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch

Greykite: A flexible, intuitive and fast forecasting library