Code from the paper "High-Performance Brain-to-Text Communication via Handwriting"

Last update: Jan 03, 2023

Related tags

Overview

High-Performance Brain-to-Text Communication via Handwriting

Overview

This repo is associated with this manuscript, preprint and dataset. The code can be used to run an offline reproduction of the main result: high-performance neural decoding of attempted handwriting movements. The jupyter notebooks included here implement all steps of the process, including labeling the neural data with HMMs, training an RNN to decode the neural data into sequences of characters, applying a language model to the RNN outputs, and summarizing the performance on held-out data.

Results from each step are saved to disk and used in future steps. Intermediate results and models are available with the data - download these to explore certain steps without needing to run all prior ones (except for Step 3, which you'll need to run on your own because it produces ~100 GB of files).

Results

Below are the main results from my original run of this code. Results are shown from both train/test partitions ('HeldOutTrials' and 'HeldOutBlocks') and were generaetd with this notebook. 95% confidence intervals are reported in brackets for each result.

HeldOutTrials

	Character error rate (%)	Word error rate (%)
Raw	2.78 [2.20, 3.41]	12.88 [10.28, 15.63]
Bigram LM	0.80 [0.44, 1.22]	3.64 [2.11, 5.34]
Bigram LM + GPT-2 Rescore	0.34 [0.14, 0.61]	1.97 [0.78, 3.41]

HeldOutBlocks

	Character error rate (%)	Word error rate (%)
Raw	5.32 [4.81, 5.86]	23.28 [21.27, 25.41]
Bigram LM	1.69 [1.32, 2.10]	6.10 [4.97, 7.25]
Bigram LM + GPT-2 Rescore	0.90 [0.62, 1.23]	3.21 [2.37, 4.11]

Train/Test Partitions

Following our manuscript, we use two separate train/test partitions (available with the data): 'HeldOutBlocks' holds out entire blocks of sentences that occur later in each session, while 'HeldOutTrials' holds out single sentences more uniformly.

'HeldOutBlocks' is more challenging because changes in neural activity accrue over time, thus requiring the RNN to be robust to neural changes that it has never seen before from held-out blocks. In 'HeldOutTrials', the RNN can train on other sentences that occur very close in time to each held-out sentence. For 'HeldOutBlocks' we found that training the RNN in the presence of artificial firing rate drifts improved generalization, while this was not necessary for 'HeldOutTrials'.

Dependencies

General
- python>=3.6
- tensorflow=1.15
- numpy (tested with 1.17)
- scipy (tested with 1.1.0)
- scikit-learn (tested with 0.20)
Step 1: Time Warping
- Time warped PCA
Steps 4-5: RNN Training & Inference
- Requires a GPU (calls cuDNN for the GRU layers)
Step 6: Bigram Language Model
- Kaldi
- Puigcerver's custom Kaldi decoders
- Bigram language model files
Step 7: GPT-2 Rescoring
- GPT-2 model files (1558M version)

Code from the paper "High-Performance Brain-to-Text Communication via Handwriting"

Related tags

Overview

High-Performance Brain-to-Text Communication via Handwriting

Overview

Results

HeldOutTrials

HeldOutBlocks

Train/Test Partitions

Dependencies

Owner

Francis R. Willett

IAUnet: Global Context-Aware Feature Learning for Person Re-Identification

Modified prey-predator system - Modified prey–predator model describes the rate of change for each species by adding coupling terms.

(CVPR 2022 - oral) Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

Tackling Obstacle Tower Challenge using PPO & A2C combined with ICM.

Experiments on continual learning from a stream of pretrained models.

AI4Good project for detecting waste in the environment

Differentiable simulation for system identification and visuomotor control

Gradient Step Denoiser for convergent Plug-and-Play

CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images

Streamlit component for TensorBoard, TensorFlow's visualization toolkit

Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)

Research code for Arxiv paper "Camera Motion Agnostic 3D Human Pose Estimation"

code for paper -- "Seamless Satellite-image Synthesis"

Conflict-aware Inference of Python Compatible Runtime Environments with Domain Knowledge Graph, ICSE 2022

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。

The Generic Manipulation Driver Package - Implements a ROS Interface over the robotics toolbox for Python

Pytorch Implementation of "Diagonal Attention and Style-based GAN for Content-Style disentanglement in image generation and translation" (ICCV 2021)

PyTorch Code of "Memory In Memory: A Predictive Neural Network for Learning Higher-Order Non-Stationarity from Spatiotemporal Dynamics"

Extremely simple and fast extreme multi-class and multi-label classifiers.

DilatedNet in Keras for image segmentation