PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Last update: Nov 12, 2021

Overview

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Abstract

NLP applications for code-mixed (CM) or mix-lingual text have gained a significant momentum recently, the main reason being the prevalence of language mixing in social media communications in multi-lingual societies like India, Mexico, Europe, parts of USA etc. Word embeddings are basic building blocks of any NLP system today, yet, word embedding for CM languages is an unexplored territory. The major bottleneck for CM word embeddings is switching points, where the language switches. These locations lack in contextually and statistical systems fail to model this phenomena due to high variance in the seen examples. In this paper we present our initial observations on applying switching point based positional encoding techniques for CM language, specifically Hinglish (Hindi - English). Results are only marginally better than SOTA, but it is evident that positional encoding could be an effective way to train position sensitive language models for CM text.

PESTO Architecture

Switch Point Attention

If you find this useful, please cite our paper below:

@inproceedings{ali-etal-relative,
title = {PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages},
author = {Mohsin Ali and Kandukuri Sai Teja and Sumanth Manduru and Parth Patwa and Amitava Das}
booktitle =  {Proceedings of the AAAI Conference on Artificial Intelligence},
year = {2022},}

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Related tags

Overview

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Abstract

PESTO Architecture

Switch Point Attention

If you find this useful, please cite our paper below:

Owner

Mohsin Ali, Mohammed

JUSTICE: A Benchmark Dataset for Supreme Court’s Judgment Prediction

A unified 3D Transformer Pipeline for visual synthesis

social humanoid robots with GPGPU and IoT

We provided a matlab implementation for an evolutionary multitasking AUC optimization framework (EMTAUC).

SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)

Code for the Shortformer model, from the paper by Ofir Press, Noah A. Smith and Mike Lewis.

Universal Probability Distributions with Optimal Transport and Convex Optimization

WaveFake: A Data Set to Facilitate Audio DeepFake Detection

Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)

Code for AutoNL on ImageNet (CVPR2020)

GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles

A unified framework to jointly model images, text, and human attention traces.

VGGFace2-HQ - A high resolution face dataset for face editing purpose

🛠️ Tools for Transformers compression using Lightning ⚡

Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning

3rd Place Solution for ICCV 2021 Workshop SSLAD Track 3A - Continual Learning Classification Challenge

DCGAN LSGAN WGAN-GP DRAGAN PyTorch

On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))

Code for the paper "Can Active Learning Preemptively Mitigate Fairness Issues?" presented at RAI 2021.

PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)