Generating Korean Slogans with phonetic and structural repetition

Last update: May 23, 2022

Related tags

Overview

LexPOS_ko

Generating Korean Slogans with phonetic and structural repetition

Generating Slogans with Linguistic Features

LexPOS is a sequence-to-sequence transformer model that generates slogans with phonetic and structural repetition. For phonetic repetition, it searches for phonetically similar words with user keywords. Both the sound-alike words and user keywords become the lexical constraints while generating slogans. It also adjusts the logits distribution to implement further phonetic constraints. For structural repetition, LexPOS uses POS constraints. Users can specify any repeated phrase structure by POS tags.

Generating slogans with lexical, POS constraints

1. Code

Need to download pretrained Korean word2vec model from here and put it below phonetic_similarity/KoG2P

# clone this repo
git clone https://github.com/yeounyi/LexPOS_ko
cd LexPOS
# generate slogans 
python3 generate_slogans.py --keywords 카드,혜택 --num_beams 3 --temperature 1.2

-keywords: Keywords that you want to be included in slogans. You can enter multiple keywords, delimited by comma
-pos_inputs: You can either specify the particular list of POS tags delimited by comma, or the model will generate slogans with the most frequent syntax used in corpus. POS tags generally follow the format of Konlpy Mecab POS tags.
-num_beams: Number of beams for beam search. Default to 1, meaning no beam search.
-temperature: The value used to module the next token probabilities. Default to 1.0.
-model_path: Path to the pretrained model

2. Examples

Keyword: 카드, 혜택
POS: [NNG, JK, VV, EC, SF, NNG, JK, VV, EF]
Output: 카드를 택하면, 혜택이 바뀐다

Keyword: 안전, 항공
POS: [MM, NNG, SF, MM, NNG, SF]
Output: 새로운 공항, 안전한 항공

Keywords: 추석, 선물
POS: [NNG, JK, MM, NNG, SF, NNG, JK, MM, NNG]
Output: 추석을 앞둔 추억, 당신을 위한 선물

Model Architecture

Pretrained Model

https://drive.google.com/drive/folders/1opkhDApURnjibVYmmhj5bqLTWy4miNe4?usp=sharing

References

https://github.com/scarletcho/KoG2P

Citation

@misc{yi2021lexpos,
  author = {Yi, Yeoun},
  title = {Generating Korean Slogans with Linguistic Constraints using Sequence-to-Sequence Transformer},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/yeounyi/LexPOS_ko}}
}

Generating Korean Slogans with phonetic and structural repetition

Related tags

Overview

LexPOS_ko

Generating Slogans with Linguistic Features

Generating slogans with lexical, POS constraints

1. Code

2. Examples

Model Architecture

Pretrained Model

References

Citation

Owner

Yeoun Yi

BERT-based Financial Question Answering System

Predict an emoji that is associated with a text

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Two-stage text summarization with BERT and BART

PyTorch implementation and pretrained models for XCiT models. See XCiT: Cross-Covariance Image Transformer

pyupbit 라이브러리를 활용하여 upbit에서 비트코인을 자동매매하는 코드입니다. 조코딩 유튜브 채널에서 자세한 강의 영상을 보실 수 있습니다.

AllenNLP integration for Shiba: Japanese CANINE model

Official implementation of Meta-StyleSpeech and StyleSpeech

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Cherche (search in French) allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers.

This Project is based on NLTK It generates a RANDOM WORD from a predefined list of words, From that random word it read out the word, its meaning with parts of speech , its antonyms, its synonyms

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Utilities for preprocessing text for deep learning with Keras

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Final Project for the Intel AI Readiness Boot Camp NLP (Jan)

MiCECo - Misskey Custom Emoji Counter

Research code for the paper "Fine-tuning wav2vec2 for speaker recognition"

Question and answer retrieval in Turkish with BERT

A Plover python dictionary allowing for consistent symbol input with specification of attachment and capitalisation in one stroke.

YACLC - Yet Another Chinese Learner Corpus