Python binding for Morfologik

Morfologik is Polish morphological analyzer. For more information see http://github.com/morfologik/morfologik-stemming/ and http://http://www.morfologik.blogspot.com/

Requirements

This binding works with Python 2 and Python 3.

Installation

Install it from pip

pip install pyMorfologik

or directly from github

git clone https://github.com/dmirecki/pyMorfologik.git

Usage

Now, only simple stems are supported:

>>> from pymorfologik import Morfologik
>>> from pymorfologik.parsing import ListParser
>>>
>>> parser = ListParser()
>>> stemmer = Morfologik()
>>> stemmer.stem(['Ala ma kota'], parser)
[(u'Ala',
  {u'Al': [u'subst:sg:acc:m1+subst:sg:gen:m1'],
   u'Ala': [u'subst:sg:nom:f'],
   u'Alo': [u'subst:sg:acc:m1+subst:sg:gen:m1']}),
 (u'ma',
  {u'mieć': [u'verb:fin:sg:ter:imperf:refl.nonrefl'],
   u'mój': [u'adj:sg:nom.voc:f:pos']}),
 (u'kota', {u'kot': [u'subst:sg:acc:m1'], u'kota': [u'subst:sg:nom:f']})]

Acknowledgements

This repo is based on Morfologik, a great contribution of Marcin Miłowski (http://marcinmilkowski.pl) and Dawid Weiss (http://www.dawidweiss.com).

Contributions

Damian Mirecki

Adrian Bohdanowicz

pyMorfologik MorfologikpyMorfologik - Python binding for Morfologik.

Related tags

Overview

Python binding for Morfologik

Requirements

Installation

Usage

Acknowledgements

Contributions

Owner

Damian Mirecki

translate using your voice

Google AI 2018 BERT pytorch implementation

Code for text augmentation method leveraging large-scale language models

Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

A Chinese to English Neural Model Translation Project

Officile code repository for "A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning"

Neural-Machine-Translation - Implementation of revolutionary machine translation models

Automated question generation and question answering from Turkish texts using text-to-text transformers

Words-per-minute - A terminal app written in python utilizing the curses module that tests the user's ability to type

Chinese Pre-Trained Language Models (CPM-LM) Version-I

Chinese NER with albert/electra or other bert descendable model (keras)

GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot, a language model

NLP codes implemented with Pytorch (w/o library such as huggingface)

Experiments in converting wikidata to ftm

A website which allows you to play with the GPT-2 transformer

Natural Language Processing Tasks and Examples.

Chinese segmentation library

Python package for performing Entity and Text Matching using Deep Learning.

An open source library for deep learning end-to-end dialog systems and chatbots.