A python framework to transform natural language questions to queries in a database query language.

Last update: Dec 18, 2022

Related tags

Overview

  __ _ _   _  ___ _ __  _   _
 / _` | | | |/ _ \ '_ \| | | |
| (_| | |_| |  __/ |_) | |_| |
 \__, |\__,_|\___| .__/ \__, |
    |_|          |_|    |___/

What's quepy?

Quepy is a python framework to transform natural language questions to queries in a database query language. It can be easily customized to different kinds of questions in natural language and database queries. So, with little coding you can build your own system for natural language access to your database.

Currently Quepy provides support for Sparql and MQL query languages. We plan to extended it to other database query languages.

An example

To illustrate what can you do with quepy, we included an example application to access DBpedia contents via their sparql endpoint.

You can try the example online here: Online demo

Or, you can try the example yourself by doing:

python examples/dbpedia/main.py "Who is Tom Cruise?"

And it will output something like this:

SELECT DISTINCT ?x1 WHERE {
    ?x0 rdf:type foaf:Person.
    ?x0 rdfs:label "Tom Cruise"@en.
    ?x0 rdfs:comment ?x1.
}

Thomas Cruise Mapother IV, widely known as Tom Cruise, is an...

The transformation from natural language to sparql is done by first using a special form of regular expressions:

person_name = Group(Plus(Pos("NNP")), "person_name")
regex = Lemma("who") + Lemma("be") + person_name + Question(Pos("."))

And then using and a convenient way to express semantic relations:

person = IsPerson() + HasKeyword(person_name)
definition = DefinitionOf(person)

The rest of the transformation is handled automatically by the framework to finally produce this sparql:

SELECT DISTINCT ?x1 WHERE {
    ?x0 rdf:type foaf:Person.
    ?x0 rdfs:label "Tom Cruise"@en.
    ?x0 rdfs:comment ?x1.
}

Using a very similar procedure you could generate and MQL query for the same question obtaining:

[{
    "/common/topic/description": [{}],
    "/type/object/name": "Tom Cruise",
    "/type/object/type": "/people/person"
}]

Installation

You need to have installed docopt and numpy. Other than that, you can just type:

pip install quepy

You can get more details on the installation here:

http://quepy.readthedocs.org/en/latest/installation.html

Learn more

You can find a tutorial here:

http://quepy.readthedocs.org/en/latest/tutorial.html

And the full documentation here:

http://quepy.readthedocs.org/

Join our mailing list

Contribute!

Want to help develop quepy? Welcome aboard! Find us in http://groups.google.com/group/quepy

A python framework to transform natural language questions to queries in a database query language.

Related tags

Overview

What's quepy?

An example

Installation

Learn more

Contribute!

Owner

Machinalis

In this project, we aim to achieve the task of predicting emojis from tweets. We aim to investigate the relationship between words and emojis.

This is the 25 + 1 year anniversary version of the 1995 Rachford-Rice contest

Sequence modeling benchmarks and temporal convolutional networks

SentimentArcs: a large ensemble of dozens of sentiment analysis models to analyze emotion in text over time

Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks

NSFW A chatbot based on GPT2-chitchat

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x using fastT5.

[Preprint] Escaping the Big Data Paradigm with Compact Transformers, 2021

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL 2021.

Ongoing research training transformer language models at scale, including: BERT & GPT-2

My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensorflow

Smart discord chatbot integrated with Dialogflow to manage different classrooms and assist in teaching!

Nested Named Entity Recognition

Unofficial PyTorch implementation of Google AI's VoiceFilter system

OpenAI CLIP text encoders for multiple languages!

中文医疗信息处理基准CBLUE: A Chinese Biomedical LanguageUnderstanding Evaluation Benchmark

Paddlespeech Streaming ASR GUI

CPC-big and k-means clustering for zero-resource speech processing

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Sequence model architectures from scratch in PyTorch

A python framework to transform natural language questions to queries in a database query language.

Related tags

Overview

What's quepy?

An example

Installation

Learn more

Contribute!

Owner

Machinalis

In this project, we aim to achieve the task of predicting emojis from tweets. We aim to investigate the relationship between words and emojis.

This is the 25 + 1 year anniversary version of the 1995 Rachford-Rice contest

Sequence modeling benchmarks and temporal convolutional networks

SentimentArcs: a large ensemble of dozens of sentiment analysis models to analyze emotion in text over time

Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks

**NSFW** A chatbot based on GPT2-chitchat

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x using fastT5.

[Preprint] Escaping the Big Data Paradigm with Compact Transformers, 2021

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL 2021.

Ongoing research training transformer language models at scale, including: BERT & GPT-2

My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensorflow

Smart discord chatbot integrated with Dialogflow to manage different classrooms and assist in teaching!

Nested Named Entity Recognition

Unofficial PyTorch implementation of Google AI's VoiceFilter system

OpenAI CLIP text encoders for multiple languages!

中文医疗信息处理基准CBLUE: A Chinese Biomedical LanguageUnderstanding Evaluation Benchmark

Paddlespeech Streaming ASR GUI

CPC-big and k-means clustering for zero-resource speech processing

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Sequence model architectures from scratch in PyTorch

NSFW A chatbot based on GPT2-chitchat