Stack based programming language that compiles to x86_64 assembly or can alternatively be interpreted in Python

Last update: May 30, 2022

Related tags

Overview

lang

lang is a simple stack based programming language written in Python. It can either be interpreted in Python, or be compiled to x86_64 assembly code using nasm. Note that the compiled executables will only run on 64 bit linux distributions since linux syscalls are used, although the Python simulation mode should work on all modern operating systems.

Installation

Clone repository
Make sure you have nasm and ld in your path as these are used for compiling programs
Install python dependcies:
- pip install pathlib
Try to run one of the example files:
- Try to simulate: ./main.py examples/fib/fib.lang simulate and
- Try to compile: ./main.py examples/dib/fib.lang compile examples/fib/fib.asm
- Try to run the compiled file: ./examples/fib/fib
Add the main.py to your system path somehow, e.g. save a bash script called lang in your ~/bin:
```
~/coding/lang/main.py "$@"
```
Then, you can do lang [program] [simulate | compile <out-file>]

Usage

Write a program with .lang extension, for example program.lang. Then you can either simulate it in Python with lang program.lang simulate. You can compile it to x86_64 assembly with lang program.lang compile program.asm. The compiled assembly code will be stored in the file specified, in this case program.asm. Additionally, an object file program.o will be generated, and the actual executable, simply named program with no extension. To run it, run ./program.

Features

lang (name not final) is a very simple stack based language, and currently does not have features you might be used to like variables, etc. Instead, you work with a stack. A program consists of a series of instructions. Instructions are separated by spaces, and newlines and excess whitespace are ignored. You can use # to type comments, anything afer a # will be ignored.

Currently the only type of data that is supported is signed integers. Nested if-else blocks and loops are supported.

Operation	Syntax	Description
`PUSH`	`int`	Push a number onto the stack, i.e. `45` pushes the number 45 onto the stack
`POP`	`pop`	Pop the top number off the stack
`ADD`	`+`	Pop the top two numbers off the stack, add them, and push the result back onto the stack
`SUB`	`-`	Pop the top two numbers `a` and `b` off the stack, subtract them (`b - a`), then push the result onto the stack
`MUL`	`*`	Pop the top two numbers `a` and `b` off the stack, multiply them, then push the result onto the stack
`DIV`	`/`	Pop the top two numbers `a` and `b` off the stack and performs integer division `b // a` on them. Then, pushes the ratio and remainder onto the stack, in that order, so the remainder is on top.
`DUMP`	`dump`	Pop the top number off the stack, and print it to standard output
`DUP`	`dup[n]`	`dup` will duplicate the top number on the stack and push it on top. `dup2` will duplicate the second number from the top and push it to the top of the stack. You can also do `dup3`, etc.
`SWAP`	`swap`	Swaps the two topmost numbers on the stack
`IF`	`if`	Peeks at the top number off the stack. If it is `0`, go to the next `else` or `end`. If it is nonzero, Go to the next instruction
`EQ`	`=`	Pops the top two numbers off the stack, and checks if they are equal. If they are, push `1` to the stack, otherwise push `0`.
`GE`	`>`	Pops the top two numbers off the stack, and checks if the second number is greater than the top number. If it is, push `1` to the stack, otherwise push `0`.
`GEQ`	`>=`	Pops the top two numbers off the stack, and checks if the second number is greater than or equal to the top number. If it is, push `1` to the stack, otherwise push `0`.
`LE`	`<`	Pops the top two numbers off the stack, and checks if the second number is less than the top number. If it is, push `1` to the stack, otherwise push `0`.
`LEQ`	`<=`	Pops the top two numbers off the stack, and checks if the second number is less than or equal to the top number. If it is, push `1` to the stack, otherwise push `0`.
`ELSE`	`else`	If `if` fails, execution will jump to the `else` if one exists
`END`	`end`	Marks the end of an if-else block or a loop
`WHILE`	`while`	Peeks at the top number on the stack. If it is nonzero, execute the code until the next `end`. Then peek at the top number again and repeat until the top number is zero, then jump to the `end`

Examples

Code examples can be found in the examples directory, but here is an example program that calculates the fibonacci numbers less than or equal to 1000 and prints them to standard output:

1 1 while
  dup dump
  swap dup2 +
  dup 1000 >= if 0 end
end

Stack based programming language that compiles to x86_64 assembly or can alternatively be interpreted in Python

Related tags

Overview

lang

Installation

Usage

Features

Examples

Owner

Christoffer Aakre

A flask application to predict the speech emotion of any .wav file.

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

SEJE is a prototype for the paper Learning Text-Image Joint Embedding for Efficient Cross-Modal Retrieval with Deep Feature Engineering.

Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2021).

Convolutional 2D Knowledge Graph Embeddings resources

Dope Wars game engine on StarkNet L2 roll-up

NLP topic mdel LDA - Gathered from New York Times website

Sequence model architectures from scratch in PyTorch

An open source framework for seq2seq models in PyTorch.

The simple project to separate mixed voice (2 clean voices) to 2 separate voices.

profile tools for pytorch nn models

Unsupervised text tokenizer for Neural Network-based text generation.

ACL'22: Structured Pruning Learns Compact and Accurate Models

Watson Natural Language Understanding and Knowledge Studio

Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.

CMeEE 数据集医学实体抽取

BookNLP, a natural language processing pipeline for books

PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI

Korean Sentence Embedding Repository

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset