University Challenge 2021

This repository contains:

The TeX file of the technical write-up describing the University / HYPER Challenge 2021 under latex-doc/
The Python starter-kit for the competition
The Docker starter-kit for the competition with the Python starter-kit inside

Option 1: Hypergraph partitioning using Python

The Python starter-kit is located under hg_tools/. Please see the README.md file within that folder for further instructions. The partition output file is written under hg_tools/output/.

Option 2: Hypergraph partitioning using Docker

The following instructions show a reproducible execution of the Docker starter-kit.

Dependencies

You must first have installed docker and docker-compose.

Datasets

You need to copy your datasets under hg_tools/data/ folder.

Build and run within a docker container

To build, type

docker-compose build

To run, type

docker-compose run hg_tools data/sample.mtx 2 0.01
# docker-compose run hg_tools data/CurlCurl_4.mtx.gz 10 0.01
# docker-compose run hg_tools data/wikipedia-20070206.mtx.gz 10 0.01

The partition output file is written under docker-output/.

University Challenge 2021 With Python

Related tags

Overview

University Challenge 2021

Option 1: Hypergraph partitioning using Python

Option 2: Hypergraph partitioning using Docker

Dependencies

Datasets

Build and run within a docker container

Owner

Containerized Demo of Apache Spark MLlib on a Data Lakehouse (2022)

Exploratory Data Analysis of the 2019 Indian General Elections using a dataset from Kaggle.

Handle, manipulate, and convert data with units in Python

Sentiment analysis on streaming twitter data using Spark Structured Streaming & Python

Leverage Twitter API v2 to analyze tweet metrics such as impressions and profile clicks over time.

DataPrep — The easiest way to prepare data in Python

2019 Data Science Bowl

Reading streams of Twitter data, save them to Kafka, then process with Kafka Stream API and Spark Streaming

Datashader is a data rasterization pipeline for automating the process of creating meaningful representations of large amounts of data.

General Assembly's 2015 Data Science course in Washington, DC

Fit models to your data in Python with Sherpa.

Stochastic Gradient Trees implementation in Python

Project: Netflix Data Analysis and Visualization with Python

Airflow ETL With EKS EFS Sagemaker

A library to create multi-page Streamlit applications with ease.

Feature Detection Based Template Matching

NFCDS Workshop Beginners Guide Bioinformatics Data Analysis

A tax calculator for stocks and dividends activities.

follow-analyzer helps GitHub users analyze their following and followers relationship

Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Theano