Use graph-based analysis to re-classify stocks and to improve Markowitz portfolio optimization

Last update: Dec 05, 2022

Overview

Dynamic Stock Industrial Classification

Use graph-based analysis to re-classify stocks and experiment different re-classification methodologies to improve Markowitz portfolio optimization performance in the low-frequency quantitative trading context.

Note that for strategy confidentiality, many files are hidden.

Module Breakdown

This project contains the following five modules:

factor generation: compute and store factors alpha factors and risk factors for low-frequency trading;
backtest: low-frequency backtest framework;
factor combination: combine factors using ML models;
portfolio optimization: Markowitz portfolio optimization, with turnover, industrial exposure, style exposure, and various other constraints.
graph clustering: experiment different graph-based clustering on stocks.

Data

China A-Share stocks, the corresponding major index data (sz50, hs300, zz500, zz1000), and the member stock weights from 20150101 to 20211231, provided by Shanghai Probability Quantitative Investment.

Quick Start

It's very easy to use this platform!

Tips:

run each module at a time;
change config for corresponding module in respective files (file location indicated inside run.py).

To run each module, in current directory:

factor generation: python run.py gen
backtest: python run.py backtest
factor combination: python run.py comb
portfolio optimization: python run.py opt
graph clustering: python run.py cluster

Acknowledgement

Special thanks to coworkers and my best friends at Shanghai Probability Quantitative Investment: Beilei Xu, Zhongyuan Wang, Zhenghang Xie, Cong Chen, Yihao Zhou, Weilin Chen, Yuhan Tao, Wan Zheng, and many others. This project would be impossible without their data, insights, and experiences.

Use graph-based analysis to re-classify stocks and to improve Markowitz portfolio optimization

Related tags

Overview

Dynamic Stock Industrial Classification

Module Breakdown

Data

Quick Start

Acknowledgement

Owner

Sheng Yang

A general, feasible, and extensible framework for classification tasks.

RAANet: Range-Aware Attention Network for LiDAR-based 3D Object Detection with Auxiliary Density Level Estimation

Code for LIGA-Stereo Detector, ICCV'21

Dynamic Attentive Graph Learning for Image Restoration, ICCV2021 [PyTorch Code]

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

Official repository of my book: "Deep Learning with PyTorch Step-by-Step: A Beginner's Guide"

Code for binary and multiclass model change active learning, with spectral truncation implementation.

Code for the paper "Asymptotics of ℓ2 Regularized Network Embeddings"

You Only Hypothesize Once: Point Cloud Registration with Rotation-equivariant Descriptors

[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias

MANO hand model porting for the GraspIt simulator

Enhancing Column Generation by a Machine-Learning-BasedPricing Heuristic for Graph Coloring

This is the pytorch code for the paper Curious Representation Learning for Embodied Intelligence.

Hough Transform and Hough Line Transform Using OpenCV

Hyperbolic Procrustes Analysis Using Riemannian Geometry

Implementations of paper Controlling Directions Orthogonal to a Classifier

CTF challenges and write-ups for MicroCTF 2021.

Code for the ICCV2021 paper "Personalized Image Semantic Segmentation"

TensorFlow-based neural network library

A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)