CRISP: Critical Path Analysis of Microservice Traces

This repo contains code to compute and present critical path summary from Jaeger microservice traces. To use first collect the microservice traces of a specific endpoint in a directory (say traces). Let the traces be for OP operation and SVC service (these are Jaeger termonologies). python3 process.py --operationName OP --serviceName SVC -t <path to trace> -o . --parallelism 8 will produce the critical path summary using 8 concurrent processes. The summary will be output in the current directory as an HTML file with a heatmap, flamegraph, and summary text in criticalPaths.html. It will also produce three flamegraphs flame-graph-*.svg for three different percentile values.

The script accepts the following options:

python3 process.py --help
usage: process.py [-h] -a OPERATIONNAME -s SERVICENAME [-t TRACEDIR] [--file FILE] -o OUTPUTDIR
                  [--parallelism PARALLELISM] [--topN TOPN] [--numTrace NUMTRACE] [--numOperation NUMOPERATION]

optional arguments:
  -h, --help            show this help message and exit
  -a OPERATIONNAME, --operationName OPERATIONNAME
                        operation name
  -s SERVICENAME, --serviceName SERVICENAME
                        name of the service
  -t TRACEDIR, --traceDir TRACEDIR
                        path of the trace directory (mutually exclusive with --file)
  --file FILE           input path of the trace file (mutually exclusivbe with --traceDir)
  -o OUTPUTDIR, --outputDir OUTPUTDIR
                        directory where output will be produced
  --parallelism PARALLELISM
                        number of concurrent python processes.
  --topN TOPN           number of services to show in the summary
  --numTrace NUMTRACE   number of traces to show in the heatmap
  --numOperation NUMOPERATION
                        number of operations to show in the heatmap

CRISP: Critical Path Analysis of Microservice Traces

Related tags

Overview

CRISP: Critical Path Analysis of Microservice Traces

Owner

Uber Research

Multiple Pairwise Comparisons (Post Hoc) Tests in Python

DenseClus is a Python module for clustering mixed type data using UMAP and HDBSCAN

Functional tensors for probabilistic programming

Integrate bus data from a variety of sources (batch processing and real time processing).

Pipeline and Dataset helpers for complex algorithm evaluation.

A Python package for the mathematical modeling of infectious diseases via compartmental models

A real-time financial data streaming pipeline and visualization platform using Apache Kafka, Cassandra, and Bokeh.

Shot notebooks resuming the main functions of GeoPandas

ForecastGA is a Python tool to forecast Google Analytics data using several popular time series models.

Python Kalman filtering and optimal estimation library. Implements Kalman filter, particle filter, Extended Kalman filter, Unscented Kalman filter, g-h (alpha-beta), least squares, H Infinity, smoothers, and more. Has companion book 'Kalman and Bayesian Filters in Python'.

A Python package for modular causal inference analysis and model evaluations

A data analysis using python and pandas to showcase trends in school performance.

Dbt-core - dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

PyTorch implementation for NCL (Neighborhood-enrighed Contrastive Learning)

nrgpy is the Python package for processing NRG Data Files

Analysis of a dataset of 10000 passwords to find common trends and mistakes people generally make while setting up a password.

A collection of learning outcomes data analysis using Python and SQL, from DQLab.

A distributed block-based data storage and compute engine

DaCe is a parallel programming framework that takes code in Python/NumPy and other programming languages

Creating a statistical model to predict 10 year treasury yields