VevestaX is an open source Python package for ML Engineers and Data Scientists.

Last update: Dec 14, 2022

Related tags

Overview

VevestaX

Track failed and successful experiments as well as features.

VevestaX is an open source Python package for ML Engineers and Data Scientists. It includes modules for tracking features sourced from data, feature engineering and variables. The output is an excel file which has tabs namely, data sourcing, feature engineering and modelling. It tracks these values in Jupyter notebook.

How to install the library:

$ pip install vevestaX

How to import a library and create the object

How to extract features present in input data.

How to extract engineered features

How to track variables used in modelling section of the code

How to dump the features and modelling variables in an xlsx file

For additional features, explore our tool at www.vevesta.com

HyperSpy is an open source Python library for the interactive analysis of multidimensional datasets

HyperSpy is an open source Python library for the interactive analysis of multidimensional datasets that can be described as multidimensional arrays o

411 Dec 27, 2022

Meltano: ELT for the DataOps era. Meltano is open source, self-hosted, CLI-first, debuggable, and extensible.

Meltano is open source, self-hosted, CLI-first, debuggable, and extensible. Pipelines are code, ready to be version c

625 Jan 2, 2023

Hue Editor: Open source SQL Query Assistant for Databases/Warehouses

759 Jan 7, 2023

OpenARB is an open source program aiming to emulate a free market while encouraging players to participate in arbitrage in order to increase working capital.

Overview OpenARB is an open source program aiming to emulate a free market while encouraging players to participate in arbitrage in order to increase

3 Feb 12, 2022

Python package to transfer data in a fast, reliable, and packetized form.

pySerialTransfer Python package to transfer data in a fast, reliable, and packetized form.

101 Dec 7, 2022

GWpy is a collaboration-driven Python package providing tools for studying data from ground-based gravitational-wave detectors

GWpy is a collaboration-driven Python package providing tools for studying data from ground-based gravitational-wave detectors. GWpy provides a user-f

342 Jan 7, 2023

Python package for processing UC module spectral data.

UC Module Python Package How To Install clone repo. cd UC-module pip install . How to Use uc.module.UC(measurment=str, dark=str, reference=str, heade

1 Oct 20, 2021

PyEmits, a python package for easy manipulation in time-series data.

PyEmits, a python package for easy manipulation in time-series data. Time-series data is very common in real life. Engineering FSI industry (Financial

5 Sep 23, 2022

nrgpy is the Python package for processing NRG Data Files

nrgpy nrgpy is the Python package for processing NRG Data Files Website and source: https://github.com/nrgpy/nrgpy Documentation: https://nrgpy.github

23 Dec 8, 2022

Comments

Create a tab in the excel created using V.dump. The tab will contain a random set of rows from the input data (panda data frame)

Create a tab in the excel sheet with name "data". This tab will contain a randomized snapshot of input data being read from the input file. The input data snapshot will be extracted from V.ds = df.
enhancement good first issue

opened by Priyanka-Vevesta 0

Releases(v6.8.2)

v6.8.2(Sep 3, 2022)

Simplified the library interface
Source code(tar.gz)
Source code(zip)
vevestaX-6.8.2-py3-none-any.whl(16.26 KB)
v6.7.0(Jul 13, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-6.7.0-py3-none-any.whl(16.02 KB)
v6.5.3(Jul 3, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-6.5.3-py3-none-any.whl(15.52 KB)
v6.5.2(Jul 1, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-6.5.2-py3-none-any.whl(15.51 KB)
v6.5.1(Jul 1, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-6.5.1-py2.py3-none-any.whl(15.67 KB)
v6.3.0(Jun 28, 2022)

Added integration with Github.
Source code(tar.gz)
Source code(zip)
vevestaX-6.3.0-py2.py3-none-any.whl(15.42 KB)
v5.5.0(Jun 2, 2022)

3d plots were generated
Source code(tar.gz)
Source code(zip)
v5.4.0(May 19, 2022)

Add box plots for numeric data
Source code(tar.gz)
Source code(zip)
vevestaX-5.4.0-py3-none-any.whl(13.22 KB)
v5.3.0(May 18, 2022)

Added following values to profiling report Kurtosis Skewness Outliers Outliers (%) Median Mode Q1 quantile Q2 quantile Q3 quantile 100th quantile
Source code(tar.gz)
Source code(zip)
vevestaX-5.3.0-py3-none-any.whl(13.00 KB)
v5.2.0(May 16, 2022)

With this release, we add another tab for data profiling. The variables data profile calculates following values: Distinct Distinct (%) Missing Missing (%) Infinite Infinite (%) Mean Minimum Maximum Zeros Zeros (%) Negative Negative (%) Total Memory size
Source code(tar.gz)
Source code(zip)
vevestaX-5.2.0-py3-none-any.whl(12.66 KB)
pysparkCorrelation(May 11, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-5.1.0-py3-none-any.whl(12.25 KB)
pysparkIntegration(May 8, 2022)

Integrated with pyspark.
Source code(tar.gz)
Source code(zip)
vevestaX-5.0.0-py3-none-any.whl(11.99 KB)
updatedLibraryDependency(Apr 12, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-3.3.0-py3-none-any.whl(11.44 KB)
updatedDependencies(Apr 11, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-3.0.0-py3-none-any.whl(11.43 KB)
colab/kaggle(Apr 7, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-2.9.0-py3-none-any.whl(11.16 KB)
majorbugfix(Apr 3, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-2.8.0-py3-none-any.whl(10.51 KB)
EDA(Apr 2, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-2.7.0-py3-none-any.whl(10.50 KB)
EDA_extended(Apr 1, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-2.6.0-py3-none-any.whl(10.35 KB)
updatedContent(Mar 27, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-2.5.0-py3-none-any.whl(9.91 KB)
messagesUpdated(Mar 26, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-2.3.0-py3-none-any.whl(9.77 KB)
correlation-plot(Mar 23, 2022)

Added EDA-correlation to the output
Source code(tar.gz)
Source code(zip)
vevestaX-2.1.0-py3-none-any.whl(9.69 KB)
performance-plots(Mar 9, 2022)

Source code(tar.gz)
Source code(zip)
mlops(Nov 3, 2021)

Library works with spyder
Source code(tar.gz)
Source code(zip)
vevestaX-1.0.0-py3-none-any.whl(7.10 KB)

Owner

Vevesta

GitHub Repository

Full ELT process on GCP environment.

Rent Houses Germany - GCP Pipeline Project: The goal of the project is to extract data about house rentals in Germany, store, process and analyze it u

2 Jan 20, 2022

Time ranges with python

timeranges Time ranges. Read the Docs Installation pip timeranges is available on pip: pip install timeranges GitHub You can also install the latest v

2 Sep 01, 2022

Fit models to your data in Python with Sherpa.

Table of Contents Sherpa License How To Install Sherpa Using Anaconda Using pip Building from source History Release History Sherpa Sherpa is a modeli

134 Jan 07, 2023

Programmatically access the physical and chemical properties of elements in modern periodic table.

API to fetch elements of the periodic table in JSON format. Uses Pandas for dumping .csv data to .json and Flask for API Integration. Deployed on "pyt

3 Oct 23, 2022

Analyzing Earth Observation (EO) data is complex and solutions often require custom tailored algorithms.

eo-grow Earth observation framework for scaled-up processing in Python. Analyzing Earth Observation (EO) data is complex and solutions often require c

18 Dec 23, 2022

The Spark Challenge Student Check-In/Out Tracking Script

The Spark Challenge Student Check-In/Out Tracking Script This Python Script uses the Student ID Database to match the entries with the ID Card Swipe a

1 Dec 09, 2021

DenseClus is a Python module for clustering mixed type data using UMAP and HDBSCAN

DenseClus is a Python module for clustering mixed type data using UMAP and HDBSCAN. Allowing for both categorical and numerical data, DenseClus makes it possible to incorporate all features in cluste

53 Dec 08, 2022

A crude Hy handle on Pandas library

Quickstart Hyenas is a curde Hy handle written on top of Pandas API to allow for more elegant access to data-scientist's powerhouse that is Pandas. In

4 Sep 05, 2022

Analysis of a dataset of 10000 passwords to find common trends and mistakes people generally make while setting up a password.

7 Sep 04, 2022

An Aspiring Drop-In Replacement for NumPy at Scale

Legate NumPy is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the NumPy API on top of the Legion runtime. Using Legate NumPy you do things like run the f

502 Jan 03, 2023

Autopsy Module to analyze Registry Hives based on bookmarks provided by EricZimmerman for his tool RegistryExplorer

13 Mar 31, 2022

Using Python to scrape some basic player information from www.premierleague.com and then use Pandas to analyse said data.

PremiershipPlayerAnalysis Using Python to scrape some basic player information from www.premierleague.com and then use Pandas to analyse said data. No

5 Sep 06, 2021

TE-dependent analysis (tedana) is a Python library for denoising multi-echo functional magnetic resonance imaging (fMRI) data

tedana: TE Dependent ANAlysis TE-dependent analysis (tedana) is a Python library for denoising multi-echo functional magnetic resonance imaging (fMRI)

136 Dec 22, 2022

VevestaX is an open source Python package for ML Engineers and Data Scientists.

Related tags

Overview

VevestaX

You might also like...

HyperSpy is an open source Python library for the interactive analysis of multidimensional datasets

Meltano: ELT for the DataOps era. Meltano is open source, self-hosted, CLI-first, debuggable, and extensible.

Hue Editor: Open source SQL Query Assistant for Databases/Warehouses

OpenARB is an open source program aiming to emulate a free market while encouraging players to participate in arbitrage in order to increase working capital.

Python package to transfer data in a fast, reliable, and packetized form.

GWpy is a collaboration-driven Python package providing tools for studying data from ground-based gravitational-wave detectors

Python package for processing UC module spectral data.

PyEmits, a python package for easy manipulation in time-series data.

nrgpy is the Python package for processing NRG Data Files

Comments

Create a tab in the excel created using V.dump. The tab will contain a random set of rows from the input data (panda data frame)

Releases(v6.8.2)

v6.8.2(Sep 3, 2022)

v6.7.0(Jul 13, 2022)

v6.5.3(Jul 3, 2022)

v6.5.2(Jul 1, 2022)

v6.5.1(Jul 1, 2022)

v6.3.0(Jun 28, 2022)

v5.5.0(Jun 2, 2022)

v5.4.0(May 19, 2022)

v5.3.0(May 18, 2022)

v5.2.0(May 16, 2022)

pysparkCorrelation(May 11, 2022)

pysparkIntegration(May 8, 2022)

updatedLibraryDependency(Apr 12, 2022)

updatedDependencies(Apr 11, 2022)

colab/kaggle(Apr 7, 2022)

majorbugfix(Apr 3, 2022)

EDA(Apr 2, 2022)

EDA_extended(Apr 1, 2022)

updatedContent(Mar 27, 2022)

messagesUpdated(Mar 26, 2022)

correlation-plot(Mar 23, 2022)

performance-plots(Mar 9, 2022)

mlops(Nov 3, 2021)

Owner

Vevesta

Full ELT process on GCP environment.

Time ranges with python

Fit models to your data in Python with Sherpa.

Programmatically access the physical and chemical properties of elements in modern periodic table.

Analyzing Earth Observation (EO) data is complex and solutions often require custom tailored algorithms.

The Spark Challenge Student Check-In/Out Tracking Script

DenseClus is a Python module for clustering mixed type data using UMAP and HDBSCAN

A crude Hy handle on Pandas library

Analysis of a dataset of 10000 passwords to find common trends and mistakes people generally make while setting up a password.

An Aspiring Drop-In Replacement for NumPy at Scale

Autopsy Module to analyze Registry Hives based on bookmarks provided by EricZimmerman for his tool RegistryExplorer

Using Python to scrape some basic player information from www.premierleague.com and then use Pandas to analyse said data.

Functional Data Analysis, or FDA, is the field of Statistics that analyses data that depend on a continuous parameter.

AWS Glue ETL Code Samples

Top 50 best selling books on amazon

Hue Editor: Open source SQL Query Assistant for Databases/Warehouses

A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms

Techdegree Data Analysis Project 2

Data processing with Pandas.

TE-dependent analysis (tedana) is a Python library for denoising multi-echo functional magnetic resonance imaging (fMRI) data