Useful tool for inserting DataFrames into the Excel sheet.

Last update: Feb 16, 2022

Related tags

Data Analysis pycellframe

Overview

PyCellFrame

Insert Pandas DataFrames into the Excel sheet with a bunch of conditions

Install

pip install pycellframe

Usage

Examples

Let's suppose that we have an Excel file named "numbers.xlsx" with the sheet named "Dictionary" in which we would like to insert the pandas.DataFrame.

Import pandas and create an example DataFrame (which will be inserted into the Excel sheet):

import pandas as pd


ex = {
    'Num': [1, 2, 3, 4],
    'AfterFirstBlankCol': 'AfterFirstBlank',
    'Descr': ['One', 'Two', 'Three', 'Four'],
    'AfterSecondBlankCol': 'AfterSecondBlank.',
    'Squared': [1, 4, 9, 16],
    'Binary:': ['1', '10', '11', '100']
}

df = pd.DataFrame(ex)

Import openpyxl.load_workbook and open numbers.xlsx - Our Excel workbook;
Get - Dictionary our desired sheet:

from openpyxl import load_workbook


workbook = load_workbook('numbers.xlsx')
worksheet = workbook['Dictionary']

Functions

1. `incell_style(cell_src, cell_dst)`

Let's say, we have a cell in Excel Dictionary sheet that we would like to copy the style from, and it is O3;
Let O4 be our destination cell:

NOTE: If we wanted to copy that style to more than one cell, we would simply use the loop depending on the locations of the destination cells.

from pycellframe import incell_style


incell_style(cell_src=worksheet['O3'], cell_dst=worksheet['O4'])

2. `sheet_to_sheet(filename_sheetname_src, filename_sheetname_dst, calculated)`

Let's say that we have two Excel files, and we need specific sheet from one file to be completely copied to another file's specific sheet;
filename_sheetname_src is the parameter for one file -> sheet the data to be copied from (tuple(['FILENAME_SRC', 'SHEETNAME_SRC']));
worksheet_dst is the parameter for the destination Worksheet the data to be copied to (openpyxl.worksheet.worksheet.Worksheet);
Let's assume that we have file_src.xlsx as src file and for worksheet_src we can use its CopyThisSheet sheet.
We can use output.xlsx -> CopyToThisSheet sheet as the destination worksheet, for which we already declared the Workbook object above.

NOTE: We are assuming that we need all the formulas (where available) from the source sheet, not calculated data, so we set calculated parameter to False

from pycellframe import sheet_to_sheet


worksheet_to = workbook['CopyToThisSheet']

sheet_to_sheet(filename_sheetname_src=('file_src.xlsx', 'CopyThisSheet'),
               worksheet_dst=worksheet_to,
               calculated=False)

3. `incell_frame(worksheet, dataframe, col_range, row_range, num_str_cols, skip_cols, headers)`

From our package pycellframe import function incell_frame;
Insert ex - DataFrame into our sheet twice - with and without conditions:

from pycellframe import incell_frame


# 1 - Simple insertion
incell_frame(worksheet=worksheet, dataframe=df)

# 2 - Insertion with some conditions
incell_frame(worksheet=worksheet,
             dataframe=df,
             col_range=(3, 0),
             row_range=(6, 8),
             num_str_cols=['I'],
             skip_cols=['D', 'F'],
             headers=True)

In the first insertion, we did not give our function any arguments, which means the DataFrame ex will be inserted into the Dictionary sheet in the area A1:F4 (without the headers).

However, with the second insertion we define some conditions:

col_range=(3, 0) - This means that insertion will be started at the Excel column with the index 3 (column C) and will not be stopped until the very end, since we gave 0 as the second element of the tuple
row_range=(6, 8) - Only in between these rows (in Excel) will the DataFrame data be inserted, which means that only the first row (since the headers is set to True) from ex will be inserted into the sheet
num_str_cols=['F'] - Another condition here is to not convert Binary column values to int. If we count, this column will be inserted in the Excel column F, so we tell the function to leave the values in it as string
skip_cols=['D', 'F'] - D and F columns in Excel will be skipped and since our worksheet was blank in the beginning, these columns will be blank (that is why I named the columns in the DataFrame related names)
headers=True - This time, the DataFrame columns will be inserted, too, so the overall insertion area would be C6:J8

For really detailed description of the parameters, please see:

incell_frame.__docs__
sheet_to_sheet.__docs__
incell_style.__docs__

Finally, let's save our changes to a new Excel file:

workbook.save('output.xlsx')

Full Code

import pandas as pd
from openpyxl import load_workbook
from pycellframe import incell_style, \
                        incell_frame, \
                        sheet_to_sheet


ex = {
    'Num': [1, 2, 3, 4],
    'AfterFirstBlankCol': 'AfterFirstBlank',
    'Descr': ['One', 'Two', 'Three', 'Four'],
    'AfterSecondBlankCol': 'AfterSecondBlank.',
    'Squared': [1, 4, 9, 16],
    'Binary:': ['1', '10', '11', '100']
}

df = pd.DataFrame(ex)

workbook = load_workbook('numbers.xlsx')
worksheet = workbook['Dictionary']


# Copy the cell style
incell_style(cell_src=worksheet['O3'], cell_dst=worksheet['O4'])


# Copy the entire sheet
worksheet_to = workbook['CopyToThisSheet']

sheet_to_sheet(filename_sheetname_src=('file_src.xlsx', 'CopyThisSheet'),
               worksheet_dst=worksheet_to,
               calculated=False)


# Insert DataFrame into the sheet

## 1 - Simple insertion
incell_frame(worksheet=worksheet, dataframe=df)

## 2 - Insertion with some conditions
incell_frame(worksheet=worksheet,
             dataframe=df,
             col_range=(3, 0),
             row_range=(6, 8),
             num_str_cols=['I'],
             skip_cols=['D', 'F'],
             headers=True)

workbook.save('output.xlsx')

Useful tool for inserting DataFrames into the Excel sheet.

Related tags

Overview

PyCellFrame

Install

Usage

Examples

Functions

1. `incell_style(cell_src, cell_dst)`

2. `sheet_to_sheet(filename_sheetname_src, filename_sheetname_dst, calculated)`

3. `incell_frame(worksheet, dataframe, col_range, row_range, num_str_cols, skip_cols, headers)`

For really detailed description of the parameters, please see:

Full Code

Owner

Luka Sosiashvili

Tkinter Izhikevich Neuron Model With Python

Fit models to your data in Python with Sherpa.

A collection of robust and fast processing tools for parsing and analyzing web archive data.

BioMASS - A Python Framework for Modeling and Analysis of Signaling Systems

Spaghetti: an open-source Python library for the analysis of network-based spatial data

small package with utility functions for analyzing (fly) calcium imaging data

Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

Example Of Splunk Search Query With Python And Splunk Python SDK

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

EOD Historical Data Python Library (Unofficial)

Python implementation of Principal Component Analysis

Single machine, multiple cards training; mix-precision training; DALI data loader.

A real-time financial data streaming pipeline and visualization platform using Apache Kafka, Cassandra, and Bokeh.

Two phase pipeline + StreamlitTwo phase pipeline + Streamlit

MoRecon - A tool for reconstructing missing frames in motion capture data.

Deep universal probabilistic programming with Python and PyTorch

Data Science Environment Setup in single line

Wafer Fault Detection - Wafer circleci with python

PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)

Ejercicios Panda usando Pandas

Useful tool for inserting DataFrames into the Excel sheet.

Related tags

Overview

PyCellFrame

Install

Usage

Examples

Functions

1. incell_style(cell_src, cell_dst)

2. sheet_to_sheet(filename_sheetname_src, filename_sheetname_dst, calculated)

3. incell_frame(worksheet, dataframe, col_range, row_range, num_str_cols, skip_cols, headers)

For really detailed description of the parameters, please see:

Full Code

Owner

Luka Sosiashvili

Tkinter Izhikevich Neuron Model With Python

Fit models to your data in Python with Sherpa.

A collection of robust and fast processing tools for parsing and analyzing web archive data.

BioMASS - A Python Framework for Modeling and Analysis of Signaling Systems

Spaghetti: an open-source Python library for the analysis of network-based spatial data

small package with utility functions for analyzing (fly) calcium imaging data

Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

Example Of Splunk Search Query With Python And Splunk Python SDK

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

EOD Historical Data Python Library (Unofficial)

Python implementation of Principal Component Analysis

Single machine, multiple cards training; mix-precision training; DALI data loader.

A real-time financial data streaming pipeline and visualization platform using Apache Kafka, Cassandra, and Bokeh.

Two phase pipeline + StreamlitTwo phase pipeline + Streamlit

MoRecon - A tool for reconstructing missing frames in motion capture data.

Deep universal probabilistic programming with Python and PyTorch

Data Science Environment Setup in single line

Wafer Fault Detection - Wafer circleci with python

PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)

Ejercicios Panda usando Pandas

1. `incell_style(cell_src, cell_dst)`

2. `sheet_to_sheet(filename_sheetname_src, filename_sheetname_dst, calculated)`

3. `incell_frame(worksheet, dataframe, col_range, row_range, num_str_cols, skip_cols, headers)`