Making the DAEN information accessible.

Last update: May 10, 2022

Overview

AccessibleAdverseEventNotification

Making the DAEN information accessible.

The purpose of this repository is to make the information on Australian COVID-19 adverse events accessible. The Therapeutics Goods Administration (TGA) keeps a database of adverse reactions to medications including the COVID-19 vaccines. This Database of Adverse Event Notifications (DAEN) is available to the public via this awful web interface. The most recent two weeks is never available.

The DAEN website doesn't provide information in a format that might be useful for analysis. Instead you have to scrape the information by entering each individual day and collecting the results from two tables which might span multiple pages. I've already done that and the code is here (this code isn't great, but it is good enough to get the job done).

Please be aware that the numbers reported in DAEN are probably significantly less than the actual number of adverse events and deaths. As the DAEN website states:

Adverse event reports from consumers and health professionals to the TGA are voluntary, so there is under-reporting by these groups of adverse events related to therapeutic goods in Australia. This is the same around the world.

The scraped data is found in the data directory. These files are tab separated files which you can easily import in to a spreadsheet program. All of the files are only for COVID-19 vaccines.

DAEN_webscrape_simple.txt This file shows the date (twice for reasons that made sense at the time, but don't necessarily make sense anymore), the number of cases reported that day, the number of cases with a single suspected medicine for that day, and the number of deaths reported that day.
DAEN_webscrape_medsummary.txt This file gives a daily count of each adverse event category. Please note that if one patient had multiple adverse events, then each event would be counted in the appropriate category.
DAEN_webscrape_listofreports.txt This file provides the individual reports and includes sex and age (when recorded).

Figure 1 shows some of the basic information such as number of adverse events and deaths reported each day for the COVID-19 vaccines, myocarditis, pericarditis and the more general term cardiac disorder.

Figure 1.

Figure 2 shows a histogram of reported cases of myocarditis and pericarditis from the COVID-19 vaccine. Please note that the age group 10-19 is somewhat distorted as the age 10-11 should not receive the vaccine (although there are cases of 8 year olds getting the vaccine when that should not have occurred). This age group also has a significantly lower uptake than other age groups.

Figure 2.

Figures 3 and 4 plot the reports of myocarditis by age grouped by sex or manufacturer respectively. Figures 5 and 6 are the same for pericarditis. A '-' is used where an age was not given in the report.

Figure 3.

Figure 4.

Figure 5.

Figure 6.

Figure 7 shows how the histogram for myocarditis has progressed over time.

Figure 7.

Figure 8 shows the death rate of people in Australia who contracted COVID-19. Data taken from health.gov on 1/12/2021. Bottom graph is zoomed in to 1% to see what is happening with those under the age of 60.

Figure 8.

Making the DAEN information accessible.

Related tags

Overview

AccessibleAdverseEventNotification

Owner

A tool to compare differences between dataframes and create a differences report in Excel

This is an analysis and prediction project for house prices in King County, USA based on certain features of the house

PyTorch implementation for NCL (Neighborhood-enrighed Contrastive Learning)

Processo de ETL (extração, transformação, carregamento) realizado pela equipe no projeto final do curso da Soul Code Academy.

The OHSDI OMOP Common Data Model allows for the systematic analysis of healthcare observational databases.

💬 Python scripts to parse Messenger, Hangouts, WhatsApp and Telegram chat logs into DataFrames.

An orchestration platform for the development, production, and observation of data assets.

Containerized Demo of Apache Spark MLlib on a Data Lakehouse (2022)

Validation and inference over LinkML instance data using souffle

WaveFake: A Data Set to Facilitate Audio DeepFake Detection

Pipeline to convert a haploid assembly into diploid

Functional Data Analysis, or FDA, is the field of Statistics that analyses data that depend on a continuous parameter.

Data Scientist in Simple Stock Analysis of PT Bukalapak.com Tbk for Long Term Investment

Vaex library for Big Data Analytics of an Airline dataset

Pandas and Spark DataFrame comparison for humans

Efficient matrix representations for working with tabular data

Elasticsearch tool for easily collecting and batch inserting Python data and pandas DataFrames

Python ELT Studio, an application for building ELT (and ETL) data flows.

Tools for working with MARC data in Catalogue Bridge.

[CVPR2022] This repository contains code for the paper "Nested Collaborative Learning for Long-Tailed Visual Recognition", published at CVPR 2022