Statistics and Mathematics for Machine Learning, Deep Learning , Deep NLP

Overview

Stat4ML

Statistics and Mathematics for Machine Learning, Deep Learning , Deep NLP

This is the first course from our trio courses:

  1. Statistics Foundation for ML

https://github.com/Bellman281/Stat4ML/

  1. Introduction to Statistical Learning https://github.com/Bellman281/Intro_Statistical_Learning

  2. Advanced Statistical Learning for DL ( to be anounced)

Registration Form for cohort 2 of STAT4ML:

https://forms.gle/ZqLJLmv1K5nGVx3m7

Notes about the course:

Instructor : Omid Safarzadeh,

LinkedIn: https://www.linkedin.com/in/omidsafarzadeh/

IG : @deepdatascientists

Course Text Book: Statistical Inference 2nd Edition by George Casella (Author), Roger L. Berger (Author) :

https://www.amazon.com/Statistical-Inference-George-Casella-dp-0534243126/dp/0534243126/ref=mt_other?_encoding=UTF8&me=&qid=

Pre Requisitives

Recall from Calculus:

    Derivative
          Chain rule
    Integral
          Techniques of Integration
          Substitution
    Integration by parts

Matrix Algebra Review:

    Matrix operations
    Matrix Multiplication
       Properties of determinants
       Inverse Matrix
       Matrix Transpose
       Properties of transpose
    Partioned Matrices
    Eigenvalues and Eigenvectors
    Matrix decomposition
       LU decomposition
       Cholesky decomposition
       QR decomposition
       SVD
    Matrix Differentiation

Course 1 :

Slide 1 : Probability Theory Foundation

 Sample Space
 Probability Theory Foundation
    Axiomatic Foundations
    The Calculus of Probabilities
 Independence
 Conditional Probability
    Bayes Theorem
 Random Variables
 Probability Function
    Distribution Functions
    Density function

Slide 2: Moments

   Moments
       Expected Value
       Variance
       Covariance and Correlation
   Moment Generating Functions
       Normal mgf
   Matrix Notation for Moments

Slide 3: Distribution Functions

   Distributions
     Discrete Distribution
       Discrete Uniform Distribution
       Binomial Distribution
       Poisson Distribution
     Continuous Distribution
       Uniform Distribution
       Exponential Distribution
       Normal Distribution
       Lognormal Distribution
       Laplace Distribution
       Beta Distribution

Slide 4: Conditional and Multivariate Distributions

Joint and Marginal Distribution
Conditional Distributions and Independence
Bivariate Transformations
Hierarchical Models and Mixture Distribution
Bivariate Normal Distribution
Multivariate Distribution

Slide 5: Convergence Concepts

Random Samples
   Sums of Random Variable from a Random Sample
Inequalities
Convergence Concepts:
   Almost Sure Convergence
   Convergence in Probability
   Convergence in Distribution
The Delta Method

Slide 6: Maximum Likelihood Estimation

Maximum Likelihood Estimation
  Motivation and the Main Ideas
  Properties of the Maximum Likelihood Estimator

Slide 7: Bayesian and posterior distribution Estimation

   Computing the posterior
   Maximum likelihood estimation (MLE)
Maximum a posteriori (MAP) estimation
   Posterior mean
   MAP properties
Bayesian linear regression
Owner
Omid Safarzadeh
Deep Learning Expert, Kaggler
Omid Safarzadeh
NLP tool to extract emotional phrase from tweets 🤩

Emotional phrase extractor Extract phrase in the given text that is used to express the sentiment. Capturing sentiment in language is important in the

Shahul ES 38 Oct 17, 2022
A deep learning-based translation library built on Huggingface transformers

DL Translate A deep learning-based translation library built on Huggingface transformers and Facebook's mBART-Large 💻 GitHub Repository 📚 Documentat

Xing Han Lu 244 Dec 30, 2022
Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS)

TOPSIS implementation in Python Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) CHING-LAI Hwang and Yoon introduced TOPSIS

Hamed Baziyad 8 Dec 10, 2022
Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis

MLP Singer Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis. Audio samples are available on our demo page.

Neosapience 103 Dec 23, 2022
Knowledge Oriented Programming Language

KoPL: 面向知识的推理问答编程语言 安装 | 快速开始 | 文档 KoPL全称 Knowledge oriented Programing Language, 是一个为复杂推理问答而设计的编程语言。我们可以将自然语言问题表示为由基本函数组合而成的KoPL程序,程序运行的结果就是问题的答案。目前,

THU-KEG 62 Dec 12, 2022
Suite of 500 procedurally-generated NLP tasks to study language model adaptability

TaskBench500 The TaskBench500 dataset and code for generating tasks. Data The TaskBench dataset is available under wget http://web.mit.edu/bzl/www/Tas

Belinda Li 20 May 17, 2022
We have built a Voice based Personal Assistant for people to access files hands free in their device using natural language processing.

Voice Based Personal Assistant We have built a Voice based Personal Assistant for people to access files hands free in their device using natural lang

Rushabh 2 Nov 13, 2021
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

Google Research Datasets 740 Dec 24, 2022
Practical Machine Learning with Python

Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.

Dipanjan (DJ) Sarkar 2k Jan 08, 2023
The PyTorch based implementation of continuous integrate-and-fire (CIF) module.

CIF-PyTorch This is a PyTorch based implementation of continuous integrate-and-fire (CIF) module for end-to-end (E2E) automatic speech recognition (AS

Minglun Han 24 Dec 29, 2022
Learning Spatio-Temporal Transformer for Visual Tracking

STARK The official implementation of the paper Learning Spatio-Temporal Transformer for Visual Tracking Highlights The strongest performances Tracker

Multimedia Research 485 Jan 04, 2023
BERT score for text generation

BERTScore Automatic Evaluation Metric described in the paper BERTScore: Evaluating Text Generation with BERT (ICLR 2020). News: Features to appear in

Tianyi 1k Jan 08, 2023
A Python script that compares files in directories

compare-files A Python script that compares files in different directories, this is similar to the command filecmp.cmp(f1, f2). I made this script in

Colvin 1 Oct 15, 2021
Fake news detector filters - Smart filter project allow to classify the quality of information and web pages

fake-news-detector-1.0 Lists, lists and more lists... Spam filter list, quality keyword list, stoplist list, top-domains urls list, news agencies webs

Memo Sim 1 Jan 04, 2022
This project consists of data analysis and data visualization (done using python)of all IPL seasons from 2008 to 2019 and answering the most asked questions about the IPL.

IPL-data-analysis This project consists of data analysis and data visualization of all IPL seasons from 2008 to 2019 and answering the most asked ques

Sivateja A T 2 Feb 08, 2022
A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

420 Dec 28, 2022
Watson Natural Language Understanding and Knowledge Studio

Material de demonstração dos serviços: Watson Natural Language Understanding e Knowledge Studio Visão Geral: https://www.ibm.com/br-pt/cloud/watson-na

Vanderlei Munhoz 4 Oct 24, 2021
Th2En & Th2Zh: The large-scale datasets for Thai text cross-lingual summarization

Th2En & Th2Zh: The large-scale datasets for Thai text cross-lingual summarization 📥 Download Datasets 📥 Download Trained Models INTRODUCTION TH2ZH (

Nakhun Chumpolsathien 5 Jan 03, 2022
hashily is a Python module that provides a variety of text decoding and encoding operations.

hashily is a python module that performs a variety of text decoding and encoding functions. It also various functions for encrypting and decrypting text using various ciphers.

DevMysT 5 Jul 17, 2022
Labelling platform for text using distant supervision

With DataQA, you can label unstructured text documents using rule-based distant supervision.

245 Aug 05, 2022