Statistics and Mathematics for Machine Learning, Deep Learning , Deep NLP

Last update: Dec 29, 2022

Related tags

Text Data & NLP Stat4ML

Overview

Stat4ML

Statistics and Mathematics for Machine Learning, Deep Learning , Deep NLP

This is the first course from our trio courses:

Statistics Foundation for ML

https://github.com/Bellman281/Stat4ML/

Introduction to Statistical Learning https://github.com/Bellman281/Intro_Statistical_Learning
Advanced Statistical Learning for DL ( to be anounced)

Registration Form for cohort 2 of STAT4ML:

https://forms.gle/ZqLJLmv1K5nGVx3m7

Notes about the course:

Instructor : Omid Safarzadeh,

LinkedIn: https://www.linkedin.com/in/omidsafarzadeh/

IG : @deepdatascientists

Course Text Book: Statistical Inference 2nd Edition by George Casella (Author), Roger L. Berger (Author) :

https://www.amazon.com/Statistical-Inference-George-Casella-dp-0534243126/dp/0534243126/ref=mt_other?_encoding=UTF8&me=&qid=

Pre Requisitives

Recall from Calculus:

    Derivative
          Chain rule
    Integral
          Techniques of Integration
          Substitution
    Integration by parts

Matrix Algebra Review:

    Matrix operations
    Matrix Multiplication
       Properties of determinants
       Inverse Matrix
       Matrix Transpose
       Properties of transpose
    Partioned Matrices
    Eigenvalues and Eigenvectors
    Matrix decomposition
       LU decomposition
       Cholesky decomposition
       QR decomposition
       SVD
    Matrix Differentiation

Course 1 :

Slide 1 : Probability Theory Foundation

 Sample Space
 Probability Theory Foundation
    Axiomatic Foundations
    The Calculus of Probabilities
 Independence
 Conditional Probability
    Bayes Theorem
 Random Variables
 Probability Function
    Distribution Functions
    Density function

Slide 2: Moments

   Moments
       Expected Value
       Variance
       Covariance and Correlation
   Moment Generating Functions
       Normal mgf
   Matrix Notation for Moments

Slide 3: Distribution Functions

   Distributions
     Discrete Distribution
       Discrete Uniform Distribution
       Binomial Distribution
       Poisson Distribution
     Continuous Distribution
       Uniform Distribution
       Exponential Distribution
       Normal Distribution
       Lognormal Distribution
       Laplace Distribution
       Beta Distribution

Slide 4: Conditional and Multivariate Distributions

Joint and Marginal Distribution
Conditional Distributions and Independence
Bivariate Transformations
Hierarchical Models and Mixture Distribution
Bivariate Normal Distribution
Multivariate Distribution

Slide 5: Convergence Concepts

Random Samples
   Sums of Random Variable from a Random Sample
Inequalities
Convergence Concepts:
   Almost Sure Convergence
   Convergence in Probability
   Convergence in Distribution
The Delta Method

Slide 6: Maximum Likelihood Estimation

Maximum Likelihood Estimation
  Motivation and the Main Ideas
  Properties of the Maximum Likelihood Estimator

Slide 7: Bayesian and posterior distribution Estimation

   Computing the posterior
   Maximum likelihood estimation (MLE)
Maximum a posteriori (MAP) estimation
   Posterior mean
   MAP properties
Bayesian linear regression

Statistics and Mathematics for Machine Learning, Deep Learning , Deep NLP

Related tags

Overview

Stat4ML

Registration Form for cohort 2 of STAT4ML:

Pre Requisitives

Recall from Calculus:

Matrix Algebra Review:

Course 1 :

Slide 1 : Probability Theory Foundation

Slide 2: Moments

Slide 3: Distribution Functions

Slide 4: Conditional and Multivariate Distributions

Slide 5: Convergence Concepts

Slide 6: Maximum Likelihood Estimation

Slide 7: Bayesian and posterior distribution Estimation

Owner

Omid Safarzadeh

End-to-End Speech Processing Toolkit

🏆 • 5050 most frequent words in 109 languages

An open collection of annotated voices in Japanese language

Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing

Code for the paper: Sequence-to-Sequence Learning with Latent Neural Grammars

CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools

Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.

Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"

Semi-automated vocabulary generation from semantic vector models

Code for Emergent Translation in Multi-Agent Communication

In this Notebook I've build some machine-learning and deep-learning to classify corona virus tweets, in both multi class classification and binary classification.

A python package for deep multilingual punctuation prediction.

InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective

Finding Label and Model Errors in Perception Data With Learned Observation Assertions

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language-Agnostic SEntence Representations

A python framework to transform natural language questions to queries in a database query language.

A list of NLP(Natural Language Processing) tutorials built on Tensorflow 2.0.