This repository summarized computer vision theories.

Last update: Feb 04, 2022

Related tags

Computer Vision CV_theory

Overview

CV_theory

Basic Overview

This repository summarized computer vision theories.

PSNR

mse = np.mean((img1 - img2) ** 2)
# MSE 구하는 식

PLXEL_MAX = 255.0
# 8bit MAX는 255의 값을 가짐

return 20 * math.log10(PLXEL_MAX/math.sqrt(mse))
#PSNR 구하는 식

[output]
openCV를 이용한 PSNR : 52.37698680492553
주어진 수식을 이용한 함수구현 : 52.37698680492553

Color transform

for i in range(height):
    for j in range(width):
        y2[i][j] = 0.299 * r[i][j] + 0.587 * g[i][j] + 0.114 * b[i][j]
        cb2[i][j] = (-0.172*r[i][j]) - (0.339*g[i][j]) + (0.511*b[i][j]) + 128
        cr2[i][j] = (0.511*r[i][j])- (0.428*g[i][j]) - (0.083*b[i][j]) + 128
# RGB 영상을 YCbCr로 변환 수식

for i in range(height):
    for j in range(width):
        r[i][j] = y2[i][j] + 1.371*(cr2[i][j] - 128)
        g[i][j] = y2[i][j] - 0.698*(cr2[i][j] - 128) - 0.336*(cb2[i][j] - 128)
        b[i][j] = y2[i][j] + 1.732*(cb2[i][j] - 128)
# yCbCr을 RGB 변환 수식

Filterring Smoothing

After converting the original image to Ycrcb, only the Y value was filtered with 3*3 kernels and smoothing was performed.

kernel = np.ones((3, 3), np.float32) / 9
# 3*3 커널값 저장

for i in range(5):
    Y = cv2.filter2D(Y, -1, kernel)
# 5번 필터링

Histogram equalization

height, width, channel = src.shape


hist, bins = np.histogram(Y.flatten(), 256, [0, 256])
# 이미지 히스토그램 구해주기

cdf = hist.cumsum()
# 각 멤버값을 누적하여 더한 1차원 배열 생성

cdf_m = np.ma.masked_equal(cdf, 0)
# cdf에서 값이 0인 부분  mask 처리


cdf_m = (cdf_m - cdf_m.min()) * 255 / (cdf_m.max() - cdf_m.min())
#  균일화 방정식 코드

cdf = np.ma.filled(cdf_m, 0). astype("uint8")
# mask처리된 부분을 o으로 다시 리턴

out = (np.dstack((Y, cr, cb)))
out_rgb = cv2.cvtColor(out, cv2.COLOR_YCrCb2RGB)

img2 = cdf[out_rgb]

dst -> function in cv2 , dst2 -> Self-made function

Hough Line Detection

Contributing

Let's connect 👨‍💻 and forge the future together. 😁 ✌

Check the Repositories and don't forget to give a star. 👇

⭐ From S-jooyoung

This repository summarized computer vision theories.

Related tags

Overview

CV_theory

Basic Overview

PSNR

Color transform

Filterring Smoothing

Histogram equalization

Hough Line Detection

Contributing

Owner

A real-time dolly zoom camera effect

A curated list of resources dedicated to scene text localization and recognition

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

list all open dataset about ocr.

CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras

Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation, CVPR 2020 (Oral)

Distilling Knowledge via Knowledge Review, CVPR 2021

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database.

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts

7th place solution

The project is an official implementation of our paper "3D Human Pose Estimation with Spatial and Temporal Transformers".

In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.https://github.com/huoyijie/raspberrypi-car

The papers published in top-tier AI conferences in recent years.

Open Source Computer Vision Library

Create single line SVG illustrations from your pictures

Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining

Awesome multilingual OCR toolkits based on PaddlePaddle （practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices）

Official implementation of Character Region Awareness for Text Detection (CRAFT)