“袋鼯麻麻——智能购物平台”能够精准地定位识别每一个商品

Last update: Jan 05, 2023

Overview

袋鼯麻麻——智能购物平台

项目背景

目前在零售行业的实际运营过程中，会产生巨大的人力成本，例如导购、保洁、结算等，而其中，尤其需要花费大量的人力成本和时间成本在识别商品并对其进行价格结算的过程中，并且在此过程中，顾客也因此而需要排队等待。这样一来零售行业人力成本较大、工作效率极低，二来也使得顾客的购物体验下降。

随着计算机视觉技术的发展，以及无人化、自动化超市运营理念的提出，利用图像识别技术及目标检测技术实现产品的自动识别及自动化结算的需求呼之欲出，及自动结账系统（Automatic checkout, ACO）。基于计算机视觉的自动结账系统能有效降低零售行业的运营成本，提高顾客结账效率，从而进一步提升用户在购物过程中的体验感与幸福感。

实现功能

本项目具体实现在零售过程中对用户购买商品的自动结算。即：利用计算机视觉领域中的图像识别及目标检测技术，精准地定位顾客购买的商品，并进行智能化、自动化的价格结算。当顾客将自己选购的商品放置在制定区域的时候，“袋鼯麻麻——智能购物平台”能够精准地定位识别每一个商品，并且能够返回完整地购物清单及顾客应付的实际商品总价格，极大地降低零售行业实际运营过程中巨大的人力成本，提升零售行业无人化、自动化、智能化水平。

整体架构

技术路线

袋鼯麻麻——智能购物平台 主要基于PaddleClas作为主要的功能开发套件，利用其开源的图像识别技术，并通过PaddleInference将其部署于Jetson Nano，并基于QPT打包.exe打通Windows系统，开发一套符合实际应用需求的工业级智能零售购物平台。

图像识别介绍

整个图像识别系统分为三步：
（1）通过一个目标检测模型，检测图像物体候选区域；
（2）对每个候选区域进行特征提取；
（3）与检索库中图像进行特征匹配，提取识别结果。

对于新的未知类别，无需重新训练模型，只需要在检索库补入该类别图像，重新建立检索库，就可以识别该类别。

数据集介绍

【The first one】:Products-10K Large Scale Product Recognition Dataset

【The second one】:RP2K: A Large-Scale Retail Product Dataset for Fine-Grained Image Classification

袋鼯麻麻——智能购物平台基于上述两个数据集，并对此两种数据集进行适应性处理。

目前处理后的数据集已在AIStudio开源。

部署方式

本项目已打通Jetson Nano、Windows、linux系统

使用QPT打包的百度网盘链接：https://pan.baidu.com/s/1pVr4zSZB6qV10VIPvgWCsA 提取码：mpq2

解压后运行启动程序.exe即可
服务器部署

安装python依赖库：pip install -r requestment.txt；

执行python manage.py makemigrations;

执行python manage.py migrate;

执行python manage.py runserver # 默认运行在8000端口
微信小程序打开开发者工具，导入系统文件夹下wx_mini_app文件夹并运行，即可运行小程序端；

“袋鼯麻麻——智能购物平台”能够精准地定位识别每一个商品

Related tags

Overview

袋鼯麻麻——智能购物平台

项目背景

实现功能

整体架构

技术路线

图像识别介绍

数据集介绍

部署方式

bilibili效果演示

Owner

thomas-yanxin

Application of K-means algorithm on a music dataset after a dimensionality reduction with PCA

This is the code of paper ``Contrastive Coding for Active Learning under Class Distribution Mismatch'' with python.

用opencv的dnn模块做yolov5目标检测，包含C++和Python两个版本的程序

Deep Learning for Human Part Discovery in Images - Chainer implementation

DenseNet Implementation in Keras with ImageNet Pretrained Models

PyTorch code of paper "LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering"

A data-driven approach to quantify the value of classifiers in a machine learning ensemble.

code and data for paper "GIANT: Scalable Creation of a Web-scale Ontology"

Civsim is a basic civilisation simulation and modelling system built in Python 3.8.

A very simple baseline to estimate 2D & 3D SMPL-compatible keypoints from a single color image.

Acoustic mosquito detection code with Bayesian Neural Networks

PyTorch implementation of SmoothGrad: removing noise by adding noise.

Pytorch implementation of Straight Sampling Network For Point Cloud Learning (ICIP2021).

DWIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data.

Self-supervised Label Augmentation via Input Transformations (ICML 2020)

Distributed Deep learning with Keras & Spark

Codes for TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.

CoINN: Correlated-informed neural networks: a new machine learning framework to predict pressure drop in micro-channels

This repo includes the supplementary of our paper "CEMENT: Incomplete Multi-View Weak-Label Learning with Long-Tailed Labels"

[CoRL 2021] A robotics benchmark for cross-embodiment imitation.