A Scalable Solution for Extreme Multi-class Product Classification: An E-commerce Case Study

Loading...
Thumbnail Image

Date

2018-04-27

Access

Authors

Fathi, Ehsan

Journal Title

Journal ISSN

Volume Title

Publisher

East Carolina University

Abstract

Image classification is the main task in image processing. Although, there were a lot of advances in recent years, it is still quite a challenge. On the other hand, due to the progress in technology, e-commerce has emerged as the fastest-growing sector of the U.S. marketplace. Product classification is an extremely important issue in e-commerce. In this work, we propose a scalable, flexible, practical, modular and efficient architecture to use image classification techniques for product classification just using product images. Considering the diversity of products offering in retail online retail stores it is not surprising that we confront an excessive number of classes. Case study is Cdiscount which is the biggest non-food e-commerce company in France which has made about 3 billion euros. As the trend of growing rate of this e-commerce shows they will have about 30 million products up for sale while they just had 10 million products until 2 years ago. As the next step to toward business expansion, they decided to employ image processing techniques. The structure of the dataset, diversity of the products and volume of it makes it unique between all the available public data sets. We focused on developing a CNN architecture to tackle this challenge and provide a more general, flexible, scalable and efficient solution for Cdiscount image classification business problem. Results of applying the proposed architecture shows a reasonable accuracy which shows the efficiency of the architecture. A comparison between proposed model and previous models is also provided.

Description

Citation

DOI