Pavel Kisilev
Toga Networks Ltd., Israel
Title: Smart data acquisition and analysis for deep learning based systems
Biography
Biography: Pavel Kisilev
Abstract
Deep Learning (DL) becomes the method of choice in many fields of computer science, including computer vision, Natural Data Processing (NLP), autonomous systems, and many others. While DL methods are known for their superior performance, in order to achieve it, large amount of training examples are required. Furthermore, the quality of training examples affects largely the performance of DL training and learnt model quality. In particular, if the training examples represent well the real world phenomena, a good model is expected to learn. If, on the other hand, the examples are highly correlated and represent only sparse knowledge about phenomena, the learnt model quality will be low. In this talk, we present several general principles and methods to diagnose the data quality, and also the suitability of DL architecture to model the data at hand. We also propose several methods to pre-process raw data to better suit the requirements of DL systems. We show several examples of applications of our framework to various datasets, including known large image datasets with millions of images, binary sequence sets, gene datasets, and others. We show the efficacy of the proposed methods to analyze and predict the performance of DL methods on a given data.